Stay Ahead, Stay ONMINE

Hallucinations in AI: How GSK is addressing a critical problem in drug development

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Generative AI has become a key piece of infrastructure in many industries, and healthcare is no exception. Yet, as organizations like GSK push the boundaries of what generative AI can achieve, they face significant challenges — […]

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More


Generative AI has become a key piece of infrastructure in many industries, and healthcare is no exception. Yet, as organizations like GSK push the boundaries of what generative AI can achieve, they face significant challenges — particularly when it comes to reliability. Hallucinations, or when AI models generate incorrect or fabricated information, are a persistent problem in high-stakes applications like drug discovery and healthcare. For GSK, tackling these challenges requires leveraging test-time compute scaling to improve gen AI systems. Here’s how they’re doing it.

The hallucination problem in generative health care

Healthcare applications demand an exceptionally high level of accuracy and reliability. Errors are not merely inconvenient; they can have life-altering consequences. This makes hallucinations in large language models (LLMs) a critical issue for companies like GSK, where gen AI is applied to tasks such as scientific literature review, genomic analysis and drug discovery.

To mitigate hallucinations, GSK employs advanced inference-time compute strategies, including self-reflection mechanisms, multi-model sampling and iterative output evaluation. According to Kim Branson, SvP of AI and machine learning (ML) at GSK, these techniques help ensure that agents are “robust and reliable,” while enabling scientists to generate actionable insights more quickly.

Leveraging test-time compute scaling

Test-time compute scaling refers to the ability to increase computational resources during the inference phase of AI systems. This allows for more complex operations, such as iterative output refinement or multi-model aggregation, which are critical for reducing hallucinations and improving model performance.

Branson emphasized the transformative role of scaling in GSK’s AI efforts, noting that “we’re all about increasing the iteration cycles at GSK — how we think faster.” By using strategies like self-reflection and ensemble modeling, GSK can leverage these additional compute cycles to produce results that are both accurate and reliable.

Branson also touched on the broader industry trend, saying, “You’re seeing this war happening with how much I can serve, my cost per token and time per token. That allows people to bring these different algorithmic strategies which were before not technically feasible, and that also will drive the kind of deployment and adoption of agents.”

Strategies for reducing hallucinations

GSK has identified hallucinations as a critical challenge in gen AI for healthcare. The company employs two main strategies that require additional computational resources during inference. Applying more thorough processing steps ensures that each answer is examined for accuracy and consistency before it is delivered in clinical or research settings, where reliability is paramount.

Self-reflection and iterative output review

One core technique is self-reflection, where LLMs critique or edit their own responses to improve quality. The model “thinks step by step,” analyzing its initial output, pinpointing weaknesses and revising answers as needed. GSK’s literature search tool exemplifies this: It collects data from internal repositories and an LLM’s memory, then re-evaluates its findings through self-criticism to uncover inconsistencies. 

This iterative process results in clearer, more detailed final answers. Branson underscored the value of self-criticism, saying: “If you can only afford to do one thing, do that.” Refining its own logic before delivering results allows the system to produce insights that align with healthcare’s strict standards.

Multi-model sampling

GSK’s second strategy relies on multiple LLMs or different configurations of a single model to cross-verify outputs. In practice, the system might run the same query at various temperature settings to generate diverse answers, employ fine-tuned versions of the same model specializing in particular domains or call on entirely separate models trained on distinct datasets.

Comparing and contrasting these outputs helps confirm the most consistent or convergent conclusions. “You can get that effect of having different orthogonal ways to come to the same conclusion,” said Branson. Although this approach requires more computational power, it reduces hallucinations and boosts confidence in the final answer — an essential benefit in high-stakes healthcare environments.

The inference wars

GSK’s strategies depend on infrastructure that can handle significantly heavier computational loads. In what Branson calls “inference wars,” AI infrastructure companies — such as Cerebras, Groq and SambaNova — compete to deliver hardware breakthroughs that enhance token throughput, lower latency and reduce costs per token. 

Specialized chips and architectures enable complex inferencing routines, including multi-model sampling and iterative self-reflection, at scale. Cerebras’ technology, for example, processes thousands of tokens per second, allowing advanced techniques to work in real-world scenarios. “You’re seeing the results of these innovations directly impacting how we can deploy generative models effectively in healthcare,” Branson noted. 

When hardware keeps pace with software demands, solutions emerge to maintain accuracy and efficiency.

Challenges remain

Even with these advancements, scaling compute resources presents obstacles. Longer inference times can slow workflows, especially if clinicians or researchers need prompt results. Higher compute usage also drives up costs, requiring careful resource management. Nonetheless, GSK considers these trade-offs necessary for stronger reliability and richer functionality. 

“As we enable more tools in the agent ecosystem, the system becomes more useful for people, and you end up with increased compute usage,” Branson noted. Balancing performance, costs and system capabilities allows GSK to maintain a practical yet forward-looking strategy.

What’s next?

GSK plans to keep refining its AI-driven healthcare solutions with test-time compute scaling as a top priority. The combination of self-reflection, multi-model sampling and robust infrastructure helps to ensure that generative models meet the rigorous demands of clinical environments. 

This approach also serves as a road map for other organizations, illustrating how to reconcile accuracy, efficiency and scalability. Maintaining a leading edge in compute innovations and sophisticated inference techniques not only addresses current challenges, but also lays the groundwork for breakthroughs in drug discovery, patient care and beyond.

Shape
Shape
Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy,  bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Shape

A CSO’s perspective: 8 cyber predictions for 2025

As we step into 2025, the cyberthreat landscape is once again more dynamic and challenging than the year before. In 2024, we witnessed a remarkable acceleration in cyberattacks of all types, many fueled by advancements in generative AI. For security leaders, the stakes are higher than ever. In this post,

Read More »

Ericsson unveils genAI assistant for 5G network operations

Telecommunications and networking provider Ericsson recently launched its generative AI-based virtual assistant that uses large language model (LLM) technology to read, understand, and generate new content to provide personalized answers for network operators configuring wireless 5G networks, troubleshooting problems, and creating policies. Ericsson’s AI-based NetCloud Assistant, or ANA, is LLM-based

Read More »

Next-gen Ethernet standards set to move forward in 2025

Metz noted that in addition to vendor participation growth there was a lot of technical innovation. Significant developments were made across the physical, link, transport, and software layers, including innovative congestion schemes, built-in security and optimized packet delivery.  “More than 25 individual projects contributed to the development of a finely

Read More »

HPE beats Dell and Supermicro in $1B AI server deal with X

The financial scope of the deal underscores its significance. Analysts see HPE’s landmark $1 billion deal with X as a major endorsement of its AI capabilities, but competition remains fierce in the high-growth AI server market. “HPE’s $1 billion deal with X not only enhances its credibility but also highlights

Read More »

Norway Awards 53 Offshore Hydrocarbon Production Licenses

Norway’s Energy Ministry on Tuesday awarded 53 hydrocarbon production leases on the country’s continental shelf under last year’s licensing round. A total of 20 firms, out of 21 that applied under the 2024 Awards in Pre-Defined Areas (APA), were offered ownership interests. Thirteen companies were offered one or more operatorships, according to a list published on the ministry’s website. “Continued development of the Norwegian continental shelf (NCS) is important for employment, value creation, and the ripple effects of petroleum activities on the mainland going forward”, Energy Minister Terje Aasland said in a statement. “We need new discoveries to ensure that Norway can remain a stable and predictable supplier of oil and gas to Europe. It is therefore very positive to see such great interest in new exploration areas”. Thirty-three of the new licenses are on Norway’s side of the North Sea. Nineteen are in the Norwegian Sea, while one is in the Norwegian portion of the Barents Sea. The Nordic nation, the top gas supplier for Europe having overtaken Russia since 2022, holds about 7.1 billion standard cubic meters of oil equivalent remaining resources in its continental shelf. The figure includes 3.5 billion standard cubic meters of oil equivalent undiscovered resources, according to the Norwegian Offshore Directorate’s 2024 “Resource Report”. Aker BP ASA has the most operatorships, numbering 16, among the 2024 APA winners. It is a participant in a total of 19 licenses under the 2024 APA. Fornebu, Norway-based Aker BP said in a separate online statement it plans to start drilling in the former Frigg field, which is among its new NCS operatorships, in the second quarter. “Phasing in oil and gas from new discoveries will be crucial to ensuring long-term activity on the shelf”, said Per Øyvind Seljebotn, senior vice president for exploration and reservoir development at

Read More »

NESO proposes “pause” in grid connection approvals ahead of reforms

The National Energy System Operator (NESO) has proposed pausing applications for new grid connections as it looks to deal with a project backlog. Following approval from energy regulator Ofgem, NESO will implement new transitional arrangements that will pause applications received as of Wednesday 29 January 2025. As part of the connections reforms planned for 2025, a new process will be implemented following final regulatory approval by Ofgem. Grid connections applications have continued to grow over the last year to the point that it is no longer possible to deliver in flight connections reforms in parallel with the existing connections process. In 2023/24 alone, NESO received over 1,700 queue applications, with more projects already in the queue than is required for the energy system in 2030 or even 2050. NESO director of connections reform Matt Vickers said: “This transitional arrangement is critical to delivering the connections reforms we will implement later this year, subject to Ofgem approval. It’s a significant step forward in changing the grid connections process for the better. “Our reforms prioritise projects which are ready to progress, and which are needed to deliver clean power by 2030. To reorder the queue, we need to start from a stable base. This short pause in applications will allow us to work with colleagues across the network companies to prepare for the new processes we need to bring forward the electricity projects needed for the delivery of clean power by 2030 and beyond.” The plan has been developed in partnership with SSEN, Scottish Power Network and National Grid Electricity Transmission To enable NESO, transmission owners and distribution network operators to focus on preparing for the new connections reform framework, a new starting point is needed for the connections process, to ensure all projects join the new framework on the same terms.

Read More »

EU Mulls Gradual Ban on Russian LNG

The European Union is considering import restrictions on Russian aluminum and phasing out liquefied natural gas from the nation as part of a new package of sanctions targeting Moscow for its full-scale invasion of Ukraine, according to people familiar with the matter. The draft measures, which would be part of the bloc’s 16th package of sanctions, include restrictions on dozens more vessels that are part of Moscow’s shadow fleet of tankers transporting Russian oil and further export controls on goods used for military purposes. The move would also see more banks cut off the international payments systems SWIFT, said the people, who spoke on condition of anonymity. Restrictions on aluminum would be gradual with a timeframe and scope still to be determined, the people said. Exiting LNG could be done either as a sanction or as part of a road map that the bloc’s executive arm is set to present next month, they said. Reuters earlier reported the discussion on aluminum. The draft proposals are still being discussed between member states and could change before they’re formally presented. While a ban on imports of Russian gas has been urged by several nations, the EU still needs to decide whether it should rely on sanctions to make it legally binding, regulations as part of a road map, or a mix of those two, according to officials and diplomats with knowledge of the talks. Sanctions may offer the strongest argument for terminating contracts with Russian suppliers, but they require unanimous approval from member states and are limited in time. Supply Shifts European governments, which had previously been reluctant to give up Russian LNG, are watching nervously as gas prices creep up because of cold weather and new US sanctions on Russian energy. The US and EU have been gradually sanctioning some Russian LNG projects

Read More »

EDF, Hypervolt Partner to Use EVs for Grid Balance

EDF Energy Holdings Ltd. has partnered with EV charge-point manufacturer and software provider Hypervolt on an industry first. EDF said in a media release that, through its Wholesale Market Services’ PowerShift technology, the collaboration aims to leverage Hypervolt’s UltraGrid software to offer a frequency response service using electric vehicles (EVs). Working with Britain’s National Electricity System Operator (NESO), this initiative will use Hypervolt EV chargers to support the grid at times when it is facing challenges maintaining the required frequency, EDF said.  The offer aligns with NESO’s Clean Power 2030 Report, emphasizing quicker responses to maintain system frequency near 50 Hz and increasing services through frequency markets, according to EDF. Using its PowerShift capability, EDF said it will automatically adjust Hypervolt EV chargers to balance the grid during peak demand or when there is excess renewable energy. This approach helps customers save on electricity costs, reduce their carbon footprint, and maximize renewable energy use, it said. Current and future Hypervolt charger owners will have access to an innovative smart charging tariff from EDF, providing the best value for those who frequently charge their EVs. Customers who enroll in this tariff will always have control over their charging preferences, including the desired level of charge and the time of day they want their EV charged. Charging will be managed automatically, requiring no manual intervention, and savings will be available on the EDF app, the company said. Hypervolt charger owners who are not EDF customers can also benefit from flexibility savings through PowerShift, EDF’s virtual power plant. PowerShift optimizes flexibility value from grid-scale and behind-the-meter assets by utilizing artificial intelligence, real-time data analytics, and advanced algorithms. EDF said it offers its multi-market flexibility trading capability to EV charge point manufacturers, helping capture value and reduce costs for all EV drivers. “Our partnership

Read More »

Scotland injects £20m in XLCC cable maker on former nuclear site

A subsea cable manufacturing facility in Ayrshire has won further backing with a £20 million investment from the Scottish National Investment Bank (SNIB). Project developer XLCC will use the Scottish Government funds to press ahead with its high-voltage direct current (HVDC) cable factory, which is being built to meet growing demand for electricity transmission projects needing subsea connection to the UK grid. The latest investment adds to public sector-backing the Essex-based developer has raised already. This includes a further £20m from UK Infrastructure Bank (UKIB) – now known as the National Wealth Fund (NWF) – in 2024, and £9m from the SNIB’s counterpart agency, Scottish Enterprise, the year prior to that. XLCC, which was established in 2020, initially won planning permission two years later to develop the factory on a disused coal yard on the site of the Hunterston B nuclear power plant, which is being decommissioned by French power giant EDF (PAR:EDF). © Supplied by XLCCRenderings of the future XLCC cable manufacturing site at Hunterston in Ayrshire. The company’s first order is for one of the longest subsea cables in the world, on behalf of its strategic partner Xlinks. This firm plans to build a massive 3.6GW solar farm in Morocco connected by an HVDC-powered trans-Atlantic link to the Alverdiscott substation in North Devon. Like fellow cable manufacturer Sumitomo, which is building a £350m cable factory in the Scottish Highlands, XLCC is also aiming to meet demand for subsea cable driven by the growth in European energy production from offshore wind, particularly floating wind power projects in deep water. XLCC estimates demand for high-voltage subsea cables is expected to be two and a half times greater than available supply by 2030. Scotland is currently a world-leader in its plans to develop floating offshore wind. Of the UK’s target to deliver

Read More »

Ceasefire News Cools Oil Rally

Oil slipped from a five-month high as Hamas and Israel tentatively agreed to a cease-fire, cooling a rally fueled by risks to Russian and Iranian supplies. West Texas Intermediate retreated 1.7% to settle at $77.50 a barrel after CBS reported Israel and Hamas agreed in principle to a draft deal for a cease-fire and hostage release. Such a deal would mark a potential end to a conflict that has buffeted global oil markets for more than 15 months. The relative strength index shows crude futures have been mostly overbought since the start of the year, a reading that signals prices are due for a pullback. Algorithmic-driven investors known as commodity trading advisers, or CTAs, are flashing signs of buying exhaustion, said Daniel Ghali, a commodity strategist at TD Securities. “Our simulations of future prices already suggest that in no scenario will CTAs add to their WTI crude length, suggesting a continued rise in supply risk premia associated with Biden’s farewell sanctions on Russia will now be needed to support prices further,” Ghali said. The US benchmark had climbed 6.6% over the previous two sessions, while oil shipping rates surged the most in months on Monday in response to the measures from Washington that target about 160 tankers involved in the Russian oil trade. While the full impact of the latest US sanctions package remains unclear, it may drive a rerouting of global flows as users across Asia, including refiners in India and China, are forced to reach far and wide for replacement barrels. Some early signs of disruption are already apparent. Among them, a senior Indian bureaucrat told reporters that sanctioned vessels won’t be allowed to discharge, although the country’s state-owned refiners expect Moscow to find workarounds. The potential for Russian oil to continue reaching its intended destinations is easing

Read More »

8 Trends That Will Shape the Data Center Industry In 2025

What lies ahead for the data center industry in 2025? At Data Center Frontier, our eyes are always on the horizon, and we’re constantly talking with industry thought leaders to get their take on key trends. Our Magic 8 Ball prognostications did pretty well last year, so now it’s time to look ahead at what’s in store for the industry over the next 12 months, as we identify eight themes that stand to shape the data center business going forward. We’ll be writing in more depth about many of these trends, but this list provides a view of the topics that we believe will be most relevant in 2025. A publication about the future frontiers of data centers and AI shouldn’t be afraid to put it’s money where its mouth is, and that’s why we used AI tools to help research and compose this year’s annual industry trends forecast. The article is meant to be a bit encyclopedic in the spirit of a digest, less than an exactly prescriptive forecast – although we try to go there as well. The piece contains some dark horse trends. Do we think immersion cooling is going to explode this year, suddenly giving direct-to-chip a run for its money? Not exactly. But do we think that, given the enormous and rapidly expanding parameters of the AI and HPC boom, the sector for immersion cooling could see some breakthroughs this year? Seems reasonable. Ditto for the trends forecasting natural gas and quantum computing advancements. Such topics are definitely on the horizon and highly visible on the frontier of data centers, so we’d better learn more about them, was our thought. Because as borne out by recent history, data center industry trends that start at the bleeding edge (pun intended – also, on the list) sometimes

Read More »

Podcast: Data Center and AI Sustainability Imperatives with iMasons Climate Accord Executive Director, Miranda Gardiner

Miranda was a featured speaker at last September’s inaugural Data Center Frontier Trends Summit. The call for speakers is now open for this year’s event, which will be held again in Reston, Virginia from Aug. 26-28. DCF Show Podcast Quotes from Miranda Gardiner, Executive Director, iMasons Climate Accord On Her Career Journey and Early Passion for Sustainability:   – “My goals have always been kind of sustainability, affordable housing. I shared a story last week on a panel that my mother even found a yearbook of me from my elementary school years. The question that year was like, what do you hope for the future? And mine was there’d be no pollution and everyone would have a home.” On Transitioning to Data Centers:   – “We started to see this mission-critical focus in facilities like data centers, airports, and healthcare buildings. For me, connecting sustainability into the performance of the building made data centers the perfect match.” Overview of the iMasons Climate Accord:   – “The iMasons Climate Accord is an initiative started in 2022. The primary focus is emission reductions, and the only requirement to join is having an emission reduction strategy.”   – “This year, we refined our roadmap to include objectives such as having a climate strategy, incentivizing low-GHG materials like green concrete, and promoting equity by supporting small, women-owned, and minority-owned businesses.” On Industry Collaboration and Leadership:   – “This year, through the Climate Accord, we issued a call to action on the value of environmental product declarations (EPDs). It was signed by AWS, Digital Realty, Google, Microsoft, Schneider Electric, and Meta—talk about a big initiative and impact!” On EPDs and Carbon Disclosure:   – “EPDs provide third-party verification of materials coming into buildings. Pairing that with the Open Compute Project’s carbon disclosure labels on equipment creates vast opportunities for transparency and

Read More »

Accelsius and iM Data Centers Demo Next-Gen Cooling and Sustainability at Miami Data Center

Miami Data Center Developments Update Miami has recently witnessed several significant developments and investments in its data center sector, underscoring the city’s growing importance as a digital infrastructure hub. Notable projects include: Project Apollo:  A proposed 15-megawatt (MW), two-story, 75,000-square-foot data center in unincorporated Miami-Dade County. With an estimated investment of $150 million, construction is slated to commence between 2026 and 2027. The development team has prior experience with major companies such as Amazon, Meta, and Iron Mountain.  RadiusDC’s Acquisition of Miami I:  In August 2024, RadiusDC acquired the Miami I data center located in the Sweetwater area. Spanning 170,000 square feet across two stories, the facility currently offers 3.2MW of capacity, with plans to expand to 9.2 MW by the first half of 2026. The carrier-neutral facility provides connectivity to 11 fiber optic and network service providers.  Iron Mountain’s MIA-1 Data Center: Iron Mountain is developing a 150,000-square-foot, 16 MW data center on a 3.4-acre campus in Central North West Miami. The facility, known as MIA-1, is scheduled to open in 2026 and aims to serve enterprises, cloud providers, and large-scale users in South Florida. It will feature fiber connections to other Iron Mountain facilities and a robust pipeline of carriers and software-defined networks.  EDGNEX’s Investment Plans:  As of this month, Dubai, UAE-based EDGNEX has announced plans to invest $20 billion in the U.S. data center market, with the potential to double this investment. This plan includes a boutique condo project in Miami, estimated to have a $1 billion gross development value, indicating a significant commitment to the region’s digital infrastructure.  All of these developments highlight Miami’s strategic position as a connectivity hub, particularly serving as a gateway to Latin America and the Caribbean. The city’s data center market is characterized by steady growth, with a focus on retail colocation and

Read More »

Tract Capital Unveils Fleet Data Centers, Specializing In 500 MW+ Build-to-Suit Megacampuses

Tract Capital has announced the launch of Fleet Data Centers, a new platform dedicated to the development of mega-scale data center campuses with capacities of 500 MW or more, specifically designed for single-user customers.  The initiative is led by Grant van Rooyen, CEO of Tract Capital and Executive Chairman of Fleet Data Centers, and Chris Vonderhaar, the newly appointed President of Fleet Data Centers.  Vonderhaar brings extensive experience to the role, having served as Vice President of Demand and Supply Management at Google Cloud and as a senior leader at Amazon Web Services (AWS) for over a decade, where he oversaw the design, planning, construction, and operation of AWS’s global data center platform.  The Fleet leadership team also includes veterans from hyperscalers, wholesale data center providers, network infrastructure firms, and equipment vendors, with a collective track record of deploying dozens of gigawatts of data center capacity across hundreds of facilities globally. A Two Prong Strategy Defining two distinct strategies, Fleet is the mega-campus vertical development arm of Tract Capital, an alternative asset manager specializing in scaling digital infrastructure, which also operates Tract to refine development sites at ground level for data centers in terms of lining up power, fiber, zoning and entitlements.  Fleet Data Centers will aim to address the next phase of hyperscale data center growth by offering customized gigawatt-level campuses that provide predictability, flexibility, and scalability for hyperscalers navigating increasing infrastructure demands. This new venture from Tract Capital underscores the growing need for innovative, large-scale digital infrastructure solutions, particularly as hyperscalers face mounting challenges in scaling their global platforms to meet the demands of the digital age. The unveiling of Fleet is just another example of the way Tract Capital has consistently demonstrated its expertise in accelerating the scaling of responsible technology infrastructure, combining operational capabilities from industry

Read More »

Call for Speakers: Second Annual Data Center Frontier Trends Summit, Aug. 26-28, Reston, VA

Data Center Frontier (DCF) is excited to announce the Call for Speakers for our highly anticipated second annual Data Center Frontier Trends Summit, set to take place from August 26-28, 2025 in Reston, Virginia.  This premier industry event will once again bring together the brightest minds and leaders in the data center and digital infrastructure sectors to explore cutting-edge trends shaping the future of the industry.   Submit Speaking Proposals Here The DCF Trends Summit focuses on delivering deep insights and actionable knowledge for professionals navigating the evolving challenges and opportunities in data center innovation, energy efficiency, sustainability, and advanced technology integration. This year’s event will feature keynote speakers, expert panels, and interactive discussions on topics such as AI workloads, modular and edge computing, renewable energy strategies, and the global expansion of hyperscale facilities.   Call for Papers Details The DCF Trends Summit welcomes paper submissions on a wide range of relevant topics, including but not limited to: Emerging Trends:  AI, machine learning, and edge computing in data center operations. Power: Utility and substation power, renewables and behind-the-meter onsite, battery backup, energy storage. Sustainability:  Innovations in energy efficiency, renewable energy integration, and sustainable design. Technology Innovations:  Next-gen cooling systems, advanced automation, and breakthroughs in network infrastructure. National & Global Perspectives:  Regional market dynamics for site selection and regulation plus strategies for addressing evolving customer needs and workforce development.   View the Full DCF Trends ‘Topics of Interest’ Listing Industry professionals, researchers, and thought leaders are encouraged to submit papers that reflect their expertise, insights, and forward-looking perspectives. Submissions should align with the core themes of the Summit and provide actionable takeaways for attendees.   The deadline for paper submissions is January 29, 2025. All speakers will receive complimentary registration and the opportunity to share their work with a diverse audience

Read More »

UAE company to invest $20B in U.S. AI data centers

A United Arab Emirates investment firm has pledged $20 billion to build new data centers targeting AI across a number of locations across the United States. Billionaire Hussain Sajwani, CEO and founder of the property development company DAMAC Properties in Dubai, made the announcement at president-elect Donald Trump’s Florida home, Mar-a-Lago. Sajwani is a close friend of Trump, according to news reports. Trump said the first phase of the planned investment will take place in Texas, Arizona, Oklahoma, Louisiana, Ohio, Illinois, Michigan and Indiana. And that’s just for starters. “They may go double, or even somewhat more than double, that amount of money,” Trump said of the deal.

Read More »

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs).  In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

Read More »

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

Read More »

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

Read More »

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Read More »