Stay Ahead, Stay ONMINE

The AI energy challenge is coming to a head

Krishna Rangasayee is CEO of SiMa.ai, a software-centric, embedded edge machine learning system-on-chip company. As the last year falls further into the rearview and we are full steam ahead into 2025, the discussion surrounding AI’s massive energy consumption has reached an inflection point. The rapid advancement of AI has resulted in unprecedented demands on global […]

Krishna Rangasayee is CEO of SiMa.ai, a software-centric, embedded edge machine learning system-on-chip company.

As the last year falls further into the rearview and we are full steam ahead into 2025, the discussion surrounding AI’s massive energy consumption has reached an inflection point. The rapid advancement of AI has resulted in unprecedented demands on global energy infrastructure, threatening to outpace our ability to deliver power — and AI benefits — where they’re needed most. 

With AI already accounting for up to 4% of U.S. electricity use (a figure projected to nearly triple to 11% by 2030), reducing the strain on our energy systems is a priority, requiring a thorough reexamination of how AI’s energy needs could affect our long-term climate goals, infrastructure, resource availability and the scale at which this technology operates.

And the hype doesn’t look to be slowing down anytime soon. A new executive order was issued last month to prioritize and speed up the development of AI infrastructure, including data centers and other power facilities, while proposing new restrictions on exports of AI chips to keep innovation local. 

While political debates rage about energy sources and environmental regulation, the fundamental challenge lies in the stark mismatch between AI’s accelerating power requirements and our aging energy distribution infrastructure. This has now become a race against time, and though AI has inevitably become a “problem” — it also offers the path to a solution.

The scale of AI’s energy challenge

Most AI applications we use today — from chatbots to image generators — rely on the cloud to run models and process queries. These AI data centers serve as a hub, steadily accounting for around 4.4% of U.S. electrical demand, potentially increasing to more than a tenth of the total U.S. electrical demand by 2028.

Recent data points to the growth of EV infrastructure and the expansion of AI data centers, with more on the way, as the primary contributors to these growing numbers. As edge computing proliferates across industries, it offers a partial solution to the pressing issues seen with the growing AI energy demands. While not immune to the power struggle, edge computing can reduce power use and latency by enabling localized AI deployments, offering a more efficient and less energy-intensive approach than our current infrastructure.

A report from Goldman Sachs revealed a stunning statistic: the average ChatGPT query requires nearly 10 times as much electricity to process compared to a Google search. While an alarming number by itself, especially when considering the rapid adoption of the chatbot, data like this underscores the fundamental mismatch between AI development cycles, which operate in 100-day sprints, and energy infrastructure projects that typically span 100-year timelines.

Regulatory changes are a false flag

On their first day in office, the new administration began taking action by not only withdrawing the United States from the Paris Agreement — limiting the country’s access to clean energy and green tech markets — but also vowing to end the “electric vehicle mandate” and reversing a 2021 executive order, which aimed to ensure that half of all new vehicles sold in the U.S. by 2030 would be electric, potentially eliminating EV tax credits.

Many anticipate the government repealing the existing executive order on AI will accelerate AI development, especially in Silicon Valley. Pundits presume that the 2021 Bipartisan Infrastructure Law, which supports the development of green energy projects, will eventually be rolled back, with fund disbursement already on hold.

The recent appointment of Lee Zeldin to head the EPA, along with the potential rollback of initiatives from the Inflation Reduction Act, may ease some environmental restrictions for now. But these policy shifts won’t address the fundamental challenges of energy distribution — the massive infrastructure required to distribute energy where it’s needed most. The critical bottleneck isn’t the energy source itself; it’s the massive infrastructure required to deliver it where needed.

It is these limitations, not any policy changes, that will continue stifling innovation in the space. In fact, the continued advancement of electric and autonomous vehicles will only emphasize the need for additional fuel sources. If energy efficiency continues improving at its current pace, the electricity required by in-vehicle computers could reach 26 terawatt-hours by 2040 — equivalent to the total consumption of about 59 million desktop PCs.

Market forces driving change

The aggressive competition for limited power infrastructure is already creating a natural selection process for providers in AI deployment. The growing gap between AI’s rapid development cycle and infrastructure’s glacial pace of change is forcing companies to innovate in three key areas:

  • Energy-Efficient AI Architecture: Companies are investing in specialized, smaller language models that can operate within existing power constraints while delivering targeted business value. 
  • Geographic Strategy: Businesses are making AI deployment decisions based primarily on power availability and distribution capabilities.
  • Competitive Innovation: As companies realize that waiting for infrastructure catch-up isn’t viable, they’re turning to new approaches to AI deployment that treat energy efficiency as a core competitive advantage rather than a compliance requirement. 

The data center capacity limitations we see today are forcing companies to make hard choices about where and how to deploy AI resources. It won’t be long until a complete reevaluation of how we distribute these resources becomes necessary.

Edge computing is the catalyst for change

So what’s the solution? As traditional infrastructure scaling remains inadequate when faced with the rise in demand, companies are increasingly turning to edge computing solutions that distribute computational loads closer to end users, reducing the strain on centralized data centers.

The emergence of smaller, specialized language models reflects a growing recognition that energy efficiency must be a core consideration in AI development. These specialized models not only reduce power use but often provide better performance for specific tasks, suggesting a future where AI development may be shaped as much by energy constraints as by technological capabilities.

Energy efficiency is the future

When looking at what is on the horizon for AI, it’s become obvious that industry collaboration on energy-efficient computing has evolved beyond an environmental imperative and into a business necessity as infrastructure limitations threaten to bottleneck AI deployment. 

There must be a seismic shift in how the industry defines and measures AI advancement, pivoting our idea of technological evolution from raw computing power to energy efficiency. The impact of edge computing goes beyond the macro level. Less bandwidth and lower network operating costs make energy more efficient and cost-effective, freeing up resources elsewhere to help propel necessary progress. 

The future of AI development will be shaped not by political decisions about energy sources, but by the physical realities of power distribution infrastructure. While this perspective may challenge popular narratives about environmental regulation being the primary constraint on AI development, the mathematics of energy distribution cannot be overcome by the wave of a politician’s magic wand.

Industry leaders must confront these infrastructure limitations head-on, driving innovation in both AI efficiency and power distribution solutions, ensuring we leave this planet better than we found it.

Shape
Shape
Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy,  bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Shape

Fortinet speeds threat detection with improved FortiAnalyzer

The package also now integrates with FortiAI, the vendor’s genAI assistant, to better support analytics and telemetry to help security teams speed threat investigation and response, the vendor stated. “FortiAI identifies the threats that need analysis from the data collected by FortiAnalyzer, primarily collected from FortiGates. By automating the collection,

Read More »

Aryaka adds AI-powered observability to SASE platform

Nadkarni explained that Aryaka runs unsupervised machine learning models on the data to identify anomalies and outliers in the data. For example, the models may detect a sudden spike in traffic to a domain that has not been seen before. This unsupervised analysis helps surface potential issues or areas of

Read More »

USA, Ukraine Ramp Up Talks on Minerals Deal During Envoy Trip

Ukrainian and US negotiators are seeking to move past the breakdown in transatlantic relations this week to finalize a deal on critical minerals, a person with knowledge of the talks said.   Two days after President Donald Trump hectored President Volodymyr Zelenskiy as a “dictator” who needed to move quickly on a peace deal, Ukrainian officials are discussing the minerals issue with US special envoy Keith Kellogg during a visit to Kyiv, the person said on condition of anonymity as talks take place behind closed doors.  Zelenskiy, who rejected an initial US offer that involved securing half the income from Ukraine’s minerals, said that his meeting with Kellogg on Thursday had “restored hope.”  The US proposal envisaged securing 50% of license sales and other proceeds from the minerals, which would violate Ukrainian laws, a person familiar with the discussions said. It also covered revenue from oil, gas and ports, ABC News reported, citing a draft document.  After Kyiv suggested changes, the US has now sent a revised and improved version, which includes language on future assistance, a person familiar with the matter said. Trump has said he wanted the equivalent of $500 billion worth of rare earths, which are mainly used in high-strength magnets. But despite reports of $10 trillion worth of mineral deposits, Ukraine has no major rare-earth reserves that are internationally recognized as economically viable. Most deposits are likely by-products of producing materials like phosphates. Some are in areas of Russian control.  A completed deal on US access to Ukrainian minerals in exchange for security guarantees would be a key element of the Trump administration’s effort to end the three-year war. Trump has said he may meet with Russian President Vladimir Putin soon.  “We need strong agreements with the United States – agreements that will truly work,” Zelenskiy said in his daily address to

Read More »

Grangemouth protesters lay 400 hard hats outside Scottish Labour conference

Grangemouth workers targeted the opening day of the Scottish Labour conference on Friday, laying 400 hard hats on the ground to represent every job set to be lost. Workers and members of the Unite union gathered outside the Scottish Event Campus in Glasgow to urge the UK Government to do more to save the oil refinery. Protesters braved wet and windy weather as they chanted “You said our refinery was your top priority” and “Keep Grangemouth working”. Petroineos announced last year that more than 400 jobs would be lost as Scotland’s last oil refinery shuts this year. The Scottish and UK governments have committed £100 million to the Falkirk and Grangemouth Growth Deal to support jobs and skills in the area along with a report – called Project Willow – to look at the future of the site. Scotland’s First Minister John Swinney also announced a further £25 million to ensure a “just transition” in the area. Grangemouth future But trade unions have accused both governments of not doing enough to save the plant, while Scottish Labour leader Anas Sarwar has described the response as “not good enough”. Sharon Graham, general secretary of the Unite union, told the PA news agency that Grangemouth could still be saved. She said: “We are taking every opportunity we can to say to the UK Government, the Scottish Government and all politicians that they have not got long now to save Grangemouth. © Andrew Milligan/PA WireUnite General Secretary Sharon Graham speaks at a demonstration to protest at Petroineos plans to close Grangemouth oil refinery, during the Scottish Labour Party conference at the Scottish Exhibition Centre (SEC) in Glasgow. Image: Andrew Milligan/PA Wire “Let’s be really clear, Grangemouth can be saved, but I’m saying to all politicians today, if they do not do that, then

Read More »

USA Crude Oil Inventories Rise by 4.6MM Barrels WoW

U.S. commercial crude oil inventories, excluding those in the Strategic Petroleum Reserve (SPR), increased by 4.6 million barrels from the week ending February 7 to the week ending February 14, the U.S. Energy Information Administration (EIA) highlighted in its latest weekly petroleum status report. The EIA’s report, which was released on February 20 and included data for the week ending February 14, showed that crude oil stocks, not including the SPR, stood at 432.5 million barrels on February 14, 427.9 million barrels on February 7, and 443.0 million barrels on February 16, 2024. Crude oil in the SPR stood at 395.3 million barrels on February 14 and February 7, and 359.5 million barrels on February 16, 2024, the report outlined. Total petroleum stocks – including crude oil, total motor gasoline, fuel ethanol, kerosene type jet fuel, distillate fuel oil, residual fuel oil, propane/propylene, and other oils – stood at 1.607 billion barrels on February 14, the report showed. Total petroleum stocks were up 0.2 million barrels week on week and up 16.4 million barrels year on year, the report outlined. “At 432.5 million barrels, U.S. crude oil inventories are about three percent below the five year average for this time of year,” the EIA said in its report. “Total motor gasoline inventories decreased by 0.2 million barrels from last week and are one percent below the five year average for this time of year. Finished gasoline inventories increased, while blending components inventories decreased last week,” it added. “Distillate fuel inventories decreased by 2.1 million barrels last week and are about 12 percent below the five year average for this time of year. Propane/propylene inventories decreased by 3.6 million barrels from last week and are slightly below the five year average for this time of year,” the EIA continued. U.S. crude

Read More »

Power Moves: SOWEC’s new co-chair and more

Claire Mack has joined the Scottish Offshore Wind Energy Council (SOWEC) as its new industry co-chair. Mack was appointed by minister for climate action Dr Alasdair Allan to represent Scotland’s offshore wind industry for a two-year term. She will provide strategic leadership to SOWEC and work alongside the partnership’s subgroups and developer forum to deliver sector growth. In addition to her new role, Mack has been chief executive of Scottish Renewables since 2017. She is responsible for leading the organisation’s work to grow Scotland’s renewable energy industry and maintaining its position as a global leader in clean power. She replaces Brian McFarlane, SSE Renewables head of offshore development, who announced his intention to step down as SOWEC industry co-chair late last year. Since SOWEC’s inception in 2019, he has acted on behalf of Scotland’s offshore wind industry to drive forward SOWEC’s mission to coordinate and grow the sector. Acting minister for climate action Allan said: “Offshore wind is at the heart of the Scottish Government’s economic development agenda and the SOWEC is vital to driving forward action in support of the sector.” Mack added: “Offshore wind is one of Scotland’s key assets and our ability to turn our 40GW pipeline into successfully delivered projects is pivotal to our environmental and economic aspirations. “As I step into the role of co-chair at SOWEC during this critical point for Scotland’s £4 billion offshore wind sector, I would like to express my sincere gratitude to my predecessor, Brian McFarlane, for his untiring dedication and contribution to the sector.” © Supplied by Offshore Energies UKOEUK operator and developer chair Doris Reiter. Doris Reiter, BP senior vice-president for the North, has taken over as operator and developer chair on the board of Offshore Energies UK (OEUK). Having previously served as vice chair to the board,

Read More »

Expansion of Corpus Christi LNG in Texas Starts Delivery

Cheniere Energy Inc. has produced the first liquefied natural gas (LNG) cargo from a project expanding the Corpus Christi LNG (CCL) terminal in the namesake Texan city. CCL Stage 3 has seven midscale trains with an expected production capacity of over 10 million metric tons per annum (MMtpa), raising CCL’s output capacity to over 25 MMtpa from 10 trains. “First LNG production from the first train of the CCL Stage 3 Project was achieved in December 2024, and the first cargo of LNG was produced in February 2025”, Houston, Texas-based Cheniere said in its quarterly report. It plans to build two more mid-scale trains adjacent to the Stage 3 project for a further capacity addition of about 3 MMtpa. “In June 2024, we received a positive Environmental Assessment from the FERC and anticipate receiving all remaining necessary regulatory approvals for the project in 2025”, the report said. Citing new capacity unlocked from the expansion, Cheniere expects to surpass last year’s earnings. Consolidated adjusted EBITDA guidance for 2025 is $6.5 billion-$7 billion, compared to the actual 2024 figure of $6.2 billion. Distributable cash flow this year is expected to be $4.1 billion-$4.6 billion, compared to the actual 2024 figure of $3.7 billion. “We expect 2025 to be another record year for LNG production as Stage 3 trains are completed, and we look forward to delivering financial results within these ranges and further enhancing the long-term value proposition of Cheniere”, commented president and chief executive Jack Fusco. Last year Cheniere exported a record 646 LNG cargoes, or 2.33 trillion British thermal units. Cheniere posted $977 million in net profit for the fourth quarter (Q4) of 2024 and $3.25 billion for the full year. The figures were down 29 percent and 67 percent by year-ago comparisons, respectively. On a diluted basis, Q4 earnings

Read More »

Leveraging surplus interconnection could unleash 800 GW of energy the US needs today

Cassady Craighill is the technical education director at GridLab. The challenges facing the grid can often feel intimidating and beyond grasp. How much of the load growth is hype and how much is real? How do we ensure the financial burden of updating our grid and leading the world in energy innovation does not disproportionally fall on residential customers? How do operators and utilities prepare for the next cataclysmic flood, fire, or storm? How do we ensure that our 21st energy grid serves everyone across the country amidst a patchwork of different state policies and resources?  Those are all big questions that will take years to get exactly right. Luckily, there is an out-of-the-box solution laying on the table today that will get hundreds of energy projects online in no time and will save consumers money. Surplus interconnection, which allows new energy projects to plug into existing interconnection infrastructure at plants with low capacity factors, could nearly double the generation in the United States by 2030 and at a fraction of the cost and time of a traditional interconnection process.  Affordable, clean, and abundant power is key to keeping businesses operating in the U.S. This sentiment dominates the rhetoric in Energy Secretary Chris Wright’s first Secretarial order issued this month and his message of “energy addition.” At its core, surplus interconnection is adding energy resources to the grid as quickly and cheaply as possible. In his order, Secretary Wright insists “we must expand energy production and reduce energy costs for American families and businesses.” Leveraging existing interconnection capacity at thermal plants is the fastest way to expand energy production by shaving years off the interconnection timeline, saving about $200 billion by avoiding costly new infrastructure and lengthy new buildouts, and adding reliable resources to the grid. We need this solution

Read More »

Do data centers threaten the water supply?

In a new report, the Royal Academy of Engineering called upon the government to ensure tech companies accurately report how much energy and water their data centers are using and reducing the use of drinking water for cooling. Without such action, warns one of the report’s authors, Professor Tom Rodden, “we face a real risk that our development, deployment and use of AI could do irreparable damage to the environment.” The situation is a little different for the US as the country has large bodies of water offering a  water supply that the UK just does not have. It’s not an accident that there are many data centers around the Chicago area: they’ve also got the Great Lakes to draw upon. Likewise, the Columbia and Klamath Rivers have become magnets for data centers for both water supply and hydroelectric power. Other than the Thames River, the UK doesn’t have these massive bodies of water. Still, the problem is not unique to the UK, says Alan Howard, senior analyst with Omdia. He notes that Microsoft took heat last year because it was draining the water supply of a small Arizona town of Goodyear with a new AI-oriented data center.  The city of Chandler, Arizona passed an ordinance in 2015 that restricted new water-intensive businesses from setting up shop which slowed data center development.   “I believe some data center operators just bowed out,” said Howard.

Read More »

Ireland says there will be no computation without generation

Stanish said that, in 2023, she wrote a paper that predicted “by 2028, more than 70% of multinational enterprises will alter their data center strategies due to limited energy supplies and data center moratoriums, up from only about 5% in 2023. It has been interesting watching this trend evolve as expected, with Ireland being a major force in this conversation since the boycotts against data center growth started a few years ago.” Fair, equitable, and stable electricity allocation, she said, “means that the availability of electricity for digital services is not guaranteed in the future, and I expect these policies, data center moratoriums, and regional rejections will only continue and expand moving forward.” Stanish pointed out that this trend is not just occurring in Ireland. “Many studies show that, globally, enterprises’ digital technologies are consuming energy at a faster rate than overall growth in energy supply (though, to be clear, these studies mostly assume a static position on energy efficiency of current technologies, and don’t take into account potential for nuclear or hydrogen to assuage some of these supply issues).” If taken at face value, she said, this means that a lack of resources could cause widespread electricity shortages in data centers over the next several years. To mitigate this, Stanish said, “so far, data center moratoriums and related constraints (including reduced tax incentives) have been enacted in the US (specifically Virginia and Georgia), Denmark, Singapore, and other countries, in response to concerns about the excessive energy consumption of IT, particularly regarding compute-intense AI workloads and concerns regarding an IT energy monopoly in certain regions. As a result, governments (federal, state, county, etc.) are working to ensure that consumption does not outpace capacity.” Changes needed In its report, the CRU stated, “a safe and secure supply of energy is essential

Read More »

Perspective: Can We Solve the AI Data Center Power Crisis with Microgrids?

President Trump announced a$500 billion private sector investment in the nation’s Artificial Intelligence (AI) infrastructure last month. The investment will come from The Stargate Project, a joint venture between OpenAI, SoftBank, Oracle and MGX, which intends to build 20 new AI data centers in the U.S in the next four to five years. The Stargate Project committed$100 billion for immediate deployment and construction has already begun on its first data center in Texas. At approximately a half a million square feet each, the partners say these new facilities will cement America’s leadership in AI, create jobs and stimulate economic growth. Stargate is not the only game in town, either. Microsoft is expected to invest$80 billion in AI data center development in 2025, with Google, AWS and Meta also spending big. While all this investment in AI infrastructure is certainly exciting, experts say there’s one lingering question that’s yet to be answered and it’s a big one: How are we going to power all these AI data centers? This will be one of the many questions tackled duringMicrogrid Knowledge’s annual conference, which will be held in Texas April 15-17 at the Sheraton Dallas. “Powering Data Centers: Collaborative Microgrid Solutions for a Growing Market” will be one of the key sessions on April 16. Industry experts will gather to discuss how private entities, developers and utilities can work together to deploy microgrids and distributed energy technologies that address the data center industry’s power needs. The panel will share solutions, technologies and strategies that will favorably position data centers in the energy queue. In advance of this session, we sat down with two microgrid experts to learn more about the challenges facing the data center industry and how microgrids can address the sector’s growing energy needs. We spoke with Michael Stadler, co-founder and

Read More »

Data Center Tours: Iron Mountain VA-1, Manassas, Virginia

Iron Mountain Northern Virginia Overview Iron Mountain’s Northern Virginia data centers VA-1 through VA-7 are situated on a 142-acre highly secure campus in Prince William County, Virginia. Located at 11680 Hayden Road in Manassas, Iron Mountain VA-1 spans 167,958 sq. ft. and harbors 12.4 MW of total capacity to meet colocation needs. The 36 MW VA-2 facility stands nearby. The total campus features a mixture of single and multi-tenant facilities which together provide more than 2,000,000 SF of highly efficient green colocation space for enterprises, federal agencies, service providers and hyperscale clouds.  The company notes that its Manassas campus offers tax savings compared to Ashburn and exceptional levels of energy-efficiency as well as a diverse and accessible ecosystem of cloud, network and other service providers.  Iron Mountain’s Virginia campus has 9 total planned data centers, with 5 operational facilities to date and two more data centers coming soon. VA-2 recently became the first data center in the United States to achieve DCOS Maturity Level 3.    As we continued the tour, Kinra led the way toward the break room, an area where customers can grab coffee or catch up on work. Unlike the high-end aesthetic of some other colocation providers, Iron Mountain’s approach is more practical and focused on functionality. At the secure shipping and receiving area, Kinra explained the process for handling customer equipment. “This is where our customers ship their equipment into,” he said. “They submit a ticket, send their shipments in, and we’ll take it, put it aside for them, and let them know when it’s here. Sometimes they ask us to take it to their environment, which we’ll do for them via a smart hands ticket.” Power Infrastructure and Security Measures The VA-1 campus is supported by a single substation, providing the necessary power for its growing

Read More »

Land and Expand: DPO, Microsoft, JLL and BlackChamber, Prologis, Core Scientific, Overwatch Capital

Land and Expand is a periodic feature at Data Center Frontier highlighting the latest data center development news, including new sites, land acquisitions and campus expansions. Here are some of the new and notable developments from hyperscale and colocation data center developers and operators about which we’ve been reading lately. DPO to Develop $200 Million AI Data Center in Wisconsin Rapids; Strategic Partnership with Billerud’s CWPCo Unlocks Hydroelectric Power for High-Density AI Compute Digital Power Optimization (DPO) is moving forward with plans to build a $200 million high-performance computing (HPC) data center in Wisconsin Rapids, Wisconsin. The project, designed to support up to 20 megawatts (MW) of artificial intelligence (AI) computing, leverages an innovative partnership with Consolidated Water Power Company (CWPCo), a subsidiary of global packaging leader Billerud. DPO specializes in developing and operating data centers optimized for power-dense computing. By partnering with utilities and independent power producers, DPO colocates its facilities at energy generation sites, ensuring direct access to sustainable power for AI, HPC, and blockchain computing. The company is privately held. Leveraging Power Infrastructure for Speed-to-Energization CWPCo, a regulated utility subsidiary, has operated hydroelectric generation assets since 1894, reliably serving industrial and commercial customers in Wisconsin Rapids, Biron, and Stevens Point. Parent company Billerud is a global leader in high-performance packaging materials, committed to sustainability and innovation. The company operates nine production facilities across Sweden, the USA, and Finland, employing 5,800 people in over 19 countries.  The data center will be powered by CWPCo’s renewable hydroelectric assets, tapping into the utility’s existing 32 megawatts of generation capacity. The partnership grants DPO a long-term land lease—extending up to 50 years—alongside interconnection rights to an already-energized substation and a firm, reliable power supply. “AI infrastructure is evolving at an unprecedented pace, and access to power-dense sites is critical,” said Andrew

Read More »

Data center spending to top $1 trillion by 2029 as AI transforms infrastructure

His projections account for recent advances in AI and data center efficiency, he says. For example, the open-source AI model from Chinese company DeepSeek seems to have shown that an LLM can produce very high-quality results at a very low cost with some clever architectural changes to how the models work. These improvements are likely to be quickly replicated by other AI companies. “A lot of these companies are trying to push out more efficient models,” says Fung. “There’s a lot of effort to reduce costs and to make it more efficient.” In addition, hyperscalers are designing and building their own chips, optimized for their AI workloads. Just the accelerator market alone is projected to reach $392 billion by 2029, Dell’Oro predicts. By that time, custom accelerators will outpace commercially available accelerators such as GPUs. The deployment of dedicated AI servers also has an impact on networking, power and cooling. As a result, spending on data center physical infrastructure (DCPI) will also increase, though at a more moderate pace, growing by 14% annually to $61 billion in 2029.  “DCPI deployments are a prerequisite to support AI workloads,” says Tam Dell’Oro, founder of Dell’Oro Group, in the report. The research firm raised its outlook in this area due to the fact that actual 2024 results exceeded its expectations, and demand is spreading from tier one to tier two cloud service providers. In addition, governments and tier one telecom operators are getting involved in data center expansion, making it a long-term trend.

Read More »

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs).  In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

Read More »

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

Read More »

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

Read More »

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Read More »