Stay Ahead, Stay ONMINE

How do AI models generate videos?

Sure, the clips you see in demo reels are cherry-picked to showcase a company’s models at the top of their game. But with the technology in the hands of more users than ever before—Sora and Veo 3 are available in the ChatGPT and Gemini apps for paying subscribers—even the most casual filmmaker can now knock out something remarkable.  The downside is that creators are competing with AI slop, and social media feeds are filling up with faked news footage. Video generation also uses up a huge amount of energy, many times more than text or image generation.  With AI-generated videos everywhere, let’s take a moment to talk about the tech that makes them work. How do you generate a video? Let’s assume you’re a casual user. There are now a range of high-end tools that allow pro video makers to insert video generation models into their workflows. But most people will use this technology in an app or via a website. You know the drill: “Hey, Gemini, make me a video of a unicorn eating spaghetti. Now make its horn take off like a rocket.” What you get back will be hit or miss, and you’ll typically need to ask the model to take another pass or 10 before you get more or less what you wanted.  [embedded content] So what’s going on under the hood? Why is it hit or miss—and why does it take so much energy? The latest wave of video generation models are what’s known as latent diffusion transformers. Yes, that’s quite a mouthful. Let’s unpack each part in turn, starting with diffusion.  What’s a diffusion model? Imagine taking an image and adding a random spattering of pixels to it. Take that pixel-spattered image and spatter it again and then again. Do that enough times and you will have turned the initial image into a random mess of pixels, like static on an old TV set.  A diffusion model is a neural network trained to reverse that process, turning random static into images. During training, it gets shown millions of images in various stages of pixelation. It learns how those images change each time new pixels are thrown at them and, thus, how to undo those changes.  The upshot is that when you ask a diffusion model to generate an image, it will start off with a random mess of pixels and step by step turn that mess into an image that is more or less similar to images in its training set.  [embedded content] But you don’t want any image—you want the image you specified, typically with a text prompt. And so the diffusion model is paired with a second model—such as a large language model (LLM) trained to match images with text descriptions—that guides each step of the cleanup process, pushing the diffusion model toward images that the large language model considers a good match to the prompt.  An aside: This LLM isn’t pulling the links between text and images out of thin air. Most text-to-image and text-to-video models today are trained on large data sets that contain billions of pairings of text and images or text and video scraped from the internet (a practice many creators are very unhappy about). This means that what you get from such models is a distillation of the world as it’s represented online, distorted by prejudice (and pornography). It’s easiest to imagine diffusion models working with images. But the technique can be used with many kinds of data, including audio and video. To generate movie clips, a diffusion model must clean up sequences of images—the consecutive frames of a video—instead of just one image.  What’s a latent diffusion model?  All this takes a huge amount of compute (read: energy). That’s why most diffusion models used for video generation use a technique called latent diffusion. Instead of processing raw data—the millions of pixels in each video frame—the model works in what’s known as a latent space, in which the video frames (and text prompt) are compressed into a mathematical code that captures just the essential features of the data and throws out the rest.  A similar thing happens whenever you stream a video over the internet: A video is sent from a server to your screen in a compressed format to make it get to you faster, and when it arrives, your computer or TV will convert it back into a watchable video.  And so the final step is to decompress what the latent diffusion process has come up with. Once the compressed frames of random static have been turned into the compressed frames of a video that the LLM guide considers a good match for the user’s prompt, the compressed video gets converted into something you can watch.   With latent diffusion, the diffusion process works more or less the way it would for an image. The difference is that the pixelated video frames are now mathematical encodings of those frames rather than the frames themselves. This makes latent diffusion far more efficient than a typical diffusion model. (Even so, video generation still uses more energy than image or text generation. There’s just an eye-popping amount of computation involved.)  What’s a latent diffusion transformer? Still with me? There’s one more piece to the puzzle—and that’s how to make sure the diffusion process produces a sequence of frames that are consistent, maintaining objects and lighting and so on from one frame to the next. OpenAI did this with Sora by combining its diffusion model with another kind of model called a transformer. This has now become standard in generative video.  Transformers are great at processing long sequences of data, like words. That has made them the special sauce inside large language models such as OpenAI’s GPT-5 and Google DeepMind’s Gemini, which can generate long sequences of words that make sense, maintaining consistency across many dozens of sentences.  But videos are not made of words. Instead, videos get cut into chunks that can be treated as if they were. The approach that OpenAI came up with was to dice videos up across both space and time. “It’s like if you were to have a stack of all the video frames and you cut little cubes from it,” says Tim Brooks, a lead researcher on Sora. [embedded content] A selection of videos generated with Veo 3 and Midjourney. The clips have been enhanced in postproduction with Topaz, an AI video-editing tool. Credit: VaigueMan Using transformers alongside diffusion models brings several advantages. Because they are designed to process sequences of data, transformers also help the diffusion model maintain consistency across frames as it generates them. This makes it possible to produce videos in which objects don’t pop in and out of existence, for example.  And because the videos are diced up, their size and orientation do not matter. This means that the latest wave of video generation models can be trained on a wide range of example videos, from short vertical clips shot with a phone to wide-screen cinematic films. The greater variety of training data has made video generation far better than it was just two years ago. It also means that video generation models can now be asked to produce videos in a variety of formats.  What about the audio?  A big advance with Veo 3 is that it generates video with audio, from lip-synched dialogue to sound effects to background noise. That’s a first for video generation models. As Google DeepMind CEO Demis Hassabis put it at this year’s Google I/O: “We’re emerging from the silent era of video generation.”  [embedded content] The challenge was to find a way to line up video and audio data so that the diffusion process would work on both at the same time. Google DeepMind’s breakthrough was a new way to compress audio and video into a single piece of data inside the diffusion model. When Veo 3 generates a video, its diffusion model produces audio and video together in a lockstep process, ensuring that the sound and images are synched.   You said that diffusion models can generate different kinds of data. Is this how LLMs work too?  No—or at least not yet. Diffusion models are most often used to generate images, video, and audio. Large language models—which generate text (including computer code)—are built using transformers. But the lines are blurring. We’ve seen how transformers are now being combined with diffusion models to generate videos. And this summer Google DeepMind revealed that it was building an experimental large language model that used a diffusion model instead of a transformer to generate text.  Here’s where things start to get confusing: Though video generation (which uses diffusion models) consumes a lot of energy, diffusion models themselves are in fact more efficient than transformers. Thus, by using a diffusion model instead of a transformer to generate text, Google DeepMind’s new LLM could be a lot more efficient than existing LLMs. Expect to see more from diffusion models in the near future!

Sure, the clips you see in demo reels are cherry-picked to showcase a company’s models at the top of their game. But with the technology in the hands of more users than ever before—Sora and Veo 3 are available in the ChatGPT and Gemini apps for paying subscribers—even the most casual filmmaker can now knock out something remarkable. 

The downside is that creators are competing with AI slop, and social media feeds are filling up with faked news footage. Video generation also uses up a huge amount of energy, many times more than text or image generation. 

With AI-generated videos everywhere, let’s take a moment to talk about the tech that makes them work.

How do you generate a video?

Let’s assume you’re a casual user. There are now a range of high-end tools that allow pro video makers to insert video generation models into their workflows. But most people will use this technology in an app or via a website. You know the drill: “Hey, Gemini, make me a video of a unicorn eating spaghetti. Now make its horn take off like a rocket.” What you get back will be hit or miss, and you’ll typically need to ask the model to take another pass or 10 before you get more or less what you wanted. 

So what’s going on under the hood? Why is it hit or miss—and why does it take so much energy? The latest wave of video generation models are what’s known as latent diffusion transformers. Yes, that’s quite a mouthful. Let’s unpack each part in turn, starting with diffusion. 

What’s a diffusion model?

Imagine taking an image and adding a random spattering of pixels to it. Take that pixel-spattered image and spatter it again and then again. Do that enough times and you will have turned the initial image into a random mess of pixels, like static on an old TV set. 

A diffusion model is a neural network trained to reverse that process, turning random static into images. During training, it gets shown millions of images in various stages of pixelation. It learns how those images change each time new pixels are thrown at them and, thus, how to undo those changes. 

The upshot is that when you ask a diffusion model to generate an image, it will start off with a random mess of pixels and step by step turn that mess into an image that is more or less similar to images in its training set. 

But you don’t want any image—you want the image you specified, typically with a text prompt. And so the diffusion model is paired with a second model—such as a large language model (LLM) trained to match images with text descriptions—that guides each step of the cleanup process, pushing the diffusion model toward images that the large language model considers a good match to the prompt. 

An aside: This LLM isn’t pulling the links between text and images out of thin air. Most text-to-image and text-to-video models today are trained on large data sets that contain billions of pairings of text and images or text and video scraped from the internet (a practice many creators are very unhappy about). This means that what you get from such models is a distillation of the world as it’s represented online, distorted by prejudice (and pornography).

It’s easiest to imagine diffusion models working with images. But the technique can be used with many kinds of data, including audio and video. To generate movie clips, a diffusion model must clean up sequences of images—the consecutive frames of a video—instead of just one image. 

What’s a latent diffusion model? 

All this takes a huge amount of compute (read: energy). That’s why most diffusion models used for video generation use a technique called latent diffusion. Instead of processing raw data—the millions of pixels in each video frame—the model works in what’s known as a latent space, in which the video frames (and text prompt) are compressed into a mathematical code that captures just the essential features of the data and throws out the rest. 

A similar thing happens whenever you stream a video over the internet: A video is sent from a server to your screen in a compressed format to make it get to you faster, and when it arrives, your computer or TV will convert it back into a watchable video. 

And so the final step is to decompress what the latent diffusion process has come up with. Once the compressed frames of random static have been turned into the compressed frames of a video that the LLM guide considers a good match for the user’s prompt, the compressed video gets converted into something you can watch.  

With latent diffusion, the diffusion process works more or less the way it would for an image. The difference is that the pixelated video frames are now mathematical encodings of those frames rather than the frames themselves. This makes latent diffusion far more efficient than a typical diffusion model. (Even so, video generation still uses more energy than image or text generation. There’s just an eye-popping amount of computation involved.) 

What’s a latent diffusion transformer?

Still with me? There’s one more piece to the puzzle—and that’s how to make sure the diffusion process produces a sequence of frames that are consistent, maintaining objects and lighting and so on from one frame to the next. OpenAI did this with Sora by combining its diffusion model with another kind of model called a transformer. This has now become standard in generative video. 

Transformers are great at processing long sequences of data, like words. That has made them the special sauce inside large language models such as OpenAI’s GPT-5 and Google DeepMind’s Gemini, which can generate long sequences of words that make sense, maintaining consistency across many dozens of sentences. 

But videos are not made of words. Instead, videos get cut into chunks that can be treated as if they were. The approach that OpenAI came up with was to dice videos up across both space and time. “It’s like if you were to have a stack of all the video frames and you cut little cubes from it,” says Tim Brooks, a lead researcher on Sora.

A selection of videos generated with Veo 3 and Midjourney. The clips have been enhanced in postproduction with Topaz, an AI video-editing tool. Credit: VaigueMan

Using transformers alongside diffusion models brings several advantages. Because they are designed to process sequences of data, transformers also help the diffusion model maintain consistency across frames as it generates them. This makes it possible to produce videos in which objects don’t pop in and out of existence, for example. 

And because the videos are diced up, their size and orientation do not matter. This means that the latest wave of video generation models can be trained on a wide range of example videos, from short vertical clips shot with a phone to wide-screen cinematic films. The greater variety of training data has made video generation far better than it was just two years ago. It also means that video generation models can now be asked to produce videos in a variety of formats. 

What about the audio? 

A big advance with Veo 3 is that it generates video with audio, from lip-synched dialogue to sound effects to background noise. That’s a first for video generation models. As Google DeepMind CEO Demis Hassabis put it at this year’s Google I/O: “We’re emerging from the silent era of video generation.” 

The challenge was to find a way to line up video and audio data so that the diffusion process would work on both at the same time. Google DeepMind’s breakthrough was a new way to compress audio and video into a single piece of data inside the diffusion model. When Veo 3 generates a video, its diffusion model produces audio and video together in a lockstep process, ensuring that the sound and images are synched.  

You said that diffusion models can generate different kinds of data. Is this how LLMs work too? 

No—or at least not yet. Diffusion models are most often used to generate images, video, and audio. Large language models—which generate text (including computer code)—are built using transformers. But the lines are blurring. We’ve seen how transformers are now being combined with diffusion models to generate videos. And this summer Google DeepMind revealed that it was building an experimental large language model that used a diffusion model instead of a transformer to generate text. 

Here’s where things start to get confusing: Though video generation (which uses diffusion models) consumes a lot of energy, diffusion models themselves are in fact more efficient than transformers. Thus, by using a diffusion model instead of a transformer to generate text, Google DeepMind’s new LLM could be a lot more efficient than existing LLMs. Expect to see more from diffusion models in the near future!

Shape
Shape
Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy,  bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Shape

Alkira advances NaaS for the agentic AI era

The practical difference spans a number of areas including the format of responses. Rather than returning raw JSON that requires parsing and interpretation, the MCP Server can deliver tabular summaries. An operator or AI agent can request a deployment overview and receive structured data showing region counts, segment configurations, connector

Read More »

Cato extends SASE platform to unmanaged devices

Cato Networks this week introduced the Cato Browser Extension that expands the company’s secure access service edge (SASE) platform and its universal zero-trust network access (ZTNA) capabilities to unmanaged devices and distributed bring-your-own-device (BYOD) endpoints. The Cato Browser Extension, according to the company, is a lightweight onramp to its SASE

Read More »

Energy Department Selects Four Companies for Advanced Nuclear Fuel Line Pilot Projects

WASHINGTON— The U.S. Department of Energy (DOE) today announced another step forward in the Trump administration’s efforts to strengthen domestic supply chains for nuclear fuel. DOE selected Oklo Inc., Terrestrial Energy Inc., TRISO-X LLC, and Valar Atomics Inc. for its new pilot program to build advanced nuclear fuel lines. Today’s action will help strengthen America’s national security, reduce reliance on foreign sources of enriched uranium and support the Department’s Reactor Pilot Program that aims to have at least three reactors achieve criticality by July 4, 2026. “President Trump has made clear that a strong nuclear sector is a central component of America’s energy security and prosperity,” said Deputy Secretary of Energy James P. Danly. “Restoring a secure domestic fuel supply will ensure that advanced reactors can move quickly from design to deployment and into operation. The ability to produce these fuels is essential to ensuring American leadership in nuclear energy and to meeting the nation’s growing demand for reliable power.” This is the second round of conditional selections under DOE’s Fuel Line Pilot Program. DOE previously selected Standard Nuclear to build and operate TRISO fuel fabrication facilities. The following projects will leverage the Department’s authorization process to ensure a robust supply of fuel is available for research, development, and demonstration purposes— including the 11 reactors initially selected to participate in DOE’s Reactor Pilot Program: Oklo Inc. (Santa Clara, California) – To build and operate three fuel fabrication facilities to support their Aurora and Pluto reactors, and possibly other fast reactors.  Terrestrial Energy Inc (Charlotte, NC) – To develop the Terrestrial Energy Fuel Line Assembly to demonstrate a fuel salt fabrication process in a phased approach.  TRISO-X Inc. (Oak Ridge, Tennessee) – To build and operate an additional fuel fabrication laboratory facility to enable pilot-scale integration, training, and system validation to

Read More »

Treasuries Gain as USA Gov Shutdown May Follow Month-End

Treasuries gained Monday, supported by a slump in oil prices and a rally in UK government bonds, and by anticipation of buying into Tuesday’s month-end index rebalancing. Yields declined as much as five basis points across tenors with the 10-year note’s falling to 4.14%. The 30-year bond’s dipped below 4.70% for the first time since Sept. 18. US benchmark crude oil futures dropped more than 4% on signs OPEC+ will hike production again in November. The prospect of a US government shutdown beginning Wednesday also has implications for the Treasuries market, as shutdowns are associated with gains for bonds based on their potential to restrain the economy. “There’s a global move lower today in yields,” said Angelo Manolatos, an interest-rate strategist at Wells Fargo Securities. “It’s likely a combination of quarter-end flow dynamics and the possibility of a government shutdown. Yields typically drop modestly during government shutdowns that last at least five days.” The market racked up gains even as Cleveland Fed President Beth Hammack — who becomes a voting member of the central bank’s rate-setting committee next year — reiterated her view that inflation remains too high to warrant cutting interest rates. Futures markets continue to anticipate about 100 basis points of additional Fed easing over the next 12 months. Expectations for Fed rate cuts rest mainly on signs of stress in the US labor market, where job creation has slowed precipitously in recent months. September data is set to be released on Friday. Tuesday’s month-end bond index rebalancing — to add eligible bonds created during the month and remove those that no longer fit the index criteria — typically drives buying by passive and other index-tracking investment funds that can support the market if their needs exceed expectations.  The rebalancing will increase the duration of the Bloomberg Treasury index by an estimated 0.06

Read More »

Russia Expects Oil and Gas Revenue at Lowest in Years

The Russian government expects the oil and gas industries’ contributions to the national budget this year to drop to the lowest since the pandemic of 2020 after prices for its fuels dipped and the ruble strengthened. Moscow will gather about 8.65 trillion rubles ($100 billion) in taxes from the oil and gas industry, according to amendments to Russia’s 2025 budget. That’s about 22 percent less than last year’s revenues, according to the amendments, published Monday on the website of the State Duma, the lower chamber of the parliament.  Levies from oil and gas producers are critical for the Kremlin, as they are projected to account for almost a quarter of revenues into state coffers this year. Moscow plans to keep boosting military spending to finance the ongoing war in Ukraine, and will raise taxes like VAT and increase borrowing to bridge the budget gap. In a drive to further bear down on Russia’s energy revenues, western nations and allies have adopted wide-ranging sanctions. Most recently, US President Donald Trump has been putting pressure on partners in the North Atlantic Treaty Organization, including Turkey, to stop buying Russia’s barrels altogether.  Urals, Russia’s main export blend, is projected to average $58 a barrel this year, compared with $66.60 last year. Despite sanctions, the decline is mainly driven by lower crude prices amid concerns over global economic growth.  Russia’s government sees the average Urals discount to global benchmark Brent at $12 a barrel. The gap has shrunk compared with the earlier years of the war, but is still markedly wider than the historic discount of $2-4 a barrel because of sanctions.  The strengthening of the national currency is another reason behind declining revenues. For this year, the government projects the exchange rate at 86.1 rubles to a US dollar, compared with 92.4 rubles a dollar

Read More »

Exxon to Cut 2,000 Jobs

Exxon Mobil Corp. plans to cut about 2,000 jobs globally as the Texas oil company consolidates smaller offices into regional hubs as part of its long-term restructuring plan.  The reductions represent about 3% to 4% of Exxon’s global workforce and are part of the company’s ongoing efficiency drive, Chief Executive Officer Darren Woods said in an memo to employees Tuesday. About half will be in the Europe and most of rest in Canada at Calgary-based Imperial Oil Ltd., which is nearly 70% owned by Exxon. Chevron Corp., ConocoPhillips and BP Plc are among major oil companies to have also announced thousands of job cuts in recent months as crude prices tumbled this year in response to increased supplies from OPEC and its allies. Exxon, however, has been on a major internal restructuring push since 2019 as Woods sought to simplify the company’s sprawling global footprint that came as a result of the merger with Mobil two decades ago.  Exxon is making “tough decisions” that build upon a years-long effort to improve competitiveness, Woods wrote in the memo. “The changes we’ve announced today will further strengthen our advantages and grow the gap with our competition, helping to keep us in the lead for decades to come,” he said.   The oil giant will cut 1,200 positions in the European Union and Norway by the end of 2027, with layoffs making up half of the reductions, it said in a statement. Imperial will cut about 900 positions, or 20% of its workforce, in the same time period, helping reduce operational expenses by C$150 million ($108 million) annually.  The regional hubs will focus on Exxon’s major growth initiatives such as oil in Guyana, liquefied natural gas along the Gulf Coast and trading globally. For example, the company recently announced plans to move employees from Brussels and Leatherhead, UK, to central London, where it’s recently

Read More »

Democratic House bill would reverse Trump energy policies, bolster RTO oversight

Democratic Reps. Sean Casten of Illinois and Mike Levin from California are preparing to introduce sweeping energy legislation that would reverse Trump administration policies, including by restoring clean energy tax credits and limiting the U.S. Department of Energy’s ability to declare “energy emergencies” to keep fossil-fueled power plants from retiring. The Trump administration is driving up the cost of electricity by creating barriers to clean energy investment to support higher-cost fossil fuel sectors, according to Casten. “You’ve now got this surging demand [from data centers and other loads] — and particularly if we’re not going to have an administration that’s going to encourage competitive markets — then that means that the only people who are going to be able to build stuff are regulated utilities with mandatory capital amortization and so it’s very hard to see anything but upward pressure on electric prices,” Casten said Monday in an interview. The draft Cheap Energy Act aims to put downward pressure on electricity bills by bolstering energy efficiency efforts, supporting grid-enhancing technologies and helping provide lower-cost renewable energy projects access to the grid, according to Casten. One section of the legislation would increase oversight of regional transmission organizations. “Everybody in the electric regulatory world, including lots of [Federal Energy Regulatory Commission] commissioners … and frankly a lot of utility executives, knows that there’s a massive governance problem in the RTO space,” Casten said. Governors in the PJM Interconnection footprint, for example, are pressing for a role in the RTO’s decisionmaking. Provisions in the bill would require FERC to reform the governance and stakeholder participation practices of RTOs and independent system operators, according to a summary of the draft bill. They would also require independent transmission monitors to facilitate the cost-effective construction of transmission. Partly, those provisions would make it easier for outsiders

Read More »

EPA mulls postponing coal plant wastewater compliance, changes to Regional Haze Rule

Dive Brief: The U.S. Environmental Protection Agency on Monday proposed giving coal power generators additional time to comply with new wastewater disposal guidelines and said it also would consider regulatory changes to the Clean Air Act’s Regional Haze Rule. EPA’s proposals are part of a package of coal-supporting actions the Trump administration unveiled this week, including $625 million to retrofit and recommission coal plants from the Department of Energy. Extending compliance for EPA’s effluent limitations guidelines would reduce electricity costs by approximately $30 million to $200 million annually, the agency estimated. Sierra Club Beyond Coal Campaign Director Laurie Williams countered that every day the requirements are delayed means more people “will be exposed to higher levels of toxic pollution.” Dive Insight: Residual waste from burning coal contains contaminants including mercury, cadmium and arsenic, according to the EPA. The federal government strengthened regulation on coal combustion residuals following a disastrous coal ash spill from the Tennessee Valley Authoriy’s Kingston plant in 2008, which contaminated some 300 acres of land, making it one of the largest industrial spills in U.S. history.  EPA revised its ELG requirements last year to include stronger protections around coal ash wastewater pollution. The changes included requiring plants to halt some types of discharges by 2029 or to commit by the end of this year to cease burning coal by 2034. EPA’s proposal would extend both the compliance and the notice deadlines. The agency said the move would “reduce costs for facilities and help with electricity reliability and affordability.” Comments are due 30 days after the proposal is published in the Federal Register. EPA’s current ELGs for wastewater discharges from steam electric power plants “are potentially costly to an electric power sector that is struggling with increasing demand as AI is booming, data centers are being constructed and operated around

Read More »

New Partnerships Form to Deploy Digital Energy Twins in Data Centers

Of course, partnerships have been ongoing in the data center and AI realms. For example, Microsoft and AMD have a long-standing collaboration to provide powerful cloud solutions for enterprise workloads, high-performance computing, and AI. Google Cloud and NVIDIA have a deep partnership across the AI stack, allowing Google Cloud to integrate NVIDIA’s latest hardware and software into its services. Colocation providers are collaborating with tech companies to offer turnkey AI solutions, which hardware, AI research labs, and cloud providers are joining forces in cross-industry alliances to support AI development. What’s Next for Digital Energy Twins Analysts anticipate that the next wave of digital energy twins will feature deeper integration with AI and the Internet of Things (IoT) for self-updating, autonomous operations, and enhanced predictive capabilities, moving beyond monitoring to proactive simulation and optimization. Key developments include increased integration with quantum computing, the development of interoperable and standardized ecosystems, and a greater focus on cybersecurity to protect these complex systems. These advancements aim to drive greater operational efficiency, improve asset performance, reduce downtime, and support sustainability goals across the energy sector.

Read More »

South Korea’s data center fire triggers global scrutiny of lithium-ion batteries and DR architecture

Kasthuri Jagadeesan, research director at Everest Group, said enterprises should audit centralization risks by mapping interdependencies across UPS, cooling, and shared power zones. “The NIRS case illustrates that redundancy alone cannot protect against weak compartmentalization,” she said, noting that outages can cost millions per hour. “Geographic redundancy is only effective if failover processes are tested and staff are trained to execute under pressure,” Jaura said. “CIOs must validate that DR plans are operational, not theoretical. This means regular, realistic testing, cross-functional engagement, and continuous improvement.” IDC research shows that centralized facilities offer economies of scale but concentrate risk, while distributed and modular approaches enhance resilience and reduce single points of failure, according to Jaura. “Diversify datacenter locations to mitigate regional risks,” he advised. “Invest in modular and mobile data center solutions for flexibility and rapid recovery.” Market implications Rai said the incident may instigate “heightened due diligence and a more cautious pace of adoption,” but lithium-ion technology’s advantages remain compelling. “What is likely to change is that enterprises will demand stronger safety certifications and vendor accountability, and accelerate investment in disaster recovery planning, geographical redundancy, and resilience frameworks.” Kalyani Devrukhkar, senior analyst at Everest Group, said both enterprises and regulators will be more demanding about safety standards. “Some organizations may look at alternatives like sodium-ion or advanced valve-regulated lead-acid, and insurers will almost certainly raise premiums where risk is seen as high,” she said, noting that NFPA 855 and newer International Fire Code editions now include stricter requirements for lithium-ion battery systems. “Enterprises are increasing budgets for business continuity management, IT disaster recovery planning, and alternative site management,” Jaura said. For CIOs, Jaura recommended a business impact analysis-driven framework that balances efficiency with safety and compliance. “The decision is not binary — mitigation such as advanced monitoring, fire suppression, and

Read More »

Zayo launches DynamicLink NaaS platform with core-based service delivery

At customer locations, only a lightweight demarcation device is required. This equipment creates a clean handoff point but contains no processing logic or intelligence. All routing, security, traffic management and network services run on the pre-deployed hardware in Zayo’s facilities. “We’ve basically taken the logic out of those boxes that used to sit on premise, sucked that into the core of the network, and then put it on hardware that can do extremely fast multi-terabyte level, single-packet processing. You know, line rate, speed performance,” Long explained. Service model: One port, multiple functions DynamicLink’s commercial model centers on port-based pricing. Organizations purchase ports at locations where they need connectivity. This includes headquarters, branch offices, colocation facilities, cloud on-ramps or any location Zayo’s network reaches. Each port provides flexible capacity that can be allocated across different services. A single 10 Gbps port might simultaneously support point-to-point Ethernet between facilities, dedicated internet access and cloud connectivity. Customers reconfigure these allocations through a self-service portal. “If you have a dynamic link port, that, if you have, like, a dynamic link in three different locations, let’s say it’s at a headquarters location and two of your data centers, you can use that to have a point to point Ethernet,” Long said. “If you want to use those 10 gigs at one location to go out to the internet, you can use it to go out to the internet.” Previously, each of these functions required ordering a separate service. An organization wanting dedicated internet access ordered a DIA port. Point-to-point Ethernet between data centers required two separate ports. Each service involved manual provisioning and fixed configuration.

Read More »

Scaling Stargate: OpenAI’s Five New U.S. Data Centers Push Toward 10 GW AI Infrastructure

Stargate is OpenAI’s massive AI infrastructure initiative, developed as a joint venture in partnership with Oracle and SoftBank. Formally announced in January 2025, the program is accelerating rapidly with the disclosure of five new U.S. data center sites. These additions—along with the flagship development in Abilene, Texas, and other ongoing projects—bring Stargate’s total planned capacity to nearly 7 gigawatts (GW). The cumulative investment estimate has now topped $400 billion as the program heads toward its ultimate goal: a 10 GW, $500 billion buildout. While the initiative focuses on building capacity with non-Microsoft partners, Microsoft remains a key technology partner and OpenAI’s primary cloud provider (Azure). Where Are the Five New Sites? The next wave of Stargate capacity is landing in regions already familiar with large-scale data center development. Based on public reporting and company statements, the five identified sites are: Shackelford County, Texas (greater Abilene expansion): An extension of the area already hosting Vantage Data Centers’ Frontier project, a $25 billion development on 1,200 acres. Milam County, Texas (Central Texas growth corridor): Previously announced as the home of a SoftBank-led Stargate data center campus. Doña Ana County, New Mexico (Las Cruces area): Linked to Project Jupiter, a proposed $165 billion build spearheaded by BorderPlex Digital Assets, with Stack Infrastructure reported as a potential participant. Lordstown, Ohio (Eastern PJM/FirstEnergy territory): Redevelopment of a former GM/Foxconn complex, being repositioned as a large-scale AI campus through a collaboration between OpenAI, Oracle, and SoftBank. An additional Midwest site (TBD): Location yet to be disclosed. These builds are being advanced under partnership models, with Oracle expected to lead three of the sites and SoftBank/SB Energy two. Together, they reinforce Stargate’s path toward a 10 GW national roadmap. Scale and Performance Goals With the addition of the five new campuses, plus Abilene and other previously announced

Read More »

Q3 Executive Roundtable Recap

AI-scale workloads are reshaping the fundamentals of data center design. For Data Center Frontier’s Q3 2025 Executive Roundtable, three industry leaders tackled the most urgent challenges: managing thermal and water risk at scale, balancing CapEx vs. OpEx in the race to build, and breaking down silos as cooling, water, and power systems converge. <!–> Sept. 26, 2025 3 min read –>

Read More »

‘Nomads at the Summit’ Podcasts – Recorded Live at DCF Trends Summit 2025

@import url(‘https://fonts.googleapis.com/css2?family=Inter:[email protected]&display=swap’); a { color: #1796c1; } .ebm-page__main h1, .ebm-page__main h2, .ebm-page__main h3, .ebm-page__main h4, .ebm-page__main h5, .ebm-page__main h6 { font-family: Inter; } body { line-height: 150%; letter-spacing: 0.025em; font-family: Inter; } button, .ebm-button-wrapper { font-family: Inter; } .label-style { text-transform: uppercase; color: var(–color-grey); font-weight: 600; font-size: 0.75rem; } .caption-style { font-size: 0.75rem; opacity: .6; } #onetrust-pc-sdk [id*=btn-handler], #onetrust-pc-sdk [class*=btn-handler] { background-color: #1796c1 !important; border-color: #1796c1 !important; } #onetrust-policy a, #onetrust-pc-sdk a, #ot-pc-content a { color: #1796c1 !important; } #onetrust-consent-sdk #onetrust-pc-sdk .ot-active-menu { border-color: #1796c1 !important; } #onetrust-consent-sdk #onetrust-accept-btn-handler, #onetrust-banner-sdk #onetrust-reject-all-handler, #onetrust-consent-sdk #onetrust-pc-btn-handler.cookie-setting-link { background-color: #1796c1 !important; border-color: #1796c1 !important; } #onetrust-consent-sdk .onetrust-pc-btn-handler { color: #1796c1 !important; border-color: #1796c1 !important; background-color: undefined !important; } Welcome to Nomads at the Summit, a new podcast series from Data Center Frontier in partnership with the Nomad Futurist Foundation. Recorded live at the 2025 Data Center Frontier Trends Summit (Aug. 26-28), here we sit down with industry leaders, innovators, and change-makers shaping the future of digital infrastructure. Join hosts Nabeel Mahmood and Phillip Koblence of Nomad Futurist, alongside DCF editorial leadership including Editor at Large Melissa Farney and Senior Editor David Chernicoff, for these candid conversations that highlight the ideas, talent, and technologies driving the next chapter of the data center industry. Whether you attended the DCF Trends Summit in person or are just now tuning in from afar, Nomads at the Summit gives you a behind-the-scenes look at the people and innovations defining what’s next in digital infrastructure. <!–> EPISODE LIST ]–> Waste Heat to Water – The Path Towards Water Positive Data Centers In this DCF Trends-Nomads at the Summit Podcast episode, Matt Grandbois, Vice President at AirJoule, introduces a game-changing approach to one of the data center industry’s most pressing challenges: water sustainability. As power-hungry, high-density environments collide with growing water scarcity

Read More »

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs).  In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

Read More »

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

Read More »

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

Read More »

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Read More »