Liquid AI wants to give smartphones small, fast AI that can see with new LFM2-VL model

Stay Ahead, Stay ONMINE

Liquid AI wants to give smartphones small, fast AI that can see with new LFM2-VL model

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Liquid AI has released LFM2-VL, a new generation of vision-language foundation models designed for efficient deployment across a wide range of hardware — from smartphones and laptops to wearables […]

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now

Liquid AI has released LFM2-VL, a new generation of vision-language foundation models designed for efficient deployment across a wide range of hardware — from smartphones and laptops to wearables and embedded systems.

The models promise low-latency performance, strong accuracy, and flexibility for real-world applications.

LFM2-VL builds on the company’s existing LFM2 architecture, extending it into multimodal processing that supports both text and image inputs at variable resolutions.

According to Liquid AI, the models deliver up to twice the GPU inference speed of comparable vision-language models, while maintaining competitive performance on common benchmarks.

AI Scaling Hits Its Limits

Power caps, rising token costs, and inference delays are reshaping enterprise AI. Join our exclusive salon to discover how top teams are:

Turning energy into a strategic advantage
Architecting efficient inference for real throughput gains
Unlocking competitive ROI with sustainable AI systems

Secure your spot to stay ahead: https://bit.ly/4mwGngO

“Efficiency is our product,” wrote Liquid AI co-founder and CEO Ramin Hasani in a post on X announcing the new model family:

meet LFM2-VL: an efficient Liquid vision-language model for the device class. open weights, 440M & 1.6B, up to 2× faster on GPU with competitive accuracy, Native 512×512, smart patching for big images.
efficiency is our product @LiquidAI_
download them on @huggingface:… pic.twitter.com/3Lze6Hc6Ys
— Ramin Hasani (@ramin_m_h) August 12, 2025

Two variants for different needs

The release includes two model sizes:

LFM2-VL-450M — a hyper-efficient model with less than half a billion parameters (internal settings) aimed at highly resource-constrained environments.
LFM2-VL-1.6B — a more capable model that remains lightweight enough for single-GPU and device-based deployment.

Both variants process images at native resolutions up to 512×512 pixels, avoiding distortion or unnecessary upscaling.

For larger images, the system applies non-overlapping patching and adds a thumbnail for global context, enabling the model to capture both fine detail and the broader scene.

Background on Liquid AI

Liquid AI was founded by former researchers from MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL) with the goal of building AI architectures that move beyond the widely used transformer model.

The company’s flagship innovation, the Liquid Foundation Models (LFMs), are based on principles from dynamical systems, signal processing, and numerical linear algebra, producing general-purpose AI models capable of handling text, video, audio, time series, and other sequential data.

Unlike traditional architectures, Liquid’s approach aims to deliver competitive or superior performance using significantly fewer computational resources, allowing for real-time adaptability during inference while maintaining low memory requirements. This makes LFMs well suited for both large-scale enterprise use cases and resource-limited edge deployments.

In July 2025, the company expanded its platform strategy with the launch of the Liquid Edge AI Platform (LEAP), a cross-platform SDK designed to make it easier for developers to run small language models directly on mobile and embedded devices.

LEAP offers OS-agnostic support for iOS and Android, integration with both Liquid’s own models and other open-source SLMs, and a built-in library with models as small as 300MB—small enough for modern phones with minimal RAM.

Its companion app, Apollo, enables developers to test models entirely offline, aligning with Liquid AI’s emphasis on privacy-preserving, low-latency AI. Together, LEAP and Apollo reflect the company’s commitment to decentralizing AI execution, reducing reliance on cloud infrastructure, and empowering developers to build optimized, task-specific models for real-world environments.

Speed/quality trade-offs and technical design

LFM2-VL uses a modular architecture combining a language model backbone, a SigLIP2 NaFlex vision encoder, and a multimodal projector.

The projector includes a two-layer MLP connector with pixel unshuffle, reducing the number of image tokens and improving throughput.

Users can adjust parameters such as the maximum number of image tokens or patches, allowing them to balance speed and quality depending on the deployment scenario. The training process involved approximately 100 billion multimodal tokens, sourced from open datasets and in-house synthetic data.

Performance and benchmarks

The models achieve competitive benchmark results across a range of vision-language evaluations. LFM2-VL-1.6B scores well in RealWorldQA (65.23), InfoVQA (58.68), and OCRBench (742), and maintains solid results in multimodal reasoning tasks.

In inference testing, LFM2-VL achieved the fastest GPU processing times in its class when tested on a standard workload of a 1024×1024 image and short prompt.

Licensing and availability

LFM2-VL models are available now on Hugging Face, along with example fine-tuning code in Colab. They are compatible with Hugging Face transformers and TRL.

The models are released under a custom “LFM1.0 license”. Liquid AI has described this license as based on Apache 2.0 principles, but the full text has not yet been published.

The company has indicated that commercial use will be permitted under certain conditions, with different terms for companies above and below $10 million in annual revenue.

With LFM2-VL, Liquid AI aims to make high-performance multimodal AI more accessible for on-device and resource-limited deployments, without sacrificing capability.

Daily insights on business use cases with VB Daily

If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.

Read our Privacy Policy

Thanks for subscribing. Check out more VB newsletters here.

An error occured.

Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy, bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Trump meets with Intel CEO after calling for his resignation

The call for Tan’s resignation coincided with an Aug. 6 letter Sen. Tom Cotton (R-AK) sent to Intel Chairman Frank Yeary, in which he expressed concerns about “Intel’s operations and its potential impact on U.S. national security,” citing a report alleging Tan’s links to Chinese firms and the fact Cadence

US to maintain lower tariff rates on China imports for 90 more days

The U.S. is extending its pause on additional retaliatory tariffs for imports from China until Nov. 10, according to an executive order signed by President Donald Trump on Monday. The order said the extension is appropriate following “significant steps” from China on addressing U.S. trade concerns in ongoing discussions between the

Critical SSH vulnerabilities expose enterprise network infrastructure as patching lags

RegreSSHion (CVE-2024-6387) proved particularly dangerous, enabling unauthenticated remote code execution through a signal reentrance vulnerability in OpenSSH. The vulnerability affected countless Linux systems and network appliances running vulnerable OpenSSH versions, though exploitation proved challenging due to modern memory protections. The MOVEit vulnerability (CVE-2024-5806) demonstrated how third-party SSH libraries could introduce

Nvidia launches Blackwell-powered RTX Pro GPUs for compact AI workstations

“While AMD’s Radeon Pro series remains a primary competitor, offering strong performance for similar professional workloads, Nvidia maintains its market lead through its mature and widely-adopted software ecosystem,” said Neil Shah, vice president at Counterpoint Research. He added that the CUDA platform continues to be the industry standard for AI

Fluor Plans to Appeal Ruling in Santos Row over Gladstone LNG Costs

Fluor Corp. plans to appeal against the Queensland Supreme Court’s decision favoring Santos Ltd. in a dispute on costs over the Gladstone LNG project, majority-owned by Santos. “The court affirmed that Fluor must pay approximately AUD 692 million to Santos and its co-venturers, with further sums yet to be determined”, oil and gas explorer and developer Santos said in a statement on its website. Adelaide-based Santos, which initiated the case in December 2016, and Irving, Texas-based Fluor had signed a contract for the construction of the coal bed methane-to-liquefied natural gas (LNG) project. Gladstone started producing LNG 2015 after going over time and over budget. Santos’ case alleges overpayments totaling more than AUD 1.4 billion, about AUD 140 million for a purported breach of the Australian Consumer Law and liquidated damages of AUD 15 million for an alleged failure of Fluor to reach mechanical completion by the contractual dates, according to the court judgment. “[T]he court will hear the parties on the appropriate orders and directions and on the calculation of interest, and on costs”, read the ruling, published on the court’s online library. The case is Santos v Fluor [2025] QSC 184. Fluor the parent company is second defendant while Fluor Australia Pty. Ltd. is first defendant. Fluor said in a statement on its website, “Further arguments and input from both parties will be heard by the court before a final judgment is delivered sometime later this year”. “Fluor maintains the contracting principles addressed by the court have wide-sweeping consequences in the engineering and construction industry”, Fluor added. “The company is reviewing the court decision and exploring its response including the timing of its appeal. “We are also working with our insurance carriers to address the obligations arising from the final judgment”. Fluor said, “The court generally accepted the recommendations

Macquarie Strategists Forecast USA Crude Inventory Rise

In an oil and gas report sent to Rigzone by the Macquarie team late Monday, Macquarie strategists, including Walt Chancellor, revealed that they are forecasting that U.S. crude inventories will be up by 2.0 million barrels for the week ending August 8. “This follows a 3.0 million barrel draw in the prior week, with the crude balance realizing tighter than our expectations,” the strategists said in the report. “For this week’s crude balance, from refineries, we model a minimal reduction in crude runs. Among net imports, we model a small increase, with exports (+0.3 million barrels per day) and imports (+0.6 million barrels per day) up on a nominal basis,” they added. Timing of cargoes remains a source of potential volatility in this week’s crude balance, the Macquarie strategists warned in the report. They went on to state that, “from implied domestic supply (prod.+adj.+transfers)”, they “look for an increase (+0.3 million barrels per day) on a nominal basis this week”. “Rounding out the picture, we anticipate no change in SPR [Strategic Petroleum Reserve] stocks this week,” the strategists said. The strategists also noted in the report that, “among products”, they “look for builds in distillate (+3.8 million barrels) and jet (+0.5 million barrels), with a draw in gasoline (-0.9 million barrels)”. “We model implied demand for these three products at ~14.3 million barrels per day for the week ending August 8,” the strategists continued. In its latest weekly petroleum status report at the time of writing, which was released on August 6 and included data for the week ending August 1, the U.S. Energy Information Administration (EIA) highlighted that U.S. commercial crude oil inventories, excluding those in the SPR, decreased by three million barrels from the week ending July 25 to the week ending August 1. That EIA report showed

ADNOC Gas Achieves Record Profit

ADNOC Gas PLC has reported $1.39 billion in net income for the second quarter, rising 16 percent compared to the same three-month period last year and setting a quarterly record for the company. Last year, the gas processing and sales arm of Abu Dhabi National Oil Co. logged its highest annual net earnings – $5 billion – thanks to natural gas demand in the United Arab Emirates. For the April-June 2025 quarter, revenue dipped to $5.96 billion from $6.08 billion for Q2 2024 as a weakening of commodity prices offset an overall increase in sales volumes, according to figures reported to the local stock exchange. Domestic gas sales rose to 611 trillion British thermal units (TBtu) in Q2 2025 from 580 TBtu in Q2 2024. Export and traded liquids slid to 252 TBtu from 266 TBtu. Sales from the ALNG JV, in which ADNOC Gas owns a 70 percent stake, increased to 65 TBtu from 56 TBtu. ADNOC Gas expects sales volumes excluding sulfur to land between 3,630 TBtu and 3,700 TBtu this year. “As with prior years, sales volumes should follow a seasonal pattern with an uptick over the summer period”, it said. “Furthermore, it is also important to note that in 2025 our shutdown activity will be higher than normal especially in the Q4 2025 period”. Meanwhile the quarterly average Brent crude price fell 20 percent year-on-year to $68 a barrel from $85 per barrel. “Conversely, JKM prices saw a significant increase of 31 percent, rising from $9.6/mmbtu to $12.5/mmbtu”, ADNOC Gas told the Abu Dhabi Securities Exchange. “LPG prices were slightly up on average despite the drop in crude oil price, with propane increasing from $592/tonne to $608/tonne and butane marginally down from $590/tonne to $588/tonne. Naphtha prices averaged at $533/tonne in the period representing a 14 percent

Weaker Chinese Demand for Saudi Oil Signals Shift to Urals, EA Says

Chinese refiners are asking for less oil from Saudi Arabia, with the drop possibly pointing to a reshuffle of global flows as more Russian crude becomes available, according to Energy Aspects Ltd. A decline in so-called nominations for term cargoes from Saudi Aramco for September loading, led by trading-giant Unipec, indicated some Chinese refineries were holding back from purchases given the greater availability of Russia’s Urals, as well as comfortable stockpiles, the London-based consultant said in an Aug. 11 note, without saying how it got the information. Indian nominations for September, meanwhile, increased from a month earlier as the country seeks alternatives to Russian crude following Western pushback. The global oil market has zeroed in on a possible reordering of some crude flows after the US and European Union ramped up pressure against India over its imports of Russian energy. Given there’s been no comparable move against China, that’s raised the possibility that more of Moscow’s oil will be taken by mainland refiners, including Urals, which ships from Russia’s west. Saudi Aramco is set to sell 43 million barrels of contractual supplies of September-loading crude to China, traders informed by the producer told Bloomberg. That compares with 51 million barrels a month ago, and a monthly average of about 45 million so far this year. Chinese interest in Urals is picking up given it remains the “most competitive” compared with similar Middle Eastern crudes, Energy Aspects said. Still, there’s a limit to China’s appetite given Russian imports account for 17 percent of overseas supplies, with 20 percent seen as a cap for a single country, it added. Sinopec, the Beijing-based parent company of Unipec, didn’t reply to an email seeking comment outside working hours. What do you think? We’d love to hear from you, join the conversation on the Rigzone Energy Network.

Shell Loses Arbitration in LNG Row with Venture Global

Venture Global Inc., one of the largest US exporters of liquefied natural gas, has prevailed over oil giant Shell Plc in an arbitration case over the sale of cargoes from its first export plant, capping a two-year fight. The Virginia-based company welcomed the favorable tribunal ruling in a statement Tuesday and said the decision reaffirmed the “plain language” in its contracts. Its shares rose 6.7 percent after the close of regular trading in New York. The dispute between the two energy players hinged on deals that Venture Global negotiated to sell fuel from its first export plant in Louisiana, named Calcasieu Pass. The facility began producing LNG in 2022. But instead of providing cargoes to customers with long-term contracts, Venture Global sold them directly into the spot market where prices were at a record high. Venture Global said the move was justified as contracts permitted it to sell LNG into the spot market before the plant was fully operational and still in its “commissioning phase.” It nonetheless outraged companies that had signed 20-year deals. The win is a crucial one for the company – the first resolution in a series of arbitration cases that have pitted the LNG upstart against some of the world’s biggest energy companies. Other cases are pending. It already has implications for trades around the globe. “It means that every LNG contract in the world was probably rewritten since this case began, to make sure this situation will be avoided in the future.” said Ira Joseph, a senior research associate at the Center on Global Energy Policy at Columbia University. Other arbitration cases against Venture Global were filed by Shell, BP Plc, Polish utility Orlen SA, Portugal’s Galp Energia SGPS SA, Spain’s Repsol SA, Edison International and China’s Sinopec. The initial claims against Venture Global totaled nearly $6

Eni, LG Chem Start Building South Korea’s First HVO-SAF Facility

LG Chem Ltd. and Eni SpA’s mobility unit Enilive have broken ground on South Korea’s first hydrotreated vegetable oil (HVO) and sustainable aviation fuel (SAF) production plant. The facility will rise at LG Chem’s Daesan chemical complex in Seosan, Chungcheongnam-do, 50 miles southwest of Seoul. The plant will be constructed by the LG Chem and Enilive joint venture, LG-Eni BioRefining. It is scheduled for completion in 2027 and will annually process approximately 400,000 tons of renewable bio-feedstock, the companies said in a joint statement. The facility’s HVO and SAF, whose demand is expected to rise due to renewable fuel mandates, will be produced by hydrogenating more sustainable vegetable oils like used cooking oil (UCO) and other waste residues through Ecofining, a technology developed by Eni in partnership with Honeywell UOP, the statement said. “LG Chem is transforming its portfolio to build a low-carbon foundation that ensures both a progressively more sustainable growth and profitability”, Shin Hak-cheol, CEO of LG Chem, said. “By advancing innovation in renewable fuels and bio-based feedstocks like HVO, we aim to strengthen our global competitiveness and meet our customers’ evolving needs efficiently”. By incorporating HVO into its supply chain, LG Chem aims to grow its range of ISCC PLUS-certified bio-circular balanced (BCB) products. These products are intended for use in sectors such as electronics, automotive, sporting goods, and hygiene products. “The Seosan biorefinery breaking ground reaffirms Enilive’s strategy in offering increasingly sustainable products and our company’s position as a leader in biofuels production”, Stefano Ballista, Enilive’s Chief Executive Officer, added. “Together with the plants that are already operational in Italy and in the United States of America, and with new biorefining plants under construction in Italy and Malaysia, the upcoming biorefining plant in Daesan will contribute to reach our 2030 target to increase our biorefining capacity

New Compute Exchange service answers GPU pricing queries

Compute Exchange and Silicon Data, Bochev added “are also working on developing clearer benchmarks for the compute market, and will have more details to share on that in the coming weeks.” PIC ‘should serve to keep suppliers honest ..’ Scott Bickley, an advisory fellow at Info-Tech Research Group, said he views the offering “as a way for enterprises to source short-term GPU capacity and possibly get a deal, especially if it is stranded capacity from the neocloud providers.” This, he said, “would also help to benchmark costs when purchasing this capacity in general, so it’s good, but it is also straightforward in terms of the value proposition.” He also noted that most companies are not buying GPU capacity directly; “This is for those that are building their own models or deploying their own AI applications atop existing models.” Bickley added, “it should serve to keep suppliers honest to some degree in terms of the floors and ceilings of the price to access GPU capacity.” Soon after Compute Exchange first launched in February, Matt Kimball, VP and principal analyst for data center compute and storage at Moor Insights & Strategy, described the GPU compute situation as “pretty dire. This is driven by what most view as a single supplier (Nvidia) selling GPUs before they can even be made to a market that has an insatiable thirst.” On Tuesday, following the announcement, he said that the concept of PIC is appealing: “I really like the idea of PIC as a tool for customers and seeing the compute exchange become an arbitrageur of sorts. This delivers a real value to [anyone] who is looking to utilize AI infrastructure,” he said.

Data center sustainability efforts stall slightly in 2025

Data center operators reported limited advances—and even some declines—in energy efficiency, carbon tracking, and water usage due in part to rising power demand and easing regulatory pressure in some regions, according to the recently released results of the Uptime Institute’s 15th Annual Global Data Center Survey 2025. As artificial intelligence workloads continue to grow and legacy data centers remain operational, sustainability initiatives have stalled, according to the Uptime Institute, which attributes this in part to reporting challenges. Uptime Institute’s 2025 data center survey was conducted online from April 2025 to May 2025 and collected responses from more than 800 data center owners and operators and more than 1,000 vendors and consultants. “What’s interesting this year is that we have seen a far from startling increase over the last few years of the data being collected, but this year it actually fell. And this obviously led to some speculation that there is a backing off of sustainability, and that it is no longer a high priority,” said Andy Lawrence, executive director of research at Uptime Institute, during a webinar sharing the survey results. “I think that the data center industry has not yet adapted to being very good at sustainability reporting.”

Arista’s latest networking results: 4 critical takeaways

“We also think UALink is another spec that’s coming out, and that may run as an overlay on top of an Ethernet underlay. There needs to be some firm standards there because today, scale-up is frankly all proprietary NV Link. And we’re encouraged by—just like we worked hard to found the Ultra Ethernet Consortium as a member for some of the back-end Ethernet, and the migration from InfiniBand to Ethernet is literally happening in 3 to 5 years. We expect the same phenomenon on scale-up,” Ullal said. “The rise in Agentic AI ensures any-to-any conversations with bidirectional bandwidth utilization. Such AI agents are pushing the envelope of LAN and WAN traffic patterns in the enterprise,” Ullal said. Work to do on VeloCloud integration The recent acquisition of VeloCloud was also a hot topic of the second quarter results that included the introduction of former Cisco exec and industry veteran Todd Nightingale, as its newly appointed President & COO. “It’s only been a month, but I can’t tell you how impressed I am with the passion and focus of the team, the trust that Arista customers have in the technology and the enormous opportunity we have ahead of us in data center, AI, and in the campus,” Nightingale said. “VeloCloud’s secure AI optimized WAN portfolio offers seamless application-aware solutions to connect customer branch sites, complementing Arista’s leading spines in the data center and campus,” Ullal said. “In a classic leaf-spine atomic identifier, we are enabling multipathing, encryption, in-band network telemetry, segmentation, application identification, and traffic engineering across distributed enterprise sites. We are so excited to fill this missing void in our distributed enterprise puzzle to bring that holistic branch solution.” “We also intend to work closely with best-of-breed security partners to enable SASE overlays. Please do note that VeloCloud is not

Enterprise tips for cloud success

The remaining tips were cited by roughly two-thirds of the enterprises. Tip number three is to look especially at applications whose users are widely dispersed. And by “widely” here, they mean on different continents, not just different neighborhoods. The reason is that quality of experience and even availability can be compromised when work has to transit a lot of networks just to get to where it’s processed. This can lead to user dissatisfaction, and dispersing resources closer to the users may be the only solution. If an enterprise doesn’t already have their own data center located close to each user concentration, chances are that putting a new hosting point in themselves couldn’t achieve reasonable economy of scale in capex, power and cooling, and operations costs. The cloud would be cheaper. A qualifying comment here is to take great care in evaluating the real impact of dispersion of application users. In some cases, there may not be enough of a difference in QoE or availability to require dispersing hosting points, and in fact it may be that where the application is hosted isn’t even the problem. “The cloud may look like the easy way out,” one enterprise said, “but it may not be the economical way.” See where your QoE issues really lie before you go to the cloud’s distributed hosting to fix them. Tip four is to examine the user-to-application interaction model carefully, to see if there’s a large non-transactional component. Mission-critical business systems, and business core databases, are almost always in the data center. The stuff that changes them are the transactions that add, update, and delete records. If an application’s user interaction is tightly coupled to the creation of transactions, then its processing is tied to those data center resources. That makes it harder to move the user-interface

Stargate’s slow start reveals the real bottlenecks in scaling AI infrastructure

The CFO emphasized that SoftBank remains committed to its original target of $346 billion (JPY 500 billion) over 4 years for the Stargate project, noting that major sites have been selected in the US and preparations are taking place simultaneously across multiple fronts. Requests for comment to Stargate partners Nvidia, OpenAI, and Oracle remain unanswered. Infrastructure reality check for CIOs These challenges offer important lessons for enterprise IT leaders facing similar AI infrastructure decisions. Sanchit Vir Gogia, chief analyst and CEO at Greyhound Research, said that Goto’s confirmation of delays “reflects a challenge CIOs see repeatedly” in partner onboarding delays, service activation slips, and revised delivery commitments from cloud and datacenter providers. Oishi Mazumder, senior analyst at Everest Group, noted that “SoftBank’s Stargate delays show that AI infrastructure is not constrained by compute or capital, but by land, energy, and stakeholder alignment.” The analyst emphasized that CIOs must treat AI infrastructure “as a cross-functional transformation, not an IT upgrade, demanding long-term, ecosystem-wide planning.” “Scaling AI infrastructure depends less on the technical readiness of servers or GPUs and more on the orchestration of distributed stakeholders — utilities, regulators, construction partners, hardware suppliers, and service providers — each with their own cadence and constraints,” Gogia said.

Incentivizing the Digital Future: Inside America’s Race to Attract Data Centers

Across the United States, states are rolling out a wave of new tax incentives aimed squarely at attracting data centers, one of the country’s fastest-growing industries. Once clustered in only a handful of industry-friendly regions, today’s data-center boom is rapidly spreading, pushed along by profound shifts in federal policy, surging demand for artificial intelligence, and the drive toward digital transformation across every sector of the economy. Nowhere is this transformation more visible than in the intensifying state-by-state competition to land massive infrastructure investments, advanced technology jobs, and the alluring prospect of long-term economic growth. The past year alone has seen a record number of states introducing or expanding incentives for data centers, from tax credits to expedited permitting, reflecting a new era of proactive, tech-focused economic development policy. Behind these moves, federal initiatives and funding packages underscore the essential role of digital infrastructure as a national priority, encouraging states to lower barriers for data center construction and operation. As states watch their neighbors reap direct investment and job creation benefits, a real “domino effect” emerges: one state’s success becomes another’s blueprint, heightening the pressure and urgency to compete. Yet, this wave of incentives also exposes deeper questions about the local impact, community costs, and the evolving relationship between public policy and the tech industry. From federal levels to town halls, there are notable shifts in both opportunities and challenges shaping the landscape of digital infrastructure advancement. Industry Drivers: the Federal Push and Growth of AI The past year has witnessed a profound federal policy shift aimed squarely at accelerating U.S. digital infrastructure, especially for data centers in direct response both to the explosive growth of artificial intelligence and to intensifying international competition. In July 2025, the administration unveiled “America’s AI Action Plan,” accompanied by multiple executive orders that collectively redefined

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs). In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

OpenAI brings GPT-4o back as a default for all paying ChatGPT users, Altman promises ‘plenty of notice’ if it leaves again

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe

Stay Ahead, Stay ONMINE