Mistral launches API for building AI agents that run Python, generate images, perform RAG and more

Stay Ahead, Stay ONMINE

Mistral launches API for building AI agents that run Python, generate images, perform RAG and more

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More The well-funded and innovative French AI startup Mistral AI is introducing a new service for enterprise customers and independent software developers alike. Mistral’s Agents application programming interface (API) allows third-party software developers to easily and rapidly […]

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More

The well-funded and innovative French AI startup Mistral AI is introducing a new service for enterprise customers and independent software developers alike.

Mistral’s Agents application programming interface (API) allows third-party software developers to easily and rapidly add autonomous generative AI capabilities — such as pulling information securely from enterprise documents — to their existing enterprise and independent applications using the newest Mistral proprietary model, Medium 3, as the “brains” of each agent.

It’s essentially designed to be a “plug and play” platform, with nearly limitless customization, for getting AI agents up and running to handle enterprise and developer workflows.

“With Agents API, we empower enterprises to use AI in more practical and impactful ways,” wrote Mistral’s Head of Developer Relations, Sophia Yang, PhD, on X.

Designed to complement Mistral’s existing Chat Completion API, this latest release focuses on agentic orchestration, built-in connectors, persistent memory, and the flexibility to coordinate multiple AI agents to tackle complex tasks.

Surpassing the limits of typical LLMs…

While traditional language models excel at generating text, they often fall short in executing actions or maintaining conversational context over time.

Mistral’s Agents API addresses these limitations by providing developers with the tools to create AI agents capable of performing real-world tasks, managing interactions across conversations, and dynamically orchestrating multiple agents when needed.

The Agents API comes equipped with several built-in connectors, including:

Code Execution: Securely runs Python code, enabling applications in data visualization, scientific computing, and other technical tasks.
Image Generation: Leverages Black Forest Lab FLUX1.1 [pro] Ultra to create custom visuals for marketing, education, or artistic uses.
Document Library: Accesses documents stored in Mistral Cloud, enhancing retrieval-augmented generation (RAG) features.
Web Search: Allows agents to retrieve up-to-date information from online sources, news outlets, and other reputable platforms.

The API also supports MCP tools, which connect agents to external resources like APIs, databases, user data, and documents—extending the agents’ abilities to handle dynamic, real-world content.

Enhanced accuracy using web search

One significant feature of the Agents API is the integration of web search as a connector, which notably improves performance on tasks requiring accurate, up-to-date information.

In benchmark testing on the SimpleQA dataset, Mistral Large’s accuracy rose from 23% to 75% when web search was enabled. Mistral Medium showed a similar improvement, increasing from 22.08% to 82.32%.

Real-world use cases

Mistral AI has highlighted a range of use cases for the Agents API, demonstrating its flexibility across multiple sectors:

Coding Assistant with GitHub: An agent oversees a developer assistant powered by DevStral, managing tasks and automating code development workflows.
Linear Tickets Assistant: Transforms call transcripts into project deliverables using multi-server MCP architecture.
Financial Analyst: Sources financial metrics and securely compiles reports through orchestrated MCP servers.
Travel Assistant: Helps users plan trips, book accommodations, and manage travel needs.
Nutrition Assistant: Supports users in setting dietary goals, logging meals, and receiving personalized recommendations.

Managing context and conversations

The Agents API’s stateful conversation system ensures that agents maintain context throughout their interactions. Developers can start new conversations or continue existing ones without losing the thread, with conversation history stored and accessible for future use.

Additionally, the API supports streaming output, enabling real-time updates in response to user requests or agent actions.

Dynamic orchestration of multiple agents

A core capability of the Agents API is its ability to coordinate multiple agents seamlessly. Developers can create customized workflows, assigning specific tasks to specialized agents and enabling handoffs as needed. This modular approach allows enterprises to deploy AI agents that work together to solve complex problems more effectively.

What the Mistral Agents API means for enterprise technical decision-makers

For professionals like the Lead AI Engineer or Senior AI Engineer, the Mistral Agents API represents a powerful addition to their AI toolkit.

The ability to dynamically orchestrate agents and seamlessly integrate real-world data sources means these roles can deploy AI solutions faster and with greater precision—critical in environments where quick iteration and performance tuning are paramount.

Specifically, these professionals often balance tight deployment timelines and the need to maintain model performance across different environments.

The Agents API’s built-in connectors—like web search, document libraries, and secure code execution—can significantly reduce the need for ad hoc integrations and patchwork tooling. This streamlined approach saves time and lowers friction, allowing teams to focus more on fine-tuning models and less on building surrounding infrastructure.

Moreover, stateful conversation management and real-time updates through streaming output align well with the demands of AI orchestration and deployment. These features make it easier for engineers to maintain context across iterations and ensure consistent, high-quality interactions with end users.

For those responsible for introducing and integrating new AI tools into organizational workflows, the MCP tool support also ensures that agents can access data from a wide range of APIs and systems, further enhancing operational efficiency.

Continuing to bolster Mistral’s enterprise AI push

The release of the Agents API follows Mistral AI’s recent launch of Le Chat Enterprise, a unified AI assistant platform designed for enterprise productivity and data privacy. Le Chat Enterprise is powered by the new Mistral Medium 3 model, which offers impressive performance at a lower computational cost than larger models.

Mistral Medium 3 is particularly strong in software development tasks, outperforming comparable models in key coding benchmarks like HumanEval and MultiPL-E. It also shows competitive performance in multilingual and multimodal scenarios, making it an attractive option for businesses operating in diverse environments.

Le Chat Enterprise supports enterprise-grade features such as data sovereignty, hybrid deployment, and strict access controls, which can be crucial for organizations in regulated sectors. The platform consolidates AI functionality within a single environment, enabling customization, seamless integration with existing workflows, and full control over deployment and data security.

But it’s another proprietary service

Mistral’s earlier releases, like Mistral 7B, were open source and widely embraced by the developer community for their transparency and flexibility.

However, Mistral Medium 3 is a proprietary model—requiring access through Mistral’s platform, APIs, or partners—and is no longer available under an open-source license.

This shift has caused some frustration in the AI community, where open access and transparency are highly valued for experimentation and customization.

The Agents API itself also follows a proprietary framework: it is not available under an open-source license and is managed exclusively by Mistral, with access available via subscription and API calls.

Pricing and availability

Pricing for the Agents API aligns with Mistral’s broader suite of models and tools:

Mistral Medium 3: $0.4 per million input tokens and $2 per million output tokens.
Web Search Connector: $30 per 1,000 calls.
Code Execution: $30 per 1,000 calls.
Image Generation: $100 per 1,000 images.
Premium News Access: $50 per 1,000 calls.
Document Library with RAG: Included in plans like Team and Enterprise, with up to 30GB per user in some tiers.
Custom connectors, audit logs, SAML SSO, and other enterprise features: Available in Team and Enterprise plans (pricing typically requires contacting Mistral’s sales team).

For developers and enterprise customers, these costs can add up quickly—making budget considerations and careful integration planning essential.

Looking ahead

Mistral AI positions its Agents API as the backbone of enterprise-grade agentic platforms, empowering developers to create solutions that move beyond traditional text generation.

Despite the community debate around open source versus proprietary access, Mistral’s focus on enterprise-grade features, customizable workflows, and secure integrations positions this API as a significant option for businesses seeking advanced AI capabilities.

For developers and technical decision makers, the question will be whether the proprietary nature of the Agents API and the underlying models aligns with their own operational and budgetary needs. For those who prioritize rapid deployment, managed services, and full integration with enterprise systems, Mistral’s evolving platform could offer significant advantages.

For more information or to get started, Mistral AI encourages developers to explore the provided documentation and demos.

Let me know if you’d like to expand on the open-source versus proprietary discussion even more, or if you’d like to highlight another perspective!

Daily insights on business use cases with VB Daily

If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.

Read our Privacy Policy

Thanks for subscribing. Check out more VB newsletters here.

An error occured.

Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy, bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

NextDecade names Mott as interim CFO

NextDecade Corp. has appointed company senior vice-president Michael (Mike) Mott as interim chief financial officer, effective Oct. 20, 2025. Mott will take over from Brent Wahl, who resigns from the company as chief financial officer, effective Oct. 20. Wahl was named chief financial officer of NextDecade in 2021 after having served

The business case for microsegmentation: Lower insurance costs, 33% faster ransomware response

Myths and misconceptions about microsegmentation The report detailed a number of reasons why organizations have not properly deployed microsegmentation. Network complexity topped the list of implementation barriers at 44%, but Weber questioned the legitimacy of that barrier. “Many organizations believe their network is too complex for microsegmentation, but once we

Three options for wireless power in the enterprise

Sensors such as these can be attached to pallets to track its location, says Srivastava. “People in Europe are very conscious about where their food is coming from and, to comply with regulations, companies need to have sensors on the pallets,” he says. “Or they might need to know that

IBM unveils advanced quantum computer in Spain

IBM executives and officials from the Basque Government and regional councils in front of Europe’s first IBM Quantum System Two, located at the IBM-Euskadi Quantum Computational Center in San Sebastián, Spain. The Basque Government and IBM unveil the first IBM Quantum System Two in Europe at the IBM-Euskadi Quantum Computational

Iberdrola Acquires 270 MW BESS Project in South Australia

Iberdrola SA bought the 270-megawatt (MW), 1,080-megawatt-hour (MWh) Tungkillo battery energy storage system (BESS) project in South Australia from RES Australia. The project “has all the key approvals and very advanced connection rights, in addition to the land already secured”, Iberdrola said in a press release. The Spanish power utility expects to start up the project 2028. Iberdrola estimates EUR 275 million ($322.04 million) in investment to complete the project. “The Australian system requires a significant increase in battery storage capacity to integrate the new renewable capacity and bring flexibility to the system”, Iberdrola said. “The location of the Tungkillo project, in the south of the country, is optimal for providing this service. “In the case of Iberdrola Australia, the storage systems provide backup capacity for its portfolio of energy sales contracts to customers”. Iberdrola has two other BESS projects under construction in Australia, it added. It expects the 65-MW Smithfield project in New South Wales and the 180-MW Broadsound project, which is part of a 360-MW solar project in Queensland, to start operations 2025 and 2026 respectively. Iberdrola already has two operational BESS projects in the country, according to an online directory of its Australian assets. Iberdrola has a 10-year dispatch control for TransGrid’s 50-MW Wallgrove Grid Battery, connected to the New South Wales transmission network. Meanwhile Iberdrola’s 25-MW Lake Bonney project is co-located with the Lake Bonney Wind Farms near Millicent in South Australia. Additionally Iberdrola has three proposed BESS projects in Australia, according to the directory. Burrenbring Battery is planned to have a capacity of one gigawatt with two to four hours duration. It is proposed to rise on privately owned land on the Widi People’s Country in the Isaac Region. Mount Doran Battery is planned to have a capacity of 200 MW and two-hour storage. Iberdrola plans to connect the project to Australian Energy Operations’ existing Elaine Terminal Station.

ChemOne Seeks $600MM Private Loan for Chemical Plant

Energy company ChemOne Holdings is seeking a $600 million private credit loan to support the construction of its chemical processing complex in Malaysia, according to people familiar with the matter. The proposed private credit deal would carry a tenor of up to 10 years and feature high single-digit pricing, the people said, who asked not to be identified discussing private matters. Several Korean institutional investors have committed around $150 million to the deal, said one of the people. A representative from ChemOne declined to comment. The deal reflects the growing global trend of using private credit for project financing. In the US, state and local infrastructure projects increasingly rely on alternative funding sources as pandemic-era stimulus fades and the Trump administration aims to reduce spending. In Asia, borrowers are also tapping the asset class to fund developments such as data centers and electric vehicle charging stations. The private loan will be subordinated to an existing $3.5 billion senior debt facility that was launched end of last year, and is now fully subscribed, the people said. Both financings are intended to fund the development of the Pengerang Energy Complex in the southern state of Johor. The project is designed to be a key component of a major oil, gas and petrochemical hub strategically positioned near international shipping lanes. Its deep-water access accommodates the world’s largest crude carriers. The low carbon facility is expected to produce 5.6 million metric tons annually of aromatic and energy products, according to a press release. Lenders of the $3.5 billion facility include export credit agencies such as the Export-Import Bank of the United States, Euler Hermes AG, and Export-Import Bank of Malaysia, the press release said. WHAT DO YOU THINK? Generated by readers, the comments included herein do not reflect the views and opinions of Rigzone. All comments are subject to editorial review. Off-topic, inappropriate or

Oil Ends Third Straight Weekly Loss

Oil limped to a third week of losses, weighed down by signs the market is tipping into the surplus analysts have been awaiting. West Texas Intermediate ended the day little changed near $57 a barrel, down 2.3% this week in its longest losing streak since March. News on oversupply kept piling up this week. The International Energy Agency raised its estimate for next year’s global overhang by about 18%. And a US storage broker reported a surge in bids for securing tank capacity at the country’s key crude hub in Cushing, Oklahoma, underscoring that traders are positioning themselves for the glut. Prices for flagship US oil grades have also weakened. Crude traders are also following the on-again, off-again tensions between Washington and Beijing. President Donald Trump on Friday said higher tariffs against China were not “sustainable” expressed optimism that an upcoming meeting with Xi could yield a lasting trade peace after last week threatening an additional 100% tariff on Beijing’s goods. The shift in sentiment eased some concerns that the ongoing tit-for-tat between the two biggest crude consumers could cripple energy consumption. Trump, meanwhile, said he would hold a second meeting with Russian counterpart Vladimir Putin “within two weeks or so” aimed at ending the war in Ukraine, a scenario that stands to send prices toward $50 a barrel, according to Citigroup Inc. The US president on Friday shrugged off concerns Putin may be manipulating him. “Crude is weighing Trump’s meeting with Putin against building evidence of oversupply,” said Joe DeLaura, global energy strategist at Rabobank. “With oil markets in contango from the second quarter of next year onward, the next path is down for crude unless demand surprises to the upside, which we view as unlikely.” Western nations are turning the screws on Russia’s energy sector in a bid

Colombia Warms to Fossil Fuels

Colombia, the world’s only significant oil producer to join a bloc of nations vowing to quit fossil fuels, is poised for an about face. As President Gustavo Petro’s term winds down, most prominent candidates running to replace him are promising to put oil and gas back at the forefront of the nation’s energy policy. Even moderate and center-left candidates are calling for Colombia to start using hydraulic fracturing, or fracking, which Petro has pushed to make illegal. “If God gave us oil, coal and gas — we’ll use oil, coal and gas,” Claudia López, a former Bogotá mayor with a record of supporting progressive causes, said in an X post. Petro, Colombia’s first leftist president, made global headlines in 2023 by announcing the nation would sign the “fossil fuel non-proliferation treaty” to end oil, gas and coal production. It made the one-time Marxist guerrilla fighter a celebrity at international climate gatherings and brought heft to the non-proliferation movement, which until then was made up of mostly small island nations. The effort, however, backfired in Colombia. Petro’s refusal to allow new drilling contracts exacerbated a domestic gas shortage, forcing the nation to import fuel at higher prices. Now with Petro’s popularity waning amid rising violence and a ballooning fiscal deficit, presidential hopefuls are distancing themselves from his policies, including energy. During a recent event in Bogotá, five candidates held up “YES” signs when asked if they would authorize fracking in Colombia. Only one, Petro’s former environment minister Susana Muhamad, said no. The shift in Colombia reflects a broader retreat from the fight against climate change globally. In the US, President Donald Trump is gutting Joe Biden’s climate policies, blocking renewable energy projects and withdrawing from the Paris climate accord. And the European Union is struggling to pass laws requiring further ambitious emissions cuts and walking back many of

Phillips 66 progresses California refinery shuttering plan, Chevron El Segundo fire adds to state’s refining uncertainty

Capacity crash Phillips 66’s announcement of its progress on the refinery shuttering comes amid growing concerns by California’s in-state refiners that unfavorable market conditions alongside regulatory pressure stemming from aggressive state legislation aimed at refiners will prevent long-term viability and competitiveness of crude processing activities in the region. While state officials recently voted to delay enactment of legislation blamed for decisions by California refiners to end operations at their conventional processing sites—which, in addition to Phillips 66 in Los Angeles, includes Valero Energy Corp.’s planned closure of its 145,000-b/d Benicia refinery, just northeast of San Francisco, by midyear 2026—refiners have yet to indicate any signs of backing down from advancing existing shutdown plans. With the state’s loss of about 20% of its traditional crude processing capacity since 2020—and nearly another estimated 20% at threat with the closure of the Los Angeles and Benicia sites—California’s refining capacity may face an additional short-term blow to its consumer fuels market following a late-evening fire on Oct. 2 at Chevron 290,000-b/d refinery in El Segundo. Chevron El Segundo refinery fire “Chevron fire department personnel, including emergency responders from the City of El Segundo and Manhattan Beach [were] actively responding to an isolated fire inside the Chevron El Segundo [r]efinery [on Oct. 2]. All refinery personnel and contractors have been accounted for and there are no injuries,” Chevron said in a statement posted to its official website. “No evacuation orders for area residents [were] put in place by emergency response agencies monitoring the incident, and no exceedances have been detected by the facilities fence line monitoring system,” the operator said. In a separate early morning statement on Oct. 3, the city of El Segundo said the fire “originated at a process unit at the southeast corner of the refinery.” Details of the impacted unit, however,

OPEC+ to hike oil production by 137,000 b/d starting in November

The Organization of the Petroleum Exporting Countries, along with Russia and a few smaller producers, known as OPEC+, on Oct. 5 announced plans to increase oil production by 137,000 b/d starting in November, maintaining the same increase as in October in response to ongoing worries about a potential supply surplus. OPEC+ described its decision as a reaction to stable global economic outlooks and current healthy market fundamentals, as reflected in low oil inventories. The group also indicates that adjustments to production could be halted or reversed if circumstances warrant. By end-September 2025, OPEC+ had fully phased out the 2.2 million b/d voluntary production cuts introduced in late 2023, completing the process months earlier than initially signaled. They also confirmed their intention to fully compensate for any overproduced volume since January 2024. The next meeting will be hold on Nov. 2. “The market was expecting a somewhat larger increase from OPEC+ as shown in the structure last week. However, the modest 137,000 b/d bloats the already-oversupplied balance for the fourth quarter of 2025 and 2026,” said Janiv Shah, an analyst at Rystad Energy. Global crude oil markets are experiencing a significant bearish trend, with ICE Brent prices dropping from last week’s peak of $70/bbl to $65/bbl on Oct. 5, following OPEC’s announcement to increase production. Rystad Energy forecasts that under these circumstances, price dynamics will likely shift substantially downward. It is unlikely that ICE Brent will remain above $60-65/bbl in 2026 unless OPEC+ alters its approach or sanctions on Russia and Iran drastically curtail exports from those countries.

Roundup: Digital Realty Marks Major Milestones in AI, Quantum Computing, Data Center Development

Key features of the DRIL include: • High-Density AI and HPC Testing. The DRIL supports AI and high-performance computing (HPC) workloads with high-density colocation, accommodating workloads up to 150 kW per cabinet. • AI Infrastructure Optimization. The ePlus AI Experience Center lets businesses explore AI-specific power, cooling, and GPU resource requirements in an environment optimized for AI infrastructure. • Hybrid Cloud Validation. With direct cloud connectivity, users can refine hybrid strategies and onboard through cross connects. • AI Workload Orchestration. Customers can orchestrate AI workloads across Digital Realty’s Private AI Exchange (AIPx) for seamless integration and performance. • Latency Testing Across Locations. Enterprises can test latency scenarios for seamless performance across multiple locations and cloud destinations. The firm’s Northern Virginia campus is the primary DRIL location, but companies can also test latency scenarios between there and other remote locations. DRIL rollout to other global locations is already in progress, and London is scheduled to go live in early 2026. Digital Realty, Redeployable Launch Pathway for Veteran Technical Careers As new data centers are created, they need talented workers. To that end, Digital Realty has partnered with Redeployable, an AI-powered career platform for veterans, to expand access to technical careers in the United Kingdom and United States. The collaboration launched a Site Engineer Pathway, now live on the Redeployable platform. It helps veterans explore, prepare for, and transition into roles at Digital Realty. Nearly half of veterans leave their first civilian role within a year, often due to unclear expectations, poor skill translation, and limited support, according to Redeployable. The Site Engineer Pathway uses real-world relevance and replaces vague job descriptions with an experience-based view of technical careers. Veterans can engage in scenario-based “job drops” simulating real facility and system challenges so they can assess their fit for the role before applying. They

BlackRock’s $40B data center deal opens a new infrastructure battle for CIOs

Everest Group partner Yugal Joshi said, “CIOs are under significant pressure to clearly define their data center strategy beyond traditional one-off leases. Given most of the capacity is built and delivered by fewer players, CIOs need to prepare for a higher-price market with limited negotiation power.” The numbers bear this out. Global data center costs rose to $217.30 per kilowatt per month in the first quarter of 2025, with major markets seeing increases of 17-18% year-over-year, according to CBRE. Those prices are at levels last seen in 2011-2012, and analysts expect them to remain elevated. Gogia said, “The combination of AI demand, energy scarcity, and environmental regulation has permanently rewritten the economics of running workloads. Prices that once looked extraordinary have now become baseline.” Hyperscalers get first dibs The consolidation problem is compounded by the way capacity is being allocated. North America’s data center vacancy rate fell to 1.6% in the first half of 2025, with Northern Virginia posting just 0.76%, according to CBRE Research. More troubling for enterprises: 74.3% of capacity currently under construction is already preleased, primarily to cloud and AI providers. “The global compute market is no longer governed by open supply and demand,” Gogia said. “It is increasingly shaped by pre-emptive control. Hyperscalers and AI majors are reserving capacity years in advance, often before the first trench for power is dug. This has quietly created a two-tier world: one in which large players guarantee their future and everyone else competes for what remains.” That dynamic forces enterprises into longer planning cycles. “CIOs must forecast their infrastructure requirements with the same precision they apply to financial budgets and talent pipelines,” Gogia said. “The planning horizon must stretch to three or even five years.”

Nvidia, Infineon partner for AI data center power overhaul

The solution is to convert power right at the GPU on the server board and to upgrade the backbone to 800 volts. That should squeeze more reliability and efficiency out of the system while dealing with the heat, Infineon stated. Nvidia announced the 800 Volt direct current (VDC) power architecture at Computex 2025 as a much-needed replacement for the 54 Volt backbone currently in use, which is overwhelmed by the demand of AI processors and increasingly prone to failure. “This makes sense with the power needs of AI and how it is growing,” said Alvin Nguyen, senior analyst with Forrester Research. “This helps mitigate power losses seen from lower voltage and AC systems, reduces the need for materials like copper for wiring/bus bars, better reliability, and better serviceability.” Infineon says a shift to a centralized 800 VDC architecture allows for reduced power losses, higher efficiency and reliability. However, the new architecture requires new power conversion solutions and safety mechanisms to prevent potential hazards and costly server downtimes such as service and maintenance.

Meta details cutting-edge networking technologies for AI infrastructure

ESUN initiative As part of its standardization efforts, Meta said it would be a key player in the new Ethernet for Scale-Up Networking (ESUN) initiative that brings together AMD, Arista, ARM, Broadcom, Cisco, HPE Networking, Marvell, Microsoft, NVIDIA, OpenAI and Oracle to advance the networking technology to handle the growing scale-up domain for AI systems. ESUN will focus solely on open, standards-based Ethernet switching and framing for scale-up networking—excluding host-side stacks, non-Ethernet protocols, application-layer solutions, and proprietary technologies. The group will focus on the development and interoperability of XPU network interfaces and Ethernet switch ASICs for scale-up networks, the OCP wrote in a blog. ESUN will actively engage with other organizations such as Ultra-Ethernet Consortium (UEC) and long-standing IEEE 802.3 Ethernet to align open standards, incorporate best practices, and accelerate innovation, the OCP stated. Data center networking milestones The launch of ESUN is just one of the AI networking developments Meta shared at the event. Meta engineers also announced three data center networking innovations aimed at making its infrastructure more flexible, scalable, and efficient: The evolution of Meta’s Disaggregated Scheduled Fabric (DSF) to support scale-out interconnect for large AI clusters that span entire data center buildings. A new Non-Scheduled Fabric (NSF) architecture based entirely on shallow-buffer, disaggregated Ethernet switches that will support our largest AI clusters like Prometheus. The addition of Minipack3N, based on Nvidia’s Ethernet Spectrum-4 ASIC, to Meta’s portfolio of 51Tbps OCP switches that use OCP’s Switch Abstraction Interface and Meta’s Facebook Open Switching System (FBOSS) software stack. DSF is Meta’s open networking fabric that completely separates switch hardware, NICs, endpoints, and other networking components from the underlying network and uses OCP-SAI and FBOSS to achieve that, according to Meta. It supports Ethernet-based RoCE RDMA over Converged Ethernet (RoCE/RDMA)) to endpoints, accelerators and NICs from multiple vendors, such as Nvidia,

Arm joins Open Compute Project to build next-generation AI data center silicon

Keeping up with the demand comes down to performance, and more specifically, performance per watt. With power limited, OEMs have become much more involved in all aspects of the system design, rather than pulling silicon off the shelf or pulling servers or racks off the shelf. “They’re getting much more specific about what that silicon looks like, which is a big departure from where the data center was ten or 15 years ago. The point here being is that they look to create a more optimized system design to bring the acceleration closer to the compute, and get much better performance per watt,” said Awad. The Open Compute Project is a global industry organization dedicated to designing and sharing open-source hardware configurations for data center technologies and infrastructure. It covers everything from silicon products to rack and tray design. It is hosting its 2025 OCP Global Summit this week in San Jose, Calif. Arm also was part of the Ethernet for Scale-Up Networking (ESUN) initiative announced this week at the Summit that included AMD, Arista, Broadcom, Cisco, HPE Networking, Marvell, Meta, Microsoft, and Nvidia. ESUN promises to advance Ethernet networking technology to handle scale-up connectivity across accelerated AI infrastructures. Arm’s goal by joining OCP is to encourage knowledge sharing and collaboration between companies and users to share ideas, specifications and intellectual property. It is known for focusing on modular rather than monolithic designs, which is where chiplets come in. For example, customers might have multiple different companies building a 64-core CPU and then choose IO to pair it with, whether like PCIe or an NVLink. They then choose their own memory subsystem, deciding whether to go HBM, LPDDR, or DDR. It’s all mix and match like Legos, Awad said.

BlackRock-Led Consortium to Acquire Aligned Data Centers in $40 Billion AI Infrastructure Deal

Capital Strategy and Infrastructure Readiness The AIP consortium has outlined an initial $30 billion in equity, with potential to scale toward $100 billion including debt over time as part of a broader AI infrastructure buildout. The Aligned acquisition represents a cornerstone investment within that capital roadmap. Aligned’s “ready-to-scale” platform – encompassing land, permits, interconnects, and power roadmaps – is far more valuable today than a patchwork of single-site developments. The consortium framed the transaction as a direct response to the global AI buildout crunch, targeting critical land, energy, and equipment bottlenecks that continue to constrain hyperscale expansion. Platform Overview: Aligned’s Evolution and Strategic Fit Aligned Data Centers has rapidly emerged as a scale developer and operator purpose-built for high-density, quick-turn capacity demanded by hyperscalers and AI platforms. Beyond the U.S., Aligned extended its reach across the Americas through its acquisition of ODATA in Latin America, creating a Pan-American presence that now spans more than 50 campuses and over 5 GW of capacity. The company has repeatedly accessed both public and private capital markets, most recently securing more than $12 billion in new equity and debt financing to accelerate expansion. Aligned’s U.S.–LATAM footprint provides geographic diversification and proximity to fast-growing AI regions. The buyer consortium’s global relationships – spanning utilities, OEMs, and sovereign-fund partners – help address power, interconnect, and supply-chain constraints, all of which are critical to sustaining growth in the AI data-center ecosystem. Macquarie Asset Management built Aligned from a niche U.S. operator into a 5 GW-plus, multi-market platform, the kind of asset infrastructure investors covet as AI demand outpaces grid and supply-chain capacity. Its sale at this stage reflects a broader wave of industry consolidation among large-scale digital-infrastructure owners. Since its own acquisition by BlackRock in early 2024, GIP has strengthened its position as one of the world’s top owners

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs). In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle