OpenAI’s API lead explains how enterprises are already succeeding with its Agents SDK and Responses API

Stay Ahead, Stay ONMINE

OpenAI’s API lead explains how enterprises are already succeeding with its Agents SDK and Responses API

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more At VentureBeat’s Transform 2025 conference, Olivier Godement, Head of Product for OpenAI’s API platform, provided a behind-the-scenes look at how enterprise teams are adopting and deploying AI agents at scale. In a […]

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more

At VentureBeat’s Transform 2025 conference, Olivier Godement, Head of Product for OpenAI’s API platform, provided a behind-the-scenes look at how enterprise teams are adopting and deploying AI agents at scale.

In a 20-minute panel discussion I hosted exclusively with Godement, the former Stripe researcher and current OpenAI API boss unpacked OpenAI’s latest developer tools—the Responses API and Agents SDK—while highlighting real-world patterns, security considerations, and cost-return examples from early adopters like Stripe and Box.

For enterprise leaders unable to attend the session live, here are top 8 most important takeaways:

Agents Are Rapidly Moving From Prototype to Production

According to Godement, 2025 marks a real shift in how AI is being deployed at scale. With over a million monthly active developers now using OpenAI’s API platform globally, and token usage up 700% year over year, AI is moving beyond experimentation.

“It’s been five years since we launched essentially GPT-3… and man, the past five years has been pretty wild.”

Godement emphasized that current demand isn’t just about chatbots anymore. “AI use cases are moving from simple Q&A to actually use cases where the application, the agent, can do stuff for you.”

This shift prompted OpenAI to launch two major developer-facing tools in March: the Responses API and the Agents SDK.

When to Use Single Agents vs. Sub-Agent Architectures

A major theme was architectural choice. Godement noted that single-agent loops, which encapsulate full tool access and context in one model, are conceptually elegant but often impractical at scale.

“Building accurate and reliable single agents is hard. Like, it’s really hard.”

As complexity increases—more tools, more possible user inputs, more logic—teams often move toward modular architectures with specialized sub-agents.

“A practice which has emerged is to essentially break down the agents into multiple sub-agents… You would do separation of concerns like in software.”

These sub-agents function like roles in a small team: a triage agent classifies intent, tier-one agents handle routine issues, and others escalate or resolve edge cases.

Why the Responses API Is a Step Change

Godement positioned the Responses API as a foundational evolution in developer tooling. Previously, developers manually orchestrated sequences of model calls. Now, that orchestration is handled internally.

“The Responses API is probably the biggest new layer of abstraction we introduced since pretty much GPT-3.”

It allows developers to express intent, not just configure model flows. “You care about returning a really good response to the customer… the Response API essentially handles that loop.”

It also includes built-in capabilities for knowledge retrieval, web search, and function calling—tools that enterprises need for real-world agent workflows.

Observability and Security Are Built In

Security and compliance were top of mind. Godement cited key guardrails that make OpenAI’s stack viable for regulated sectors like finance and healthcare:

Policy-based refusals
SOC-2 logging
Data residency support

Evaluation is where Godement sees the biggest gap between demo and production.

“My hot take is that model evaluation is probably the biggest bottleneck to massive AI adoption.”

OpenAI now includes tracing and eval tools with the API stack to help teams define what success looks like and track how agents perform over time.

“Unless you invest in evaluation… it’s really hard to build that trust, that confidence that the model is being accurate, reliable.”

Early ROI Is Visible in Specific Functions

Some enterprise use cases are already delivering measurable gains. Godement shared examples from:

Stripe, which uses agents to accelerate invoice handling, reporting “35% faster invoice resolution”
Box, which launched knowledge assistants that enable “zero-touch ticket triage”

Other high-value use cases include customer support (including voice), internal governance, and knowledge assistants for navigating dense documentation.

What It Takes to Launch in Production

Godement emphasized the human factor in successful deployments.

“There is a small fraction of very high-end people who, whenever they see a problem and see a technology, they run at it.”

These internal champions don’t always come from engineering. What unites them is persistence.

“Their first reaction is, OK, how can I make it work?”

OpenAI sees many initial deployments driven by this group — people who pushed early ChatGPT use in the enterprise and are now experimenting with full agent systems.

He also pointed out a gap many overlook: domain expertise. “The knowledge in an enterprise… does not lie with engineers. It lies with the ops teams.”

Making agent-building tools accessible to non-developers is a challenge OpenAI aims to address.

What’s Next for Enterprise Agents

Godement offered a glimpse into the roadmap. OpenAI is actively working on:

Multimodal agents that can interact via text, voice, images, and structured data
Long-term memory for retaining knowledge across sessions
Cross-cloud orchestration to support complex, distributed IT environments

These aren’t radical changes, but iterative layers that expand what’s already possible. “Once we have models that can think not only for a few seconds but for minutes, for hours… that’s going to enable some pretty mind-blowing use cases.”

Final Word: Reasoning Models Are Underhyped

Godement closed the session by reaffirming his belief that reasoning-capable models—those that can reflect before responding—will be the true enablers of long-term transformation.

“I still have conviction that we are pretty much at the GPT-2 or GPT-3 level of maturity of those models….We are still scratching the surface on what reasoning models can do.”

For enterprise decision makers, the message is clear: the infrastructure for agentic automation is here. What matters now is building a focused use case, empowering cross-functional teams, and being ready to iterate. The next phase of value creation lies not in novel demos—but in durable systems, shaped by real-world needs and the operational discipline to make them reliable.

Daily insights on business use cases with VB Daily

If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.

Read our Privacy Policy

Thanks for subscribing. Check out more VB newsletters here.

An error occured.

Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy, bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Oncor to propose rate increase, says 5-year capital expenditures could ‘soar’ by $12B

Dive Brief: Oncor Electric said Thursday that it will ask Texas regulators to increase its base rates by approximately $834 million to support Texas investments that include the utility’s $36 billion five-year capital spending plan — though that spending figure is likely to climb, officials said. In February, Oncor announced a

Chronosphere unveils logging package with cost control features

According to a study by Chronosphere, enterprise log data is growing at 250% year-over-year, and Chronosphere Logs helps engineers and observability teams to resolve incidents faster while controlling costs. The usage and volume analysis and proactive recommendations can help reduce data before it’s stored, the company says. “Organizations are drowning

Cisco CIO on the future of IT: AI, simplicity, and employee power

AI can democratize access to information to deliver a “white-glove experience” once reserved for senior executives, Previn said. That might include, for example, real-time information retrieval and intelligent process execution for every employee. “Usually, in a large company, you’ve got senior executives, and you’ve got early career hires, and it’s

AMI MegaRAC authentication bypass flaw is being exploitated, CISA warns

The spoofing attack works by manipulating HTTP request headers sent to the Redfish interface. Attackers can add specific values to headers like “X-Server-Addr” to make their external requests appear as if they’re coming from inside the server itself. Since the system automatically trusts internal requests as authenticated, this spoofing technique

Oil Steady as OPEC+ Weighs Output Hike

Oil held steady as traders weighed the uncertain status of nuclear talks between the US and Iran against reports that OPEC+ may extend its run of super-sized production increases. West Texas Intermediate edged up to settle above $65 a barrel after swinging between gains and losses. Bloomberg reported that several OPEC delegates, who asked not to be identified, said their countries are ready consider another 411,000 barrel-a-day increase for August when they convene on July 6, following similarly sized hikes agreed upon in each of the previous three months. While that figure is broadly in-line with expectations, “the indications are that the group may go beyond the 411,000 barrel-a-day increase,” said John Kilduff, a partner at Again Capital. “Next, we should hear about the voluntary cuts under-shooting the goal from the group laggards. I expect the ultimate decision to be bearish for prices.” Crude had earlier advanced as much as 1.3% after US Energy Secretary Chris Wright told Bloomberg that sanctions against Iran will remain in place for now, and US President Donald Trump said he dropped plans to ease Iran sanctions. The statement comes just days after the president claimed that Iran and the US would meet for nuclear talk as soon as next week, which Iran denied. Oil still ended the week down roughly 13% — snapping three weeks of gains — after a ceasefire in the Israel-Iran conflict was reached, easing concerns about supply disruptions from a region that pumps about a third of the world’s crude. The focus has largely reverted to fundamental catalysts, including OPEC moves. Russia now also appears more receptive to a fresh output boost, in a reversal of an earlier stance, raising concerns of supply overhang in the second half of the year. Investors have also turned their attention to progress on

Oil Tanker Rates Collapse as Conflict in Middle East Abates

The cost of shipping Middle East crude to customers in Asia collapsed on Thursday, the latest sign of oil markets returning to normal after conflict eased in the world’s top petroleum-exporting region. Charter rates slumped by 17% to 55.50 industry-standard Worldscale points, according to data from the Baltic Exchange in London. It works out at roughly $1.60 a barrel. “Risk premiums have naturally faded,” said Fredrik Dybwad, an analyst at Fearnley Securities AS. “There is ample vessel availability, and considering normal seasonality, rates should naturally find a lower level.” Shipping prices soared two weeks ago amid concern Iran might disrupt maritime traffic around Hormuz Strait, a vital waterway through which 20% of the world’s oil and liquefied natural gas must pass. After almost two weeks of fighting between Iran and Israel that began on June 13, there’s since been a ceasefire, hitting oil prices and lowering the risks for ships that enter the region. The Joint Maritime Information Center, a naval liaison with commercial shipping in the region, said Thursday that no hostilities had been reported in the Strait of Hormuz over the past 48 hours and that traffic had returned to normal levels. “A sustained period of inactivity and strengthening of the ceasefire agreement will stabilize maritime tension in the Arabian Gulf,” it said in a note. “Now that the market has become sanguine about Iran shutting the Strait of Hormuz, ships are running fluidly again, the premium gas been removed, and rates are correcting lower meaningfully,” said Jonathan Chappell, senior managing director at Evercore ISI. The Worldscale system is designed to let owners and charterers quickly calculate the latest earnings and per-barrel costs on thousands of trade routes. Vessels on the industry’s benchmark Saudi Arabia-to-China route are earning $35,281 a day, according to the Baltic Exchange. They were

Equinor, Shell Unveil Name of UK North Sea JV

Shell PLC and Equinor ASA have named their United Kingdom North Sea joint venture Adura, which they announced December as the biggest independent producer on the UK’s side of the sea. “Work continues towards securing regulatory approvals, with launch of the IJV [incorporated JV] expected by the end of this year”, Norway’s majority state-owned Equinor said in an online statement. Adura, which will be equally owned, combines the two companies’ offshore assets in the UK, where Shell currently produces over 100,000 barrels of oil equivalent a day (boed) and Equinor about 38,000 boed. “Adura is expected to produce over 140,000 barrels of oil equivalent per day in 2025”, Equinor said. The name Adura is “rooted in their [the companies] respective heritage and focused on shaping the future of the basin in the years ahead”, Equinor explained. “Adura has been created to bring together the A of Aberdeen and the dura of durability. It’s a company built on firm foundations, much like the strong granite synonymous with the city”. “Adura will sustain domestic oil and gas production and security of energy supply in the UK and beyond”, Equinor added. Adura will include Equinor’s 29.89 percent stake in the CNOOC Ltd.-operated Buzzard field, which started production 2007; an operating stake of 65.11percent in Mariner, online since 2019; and an 80 percent operating stake in Rosebank, expected to come onstream 2026. Shell will contribute its 27.97 percent ownership in BP PLC-operated Clair, which began production 2005; a 50 percent operating stake in Gannet, started up 1992; a 100 percent stake in Jackdaw, for which Shell plans to seek a new consent following a court nullification; a 21.23 percent operating stake in Nelson, which started production 1994; a 50 percent operating stake in Penguins, which started production 2003; a 92.52 percent operating stake in Pierce,

New York offering up to $750K for facility decarbonization projects

Dive Brief: New York state is offering up to $750,000 in state cost-sharing funding for building and campus decarbonization efforts that use ground-source heat pumps, waste heat recovery, thermal energy storage and other low-emissions technologies. Applications are due July 31. The New York State Energy Research and Development Authority’s Large-Scale Thermal program encourages property owners to pursue high-efficiency, “grid-friendly” electrification projects, NYSERDA Program Manager Sue Dougherty said in a presentation at the International District Energy Association annual conference earlier this month. The $10 million program is open to systems that provide heating, cooling and hot water to single buildings with at least 100,000 square feet of conditioned space or multibuilding campuses with at least 250,000 conditioned square feet, NYSERDA says. Dive Insight: State funding opportunities like the Large-Scale Thermal program are key to New York’s efforts to significantly reduce the environmental impact of its roughly 6 million buildings in the coming decades, Dougherty said. The state wants 85% of its buildings to use clean heating technologies like heat pumps and thermal energy networks by 2050, the same year its statutory net-zero statewide GHG emissions target kicks in. “We’re not going to do all 6 million buildings, and we really don’t have to,” Dougherty said. “But we will need to do a significant number, and our solutions will need to address existing, older buildings and newer buildings getting built [today].” The Large-Scale Thermal program is accepting applications for its third funding round through July. Successful applicants will receive state funding equal to 50% of total project design costs, with maximum funding up to $300,000 for new construction and $750,000 for existing buildings. The project economics tend to work best for existing facilities with aging heating and cooling infrastructure, new construction and larger buildings or campuses that can achieve “economies of scale,”

Electrical manufacturers publish ‘digital substation’ standards

The National Electrical Manufacturers Association on Wednesday announced the publication of three standards aimed at helping utilities and equipment manufacturers develop the “digital substations” they say will underpin a modern, reliable and self-healing grid. More standards will be published in the future, with the initial set covering fault isolation and restoration issues in distributed energy resources, and on a looped single line feeder, potentially in instances of communications loss. The standards can assist utilities that want to install grid sensors, switches and automated reclosers, NEMA officials said. “You can think of them as basically blueprints for utilities to use when they are going to implement one of these systems,” NEMA Senior Vice President of Strategy, Technical and Industry Affairs Patrick Hughes said. “We’ve seen and heard from utilities that there are interoperability questions and there’s a need for technical guidance for utilities interested in installing these … self-healing grid technologies,” Hughes said. “These give utilities a very clear blueprint for how to implement these systems.” Modernized substations and automated restoration equipment will be key to managing the increasing demand for electricity across a strained grid, experts say. NEMA anticipates electricity demand in the U.S. will increase 2% annually and 50% by 2050, as more data centers are built, buildings are electrified and transportation shifts away from fossil fuels. “When you’re dealing with modern grid technologies, and it includes communication systems and smart technologies, you have a number of different vendors that are offering these technologies and the service,” Hughes said. “And it can be a little overwhelming with different manufacturers and different communication protocols and all the different specifications that go into implementing one of these systems.” Add to that, utilities must convince regulators who are concerned about how the investments are made, and whether the systems will have long-term support

Can oil and gas solve the AI power dilemma?

Joe Brettell is a partner at Prosody Group. The promise, peril and possibilities of artificial intelligence continue to capture the cultural and business zeitgeist worldwide. Hardly a conference or long-form interview can be held these days without a panelist or pundit commenting on the technology’s implications for their profession. Yet despite being the hottest topic in every circle, AI’s ultimate challenge isn’t technological but physical. After years of breathless speculation and prediction, the issue remains the same: AI needs more energy. Amidst this backdrop, the oil and gas industry faces a similarly fundamental challenge: a shifting production frontier and evolving path to continued growth. After a decade of efficiency-driven growth, the era of easy barrels is waning. Diamondback Energy CEO Travis Stice captured the new reality in a recent letter, warning of the increasingly dim prospects for expanding production amid geological constraints and rising costs. Other energy majors have issued similar cautions, a sharp departure from the boom years of the shale revolution when abundant, low-cost reserves, followed by shareholder-focused production, made the industry a market favorite. Now, with resource intensity rising, global volatility accelerating and economic conditions tightening, the industry is under pressure to find its next value horizon. That horizon may be converging with AI. The pairing makes increasing sense. While initially circling one another warily, major players in energy and technology have become increasingly intertwined. At major gatherings like CERAWeek, energy executives and tech leaders now share the same stage — and increasingly, the same strategic questions. How do we scale the infrastructure to match exponential AI growth? Who will supply the energy to power it? And how do we do so fast enough while dealing with rising environmental, social and regulatory concerns? These challenges come amid a stark reality: AI’s computational appetite isn’t just increasing —

Data center costs surge up to 18% as enterprises face two-year capacity drought

“AI workloads, especially training and archival, can absorb 10-20ms latency variance if offset by 30-40% cost savings and assured uptime,” said Gogia. “Des Moines and Richmond offer better interconnection diversity today than some saturated Tier-1 hubs.” Contract flexibility is also crucial. Rather than traditional long-term leases, enterprises are negotiating shorter agreements with renewal options and exploring revenue-sharing arrangements tied to business performance. Maximizing what you have With expansion becoming more costly, enterprises are getting serious about efficiency through aggressive server consolidation, sophisticated virtualization and AI-driven optimization tools that squeeze more performance from existing space. The companies performing best in this constrained market are focusing on optimization rather than expansion. Some embrace hybrid strategies blending existing on-premises infrastructure with strategic cloud partnerships, reducing dependence on traditional colocation while maintaining control over critical workloads. The long wait When might relief arrive? CBRE’s analysis shows primary markets had a record 6,350 MW under construction at year-end 2024, more than double 2023 levels. However, power capacity constraints are forcing aggressive pre-leasing and extending construction timelines to 2027 and beyond. The implications for enterprises are stark: with construction timelines extending years due to power constraints, companies are essentially locked into current infrastructure for at least the next few years. Those adapting their strategies now will be better positioned when capacity eventually returns.

Cisco backs quantum networking startup Qunnect

In partnership with Deutsche Telekom’s T-Labs, Qunnect has set up quantum networking testbeds in New York City and Berlin. “Qunnect understands that quantum networking has to work in the real world, not just in pristine lab conditions,” Vijoy Pandey, general manager and senior vice president of Outshift by Cisco, stated in a blog about the investment. “Their room-temperature approach aligns with our quantum data center vision.” Cisco recently announced it is developing a quantum entanglement chip that could ultimately become part of the gear that will populate future quantum data centers. The chip operates at room temperature, uses minimal power, and functions using existing telecom frequencies, according to Pandey.

HPE announces GreenLake Intelligence, goes all-in with agentic AI

Like a teammate who never sleeps Agentic AI is coming to Aruba Central as well, with an autonomous supervisory module talking to multiple specialized models to, for example, determine the root cause of an issue and provide recommendations. David Hughes, SVP and chief product officer, HPE Aruba Networking, said, “It’s like having a teammate who can work while you’re asleep, work on problems, and when you arrive in the morning, have those proposed answers there, complete with chain of thought logic explaining how they got to their conclusions.” Several new services for FinOps and sustainability in GreenLake Cloud are also being integrated into GreenLake Intelligence, including a new workload and capacity optimizer, extended consumption analytics to help organizations control costs, and predictive sustainability forecasting and a managed service mode in the HPE Sustainability Insight Center. In addition, updates to the OpsRamp operations copilot, launched in 2024, will enable agentic automation including conversational product help, an agentic command center that enables AI/ML-based alerts, incident management, and root cause analysis across the infrastructure when it is released in the fourth quarter of 2025. It is now a validated observability solution for the Nvidia Enterprise AI Factory. OpsRamp will also be part of the new HPE CloudOps software suite, available in the fourth quarter, which will include HPE Morpheus Enterprise and HPE Zerto. HPE said the new suite will provide automation, orchestration, governance, data mobility, data protection, and cyber resilience for multivendor, multi cloud, multi-workload infrastructures. Matt Kimball, principal analyst for datacenter, compute, and storage at Moor Insights & strategy, sees HPE’s latest announcements aligning nicely with enterprise IT modernization efforts, using AI to optimize performance. “GreenLake Intelligence is really where all of this comes together. I am a huge fan of Morpheus in delivering an agnostic orchestration plane, regardless of operating stack

MEF goes beyond metro Ethernet, rebrands as Mplify with expanded scope on NaaS and AI

While MEF is only now rebranding, Vachon said that the scope of the organization had already changed by 2005. Instead of just looking at metro Ethernet, the organization at the time had expanded into carrier Ethernet requirements. The organization has also had a growing focus on solving the challenge of cross-provider automation, which is where the LSO framework fits in. LSO provides the foundation for an automation framework that allows providers to more efficiently deliver complex services across partner networks, essentially creating a standardized language for service integration. NaaS leadership and industry blueprint Building on the LSO automation framework, the organization has been working on efforts to help providers with network-as-a-service (NaaS) related guidance and specifications. The organization’s evolution toward NaaS reflects member-driven demands for modern service delivery models. Vachon noted that MEF member organizations were asking for help with NaaS, looking for direction on establishing common definitions and some standard work. The organization responded by developing comprehensive industry guidance. “In 2023 we launched the first blueprint, which is like an industry North Star document. It includes what we think about NaaS and the work we’re doing around it,” Vachon said. The NaaS blueprint encompasses the complete service delivery ecosystem, with APIs including last mile, cloud, data center and security services. (Read more about its vision for NaaS, including easy provisioning and integrated security across a federated network of providers)

AMD rolls out first Ultra Ethernet-compliant NIC

The UEC was launched in 2023 under the Linux Foundation. Members include major tech-industry players such as AMD, Intel, Broadcom, Arista, Cisco, Google, Microsoft, Meta, Nvidia, and HPE. The specification includes GPU and accelerator interconnects as well as support for data center fabrics and scalable AI clusters. AMD’s Pensando Pollara 400GbE NICs are designed for massive scale-out environments containing thousands of AI processors. Pollara is based on customizable hardware that supports using a fully programmable Remote Direct Memory Access (RDMA) transport and hardware-based congestion control. Pollara supports GPU-to-GPU communication with intelligent routing technologies to reduce latency, making it very similar to Nvidia’s NVLink c2c. In addition to being UEC-ready, Pollara 400 offers RoCEv2 compatibility and interoperability with other NICs.

Can Intel cut its way to profit with factory layoffs?

Matt Kimball, principal analyst at Moor Insights & Strategy, said, “While I’m sure tariffs have some impact on Intel’s layoffs, this is actually pretty simple — these layoffs are largely due to the financial challenges Intel is facing in terms of declining revenues.” The move, he said, “aligns with what the company had announced some time back, to bring expenses in line with revenues. While it is painful, I am confident that Intel will be able to meet these demands, as being able to produce quality chips in a timely fashion is critical to their comeback in the market.” Intel, said Kimball, “started its turnaround a few years back when ex-CEO Pat Gelsinger announced its five nodes in four years plan. While this was an impressive vision to articulate, its purpose was to rebuild trust with customers, and to rebuild an execution discipline. I think the company has largely succeeded, but of course the results trail a bit.” Asked if a combination of layoffs and the moving around of jobs will affect the cost of importing chips, Kimball predicted it will likely not have an impact: “Intel (like any responsible company) is extremely focused on cost and supply chain management. They have this down to a science and it is so critical to margins. Also, while I don’t have insights, I would expect Intel is employing AI and/or analytics to help drive supply chain and manufacturing optimization.” The company’s number one job, he said, “is to deliver the highest quality chips to its customers — from the client to the data center. I have every confidence it will not put this mandate at risk as it considers where/how to make the appropriate resourcing decisions. I think everybody who has been through corporate restructuring (I’ve been through too many to count)

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs). In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Stay Ahead, Stay ONMINE