OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Stay Ahead, Stay ONMINE

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More

OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more.

The first paper, “OpenAI’s Approach to External Re d Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them.

In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks.

Going all-in on red teaming pays practical, competitive dividends

It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks.

Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find.

What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle design to combine human expertise and contextual intelligence on one side with AI-based techniques on the other.

“When automated red teaming is complemented by targeted human insight, the resulting defense strategy becomes significantly more resilient,” writes OpenAI in the first paper (Ahmad et al., 2024).

The company’s premise is that using external testers to identify the most high-impact real-world scenarios, while also evaluating AI outputs, leads to continuous model improvements. OpenAI contends that combining these methods delivers a multi-layered defense for their models that identify potential vulnerabilities quickly. Capturing and improving models with the human contextual intelligence made possible by a human-in-the-middle design is proving essential for red-teaming AI models.

Why red teaming is the strategic backbone of AI security

Red teaming has emerged as the preferred method for iteratively testing AI models. This kind of testing simulates a variety of lethal and unpredictable attacks and aims to identify their most potent and weakest points. Generative AI (gen AI) models are difficult to test through automated means alone, as they mimic human-generated content at scale. The practices described in OpenAI’s two papers seek to close the gaps automated testing alone leaves, by measuring and verifying a model’s claims of safety and security.

In the first paper (“OpenAI’s Approach to External Red Teaming”) OpenAI explains that red teaming is “a structured testing effort to find flaws and vulnerabilities in an AI system, often in a controlled environment and collaboration with developers” (Ahmad et al., 2024). Committed to leading the industry in red teaming, the company had over 100 external red teamers assigned to work across a broad base of adversarial scenarios during the pre-launch vetting of GPT-4 prior to launch.

Research firm Gartner reinforces the value of red teaming in its forecast, predicting that IT spending on gen AI will soar from $5 billion in 2024 to $39 billion by 2028. Gartner notes that the rapid adoption of gen AI and the proliferation of LLMs is significantly expanding these models’ attack surfaces, making red teaming essential in any release cycle.

Practical insights for security leaders

Even though security leaders have been quick to see the value of red teaming, few are following through by making a commitment to get it done. A recent Gartner survey finds that while 73% of organizations recognize the importance of dedicated red teams, only 28% actually maintain them. To close this gap, a simplified framework is needed that can be applied at scale to any new model, app, or platform’s red teaming needs.

In its paper on external red teaming OpenAI defines four key steps for using a human-in-the-middle design to make the most of human insights:

Defining testing scope and teams: Drawing on subject matter experts and specialists across key areas of cybersecurity, regional politics, and natural sciences, OpenAI targets risks that include voice mimicry and bias. The ability to recruit cross-functional experts is, therefore, crucial. (To gain an appreciation for how committed OpenAI is to this methodology and its implications for stopping deepfakes, please see our article “GPT-4: OpenAI’s shield against $40B deepfake threat to enterprises.”)

Selecting model versions for testing, then iterating them across diverse teams: Both of OpenAI’s papers emphasize that cycling red teams and models using an iterative approach delivers the most insightful results. Allowing each red team to cycle through all models is conducive to greater team learning of what is and isn’t working.

Clear documentation and guidance: Consistency in testing requires well-documented APIs, standardized report formats, and explicit feedback loops. These are essential elements for successful red teaming.

Making sure insights translate into practical and long-lasting mitigations: Once red teams log vulnerabilities, they drive targeted updates to models, policies and operational plans — ensuring security strategies evolve in lockstep with emerging threats.

Scaling adversarial testing with GPT-4T: The next frontier in red teaming

AI companies’ red teaming methodologies are demonstrating that while human expertise is resource-intensive, it remains crucial for in-depth testing of AI models.

In OpenAI’s second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning” (Beutel et al., 2024), OpenAI addresses the challenge of scaling adversarial testing using an automated, multi-pronged approach that combines human insights with AI-generated attack strategies.

The core of this methodology is GPT-4T, a specialized variant of the GPT-4 model engineered to produce a wide range of adversarial scenarios.

Here’s how each component of the methodology contributes to a stronger adversarial testing framework:

Goal diversification. OpenAI describes how it is using GPT-4T to create a broad spectrum of scenarios, starting with initially benign-seeming prompts and progressing to more sophisticated phishing campaigns. Goal diversification focuses on anticipating and exploring the widest possible range of potential exploits. By using GPT-4T’s capacity for diverse language generation, OpenAI contends that red teams avoid tunnel vision and stay focused on probing for vulnerabilities that manual-only methods miss.

Reinforcement learning (RL). A multi-step RL framework rewards the discovery of new and previously unseen vulnerabilities. The purpose is to train the automated red team by improving each iteration. This enables security leaders to refocus on genuine risks rather than sifting through volumes of low-impact alerts. It aligns with Gartner’s projection of a 30% drop in false pos i tives attributable to gen AI in application security testing by 2027. OpenAI writes, “Our multi-step RL approach systematically rewards the discovery of newly identified vulnerabilities, driving continuous improvement in adversarial testing.”

Auto-generated rewards: OpenAI defines this as a system that tracks and updates scores for partial successes by red teams, assigning incremental rewards for identifying each unprotected weak area of a model.

Securing the future of AI: Key takeaways for security leaders

OpenAI’s recent papers show why a structured, iterative process that combines internal and external testing delivers the insights needed to keep improving models’ accuracy, safety, security and quality.

Security leaders’ key takeaways from these papers should include:

Go all-in and adopt a multi-pronged approach to red teaming. The papers emphasize the value of combining external, human-led teams with real-time simulations of AI attacks generated randomly, as they reflect how chaotic intrusion attempts can be. OpenAI contends that while humans excel at spotting context-specific gaps, including biases, automated systems identify weaknesses that emerge only under stress testing and repeated sophisticated attacks.

Test early and continuously throughout model dev cycles. The white papers make a compelling argument against waiting for production-ready models and instead beginning testing with early-stage versions. The goal is to find emerging risks and retest later to make sure the gaps in models were closed before launch.

Whenever possible, streamline documentation and feedback with real-time feedback loops. Standardized reporting and well-documented APIs, along with explicit feedback loops, help convert red team findings into actionable, trackable mitigations. OpenAI emphasizes the need to get this process in place before beginning red teaming, to accelerate fixes and remediation of problem areas.

Using real-time reinforcement learning is critically important, as is the future of AI red teaming. OpenAI makes the case for automating frameworks that reward discoveries of new attack vectors as a core part of the real-time feedback loops. The goal of RL is to create a continuous loop of improvement.

Don’t settle for anything less than actionable insights from the red team process. It’s essential to treat every red team discovery or finding as a catalyst for updating security strategies, improving incident response plans, and revamping guidelines as required.

Budget for the added expense of enlisting external expertise for red teams. A central premise of OpenAI’s approach to red teaming is to actively recruit outside specialists who have informed perspectives and knowledge of advanced threats. Areas of expertise valuable to AI-model red teams include deepfake technology, social engineering, identity theft, synthetic identity creation, and voice-based fraud. “Involving external specialists often surfaces hidden attack paths, including sophisticated social engineering and deepfake threats.” (Ahmad et al., 2024)

Papers:

Beutel, A., Xiao, K., Heidecke, J., & Weng, L. (2024). “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning.” OpenAI.

Ahmad, L., Agarwal, S., Lampe, M., & Mishkin, P. (2024). “OpenAI’s Approach to External Red Teaming for AI Models and Systems.” OpenAI.

Daily insights on business use cases with VB Daily

If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.

Read our Privacy Policy

Thanks for subscribing. Check out more VB newsletters here.

An error occured.

Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy, bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Trump meets with Intel CEO after calling for his resignation

The call for Tan’s resignation coincided with an Aug. 6 letter Sen. Tom Cotton (R-AK) sent to Intel Chairman Frank Yeary, in which he expressed concerns about “Intel’s operations and its potential impact on U.S. national security,” citing a report alleging Tan’s links to Chinese firms and the fact Cadence

US to maintain lower tariff rates on China imports for 90 more days

The U.S. is extending its pause on additional retaliatory tariffs for imports from China until Nov. 10, according to an executive order signed by President Donald Trump on Monday. The order said the extension is appropriate following “significant steps” from China on addressing U.S. trade concerns in ongoing discussions between the

Critical SSH vulnerabilities expose enterprise network infrastructure as patching lags

RegreSSHion (CVE-2024-6387) proved particularly dangerous, enabling unauthenticated remote code execution through a signal reentrance vulnerability in OpenSSH. The vulnerability affected countless Linux systems and network appliances running vulnerable OpenSSH versions, though exploitation proved challenging due to modern memory protections. The MOVEit vulnerability (CVE-2024-5806) demonstrated how third-party SSH libraries could introduce

Nvidia launches Blackwell-powered RTX Pro GPUs for compact AI workstations

“While AMD’s Radeon Pro series remains a primary competitor, offering strong performance for similar professional workloads, Nvidia maintains its market lead through its mature and widely-adopted software ecosystem,” said Neil Shah, vice president at Counterpoint Research. He added that the CUDA platform continues to be the industry standard for AI

Union Says 200+ Repsol Workers Back Enhanced Pay Offer

London, UK, headquartered union Unite announced, in a statement sent to Rigzone on Tuesday, that “over 200 Repsol Resources workers have backed an enhanced pay and conditions offer” and brought their offshore dispute with the company to an end. “The pay deal successfully negotiated by Unite is worth 8.5 percent over two years,” Unite said in the statement. “In 2025/26, the pay increase amounts to 4.5 percent and in the following year due to changes in shift rotation allowances, workers will receive a further four percent,” it added. Unite noted in the statement that the Repsol workers “had previously rejected several unacceptable pay offers”. It revealed that planned industrial action on August 6 was suspended to allow members to vote on the improved offer. The scheduled strikes on August 13 and 28, and September 4, are now cancelled following the successful resolution of the dispute, Unite confirmed in the statement. The pay agreement covers workers such as control room operators, supervisors, electricians, technicians, mechanics and HSE advisors on Repsol’s Arbroath, AUK, Bleoholm, Claymore, Clyde, Fulmer, Montrose, and Piper Bravo platforms, Unite highlighted in the statement. “Unite has successfully negotiated a significant pay deal for our Repsol members,” Unite General Secretary Sharon Graham said in the statement. “Let’s be clear that this deal only came about due to our members standing firm and being prepared to take strike action to get a better deal,” Graham added. Unite Industrial Officer John Boland said in the statement, “we are pleased that industrial action has been averted at Repsol after the company improved its pay offer after our members emphatically backed strike action”. Rigzone has contacted Repsol and Neo Next Energy Limited for comment on Unite’s statement. At the time of writing, Repsol and Neo Next have not responded to Rigzone. In a

XRG Extends Diligence Period for Planned Takeover of Santos

Santos Ltd. has granted a consortium led by Abu Dhabi National Oil Co. PJSC (ADNOC) more time to conduct due diligence for a potential acquisition of the Australian oil and gas company. The “process and exclusivity deed” signed June 27 between Santos and the consortium of sovereign investor Abu Dhabi Development Holding Co., Carlyle Group and ADNOC’s global investment arm XRG PJSC has now been extended to August 22, Santos said in a statement on its website. The parties announced a non-binding indicative proposal June 16, with Santos saying it intended to endorse to its shareholders the cash purchase offer of $5.76 per share. Announcing the extension of the due diligence review, Santos said, “The XRG consortium has now substantially completed due diligence in relation to the potential transaction under the process and exclusivity deed dated 27 June 2025. The XRG consortium has confirmed it has not discovered anything to date that would cause the XRG consortium to withdraw its indicative proposal and has confirmed its commitment to working constructively with Santos to complete the due diligence promptly and agree on a binding transaction”. “To complete due diligence and progress a binding transaction, the XRG consortium has requested a two-week extension to the Due Diligence and Exclusivity Period under the Process Deed”. The initial exclusivity period was to last six weeks from June 27. “The exclusivity provisions include customary ‘no shop’, ‘no talk’, ‘no due diligence’ and ‘notification’ obligations that apply during the exclusivity period”, Santos said June 27. “A fiduciary exception applies enabling the Santos board to deal with potentially superior proposals from competing acquirers from the date that is four weeks from today”. Confirming the extension, XRG said separately, “There remains strong alignment between both parties on the strategic rationale for the potential transaction, and the process to

Fluor Plans to Appeal Ruling in Santos Row over Gladstone LNG Costs

Fluor Corp. plans to appeal against the Queensland Supreme Court’s decision favoring Santos Ltd. in a dispute on costs over the Gladstone LNG project, majority-owned by Santos. “The court affirmed that Fluor must pay approximately AUD 692 million to Santos and its co-venturers, with further sums yet to be determined”, oil and gas explorer and developer Santos said in a statement on its website. Adelaide-based Santos, which initiated the case in December 2016, and Irving, Texas-based Fluor had signed a contract for the construction of the coal bed methane-to-liquefied natural gas (LNG) project. Gladstone started producing LNG 2015 after going over time and over budget. Santos’ case alleges overpayments totaling more than AUD 1.4 billion, about AUD 140 million for a purported breach of the Australian Consumer Law and liquidated damages of AUD 15 million for an alleged failure of Fluor to reach mechanical completion by the contractual dates, according to the court judgment. “[T]he court will hear the parties on the appropriate orders and directions and on the calculation of interest, and on costs”, read the ruling, published on the court’s online library. The case is Santos v Fluor [2025] QSC 184. Fluor the parent company is second defendant while Fluor Australia Pty. Ltd. is first defendant. Fluor said in a statement on its website, “Further arguments and input from both parties will be heard by the court before a final judgment is delivered sometime later this year”. “Fluor maintains the contracting principles addressed by the court have wide-sweeping consequences in the engineering and construction industry”, Fluor added. “The company is reviewing the court decision and exploring its response including the timing of its appeal. “We are also working with our insurance carriers to address the obligations arising from the final judgment”. Fluor said, “The court generally accepted the recommendations

Macquarie Strategists Forecast USA Crude Inventory Rise

In an oil and gas report sent to Rigzone by the Macquarie team late Monday, Macquarie strategists, including Walt Chancellor, revealed that they are forecasting that U.S. crude inventories will be up by 2.0 million barrels for the week ending August 8. “This follows a 3.0 million barrel draw in the prior week, with the crude balance realizing tighter than our expectations,” the strategists said in the report. “For this week’s crude balance, from refineries, we model a minimal reduction in crude runs. Among net imports, we model a small increase, with exports (+0.3 million barrels per day) and imports (+0.6 million barrels per day) up on a nominal basis,” they added. Timing of cargoes remains a source of potential volatility in this week’s crude balance, the Macquarie strategists warned in the report. They went on to state that, “from implied domestic supply (prod.+adj.+transfers)”, they “look for an increase (+0.3 million barrels per day) on a nominal basis this week”. “Rounding out the picture, we anticipate no change in SPR [Strategic Petroleum Reserve] stocks this week,” the strategists said. The strategists also noted in the report that, “among products”, they “look for builds in distillate (+3.8 million barrels) and jet (+0.5 million barrels), with a draw in gasoline (-0.9 million barrels)”. “We model implied demand for these three products at ~14.3 million barrels per day for the week ending August 8,” the strategists continued. In its latest weekly petroleum status report at the time of writing, which was released on August 6 and included data for the week ending August 1, the U.S. Energy Information Administration (EIA) highlighted that U.S. commercial crude oil inventories, excluding those in the SPR, decreased by three million barrels from the week ending July 25 to the week ending August 1. That EIA report showed

ADNOC Gas Achieves Record Profit

ADNOC Gas PLC has reported $1.39 billion in net income for the second quarter, rising 16 percent compared to the same three-month period last year and setting a quarterly record for the company. Last year, the gas processing and sales arm of Abu Dhabi National Oil Co. logged its highest annual net earnings – $5 billion – thanks to natural gas demand in the United Arab Emirates. For the April-June 2025 quarter, revenue dipped to $5.96 billion from $6.08 billion for Q2 2024 as a weakening of commodity prices offset an overall increase in sales volumes, according to figures reported to the local stock exchange. Domestic gas sales rose to 611 trillion British thermal units (TBtu) in Q2 2025 from 580 TBtu in Q2 2024. Export and traded liquids slid to 252 TBtu from 266 TBtu. Sales from the ALNG JV, in which ADNOC Gas owns a 70 percent stake, increased to 65 TBtu from 56 TBtu. ADNOC Gas expects sales volumes excluding sulfur to land between 3,630 TBtu and 3,700 TBtu this year. “As with prior years, sales volumes should follow a seasonal pattern with an uptick over the summer period”, it said. “Furthermore, it is also important to note that in 2025 our shutdown activity will be higher than normal especially in the Q4 2025 period”. Meanwhile the quarterly average Brent crude price fell 20 percent year-on-year to $68 a barrel from $85 per barrel. “Conversely, JKM prices saw a significant increase of 31 percent, rising from $9.6/mmbtu to $12.5/mmbtu”, ADNOC Gas told the Abu Dhabi Securities Exchange. “LPG prices were slightly up on average despite the drop in crude oil price, with propane increasing from $592/tonne to $608/tonne and butane marginally down from $590/tonne to $588/tonne. Naphtha prices averaged at $533/tonne in the period representing a 14 percent

Weaker Chinese Demand for Saudi Oil Signals Shift to Urals, EA Says

Chinese refiners are asking for less oil from Saudi Arabia, with the drop possibly pointing to a reshuffle of global flows as more Russian crude becomes available, according to Energy Aspects Ltd. A decline in so-called nominations for term cargoes from Saudi Aramco for September loading, led by trading-giant Unipec, indicated some Chinese refineries were holding back from purchases given the greater availability of Russia’s Urals, as well as comfortable stockpiles, the London-based consultant said in an Aug. 11 note, without saying how it got the information. Indian nominations for September, meanwhile, increased from a month earlier as the country seeks alternatives to Russian crude following Western pushback. The global oil market has zeroed in on a possible reordering of some crude flows after the US and European Union ramped up pressure against India over its imports of Russian energy. Given there’s been no comparable move against China, that’s raised the possibility that more of Moscow’s oil will be taken by mainland refiners, including Urals, which ships from Russia’s west. Saudi Aramco is set to sell 43 million barrels of contractual supplies of September-loading crude to China, traders informed by the producer told Bloomberg. That compares with 51 million barrels a month ago, and a monthly average of about 45 million so far this year. Chinese interest in Urals is picking up given it remains the “most competitive” compared with similar Middle Eastern crudes, Energy Aspects said. Still, there’s a limit to China’s appetite given Russian imports account for 17 percent of overseas supplies, with 20 percent seen as a cap for a single country, it added. Sinopec, the Beijing-based parent company of Unipec, didn’t reply to an email seeking comment outside working hours. What do you think? We’d love to hear from you, join the conversation on the Rigzone Energy Network.

New Compute Exchange service answers GPU pricing queries

Compute Exchange and Silicon Data, Bochev added “are also working on developing clearer benchmarks for the compute market, and will have more details to share on that in the coming weeks.” PIC ‘should serve to keep suppliers honest ..’ Scott Bickley, an advisory fellow at Info-Tech Research Group, said he views the offering “as a way for enterprises to source short-term GPU capacity and possibly get a deal, especially if it is stranded capacity from the neocloud providers.” This, he said, “would also help to benchmark costs when purchasing this capacity in general, so it’s good, but it is also straightforward in terms of the value proposition.” He also noted that most companies are not buying GPU capacity directly; “This is for those that are building their own models or deploying their own AI applications atop existing models.” Bickley added, “it should serve to keep suppliers honest to some degree in terms of the floors and ceilings of the price to access GPU capacity.” Soon after Compute Exchange first launched in February, Matt Kimball, VP and principal analyst for data center compute and storage at Moor Insights & Strategy, described the GPU compute situation as “pretty dire. This is driven by what most view as a single supplier (Nvidia) selling GPUs before they can even be made to a market that has an insatiable thirst.” On Tuesday, following the announcement, he said that the concept of PIC is appealing: “I really like the idea of PIC as a tool for customers and seeing the compute exchange become an arbitrageur of sorts. This delivers a real value to [anyone] who is looking to utilize AI infrastructure,” he said.

Data center sustainability efforts stall slightly in 2025

Data center operators reported limited advances—and even some declines—in energy efficiency, carbon tracking, and water usage due in part to rising power demand and easing regulatory pressure in some regions, according to the recently released results of the Uptime Institute’s 15th Annual Global Data Center Survey 2025. As artificial intelligence workloads continue to grow and legacy data centers remain operational, sustainability initiatives have stalled, according to the Uptime Institute, which attributes this in part to reporting challenges. Uptime Institute’s 2025 data center survey was conducted online from April 2025 to May 2025 and collected responses from more than 800 data center owners and operators and more than 1,000 vendors and consultants. “What’s interesting this year is that we have seen a far from startling increase over the last few years of the data being collected, but this year it actually fell. And this obviously led to some speculation that there is a backing off of sustainability, and that it is no longer a high priority,” said Andy Lawrence, executive director of research at Uptime Institute, during a webinar sharing the survey results. “I think that the data center industry has not yet adapted to being very good at sustainability reporting.”

Arista’s latest networking results: 4 critical takeaways

“We also think UALink is another spec that’s coming out, and that may run as an overlay on top of an Ethernet underlay. There needs to be some firm standards there because today, scale-up is frankly all proprietary NV Link. And we’re encouraged by—just like we worked hard to found the Ultra Ethernet Consortium as a member for some of the back-end Ethernet, and the migration from InfiniBand to Ethernet is literally happening in 3 to 5 years. We expect the same phenomenon on scale-up,” Ullal said. “The rise in Agentic AI ensures any-to-any conversations with bidirectional bandwidth utilization. Such AI agents are pushing the envelope of LAN and WAN traffic patterns in the enterprise,” Ullal said. Work to do on VeloCloud integration The recent acquisition of VeloCloud was also a hot topic of the second quarter results that included the introduction of former Cisco exec and industry veteran Todd Nightingale, as its newly appointed President & COO. “It’s only been a month, but I can’t tell you how impressed I am with the passion and focus of the team, the trust that Arista customers have in the technology and the enormous opportunity we have ahead of us in data center, AI, and in the campus,” Nightingale said. “VeloCloud’s secure AI optimized WAN portfolio offers seamless application-aware solutions to connect customer branch sites, complementing Arista’s leading spines in the data center and campus,” Ullal said. “In a classic leaf-spine atomic identifier, we are enabling multipathing, encryption, in-band network telemetry, segmentation, application identification, and traffic engineering across distributed enterprise sites. We are so excited to fill this missing void in our distributed enterprise puzzle to bring that holistic branch solution.” “We also intend to work closely with best-of-breed security partners to enable SASE overlays. Please do note that VeloCloud is not

Enterprise tips for cloud success

The remaining tips were cited by roughly two-thirds of the enterprises. Tip number three is to look especially at applications whose users are widely dispersed. And by “widely” here, they mean on different continents, not just different neighborhoods. The reason is that quality of experience and even availability can be compromised when work has to transit a lot of networks just to get to where it’s processed. This can lead to user dissatisfaction, and dispersing resources closer to the users may be the only solution. If an enterprise doesn’t already have their own data center located close to each user concentration, chances are that putting a new hosting point in themselves couldn’t achieve reasonable economy of scale in capex, power and cooling, and operations costs. The cloud would be cheaper. A qualifying comment here is to take great care in evaluating the real impact of dispersion of application users. In some cases, there may not be enough of a difference in QoE or availability to require dispersing hosting points, and in fact it may be that where the application is hosted isn’t even the problem. “The cloud may look like the easy way out,” one enterprise said, “but it may not be the economical way.” See where your QoE issues really lie before you go to the cloud’s distributed hosting to fix them. Tip four is to examine the user-to-application interaction model carefully, to see if there’s a large non-transactional component. Mission-critical business systems, and business core databases, are almost always in the data center. The stuff that changes them are the transactions that add, update, and delete records. If an application’s user interaction is tightly coupled to the creation of transactions, then its processing is tied to those data center resources. That makes it harder to move the user-interface

Stargate’s slow start reveals the real bottlenecks in scaling AI infrastructure

The CFO emphasized that SoftBank remains committed to its original target of $346 billion (JPY 500 billion) over 4 years for the Stargate project, noting that major sites have been selected in the US and preparations are taking place simultaneously across multiple fronts. Requests for comment to Stargate partners Nvidia, OpenAI, and Oracle remain unanswered. Infrastructure reality check for CIOs These challenges offer important lessons for enterprise IT leaders facing similar AI infrastructure decisions. Sanchit Vir Gogia, chief analyst and CEO at Greyhound Research, said that Goto’s confirmation of delays “reflects a challenge CIOs see repeatedly” in partner onboarding delays, service activation slips, and revised delivery commitments from cloud and datacenter providers. Oishi Mazumder, senior analyst at Everest Group, noted that “SoftBank’s Stargate delays show that AI infrastructure is not constrained by compute or capital, but by land, energy, and stakeholder alignment.” The analyst emphasized that CIOs must treat AI infrastructure “as a cross-functional transformation, not an IT upgrade, demanding long-term, ecosystem-wide planning.” “Scaling AI infrastructure depends less on the technical readiness of servers or GPUs and more on the orchestration of distributed stakeholders — utilities, regulators, construction partners, hardware suppliers, and service providers — each with their own cadence and constraints,” Gogia said.

Incentivizing the Digital Future: Inside America’s Race to Attract Data Centers

Across the United States, states are rolling out a wave of new tax incentives aimed squarely at attracting data centers, one of the country’s fastest-growing industries. Once clustered in only a handful of industry-friendly regions, today’s data-center boom is rapidly spreading, pushed along by profound shifts in federal policy, surging demand for artificial intelligence, and the drive toward digital transformation across every sector of the economy. Nowhere is this transformation more visible than in the intensifying state-by-state competition to land massive infrastructure investments, advanced technology jobs, and the alluring prospect of long-term economic growth. The past year alone has seen a record number of states introducing or expanding incentives for data centers, from tax credits to expedited permitting, reflecting a new era of proactive, tech-focused economic development policy. Behind these moves, federal initiatives and funding packages underscore the essential role of digital infrastructure as a national priority, encouraging states to lower barriers for data center construction and operation. As states watch their neighbors reap direct investment and job creation benefits, a real “domino effect” emerges: one state’s success becomes another’s blueprint, heightening the pressure and urgency to compete. Yet, this wave of incentives also exposes deeper questions about the local impact, community costs, and the evolving relationship between public policy and the tech industry. From federal levels to town halls, there are notable shifts in both opportunities and challenges shaping the landscape of digital infrastructure advancement. Industry Drivers: the Federal Push and Growth of AI The past year has witnessed a profound federal policy shift aimed squarely at accelerating U.S. digital infrastructure, especially for data centers in direct response both to the explosive growth of artificial intelligence and to intensifying international competition. In July 2025, the administration unveiled “America’s AI Action Plan,” accompanied by multiple executive orders that collectively redefined

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs). In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

The Download: Trump’s golden dome, and fueling AI with nuclear power

This is today’s edition of The Download, our weekday newsletter that provides a daily dose of what’s going on in the world of technology. Why

Why Trump’s “golden dome” missile defense idea is another ripped straight from the movies

In 1940, a fresh-faced Ronald Reagan starred as US Secret Service agent Brass Bancroft in Murder in the Air, an action film centered on a

OpenAI brings GPT-4o back as a default for all paying ChatGPT users, Altman promises ‘plenty of notice’ if it leaves again

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe

The end of perimeter defense: When your own AI tools become the threat actor

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe

Stay Ahead, Stay ONMINE

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Going all-in on red teaming pays practical, competitive dividends

Why red teaming is the strategic backbone of AI security

Practical insights for security leaders

Scaling adversarial testing with GPT-4T: The next frontier in red teaming

Securing the future of AI: Key takeaways for security leaders

Stay Ahead

Explore More Insights

Trump meets with Intel CEO after calling for his resignation

US to maintain lower tariff rates on China imports for 90 more days

Critical SSH vulnerabilities expose enterprise network infrastructure as patching lags

Nvidia launches Blackwell-powered RTX Pro GPUs for compact AI workstations

Union Says 200+ Repsol Workers Back Enhanced Pay Offer

XRG Extends Diligence Period for Planned Takeover of Santos

Fluor Plans to Appeal Ruling in Santos Row over Gladstone LNG Costs

Macquarie Strategists Forecast USA Crude Inventory Rise

ADNOC Gas Achieves Record Profit

Weaker Chinese Demand for Saudi Oil Signals Shift to Urals, EA Says

New Compute Exchange service answers GPU pricing queries

Data center sustainability efforts stall slightly in 2025

Arista’s latest networking results: 4 critical takeaways

Enterprise tips for cloud success

Stargate’s slow start reveals the real bottlenecks in scaling AI infrastructure

Incentivizing the Digital Future: Inside America’s Race to Attract Data Centers

Microsoft will invest $80B in AI data centers in fiscal 2025

John Deere unveils more autonomous farm machines to address skill labor shortage

2025 playbook for enterprise AI success, from agents to evals

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

The Download: Trump’s golden dome, and fueling AI with nuclear power

Why Trump’s “golden dome” missile defense idea is another ripped straight from the movies

OpenAI brings GPT-4o back as a default for all paying ChatGPT users, Altman promises ‘plenty of notice’ if it leaves again

The end of perimeter defense: When your own AI tools become the threat actor

Do you have any questions?

Quicklinks

Solutions

Company