Writer launches a ‘super agent’ that actually gets sh*t done, outperforms OpenAI on key benchmarks

Stay Ahead, Stay ONMINE

Writer launches a ‘super agent’ that actually gets sh*t done, outperforms OpenAI on key benchmarks

Writer, the enterprise artificial intelligence company valued at $1.9 billion, launched an autonomous “super agent” Tuesday that can independently execute complex, multi-step business tasks across hundreds of software platforms — marking a significant escalation in the corporate AI arms race.The new Action Agent represents a fundamental shift from AI chatbots that simply answer questions to systems that can autonomously complete entire projects. The agent can browse websites, analyze data, create presentations, write code, and coordinate work across an organization’s entire technology stack without human intervention.“Other AI chatbots can tell you what to do. Action Agent does it,” said May Habib, Writer’s CEO and co-founder. “It’s the difference between getting a research report and having your entire sales pipeline updated and acted upon.”The launch positions San Francisco-based Writer as a formidable competitor to Microsoft’s Copilot and OpenAI’s ChatGPT in the lucrative enterprise market, where companies are racing to deploy AI systems that can automate knowledge work. Unlike consumer-focused AI tools, Writer’s agent includes enterprise-grade security controls and audit trails that regulated industries like banking and healthcare require.

The new Action Agent represents a fundamental shift from AI chatbots that simply answer questions to systems that can autonomously complete entire projects. The agent can browse websites, analyze data, create presentations, write code, and coordinate work across an organization’s entire technology stack without human intervention.

“Other AI chatbots can tell you what to do. Action Agent does it,” said May Habib, Writer’s CEO and co-founder. “It’s the difference between getting a research report and having your entire sales pipeline updated and acted upon.”

The launch positions San Francisco-based Writer as a formidable competitor to Microsoft’s Copilot and OpenAI’s ChatGPT in the lucrative enterprise market, where companies are racing to deploy AI systems that can automate knowledge work. Unlike consumer-focused AI tools, Writer’s agent includes enterprise-grade security controls and audit trails that regulated industries like banking and healthcare require.

The AI Impact Series Returns to San Francisco – August 5

The next phase of AI is here – are you ready? Join leaders from Block, GSK, and SAP for an exclusive look at how autonomous agents are reshaping enterprise workflows – from real-time decision-making to end-to-end automation.

Secure your spot now – space is limited: https://bit.ly/3GuuPLF

How Writer’s super agent executes tasks other AI can only describe

Writer’s Action Agent fundamentally differs from existing AI assistants by operating at what the company calls “level four orchestration” — the highest tier of AI automation. Most current enterprise AI tools operate at levels one or two, handling basic tasks like answering questions or retrieving documents.

“The reality is most of the market is anywhere between one to two,” explained Matan-Paul Shetrit, Writer’s head of product, in an interview with VentureBeat. “What we’ve done here is full orchestration. This is an agent that calls agents, writes its own tools when needed, can execute on that with full visibility.”

The distinction goes far beyond simple automation capabilities. While traditional AI assistants like ChatGPT or Copilot are “very much built for like a Q and A experience,” Shetrit noted, Action Agent is designed for execution. “The difference is, one is not just about like, let me do this back and forth brainstorming, but more like, once and if I want to do the brainstorming, I can also act on it.”

The agent operates within its own isolated virtual computer for each session, allowing it to independently browse web pages, build software, solve technical problems, and execute complex multi-step plans. When asked to perform a product analysis, for example, Action Agent will automatically process thousands of customer reviews, perform sentiment analysis, identify themes, and generate a presentation — all without human guidance.

The system’s capabilities extend to generating its own tools when existing ones prove insufficient. “It can action whether or not it has MCP or any tool access, because it can just generate its own tools on the fly for the purpose of the task,” Shetrit explained.

During a demonstration, Shetrit showed the agent conducting clinical trial site selection — a process that typically requires weeks of human research. The agent systematically analyzed demographics across multiple cities, ranked locations by suitability criteria, and generated comprehensive reports with supporting evidence.

“This is weeks worth of work by these companies,” Shetrit noted. “It’s not something that’s trivial to do.”

Breaking benchmarks: Action agent outperforms OpenAI on key tests

Writer’s claims about the agent’s capabilities are backed by impressive benchmark results. Action Agent scored 61% on GAIA Level 3, the most challenging benchmark for AI agent performance, outperforming competing systems including OpenAI’s Deep Research. The agent also achieved a 10.4% score on the CUB (Computer Use Benchmark) leaderboard, making it the top performer for computer and browser use tasks.

These results demonstrate the agent’s ability to handle complex reasoning tasks that have traditionally stumped AI systems. GAIA Level 3 tests require agents to navigate multiple tools, synthesize information from various sources, and complete multi-step workflows — precisely the kind of work that enterprises need automated.

The performance stems from Writer’s Palmyra X5 model, which features a one-million-token context window — enough to process hundreds of pages of documents simultaneously while maintaining coherence across complex tasks. This massive context capability allows the agent to work with entire codebases, lengthy research reports, and comprehensive datasets without losing track of the overall objective.

Writer’s enterprise focus sets it apart in a market dominated by consumer-oriented AI companies attempting to adapt their products for business use. The company built Action Agent on its existing enterprise platform, which already serves hundreds of major corporations including Accenture, Vanguard, Qualcomm, Uber, and Salesforce.

The distinction proves crucial for enterprise adoption. While consumer AI tools often operate as “black boxes” with limited transparency, Writer’s system provides complete audit trails showing exactly how the agent reached its conclusions and what actions it took.

Shetrit emphasized this transparency as essential for regulated industries: “If you start talking about some of the largest companies in the world, whether it’s banks or pharmaceutical companies or healthcare companies, it’s unacceptable that you don’t know how these autonomous agents are behaving and what they’re doing, and you can audit and have a few full visibility on what, what the hell is happening in that in that box.”

The system provides “full traceability, auditability and visibility,” allowing IT administrators to set fine-grained permissions controlling which tools each agent can access and what actions they can perform.

Action Agent’s ability to connect with over 600 enterprise tools represents a significant technical achievement. The agent uses Model Context Protocol (MCP), an emerging standard for AI tool integration, but Writer has enhanced it with enterprise-grade controls that address security and governance concerns.

Writer has been working closely with Amazon Web Services and other industry players to bring MCP to enterprise standards. “There’s still place to bring it to enterprise grade,” Shetrit noted, referencing recent issues with MCP implementations at companies like Asana and GitHub.

The company’s approach allows granular control that extends beyond simple user permissions. “It’s not just by a user. It will also have it by the specific agent,” Shetrit explained. “So as an IT persona or a security persona, I have the controls I need to feel comfortable with this data access.”

For example, administrators can permit certain agents to publish messages to Slack while preventing them from deleting messages. “You need that fine grained control, and that’s something we’re baking in as part of the system,” Shetrit said.

The company pre-announces support for over 600 different tools, with each tool offering fine-grained control both at the integration level and for specific agents. This capability allows Action Agent to coordinate work across an organization’s entire technology ecosystem, from customer relationship management systems to financial databases.

Free AI agents challenge traditional software pricing models

Writer’s decision to offer Action Agent free to existing customers challenges traditional software pricing models and reflects broader shifts in the AI industry. The move comes despite the significant computational costs associated with the agent’s extensive token usage.

“Token pricing is extremely problematic when you start thinking about enterprises,” Shetrit explained. “They need a budget line item. They need to figure out the cost structure. This highly variable cost model does not work for these companies, and that is why we’ve been moving away from this for a while now.”

The strategy reflects Writer’s confidence in its cost-efficient model development. The company spent just $700,000 to train its Palmyra X4 model, compared to an estimated $4.6 million for a similarly sized OpenAI model. This efficiency stems from Writer’s use of synthetic data and innovative training techniques that reduce computational requirements.

Writer’s reasoning for the free offering goes beyond competitive positioning. “We think this shows the full value of the ecosystem and the platform, and really starts delivering on the promise of AI,” Shetrit said. Internal users have reported being more excited about this AI product than any previous AI tool they’ve used, including other copilot systems.

Enterprise AI market heats up as startups target Microsoft and Google

Writer’s Action Agent launch escalates competition in the rapidly expanding enterprise AI market, projected to grow from $58 billion to $114 billion by 2027. The company competes directly with Microsoft’s Copilot suite, Google’s enterprise AI offerings, and OpenAI’s business products, but targets a different market segment with its enterprise-first approach.

The competitive positioning reflects a broader industry split between companies building general-purpose AI systems and those focusing specifically on enterprise needs. Writer’s approach prioritizes security, governance, and reliability over raw capability, betting that enterprise customers will choose specialized tools over consumer products adapted for business use.

“Most of their focus is on the consumer realm versus us, which was like, this is not where we’re at,” Shetrit emphasized regarding competitors. “We are fully on the Enterprise B to B side.”

This focus has paid off financially. Writer raised $200 million in Series C funding in November 2024 at a $1.9 billion valuation, nearly quadrupling its previous valuation. The round was co-led by Premji Invest, Radical Ventures, and ICONIQ Growth, with participation from major enterprise players including Salesforce Ventures, Adobe Ventures, and IBM Ventures.

From automation to transformation: How AI will reshape corporate work

Writer’s vision extends beyond current automation to fundamentally reshape how enterprises operate. The company identifies two clusters of use cases emerging in large organizations: traditional “90% workflow, 10% AI” optimization and new “90% AI, 10% workflow” experiences that unlock entirely new capabilities.

“Each employee will have a thing like this next to them that helps them do their work, helps them automate a lot of it, so you can do much higher leverage work across the organization,” Shetrit predicted.

This transformation addresses a critical shift in enterprise software expectations. As employees become accustomed to sophisticated AI tools in their personal lives, enterprise software must match or exceed that quality. “You cannot afford for enterprise software to not be as good, and in a lot of cases, significantly better,” Shetrit noted.

The shift is already changing internal dynamics at Writer itself. “Historically, as a PM, I can say that execution was the bottleneck. So I can always say no, because I don’t have capacity. Capacity is no longer the bottleneck,” Shetrit explained. When his product managers claim they don’t have time for projects, he now uses Action Agent to generate “at least 80% and 70% and 90% of the work for them so they can start working on it.”

This represents a fundamental change from “scarcity to an abundance mentality” that will require “a lot of retraining element that has to happen within the org.”

Inside Writer’s collaboration with Uber to build real-world AI agents

Writer’s collaboration with Uber on Action Agent development illustrates how the company leverages customer relationships to improve its technology. Uber’s AI Solutions team provided operational expertise for scaling high-quality annotations across complex enterprise domains, while simultaneously validating the agent’s capabilities in real-world use cases.

“Our collaboration with WRITER allowed us to contribute our deep operational expertise in high-quality data annotation to help shape an agent capable of tackling the most complex enterprise challenges,” said Megha Yethadka, GM and Head of Uber AI Solutions.

This partnership model allows Writer to develop agents that solve actual enterprise problems rather than theoretical use cases. The approach has generated diverse applications across industries, from HR candidate sourcing and securities analysis to clinical trial site selection and competitive intelligence.

Shetrit noted that customer creativity continues to surprise the team: “I’m sure, because that’s the nature of platform and technology, is if we have this conversation again in a week after tomorrow, I’ll have completely different use cases, because our customers will be very, very creative in how they use them.”

What’s next: Rollout timeline and enterprise adoption strategy

Writer plans to expand Action Agent’s capabilities significantly over the coming weeks. The company will add connections to 80 enterprise platforms and third-party data providers like PitchBook and FactSet, enabling access to the full suite of 600+ agent tools.

The rollout strategy reflects lessons learned from enterprise AI deployments. Rather than launching with full capabilities, Writer is starting with core functionality and gradually adding integrations based on customer feedback and real-world testing.

Action Agent is available immediately in beta to Writer’s existing customer base, with a 14-day trial available for new users. The gradual rollout allows the company to refine the system based on enterprise feedback while maintaining the security and reliability standards that regulated industries require.

The launch signals a pivotal moment in the enterprise AI revolution, where autonomous agents are moving from experimental curiosities to mission-critical business tools. As traditional software vendors scramble to add AI features to existing products, Writer’s agent-first approach may determine which companies successfully navigate the transition from human-driven to AI-augmented work.

But perhaps the most telling sign of this shift came from Shetrit himself during the interview: “We will all become, you know, quote, unquote, managers of these fleet of agents, whether they’re humans or synthetic agents.” In this future, the companies that learn to orchestrate AI agents alongside human workers may find themselves with an insurmountable advantage over those still clinging to purely human-driven processes.

Daily insights on business use cases with VB Daily

If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.

Read our Privacy Policy

Thanks for subscribing. Check out more VB newsletters here.

An error occured.

Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy, bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

For many NFL teams, a new season means infrastructure modernization

Cisco has been aggressive in its platform strategy, and the willingness of six more teams to embrace it demonstrates the maturity of its AI-ready infrastructure. Cisco has been an official NFL partner for the past five seasons. Aaron Amendolia, the NFL’s deputy CIO, cited the vendor’s delivery of “world-class networking

Quinas readies UltraRam, flash memory with DRAM speed

For starters, the memory is built on what is called a” III-V technology,” a class of semiconductor materials that are composed of elements from groups III and V of the periodic table, the company stated. These materials have unique properties that make them ideal for use in electronic devices such

7 Wi-Fi certifications to bolster wireless networking skills

Organization: Certified Wireless Network Professionals (CWNP) Price: $149.99 for the CWTS-102 exam How to prepare: CWNP offers several resources to prepare including: a live training class, a self-paced training kit, a study and reference guide, an electronic practice test, an eLearning module, an eLearning bundle, and a test and go

Running the numbers on VMware Cloud Foundation: How one midsize enterprise is making it work

Grinnell leases its infrastructure, and one big renewal coming up was that of all of its hardware, which would happen a little over a year after the VMware subscription renewal. “We were all-in on HPE,” said Wright. “It’s a great platform.” Grinnell was leasing its servers and storage, not buying,

Saudi Arabia’s $5.5B Bond Sets Course for Record Issuance

Saudi Arabia sold $5.5 billion of international bonds on Tuesday to help plug its budget deficit, putting it on course for a record year of issuance as it continues to spend heavily on Crown Prince Mohammed bin Salman’s economic-diversification projects. The two-part Sukuk, or Islamic debt, sale was made up of a $2.25 billion five-year note and a $3.25 billion 10-year bond. The shorter tranche priced with a spread of 65 basis points over US Treasuries, while the longer one was sold at 75 basis points. Investors had placed around $17.5 billion of orders, underscoring the strong demand for Saudi debt even as the Gulf nation ramps ups issuance in the face of lower oil prices and high spending that’s squeezing the government’s finances. Saudi Arabia has now sold almost $20 billion in dollar- and euro-denominated debt this year, according to data compiled by Bloomberg. That cements its status as one of the busiest issuers in emerging markets. It is also well above the 2024 full-year tally and within a whisker of the annual record of $21.5 billion set in 2017. The latest sale adds to a pick up in syndicated loan activity and a fresh wave of local bank issuance as Saudi companies, the government and sovereign wealth fund come up with extra financing to back the crown prince’s Vision 2030 agenda. Those financing needs are increasingly pressing, with Brent oil prices down about 8% this year to around $69 a barrel. With prices subdued, the Saudi government projects a fiscal shortfall of about 2.3% of gross domestic product this year. It has widely telegraphed its intention to issue bonds to fill the hole. That’s in addition to measures such as privatizing some assets. While the kingdom’s ratio of debt to GDP is low by global standards at under 30%, the International Monetary Fund sees

It’s time for customer-oriented approaches to generator interconnection

Travis Kavulla is vice president of regulatory affairs at NRG Energy and a past president of the National Association of Regulatory Utility Commissioners. Eric Blank is chairman of the Colorado Public Utilities Commission. Hardly a week goes by in the power sector these days without recriminations arising from a new era of tight supply conditions relative to soaring demand in the power sector. One of the hottest points of contention is the broken process by which new power plants are interconnected to the power grid. In order to gain grid access, a would-be power plant files an interconnection request to a transmission utility, and, having staked that claim, it takes a place in line — a line that has grown and grown, with years of delay for interconnecting projects to gain access to the grid, with the amount of projects in that line growing in all markets to many multiples of the given market’s total demand. These Gold Rush rules were appropriate at the origins of the industry’s restructuring when substantial latent capacity in the power grid was being underutilized, or where additional grid capacity could be opened up for only modest incremental cost. The transformational “open access” orders issued by the Federal Energy Regulatory Commission during this era unleashed an era of investment in power generators by standardizing utility transmission tariffs and making it clear that anyone who wanted to make a go of it in the power-generation business could do so. But like most Gold Rushes, this early bonanza that was meant to quickly tap a resource has ended up in a bureaucratic snarl. The grid today is largely tapped out, yet there are too many stakes in the ground that will never prove out. The messy backlog of the generator interconnection queue that has been left in

US DOE Earmarks $35MM to Support Emerging Energy Tech

The United States Department of Energy (DOE) will channel more than $35 million toward developing emerging energy technologies. The DOE said in a media release that the funds will be divided among 42 projects related to grid security, artificial intelligence, nuclear energy, and advanced manufacturing, and located at DOE national laboratories, plants, and sites. The selected projects will leverage over $21 million in cost share from private and public partners, bringing total funding to more than $57.5 million, according to DOE. The funds are provided through DOE’s Technology Commercialization Fund (TCF) program, managed through the Office of Technology Commercialization’s Core Laboratory Infrastructure for Market Readiness (CLIMR) Lab Call. The program, according to DOE, strengthens America’s economic and national security by supporting public-private partnerships that maximize taxpayer investments, advance American innovation, and ensure the U.S. stays ahead in global competitiveness. “The Energy Department’s National Labs play an important role in ensuring the United States leads the world in innovation”, DOE Secretary Chris Wright said. “These projects have the potential to accelerate technological breakthroughs that will define the future of science and help secure America’s energy future”. This year’s selections span across 19 DOE national labs, plants, and sites, DOE said, highlighting Lawrence Berkeley National Laboratory’s launch of America’s Cradle to Commerce (AC2C), which builds on the Cradle to Commerce (C2C) program. It provides wraparound support for lab-to-market innovation. In just 18 months, C2C has proven impact with more than $15M raised by participating startups and five commercial pilots launched, DOE said. Pacific Northwest National Laboratory plans to enhance and broaden the free Visual Intellectual Property Search (VIPS) tool with the VIPS 2.0 project. The new platform will enable smooth searches across a wide range of National Lab innovations available for licensing or open-source sharing, DOE said. Meanwhile, Argonne National Laboratory

California Geothermal Lease Sales Net Over $2.7MM

The U.S. Bureau of Land Management announced, in a statement posted on its website recently, that Bureau of Land Management geothermal lease sales in California netted over $2.7 million. The Bureau noted in that statement that it accepted winning bids on 13 parcels across 22,685 public acres in Imperial, Lassen, and Modoc counties for $2,711,858 in total receipts for a geothermal lease sale. The Bureau said in the statement that it may issue leases once review and payment are complete. “The sale generated an average of $117 per acre offered, supporting American prosperity by increasing potential for domestic energy production,” the Bureau stated. “For each parcel leased, 50 percent of the bid, rental receipts, and subsequent royalties will go to the state of California, 25 percent will go to the county where the lease is located, and the remaining 25 percent will go to the U.S. Treasury,” it added. “Geothermal lease sales support domestic energy production and American energy independence, while contributing to the nation’s economic and military security,” the Bureau continued. “Consistent with Executive Order 14154, ‘Unleashing American Energy’, the BLM’s geothermal lease sales help meet the energy needs of U.S. citizens and solidify the nation as a global energy leader long into the future and achieve American Energy Dominance,” it went on to state. The Bureau noted in the statement that leasing is the first step in the process to develop federal geothermal resources. The organization added that it ensures geothermal development meets the requirements set forth by the National Environmental Policy Act of 1969 and other applicable legal authorities. In its statement, the Bureau described geothermal as “an abundant resource, especially in the West, where the BLM has authority to manage geothermal resource leasing, exploration, and development on approximately 245 million surface acres of public lands and the 700 million acres

Texas Critical Data Centers and Thunderhead Ink Preliminary Power Deal

Texas Critical Data Centers LLC (TCDC), a 50-50 venture between New Era Energy & Digital Inc. and Sharon AI Inc., has signed a non-binding term sheet with Thunderhead Energy Solutions LLC for a natural gas-fired generation facility with a capacity of about 250 megawatts. Thunderhead will fund, construct and operate the facility using a hybrid deployment of reciprocating engines and turbines. The facility will serve “as the energy backbone for TCDC’s high-performance, AI-optimized compute campus”, a joint statement said. The parties expect to start construction of the power facility this year, targeting completion over the next 18 months. Planned to rise in Ector County, Texas, TCDC would be scalable to up to one gigawatt, according to New Era. In July TCDC completed the acquisition of 235 acres from Grow Odessa near the City of Odessa. It has entered into a letter of intent with the same seller for the purchase of an additional 203 contiguous acres. “The agreement with Thunderhead is one more major milestone in our buildout and reinforces our vision of delivering energy-resilient, AI-native infrastructure”, said New Era chief executive E. Will Gray II. “It also ensures TCDC will provide robust, SB6-compliant power to support the next wave of AI growth in West Texas”. This is the first agreement announced by New Era Energy & Digital since rebranding from New Era Helium Inc. to reflect its shift into a vertically integrated energy supplier. The rebranded New Era aims to develop “next-generation digital infrastructure and integrated power assets, including powered land and powered shells”, it said in a statement August 12. “The company delivers turnkey solutions that will enable hyperscale, enterprise and edge operators to accelerate data center deployment, optimize total cost of ownership and future-proof their infrastructure investments”. The Midland, Texas-based company “projects generational AI infrastructure demand will grow exponentially

Plains to Become Majority Owner of EPIC Crude Pipeline System

Diamondback Energy Inc. and Kinetik Holdings Inc. have signed agreements to sell each of their 27.5 percent stakes in EPIC Crude Holdings LP to Plains for around $1.57 billion. Plains will become the majority owner with a 55 percent interest in EPIC Crude Holdings, owner of the EPIC Crude Oil Pipeline. Ares Management Corp.’s EPIC Midstream Holdings LP will retain an operating stake of 45 percent. Stretching 800 miles, the pipeline system carries Delaware Basin and Midland Basin supply from locations near Crane, Midland, Orla and Wink, Texas, and Eagle Ford supply from locations near Gardendale and Hobson, Texas. The pipeline system delivers the oil to EPIC Crude Holdings’ 3.4-million-barrel Robstown Terminal near Corpus Christi, according to EPIC Midstream. The pipeline system, which became fully operational April 2020, has a nameplate capacity of 600,000 barrels per day (bpd), expandable up to one million bpd, and nearly seven million barrels of operational storage, according to EPIC Midstream. The assets boost Plains’ Permian wellhead to water strategy, Plains said in a statement on its website, noting the pipeline system is “underpinned by long-term minimum volume commitments from high-quality customers”. “This transaction strengthens our position as the premier crude oil midstream provider, complements our asset footprint and enhances our customer offering”, said Plains chair, chief executive and president Willie Chiang. “The combination of our stake in EPIC Crude Holdings coupled with our existing integrated Permian and Eagle Ford assets enhances our commitment to offering a high level of connectivity and flexibility for our customers. “By further linking our Permian and Eagle Ford gathering systems to Corpus Christi, we are enhancing market access and ensuring our customers have reliable, cost-effective routes to multiple demand centers”. Plains agreed to pay Diamondback and Kinetik an additional $193 million should an expansion of the pipeline system to a

SAP data sovereignty service lets customers run cloud workloads inside their data centers

A range of developments, primarily geo-political in nature, have transformed this outlook. Now, sovereignty is as much tied up with the growing sense that operational, political, and even technological independence is essential, especially for EU-based enterprises. SAP has embraced this concern. “The digital resilience of Europe depends on sovereignty that is secure, scalable and future-ready,” said Martin Merz, president, SAP Sovereign Cloud. “SAP’s full-stack sovereign cloud offering delivers exactly that, giving customers the freedom to choose their deployment model while helping ensure compliance up to the highest standards.” This reflects the company’s commitment to supporting the EU’s “digital autonomy,” he said. The company has made digital sovereignty a strategic priority, and will invest €20 billion ($23.3 billion) to develop new digital sovereignty products for the EU as well as for other territories. A decade ago, the idea of cloud services promoted the notion of a single global infrastructure market. Now it looks just as likely that there will be a balkanization of global cloud infrastructure into geographical domains. “For decades, enterprises have handed over too much power to their cloud providers – power over infrastructure, power over availability, and most importantly, power over their own data,” commented Garima Kapoor, co-founder and co-CEO of US AI object storage company, MinIO. “CIOs are realizing that outsourcing control to a public cloud provider is no longer an option. The concept of sovereignty is evolving. It’s no longer just as a means of maintaining compliance with data regulations but is now viewed as a strategic and architectural imperative for enterprises that want to own their digital destiny,” she said.

Alibaba Cloud tweaks software for networking efficiency gains

Alibaba Cloud said that it has been using ZooRoute in AliCloud for the last 18 months, where it has reduced outage time by 92.71%. Nezha for network performance in high-demand VMs Another software upgrade is helping Alibaba Cloud maintain network performance for high-demand virtual machines (VMs) without spending more on SmartNIC-accelerated virtual switches (vSwitches). Nezha, a distributed vSwitch load-sharing system, identifies idle SmartNICs and uses them to create a remote resource pool for high-demand virtual NICs (vNICs). Alibaba has tested the system in its data centers for a year and said in the paper that “Nezha effectively resolves vSwitch overloads and removes it as a bottleneck.” With the number of concurrent flows improved by up to 50x, and the number of vNICs by up to 40x, the bottleneck s now the VM kernel stack, the researchers wrote. Dai’s Forrester said that Nezha’s stateless offloading and cluster-wide pooling design is superior to solutions being pursued by rival cloud service providers. Separately, Alibaba’s cloud computing division has also been working on another software update that will enable it to provide better network performance for AI workloads.

AI networking success requires deep, real-time observability

Most research participants also told us they need to improve visibility into their data center network fabrics and WAN edge connectivity services. (See also: 10 network observability certifications to boost IT operations skills) The need for real-time data Observability of AI networks will require many enterprises to optimize how their tools collect network data. For instance, most observability tools rely on SNMP polling to pull metrics from network infrastructure, and these tools typically poll devices at five minute intervals. Shorter polling intervals can adversely impact network performance and tool performance. Sixty-nine percent of survey participants told EMA that AI networks require real-time infrastructure monitoring that SNMP simply cannot support. Real-time telemetry closes visibility gaps. For instance, AI traffic bursts that create congestion and packet drops may last only seconds, an issue that a five-minute polling interval would miss entirely. To achieve this level of metric granularity, network teams will have to adopt streaming network telemetry. Unfortunately, support of such technology is still uneven among network infrastructure and network observability vendors due to a lack of industry standardization and a perception among vendors that customers simply don’t need it. Well, AI is about to create a lot of demand for it. In parallel to the need for granular infrastructure metrics, 51% of respondents told EMA that they need more real-time network flow monitoring. In general, network flow technologies such as NetFlow and IPFIX can deliver data nearly in real-time, with delays of seconds or a couple minutes depending on the implementation. However, other technologies are less timely. In particular, the VPC flow logs generated by cloud providers are do not offer the same data granularity. Network teams may need to turn to real-time packet monitoring to close cloud visibility gaps. Smarter analysis for smarter networks Network teams also need their network

Equinix Bets on Nuclear and Fuel Cells to Meet Exploding Data Center Energy Demand

A New Chapter in Data Center Energy Strategy Equinix’s strategic investments in advanced nuclear and fuel cell technologies mark a pivotal moment in the evolution of data center energy infrastructure. By proactively securing power sources like Oklo’s fast reactors and Radiant’s microreactors, Equinix is not merely adapting to the industry’s growing energy demands but is actively shaping the future of sustainable, resilient power solutions. This forward-thinking approach is mirrored across the tech sector. Google, for instance, has partnered with Kairos Power to develop small modular reactors (SMRs) in Tennessee, aiming to supply power to its data centers by 2030 . Similarly, Amazon has committed to deploying 5 gigawatts of nuclear energy through partnerships with Dominion Energy and X-energy, underscoring the industry’s collective shift towards nuclear energy as a viable solution to meet escalating power needs . The urgency of these initiatives is underscored by projections from the U.S. Department of Energy, which anticipates data center electricity demand could rise to 6.7%–12% of total U.S. production by 2028, up from 4.4% in 2023. This surge, primarily driven by AI technologies, is straining existing grid infrastructure and prompting both public and private sectors to explore innovative solutions. Equinix’s approach, i.e. investing in both immediate and long-term energy solutions, sets a precedent for the industry. By integrating fuel cells for near-term needs and committing to advanced nuclear projects for future scalability, Equinix exemplifies a balanced strategy that addresses current challenges while preparing for future demands. As the industry moves forward, the collaboration between data center operators, energy providers, and policymakers will be crucial. The path to a sustainable, resilient energy future for data centers lies in continued innovation, strategic partnerships, and a shared commitment to meeting the digital economy’s power needs responsibly.

Evolving to Meet AI-Era Data Center Power Demands: A Conversation with Rehlko CEO Brian Melka

On the latest episode of the Data Center Frontier Show Podcast, we sat down with Brian Melka, CEO of Rehlko, to explore how the century-old mission-critical power provider is reinventing itself to support the new realities of AI-driven data center growth. Rehlko, formerly known as Kohler Energy, rebranded a year ago but continues to draw on more than a century of experience in power generation and backup systems. Melka emphasized that while the name has changed, the mission has not: delivering reliable, scalable, and flexible energy solutions to support always-on digital infrastructure. Meeting Surging AI Power Demands Asked how Rehlko is evolving to support the next wave of data center development, Melka pointed to two major dynamics shaping the market: Unprecedented capacity needs driven by AI training and inference. New, “spiky” usage patterns that strain traditional backup systems. “Power generation is something we’ve been doing longer than anyone else, starting in 1920,” Melka noted. “As we look forward, it’s not just about the scale of backup power required — it’s about responsiveness. AI has very large short-duration power demands that put real strain on traditional systems.” To address this, Rehlko is scaling its production capacity fourfold over the next three to four years, while also leveraging its global in-house EPC (engineering, procurement, construction) capabilities to design and deliver hybrid systems. These combine diesel or gas generation with battery storage and short-duration modulation, creating a more responsive power backbone for AI data centers. “We’re the only ones out there that can deliver that breadth of capability on a full turnkey basis,” Melka said. “It positions us to support customers as they navigate these new patterns of energy demand.” Speed to Power Becomes a Priority In today’s market, “speed to power” has become the defining theme. Developers and operators are increasingly considering

Data Center Chip Giants Negotiate Political Moves, Tariffs, and Corporate Strategies

And with the current restrictions being placed on US manufacturers selling AI parts to China, reporting says NVIDIA is developing a Blackwell-based China chip, more capable than the current H20 but still structured to comply with U.S. export rules. Reuters reported that it would be a single-die design (roughly half the compute of the dual-die B300), with HBM and NVLink, sampling as soon as next month. A second compliant workstation/inference product (RTX6000D) is also in development. Chinese agencies have reportedly discouraged use of NVIDIA H20 in government work, favoring Huawei Ascend. However, there have been reports describing AI training using the Ascend to be “challenging”, forcing some AI firms to revert to NVIDIA for large-scale training while using Ascend for inference. This keeps China demand alive for compliant NVIDIA/AMD parts—hence the U.S. interest in revenue-sharing. Meanwhile, AMD made its announcements at June’s “Advancing AI 2025” to set MI350 (CDNA 4) expectations and a yearly rollout rhythm that’s designed to erase NVIDIA’s time lead as much as fight on absolute perf/Watt. If MI350 systems ramp aligns with major cloud designs in 2026, AMD’s near-term objective is defending MI300X momentum while converting large customers to multi-vendor strategies (often pairing MI clusters with NVIDIA estates for redundancy and price leverage). The 15% China license fee will shape how AMD prices MI-series export SKUs and whether Chinese hyperscalers still prefer them to the domestic alternative (Huawei Ascend), which continue to face software/toolchain challenges. If Chinese buyers balk or Beijing discourages purchases, the revenue-share may be moot; if they don’t, AMD has a path to keep seats warm in China while building MI350 demand elsewhere. Beyond China export licenses, the U.S. and EU recently averted a larger trade war by settling near 15% on certain sectors, which included semiconductors, as opposed to the far more

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs). In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Stay Ahead, Stay ONMINE