Blaxel raises $7.3M seed round to build ‘AWS for AI agents’ after processing billions of agent requests

Stay Ahead, Stay ONMINE

Blaxel raises $7.3M seed round to build ‘AWS for AI agents’ after processing billions of agent requests

Blaxel, a startup building cloud infrastructure specifically designed for artificial intelligence agents, has raised $7.3 million in seed funding led by First Round Capital, the company announced Tuesday. The financing comes just three months after the six-founder team graduated from Y Combinator’s Spring 2025 batch, underscoring investor appetite for infrastructure plays in the rapidly expanding AI agent market.The San Francisco-based company is betting that the current generation of cloud providers — Amazon Web Services, Google Cloud, and Microsoft Azure — are fundamentally mismatched for the new wave of autonomous AI systems that can take actions without human intervention. These AI agents, which handle everything from managing calendars to generating code, require dramatically different infrastructure than traditional web applications built for human users.“The current cloud providers have been designed for the Web 2.0, Software as a Service era,” said Paul Sinaï, Blaxel’s co-founder and CEO, in an exclusive interview with VentureBeat. “But with this new wave of agentic AI, we believe that there is a need for a new type of infrastructure which is dedicated to AI agents.”The timing reflects a broader shift in enterprise computing as companies increasingly deploy AI agents for customer service, data processing, and workflow automation. Unlike traditional applications where databases sit alongside web servers in predictable patterns, AI agents create unique networking challenges by connecting to language models in one region, APIs in another cloud, and knowledge bases elsewhere—all while users expect instant responses.

The San Francisco-based company is betting that the current generation of cloud providers — Amazon Web Services, Google Cloud, and Microsoft Azure — are fundamentally mismatched for the new wave of autonomous AI systems that can take actions without human intervention. These AI agents, which handle everything from managing calendars to generating code, require dramatically different infrastructure than traditional web applications built for human users.

“The current cloud providers have been designed for the Web 2.0, Software as a Service era,” said Paul Sinaï, Blaxel’s co-founder and CEO, in an exclusive interview with VentureBeat. “But with this new wave of agentic AI, we believe that there is a need for a new type of infrastructure which is dedicated to AI agents.”

The timing reflects a broader shift in enterprise computing as companies increasingly deploy AI agents for customer service, data processing, and workflow automation. Unlike traditional applications where databases sit alongside web servers in predictable patterns, AI agents create unique networking challenges by connecting to language models in one region, APIs in another cloud, and knowledge bases elsewhere—all while users expect instant responses.

The AI Impact Series Returns to San Francisco – August 5

The next phase of AI is here – are you ready? Join leaders from Block, GSK, and SAP for an exclusive look at how autonomous agents are reshaping enterprise workflows – from real-time decision-making to end-to-end automation.

Secure your spot now – space is limited: https://bit.ly/3GuuPLF

Blaxel has already demonstrated significant traction, processing millions of agent requests daily across 16 global regions by the end of their Y Combinator batch. One customer is running over 1 billion seconds of agent runtime to process millions of videos, representing a scale that illustrates the infrastructure demands of AI-first companies.

“One of our customers is processing session replays to enable product managers to understand better how the user behavior of their product,” Sinaï explained. “They need to process millions of session replays every month. So it represents millions of minutes of sessions. They are using our agentic infrastructure to process those session replays and provide insights for product managers.”

The company’s approach centers on providing infrastructure that AI agents can operate themselves, rather than requiring human administrators. This includes sandboxed virtual machines that boot in under 25 milliseconds, automatic scaling based on agent activity patterns, and APIs designed to be consumed directly by AI systems rather than human developers.

How six co-founders with a successful exit plan to take on Big Tech

Blaxel’s unusual six-founder structure stems from the team’s shared experience building and selling a previous company to OVHcloud, Europe’s largest cloud provider. That company became OVH’s entire analytics product suite, giving the team firsthand experience with both cloud infrastructure challenges and successful exits.

“I know it sounds unusual, pretty big team. We didn’t fit exactly on the stage for demo day,” Sinaï said, referencing Y Combinator’s signature event. “But we already did that. My previous company, which I sold to OVH cloud, we were also six co-founders.”

The team includes Charles Drappier, whom Sinaï has known for over 30 years, along with co-founders Christophe Ploujoux, Nicolas Lecomte, Thomas Crochet, and Mathis Joffre. Their collective experience spans infrastructure, developer tools, and platform engineering — critical expertise for competing against tech giants with virtually unlimited resources.

“I think it’s important to be six right now, because we have a lot of ambition,” Sinaï said. “What we are doing is building this next generation of cloud computing for this new agentic era.”

What sets Blaxel apart in the competitive cloud infrastructure market

The cloud infrastructure market is notoriously competitive, with AWS commanding roughly one-third market share and newer players like Modal, Replicate, and RunPod targeting AI workloads. Blaxel differentiates itself by focusing specifically on AI agents rather than model inference or training.

“Most of the competitors you mentioned are solving a very difficult problem, which is around the inference — how you can host your model, how you can make those models as fast as you can in terms of number of tokens,” Sinaï said. “But there is not that many people working on infrastructure for the agents, and it’s exactly what we are doing.”

The company’s platform includes three main components: agent hosting for deploying AI systems as serverless APIs, MCP (Model Context Protocol) servers for connecting agents to external tools, and a unified gateway for accessing multiple AI models. The infrastructure is designed to handle the variable resource demands of AI agents, which might require minimal computing power while waiting for responses but need significant resources during active processing.

Enterprise security and compliance features target regulated industries

Despite targeting younger AI-first companies, Blaxel has implemented enterprise-grade security measures including SOC2 and HIPAA compliance. The platform offers data residency controls that allow customers to restrict workloads to specific geographic regions—critical for companies in regulated industries.

“We provide a policy framework where you can attach, for example, to workloads to say, this agent cannot run outside of those subsets of regions,” Sinaï explained. “You can attach a policy to say this agent cannot run outside of the United States, so you are sure that this agent will process the data only in the regions you have chosen.”

This approach reflects the company’s belief that even early-stage AI companies need robust infrastructure practices because they’re building the enterprises of tomorrow. “We believe that it’s very important to have, even for young companies, the best infrastructure with the best practices, because they are going to become enterprises,” Sinaï said.

Pay-as-you-go pricing delivers 50% cost savings over traditional serverless

Blaxel has adopted a pay-as-you-go pricing model similar to established cloud providers, moving away from an initial subscription approach after validating market demand during their Y Combinator batch. The model charges customers only when their agents are actively processing tasks, shutting down infrastructure during idle periods to optimize costs.

“We provide infrastructure that spin up in just few milliseconds and shut down in just one second,” Sinaï said. “So you just pay for the time your agent is actually processing something. When your agent is waiting for something else, you don’t have to pay for it because we shut it down.”

The approach has already delivered cost savings for customers, with one client achieving 50% cost reduction compared to typical serverless solutions while processing terabytes of data monthly.

Gartner predicts 75% of apps will use AI agents by 2028

The investment comes as industry analysts predict explosive growth in AI agent adoption. Gartner forecasts that 75% of application development will involve AI agents by 2028, though Sinaï believes current enterprise adoption remains largely experimental.

“Right now, most of companies working actively in production are mostly smaller companies, not yet enterprise companies,” he said. “So we are focusing really on serving them exactly like the big cloud providers did in the past.”

The strategy mirrors how Amazon Web Services initially focused on startups and developer-friendly companies before expanding to enterprise customers. Blaxel plans to follow a similar path, using the $7.3 million to expand their software platform before potentially moving into custom hardware and data center optimization.

“Seven millions is not enough to build data centers, obviously, but I think it’s important to go step by step,” Sinaï said. “Being sure that right now we have the best interfaces we can provide to our customers, the best services for their agents, and then going into the deeper infrastructure optimization.”

The company’s roadmap includes features like snapshot forking for agent experimentation, automatic failover capabilities, and deeper optimization for the massive scale they anticipate. With projections of hundreds of billions of AI agents in the coming decades, Blaxel sees an opportunity to build infrastructure designed for this new computing paradigm from the ground up.

“We believe that there is a huge economy which is starting around the agents,” Sinaï said. “There are going to be hundreds of billions of AI agents, and the infrastructure we have today has not been designed for this new wave.”

The funding round included participation from Y Combinator, Liquid2, Transpose, and angel investors who share the company’s vision of purpose-built agent infrastructure. As AI agents transition from experimental tools to production systems handling critical business processes, Blaxel’s specialized approach could position it to capture significant market share in what may become the next major category of cloud computing.

Daily insights on business use cases with VB Daily

If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.

Read our Privacy Policy

Thanks for subscribing. Check out more VB newsletters here.

An error occured.

Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy, bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Why enterprises need to drive telecom standards

Cutting access costs by supporting VPN-over-FWA or standardizing SD-WAN interconnects could save enterprises as much as a quarter of their VPN costs, but neither is provided in 5G or assured in 6G. Enterprises could change that if they applied appropriate pressure. Reason No. 3: Satellite, private mobile, public mobile, and

Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety

This is a linkpost for https://bit.ly/cot-monitorability-fragile

Broadcom scales up Ethernet with Tomahawk Ultra for low latency HPC and AI

Broadcom Support for minimum packet size allows streaming of those packets at full bandwidth. That capability is essential for efficient communication in scientific and computational workloads. It is particularly important for scale-up networks where GPU-to-switch-to-GPU communication happens in a single hop. Lossless Ethernet gets an ‘Ultra’ boost Another specific area

F5 streamlines application delivery, security with new AI assistant and code generation service

The AI Assistant can also reduce documentation searches and help accelerate the handling of routine tasks, such as developing traffic configurations and deploying APIs, Ford said. In addition, it can help customers optimize traffic management and app delivery flows by offering real-time AI contextual analytics that can be applied enterprise-wide,

Petrobras Eyes Retail Return to Hold Down Pump Prices

Petrobras is considering a return to retail fuel sales after President Luiz Inacio Lula da Silva and the state-controlled oil company’s top executive complained about high pump prices. Four years after exiting the business now known as Vibra Energia SA, Petrobras’ board of directors will meet this week to discuss amending the company’s strategic plan to include a presence in the retail sector, according to a person familiar with the matter who asked not to be named discussing private matters. It’s unclear if such a move would involve trying to fully re-nationalize Vibra or buying a stake in the convenience-store operator and distributor of cooking fuels and other petroleum products. The proposal to be discussed for the 2026-2030 strategic plan would position Petrobras as a diversified and integrated energy company, the person said. Vibra was privatized during the Jair Bolsonaro administration. Petrobras’ media-relations office declined to comment. Lula has complained that wholesale price cuts by Petrobras for gasoline, diesel and other products haven’t flowed through to consumers at the retail level. He has blamed both filling stations and state-level taxes for the disparities. “It’s not possible for Petrobras to announce such a huge discount on diesel and for this discount not to reach the consumer,” Lula said earlier this month while announcing refinery investments. “Even when Petrobras cuts back, many gas stations don’t.” Lula has also said privatization has created multiple layers in the distribution system that result in higher prices for consumers. “Petrobras currently releases a 13-kilogram gas cylinder for 37 reais and it gets at a poor person’s house for 140 reais,” Lula said at the early July event. State control of retail outlets would allow more efficient delivery of the fuel, he added. Petrobras Chief Executive Officer Magda Chambriard has also expressed concern that filling stations aren’t

Quantum Capital Says Oil in Mid $60s Is Profit Red Zone

Activity is slowing in US oil fields as drillers remain in the crude-price danger zone for profits, according to one of the biggest investors of private operators in the shale patch. “In the mid-$60s, you get dangerously close to where oil prices don’t really drive appropriate returns for new drilling,” Dwight Scott, who joined Quantum Capital Group at the start of this month as executive vice chairman, said on Bloomberg TV Wednesday. “So, activity in the oil field is slowing; I think that’s a temporary thing.” West Texas Intermediate, the US benchmark, has fallen 8% since the start of this year, trading at $65.82 a barrel on Wednesday. Scott, who helped build Blackstone Inc.’s credit arm into a $330 billion business, said while uncertainty around tariffs has contributed to reduced drilling activity, he expects the US “will continue to be a leader in oil and gas.” WHAT DO YOU THINK? Generated by readers, the comments included herein do not reflect the views and opinions of Rigzone. All comments are subject to editorial review. Off-topic, inappropriate or insulting comments will be removed.

Trump taps Project 2025 contributor to fill vacant FERC seat

The White House on Wednesday named David LaCerte, an official in the U.S. Office of Personnel Management, to fill a vacant seat at the Federal Energy Regulatory Commission. LaCerte has served as the principal White House liaison and senior advisor to the director of the OPM since January, according to his LinkedIn page. He worked at OPM during the first Trump administration. The office is the chief human resources agency and personnel policy manager for the federal government. When he joined the OPM, LaCerte was set to help craft policy on workforce relations, collective bargaining and employee accountability, according to his former law firm in New Orleans, Sternberg, Naccari & White. LaCerte contributed to Project 2025, a presidential transition effort organized by the conservative Heritage Foundation that includes The Mandate for Leadership, a road map to “deconstruct the Administrative State.” LaCerte also worked as acting managing director at the U.S. Chemical Safety and Hazard Investigation Board starting at the end of President Donald Trump’s first term. LaCerte was a special counsel at the Baker Botts law firm for two years, starting in January 2023. While there, he worked on energy litigation and environmental, safety and incident response issues. FERC regulates natural gas infrastructure, wholesale electricity and gas markets, hydroelectric projects and interstate electric transmission. If confirmed by the Senate, LaCerte would serve for the remainder of former FERC Chairman Willie Phillips’ term, which expires June 30, 2026, according to the White House. LaCerte will likely move through the Senate confirmation process with Laura Swett, an energy attorney at Vinson & Elkins who Trump nominated for a FERC seat on June 2. Swett would assume the seat held by FERC Chairman Mark Christie. It is unclear how quickly the Senate will be able to act on the nominations. If confirmed, FERC

Utilities may speed renewable projects under new tax credit timeline: Jefferies

Dive Brief: Utilities are set to accelerate the development of their renewable energy projects in order to qualify for Inflation Reduction Act tax credits within the new one-year safe harbor period set by the Republican megabill that passed earlier this month, according to a July 10 report from investment bank Jefferies. Jefferies anticipates utilities “with renewables-heavy plans” – like Xcel Energy, WEC Energy Group, CMS Energy, and Ameren – “to accelerate projects originally slated for 2030–31 into 2027–28 … While affordability concerns linger, we believe investors are too focused on potential capital pullbacks and not enough on who’s actually accelerating spend.” “The provisions of the new law provide a sufficient path for us to continue delivering new, affordable, clean energy to our customers through the end of the decade,” said Theo Keith, a senior media relations representative at Xcel Energy. “Our well-established planning process ensures we can manage policy changes while working to meet our states’ energy goals and keeping bills as low as possible for our customers.” Dive Insight: “Meeting the unprecedented demand for energy in the U.S. to support our growing economy will require a wide range of energy sources and strengthened infrastructure,” Keith said. “While we supported a longer-term phase-down of the wind and solar tax credits, we recognize that budgets require compromise … we remain focused on an ‘all-of-the-above’ approach for the energy we provide.” The Republican budget megabill, which President Donald Trump signed into law July 4, stipulates that wind and solar projects must start construction within a year of the law’s enactment to qualify for the IRA’s clean electricity production and investment tax credits, or be subjected to an end of 2027 “placed in service” deadline to be eligible. As part of a reported deal with the Freedom Caucus, Trump also issued an executive order

Trump unveils $92B in energy and AI investments for Pennsylvania

President Donald Trump made an appearance at the Pennsylvania Energy and Innovation Summit on Tuesday and announced that companies including Google, Blackstone and FirstEnergy plan to make $92 billion in energy and AI investments in the state. Blackstone will be building and operating “new natural gas-based, combined-cycle generation stations” in a joint venture with PPL Corp “to power data centers under long-term energy services agreements with regulated-like risk profiles that do not expose the companies to merchant energy and capacity price volatility,” said Edison Electric Institute in a release. FirstEnergy Chair, President and CEO Brian Tierney announced at the summit that his utility plans to invest more than $28 billion “systemwide to modernize local distribution systems and strengthen the transmission network. In Pennsylvania, that includes spending $15 billion in the infrastructure enhancements, people, processes, and facilities needed to deliver safe, reliable power.” Thar Casey, CEO of AmberSemi, a developer of power management technologies, including a power conversion solution for AI data centers, attended the summit and said his “first impression from talking to Pennsylvanians is that they’re excited about getting that kind of attention.” “It’s fantastic for the state; it tells me that [Sen. Dave McCormick, R-Pa.] is doing his job,” Casey said. However, he added, he doesn’t only see the announcements as a plus for Pennsylvania, but the U.S. in general. “I had a chance to talk to some very key influential people and speak with them about the efficiency aspect of things, in addition to the focus that they have on power,” he said. “You see it in their eyes when you bring up efficiency — it’s a subject that they’re focused on.” Trump’s announcement was criticized by environmental groups like Evergreen Action, which issued a release saying the president had “unveiled a plan to double down on expensive fossil fuels” after

With Blackstone venture, PPL emerges ‘biggest winner’ from data center summit

Dive Brief: Utility company PPL Corp. is the “biggest winner” from the Pennsylvania Energy and Innovation Summit in Pittsburgh this week with its joint venture with Blackstone Infrastructure to build gas-fired power plants to serve data centers in Pennsylvania and across the PJM Interconnection, Jefferies analysts said Wednesday. PPL and Blackstone are negotiating with multiple potential hyperscale counterparties, according to the analysts, who noted that any new power plants would be operating by 2030 at the earliest. “The joint venture is actively engaged with landowners, natural gas pipeline companies and turbine manufacturers, and has secured multiple land parcels to enable this new generation buildout; however, no [energy services agreements] with hyperscalers have been signed to date,” PPL said Tuesday. Dive Insight: Plans to build gas-fired generation in Pennsylvania comes amid a surge in data center development across the United States, fueled in part by a race to develop artificial intelligence capacity. In PPL Electric Utilities’ service territory in Pennsylvania, there is more than 13 GW of potential data center load in advanced stages of planning, according to PPL. If all those data centers are built, there would be a 6 GW generation shortfall in PPL Electric Utilities’ service territory in the next five to six years, PPL said. It would cost about $15 billion to build enough gas-fired, combined-cycle units to meet the shortfall, PPL said, noting that it expects the power plants would be built by the joint venture, independent power producers and — if legislation is passed to change Pennsylvania law — PPL Electric Utilities. Blackstone said it expects to spend $25 billion on data centers and energy infrastructure in Pennsylvania. QTS, a data center operator backed by Blackstone, has secured land sites across northeastern Pennsylvania for data centers, the private equity firm said. PPL owns 51% of

Cisco upgrades 400G optical receiver to boost AI infrastructure throughput

“In the data center, what’s really changed in the last year or so is that with AI buildouts, there’s much, much more optics that are part of 400G and 800G. It’s not so much using 10G and 25G optics, which we still sell a ton of, for campus applications. But for AI infrastructure, the 400G and 800G optics are really the dominant optics for that application,” Gartner said. Most of the AI infrastructure builds have been for training models, especially in hyperscaler environments, Gartner said. “I expect, towards the tail end of this year, we’ll start to see more enterprises deploying AI infrastructure for inference. And once they do that, because it has an Nvidia GPU attached to it, it’s going to be a 400G or 800G optic.” Core enterprise applications – such as real-time trading, high-frequency transactions, multi-cloud communications, cybersecurity analytics, network forensics, and industrial IoT – can also utilize the higher network throughput, Gartner said.

Supermicro bets big on 4-socket X14 servers to regain enterprise trust

In April, Dell announced its PowerEdge R470, R570, R670, and R770 servers with Intel Xeon 6 Processors with P-cores, but with single and double-socket servers. Similarly, Lenovo’s ThinkSystem V4 servers are also based on the Intel Xeon 6 processor but are limited to dual socket configurations. The launch of 4-socket servers by Supermicro reflects a growing enterprise need for localized compute that can support memory-bound AI and reduce the complexity of distributed architectures. “The modern 4-socket servers solve multiple pain points that have intensified with GenAI and memory-intensive analytics. Enterprises are increasingly challenged by latency, interconnect complexity, and power budgets in distributed environments. High-capacity, scale-up servers provide an architecture that is more aligned with low-latency, large-model processing, especially where data residency or compliance constraints limit cloud elasticity,” said Sanchit Vir Gogia, chief analyst and CEO at Greyhound Research. “Launching a 4-socket Xeon 6 platform and packaging it within their modular ‘building block’ strategy shows Supermicro is focusing on staying ahead in enterprise and AI data center compute,” said Devroop Dhar, co-founder and MD at Primus Partner. A critical launch after major setbacks Experts peg this to be Supermicro’s most significant product launch since it became mired in governance and regulatory controversies. In 2024, the company lost Ernst & Young, its second auditor in two years, following allegations by Hindenburg Research involving accounting irregularities and the alleged export of sensitive chips to sanctioned entities. Compounding its troubles, Elon Musk’s AI startup xAI redirected its AI server orders to Dell, a move that reportedly cost Supermicro billions in potential revenue and damaged its standing in the hyperscaler ecosystem. Earlier this year, HPE signed a $1 billion contract to provide AI servers for X, a deal Supermicro was also bidding for. “The X14 launch marks a strategic reinforcement for Supermicro, showcasing its commitment

Moving AI workloads off the cloud? A hefty data center retrofit awaits

“If you have a very specific use case, and you want to fold AI into some of your processes, and you need a GPU or two and a server to do that, then, that’s perfectly acceptable,” he says. “What we’re seeing, kind of universally, is that most of the enterprises want to migrate to these autonomous agents and agentic AI, where you do need a lot of compute capacity.” Racks of brand-new GPUs, even without new power and cooling infrastructure, can be costly, and Schneider Electric often advises cost-conscious clients to look at previous-generation GPUs to save money. GPU and other AI-related technology is advancing so rapidly, however, that it’s hard to know when to put down stakes. “We’re kind of in a situation where five years ago, we were talking about a data center lasting 30 years and going through three refreshes, maybe four,” Carlini says. “Now, because it is changing so much and requiring more and more power and cooling you can’t overbuild and then grow into it like you used to.”

My take on the Gartner Magic Quadrant for LAN infrastructure? Highly inaccurate

Fortinet being in the leader quadrant may surprise some given they are best known as a security vendor, but the company has quietly built a broad and deep networking portfolio. I have no issue with them being considered a leader and believe for security conscious companies, Fortinet is a great option. Challenger Cisco is the only company listed as a challenger, and its movement out of the leader quadrant highlights just how inaccurate this document is. There is no vendor that sells more networking equipment in more places than Cisco, and it has led enterprise networking for decades. Several years ago, when it was a leader, I could argue the division of engineering between Meraki and Catalyst could have pushed them out, but it didn’t. So why now? At its June Cisco Live event, the company launched a salvo of innovation including AI Canvas, Cisco AI Assistant, and much more. It’s also continually improved the interoperability between Meraki and Catalyst and announced several new products. AI Canvas is a completely new take, was well received by customers at Cisco Live, and reinvents the concept of AIOps. As I stated above, because of the December cutoff time for information gathering, none of this was included, but that makes Cisco’s representation false. Also, I find this MQ very vague in its “Cautions” segment. As an example, it states: “Cisco’s product strategy isn’t well-aligned with key enterprise needs.” Some details here would be helpful. In my conversations with Cisco, which includes with Chief Product Officer and President Jeetu Patel, the company has reiterated that its strategy is to help customers be AI-ready with products that are easier to deploy and manage, more automated, and with a lower cost to run. That seems well-aligned with customer needs. If Gartner is hearing customers want networks

Equinix, AWS embrace liquid cooling to power AI implementations

With AWS, it deployed In-Row Heat Exchangers (IRHX), a custom-built liquid cooling system designed specifically for servers using Nvidia’s Blackwell GPUs, it’s most powerful but also its hottest running processors used for AI training and inference. The IRHX unit has three components: a water‑distribution cabinet, an integrated pumping unit, and in‑row fan‑coil modules. It uses direct to chip liquid cooling just like the equinox servers, where cold‑plates attached to the chip draw heat from the chips and is cooled by the liquid. The warmed coolant then flows through the coils of heat exchangers, where high‑speed fans Blow on the pipes to cool them, like a car radiator. This type of cooling is nothing new, and there are a few direct to chip liquid cooling solutions on the market from Vertiv, CoolIT, Motivair, and Delta Electronics all sell liquid cooling options. But AWS separates the pumping unit from the fan-coil modules, letting a single pumping system to support large number of fan units. These modular fans can be added or removed as cooling requirements evolve, giving AWS the flexibility to adjust the system per row and site. This led to some concern that Amazon would disrupt the market for liquid cooling, but as a Dell’Oro Group analyst put it, Amazon develops custom technologies for itself and does not go into competition or business with other data center infrastructure companies.

Intel CEO: We are not in the top 10 semiconductor companies

The Q&A session came on the heels of layoffs across the company. Tan was hired in March, and almost immediately he began to promise to divest and reduce non-core assets. Gelsinger had also begun divesting the company of losers, but they were nibbles around the edge. Tan is promising to take an axe to the place. In addition to discontinuing products, the company has outsourced marketing and media relations — for the first time in more than 25 years of covering this company, I have no internal contacts at Intel. Many more workers are going to lose their jobs in coming weeks. So far about 500 have been cut in Oregon and California but many more is expected — as much as 20% of the overall company staff may go, and Intel has over 100,000 employees, according to published reports. Tan believes the company is bloated and too bogged down with layers of management to be reactive and responsive in the same way that AMD and Nvidia are. “The whole process of that (deciding) is so slow and eventually nobody makes a decision,” he is quoted as saying. Something he has decided on is AI, and he seems to have decided to give up. “On training, I think it is too late for us,” Tan said, adding that Nvidia’s position in that market is simply “too strong.” So there goes what sales Gaudi3 could muster. Instead, Tan said Intel will focus on “edge” artificial intelligence, where AI capabilities Are brought to PCs and other remote devices rather than big AI processors in data centers like Nvidia and AMD are doing. “That’s an area that I think is emerging, coming up very big and we want to make sure that we capture,” Tan said.

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs). In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Finding value from AI agents from day one

In partnership withBoomi Imagine AI so sophisticated it could read a customer’s mind? Or identify and close a cybersecurity loophole weeks before hackers strike? How

How to run an LLM on your laptop

MIT Technology Review’s How To series helps you get things done. Simon Willison has a plan for the end of the world. It’s a USB

OpenAI unveils ‘ChatGPT agent’ that gives ChatGPT its own computer to autonomously use your email and web apps, download and create files for you

OpenAI isn’t letting the delay of its open source AI model slow it down on shipping other features.Today, the company is unveiling ChatGPT agent, a