Stay Ahead, Stay ONMINE

Chip sales are set to soar in 2025 — so long as there isn’t a trade war | Deloitte

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Semiconductor chip sales are set to soar in 2025, led by generative AI and data center build-outs, even as demand from PC and mobile markets may be weak, according to Deloitte’s 2025 chip outlook report. The […]

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More


Semiconductor chip sales are set to soar in 2025, led by generative AI and data center build-outs, even as demand from PC and mobile markets may be weak, according to Deloitte’s 2025 chip outlook report.

The semiconductor industry had a robust 2024, with expected double digit (19%) growth, and sales of $627 billion for the year. But that’s even better than the earlier forecast of $611 billion. And 2025 could be even better, with predicted sales of $697 billion, reaching a new all-time high, and well on track to reach the widely accepted aspirational goal of $1 trillion in chip sales by 2030. To get there, the chip industry only has to grow at a compound annual growth rate of 7.5% from 2025 to 2030.

Of course, all of this assumes that the U.S. doesn’t get into a massive trade war as a result of Donald Trump’s plan to place tariffs on computer chips and other semiconductors. He reaffirmed last week that he not only planned to place tariffs on China of 10%, but that he would also put them on Taiwan, where big U.S. companies like Nvidia get their chips. The Consumer Technology Association estimates that tariffs could make game consoles 40% more expensive for U.S. consumers, with a 26% price increase for smartphones and 46% price increase for laptops.

I am guessing this probably came up as Jensen Huang, CEO of Nvidia, visited Trump on Friday just ahead of the tariff announcements on Saturday. And so far, Trump has not yet placed any tariffs on Taiwan or the chip industry. But it’s a fluid situation and it’s complicated.

“When you look at the idea of tariff in general across all industries, if you put a tariff on maple syrup, it’s either maple syrup or it isn’t maple syrup, and it either comes from outside the U.S. or it doesn’t. There are many, many, many kinds of chips. They are manufactured almost always in a highly complex supply chain with bits of them going from country A to B to C, back to A, over to D,” said Duncan Stewart, TMT Center research director at Deloitte, in an interview with GamesBeat. “Given the global nature of supply chains, anything along the line of chip restrictions or tariffs, likely will have an impact and would make supply chains more complex to administer and just in general complicate them.”

Assuming the industry continues to grow at 7.5% CAGR, it could reach $2 trillion in 2040. The stock market is often a leading indicator of industry performance: As of mid-December 2024, the combined market capitalization of the top 10 global chip companies was $6.5 trillion, up 93% from $3.4 trillion in mid-December 2023 and 235% higher than the $1.9 trillion we saw in mid-November 2022. Much of the reason for that was the growth of Nvidia, the AI chip maker, in my opinion.

Are subsidies working for reshoring chip production in the U.S.?

Jeroen Kusters, U.S. semiconductor leader at Deloitte, said in an interview with GamesBeat there are a lot of investments happening on reshoring of chip factories in the U.S. The big ones are under way with companies like Intel, Globalfoundries and TSMC building factories in the U.S. Stewart said that chip executives have said that the investments so far are just the first of what should be even larger numbers in terms of the investment required to bring semiconductor manufacturing back.

“After all of the plants that are in the process of being built and started and launched, at the end of all of that, by 2032, the U.S. may be up around 14% or something. It takes time. It is an absolutely massive industry. And moving the needle from 10% to 14% is in fact a remarkably good number. It’s a sign of how hard it is to move. And it’s the same for Europe, of course,” Stewart said.

How much should the industry invest in AI?

Jeroen Kusters, U.S. semiconductor leader at Deloitte.

This is the trillion dollar questions.

Regarding the risks of smaller models working like DeepSeek and the impact on AI chip demand, Stewart said, “Various people saying AI would be $400 billion or even $500 billion in addressable market, which would be on the order of 2028 or something like that. One of the complicating factors is that there are smaller models out there and more efficient models as well as edge computing. And all of those could change the demand. In other words, you still need GenAI chips, but you need different GenAI chips. Or it could even reduce the demand for GenAI. We actually said that in our outlook as a potential risk factor.”

He added, “Without commenting on any given small model, this has all been known for some time. Somebody comes out and says I have a thing where I used to use many, many expensive chips, and I can now use either newer chips or cheaper chips — that could change the size of the GenAI chip industry. We actually anticipated that.”

He noted that while many of the large hyperscaler data center operators re saying that they are reducing their capital expenditure plans for the next quarter of the next year.

“Although there is always and will always be a threat, one of the things that in recent weeks is the idea that maybe you don’t need as much generative AI infrastructure because of more efficient, smaller models,” Stewart said. “Two weeks before that, there was some remarkable gains in GenAI programming where they did what’s called chain of reasoning. And this was like last month. Those are significantly more accurate. And in fact they can use ten, a hundred, or even a thousand times as many chips as the previous models. So I think I’m comfortable saying that given which week it is, sometimes the focus is on more efficient AI models, but at the same time are news events that make it look like we need even more chips to do better AI.”

Jeroen Kusters, U.S. semiconductor leader at Deloitte, said in an interview with GamesBeat, “I think you’ll find that in terms of demand, there will be step changes in demand. This is sort of a step change in efficiency. We all expect it, and actually we absolutely need models to become more efficient. We expected a relatively linear trend on this. Everyone knows that models are going to get more attention. What we saw now see was a step change, and it confused people a little. That’s okay. We’re going to see more of these step changes. And most of the time, what you’ll find is that, indeed, as the models get better, they will require more performance. They will require more compute. We then get more efficient.”

Stewart said, “One of the large platform companies announced that they have five million users of their generative AI tool for a specific application. And people say, oh, it’s a pilot. No, it isn’t. They have five million companies doing this weekly. The idea that we’re still entirely a group of concepts and pilots is just plain wrong. They are one of the world’s largest companies, and this is a tool that not only their customers are using, but is a thing that the company itself says drives the effectiveness of their product.”

A tale of two markets

Duncan Stewart TMT Center research director at Deloitte

That said, it is worth noting that “average” chip stock performance in the last two years has been a “tale of two markets”: companies that are involved in the generative AI chip market outperformed that average, while companies without that exposure (automotive, computer, smartphone, and communications semiconductor companies, for example) underperformed the semiconductor market average.

One driver of industry sales has been the demand for generative AI (gen AI) chips: a mix of CPUs, GPUs, data center communications chips, memory, power chips, and more. The Deloitte 2024 TMT Predictions report predicted that those gen AI chips collectively would be worth “more than” $50 billion, which was a much too conservative forecast, as the market was likely over $125 billion in 2024 – and represented over 20% of total chip sales for the year.

Last year, Deloitte estimated that AI chips would grow by a strong number, but the AI industry sailed right past those optimistic numbers and grew even bigger.

At the time of publication, Deloitte predicts that gen AI chips will be over $150 billion in 2025. Further, AMD CEO Lisa Su has moved her estimate for the total addressable market for AI accelerator chips up to $500 billion in 2028, a number which is larger than sales for the entire chip industry in 2023.

Chip demand is exploding because of AI.

In terms of end markets, after being flat at around 262 million units in 2024 over 2023, PC sales are expected to grow in 2025 by over 4% to about 273 million units. Meanwhile, smartphone sales are expected to grow at low-single digits in 2025 (and beyond) to reach an estimated 1.24 billion units in 2024 (+6.2% year-over-year growth). These two end markets are important for the semi industry: In 2023, communication and computer chip sales (which include data center chips) made up 57% of overall semiconductor sales for the year compared to auto and industrial, which accounted for only 31% of sales combined, for example.

One challenge for the industry is that while gen AI chips and associated revenues (memory, advanced packaging, communications, and more) are responsible for outsized revenues and profits, they represent a small number of very high value chips, meaning that wafer capacity—and therefore utilization—for the industry as a whole isn’t as high as it might appear. In 2023, nearly a trillion chips were sold at an average selling price of US$0.61 per chip. At a rough estimate, although gen AI chips might account for 20% of revenues in 2024, they were less than 0.2% of wafers.

Even though global chip revenues for 2024 was forecast to rise 19%, silicon wafer shipments for the year actually declined an estimated 2.4% for the year. 16 That number is expected to grow by almost 10% in 2025, fueled by demand for components and technologies used largely in gen AI chips, such as chiplets, as mentioned in the 2025 TMT Predictions report. Of course, silicon wafers are not the only kind of capacity to track: Advanced packaging is growing even faster.

As an example, some analysts estimate that TSMC’s CoWoS (chip-on-wafer-on-substrate) 2.5D advanced packaging production capacity will reach 35,000 wafers per month (wpm) in 2024 and could increase to 70,000 wpm (100% YoY) and further by 30% YoY to 90,000 wpm by end of 2026.

Further, driving innovation in the industry is not cheap. In 2015, overall chip industry average spending on R&D was 45% of its EBIT (earnings before interest and taxes), but by 2024 it was an estimated 52% of EBIT. R&D seems to be growing at a 12% CAGR, white EBIT is only growing at 10% (see figure 2).

The semiconductor outlook: R&D and EBIT growth. (figure 2)

Finally, it’s worth reminding readers that the chip industry can be notoriously cyclical. The industry has flipped from growth to shrinkage nine times in the last 34 years (figure 3). 21 So it may seem that the industry is seeing less extreme growth or shrinkage in the last 14 years, compared to the 1990-2010 period, but the frequency of contractions seems to increase. 2025 looks solid for now, it’s hard to tell what 2026 will bring.

Making investments in a resilient supply chain will make sense around the world.

“Companies with or without incentives are deciding to build new plants in new places to shorten or make resilient supply chains. This is an industry where staying on top of the ball has been a thing they’ve been doing for half a decade now. It has been a constantly shifting mix of various incentives and restrictions. That is a fairly normal thing for the semiconductor industry,” Stewart said.

Global semiconductor industry—Historic billings (Three month moving average), 1990 to 2024 October YTD. (figure 3).

These trends and others play into the 2025 semiconductor industry outlook, where the firm drills down into four big topics for the year ahead: generative AI accelerator chips for PCs and smartphones and the enterprise edge; a new “shift left” approach to chip design; the growing global talent shortage; and the need to build resilient supply chains amid escalating geopolitical tensions.

Generative AI chips in PCs, smartphones, the enterprise edge, and IoT

Many of the chips that are being used for training and inference of gen AI cost tens of thousands of dollars and are destined for large cloud data centers. In 2024 and 2025, these chips or lightweight versions of these chips are also finding homes in the enterprise edge, in computers, in smartphones, and (over time) in other edge devices such as Internet of Things (IoT) applications. To be clear, in many cases these chips are being used for either gen AI, traditional AI (machine learning) or, increasingly, a combination of both.

The enterprise edge market was already a factor in 2024, but the question in 2025 will be about smaller, cheaper, less powerful versions of these chips becoming a key part of computers and smartphones. What they lack in per-chip value, they can make up for in volume: PC sales are expected to be over 260 million units in 2025, while smartphones are expected to be over 1.24 billion units.

Sometimes the “gen AI chip” can be a standalone single piece of silicon, but more commonly it’s a few square millimeters of dedicated AI processing real estate that is tiny part of a much larger chip.

Enterprise edge: Although generative AI via the cloud will likely continue to be a dominant option for many enterprises, about half of the enterprises worldwide are predicted to add AI data center infrastructure on-premises—an example of enterprise edge computing. 23 This could be, in part, to help protect their intellectual property and sensitive data and comply with data sovereignty or other regulations, but also to help them save money.

These chips are largely the same as those found in hyperscale data centers, with server racks costing millions of dollars and requiring hundreds of kilowatts. Although smaller than hyperscale chip demand, we estimate the chips for enterprise edge server chips will likely be worth tens of billions of dollars globally in 2025.

Demand for NPU-enabled PCs. (figure 4)


Personal computers: Sales of gen AI powered PCs are predicted to be half of all PCs in 2025, 26 with some forecasts suggesting that almost all PCs will have at least some on-board gen AI processing—also known as neural processing units (NPUs)—by 2028 (see figure 4). 27 These NPU-powered machines are expected to command a price premium of 10-15%, but it’s important to note that not all gen AI PCs are equal.

There’s a dividing line at the 40 TOPS (trillion operations per second) level, following a recommendation from major PC ecosystem companies that only computers with more than TOPS be considered true AI-enabled PCs. 29 As at the time of writing, some buyers are cautious about the new PCs, either unwilling to pay the premium, or waiting until more powerful gen AI NPUs are introduced in the back half of 2025.

As of December 2024, many of the installed base of PCs were running on x86 CPUs, with the balance being on CPUs based on the Arm architecture. MediaTek, Microsoft, and Qualcomm announced in 2024 that they would make Arm-powered PCs, specifically gen AI PCs. It’s unclear how successful these achines will be in the next 12 months, but it will likely be a key issue for the various chipmakers, with Qualcomm predicting it will sell $4 billion worth of PC chips annually by 2029.

Smartphones: Where PC NPUs might be worth tens of dollars in value, smartphone equivalent gen AI chips may be worth much less, and Deloitte estimated under $1 worth of silicon on next generation smartphone processors. Even though the smartphone market is over a billion units sold annually, and even though we predict gen AI smartphones will be 30% of phones sold in 2025, the semiconductor impact is likely smaller than PCs in dollar terms. Instead, an interesting angle for chipmakers could be to see if consumers are excited enough about new gen AI phones and features to shorten the replacement cycle. Consumers have been keeping phones longer before upgrading, and sales have been flat for
years now. 35 If gen AI enthusiasm causes an uptick in smartphone sales, that could benefit all kinds of chip companies, not just those that make the gen AI chips themselves.

IoT: A gen AI chip in a data center might cost $30,000. A gen AI chip on a PC might cost $30. A gen AI chip on a smartphone might be $3. For gen AI chips to work on the low-cost Internet of Things market, they should cost about $0.30. That’s unlikely to happen anytime soon, but with the possibility of tens of billions of IoT endpoints needing AI processors, this is a market to watch for the longer term.

“As good as Gen AI is,” Stewart said, other categories like PCs and smartphones are up a little or mostly flat, and automotive is actually down from a year ago.

“It was the best of times, it was the worst of times,” Stewart said. “Sometimes that is true, even when there are pockets of enormous growth in the semiconductor industry. It’s really important to remember there are other kinds of chips that are not growing at the same level. To some extent, the growth in GenAI is a spectacular success story, but it is masking some pockets of weakness out there in other parts of semiconductor manufacturing. And we just think it’s really important to remind people about that, because as an industry, there are companies that make GenAI chips and don’t make the other kinds and then there are companies that make the slower growing ones and aren’t benefiting from AI.”

As far as strategic questions for the industry go, Deloitte asked, “Although gen AI chips for data centers are in demand now, given their importance to industry growth, are there any signs that demand is weakening, or that processing is moving away from data centers to edge devices?”

Chip design ‘shifts left’ and calls for a greater collaboration across the industry

Deloitte predicted that, by 2023, AI would emerge as a powerful aid to human semiconductor engineers, assisting them on extreme complex chip design processes, and enabling them to find ways to improve and optimize PPA (power, performance and area). As of 2024, gen AI has enabled rapid iterations to enhance existing designs and discover entirely new ones and can do it in less time.

In 2025, there will likely be more emphasis towards ‘shift left’—an approach to chip design and development where testing, verification, and validation are moved up earlier in the chip design and development process — as optimization strategies could evolve from simple PPA metrics to system-level metrics like performance per watt, FLOPs per watt (FLOPs denotes floating point operations per second), and thermal factors. And the combination of advanced AI capabilities—graph neural networks (GNN) and reinforcement learning (RL)—will likely continue to help design chips that are more power-efficient than typical chips produced by human engineers.

Domain-specific and specialized chips are expected to continue to gain prominence over general-purpose ones, as several industries (such as automotive) and certain AI workloads would require customized approaches to designing chips. However, a widespread adoption of application-specific integrated circuits (or ASICs) remains less clear, as the development and maintenance of such hardware can be costly and could divert focus from other AI advancements. But here’s where gen AI tools can allow companies to design more specialized and competitive products including custom silicon.

3D ICs and heterogeneous architectures are introducing challenges related to arranging, assembling, validating, and testing the various chiplets, which can sometimes be pre-assembled. This shift towards system design over individual product design can incorporate software and digital twins early on—stressing the importance of early and frequent testing.

By 2025, synchronizing hardware, system, and software development upstream in the process will likely help redefine future system engineering and enhance overall efficiency, quality, and time-to-market.

To evolve and keep pace with the changing face of design, the industry may want to consider new ways to handle the complex design processes. Already, the chip industry is exploring digital twins to emulate and visualize complex design processes step-by-step, including the ability to move around or swap chiplets to measure and assess performance of a multi-chiplet system. And digital twins could increasingly be used to give a visual representation (via 3D modeling) of the physical end-device or the system to assist with all aspects of design, including mechanical as well as electrical (software and hardware).

Designers should work with EDA (electronic design automation) and other hi-tech CAD/CAE (computer-aided design/computer-aided engineering) companies to strengthen design, simulation, and verification and validation tools and capabilities for hybrid and complex heterogenous systems. And they also should consider using and adapting model-based system engineering (MBSE) tools as part of the broader EDA ‘shift left’ approach.

As design and software are expected to play crucial roles in the development of next-generation advanced chip products, bolstering cyber defense becomes more important, heading into 2025. To help align with shift left approach, chip designers should integrate security and safety testing early in the chip design process. They should implement redundancy and error correction and detection mechanisms to help ensure that systems can continue to operate even when some of the components fail, and hardware-based security features such as secure boot mechanisms and encryption engines.

Deloitte said among the strategic questions to consider: As AI in chip design becomes more prevalent and common and EDA becomes more and more AI-enabled, how can the industry proactively ensure trust and transparency in the complex design process by always keeping human engineers in the loop and giving them a major role in the overall process?

The intensifying talent challenges in semiconductor industry

A skill gap in chips looms. (figure 5)

In Deloitte’s 2023 Semiconductor industry outlook report, the firm wrote that the industry needs to add a million skilled workers by 2030, or more than 100,000 every year. Two years after, not only does that forecast hold good, but the talent challenge is expected to intensify further in 2025. Globally, countries are not producing enough skilled talent to meet their workforce needs.


From core engineering to chip design and manufacturing, operations, and maintenance, AI may help alleviate some engineering talent shortages, but the skill gap looms (see figure 5). Attracting and retaining talent will likely continue to be a challenge for many organizations in 2025, and a big part of the problem is an aging workforce, which is more prominent in the United States and even Europe. Add the complex geopolitical landscape and supply chain fragility to this equation, and it becomes clear that the availability of talent supply is under stress globally. With onshoring and reshoring of fabrication, assembly, and test in the US and Europe, there will likely be pressure on chip companies and foundries as they source more of the talent locally in 2025.

For example, talent challenges are contributing to delays in opening new plants. On a related note, “friendshoring” (collaborating with companies from countries considered to be allies) can provide stability and resilience to supply chains, especially for the United States and European Union. But it also demands scouting for the right skills to help meet new capacity demands and talent roles in destinations such as Malaysia, India, Japan, and Poland.

Chip companies can’t continue to wrestle over the same finite talent pool and still expect to match up to the industry’s pace of technological advancement and capacity expansion. So, what can semiconductor companies do in 2025 to address the talent conundrum?

To help attract AI and chip talent, chip companies should consider offering a sense of trust, stability, and projected market growth; with this, they can help make the industry more appealing to recent high school grads and fresh entrants to help reinvigorate talent pipelines.

Countries aiming to benefit from their respective domestic chips acts should consider weaving in strategic goals and aspects related to workforce development and activation. Some examples could include training programs, expanded vocational and professional education, and employment opportunities that their local chip companies would commit to receive funding. Semi companies should consider collaborating with educational institutions (high schools, technical colleges, and universities) and local government organizations to leverage chip funds to develop and curate targeted workforce training and development programs aligned with specific industry needs in the region.

Semi companies should design flexible upskilling and reskilling programs for career path flexibility to help address future workforce skills and gaps. Additionally, they should implement and leverage advanced tech and AI-based tools to assess diverse talent related factors such as supply, demand, and current and projected spend, to perform complex workforce scenario modeling to support strategic talent decision-making.

Deloitte said among the strategic questions to consider: How should the workforce be characterized and segmented based on specialization areas, for example, design and IP, and manufacturing, operator, engineering, and technical roles? And how can the industry customize talent sourcing and skill development strategies based on these roles, as well as based on specific geographic regions where hiring takes place?

Stewart said one thing that could hold back the reshoring of the chip industry in the U.S. is a big talent shortage. But he noted that talent shortage is global as every country is struggling to find enough people. That means retraining and research investments have to be made in order to keep the growth going.

Building resilient supply chains amid geopolitical tensions

Deloitte 2024 Semiconductor Outlook report already talked about geopolitical tensions in depth, so what’s new for 2025?

The same…but even more. As one example, in December of 2024 the outgoing administration issued a new list of US export restrictions mainly still focused on advanced nodes (despite some speculation that restrictions might be broadened to include some relatively less advanced nodes). These restrictions now include additional separate categories around advanced inspection and metrology. Additionally, many (over 100) new entities (mainly Chinese) have been added to the restricted entity list.

As part of these restrictions, the US seems to be adopting the “small yard, high fence” approach toward semiconductor export restrictions. This aims to impose a high level of restrictions on a relatively small subset of chip technologies with a focus on those that defense, including advanced weapon systems, and advanced AI used in military applications.

The new restrictions (if implemented by the new administration) go on to flag that AI advancements are increasingly being viewed as matters of national security. The day after those new restrictions, China announced further restrictions on the export of gallium and germanium (as well as other materials), both key for the manufacture of multiple semiconductors.

As Deloitte predicted in 2024, ongoing materials restrictions will likely pose a challenge for the chip industry, but also an imperative for the industry to do more recycling of e-waste. In mid-January of 2025, the outgoing administration announced Interim Final Rule on AI Technology Diffusion. The Interim Final Rule will impose new controls for chip exports.

At time of writing, it is unknown whether the incoming administration will roll back the December and January restrictions, modify them, or even propose additional restrictions.

Additionally, the new administration has proposed increasing its use of tariffs, including tariffs on goods from China, Mexico, and Canada. Given the global nature of most semi supply chains, the proposed new AI related chip export controls (by the outgoing administration) and the planned higher tariffs would likely have an impact and could make supply chains more complex to administer, shifting profits, costs, and more. And the impact could be felt across the supply chain – including R&D and manufacturing – as well as how industry policies are shaped across countries and regions.

Of course, there are additional geopolitical risks or changes: Conflicts in Ukraine/Russia and the Middle-East continue, potentially affecting semiconductor manufacturing, supply chains, and critical raw materials. But the chip industry has other vulnerable points: the December martial law order in South Korea highlighted the global supply chain dependency and concentration of certain types of semiconductors, especially in the most advanced technologies.

As an example of concentration, almost 75% of DRAM memory chips globally are made in South Korea. It’s not just geopolitics that can interrupt key materials: 2024’s Hurricane Helene briefly shut down two mines in North Carolina that are sources for nearly all of the world’s ultra-high purity quartz, essential for making the crucibles which are a key part of the chipmaking process. With hurricanes, typhoons, and other extreme weather events projected to become more frequent and intense due to climate change, expanding the sources for key materials is likely to continue to be a supply chain priority.

It is worth noting that, as of late 2024, a key part of the export restrictions from the United States and allies is having an effect: The restrictions around extreme ultraviolet (EUV) lithography machines seem to be posing a barrier, preventing Chinese companies from making advanced node chips at scale and with acceptable yields. Although there are 7 nm and 6 nm chips being made in limited numbers using older deep ultraviolet (DUV) technology, the volumes are low, yields are uneconomical, and that situation is expected to persist at least until 2026.

To be clear, semiconductor supply chains worked well in 2024, even as the industry grew by almost 20%. At this time, there’s no reason to believe 2025 supply chains will be less resilient, but as always, the risk is there. And given how important gen AI chips are expected to be in 2025 and beyond (up to 50% of sales, perhaps 76 ) and the relatively higher concentration of processor, memory, and packaging required for cutting-edge chips, the industry may be more vulnerable to supply chain disruptions than ever before. Although the industry is likely to become less concentrated geographically thanks to the various chips acts – and initiatives like onshoring, re-shoring, near shoring, and friendshoring are all still in their early days – the industry remains highly vulnerable for the next year or two, at least.

Deloitte said that among the strategic questions to consider was, “Given the fluid geopolitical environment and escalating export restrictions, what should be the mix of reshoring vs. offshoring? And how should the industry factor potential disruptions to any existing supply chain channel partner relationships in erstwhile friendly countries and allies, aka friendshoring?”

Signposts for the future

For 2025, semiconductor industry executives should be mindful of the following signposts:

  1. There is currently a mismatch between very high spending on semiconductors for gen AI, and companies being able to monetize their gen AI offerings. For 2025, the argument of “the risk of underinvesting is greater than the risk of overinvesting” seems to be still dominant, but if that attitude shifts, demand for gen AI chips could become weaker than predicted.
  2. Competition from agile chip startups could intensify, challenging incumbents in the broader semiconductor industry. Notably, AI chip startups secured a cumulative $7.6 billion in venture capital funding globally during Q2, Q3 and Q4 of 2024, and several of these startups offer specialized solutions including customizable RISC-V-based applications, chiplets, LLM inference chips, photonic ICs, chip design, and chip equipment.
  3. With interest rates in the United States and other major markets likely to drop further, a favorable credit environment could act as a tailwind for the chip industry’s M&A scene, which has already seen an uptick.
  4. Moreover, with two different chip markets evolving—one for AI chips and one for all other types of
    chips—the industry may experience M&A and consolidation, especially if companies with valuable IP lag their peers and are seen as attractive targets. Nonetheless, potential tighter regulations and trade conflicts, globally, could potentially dampen the deal environment.
  5. As geopolitical challenges ripple across the globe, chip companies should brace themselves for further disruptions. Traditional channel partner models and allyships could get upended, even as reshoring, friendshoring, and nearshoring have gained momentum. Prolonging regional conflicts and wars could further impact the flow of vital materials and inventories. All of these could disrupt semi companies’ demand planning, requiring them to be more agile and adapt supply chain and sourcing contracts, and pricing terms.
  6. A significant part of capex spending and revenues was driven by AI and the advanced wafers needed to produce those highly advanced AI chips. However, wafer demand from auto, industrial, and consumer segments continue to be lackluster, while there’s some uptick in demand from mobile handset and other consumer products.

Through 2025 and 2026, though the overall revenue and capex seem to continue trending upward (at least for the next nine to 12 months), any downward movement in AI-related spending and components shortage could have an adverse impact ripping through the broader global semiconductor and electronics supply chain.

The pandemic caused havoc in the supply chain for a couple of years. But Stewart said good things have happened to global semiconductor supply chains in the last four years. They are more resilient than they were last time, he said.

“And every single semiconductor supply chain expert in the world says that this is a good thing and we can continue to do it. And that there will also be another supply chain interruption and shortage that has severe problems at some point within the reasonable future,” he said. “You can make the supply chain more resilient. You cannot make it bulletproof. Things happen. There are droughts and fights and trade wars and restrictions and all of the different things that the economy [deals with.]”

Kusters noted that making the supply chain more resilient doesn’t happen overnight. That resilience is still building but it isn’t completely finished yet.

“We have talked about making more resilient supply chains very much a kind of incentive-based program, like the European Chips Act would be an example. We will build this plant in Poland because we don’t have one in Europe, and we need that, and it’s very directive. One of the things that we might see with restrictions and tariffs and other forms of things that shape supply chains is those might end up making different kinds of outcomes than you would have gotten [otherwise],” Stewart said.

Shape
Shape
Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy,  bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Shape

Pantheon of college football gets a Wi-Fi upgrade

Notre Dame has fully adopted mobile ticketing and introduced grab-and-go concession stands, with plans to expand them further. Alcohol sales were recently approved, prompting efforts to support new services like mobile carts. In premium areas, fans can stream various games during events. Notre Dame also tested mobile ordering for concessions

Read More »

The U.S. leads the world in AI (job) anxiety

The Americans have the highest search volume with a population-adjusted value of 440,000 search queries on the topic of AI job loss, while their attitude towards AI is moderately positive at 54.5%. The intensity score of 3 for the U.S. shows that the concern of losing jobs to AI is

Read More »

Tigera extends cloud-native networking with Calico 3.30

This logging capability is exposed through two new components: Goldmane: A gRPC-based API endpoint that aggregates flow logs from Calico’s Felix component, which runs on each node. Whisker: A web-based visualization tool built with React and TypeScript that connects to the Goldmane API. The combination of these components provides detailed

Read More »

GOP Plans Billions in Oil, Gas Sales to Help Pay for Trump’s Tax Bill

House Republicans plan to raise more than $15 billion in revenue through increasing US oil, gas and coal lease sales, as well as other measures, to help pay for President Donald Trump’s massive tax cut package, according to a document seen by Bloomberg News.  The document, prepared by the House Natural Resources Committee, details plans to mandate at least four sales in the coastal plain of Alaska’s Arctic Arctic National Wildlife Refuge within the next 10 years, and resume lease sales in the National Petroleum Reserve-Alaska. Republicans also plan to resume quarterly onshore oil and gas lease sales as well as mandate new off shore leases sales, according to the document.  In addition, Republicans are planning to raise revenue through required sales of coal leases and also requiring the Forest Service to conduct timber sales, while rescinding unspecified funds for agencies like the National Oceanic and Atmospheric Administration and National Park Service.  In addition, the legislation, which is slated to receive a vote in by the committee next week, includes a measure streamlining the federal permitting process for big projects, with a goal of major environmental reviews being completed in one year.  House Republicans are aiming for a total of $2 trillion in spending reductions paired with a $4.5 trillion in reduced revenue from tax cuts. WHAT DO YOU THINK? Generated by readers, the comments included herein do not reflect the views and opinions of Rigzone. All comments are subject to editorial review. Off-topic, inappropriate or insulting comments will be removed. MORE FROM THIS AUTHOR Bloomberg

Read More »

Energy Secretary Visits Appliance Manufacturing Facility in Georgia to Mark 100 Days of Unleashing American Energy

GRIFFIN, GA— U.S. Secretary of Energy Chris Wright today visited Rinnai America Corporation’s manufacturing facility for non-condensed tankless gas water heaters in Griffin, Georgia, to celebrate the first 100 days of the Trump Administration’s efforts to unleash American energy and innovation. The visit underscored the Department of Energy’s commitment to protecting consumer freedom, defending American manufacturing jobs, and restoring American energy dominance. “President Trump was elected to bring back common sense—to get the barriers out of the way and let Americans pursue their own dreams,” said Secretary Wright. “In these first 100 days, that’s exactly what we’ve started doing. Rinnai America is a perfect example of what’s at stake when Washington pushes reckless regulations, bureaucrats tried to end hundreds of jobs with a rule no one asked for, slipped in the day after Christmas—without a single consideration for the people it would hurt. But these workers stood strong. They didn’t back down and because of their courage and hard work, we won this battle together. What they build here changes lives—millions of people choose these products to make their lives better. That’s worth fighting for. Let’s stand together for our dreams, for the American Dream.” Rinnai is the only company manufacturing non-condensing tankless water heaters in the United States—an energy-efficient technology targeted by a Biden-era rule that would have effectively banned the product and forced this facility to shutter its doors. The Department’s swift action to halt the rule has saved more than 200 Georgia jobs and preserved an affordable, high-efficiency option for American families.  “We were honored to host Secretary Wright and are grateful for the Department’s decisive actions to support American manufacturing and consumer freedom,” said Frank Windsor, President of Rinnai America Corporation. “By preserving access to high-efficient, cost-effective tankless water heaters, the Department is helping companies like ours continue

Read More »

Crude Drops Nearing 2021 Lows Amid Supply Surge

Oil slumped as OPEC+ discussed making a second major production increase, inflaming concerns about swelling global supplies that have dragged down crude prices this year. West Texas Intermediate futures fell 1.6% to settle near $58 a barrel, down more than 7% for the week, with prices holding near the lowest since early 2021. Key OPEC+ nations are considering another production increase of about 400,000 barrels a day in June ahead of a meeting the group pushed forward two days to May 3. Another aggressive supply boost from the cartel threatens to batter a market already pressured by soft Chinese demand and plentiful output from outside the group. The increase would be in line with figures previously telegraphed by the group and roughly matches last month’s shock hike, which was seen as a bid to discipline over-producing members. “OPEC’s decision framework appears to be fueled by the persistent cheating, particularly from the likes of Iraq, Kazakhstan, Russia among others,” TD Cowen strategists including Dan Ghali and Bart Melek said in a note to clients. Inventories may increase by about 200 million barrels over the next three quarters, which could drop crude toward the low $50s, they wrote. Brent prompt spread — the difference between its two nearest contracts — has narrowed to 36 cents a barrel in backwardation, compared with a gap of $1.07 four days ago. The narrowing spread signals expectations that near-term supplies will be readily available. Crude has shed about 19% this year — and briefly touched a four-year low last month — as the Trump administration’s tariffs fan concerns that energy demand will fall. The drop in prices is already showing signs of squeezing a key industry that US President Donald Trump pledged to help. Some of the biggest US shale-oil producers plan to slash about 4%

Read More »

Energy Department Lifts Regulations on Miscellaneous Gas Products

WASHINGTON— The U.S. Department of Energy (DOE) today announced the withdrawal of the determination of miscellaneous gas products as a covered consumer product under the Energy Policy and Conservation Act (EPCA). This action is yet another step toward President Trump’s pledge to lower costs for the American people by expanding choice and cutting red tape. By withdrawing this rule, DOE will exempt miscellaneous gas products—a category that includes decorative hearths and outdoor heaters—from a range of unnecessary regulations on their manufacture and sale.  “Under President Trump’s leadership, the Department of Energy is returning to common sense – and that means giving the American people the ability to choose which heaters they use in their own backyards,” U.S. Secretary of Energy Chris Wright said. “To date, rescinding or delaying unnecessary consumer regulations such as this have saved the taxpayers nearly $24 billion – and we’re just getting started.”   “Previous DOE rulemaking on this subject lumped together several products that are dissimilar in form and function, subjecting manufacturers to an awkward and unnecessary regulatory framework,” Principal Deputy Assistant Secretary for Energy Efficiency and Renewable Energy Lou Hrkman said. “By withdrawing the previous determination and repealing these unclear definitions, the Trump Administration is sending a clear signal that these markets will be allowed to thrive without fear of undue government interference.” Prior to today’s action, miscellaneous gas products were classified as covered products under Part A of Title III of the EPCA, and therefore potentially subject to burdensome standards for energy conservation. The withdrawal of this classification, along with the repeal of the definitions for “miscellaneous gas products,” “decorative hearth product,” and “outdoor heater” from the Code of Federal Regulations, will allow the market for these products to freely develop without needing to account for new conservation standards from DOE. In addition to today’s action, DOE has officially withdrawn four

Read More »

DOE Announces New Leadership to Tackle Challenges of Growing Energy Demand

WASHINGTON—The Department of Energy (DOE) today announced new leadership to tackle the challenge of strengthening and securing the U.S. energy system and ensuring America can lead the global race for AI leadership. To unleash American Energy Dominance, the systems and infrastructure that produce and deliver energy to the American people must be reliable, resilient, and secure. As energy demand continues to grow, the U.S. needs to upgrade both existing energy infrastructure and build new infrastructure – all of which must be done with resilience and security as priorities.  To advance these goals, today DOE is announcing that the Office of Cybersecurity, Energy Security, and Emergency Response (CESER) will be led by DOE Chief of Staff Alex Fitzsimmons. Carl Coe, who currently leads the Department of Government Efficiency (DOGE) at DOE, will assume the role of DOE Chief of Staff. “The race for global leadership in AI is the new Manhattan Project, and winning this race depends on our ability to increase access to abundant supplies of reliable, affordable energy and build secure infrastructure,” said U.S. Secretary of Energy Chris Wright. “The Department of Energy is focused on the need to meet growing energy demand while strengthening the resilience and security of U.S. energy infrastructure against all threats and hazards.   “Alex has served as a critical leader across the Department in our first 100 days, and his expertise and ability to take on complex problems make him the right person to spearhead this important office. I am grateful for his ongoing leadership within the Department, and I look forward to continuing to work with Carl Coe in his new role as Chief of Staff.” As Chief of Staff to the Secretary, Alex Fitzsimmons led the DOE beach-head team on day one and through the first 100 days of the Administration.

Read More »

Chevron Cuts Buybacks and Exxon Sits Tight as Oil Plunges

Chevron Corp. will reduce share buybacks this quarter after oil prices tumbled, indicating that President Donald Trump’s trade war is hurting a key US industry he pledged to help. The Houston-based company said Friday it will repurchase about $2.75 billion of stock in the second quarter, about 30% less than it bought in the first three months of the year. It comes despite Chevron beating earnings estimates on more low-cost production from Kazakhstan and the Permian Basin.  Exxon Mobil Corp., which also reported earnings Friday, is sticking to its plan to buy back about $5 billion in shares per quarter. And Shell Plc said it has the financial wherewithal to keep repurchasing upwards of $3 billion of shares each quarter even if crude plunges as low as $50 a barrel. Big Oil is finding it increasingly difficult to maintain share buybacks as Brent crude slumped 17% this year to about $62 a barrel at the close Thursday. Trump’s tariffs are poised to slow demand growth for crude and increase the cost of steel and other materials needed to produce oil and gas. At the same time, OPEC and its allies surprised markets last month with a plan to increase oil supplies more than expected later this year.  “Oil prices have changed,” Chief Financial Officer Eimear Bonner said in an interview. “The market, from a supply and demand perspective, appears to be softening.” The downturn in oil prices is starting to show the relative strength and weakness between the world’s supermajors. BP plc and Chevron cut their buybacks while Exxon, Shell and TotalEnergies SE maintained their payouts. Still, with debt levels rising across the group, it remains to be seen which is the right approach — especially if crude prices continue to decline. Brent crude futures slipped about 0.3% Friday, to

Read More »

ExtraHop looks to eliminate ‘extra hops’ in NDR stack

This deep visibility allows ExtraHop to provide insights across the entire network stack, from basic connectivity to application-level transactions. “The benefit of going all the way through Layer 7 is I can actually see a database transaction going through on the wire,” Vasani said. “If you have application teams complaining about database query latency, we can map it to what session was that tied to and what flows was it tied to from a network perspective and is this really an app server issue, or is it a network issue, or is it an endpoint issue?” The new sensor integrates with ExtraHop’s RevealX platform, feeding telemetry into the company’s cloud-scale ML/AI engine that powers its detection and analysis capabilities. “The sensor collects the telemetry, feeds it into an ML/AI engine that sits in the cloud, and then we layer in workflow engines on top to enable the various use cases,” Vasani said. In modern distributed enterprise environments, network visibility must extend beyond traditional data centers. ExtraHop’s all-in-one sensor is designed to address this reality with deployment options that span physical appliances, virtual machines and cloud environments. ExtraHop has both virtual and physical hardware appliances for sensor deployment. ExtraHop sensors can plug into a network through multiple methods including, Network Tap, SPAN (Switched Port Analyzer) port, packet broker or a cloud provider’s vTAP capabilities.

Read More »

AI’s energy appetite drives interest in nuclear power

In its new report, Deloitte said that its analysis of figures from the World Nuclear Association, the American Nuclear Society, the U.S. Department of Energy, and others showed that new nuclear power could potentially meet about 10% of the projected increase in data center demand over the next decade, assuming capacity is also significantly expanded by between 35GW and 62GW, and 30% of the expansion is earmarked for data centers. “Nuclear energy presents a potential solution for meeting some of the growing electricity demands of data centers, with its reliable and clean energy profile,” Deloitte’s report said, noting five key advantages of the technology: Reliable baseload power: Nuclear reactors operate 24/7, regardless of the weather, providing the reliable power so important to data centers. In addition, Deloitte said, “Their capacity factor, exceeding 92.5%, outperforms other sources like natural gas (56%) and renewables like wind (35%) and solar (25%).” High energy density: A small amount of fuel generates a lot of power, which minimizes the need for fuel storage and transportation. “This efficiency can translate to a smaller physical footprint and enhanced sustainability,” Deloitte said. Scalable power output: A full-sized reactor typically generates 800 megawatts (MW) or more of electricity, which accommodates the needs of large data centers. Low carbon emissions: Nuclear power plants produce virtually no greenhouse gas emissions during operation. Enhanced land use efficiency: Compared to other energy sources, nuclear power plants require relatively little land. Gartner’s Johnson echoed these advantages, and also predicted that nuclear energy, and small modular reactors (SMRs) in particular, will “provide a viable answer” to the question of what to do when electricity demand exceeds supply. They can, he said, “ensure independence from grid power fluctuations by providing dedicated on-site power for large data centers.” However, both Gartner and Deloitte also highlighted challenges in

Read More »

Nvidia AI supercluster targets agents, reasoning models on Oracle Cloud

Oracle has previously built an OCI Supercluster with 65,536 Nvidia H200 GPUs using the older Hopper GPU technology and no CPU that offers up to 260 exaflops of peak FP8 performance. According to the blog post announcing the availability, the Blackwell GPUs are available via Oracle’s public, government, and sovereign clouds, as well as in customer-owned data centers through its OCI Dedicated Region and Alloy offerings. Oracle joins a growing list of cloud providers that have made the GB200 NVL72 system available, including Google, CoreWeave and Lambda. In addition, Microsoft offers the GB200 GPUs, though they are not deployed as an NVL72 machine.

Read More »

Deep Data Center: Neoclouds as the ‘Picks and Shovels’ of the AI Gold Rush

In 1849, the discovery of gold in California ignited a frenzy, drawing prospectors from around the world in pursuit of quick fortune. While few struck it rich digging and sifting dirt, a different class of entrepreneurs quietly prospered: those who supplied the miners with the tools of the trade. From picks and shovels to tents and provisions, these providers became indispensable to the gold rush, profiting handsomely regardless of who found gold. Today, a new gold rush is underway, in pursuit of artificial intelligence. And just like the days of yore, the real fortunes may lie not in the gold itself, but in the infrastructure and equipment that enable its extraction. This is where neocloud players and chipmakers are positioned, representing themselves as the fundamental enablers of the AI revolution. Neoclouds: The Essential Tools and Implements of AI Innovation The AI boom has sparked a frenzy of innovation, investment, and competition. From generative AI applications like ChatGPT to autonomous systems and personalized recommendations, AI is rapidly transforming industries. Yet, behind every groundbreaking AI model lies an unsung hero: the infrastructure powering it. Enter neocloud providers—the specialized cloud platforms delivering the GPU horsepower that fuels AI’s meteoric rise. Let’s examine how neoclouds represent the “picks and shovels” of the AI gold rush, used for extracting the essential backbone of AI innovation. Neoclouds are emerging as indispensable players in the AI ecosystem, offering tailored solutions for compute-intensive workloads such as training large language models (LLMs) and performing high-speed inference. Unlike traditional hyperscalers (e.g., AWS, Azure, Google Cloud), which cater to a broad range of use cases, neoclouds focus exclusively on optimizing infrastructure for AI and machine learning applications. This specialization allows them to deliver superior performance at a lower cost, making them the go-to choice for startups, enterprises, and research institutions alike.

Read More »

Soluna Computing: Innovating Renewable Computing for Sustainable Data Centers

Dorothy 1A & 1B (Texas): These twin 25 MW facilities are powered by wind and serve Bitcoin hosting and mining workloads. Together, they consumed over 112,000 MWh of curtailed energy in 2024, demonstrating the impact of Soluna’s model. Dorothy 2 (Texas): Currently under construction and scheduled for energization in Q4 2025, this 48 MW site will increase Soluna’s hosting and mining capacity by 64%. Sophie (Kentucky): A 25 MW grid- and hydro-powered hosting center with a strong cost profile and consistent output. Project Grace (Texas): A 2 MW AI pilot project in development, part of Soluna’s transition into HPC and machine learning. Project Kati (Texas): With 166 MW split between Bitcoin and AI hosting, this project recently exited the Electric Reliability Council of Texas, Inc. planning phase and is expected to energize between 2025 and 2027. Project Rosa (Texas): A 187 MW flagship project co-located with wind assets, aimed at both Bitcoin and AI workloads. Land and power agreements were secured by the company in early 2025. These developments are part of the company’s broader effort to tackle both energy waste and infrastructure bottlenecks. Soluna’s behind-the-meter design enables flexibility to draw from the grid or directly from renewable sources, maximizing energy value while minimizing emissions. Competition is Fierce and a Narrower Focus Better Serves the Business In 2024, Soluna tested the waters of providing AI services via a  GPU-as-a-Service through a partnership with HPE, branded as Project Ada. The pilot aimed to rent out cloud GPUs for AI developers and LLM training. However, due to oversupply in the GPU market, delayed product rollouts (like NVIDIA’s H200), and poor demand economics, Soluna terminated the contract in March 2025. The cancellation of the contract with HPE frees up resources for Soluna to focus on what it believes the company does best: designing

Read More »

Quiet Genius at the Neutral Line: How Onics Filters Are Reshaping the Future of Data Center Power Efficiency

Why Harmonics Matter In a typical data center, nonlinear loads—like servers, UPS systems, and switch-mode power supplies—introduce harmonic distortion into the electrical system. These harmonics travel along the neutral and ground conductors, where they can increase current flow, cause overheating in transformers, and shorten the lifespan of critical power infrastructure. More subtly, they waste power through reactive losses that don’t show up on a basic utility bill, but do show up in heat, inefficiency, and increased infrastructure stress. Traditional mitigation approaches—like active harmonic filters or isolation transformers—are complex, expensive, and often require custom integration and ongoing maintenance. That’s where Onics’ solution stands out. It’s engineered as a shunt-style, low-pass filter: a passive device that sits in parallel with the circuit, quietly siphoning off problematic harmonics without interrupting operations.  The result? Lower apparent power demand, reduced electrical losses, and a quieter, more stable current environment—especially on the neutral line, where cumulative harmonic effects often peak. Behind the Numbers: Real-World Impact While the Onics filters offer a passive complement to traditional mitigation strategies, they aren’t intended to replace active harmonic filters or isolation transformers in systems that require them—they work best as a low-complexity enhancement to existing power quality designs. LoPilato says Onics has deployed its filters in mission-critical environments ranging from enterprise edge to large colos, and the data is consistent. In one example, a 6 MW data center saw a verified 9.2% reduction in energy consumption after deploying Onics filters at key electrical junctures. Another facility clocked in at 17.8% savings across its lighting and support loads, thanks in part to improved power factor and reduced transformer strain. The filters work by targeting high-frequency distortion—typically above the 3rd harmonic and up through the 35th. By passively attenuating this range, the system reduces reactive current on the neutral and helps stabilize

Read More »

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs).  In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

Read More »

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

Read More »

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

Read More »

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Read More »