Stay Ahead, Stay ONMINE

Chip sales are set to soar in 2025 — so long as there isn’t a trade war | Deloitte

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Semiconductor chip sales are set to soar in 2025, led by generative AI and data center build-outs, even as demand from PC and mobile markets may be weak, according to Deloitte’s 2025 chip outlook report. The […]

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More


Semiconductor chip sales are set to soar in 2025, led by generative AI and data center build-outs, even as demand from PC and mobile markets may be weak, according to Deloitte’s 2025 chip outlook report.

The semiconductor industry had a robust 2024, with expected double digit (19%) growth, and sales of $627 billion for the year. But that’s even better than the earlier forecast of $611 billion. And 2025 could be even better, with predicted sales of $697 billion, reaching a new all-time high, and well on track to reach the widely accepted aspirational goal of $1 trillion in chip sales by 2030. To get there, the chip industry only has to grow at a compound annual growth rate of 7.5% from 2025 to 2030.

Of course, all of this assumes that the U.S. doesn’t get into a massive trade war as a result of Donald Trump’s plan to place tariffs on computer chips and other semiconductors. He reaffirmed last week that he not only planned to place tariffs on China of 10%, but that he would also put them on Taiwan, where big U.S. companies like Nvidia get their chips. The Consumer Technology Association estimates that tariffs could make game consoles 40% more expensive for U.S. consumers, with a 26% price increase for smartphones and 46% price increase for laptops.

I am guessing this probably came up as Jensen Huang, CEO of Nvidia, visited Trump on Friday just ahead of the tariff announcements on Saturday. And so far, Trump has not yet placed any tariffs on Taiwan or the chip industry. But it’s a fluid situation and it’s complicated.

“When you look at the idea of tariff in general across all industries, if you put a tariff on maple syrup, it’s either maple syrup or it isn’t maple syrup, and it either comes from outside the U.S. or it doesn’t. There are many, many, many kinds of chips. They are manufactured almost always in a highly complex supply chain with bits of them going from country A to B to C, back to A, over to D,” said Duncan Stewart, TMT Center research director at Deloitte, in an interview with GamesBeat. “Given the global nature of supply chains, anything along the line of chip restrictions or tariffs, likely will have an impact and would make supply chains more complex to administer and just in general complicate them.”

Assuming the industry continues to grow at 7.5% CAGR, it could reach $2 trillion in 2040. The stock market is often a leading indicator of industry performance: As of mid-December 2024, the combined market capitalization of the top 10 global chip companies was $6.5 trillion, up 93% from $3.4 trillion in mid-December 2023 and 235% higher than the $1.9 trillion we saw in mid-November 2022. Much of the reason for that was the growth of Nvidia, the AI chip maker, in my opinion.

Are subsidies working for reshoring chip production in the U.S.?

Jeroen Kusters, U.S. semiconductor leader at Deloitte, said in an interview with GamesBeat there are a lot of investments happening on reshoring of chip factories in the U.S. The big ones are under way with companies like Intel, Globalfoundries and TSMC building factories in the U.S. Stewart said that chip executives have said that the investments so far are just the first of what should be even larger numbers in terms of the investment required to bring semiconductor manufacturing back.

“After all of the plants that are in the process of being built and started and launched, at the end of all of that, by 2032, the U.S. may be up around 14% or something. It takes time. It is an absolutely massive industry. And moving the needle from 10% to 14% is in fact a remarkably good number. It’s a sign of how hard it is to move. And it’s the same for Europe, of course,” Stewart said.

How much should the industry invest in AI?

Jeroen Kusters, U.S. semiconductor leader at Deloitte.

This is the trillion dollar questions.

Regarding the risks of smaller models working like DeepSeek and the impact on AI chip demand, Stewart said, “Various people saying AI would be $400 billion or even $500 billion in addressable market, which would be on the order of 2028 or something like that. One of the complicating factors is that there are smaller models out there and more efficient models as well as edge computing. And all of those could change the demand. In other words, you still need GenAI chips, but you need different GenAI chips. Or it could even reduce the demand for GenAI. We actually said that in our outlook as a potential risk factor.”

He added, “Without commenting on any given small model, this has all been known for some time. Somebody comes out and says I have a thing where I used to use many, many expensive chips, and I can now use either newer chips or cheaper chips — that could change the size of the GenAI chip industry. We actually anticipated that.”

He noted that while many of the large hyperscaler data center operators re saying that they are reducing their capital expenditure plans for the next quarter of the next year.

“Although there is always and will always be a threat, one of the things that in recent weeks is the idea that maybe you don’t need as much generative AI infrastructure because of more efficient, smaller models,” Stewart said. “Two weeks before that, there was some remarkable gains in GenAI programming where they did what’s called chain of reasoning. And this was like last month. Those are significantly more accurate. And in fact they can use ten, a hundred, or even a thousand times as many chips as the previous models. So I think I’m comfortable saying that given which week it is, sometimes the focus is on more efficient AI models, but at the same time are news events that make it look like we need even more chips to do better AI.”

Jeroen Kusters, U.S. semiconductor leader at Deloitte, said in an interview with GamesBeat, “I think you’ll find that in terms of demand, there will be step changes in demand. This is sort of a step change in efficiency. We all expect it, and actually we absolutely need models to become more efficient. We expected a relatively linear trend on this. Everyone knows that models are going to get more attention. What we saw now see was a step change, and it confused people a little. That’s okay. We’re going to see more of these step changes. And most of the time, what you’ll find is that, indeed, as the models get better, they will require more performance. They will require more compute. We then get more efficient.”

Stewart said, “One of the large platform companies announced that they have five million users of their generative AI tool for a specific application. And people say, oh, it’s a pilot. No, it isn’t. They have five million companies doing this weekly. The idea that we’re still entirely a group of concepts and pilots is just plain wrong. They are one of the world’s largest companies, and this is a tool that not only their customers are using, but is a thing that the company itself says drives the effectiveness of their product.”

A tale of two markets

Duncan Stewart TMT Center research director at Deloitte

That said, it is worth noting that “average” chip stock performance in the last two years has been a “tale of two markets”: companies that are involved in the generative AI chip market outperformed that average, while companies without that exposure (automotive, computer, smartphone, and communications semiconductor companies, for example) underperformed the semiconductor market average.

One driver of industry sales has been the demand for generative AI (gen AI) chips: a mix of CPUs, GPUs, data center communications chips, memory, power chips, and more. The Deloitte 2024 TMT Predictions report predicted that those gen AI chips collectively would be worth “more than” $50 billion, which was a much too conservative forecast, as the market was likely over $125 billion in 2024 – and represented over 20% of total chip sales for the year.

Last year, Deloitte estimated that AI chips would grow by a strong number, but the AI industry sailed right past those optimistic numbers and grew even bigger.

At the time of publication, Deloitte predicts that gen AI chips will be over $150 billion in 2025. Further, AMD CEO Lisa Su has moved her estimate for the total addressable market for AI accelerator chips up to $500 billion in 2028, a number which is larger than sales for the entire chip industry in 2023.

Chip demand is exploding because of AI.

In terms of end markets, after being flat at around 262 million units in 2024 over 2023, PC sales are expected to grow in 2025 by over 4% to about 273 million units. Meanwhile, smartphone sales are expected to grow at low-single digits in 2025 (and beyond) to reach an estimated 1.24 billion units in 2024 (+6.2% year-over-year growth). These two end markets are important for the semi industry: In 2023, communication and computer chip sales (which include data center chips) made up 57% of overall semiconductor sales for the year compared to auto and industrial, which accounted for only 31% of sales combined, for example.

One challenge for the industry is that while gen AI chips and associated revenues (memory, advanced packaging, communications, and more) are responsible for outsized revenues and profits, they represent a small number of very high value chips, meaning that wafer capacity—and therefore utilization—for the industry as a whole isn’t as high as it might appear. In 2023, nearly a trillion chips were sold at an average selling price of US$0.61 per chip. At a rough estimate, although gen AI chips might account for 20% of revenues in 2024, they were less than 0.2% of wafers.

Even though global chip revenues for 2024 was forecast to rise 19%, silicon wafer shipments for the year actually declined an estimated 2.4% for the year. 16 That number is expected to grow by almost 10% in 2025, fueled by demand for components and technologies used largely in gen AI chips, such as chiplets, as mentioned in the 2025 TMT Predictions report. Of course, silicon wafers are not the only kind of capacity to track: Advanced packaging is growing even faster.

As an example, some analysts estimate that TSMC’s CoWoS (chip-on-wafer-on-substrate) 2.5D advanced packaging production capacity will reach 35,000 wafers per month (wpm) in 2024 and could increase to 70,000 wpm (100% YoY) and further by 30% YoY to 90,000 wpm by end of 2026.

Further, driving innovation in the industry is not cheap. In 2015, overall chip industry average spending on R&D was 45% of its EBIT (earnings before interest and taxes), but by 2024 it was an estimated 52% of EBIT. R&D seems to be growing at a 12% CAGR, white EBIT is only growing at 10% (see figure 2).

The semiconductor outlook: R&D and EBIT growth. (figure 2)

Finally, it’s worth reminding readers that the chip industry can be notoriously cyclical. The industry has flipped from growth to shrinkage nine times in the last 34 years (figure 3). 21 So it may seem that the industry is seeing less extreme growth or shrinkage in the last 14 years, compared to the 1990-2010 period, but the frequency of contractions seems to increase. 2025 looks solid for now, it’s hard to tell what 2026 will bring.

Making investments in a resilient supply chain will make sense around the world.

“Companies with or without incentives are deciding to build new plants in new places to shorten or make resilient supply chains. This is an industry where staying on top of the ball has been a thing they’ve been doing for half a decade now. It has been a constantly shifting mix of various incentives and restrictions. That is a fairly normal thing for the semiconductor industry,” Stewart said.

Global semiconductor industry—Historic billings (Three month moving average), 1990 to 2024 October YTD. (figure 3).

These trends and others play into the 2025 semiconductor industry outlook, where the firm drills down into four big topics for the year ahead: generative AI accelerator chips for PCs and smartphones and the enterprise edge; a new “shift left” approach to chip design; the growing global talent shortage; and the need to build resilient supply chains amid escalating geopolitical tensions.

Generative AI chips in PCs, smartphones, the enterprise edge, and IoT

Many of the chips that are being used for training and inference of gen AI cost tens of thousands of dollars and are destined for large cloud data centers. In 2024 and 2025, these chips or lightweight versions of these chips are also finding homes in the enterprise edge, in computers, in smartphones, and (over time) in other edge devices such as Internet of Things (IoT) applications. To be clear, in many cases these chips are being used for either gen AI, traditional AI (machine learning) or, increasingly, a combination of both.

The enterprise edge market was already a factor in 2024, but the question in 2025 will be about smaller, cheaper, less powerful versions of these chips becoming a key part of computers and smartphones. What they lack in per-chip value, they can make up for in volume: PC sales are expected to be over 260 million units in 2025, while smartphones are expected to be over 1.24 billion units.

Sometimes the “gen AI chip” can be a standalone single piece of silicon, but more commonly it’s a few square millimeters of dedicated AI processing real estate that is tiny part of a much larger chip.

Enterprise edge: Although generative AI via the cloud will likely continue to be a dominant option for many enterprises, about half of the enterprises worldwide are predicted to add AI data center infrastructure on-premises—an example of enterprise edge computing. 23 This could be, in part, to help protect their intellectual property and sensitive data and comply with data sovereignty or other regulations, but also to help them save money.

These chips are largely the same as those found in hyperscale data centers, with server racks costing millions of dollars and requiring hundreds of kilowatts. Although smaller than hyperscale chip demand, we estimate the chips for enterprise edge server chips will likely be worth tens of billions of dollars globally in 2025.

Demand for NPU-enabled PCs. (figure 4)


Personal computers: Sales of gen AI powered PCs are predicted to be half of all PCs in 2025, 26 with some forecasts suggesting that almost all PCs will have at least some on-board gen AI processing—also known as neural processing units (NPUs)—by 2028 (see figure 4). 27 These NPU-powered machines are expected to command a price premium of 10-15%, but it’s important to note that not all gen AI PCs are equal.

There’s a dividing line at the 40 TOPS (trillion operations per second) level, following a recommendation from major PC ecosystem companies that only computers with more than TOPS be considered true AI-enabled PCs. 29 As at the time of writing, some buyers are cautious about the new PCs, either unwilling to pay the premium, or waiting until more powerful gen AI NPUs are introduced in the back half of 2025.

As of December 2024, many of the installed base of PCs were running on x86 CPUs, with the balance being on CPUs based on the Arm architecture. MediaTek, Microsoft, and Qualcomm announced in 2024 that they would make Arm-powered PCs, specifically gen AI PCs. It’s unclear how successful these achines will be in the next 12 months, but it will likely be a key issue for the various chipmakers, with Qualcomm predicting it will sell $4 billion worth of PC chips annually by 2029.

Smartphones: Where PC NPUs might be worth tens of dollars in value, smartphone equivalent gen AI chips may be worth much less, and Deloitte estimated under $1 worth of silicon on next generation smartphone processors. Even though the smartphone market is over a billion units sold annually, and even though we predict gen AI smartphones will be 30% of phones sold in 2025, the semiconductor impact is likely smaller than PCs in dollar terms. Instead, an interesting angle for chipmakers could be to see if consumers are excited enough about new gen AI phones and features to shorten the replacement cycle. Consumers have been keeping phones longer before upgrading, and sales have been flat for
years now. 35 If gen AI enthusiasm causes an uptick in smartphone sales, that could benefit all kinds of chip companies, not just those that make the gen AI chips themselves.

IoT: A gen AI chip in a data center might cost $30,000. A gen AI chip on a PC might cost $30. A gen AI chip on a smartphone might be $3. For gen AI chips to work on the low-cost Internet of Things market, they should cost about $0.30. That’s unlikely to happen anytime soon, but with the possibility of tens of billions of IoT endpoints needing AI processors, this is a market to watch for the longer term.

“As good as Gen AI is,” Stewart said, other categories like PCs and smartphones are up a little or mostly flat, and automotive is actually down from a year ago.

“It was the best of times, it was the worst of times,” Stewart said. “Sometimes that is true, even when there are pockets of enormous growth in the semiconductor industry. It’s really important to remember there are other kinds of chips that are not growing at the same level. To some extent, the growth in GenAI is a spectacular success story, but it is masking some pockets of weakness out there in other parts of semiconductor manufacturing. And we just think it’s really important to remind people about that, because as an industry, there are companies that make GenAI chips and don’t make the other kinds and then there are companies that make the slower growing ones and aren’t benefiting from AI.”

As far as strategic questions for the industry go, Deloitte asked, “Although gen AI chips for data centers are in demand now, given their importance to industry growth, are there any signs that demand is weakening, or that processing is moving away from data centers to edge devices?”

Chip design ‘shifts left’ and calls for a greater collaboration across the industry

Deloitte predicted that, by 2023, AI would emerge as a powerful aid to human semiconductor engineers, assisting them on extreme complex chip design processes, and enabling them to find ways to improve and optimize PPA (power, performance and area). As of 2024, gen AI has enabled rapid iterations to enhance existing designs and discover entirely new ones and can do it in less time.

In 2025, there will likely be more emphasis towards ‘shift left’—an approach to chip design and development where testing, verification, and validation are moved up earlier in the chip design and development process — as optimization strategies could evolve from simple PPA metrics to system-level metrics like performance per watt, FLOPs per watt (FLOPs denotes floating point operations per second), and thermal factors. And the combination of advanced AI capabilities—graph neural networks (GNN) and reinforcement learning (RL)—will likely continue to help design chips that are more power-efficient than typical chips produced by human engineers.

Domain-specific and specialized chips are expected to continue to gain prominence over general-purpose ones, as several industries (such as automotive) and certain AI workloads would require customized approaches to designing chips. However, a widespread adoption of application-specific integrated circuits (or ASICs) remains less clear, as the development and maintenance of such hardware can be costly and could divert focus from other AI advancements. But here’s where gen AI tools can allow companies to design more specialized and competitive products including custom silicon.

3D ICs and heterogeneous architectures are introducing challenges related to arranging, assembling, validating, and testing the various chiplets, which can sometimes be pre-assembled. This shift towards system design over individual product design can incorporate software and digital twins early on—stressing the importance of early and frequent testing.

By 2025, synchronizing hardware, system, and software development upstream in the process will likely help redefine future system engineering and enhance overall efficiency, quality, and time-to-market.

To evolve and keep pace with the changing face of design, the industry may want to consider new ways to handle the complex design processes. Already, the chip industry is exploring digital twins to emulate and visualize complex design processes step-by-step, including the ability to move around or swap chiplets to measure and assess performance of a multi-chiplet system. And digital twins could increasingly be used to give a visual representation (via 3D modeling) of the physical end-device or the system to assist with all aspects of design, including mechanical as well as electrical (software and hardware).

Designers should work with EDA (electronic design automation) and other hi-tech CAD/CAE (computer-aided design/computer-aided engineering) companies to strengthen design, simulation, and verification and validation tools and capabilities for hybrid and complex heterogenous systems. And they also should consider using and adapting model-based system engineering (MBSE) tools as part of the broader EDA ‘shift left’ approach.

As design and software are expected to play crucial roles in the development of next-generation advanced chip products, bolstering cyber defense becomes more important, heading into 2025. To help align with shift left approach, chip designers should integrate security and safety testing early in the chip design process. They should implement redundancy and error correction and detection mechanisms to help ensure that systems can continue to operate even when some of the components fail, and hardware-based security features such as secure boot mechanisms and encryption engines.

Deloitte said among the strategic questions to consider: As AI in chip design becomes more prevalent and common and EDA becomes more and more AI-enabled, how can the industry proactively ensure trust and transparency in the complex design process by always keeping human engineers in the loop and giving them a major role in the overall process?

The intensifying talent challenges in semiconductor industry

A skill gap in chips looms. (figure 5)

In Deloitte’s 2023 Semiconductor industry outlook report, the firm wrote that the industry needs to add a million skilled workers by 2030, or more than 100,000 every year. Two years after, not only does that forecast hold good, but the talent challenge is expected to intensify further in 2025. Globally, countries are not producing enough skilled talent to meet their workforce needs.


From core engineering to chip design and manufacturing, operations, and maintenance, AI may help alleviate some engineering talent shortages, but the skill gap looms (see figure 5). Attracting and retaining talent will likely continue to be a challenge for many organizations in 2025, and a big part of the problem is an aging workforce, which is more prominent in the United States and even Europe. Add the complex geopolitical landscape and supply chain fragility to this equation, and it becomes clear that the availability of talent supply is under stress globally. With onshoring and reshoring of fabrication, assembly, and test in the US and Europe, there will likely be pressure on chip companies and foundries as they source more of the talent locally in 2025.

For example, talent challenges are contributing to delays in opening new plants. On a related note, “friendshoring” (collaborating with companies from countries considered to be allies) can provide stability and resilience to supply chains, especially for the United States and European Union. But it also demands scouting for the right skills to help meet new capacity demands and talent roles in destinations such as Malaysia, India, Japan, and Poland.

Chip companies can’t continue to wrestle over the same finite talent pool and still expect to match up to the industry’s pace of technological advancement and capacity expansion. So, what can semiconductor companies do in 2025 to address the talent conundrum?

To help attract AI and chip talent, chip companies should consider offering a sense of trust, stability, and projected market growth; with this, they can help make the industry more appealing to recent high school grads and fresh entrants to help reinvigorate talent pipelines.

Countries aiming to benefit from their respective domestic chips acts should consider weaving in strategic goals and aspects related to workforce development and activation. Some examples could include training programs, expanded vocational and professional education, and employment opportunities that their local chip companies would commit to receive funding. Semi companies should consider collaborating with educational institutions (high schools, technical colleges, and universities) and local government organizations to leverage chip funds to develop and curate targeted workforce training and development programs aligned with specific industry needs in the region.

Semi companies should design flexible upskilling and reskilling programs for career path flexibility to help address future workforce skills and gaps. Additionally, they should implement and leverage advanced tech and AI-based tools to assess diverse talent related factors such as supply, demand, and current and projected spend, to perform complex workforce scenario modeling to support strategic talent decision-making.

Deloitte said among the strategic questions to consider: How should the workforce be characterized and segmented based on specialization areas, for example, design and IP, and manufacturing, operator, engineering, and technical roles? And how can the industry customize talent sourcing and skill development strategies based on these roles, as well as based on specific geographic regions where hiring takes place?

Stewart said one thing that could hold back the reshoring of the chip industry in the U.S. is a big talent shortage. But he noted that talent shortage is global as every country is struggling to find enough people. That means retraining and research investments have to be made in order to keep the growth going.

Building resilient supply chains amid geopolitical tensions

Deloitte 2024 Semiconductor Outlook report already talked about geopolitical tensions in depth, so what’s new for 2025?

The same…but even more. As one example, in December of 2024 the outgoing administration issued a new list of US export restrictions mainly still focused on advanced nodes (despite some speculation that restrictions might be broadened to include some relatively less advanced nodes). These restrictions now include additional separate categories around advanced inspection and metrology. Additionally, many (over 100) new entities (mainly Chinese) have been added to the restricted entity list.

As part of these restrictions, the US seems to be adopting the “small yard, high fence” approach toward semiconductor export restrictions. This aims to impose a high level of restrictions on a relatively small subset of chip technologies with a focus on those that defense, including advanced weapon systems, and advanced AI used in military applications.

The new restrictions (if implemented by the new administration) go on to flag that AI advancements are increasingly being viewed as matters of national security. The day after those new restrictions, China announced further restrictions on the export of gallium and germanium (as well as other materials), both key for the manufacture of multiple semiconductors.

As Deloitte predicted in 2024, ongoing materials restrictions will likely pose a challenge for the chip industry, but also an imperative for the industry to do more recycling of e-waste. In mid-January of 2025, the outgoing administration announced Interim Final Rule on AI Technology Diffusion. The Interim Final Rule will impose new controls for chip exports.

At time of writing, it is unknown whether the incoming administration will roll back the December and January restrictions, modify them, or even propose additional restrictions.

Additionally, the new administration has proposed increasing its use of tariffs, including tariffs on goods from China, Mexico, and Canada. Given the global nature of most semi supply chains, the proposed new AI related chip export controls (by the outgoing administration) and the planned higher tariffs would likely have an impact and could make supply chains more complex to administer, shifting profits, costs, and more. And the impact could be felt across the supply chain – including R&D and manufacturing – as well as how industry policies are shaped across countries and regions.

Of course, there are additional geopolitical risks or changes: Conflicts in Ukraine/Russia and the Middle-East continue, potentially affecting semiconductor manufacturing, supply chains, and critical raw materials. But the chip industry has other vulnerable points: the December martial law order in South Korea highlighted the global supply chain dependency and concentration of certain types of semiconductors, especially in the most advanced technologies.

As an example of concentration, almost 75% of DRAM memory chips globally are made in South Korea. It’s not just geopolitics that can interrupt key materials: 2024’s Hurricane Helene briefly shut down two mines in North Carolina that are sources for nearly all of the world’s ultra-high purity quartz, essential for making the crucibles which are a key part of the chipmaking process. With hurricanes, typhoons, and other extreme weather events projected to become more frequent and intense due to climate change, expanding the sources for key materials is likely to continue to be a supply chain priority.

It is worth noting that, as of late 2024, a key part of the export restrictions from the United States and allies is having an effect: The restrictions around extreme ultraviolet (EUV) lithography machines seem to be posing a barrier, preventing Chinese companies from making advanced node chips at scale and with acceptable yields. Although there are 7 nm and 6 nm chips being made in limited numbers using older deep ultraviolet (DUV) technology, the volumes are low, yields are uneconomical, and that situation is expected to persist at least until 2026.

To be clear, semiconductor supply chains worked well in 2024, even as the industry grew by almost 20%. At this time, there’s no reason to believe 2025 supply chains will be less resilient, but as always, the risk is there. And given how important gen AI chips are expected to be in 2025 and beyond (up to 50% of sales, perhaps 76 ) and the relatively higher concentration of processor, memory, and packaging required for cutting-edge chips, the industry may be more vulnerable to supply chain disruptions than ever before. Although the industry is likely to become less concentrated geographically thanks to the various chips acts – and initiatives like onshoring, re-shoring, near shoring, and friendshoring are all still in their early days – the industry remains highly vulnerable for the next year or two, at least.

Deloitte said that among the strategic questions to consider was, “Given the fluid geopolitical environment and escalating export restrictions, what should be the mix of reshoring vs. offshoring? And how should the industry factor potential disruptions to any existing supply chain channel partner relationships in erstwhile friendly countries and allies, aka friendshoring?”

Signposts for the future

For 2025, semiconductor industry executives should be mindful of the following signposts:

  1. There is currently a mismatch between very high spending on semiconductors for gen AI, and companies being able to monetize their gen AI offerings. For 2025, the argument of “the risk of underinvesting is greater than the risk of overinvesting” seems to be still dominant, but if that attitude shifts, demand for gen AI chips could become weaker than predicted.
  2. Competition from agile chip startups could intensify, challenging incumbents in the broader semiconductor industry. Notably, AI chip startups secured a cumulative $7.6 billion in venture capital funding globally during Q2, Q3 and Q4 of 2024, and several of these startups offer specialized solutions including customizable RISC-V-based applications, chiplets, LLM inference chips, photonic ICs, chip design, and chip equipment.
  3. With interest rates in the United States and other major markets likely to drop further, a favorable credit environment could act as a tailwind for the chip industry’s M&A scene, which has already seen an uptick.
  4. Moreover, with two different chip markets evolving—one for AI chips and one for all other types of
    chips—the industry may experience M&A and consolidation, especially if companies with valuable IP lag their peers and are seen as attractive targets. Nonetheless, potential tighter regulations and trade conflicts, globally, could potentially dampen the deal environment.
  5. As geopolitical challenges ripple across the globe, chip companies should brace themselves for further disruptions. Traditional channel partner models and allyships could get upended, even as reshoring, friendshoring, and nearshoring have gained momentum. Prolonging regional conflicts and wars could further impact the flow of vital materials and inventories. All of these could disrupt semi companies’ demand planning, requiring them to be more agile and adapt supply chain and sourcing contracts, and pricing terms.
  6. A significant part of capex spending and revenues was driven by AI and the advanced wafers needed to produce those highly advanced AI chips. However, wafer demand from auto, industrial, and consumer segments continue to be lackluster, while there’s some uptick in demand from mobile handset and other consumer products.

Through 2025 and 2026, though the overall revenue and capex seem to continue trending upward (at least for the next nine to 12 months), any downward movement in AI-related spending and components shortage could have an adverse impact ripping through the broader global semiconductor and electronics supply chain.

The pandemic caused havoc in the supply chain for a couple of years. But Stewart said good things have happened to global semiconductor supply chains in the last four years. They are more resilient than they were last time, he said.

“And every single semiconductor supply chain expert in the world says that this is a good thing and we can continue to do it. And that there will also be another supply chain interruption and shortage that has severe problems at some point within the reasonable future,” he said. “You can make the supply chain more resilient. You cannot make it bulletproof. Things happen. There are droughts and fights and trade wars and restrictions and all of the different things that the economy [deals with.]”

Kusters noted that making the supply chain more resilient doesn’t happen overnight. That resilience is still building but it isn’t completely finished yet.

“We have talked about making more resilient supply chains very much a kind of incentive-based program, like the European Chips Act would be an example. We will build this plant in Poland because we don’t have one in Europe, and we need that, and it’s very directive. One of the things that we might see with restrictions and tariffs and other forms of things that shape supply chains is those might end up making different kinds of outcomes than you would have gotten [otherwise],” Stewart said.

Shape
Shape
Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy,  bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Shape

AI-Powered Policing: The Future of Traffic Safety in Kazakhstan

Traffic management is a growing challenge for cities worldwide, requiring a balance between enforcement, efficiency, and public trust. In Kazakhstan, the Qorgau system is redefining road safety through an innovative fusion of artificial intelligence (AI), computer vision, and mobile technology. Designed to assist traffic police in real-time violation detection and

Read More »

Quantum networking advances on Earth and in space

“Currently, the U.S. government is not investing in such testbeds or demonstrations, ensuring it will be a follower and not a leader in the development of technical advances in the field,” said a report released last year by the Quantum Economic Development Consortium, a global association of more than 250

Read More »

Who wins in networking in 2025 and beyond?

In this third team, we also find vendors most interested and active in the IoT opportunity, two of which are usually not considered enterprise vendors at all: Ericsson and Nokia. Both these companies are actively pursuing new IoT strategies, and while they are surely hoping that broad public-sensor IoT will

Read More »

Dorian LPG Q4 Profit Slashed 79 Percent as Shipping Rates Down

Dorian LPG Ltd. has reported $21.36 million in net income for the fourth quarter of 2024 (third quarter of fiscal year 2025), down 78.63 percent compared to the same three-month period 2023 as freight rates fell. The Stamford, Connecticut-based owner and operator of very large gas carriers (VLGCs) logged $18.5 million in adjusted net profit, or $0.43 per diluted share, according to results it published online. That was down from $106 million for the fourth quarter of 2023. The adjusted net earnings per share figure missed the $0.56 Zacks Consensus Estimate, an average of projections by brokerage analysts. “The $87.5 million decrease in adjusted net income for the three months ended December 31, 2024, compared to the three months ended December 31, 2023, is primarily attributable to (i) a decrease of $82.4 million in revenues; (ii) increases of $2.2 million in charter hire expenses, $2.2 million in vessel operating expenses, $0.2 million in voyage expenses, and $0.1 million in depreciation and amortization expenses; and (iii) decreases of $1.1 million in realized gain on derivatives and $1.6 million in other gain/(loss), net, partially offset by (i) decreases of $1.2 million in interest and finance costs and $0.2 million in general and administrative expenses and (ii) an increase of $0.9 million in interest income”, Dorian LPG said. Revenues for the October-December 2024 period totaled $80.67 million, compared to $163.06 million for the comparable period in the prior year. Dorian LPG’s time charter equivalent rate per available day in the fourth quarter of 2024 dropped 49.86 percent year-on-year to $36,071. Available days decreased from 2,256 to 2,210. “Weaker import demand from China, driven in part by lower steam cracking demand, resulted in a decline in LPG imports from high levels of 3.5 MMT [million metric tons] in July 2024 to 2.3 MMT in

Read More »

Trump Signs Memorandum ‘Restoring Maximum Pressure’ on Iran

A fact sheet posted on the White House website on Tuesday stated that U.S. President Donald J. Trump signed a National Security Presidential Memorandum (NSPM) “restoring maximum pressure on the government of the Islamic Republic of Iran”. “The NSPM directs the Secretary of the Treasury to impose maximum economic pressure on the Government of Iran, including by sanctioning or imposing enforcement mechanisms on those acting in violation of existing sanctions,” the fact sheet noted. “The Secretary of State will also modify or rescind existing sanctions waivers and cooperate with the Secretary of Treasury to implement a campaign aimed at driving Iran’s oil exports to zero,” it went on to state. Rigzone has contacted Iran’s ministry of foreign affairs for comment on the fact sheet. At the time of writing, the ministry has not yet responded to Rigzone’s request. In a report sent to Rigzone by the Skandinaviska Enskilda Banken AB (SEB) team on Wednesday morning, Bjarne Schieldrop, the chief commodities analyst at the company, said Brent “turned higher yesterday as Trump ramps up pressure on Iran” but added that it was “slightly lower this morning”. “Brent traded as low as $74.15 per barrel (-2.4 percent) yesterday but managed to close with a gain of 0.3 percent at $76.2 per barrel,” Schieldrop highlighted in the report. “The almost linear downward trend since the recent peak in mid-January seems to have faded a bit with price action now a little more sideways it seems,” he added. In the report, Schieldrop pointed out that, on Tuesday, “Trump signed actions for harder pressure on Iran with the potential to drive its exports significantly lower”. “That Trump would try to drive Iranian oil exports lower has been our expectation all along,” he said in the report. “The oil market is now caught between increasing fears

Read More »

Orsted, PGE Make FID on 1.5-GW Wind Farm Near Poland

Danish wind developer Ørsted A/S and PGE Polska Grupa Energetyczna S.A. (PGE) have made a final investment decision (FID) on the Baltica 2 Offshore Wind Farm, planned to have a capacity of 1.5 gigawatts (GW). The project will be built, owned and operated in a 50/50 partnership between the two firms. Baltica 2, which will be located approximately 25 miles (40 kilometers) off the Polish coast near Ustka, is expected to be fully commissioned in 2027, Orsted said in a news release. Baltica 2 has a 25-year inflation-protected contract for difference (CfD) in place with the Polish state. The wind farm has obtained all permits and has signed a grid connection contract with the Polish transmission system operator PSE, according to the release. The wind farm will use the Port of Gdansk for the storage, pre-assembly, and offshore installation of wind turbine components. All major component and vessel contracts for Baltica 2 have been signed, locking in the majority of the project’s capital expenditures, “which significantly de-risks the project,” Orsted noted. According to a separate report from PGE, the total budget for the project, including capital expenditures in development and construction, is estimated at approximately $7.41 billion (PLN 30 billion). Newly appointed Orsted CEO Rasmus Errboe said, “With today’s announcement, we’re ready to build Baltica 2, a flagship project for offshore wind in Poland. We’re satisfied with the value creation of the project, which has an attractive risk-reward profile. I wish to thank the Polish government for its support, and I want to thank our partner PGE for working with us to reach this moment. Together, we’re writing a new chapter in the Polish energy sector, and we’re setting up an industry that will bring jobs and industrial development to Poland for decades to come”. PGE CEO Dariusz Marzec said,

Read More »

Moomba CCS in Australia on Track to Achieve Declared Work Rate, Says Santos

The Moomba carbon capture and storage (CCS) project onshore South Australia stored 340,000 metric tons of carbon dioxide equivalent (CO2e) at yearend after starting service October 2024, operator Santos Ltd. has said, touting the facility as a showcase of Australia’s potential for the technology. “The technology and reservoir is [sic] performing as expected, putting Moomba CCS on track to safely and permanently sequester up to 1.7 million tonnes per annum of CO2e, depending on CO2 availability”, the local gas and oil company said in an online statement. A recent analysis by the Institute for Energy Economics and Financial Analysis (IEEFA) of another Australian CCS project found underperformance and cast doubt about the technology’s viability for abating emissions. The Chevron Corp.-led Gorgon CCS injection system captured, in the last Australian fiscal year (July 2023-June 2024), just 30 percent of the CO2 emitted from natural gas extraction by the Gorgon LNG and domestic gas project, IEEFA reported November 28, 2024. Gorgon CCS had been approved on the condition it captures, on a five-year rolling average from 2016, at least 80 percent of CO2 emissions from wells drilled for the gas facility, according to information published online by Chevron Australia Pty. Ltd. Santos assured its project “is delivering immediate and real large-scale emissions reduction for the company and for Australia at a very competitive lifecycle cost”. “The project is providing a real confidence boost for the potential of CCS technology to help Australia reach net zero and decarbonize faster, at scale and affordably”, the Adelaide-based exploration and production company added. At a full injection rate, Moomba CCS avoids more CO2 in four days than what 10,000 electric vehicles save in one year, according to Santos. “And in just one year, Moomba CCS will achieve around 28 percent of the total emissions reduction achieved by

Read More »

SSE on track despite stormy weather

SSE (LON: SSE) has upped its energy production across its portfolio of wind, gas and coal power through the most recent quarter affected by Storm Eowyn. The Perth-headquartered firm “good operational performance against variable weather conditions” in an update on its third quarter. It added operating profit expectations across its business units “remain unchanged” albeit it cautioned full year performance remains subject to a number of factors, including more weather. Generation output from its SSE Renewables division increased 26% in first nine months to the end of December compared to same period in prior year, SSE said. It added its “renewables fleet continue to experience periods of variable weather conditions” in January when Storm Eowyn hit, which the Met Office has described as “the UK’s most powerful windstorm for over a decade“. With its growing portfolio of investments in onshore and offshore wind in the UK, it said it’s massive Dogger Bank wind farm was still expected to complete in the second half of 2025.  SSE has a 40% stake in the project alongside Equinor (OSL: EQNR) 40% and Eni (IT: ENI) 20%. It added a second vessel has been reserved for the project from 2026 to support turbine installation across the second and third phases of the project. © Supplied by ProservProserv’s holistic cable monitoring system has been deployed at Dogger Bank wind farm. It also reported it has achieved first power at its 101MW Yellow River onshore wind farm that it has made a financial investment decision (FID) in its 208MW Strathy South onshore wind farm. It said it’s SSEN Transmission business, in which it holds a 75% stake along with Ontario Teachers’ Pension Plan Board which owns 25%, published its “bold blueprint to deliver at least £22 billion of critical grid infrastructure in the five years to 2031”.

Read More »

TriMas Completes Arrow Engine Sale

TriMas Corp., a Michigan-based manufacturer of engineered products, has completed the sale of its Arrow Engine business to IES Infrastructure Solutions LLC, a division of IES Holdings Inc. TriMas confirmed in a media release that the sale marks the end of its direct presence in the oil and gas market. “Founded in 1955, Arrow Engine was a legacy TriMas business and part of the TriMas portfolio for several decades”, Thomas Amato, TriMas President and Chief Executive Officer, said. “We are pleased to place this business with IES Infrastructure Solutions, the right partner to take Arrow Engine to the next level. This move is another important step in optimizing TriMas’ business portfolio”. Arrow Engine supplies natural gas engines commonly used in remote applications, including oil field pump jacks and compressors. These engines and replacement parts are designed for use in oil and natural gas production, as well as other industrial sectors. Arrow Engine distributes its products globally, with a primary focus on the United States and Canada, according to TriMas. Arrow Engine manufactures its engine line and offers a wide array of spare parts for various industrial engines, even those not produced by Arrow Engine, TriMas said. With estimated 2024 revenues of approximately $20 million, Arrow Engine will operate within IES’ Infrastructure Solutions segment and retain its existing brand name, TriMas said. “We remain committed to ensuring a smooth transition to IES Infrastructure Solutions while continuing to deliver the highest level of service to Arrow Engine’s customers”, said Amato. TriMas selected Holland & Knight as its outside legal counsel, while Woodward Park Partners acted as exclusive financial advisor and led the sale process. The company added that as a result of the sale, the Specialty Products business will include only Norris Cylinder’s financial performance in 2025, with just one month of Arrow

Read More »

Linux containers in 2025 and beyond

The upcoming years will also bring about an increase in the use of standard container practices, such as the Open Container Initiative (OCI) standard, container registries, signing, testing, and GitOps workflows used for application development to build Linux systems. We’re also likely see a significant rise in the use of bootable containers, which are self-contained images that can boot directly into an operating system or application environment. Cloud platforms are often the primary platform for AI experimentation and container development because of their scalability and flexibility along the integration of both AI and ML services. They’re giving birth to many significant changes in the way we process data. With data centers worldwide, cloud platforms also ensure low-latency access and regional compliance for AI applications. As we move ahead, development teams will be able to collaborate more easily through shared development environments and efficient data storage.

Read More »

Let’s Go Build Some Data Centers: PowerHouse Drives Hyperscale and AI Infrastructure Across North America

PowerHouse Data Centers, a leading developer and builder of next-generation hyperscale data centers and a division of American Real Estate Partners (AREP), is making significant strides in expanding its footprint across North America, initiating several key projects and partnerships as 2025 begins.  The new developments underscore the company’s commitment to advancing digital infrastructure to meet the growing demands of hyperscale and AI-driven applications. Let’s take a closer look at some of PowerHouse Data Centers’ most recent announcements. Quantum Connect: Bridging the AI Infrastructure Gap in Ashburn On January 17, PowerHouse Data Centers announced a collaboration with Quantum Connect to develop Ashburn’s first fiber hub specifically designed for AI and high-density workloads. This facility is set to provide 20 MW of critical power, with initial availability slated for late 2026.  Strategically located in Northern Virginia’s Data Center Alley, Quantum Connect aims to offer scalable, high-density colocation solutions, featuring rack densities of up to 30kW to support modern workloads such as AI inference, edge caching, and regional compute integration. Quantum Connect said it currently has 1-3 MW private suites available for businesses seeking high-performance infrastructure that bridges the gap between retail colocation and hyperscale facilities. “Quantum Connect redefines what Ashburn’s data center market can deliver for businesses caught in the middle—those too large for retail colocation yet underserved by hyperscale environments,” said Matt Monaco, Senior Vice President at PowerHouse Data Centers. “We’re providing high-performance solutions for tenants with demanding needs but without hyperscale budgets.” Anchored by 130 miles of private conduit and 2,500 fiber pathways, Quantum Connect’s infrastructure offers tenants direct, short-hop connections to adjacent facilities and carrier networks.  With 14 campus entrances and secure, concrete-encased duct banks, the partners said the new facility minimizes downtime risks and reduces operational costs by eliminating the need for new optics or extended fiber runs.

Read More »

Blue Owl Swoops In As Major Backer of New, High-Profile, Sustainable U.S. Data Center Construction

With the global demand for data centers continuing to surge ahead, fueled by the proliferation of artificial intelligence (AI), cloud computing, and digital services, it is unsurprising that we are seeing aggressive investment strategies, beyond those of the existing hyperscalers. One of the dynamic players in this market is Blue Owl Capital, a leading asset management firm that has made significant strides in the data center sector. Back in October 2024 we reported on its acquisition of IPI Partners, a digital infrastructure fund manager, for approximately $1 billion. This acquisition added over $11 billion to the assets Blue Owl manages and focused specifically on digital infrastructure initiatives. This acquisition was completed as of January 5, 2025 and IPI’s Managing Partner, Matt A’Hearn has been appointed Head of Blue Owl’s digital infrastructure strategy. A Key Player In Digital Infrastructure and Data Centers With multi-billion-dollar joint ventures and financing initiatives, Blue Owl is positioning itself as a key player in the digital infrastructure space. The company investments in data centers, the implications of its strategic moves, and the broader impact on the AI and digital economy highlights the importance of investment in the data center to the economy overall. With the rapid growth of the data center industry, it is unsurprising that aggressive investment fund management is seeing it as an opportunity. Analysts continue to emphasize that the global data center market is expected to grow at a compound annual growth rate (CAGR) of 10.2% from 2023 to 2030, reaching $517.17 billion by the end of the decade. In this rapidly evolving landscape, Blue Owl Capital has emerged as a significant contributor. The firm’s investments in data centers are not just about capitalizing on current trends but also about shaping the future of digital infrastructure. Spreading the Wealth In August 2024, Blue Owl

Read More »

Global Data Center Operator Telehouse Launches Liquid Cooling Lab in the UK to Meet Ongoing AI and HPC Demand

@import url(‘/fonts/fira_sans.css’); a { color: #0074c7; } .ebm-page__main h1, .ebm-page__main h2, .ebm-page__main h3, .ebm-page__main h4, .ebm-page__main h5, .ebm-page__main h6 { font-family: “Fira Sans”, Arial, sans-serif; } body { letter-spacing: 0.025em; font-family: “Fira Sans”, Arial, sans-serif; } button, .ebm-button-wrapper { font-family: “Fira Sans”, Arial, sans-serif; } .label-style { text-transform: uppercase; color: var(–color-grey); font-weight: 600; font-size: 0.75rem; } .caption-style { font-size: 0.75rem; opacity: .6; } #onetrust-pc-sdk [id*=btn-handler], #onetrust-pc-sdk [class*=btn-handler] { background-color: #005ea0 !important; border-color: #005ea0 !important; } #onetrust-policy a, #onetrust-pc-sdk a, #ot-pc-content a { color: #005ea0 !important; } #onetrust-consent-sdk #onetrust-pc-sdk .ot-active-menu { border-color: #005ea0 !important; } #onetrust-consent-sdk #onetrust-accept-btn-handler, #onetrust-banner-sdk #onetrust-reject-all-handler, #onetrust-consent-sdk #onetrust-pc-btn-handler.cookie-setting-link { background-color: #005ea0 !important; border-color: #005ea0 !important; } #onetrust-consent-sdk .onetrust-pc-btn-handler { color: #005ea0 !important; border-color: #005ea0 !important; background-color: undefined !important; } Starting in early 2025, Telehouse International Corporation of Europe will offer an advanced liquid cooling lab at their newest data center, Telehouse South at the London Docklands campus in Blackwall Yard. Telehouse has partnered with four leading liquid-cooling technology vendors — Accelsius, JetCool, Legrand, and EkkoSense — to allow customers to explore different cooling technologies and management tools while evaluating suitability for their use in the customer applications. Dr. Stu Redshaw, Chief Technology and Innovation Officer at EkkoSense, said about the project: Given that it’s not possible to run completely liquid-cooled data centers, the reality for most data center operators is that liquid cooling and air cooling will have an important role to play in the cooling mix – most likely as part of an evolving hybrid cooling approach. However, key engineering questions need answering before simply deploying liquid cooling – including establishing the exact blend of air and liquid cooling technologies you’ll need. And also recognizing the complexity of managing the operation of a hybrid air cooling and liquid cooling approach within the same room. This increases the

Read More »

Flexential Partners with Lonestar to Support First Lunar Data Center

Flexential, a leading provider of secure and flexible data center solutions, this month announced that it has joined forces with Lonestar Data Holdings Inc. to support the upcoming launch of Freedom, Lonestar’s second lunar data center. Scheduled to launch aboard a SpaceX Falcon 9 rocket via Intuitive Machines, this mission is a critical step toward establishing a permanent data center on the Moon. Ground-Based Support for Lunar Data Storage Flexential’s Tampa data center will serve as the mission control platform for Lonestar’s lunar operations, providing colocation, interconnection, and professional services. The facility was chosen for its proximity to Florida’s Space Coast launch operations and its ability to deliver low-latency connectivity for critical functions. Flexential operates two data centers in Tampa and four in Florida as part of its FlexAnywhere® Platform, comprising more than 40 facilities across the U.S. “Flexential’s partnership with Lonestar represents our commitment to advancing data center capabilities beyond conventional boundaries,” said Jason Carolan, Chief Innovation Officer at Flexential. “By supporting Lonestar’s space-based data center initiative, we are helping to create new possibilities for data storage and disaster recovery. This project demonstrates how innovative data center expertise can help organizations prepare for a resilient future with off-world storage solutions.” A New Era of Space-Based Resiliency The growing demand for data center capacity, with U.S. power consumption expected to double from 17 GW in 2022 to 35 GW by 2030 (according to McKinsey & Company), is driving interest in space-based solutions. Storing data off-planet reduces reliance on terrestrial resources while enhancing security against natural disasters, warfare, and cyber threats. The Freedom data center will provide resiliency, disaster recovery, and edge processing services for government and enterprise customers requiring the highest levels of data protection. The solar-powered data center leverages Solid-State Drives (SSDs) and a Field Programmable Gate Array (FPGA) edge

Read More »

Why DeepSeek Is Great for AI and HPC and Maybe No Big Deal for Data Centers

In the rapid and ever-evolving landscape of artificial intelligence (AI) and high-performance computing (HPC), the emergence of DeepSeek’s R1 model has sent ripples across industries. DeepSeek has been the data center industry’s topic of the week, for sure. The Chinese AI app surged to the top of US app store leaderboards last weekend, sparking a global selloff in technology shares Monday morning.  But while some analysts predict a transformative impact within the industry, a closer examination suggests that, for data centers at large, the furor over DeepSeek might ultimately be much ado about nothing. DeepSeek’s Breakthrough in AI and HPC DeepSeek, a Chinese AI startup, this month unveiled its R1 model, claiming performance on par with, or even surpassing, leading models like OpenAI’s ChatGPT-4 and Anthropic’s Claude-3.5-Sonnet. Remarkably, DeepSeek developed this model at a fraction of the cost typically associated with such advancements, utilizing a cluster of 256 server nodes equipped with 2,048 GPUs. This efficiency has been attributed to innovative techniques and optimized resource utilization. AI researchers have been abuzz about the performance of the DeepSeek chatbot that produces results similar to ChatGPT, but is based on open-source models and reportedly trained on older GPU chips. Some researchers are skeptical of claims about DeepSeek’s development costs and means, but its performance appears to challenge common assumptions about the computing cost of developing AI applications. This efficiency has been attributed to innovative techniques and optimized resource utilization.  Market Reactions and Data Center Implications The announcement of DeepSeek’s R1 model led to significant market reactions, with notable declines in tech stocks, including a substantial drop in Nvidia’s valuation. This downturn was driven by concerns that more efficient AI models could reduce the demand for high-end hardware and, by extension, the expansive data centers that house them. For now, investors are re-assessing the

Read More »

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs).  In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

Read More »

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

Read More »

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

Read More »

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Read More »