Stay Ahead, Stay ONMINE

Security leaders lose visibility as consultants deploy shadow AI copilots to stay employed

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Consulting firms’ accelerating adoption of generative AI to automate knowledge work is sending shockwaves across the industry, triggering workforce shakeups and layoffs. This month, PwC cut roughly 2% of its U.S. staff, approximately 1,500 jobs, in […]

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More


Consulting firms’ accelerating adoption of generative AI to automate knowledge work is sending shockwaves across the industry, triggering workforce shakeups and layoffs.

This month, PwC cut roughly 2% of its U.S. staff, approximately 1,500 jobs, in audit and tax linesEY eliminated 150 roles even as it announced a $1.4 billion investment to build an enterprise AI platform. Accenture slashed 19,000 positions (2.5% of its workforce) amid slowing growth and rising tech costs in 2023.

McKinsey & Company is reportedly paying senior staff up to nine months’ salary to quit voluntarily, a strategy industry observers link to a downturn in consulting spending accelerated by AI-driven change.

KPMG is realigning skills as AI platforms replace parts of the audit process and other routine tasks. In November of last year, 333 jobs were cut, or approximately 4% of U.S. audit employees.

This escalating wave of AI-driven layoffs is reflected in IBM CEO Arvind Krishna’s recent acknowledgment that IBM has replaced several hundred routine human resources roles with AI agents. Krishna’s comments underscore the unsettling reality confronting high-performing employees: roles centered around “rote process work” are rapidly becoming obsolete. Although IBM has reallocated some resources toward roles in software development and sales, the underlying message is clear. Employees increasingly perceive AI agents as existential threats, fueling anxiety and driving many to build shadow AI apps to preserve their relevance defensively.

The bottom line is that gen AI is redefining all forms of knowledge work much faster than anyone, including the industry’s elite consultants, leaders and partners, expected.

AI layoffs are sparking a survival mindset

Fearing they may be caught up in sweeping layoffs driven by AI and automation, many of the industry’s elite consultants and high performers are reinventing themselves quickly before their roles vanish.

Teams in the hardest hit areas often have dozens of shadow AI apps designed to improve efficiency and team productivity. Proposal and pitch automation, operations and workflow automation, financial modeling, scenario analysis, and client relationship managers being replaced by firm-specific copilots are where shadow AI flourishes. Many are relying on Python-based shadow AI to build custom automation tools, bypass internal IT bottlenecks, and deliver faster, differentiated insights that protect their roles in an industry under pressure from gen AI.

Python is becoming the language of reinvention

VentureBeat has learned that many of the top-tier strategists, marketers, practice leaders and their teams are becoming proficient in creating Python-based apps that can take analysis and insights beyond the existing genAI tools provided by IT. Teams creating these apps are proficient with Open AI, Google programmable search engines, Google Gemini 2.5 Pro, Perplexity and other AI platforms’ API keys and calls. Platforms of choice for fine-tuning shadow AI apps include Google Colab and Google AI Studio. Many are using Replit to create standalone apps.  

Building Shadow AI apps with enterprise-grade reach

By combining a series of APIs and search engine IDs from Anthropic, Perplexity, Open AI and Google, the speed, accuracy and acuity of insights, associates’  shadow AI apps deliver reach beyond the current scope of legitimate, IT-approved copilots and chatbots. One SME leader confided to VentureBeat that the combination of APIs and Python fine-tuning makes it possible to create apps so hyper-customized to a client’s goals that it’s saving him days of manual work aggregating and analyzing data.  

Associates at top firms globally have created dozens, and in some cases hundreds, of unique Google Search Engine APIs and IDs to power their Python apps. These APIs provide precise, real-time integration of external data directly into their shadow AI tools, further boosting their analytical edge.

Shadow AI is quickly emerging as the new consulting stack

An analysis by Cyberhaven of AI usage across three million employees found that 73.8% of workplace ChatGPT accounts were personal rather than corporate, indicating that most consultants turn to these tools independently. Cameron Coles, VP of Marketing at Cyberhaven’s blog post last month, AI Usage at Work Is Exploding — But 71% of Tools Put Your Data at Risk, provides insights into what kind of data is most often shared across shadow AI apps and the rapid growth of the category.

Coles writes, “AI usage at work continues its remarkable growth trajectory. In the past 12 months alone, usage has increased 4.6x, and over the past 24 months, AI usage has grown an astounding 61x. This represents one of the fastest adoption rates for any workplace technology, substantially outpacing even SaaS adoption, which took years to achieve similar penetration levels.”

Within top consulting firms, the proliferation of self-built, unauthorized apps continues to be explosive. Itamar Golan, CEO of Prompt Security, notes, “We see 50 new AI apps a day, and we’ve already cataloged over 12,000,” highlighting how rapidly these shadow tools are emerging. He recently told VentureBeat during an interview that “many default to indiscriminately training on proprietary data inputs,” exposing firms to substantial risk. Internal data confirms that 70–75% of consultants now regularly rely on generative AI apps, directly attributing productivity gains to shadow AI apps. It’s become the weapon of choice for consulting’s top talent, enabling them to produce exceptional work in a fraction of the time.

VentureBeat interviewed Golan, Vineet Arora, CTO of WinWire, and senior leaders at fourteen leading global consultancies to understand the breadth of shadow AI adoption.:

Estimating the true scale of shadow AI in consulting

While most enterprise tools still fail to detect the scale of shadow AI use, field interviews and telemetry from Prompt Security, WinWire and interviews with 14 top-tier consulting firms make one thing clear: shadow AI is no longer a fringe phenomenon. It’s emerging as a parallel tech stack built from the ground up by consultants themselves.

VentureBeat’s estimate incorporates:

  • Prompt Security telemetry, which detects ~50 new shadow AI apps per day and has already cataloged 12,000+ tools across consulting firms globally.
  • WinWire enterprise AI data, covering Gemini, GPT-4, Claude 3 and Colab-based deployments.
  • Cyberhaven usage data, which reveals that 73.8% of ChatGPT workplace accounts are unauthorized, and enterprise AI usage has grown 61x in 24 months.
  • 14 executive interviews with CTOs, CISOs, AI leads and partners across Tier 1 firms.

Only actively deployed, production-grade tools are counted, not one-off prompts, temporary Google Colab notebooks, or ChatGPT browser sessions. These numbers reflect a validated lower bound, likely far short of the true total.

Shadow AI app landscape in consulting, 2025 (Verified Estimate)

Use Case CategoryEstimated Shadow AI Apps   (Q2 2025)Primary Tools Used
Pitch & Proposal Automation12,000GPT-4, Gemini, Replit, Colab
Market Segmentation & Targeting9,000Perplexity, Gemini APIs, RAG apps
Research Assistants & Knowledge Bots15,000Claude 3, Gemini Pro, Google Search APIs
Client-Facing Chatbots & Agents7,500OpenAI Assistants, LangChain, custom LLMs
Workflow & Productivity Automation13,000Python automations, Sheets, Zapier
Financial Analysis & Scenario Models18,000Monte Carlo models, Gemini + Python
Total (Validated Estimate)74,500+

Sources: Prompt Security, WinWire, Cyberhaven, VentureBeat interviews with 14 global firms

Shadow AI growth trajectory: What comes next

Shadow AI is scaling faster than any sanctioned internal platform, and most firms have no real way to slow it down. Based on a conservative 5% month-over-month growth rate, the number of actively used shadow apps could more than double by mid-2026.

What started as isolated productivity scripts has evolved into something more durable. Shadow AI is no longer a fringe toolset. It is now a parallel delivery stack. It operates outside IT, without formal governance, yet it powers many of the high-value outputs firms deliver to clients every day.

Projected shadow AI app growth in consulting

QuarterProjected App CountDrivers of Growth
Q2 202574,500+Verified active apps from Prompt Security, WinWire, and interviews
Q3 202590,000 to 95,000Growth in Gemini and Claude apps, partner-led development
Q4 2025110,000 to 115,000Shadow tools become embedded in client delivery workflows
Q1 2026130,000 to 140,000Emergence of gray-market copilots and self-maintained apps
Q2 2026150,000 to 160,000+Shadow AI evolves into an ungoverned parallel delivery stack

These projections exclude one-off use of ChatGPT or Gemini in browser sessions. They reflect persistent apps and workflows built using APIs, scripting, or automated agents developed inside consulting teams.

How to strategically manage shadow AI risks

Shadow AI is thriving because traditional IT and cybersecurity frameworks aren’t designed to track its use. IT and security teams in nearly every enterprise VentureBeat spoke with have three to five times the number of projects they can complete this year. While getting a new copilot out is a high priority, it can face resource and approval hurdles as a result.

Arora underscores that “most traditional management tools lack comprehensive visibility into AI apps,” enabling unauthorized AI to quietly embed itself within enterprise workflows.” Arora’s insights reveal an underlying truth: Employees aren’t acting maliciously; they’re fearful of being let go while simultaneously overwhelmed with work, leveraging AI to cope with escalating workloads, shrinking deadlines, and relentless performance expectations.

Rather than stifling AI adoption, Arora advocates proactive empowerment through strategic, centralized governance. By institutionalizing clear oversight, organizations can harness AI securely, transforming shadow AI from an unseen threat into a controlled asset.

A blueprint for governance

Consultancies’ senior management teams need a clear, practical roadmap to get in front of shadow AI risks and harness its strategic potential. Arora outlined a detailed governance framework during a recent interview with VentureBeat, explicitly designed for enterprises navigating the complexities of shadow AI:

  • Shadow AI audits are table stakes:
    Regularly take inventory of all unauthorized AI activity through robust network monitoring and detailed software asset management.
  • Create an Office of Responsible AI:
    Centralize AI governance functions spanning policy creation, vendor assessments and risk analysis, and maintain an approved AI tools catalog accessible to all teams.
  • Get AI-aware security controls in place immediately:
    Deploy specialized Data Loss Prevention (DLP) tools and real-time inference monitoring capable of detecting sensitive data leaks specific to AI applications in real-time.
  • Go all in on applying zero trust to AI architectures:
    Adopt strict output validation protocols, anonymize or tokenize sensitive inputs, and rigorously manage data flows to minimize exposure and prevent unauthorized data training.
  • Find out where the roadblocks are to getting more gen AI tools out now:
    Every organization can improve on the speed at which it deploys new technologies. Find out where the gaps and roadblocks are holding the consultancy back from delivering more adept copilots and chatbots. It is essential to get a roadmap defined for IT and DevOps to work on for internally suggested Python apps, fine-tuned to client needs.
  • GRC integration and continuous training:
    Integrate AI governance within existing governance, risk, and compliance (GRC) frameworks, and consistently consult on secure, compliant AI practices.
  • Avoid blanket bans, it’s fuel for even more shadow AI app development:
    Recognize that outright AI bans inevitably backfire, increasing shadow AI proliferation. Instead, rapidly deploy secure, sanctioned alternatives that enable compliant, productive innovation.

Initially an underground productivity hack, shadow AI has emerged as a decisive factor in how top-tier consultants deliver differentiated client value. Driven by a stark survival imperative amid widespread AI-triggered layoffs, elite talent now relies on Python-driven, generative AI-powered solutions, enabling uniquely tailored client insights and rapid responses to their clients.

Consulting firms that are slow to adapt or hesitant to strategically harness these innovations strategically risk forfeiting their future competitive edge. The path forward demands not prohibition but thoughtful, secure integration of shadow AI and the transformation of potential risks into decisive strategic advantages.

Shape
Shape
Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy,  bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Shape

Many ways to use the date command on Linux

$ date +%s1746203311 You can also convert a timestamp back to a human-friendly date: $ date -d @1746203311Fri May 2 12:28:31 PM EDT 2025 The date -u commands displays the date and time in UTC (coordinated universal time): $ date -uFri May 2 04:30:55 PM UTC 2025 You can display

Read More »

Nvidia eyes China rebound with stripped-down AI chip tailored to export limits

“Nvidia’s region-specific, compliance-driven chip strategy introduces manageable fragmentation risks, but also unlocks significant opportunities for global enterprises,” said Prabhu Ram, VP of the industry research group at Cybermedia Research. “While hardware and software inconsistencies may complicate unified AI deployments, these variants enable legal market access, cost optimization, and hybrid architecture

Read More »

GasBuddy Head Predicts ‘Relatively Stable Stretch for Gas Prices’

In a blog posted on the GasBuddy website on Tuesday, Patrick De Haan, the company’s head of petroleum analysis, said, “as we move into the heart of summer”, he believes “we’re likely to see a relatively stable stretch for gas prices as refinery maintenance wraps up”. “Don’t expect the national average to rise above $3.30 per gallon, nor drop much below $3 for now,” he added. However, De Haan warned in the blog that several factors could still influence prices in the weeks ahead, “including growing uncertainty around the upcoming hurricane season”. “We’ll also be watching OPEC+’s meeting this week to see if they boost oil production again for July as well,” De Haan said. The blog noted that the U.S. average price of gasoline has risen, then fallen, and remains just below its week ago level at $3.13 per gallon, according to GasBuddy data compiled from more than 12 million individual price reports covering over 150,000 gas stations across the country. “The national average is up 3.1 cents from a month ago and is 43.0 cents per gallon lower than a year ago,” the blog added. The U.S. Energy Information Administration’s (EIA) latest gasoline fuel update, which was released on May 28, showed that the U.S. regular gasoline price averaged $3.120 per gallon on May 12, $3.173 per gallon on May 19, and $3.160 per gallon on May 26. The May 26 price is down $0.417 compared to the year ago price, the fuel update outlined. Of the five Petroleum Administration for Defense District (PADD) regions highlighted in the EIA’s latest fuel update, the West Coast was shown to have the highest U.S. regular gasoline price as of May 26, at $4.258 per gallon. The Gulf Coast was shown to have the lowest U.S. regular gasoline price as of May

Read More »

Equinor Offers $1.75B Bonds

Equinor ASA has entered into underwriting agreements for the sale of debt instruments with a total principal amount of $1.75 billion. The offering consists of $550 million notes due June 2028 with a 4.25 percent interest, $400 million notes due September 2030 with a 4.5 percent interest and $800 million notes due June 2035 with a 5.125 percent interest. The interests are payable semi-annually, according to a regulatory filing Wednesday. The unsecured notes will be issued in denominations of $1,000 and integral multiples of $1,000 in excess thereof. The underwriters are Barclays Capital Inc., BofA Securities Inc., Deutsche Bank Securities Inc., Goldman Sachs & Co. LLC and JP Morgan Securities LLC. Equinor Energy AS is guarantor. The offering is scheduled to close June 3. “[T]he net proceeds from the sale of securities will be used for general corporate purposes, which may include working capital, the repayment of existing debt (including debt incurred in connection with acquisitions) or the financing of acquisitions”, Equinor told the United States Securities and Exchange Commission. As of the end of the first quarter the Norwegian majority state-owned company had a current finance debt including lease liabilities of $7.03 billion. Non-current finance debt including lease liabilities stood at $22.74 billion, of which $21.16 billion is unsecured and $1.58 billion secured. Total equity was $45.86 billion. Total finance debt and equity was $75.63 billion. Last week Equinor and Polenergia reached financial close securing over EUR 3 billion for the Baltyk II offshore wind project and over EUR 3 billion for Baltyk III (around $6.76 billion in total). The wind projects on the Polish side of the Baltic Sea will have a combined capacity of 1,440 megawatts (720 MW each), enough to power two million Polish homes, according to the 50-50 co-venturers. Full commercial generation is expected 2028. “The

Read More »

Standard Chartered Expects OPEC+ 8 to ‘Continue with Accelerated Unwinding’

In a report sent to Rigzone on Wednesday by Emily Ashford, an energy analyst at Standard Chartered Bank, analysts at the bank, including Ashford and the bank’s commodities research head Paul Horsnell, said they expect the eight OPEC+ producers who contributed additional voluntary curbs “to continue with the accelerated unwinding of the November 2023 tranche of 2.22 million barrels per day of voluntary cuts”. “The eight OPEC+ producers who contributed additional voluntary cuts are scheduled to meet virtually on 31 May to discuss production strategy for July loading programs,” the analysts highlighted in the report. “Adding a further 411,000 barrels per day to targets for July would bring the cumulative unwinding over four months to 1.4 million barrels per day,” they added. The analysts noted in the report that the increase in targets has not translated into barrel for barrel increases in OPEC+ output. They outlined that this is “partly because of output constraints in some countries and partly because for the main overproducers the increase simply cuts the extent of the overshoot above target”. “We estimate total OPEC+ crude oil output was 40.994 million barrels per day in March, the last month before the unwind began,” the analysts said in the report. “We expect it to be 41.899 million barrels per day in July assuming the 411,000 barrel per day increase goes ahead, just 905,000 barrels per day higher despite the 1.4 million barrel per day relaxation in targets,” they added. In the report, the Standard Chartered Bank analysts forecast that the global market will be “broadly balanced in both Q2 and Q3, even if the November 2023 tranche of voluntary cuts is fully unwound by the end of Q3”. “We expect the call on OPEC+ crude oil and inventories to average 42.1 million barrels per day in Q3,

Read More »

North Atlantic in Talks with ExxonMobil to Buy French Refinery Complex

Exxon Mobil Corp. has agreed to enter exclusive negotiations with North Atlantic for the Canadian company’s potential takeover of a refining and petrochemical complex in France. The 1,500-acre site in Gravenchon, Normandy hosts the second-biggest refinery in France and one of the largest integrated chemical complexes in Western Europe, North Atlantic noted. With two distillation trains, several conversion units and associated logistics facilities, the site can process up to 230,000 barrels a day of crude and other feedstocks. The planned transaction involves the divestment of an 82.89 percent stake in Esso Société Anonyme Française SA (Esso SAF), which manages the refinery, and 100 percent of ExxonMobil Chemical France SAS (EMCF), which manages the site’s petrochemical operations, from ExxonMobil France Holding SAS to North Atlantic France SAS. All affected employees, about 1,350, would be retained by the prospective new owner on the same employment terms, according to an online statement from ExxonMobil. “ExxonMobil has been operating in France for over 120 years and we plan to maintain a significant commercial presence with the Esso brand at around 750 retail sites across the country”, said Tanya Bryja, senior vice president of ExxonMobil Product Solutions. “France remains an important market for us, and we will continue to support customers with sales of chemicals, finished lubricants, base stocks, synthetics, and other specialty products as well”. ExxonMobil will keep trademark registrations and other intellectual property rights owned by Esso SAF and EMCF. “ExxonMobil continually evaluates its business globally and the proposed sale is aligned with its business strategy”, the company said. “Europe is an important region for ExxonMobil where there will continue to be a meaningful presence”. ExxonMobil is high-grading its portfolio by selling non-strategic assets. The planned sale to North Atlantic follows ExxonMobil’s sale last year of the Fos-sur-Mer refinery in France to a Trafigura consortium,

Read More »

OPEC+ Ratifies Group-Wide Quotas Before July Output Decision

OPEC+ ratified group-wide production quotas for this year and next, ahead of a decision by eight key members at the weekend whether to bolster output again in July. The Organization of the Petroleum Exporting Countries and its partners reaffirmed their longer-term targets for 2025 and 2026, which underpin its current supply restraints, according to a statement after a video conference on Wednesday.  A more market-sensitive discussion on whether to continue their 411,000 barrel-a-day hikes, which have sent prices crashing over the past two months, will be finalized in a call on Saturday, according to delegates who asked not to be identified. The sequencing of the meetings underscores how oil quotas for the full 22-nation OPEC+ alliance have receded in importance over the past two years, as actual supply adjustments are carried out by sub-group of eight countries, led by Saudi Arabia and Russia. It was these nations that shattered market expectations on April 3, when they announced their first super-sized hike — triple the volume originally scheduled. The shock move, unveiled just hours after President Donald Trump launched a global trade war, helped send crude futures to a four-year low below $60 a barrel in the ensuing days, marking a rupture with years of efforts by the coalition to try to shore up prices.   Brent contracts have since stabilized near $65 as Trump has reversed some of his trade tariffs.  OPEC+ delegates have offered a range of explanations for the policy reversal: from satisfying summer fuel demand to punishing over-producing members, and from placating President Trump to recouping lost market share. At Wednesday’s gathering, they also agreed to “develop a mechanism” for setting production baselines in 2027 based on their maximum sustainable capacity, according to the statement. The 22-nation alliance will hold its next full ministerial meeting on Nov. 30. Its Joint Ministerial

Read More »

AI and microgrids: Moving towards seamless, resilient energy solutions

Brad Meissner is director of product management at Generac. Global energy demand is on the rise and will continue to experience significant growth — particularly electricity use, which is expected to grow at an annual rate of around 4%, according to the International Energy Agency.  As this trend persists, driven by factors like the electrification of transportation and the rising demand for data centers, microgrids are emerging as essential solutions to address the challenges of modern energy systems. These decentralized systems self-generate, store and distribute power intelligently, alleviating strain on traditional grids. However, while the potential of microgrids is vast, they are still largely fragmented and custom-designed, limiting scalability. The integration of artificial intelligence is transforming this space, enabling more efficient, resilient and secure energy management solutions. The future of energy isn’t just about generation — it’s about intelligent, adaptive networks that maximize reliability and sustainability. Is your business ready to embrace this shift?  Let’s unpack how AI-powered microgrids are overhauling energy systems — optimizing distribution, enhancing resilience and reducing reliance on traditional grids. Current state of microgrid projects and the need for clear standards Today, microgrid projects are mostly tailored to specific needs, making them bespoke solutions that vary widely in design and operation. While this customization meets particular requirements, it creates a fragmented market where the integration of various energy resources (e.g., solar, battery storage and backup generators) is often cobbled together. This lack of standardization leads to inefficiencies and higher costs, limiting scalability and interoperability. The lack of a cohesive framework makes it difficult to achieve the kind of seamless integration needed for large-scale deployment. For example, integrating renewable energy sources into microgrids, while essential for sustainability, adds complexity when each project uses different hardware, software and operational models. To address these challenges, the industry needs clear

Read More »

HPE Aruba unveils raft of new switches for data center, campus modernization

And in large-scale enterprise environments embracing collapsed-core designs, the switch acts as a high-performance aggregation layer. It consolidates services, simplifies network architecture, and enforces security policies natively, reducing complexity and operational cost, Gray said. In addition, the switch offers the agility and security required at colocation facilities and edge sites. Its integrated Layer 4 stateful security and automation-ready platform enable rapid deployment while maintaining robust control and visibility over distributed infrastructure, Gray said. The CX 10040 significantly expands the capacity it can provide and the roles it can serve for enterprise customers, according to one industry analyst. “From the enterprise side, this expands on the feature set and capabilities of the original 10000, giving customers the ability to run additional services directly in the network,” said Alan Weckel, co-founder and analyst with The 650 Group. “It helps drive a lower TCO and provide a more secure network.”  Aimed as a VMware alternative Gray noted that HPE Aruba is combining its recently announced Morpheus VM Essentials plug-in package, which offers a hypervisor-based package aimed at hybrid cloud virtualization environments, with the CX 10040 to deliver a meaningful alternative to Broadcom’s VMware package. “If customers want to get out of the business of having to buy VM cloud or Cloud Foundation stuff and all of that, they can replace the distributed firewall, microsegmentation and lots of the capabilities found in the old VMware NSX [networking software] and the CX 10k, and Morpheus can easily replace that functionality [such as VM orchestration, automation and policy management],” Gray said. The 650 Group’s Weckel weighed in on the idea of the CX 10040 as a VMware alternative:

Read More »

Indian startup Refroid launches India’s first data center CDUs

They use heat exchangers and pumps to regulate the flow and temperature of fluid delivered to equipment for cooling, while isolating the technology cooling system loop from facility systems. The technology addresses limitations of traditional air cooling, which industry experts say cannot adequately handle the heat generated by modern AI processors and high-density computing applications. Strategic significance for India Industry analysts view the development as a critical milestone for India’s data center ecosystem. “India generates 20% of global data, yet contributes only 3% to global data center capacity. This imbalance is not merely spatial — it’s systemic,” said Sanchit Vir Gogia, chief analyst and CEO at Greyhound Research. “The emergence of indigenously developed CDUs signals a strategic pivot. Domestic CDU innovation is a defining moment in India’s transition from data centre host to technology co-creator.” Neil Shah, VP for research and partner at Counterpoint Research, noted that major international players like Schneider, Vertiv, Asetek, Liquidstack, and Zutacore have been driving most CDU deployments in Indian enterprises and data centers. “Having a local indigenous CDU tech and supplier designed with Indian weather, infrastructure and costs in mind expands options for domestic data center demand,” he said. AI driving data center cooling revolution India’s data center capacity reached approximately 1,255 MW between January and September 2024 and was projected to expand to around 1,600 MW by the end of 2024, according to CBRE India’s 2024 Data Center Market Update. Multiple market research firms have projected the India data center market to grow from about $5.7 billion in 2024 to $12 billion by 2030. Bhavaraju cited aggressive projections for the sector’s expansion, with AI workloads expected to account for 30% of total workloads by 2030. “All of them need liquid cooling,” he said, noting that “today’s latest GPU servers – GB200 from Nvidia

Read More »

Platform approach gains steam among network teams

Revisting the platform vs. point solutions debate The dilemma of whether to deploy an assortment of best-of-breed products from multiple vendors or go with a unified platform of “good enough” tools from a single vendor has vexed IT execs forever. Today, the pendulum is swinging toward the platform approach for three key reasons. First, complexity, driven by the increasingly distributed nature of enterprise networks, has emerged as a top challenge facing IT execs. Second, the lines between networking and security are blurring, particularly as organizations deploy zero trust network access (ZTNA). And third, to reap the benefits of AIOps, generative AI and agentic AI, organizations need a unified data store. “The era of enterprise connectivity platforms is upon us,” says IDC analyst Brandon Butler. “Organizations are increasingly adopting platform-based approaches to their enterprise connectivity infrastructure to overcome complexity and unlock new business value. When enhanced by AI, enterprise platforms can increase productivity, enrich end-user experiences, enhance security, and ultimately drive new opportunities for innovation.” In IDC’s Worldwide AI in Networking Special Report, 78% of survey respondents agreed or strongly agreed with the statement: “I am moving to an AI-powered platform approach for networking.” Gartner predicts that 70% of enterprises will select a broad platform for new multi-cloud networking software deployments by 2027, an increase from 10% in early 2024. The breakdown of silos between network and security operations will be driven by organizations implementing zero-trust principles as well as the adoption of AI and AIOps. “In the future, enterprise networks will be increasingly automated, AI-assisted and more tightly integrated with security across LAN, data center and WAN domains,” according to Gartner’s 2025 Strategic Roadmap for Enterprise Networking. While all of the major networking vendors have announced cloud-based platforms, it’s still relatively early days. For example, Cisco announced a general framework for Cisco

Read More »

Oracle to spend $40B on Nvidia chips for OpenAI data center in Texas

OpenAI has also expanded Stargate internationally, with plans for a UAE data center announced during Trump’s recent Gulf tour. The Abu Dhabi facility is planned as a 10-square-mile campus with 5 gigawatts of power. Gogia said OpenAI’s selection of Oracle “is not just about raw compute, but about access to geographically distributed, enterprise-grade infrastructure that complements its ambition to serve diverse regulatory environments and availability zones.” Power demands create infrastructure dilemma The facility’s power requirements raise serious questions about AI’s sustainability. Gogia noted that the 1.2-gigawatt demand — “on par with a nuclear facility” — highlights “the energy unsustainability of today’s hyperscale AI ambitions.” Shah warned that the power envelope keeps expanding. “As AI scales up and so does the necessary compute infrastructure needs exponentially, the power envelope is also consistently rising,” he said. “The key question is how much is enough? Today it’s 1.2GW, tomorrow it would need even more.” This escalating demand could burden Texas’s infrastructure, potentially requiring billions in new power grid investments that “will eventually put burden on the tax-paying residents,” Shah noted. Alternatively, projects like Stargate may need to “build their own separate scalable power plant.” What this means for enterprises The scale of these facilities explains why many organizations are shifting toward leased AI computing rather than building their own capabilities. The capital requirements and operational complexity are beyond what most enterprises can handle independently.

Read More »

New Intel Xeon 6 CPUs unveiled; one powers rival Nvidia’s DGX B300

He added that his read is that “Intel recognizes that Nvidia is far and away the leader in the market for AI GPUs and is seeking to hitch itself to that wagon.” Roberts said, “basically, Intel, which has struggled tremendously and has turned over its CEO amidst a stock slide, needs to refocus to where it thinks it can win. That’s not competing directly with Nvidia but trying to use this partnership to re-secure its foothold in the data center and squeeze out rivals like AMD for the data center x86 market. In other words, I see this announcement as confirmation that Intel is looking to regroup, and pick fights it thinks it can win. “ He also predicted, “we can expect competition to heat up in this space as Intel takes on AMD’s Epyc lineup in a push to simplify and get back to basics.” Matt Kimball, vice president and principal analyst, who focuses on datacenter compute and storage at Moor Insights & Strategy, had a much different view about the announcement. The selection of the Intel sixth generation Xeon CPU, the 6776P, to support Nvidia’s DGX B300 is, he said, “important, as it validates Intel as a strong choice for the AI market. In the big picture, this isn’t about volumes or revenue, rather it’s about validating a strategy Intel has had for the last couple of generations — delivering accelerated performance across critical workloads.”  Kimball said that, In particular, there are a “couple things that I would think helped make Xeon the chosen CPU.”

Read More »

AWS clamping down on cloud capacity swapping; here’s what IT buyers need to know

As of June 1, AWS will no longer allow sub-account transfers or new commitments to be pooled and reallocated across customers. Barrow says the shift is happening because AWS is investing billions in new data centers to meet demand from AI and hyperscale workloads. “That infrastructure requires long-term planning and capital discipline,” he said. Phil Brunkard, executive counselor at Info-Tech Research Group UK, emphasized that AWS isn’t killing RIs or SPs, “it’s just closing a loophole.” “This stops MSPs from bulk‑buying a giant commitment, carving it up across dozens of tenants, and effectively reselling discounted EC2 hours,” he said. “Basically, AWS just tilted the field toward direct negotiations and cleaner billing.” What IT buyers should do now For enterprises that sourced discounted cloud resources through a broker or value-added reseller (VAR), the arbitrage window shuts, Brunkard noted. Enterprises should expect a “modest price bump” on steady‑state workloads and a “brief scramble” to unwind pooled commitments.  If original discounts were broker‑sourced, “budget for a small uptick,” he said. On the other hand, companies that buy their own RIs or SPs, or negotiate volume deals through AWS’s Enterprise Discount Program (EDP), shouldn’t be impacted, he said. Nothing changes except that pricing is now baselined.

Read More »

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs).  In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

Read More »

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

Read More »

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

Read More »

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Read More »