Stay Ahead, Stay ONMINE

Nvidia: Latest news and insights

Nvidia: ‘Graphics 3.0’ will drive physical AI productivity August 15, 2025: Nvidia has floated the idea of “Graphics 3.0” with the hope of making AI-generated graphics central to physical productivity. The concept revolves around graphics created by genAI tools. Nvidia say AI-generated graphics could help in training robots to do their jobs in the physical […]

Nvidia: ‘Graphics 3.0’ will drive physical AI productivity

August 15, 2025: Nvidia has floated the idea of “Graphics 3.0” with the hope of making AI-generated graphics central to physical productivity. The concept revolves around graphics created by genAI tools. Nvidia say AI-generated graphics could help in training robots to do their jobs in the physical world or by helping AI assistants automate the creation of equipment and structures.

Nvidia launches Blackwell-powered RTX Pro GPUs for compact AI workstations

August 12, 2025: Nvidia announced two new professional GPUs, the RTX Pro 4000 Small Form Factor (SFF) and the RTX Pro 2000. Built on its Blackwell architecture, Nvidia’s new GPUs aim to deliver powerful AI capabilities in compact desktop and workstation deployments.

Nvidia’s new genAI model helps robots think like humans

August 11, 2025: Nvidia has developed a genAI model to help robots make human-like decisions by analyzing surrounding scenes. The Cosmos Reason model in robots can take in information from video and graphics input, analyze the data, and use its understanding to make decisions.

Nvidia patches critical Triton server bugs that threaten AI model security

August 5, 2025: A surprising attack chain in Nvidia’s Triton Inference Server, starting with a seemingly minor memory-name leak, could allow full remote server takeover without user authentication.

China demands ‘security evidence’ from Nvidia over H20 chip backdoor fears

August 4, 2025: China escalated pressure on Nvidia with the state-controlled People’s Daily publishing an opinion piece titled “Nvidia, how can I trust you?” — a day after regulators summoned company officials over alleged security vulnerabilities in H20 artificial intelligence chips.

Nvidia to restart H20 exports to China, unveils new export-compliant GPU

July 15, 2025: Nvidia will restart H20 AI chip sales to China and release a new GPU model compliant with export rules, a move that could impact global AI hardware strategies for enterprise IT teams. Nvidia has applied for US approval to resume sales and says that the government has indicated licenses will be granted and deliveries could begin soon.

Nvidia GPUs are vulnerable to Rowhammer attacks

July 15, 2025: Nvidia has issued a security reminder to application developers, computer manufacturers, and IT leaders that modern memory chips in graphic processors are potentially susceptible to so-called Rowhammer exploits after Canadian university researchers proved that an Nvidia A6000 GPU could be successfully compromised with a similar attack.

Nvidia hits $4T market cap as AI, high-performance semiconductors hit stride

July 11, 2025: Nvidia became the first publicly traded company to surpass a $4 trillion market capitalization value, 13 months after surpassing the $3 trillion mark. This makes Nvidia the world’s most valuable company ahead of Apple and Microsoft.

New Nvidia technology provides instant answers to encyclopedic-length questions

Jul 8, 2025: Have a question that needs to process an encyclopedia-length dataset? Nvidia says its new technique can answer it instantly. Built leveraging the company’s Blackwell processor’s capabilities, the new “Helix Parallelism” method allows AI agents to process millions of words — think encyclopedia-length — and support up to 32x more users at a time.

Nvidia doubles down on GPUs as a service

July 8, 2025: Nvidia’s recent initiative to dive deeper into the GPU-as-a-service (GPUaaS) model marks a significant and strategic shift that reflects an evolving landscape within the cloud computing market. 

Nvidia, Perplexity to partner with EU and Middle East AI firms to build sovereign LLMs

June 12, 2025: Nvidia and AI search firm Perplexity said they are joining hands with model builders and cloud providers across Europe and the Middle East to refine sovereign large-language models (LLMs) and accelerate enterprise AI uptake in local industries.

Nvidia: ‘Sovereign AI’ will change digital work

June 11, 2025: Nvidia executives think sovereign AI has the potential to change digital work as generative AI (genAI) aligns with national priorities and local regulations.

AWS cuts prices of some EC2 Nvidia GPU-accelerated instances

June 9, 2025: AWS has reduced the prices of some of its Nvidia GPU-accelerated instances to attract more AI workloads while competing with rivals, such as Microsoft and Google, as demand for GPUs and the cost of securing them continues to grow.

Nvidia aims to bring AI to wireless

June 6, 2025: Nvidia hopes to maximize RAN infrastructure use (traditional networks average a low 30% to 35%), use AI to rewrite the air interface, and enhance performance and efficiency through radio signal processing. The longer-term goal is to seamlessly process AI traffic at the network edge to create new monetization opportunities for service providers.

Oracle to spend $40B on Nvidia chips for OpenAI data center in Texas

May 26, 2025: Oracle is reportedly spending about $40 billion on Nvidia’s high-performance computer chips to power OpenAI’s new data center in Texas, marking a pivotal shift in the AI infrastructure landscape that has significant implications for enterprise IT strategies.

Nvidia eyes China rebound with stripped-down AI chip tailored to export limits

May 26, 2025: Nvidia plans to launch a lower-cost AI chip for China in June, aiming to protect market share under the US export controls and signal a broader shift toward affordable, segmented products that could impact global enterprise AI spending.

Nvidia introduces ‘ridesharing for AI’ with DGX Cloud Lepton

May 19, 2025: Nvidia introduced DGX Cloud Lepton, an AI-centric cloud software program that makes it easier for AI factories to rent out their hardware to developers who wish to access performant compute globally.

May 19, 2025: Nvidia kicked off the Computex systems hardware tradeshow with the news it has opened the NVLink interconnect technology to the competition with the introduction of NVLink Fusion. NVLink is a high-speed interconnect born out of its Mellanox networking group which lets multiple GPUs in a system or rack share compute and memory resources, thus making many GPUs appear to the system as a single processor.

AMD, Nvidia partner with Saudi startup to build multi-billion dollar AI service centers

May 15, 2025: As part of the avalanche of business deals coming from President Trump’s Middle East tour, both AMD and Nvidia have struck multi-billion dollar deals with an emerging Saudi AI firm. The deals served as the coming out party for Humain, a state-backed artificial intelligence (AI) company that operates under the Kingdom’s Public Investment Fund (PIF) and is chaired by Crown Prince Mohammed bin Salman. 

Nvidia, ServiceNow engineer open-source model to create AI agents

May 6, 2025: Nvidia and ServiceNow have created an AI model that can help companies create learning AI agents to automate corporate workloads..The open-source Apriel model, available generally in the second quarter on HuggingFace, will help create AI agents that can make decisions around IT, human resources and customer-service functions.

Nvidia AI supercluster targets agents, reasoning models on Oracle Cloud

April 29, 2025: The move marks the first wave of liquid-cooled Nvidia GB200 NVL72 racks in OCI data centers, involving thousands of Nvidia Grace CPUs and Blackwell GPUs. 

Nvidia says NeMo microservices now generally available

April 23, 2025: Nvidia announced the general availability of neural module (NeMo) microservices, a modular platform for building and customizing gen AI models and AI agents.NeMo microservices integrate with partner platforms to provide features including prompt tuning, supervised fine-tuning, and knowledge retrieval tools.

Nvidia expects ban on chip exports to China to cost $5.5B

April 16, 2025: Nvidia now expects new US government restrictions on exports of its H20 chip to China will cost the company as much as $5.5 billion.

Incomplete patching leaves Nvidia, Docker exposed to DOS attacks

April 15, 2025: A critical race condition bug affecting the Nvidia Container Toolkit, which received a fix in September, might still be open to attacks owing to incomplete patching.

Nvidia lays out plans to build AI supercomputers in the US

April 14, 2025: There was mixed reaction from industry analysts over an announcement that Nvidia plans to produce AI supercomputers entirely in the US. The company said in a blog post that, together with its manufacturing partners, it has commissioned more than one million square feet (92,900 square meters) of manufacturing space to build and test Nvidia Blackwell chips in Arizona and AI supercomputers in Texas.

Potential Nvidia chip shortage looms as Chinese customers rush to beat US sales ban

April 2, 2025: The AI chip shortage could become even more dire as Chinese customers are purportedly looking to hoard Nvidia chips ahead of a proposed US sales ban. According to inside sources, Chinese companies including ByteDance, Alibaba Group, and Tencent Holdings have ordered at least $16 billion worth of Nvidia’s H20 server chips for running AI workloads in just the first three months of this year.

Nvidia’s Blackwell raises the bar with new MLPerf Inference V5.0 results

April 2, 2025: Nvidia released a set of MLPerf Inference V5.0 benchmark results for its Blackwell GPU, the successor to Hopper, saying that its GB200 NVL72 system, a rack-scale offering designed for AI reasoning, set a series of performance records.

5 big takeaways from Nvidia GTC

March 25, 2025: Now that the dust has settled from Nvidia’s GTC 2025, a few industry experts weighed in on some core big picture developments from the conference. Here are five of their top observations.

Nvidia wants to be a one-stop enterprise technology shop

March 24, 2025: After last week’s Nvidia GTC 2025 event, a new, fuller picture of the vendor emerged. Analysts agree that Nvidia is not just a graphics chip provider anymore. It’s a full-stack solution provider, and GPUs are just one of many parts.

Nvidia launches AgentIQ toolkit to connect disparate AI agents

March 21, 2025: As enterprises look to adopt agentic AI to boost the efficiency of their applications, Nvidia introduced a new open-source software library — AgentIQ toolkit — to help developers connect disparate agents and agent frameworks. The toolkit, according to Nvidia, packs in a variety of tools, including ones to weave in RAG, search, and conversational UI into agentic AI applications.

Nvidia launches research center to accelerate quantum computing breakthrough

March 21, 2025: In a move to help accelerate the timeline for practical, real-world quantum applications, Nvidia is establishing the Nvidia Accelerated Quantum Research Center. “Quantum computing will augment AI supercomputers to tackle some of the world’s most important problems,” Nvidia CEO Jensen Huang said.

Nvidia, xAI and two energy giants join genAI infrastructure initiative

March 19, 2025: An industry generative artificial intelligence (genAI) alliance, the AI Infrastructure Partnership (AIP), on Wednesday announced that xAI, Nvidia, GE Vernova, and NextEra Energy were joining BlackRock, Microsoft, and Global Infrastructure Partners as members.

IBM broadens access to Nvidia technology for enterprise AI

March 19, 2025: New collaborations between IBM and Nvidia have yielded a content-aware storage capability for IBM’s hybrid cloud infrastructure, expanded integration between watsonx and Nvidia NIM, and AI services from IBM Consulting that use Nvidia Blueprints.

Nvidia’s silicon photonics switches bring better power efficiency to AI data centers

March 19, 2025: Amid the flood of news from Nvidia’s annual GTC event, one item stood out. Nvidia introduced new silicon photonics network switches that integrate network optics into the switch using a technique called co-packaged optics (CPO), replacing traditional external pluggable transceivers. While Nvidia alluded to its new switches providing a cost savings, the primary benefit is to reduce power consumption with an improvement in network resiliency.

What is Nvidia Dynamo and why it matters to enterprises?

March 19, 2025: Chipmaker Nvidia has released a new open-source inferencing software — Dynamo, at its GTC 2025 conference, that will allow enterprises to increase throughput and reduce cost while using large language models on Nvidia GPUs.

Nvidia, xAI and two energy giants join genAI infrastructure initiative

March 19, 2025:  AI Infrastructure Partnership (AIP) announced that xAI, Nvidia, GE Vernova, and NextEra Energy joined the AIP. But given that no financial commitments or any other details were released, will it make a difference?

HPE, Nvidia broaden AI infrastructure lineup

March 19, 2025: HPE news from Nvidia GTC includes a new Private Cloud AI developer kit, Nvidia AI blueprints, GPU optimization capabilities, and servers built with Nvidia Blackwell Ultra and Blackwell architecture.

Cisco, Nvidia team to deliver secure AI factory infrastructure

March 18, 2025: Cisco and Nvidia have expanded their partnership to create their most advanced AI architecture package to date, designed to promote secure enterprise AI networking.

Nvidia’s ‘hard pivot’ to AI reasoning bolsters Llama models for agentic AI

March 18, 2025: The company has post-trained its new Llama Nemotron family of reasoning models to improve multistep math, coding, reasoning, and complex decision-making. The enhancements aim to provide developers and enterprises with a business-ready foundation for creating AI agents that can work independently or as part of connected teams.

Nvidia details its GPU, CPU, and system roadmap for the next three years

March 18, 2025: Nvidia CEO Jensen Huang shared previously unreleased specifications for its Rubin graphics processing unit (GPU), due in 2026, the Rubin Ultra coming in 2027, and announced the addition of a new GPU called Feynman to the mix for 2028.

Oracle, Nvidia partner to add AI software into OCI services

March 18, 2025: Nvidia’s AI Enterprise stack will be available natively through the OCI Console and will be available anywhere in OCI’s distributed cloud while providing enterprises access to over 160 AI tools for training and inference, including NIM microservices, the companies said in a joint statement at Nvidia’s annual GTC conference.

Nvidia GTC 2025: What to expect from the AI leader

March 3, 2025: Last year, Nvidia’s GTC 2024 grabbed headlines with the introduction of the Blackwell architecture and the DGX systems powered by it. With Nvidia GTC 2025 right around the corner, the tech world is eager to see what Nvidia – and its partners and competitors – will unveil next. 

Cisco, Nvidia expand AI partnership to include Silicon One technology

February 25, 2025; Cisco and Nvidia have expanded their collaboration to support enterprise AI implementations by tying Cisco’s Silicon One technology to Nvidia’s Ethernet networking platform. The extended agreement is designed to offer customers yet another way to support AI workloads across the data center and strengthens both companies’ strategies to expand the role of Ethernet networking for AI in the enterprise.

Nvidia forges healthcare partnerships to advance AI-driven genomics, drug discovery

February 14, 2025: Through new partnerships with industry leaders, Nvidia aims to advance practical use cases for AI in healthcare and life sciences. It’s a logical move: Healthcare has the most significant upside, particularly in patient care, among all the industries applicable to AI. 

Nvidia partners with cybersecurity vendors for real-time monitoring

February 12, 2025: Nvidia partnered with leading cybersecurity firms to provide real-time security protection using its accelerator and networking hardware in combination with its AI software. Under the agreement, Nvidia will provide integration of its BlueField and Morpheus hardware with cyber defenses software from Armis, Check Point Software Technologies, CrowdStrike, Deloitte and World Wide Technology .

Nvidia claims near 50% boost in AI storage speed

February 7, 2025: Nvidia is touting a near 50% improvement in storage read bandwidth thanks to intelligence in its Spectrum-X Ethernet networking equipment, according to the vendor’s technical blog post. Spectrum-X is a combination of the company’s Spectrum-4 Ethernet switch and BlueField-3 SuperNIC smart networking card, which supports RoCE v2 for remote direct memory access (RDMA) over Converged Ethernet.

Nvidia unveils preview of DeepSeek-R1 NIM microservice

February 3, 2025: The chipmaker stock plummeted 17% after Chinese AI developer DeepSeek unveiled its DeepSeek-R1 LLM. Last week, Nvidia announced the DeepSeek-R1 model is now available as a preview Nvidia inference microservice (NIM) on build.nvidia.com.

Nvidia unveils preview of DeepSeek-R1 NIM microservice

January 31, 2025: Nvidia stock plummeted 17% after Chinese AI developer, DeepSeek, unveiled its DeepSeek-R1 LLM. Later the same week, the chipmaker turned around and announced the DeepSeek-R1 model is available as a preview Nvidia inference microservice (NIM) on build.nvidia.com.

Nvidia intros new guardrail microservices for agentic AI

January 16, 2025: Nvidia added new Nvidia inference microservices (NIMs) for AI guardrails to its Nvidia NeMo Guardrails software tools. The new microservices aim to help enterprises improve accuracy, security, and control of agentic AI applications, addressing a key reservation IT leaders have about adopting the technology.

Nvidia year in review

January 10, 2025: Last year was Nvidia’s year. Its command of mindshare and market share was unequaled among tech vendors. Here’s a recap of some of the key Nvidia events of 2024 that highlight just how powerful the world’s most dominant chip player is.

Nvidia launches blueprints to help jumpstart AI projects

January 8, 2025: Nvidia recently issued designs for AI factories after hyping up the idea for several months. Now it has come out with AI blueprints, essentially prebuilt templates that give developers a jump start on creating AI systems.

Nvidia’s Project DIGITS puts AI supercomputing chips on the desktop

January 6, 2025: Nvidia is readying a tiny desktop device called Project DIGITS, a “personal AI supercomputer” with a lightweight version of the Grace Blackwell platform found in its most powerful servers; it’s aimed at data scientists, researchers, and students who will be able to prototype, tune, and run large genAI models.

Nvidia unveils generative physical AI platform, agentic AI advances at CES

January 6, 2025: At CES in Las Vegas, Nvidia trumpeted a slew of AI announcements, with an emphasis on generative physical AI that promises a new revolution in factory and warehouse automation. “AI requires us to build an entirely new computing stack to build AI factories, accelerated computing at data center scale,” Rev Lebaredian, vice president of omniverse and simulation technology at Nvidia.

Verizon, Nvidia team up for enterprise AI networking

December 30, 2024: Verizon and Nvidia partnered to build AI services for enterprises that run workloads over Verizon’s 5G private network. The new offering, 5G Private Network with Enterprise AI, will run a range of AI applications and workloads over Verizon’s private 5G network with Mobile Edge Compute (MEC). MEC is a colocated infrastructure that is a part of Verizon’s public wireless network, bringing compute and storage closer to devices and endpoints for ultra-low latency.

Nvidia’s Run:ai acquisition waved through by EU

December 20, 2024: Nvidia will face no objections to its plan to acquire Israeli AI orchestration software vendor Run:ai Labs in Europe, after the European Commission gave the deal its approval today. But Nvidia may not be out of the woods yet. Competition authorities in other markets are closely examining the company’s acquisition strategy.

China launches anti-monopoly probe into Nvidia amid rising US-China chip tensions

December 10, 2024: China has initiated an investigation into Nvidia over alleged violations of the country’s anti-monopoly laws, signaling a potential escalation in the ongoing tech and trade tensions between Beijing and Washington.

Nvidia Blackwell chips face serious heating issues

November 18, 2024: Nvidia’s next-generation Blackwell data center processors have significant problems with overheating when installed in high-capacity server racks, forcing redesigns of the racks themselves, according to a report by The Information. These issues have reportedly led to design changes, meaning delays in shipping product and raising concern that its biggest customers, including Google, Meta, and Microsoft, will be able to deploy Blackwell servers according to their schedules.

Nvidia to power India’s AI factories with tens of thousands of AI chips

October 24, 2024: Nvidia plans to deploy thousands of Hopper GPUs in India to create AI factories and collaborate with Reliance Industries to develop AI infrastructure.. Yotta Data Services, Tata Communications, E2E Networks, and Netweb will lead the AI factories — large-scale data centers for producing AI. Nvidia added that the expansion will provide nearly 180 exaflops of computing power.

Nvidia contributes Blackwell rack design to Open Compute Project

October 15, 2024: Nvidia contributed to the Open Compute Project its Blackwell GB200 NVL72 electro-mechanical designs – including the rack architecture, compute and switch tray mechanicals, liquid cooling and thermal environment specifications, and Nvidia NVLink cable cartridge volumetrics –.

As global AI energy usage mounts, Nvidia claims efficiency gains of up to 100,000X

October 08, 2024: As concerns over AI energy consumption ratchet up, chip maker Nvidia is defending what it calls a steadfast commitment to sustainability. The company reports that its GPUs have experienced a 2,000X reduction in energy use over the last 10 years in training and a 100,000X energy reduction over that same time in generating tokens.

Accenture forms new Nvidia business group focused on agentic AI adoption

October 4, 2024: Accenture and Nvidia announced an expanded partnership focused on helping customers rapidly scale AI adoption. Accenture said the new group will use Accenture’s AI Refinery platform — built on the Nvidia AI stack, including Nvidia AI Foundry, Nvidia AI Enterprise, and Nvidia Omniverse — to help clients create a foundation for use of agentic AI.

IBM expands Nvidia GPU options for cloud customers

October 1, 2024: IBM expanded access to Nvidia GPUs on IBM Cloud to help enterprise customers advance their AI implementations, including large language model (LLM) training. IBM Cloud users can now access Nvidia H100 Tensor Core GPU instances in virtual private cloud and managed Red Hat OpenShift environments.

Oracle to offer 131,072 Nvidia Blackwell GPUs via its cloud

September 12, 2024: Oracle started taking pre-orders for 131,072 Nvidia Blackwell GPUs in the cloud via its Oracle Cloud Infrastructure (OCI) Supercluster to aid large language model (LLM) training and other use cases, the company announced at the CloudWorld 2024 conference.  The launch of an offering that provides these many Blackwell GPUs, also known as Grace Blackwell (GB) 200, is significant as enterprises globally are faced with the unavailability of high-bandwidth memory (HBM) — a key component used in making GPUs.

Why is the DOJ investigating Nvidia?

September 11, 2024: After a stock sell-off following its quarterly earnings report, Nvidia’s pain was aggravated by news that the Department of Justice is escalating its investigation into the company for anticompetitive practices. According to a Bloomberg report, the DOJ sent a subpoena to Nvidia as part of a probe into alleged antitrust practices.

Cisco, HPE, Dell announce support for Nvidia’s pretrained AI workflows

September 4, 2024: Cisco, HPE, and Dell are using Nvidia’s new AI microservices blueprints to help enterprises streamline the deployment of generative AI applications. Nvidia’s announced its NIM Agent Blueprints, a catalogue of pretrained, customizable AI workflows that are designed to provide a jump-start for developers creating AI applications. NIM Agent Blueprints target a number of use cases, including customer service, virtual screening for computer-aided drug discovery, and a multimodal PDF data extraction workflow for retrieval-augmented generation (RAG) that can ingest vast quantities of data.

Nvidia reportedly trained AI models on YouTube data

August 4, 2024: Nvidia scraped huge amounts of data from YouTube to train its AI models, even though neither Youtube nor individual YouTube channels approved the move, according to leaked documents. Among other things, Nvidia reportedly used the YouTube data to train its deep learning model Cosmos, an algorithm for automated driving, a human-like AI avatar, and Omniverse, a tool for building 3D worlds.

Can Intel’s new chips compete with Nvidia in the AI universe?

June 9, 2024: Intel is aiming its next-generation X86 processors at AI tasks, even though the chips won’t actually run AI workloads themselves.mAt Computex, Intel announced its Xeon 6 processor line, talking up what it calls Efficient-cores (E-cores) that it said will deliver up to 4.2 times the performance of Xeon 5 processors. The first Xeon 6 CPU is the Sierra Forest version (6700 series) a more performance-oriented line, Granite Rapids with Performance cores (P-cores or 6900 series), will be released next quarter.

Everyone but Nvidia joins forces for new AI interconnect

May 30, 2024: A clear sign of Nvidia’s dominance is when Intel and AMD link arms to deliver a competing product. That’s what happened when AMD and Intel – along with Broadcom, Cisco, Google, Hewlett Packard Enterprise, Meta and Microsoft – formed the Ultra Accelerator Link (UALink) Promoter Group to develop high-speed interconnections between AI processors.

Nvidia to build supercomputer for federal AI research

May 15, 2024: The U.S. government will use an Nvidia DGX SuperPOD to provide researchers and developers access to much more computing power than they have had in the past to produce generative AI advances in areas such as climate science, healthcare and cybersecurity.

Nvidia, Google Cloud team to boost AI startups

April 11, 2024: Alphabet’s Google Cloud unveiled a slew of new products and services at Google Cloud Next 2024, among them a program to help startups and small businesses build generative AI applications and services. The initiative brings together the Nvidia Inception program for startups and the Google for Startups Cloud Program.

Nvidia GTC 2024 wrap-up: Blackwell not the only big news

March 29, 2024: Nvidia’s GDC is in our rearview mirror, and there was plenty of news beyond the major announcement of the Blackwell architecture and the massive new DGX systems powered by it. Here’s a rundown of some of the announcements you might have missed.

Nvidia expands partnership with hyperscalers to boost AI training and development

March 19, 2024: Nvidia extended its existing partnerships with hyperscalers Amazon Web Services (AWS), Google Cloud Platform, Microsoft Azure, and Oracle Cloud Infrastructure, to make available its latest GPUs and foundational large language models and to integrate its software across their platforms.

Nvidia launches Blackwell GPU architecture

March 18, 2024: Nvidia kicked off its GTC 2024 conference with the formal launch of Blackwell, its next-generation GPU architecture due at the end of the year. Blackwell uses a chiplet design, to a point. Whereas AMD’s designs have several chiplets, Blackwell has two very large dies that are tied together as one GPU with a high-speed interlink that operates at 10 terabytes per second, according to Ian Buck, vice president of HPC at Nvidia.

Cisco, Nvidia target secure AI with expanded partnership

February 9, 2024: Cisco and Nvidia expanded their partnership to offer integrated software and networking hardware that promises to help customers more easily spin up infrastructure to support AI applications. The agreement deepens both companies’ strategy to expand the role of Ethernet networking for AI workloads in the enterprise. It also gives both companies access to each other’s sales and support systems.

Nvidia and Equinix partner for AI data center infrastructure

January 9, 2024: Nvidia partnered with data center giant Equinix to offer what the vendors are calling Equinix Private AI with Nvidia DGX, a turnkey solution for companies that are looking to get into the generative AI game but lack the data center infrastructure and expertise to do it.

Shape
Shape
Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy,  bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Shape

AI agent traffic drives first profitable year for Fastly

Fetcher bots, which retrieve content in real time when users make queries to AI assistants, show different concentration patterns. OpenAI’s ChatGPT and related bots generated 68% of fetcher bot requests. In some cases, fetcher bot request volumes exceeded 39,000 requests per minute to individual sites. AI agents check multiple websites

Read More »

IBM Research: When AI and quantum merge

IBM’s research laboratory in Zurich. A look inside an IBM Quantum System Two. Advances in tape development. On the left is a quantum-secure tape drive. Scanning tunneling microscope in one of the Zurich laboratories. The innovation won the Nobel Prize in Physics in 1986.

Read More »

Oil Posts Second Straight Weekly Drop

Oil notched its first back-to-back weekly drop this year as traders weighed the prospect of expanded OPEC+ supplies against US-Iran nuclear talks and recent weakness in wider markets. West Texas Intermediate fell 1% for the week and ended the day little changed on Friday. President Donald Trump said the US deployed an additional aircraft carrier to the Middle East in case a nuclear deal is not reached with Iran. “If we don’t have a deal, we’ll need it,” Trump said at the White House. He added he thinks negotiations will ultimately be successful. Traders have been watching for any uptick in tensions between Washington and Tehran that could pose a threat to supply from the Middle East. The commodity was down earlier as OPEC+ members see scope for output increases to resume in April, believing concerns about a glut are overblown, delegates said. The group has not yet committed to any course of action or begun formal discussions for a March 1 meeting, they added. A second weekly decline in the futures market stands to snap a long run of gains for early 2026, when recurrent bouts of geopolitical tension including the US stand-off with Iran supported oil prices. At an energy conference in London this week, attendees flagged that they expect worldwide supplies to top demand this year, potentially feeding into higher inventories in the Atlantic basin, the region where global prices are set. Still, a pile-up of sanctioned oil coupled with supply disruptions in various nations has limited the impact thus far. Trading may be thinner ahead of the Presidents’ Day holiday in the US, contributing to exaggerated price swings. Oil Prices WTI for March delivery settled up 0.1% at $62.89 a barrel in New York. Brent for April settlement edged 0.3% higher to $67.75 a barrel. What

Read More »

Reliance Gets USA License to Directly Buy VEN Crude

Indian refiner Reliance Industries Ltd. has received a general license from the US government that will allow it to purchase Venezuelan oil directly, according to a person familiar with the matter.  Reliance, owned by billionaire Mukesh Ambani, applied for the permit last month and received it from the Treasury Department a few days ago, the person said, asking not to be named as the matter is not public. The move comes immediately on the heels of a trade deal with the US that slashes punitive tariffs for Indian exports but demands that the country stop importing discounted Russian oil. The Indian government has asked state-owned refiners to consider buying more Venezuelan crude, as well as oil from the US.  Venezuela is unlikely to produce large volumes of crude anytime soon, but even limited supplies provide a fallback option for India’s largest refiner. The US — which has stepped up involvement in Venezuela’s oil sector after capturing the country’s president last month — has been considering general licenses to permit purchases, trading and investment in a sprawling but threadbare industry. Reliance is the first Indian refiner to receive clearance in the current push.  Reliance has historically been an important consumer of the country’s heavy crude, having struck a term deal to secure as much as 400,000 barrels a day from Petroleos de Venezuela SA in 2012. It is among only a handful of refiners in India that have the capacity to process the high-viscosity, sour oil, which is difficult to extract and refine without diluent.  The Indian refining giant took about 25% of Venezuela’s exports in 2019, before its term deal got suspended in 2019 due to US sanctions. It last received a general license in 2024 and took crude until that expired last year, and was not renewed. Reuters first reported the issuance of

Read More »

Baker Hughes Explores $1.5B Sale of Waygate Unit

Baker Hughes Co. is exploring a potential sale of its Waygate Technologies unit, which provides industrial testing and inspection equipment, people with knowledge of the matter said.  The world’s second-biggest oilfield contractor is working with advisers to study a possible divestment of the Waygate business, which could fetch around $1.5 billion, according to the people. A sale process could kick off in the next few months and attract interest from private equity firms, the people said, asking not to be identified because the information is private.  Deliberations are ongoing and there’s no certainty they will lead to a transaction, the people said. A representative for Baker Hughes declined to comment.  Waygate, based in Hürth, Germany, makes radiographic testing systems, industrial CT scanners, remote visual inspection machines and ultrasonic testing devices. It operates in more than 80 countries and is known for brands including Krautkrämer, phoenix|x-ray, Seifert, Everest and Agfa NDT.  The company was started in 2004 as GE Inspection Technologies. It’s been under the current ownership since 2017, when General Electric Co. combined its oil and gas division with Baker Hughes in a $32 billion deal.  Baker Hughes is selling the non-core asset after agreeing last year to buy industrial equipment maker Chart Industries Inc. for about $9.6 billion in one of its biggest-ever acquisitions. Chief Executive Officer Lorenzo Simonelli said in October last year that Baker Hughes is undertaking a “comprehensive evaluation” of its capital allocation focus following the Chart deal in order to boost shareholder value.  The pending sale would join other sizeable corporate divestments in Europe. Volkswagen AG has launched the sale of a majority stake in its heavy diesel engine maker Everllence, while Continental AG is selling its Contitech business. WHAT DO YOU THINK? Generated by readers, the comments included herein do not reflect the views and opinions

Read More »

EIA Raises 2026 WTI Forecast, Lowers 2027 Projection

The U.S. Energy Information Administration (EIA) increased its 2026 West Texas Intermediate (WTI) crude oil average spot price forecast, and lowered its 2027 projection, in its latest short term energy outlook (STEO). According to the EIA’s February STEO, which was released on February 10, the EIA now sees the WTI spot price averaging $53.42 per barrel this year and $49.34 per barrel next year. In its previous STEO, which was released in January, the EIA projected that the WTI spot price would average $52.21 per barrel in 2026 and $50.36 per barrel in 2027. A quarterly breakdown included in the EIA’s latest STEO projected that the WTI average spot price will come in at $58.62 per barrel in the first quarter of this year, $53.65 per barrel in the second quarter, $51.69 per barrel in the third quarter, $50.00 per barrel in the fourth quarter, $49.00 per barrel in the first quarter of next year, $49.66 per barrel in the second quarter, $49.68 per barrel in the third quarter, and $49.00 per barrel in the fourth quarter of 2027. In its previous STEO, the EIA forecast that the WTI spot price would average $54.93 per barrel in the first quarter of this year, $52.67 per barrel in the second quarter, $52.03 per barrel in the third quarter, $49.34 per barrel in the fourth quarter, $49.00 per barrel in the first quarter of next year, $50.66 per barrel in the second quarter, $50.68 per barrel in the third quarter, and $51.00 per barrel in the fourth quarter of 2027. In a BMI report sent to Rigzone by the Fitch Group on Friday, BMI projected that the front month WTI crude price will average $64.00 per barrel in 2026 and $68.00 per barrel in 2027. Standard Chartered sees the NYMEX WTI nearby

Read More »

Some OPEC+ Members See Scope to Resume Hikes in April

Some OPEC+ members see scope for the alliance to resume supply increases in April, believing concerns of a glut in global oil markets to be overblown. The group led by Saudi Arabia and Russia hasn’t committed to any course of action or begun formal discussions ahead of its meeting on March 1, according to several delegates, who asked not to be identified as the process is private. Their ultimate decision may depend on whether US President Donald Trump launches military action against — or reaches a nuclear deal with — OPEC member Iran, one added.  Nonetheless, some nations in the Organization of the Petroleum Exporting Countries and its allies said they see room to resume the output increases the coalition paused during the seasonal demand slowdown of the first quarter.  Trump’s assertive stance toward OPEC members Venezuela and Iran, along with disruptions spanning from North America to Kazakhstan, drove oil prices to a strong start of the year despite warnings of a supply glut. Several top traders have said that prices are supported by tightness in key markets, as many of the surplus barrels are from producers subject to sanctions like Russia and Iran, and thus remain unavailable to a wider pool of buyers. That has made the market surprisingly resilient. Brent futures are up 11% this year, after spiking to a six-month high near $72 a barrel at the end of January over concerns a conflict might erupt in the Middle East. Oil inventories piled up last year at the fastest pace since the 2020 pandemic amid swelling output from both OPEC+ and its competitors in the Americas, according to the International Energy Agency, though the impact on prices was tempered as China scooped up barrels for its strategic reserves. Last April, the Saudis stunned crude traders by steering OPEC+ to

Read More »

ADNOC Drilling Delivers ‘Best Year on Record’

In a release sent to Rigzone on Thursday, ADNOC Drilling said it had delivered its “best year on record with 2025 net profit of $1.45 billion”. “ADNOC Drilling Company PJSC announced today record fourth quarter and full year 2025 results, marking a step change in scale, technology-enabled performance and excellence in execution,” the company noted in the release. “This performance represents the strongest in the company’s history, reflecting high asset utilization and continued growth across integrated drilling and oilfield services, and driven by strong operational execution across the fleet,” it added. ADNOC Drilling highlighted in the release that its 2025 net profit marked an 11 percent year on year increase. In 2025, the company reported revenue of $4.9 billion, which it pointed out was a 22 percent increase year on year, and EBITDA of $2.2 billion, which it highlighted was an increase of nine percent year on year. “2025 was a defining year for ADNOC Drilling,” ADNOC Drilling CEO Abdulla Ateya Al Messabi said in the release. “Our record breaking results were delivered by our people, whose discipline, innovation and commitment to operational excellence and safety underpin every milestone we achieve,” Messabi added. “Our resilience as a business, built on strong systems, disciplined operations and the ability to adapt at pace, continues to reinforce our competitive strength,” the CEO continued. “Through execution excellence, technology‑led efficiency and a disciplined approach to capital allocation and operations, we continue our transformation into the region’s most advanced energy services company,” Messabi said. “By expanding across the GCC, pioneering AI‑driven operations and setting new benchmarks in sustainability, we are unlocking value and helping power the UAE’s energy future. This is just the beginning of a new era of growth, innovation and impact,” Messabi went on to note. In a release posted on its site in November last year, ADNOC Drilling announced that

Read More »

FTC digs deeper into Microsoft’s bundling and licensing practices

Relationship with OpenAI While much of the initial query, and subsequent ones, have focused on licensing and bundling, the FTC is also looking into the company’s relationship with OpenAI, and raising questions about Microsoft’s data centers, capacity constraints, and AI spending and research. Notably, the tech giant’s initial $1 billion investment in OpenAI has grown into a multi-billion-dollar partnership, with Microsoft rolling out ChatGPT-powered features across its product line in 2023. The FTC is examining whether the relationship is an undisclosed merger that should have been subject to antitrust review. Further, the federal agency is scrutinizing Microsoft’s alleged decision to scale back its own AI research following the OpenAI investment, potentially reducing competition. Ultimately, all of this recalls the industry-shaping 1990s US federal investigation into Microsoft’s monopoly of desktop software and web browsers. A federal judge ruled at the time that the company deliberately built the Internet Explorer (IE) browser into Windows to edge out rivals like the now-defunct Netscape. And, analysts note, it’s an indication that Microsoft hasn’t learned from those past lessons. “While technology and trends may have evolved since Microsoft’s first anti-trust case in 1998, where they were forced to unbundle IE from Windows OS, their tactics have stayed remarkably the same,” Bickley noted.

Read More »

Nvidia: Latest news and insights

Nvidia: ‘Graphics 3.0’ will drive physical AI productivity August 15, 2025: Nvidia has floated the idea of “Graphics 3.0” with the hope of making AI-generated graphics central to physical productivity. The concept revolves around graphics created by genAI tools. Nvidia say AI-generated graphics could help in training robots to do their jobs in the physical world or by helping AI assistants automate the creation of equipment and structures. Nvidia launches Blackwell-powered RTX Pro GPUs for compact AI workstations August 12, 2025: Nvidia announced two new professional GPUs, the RTX Pro 4000 Small Form Factor (SFF) and the RTX Pro 2000. Built on its Blackwell architecture, Nvidia’s new GPUs aim to deliver powerful AI capabilities in compact desktop and workstation deployments. Nvidia’s new genAI model helps robots think like humans August 11, 2025: Nvidia has developed a genAI model to help robots make human-like decisions by analyzing surrounding scenes. The Cosmos Reason model in robots can take in information from video and graphics input, analyze the data, and use its understanding to make decisions. Nvidia patches critical Triton server bugs that threaten AI model security August 5, 2025: A surprising attack chain in Nvidia’s Triton Inference Server, starting with a seemingly minor memory-name leak, could allow full remote server takeover without user authentication. China demands ‘security evidence’ from Nvidia over H20 chip backdoor fears August 4, 2025: China escalated pressure on Nvidia with the state-controlled People’s Daily publishing an opinion piece titled “Nvidia, how can I trust you?” — a day after regulators summoned company officials over alleged security vulnerabilities in H20 artificial intelligence chips. Nvidia to restart H20 exports to China, unveils new export-compliant GPU July 15, 2025: Nvidia will restart H20 AI chip sales to China and release a new GPU model compliant with export rules, a move that could impact global AI

Read More »

Starcloud prepares to launch AWS Outpost into space

One executive skeptical of the idea of data centers in space is AWS’s own CEO, Matt Garman. “There are not enough rockets to launch a million satellites yet, so we’re, like, pretty far from that. If you think about the cost of getting a payload in space today, it’s massive,” Garman told attendees at the Cisco AI summit, according to a Reuters report. Garman is just one of many critics of the notion that data centers can be a viable alternative. Issues such as collisions with space debris, the difficulty of supplying water as a coolant, the impossibility of fixing hardware issues and latency have all been highlighted as potential problems.

Read More »

Data center capex to hit $1.7 trillion by 2030 due to AI boom

Asked on Thursday about the fact that capex is expected to approach the $1 trillion mark in 2026, he said it is somewhat surprising. “Last year, I thought it would take at least three years to get to that trillion dollar mark,” he said. “It seems increases are supported by the result of larger models needed for training infrastructure, and in turn, you need inference as well. You also need a supporting infrastructure in storage, networking, power, and cooling.” AI, he said, has become “the tide that lift all boats, meaning that in addition to the core accelerated compute, AI also positively impacts complementary infrastructure, such as storage, networking, and physical infrastructure.” Fung added that while much of the achievement of projected spending estimates will depend on whether or not this growth is sustainable, he pointed out, “it seems like the large hyperscalers have a lot of weight in optimizing cash flow and cost structures. They’re trying to get as creative as possible, generally moving towards a more vertical, integrated stack with their own custom networking and external financing, which would help  [create] more sustainable deployments and operations.” Enterprises thinking of expanding their own infrastructure can learn from this growth. In a recent article on the hyper spending of hyperscalers, Greyhound Research chief analyst Sanchit Vir Gogia said their capex spending  levels can help pinpoint where the hyperscalers are expecting bottlenecks, which is useful information for enterprises planning their own cloud strategy across multiple geographies. These and other factors can help enterprises plan their own execution timelines, he said. This article originally appeared on CIO.com.

Read More »

Cisco highlights memory costs, Silicon One growth in Q2 recap

“AI infrastructure orders taken from hyperscalers totaled $2.1 billion in Q2 compared to $1.3 billion just last quarter and equal to the total orders taken in all of fiscal year ’25, marking another significant acceleration in growth across our silicon, systems and optics,” Robbins said. “Given the strong demand for our Silicon One systems and optics, we now expect to take AI orders in excess of $5 billion and to recognize over $3 billion in AI infrastructure revenue from hyperscalers in FY ’26.” Regarding enterprise uptake, Robbins said Cisco took in $350 million in AI orders from enterprise customers in Q2 and has a pipeline in excess of $2.5 billion for its high-performance AI infrastructure portfolio. Cisco is seeing early enterprise use cases for AI around fraud detection and video analytics in sectors such as financial, manufacturing and pharmaceuticals, for example. “I also see examples in retail, where customers are leveraging agents on mobile devices in retail to help their staff do a better job engaging with their customers. We’re seeing a combination of both investment in cloud-based architectures as well as on prem,” Robbins said. Networking rules Cisco is experiencing a faster-than-historical ramp-up of next-generation platforms, including its Catalyst 9K, Wi-Fi 7, and smart switches, stated Sebastien Naji, a research analyst with William Blair, in a report after the call. He attributed it to three factors: an accelerated refresh cycle in the data center; early AI-readiness efforts in the enterprise; and end-of-support for legacy Catalyst and Nexus switches.  “We are seeing strong demand for our next-generation switching, routing and wireless products, which continue to ramp faster than prior product launches. We’re delivering AI-native capabilities across these products, including weaving security into the fabric of the network and modernizing the operational stack of campus networks,” Robbins said. Co-packaged optics? When asked

Read More »

Energy providers seek flexible load strategies for data center operations

“In theory, yes, they’d have to wait a little bit longer while their queries are routed to a data center that has capacity,” said Lawrence. The one thing the industry cannot do is operate like it has in the past, where data center power was tuned and then forgotten for six months. Previously, data centers would test their power sources once or twice a year. They don’t have that luxury anymore. They need to check their power sources and loads far more regularly, according to Lawrence. “I think that for that for the data center industry to continue to survive like we all need it, there’s going to have to be some realignment on the incentives to why somebody would become flexible,” said Lawrence. The survey suggests that utilities and load operators expect to expand their demand response activities and budgets in the near term. Sixty-three percent of respondents anticipate DR program funding to grow by 50% or more over the next three years. While they remain a major source of load growth and system strain, 57% of respondents indicate that onsite power generation from data centers will be most important to improving grid stability over the next five years. One of the proposed fixes to the power shortage has been small modular nuclear reactors. These have gained a lot of traction in the marketplace even if they have nothing to sell yet. But Lawrence said that that’s not an ideal solution for existing power generators, ironically enough.

Read More »

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs).  In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

Read More »

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

Read More »

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

Read More »

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Read More »