Stay Ahead, Stay ONMINE

Why Data Scientists Should Care about Containers — and Stand Out with This Knowledge

“I train models, analyze data and create dashboards — why should I care about Containers?” Many people who are new to the world of data science ask themselves this question. But imagine you have trained a model that runs perfectly on your laptop. However, error messages keep popping up in the cloud when others access […]

“I train models, analyze data and create dashboards — why should I care about Containers?”

Many people who are new to the world of data science ask themselves this question. But imagine you have trained a model that runs perfectly on your laptop. However, error messages keep popping up in the cloud when others access it — for example because they are using different library versions.

This is where containers come into play: They allow us to make machine learning models, data pipelines and development environments stable, portable and scalable — regardless of where they are executed.

Let’s take a closer look.

Table of Contents
1 — Containers vs. Virtual Machines: Why containers are more flexible than VMs
2 — Containers & Data Science: Do I really need Containers? And 4 reasons why the answer is yes.
3 — First Practice, then Theory: Container creation even without much prior knowledge
4 — Your 101 Cheatsheet: The most important Docker commands & concepts at a glance
Final Thoughts: Key takeaways as a data scientist
Where Can You Continue Learning?

1 — Containers vs. Virtual Machines: Why containers are more flexible than VMs

Containers are lightweight, isolated environments. They contain applications with all their dependencies. They also share the kernel of the host operating system, making them fast, portable and resource-efficient.

I have written extensively about virtual machines (VMs) and virtualization in ‘Virtualization & Containers for Data Science Newbiews’. But the most important thing is that VMs simulate complete computers and have their own operating system with their own kernel on a hypervisor. This means that they require more resources, but also offer greater isolation.

Both containers and VMs are virtualization technologies.

Both make it possible to run applications in an isolated environment.

But in the two descriptions, you can also see the 3 most important differences:

  • Architecture: While each VM has its own operating system (OS) and runs on a hypervisor, containers share the kernel of the host operating system. However, containers still run in isolation from each other. A hypervisor is the software or firmware layer that manages VMs and abstracts the operating system of the VMs from the physical hardware. This makes it possible to run multiple VMs on a single physical server.
  • Resource consumption: As each VM contains a complete OS, it requires a lot of memory and CPU. Containers, on the other hand, are more lightweight because they share the host OS.
  • Portability: You have to customize a VM for different environments because it requires its own operating system with specific drivers and configurations that depend on the underlying hardware. A container, on the other hand, can be created once and runs anywhere a container runtime is available (Linux, Windows, cloud, on-premise). Container runtime is the software that creates, starts and manages containers — the best-known example is Docker.
Created by the author

You can experiment faster with Docker — whether you’re testing a new ML model or setting up a data pipeline. You can package everything in a container and run it immediately. And you don’t have any “It works on my machine”-problems. Your container runs the same everywhere — so you can simply share it.

2 — Containers & Data Science: Do I really need Containers? And 4 reasons why the answer is yes.

As a data scientist, your main task is to analyze, process and model data to gain valuable insights and predictions, which in turn are important for management.

Of course, you don’t need to have the same in-depth knowledge of containers, Docker or Kubernetes as a DevOps Engineer or a Site Reliability Engineer (SRE). Nevertheless, it is worth having container knowledge at a basic level — because these are 4 examples of where you will come into contact with it sooner or later:

Model deployment

You are training a model. You not only want to use it locally but also make it available to others. To do this, you can pack it into a container and make it available via a REST API.

Let’s look at a concrete example: Your trained model runs in a Docker container with FastAPI or Flask. The server receives the requests, processes the data and returns ML predictions in real-time.

Reproducibility and easier collaboration

ML models and pipelines require specific libraries. For example, if you want to use a deep learning model like a Transformer, you need TensorFlow or PyTorch. If you want to train and evaluate classic machine learning models, you need Scikit-Learn, NumPy and Pandas. A Docker container now ensures that your code runs with exactly the same dependencies on every computer, server or in the cloud. You can also deploy a Jupyter Notebook environment as a container so that other people can access it and use exactly the same packages and settings.

Cloud integration

Containers include all packages, dependencies and configurations that an application requires. They therefore run uniformly on local computers, servers or cloud environments. This means you don’t have to reconfigure the environment.

For example, you write a data pipeline script. This works locally for you. As soon as you deploy it as a container, you can be sure that it will run in exactly the same way on AWS, Azure, GCP or the IBM Cloud.

Scaling with Kubernetes

Kubernetes helps you to orchestrate containers. But more on that below. If you now get a lot of requests for your ML model, you can scale it automatically with Kubernetes. This means that more instances of the container are started.

3 — First Practice, then Theory: Container creation even without much prior knowledge

Let’s take a look at an example that anyone can run through with minimal time — even if you haven’t heard much about Docker and containers. It took me 30 minutes.

We’ll set up a Jupyter Notebook inside a Docker container, creating a portable, reproducible Data Science environment. Once it’s up and running, we can easily share it with others and ensure that everyone works with the exact same setup.

0 — Install Docker Dekstop and create a project directory

To be able to use containers, we need Docker Desktop. To do this, we download Docker Desktop from the official website.

Now we create a new folder for the project. You can do this directly in the desired folder. I do this via Terminal — on Windows with Windows + R and open CMD.

We use the following command:

Screenshot taken by the author

1. Create a Dockerfile

Now we open VS Code or another editor and create a new file with the name ‘Dockerfile’. We save this file without an extension in the same directory. Why doesn’t it need an extension?

We add the following code to this file:

# Use the official Jupyter notebook image with SciPy
FROM jupyter/scipy-notebook:latest  

# Set the working directory inside the container
WORKDIR /home/jovyan/work  

# Copy all local files into the container
COPY . .

# Start Jupyter Notebook without token
CMD ["start-notebook.sh", "--NotebookApp.token=''"]

We have thus defined a container environment for Jupyter Notebook that is based on the official Jupyter SciPy Notebook image.

First, we define with FROM on which base image the container is built. jupyter/scipy-notebook:latest is a preconfigured Jupyter notebook image and contains libraries such as NumPy, SiPy, Matplotlib or Pandas. Alternatively, we could also use a different image here.

With WORKDIR we set the working directory within the container. /home/jovyan/work is the default path used by Jupyter. User jovyan is the default user in Jupyter Docker images. Another directory could also be selected — but this directory is best practice for Jupyter containers.

With COPY . . we copy all files from the local directory — in this case the Dockerfile, which is located in the jupyter-docker directory — to the working directory /home/jovyan/work in the container.

With CMD [“start-notebook.sh”, “ — NotebookApp.token=‘’’”] we specify the default start command for the container, specify the start script for Jupyter Notebook and define that the notebook is started without a token — this allows us to access it directly via the browser.

2. Create the Docker image

Next, we will build the Docker image. Make sure you have the previously installed Docker desktop open. We now go back to the terminal and use the following command:

cd jupyter-docker
docker build -t my-jupyter .

With cd jupyter-docker we navigate to the folder we created earlier. With docker build we create a Docker image from the Dockerfile. With -t my-jupyter we give the image a name. The dot means that the image will be built based on the current directory. What does that mean? Note the space between the image name and the dot.

The Docker image is the template for the container. This image contains everything needed for the application such as the operating system base (e.g. Ubuntu, Python, Jupyter), dependencies such as Pandas, Numpy, Jupyter Notebook, the application code and the startup commands. When we “build” a Docker image, this means that Docker reads the Dockerfile and executes the steps that we have defined there. The container can then be started from this template (Docker image).

We can now watch the Docker image being built in the terminal.

Screenshot taken by the author

We use docker images to check whether the image exists. If the output my-jupyter appears, the creation was successful.

docker images

If yes, we see the data for the created Docker image:

Screenshot taken by the author

3. Start Jupyter container

Next, we want to start the container and use this command to do so:

docker run -p 8888:8888 my-jupyter

We start a container with docker run. First, we enter the specific name of the container that we want to start. And with -p 8888:8888 we connect the local port (8888) with the port in the container (8888). Jupyter runs on this port. I do not understand.

Alternatively, you can also perform this step in Docker desktop:

Screenshot taken by the author

4. Open Jupyter Notebook & create a test notebook

Now we open the URL [http://localhost:8888](http://localhost:8888/) in the browser. You should now see the Jupyter Notebook interface.

Here we will now create a Python 3 notebook and insert the following Python code into it.

import numpy as np
import matplotlib.pyplot as plt

x = np.linspace(0, 10, 100)
y = np.sin(x)

plt.plot(x, y)
plt.title("Sine Wave")
plt.show()

Running the code will display the sine curve:

Screenshot taken by the author

5. Terminate the container

At the end, we end the container either with ‘CTRL + C’ in the terminal or in Docker Desktop.

With docker ps we can check in the terminal whether containers are still running and with docker ps -a we can display the container that has just been terminated:

Screenshot taken by the author

6. Share your Docker image

If you now want to upload your Docker image to a registry, you can do this with the following command. This will upload your image to Docker Hub (you need a Docker Hub account for this). You can also upload it to a private registry of AWS Elastic Container, Google Container, Azure Container or IBM Cloud Container.

docker login

docker tag my-jupyter your-dockerhub-name/my-jupyter:latest

docker push dein-dockerhub-name/mein-jupyter:latest

If you then open Docker Hub and go to your repositories in your profile, the image should be visible.

This was a very simple example to get started with Docker. If you want to dive a little deeper, you can deploy a trained ML model with FastAPI via a container.

4 — Your 101 Cheatsheet: The most important Docker commands & concepts at a glance

You can actually think of a container like a shipping container. Regardless of whether you load it onto a ship (local computer), a truck (cloud server) or a train (data center) — the content always remains the same.

The most important Docker terms

  • Container: Lightweight, isolated environment for applications that contains all dependencies.
  • Docker: The most popular container platform that allows you to create and manage containers.
  • Docker Image: A read-only template that contains code, dependencies and system libraries.
  • Dockerfile: Text file with commands to create a Docker image.
  • Kubernetes: Orchestration tool to manage many containers automatically.

The basic concepts behind containers

  • Isolation: Each container contains its own processes, libraries and dependencies
  • Portability: Containers run wherever a container runtime is installed.
  • Reproducibility: You can create a container once and it runs exactly the same everywhere.

The most basic Docker commands

docker --version # Check if Docker is installed
docker ps # Show running containers
docker ps -a # Show all containers (including stopped ones)
docker images # List of all available images
docker info # Show system information about the Docker installation

docker run hello-world # Start a test container
docker run -d -p 8080:80 nginx # Start Nginx in the background (-d) with port forwarding
docker run -it ubuntu bash # Start interactive Ubuntu container with bash

docker pull ubuntu # Load an image from Docker Hub
docker build -t my-app . # Build an image from a Dockerfile

Final Thoughts: Key takeaways as a data scientist

👉 With Containers you can solve the “It works on my machine” problem. Containers ensure that ML models, data pipelines, and environments run identically everywhere, independent of OS or dependencies.

👉 Containers are more lightweight and flexible than virtual machines. While VMs come with their own operating system and consume more resources, containers share the host operating system and start faster.

👉 There are three key steps when working with containers: Create a Dockerfile to define the environment, use docker build to create an image, and run it with docker run — optionally pushing it to a registry with docker push.

And then there’s Kubernetes.

A term that comes up a lot in this context: An orchestration tool that automates container management, ensuring scalability, load balancing and fault recovery. This is particularly useful for microservices and cloud applications.

Before Docker, VMs were the go-to solution (see more in ‘Virtualization & Containers for Data Science Newbiews’.) VMs offer strong isolation, but require more resources and start slower.

So, Docker was developed in 2013 by Solomon Hykes to solve this problem. Instead of virtualizing entire operating systems, containers run independently of the environment — whether on your laptop, a server or in the cloud. They contain all the necessary dependencies so that they work consistently everywhere.

I simplify tech for curious minds🚀 If you enjoy my tech insights on Python, data science, Data Engineering, machine learning and AI, consider subscribing to my substack.

Where Can You Continue Learning?

Shape
Shape
Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy,  bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Shape

IBM Research: When AI and quantum merge

IBM’s research laboratory in Zurich. A look inside an IBM Quantum System Two. Advances in tape development. On the left is a quantum-secure tape drive. Scanning tunneling microscope in one of the Zurich laboratories. The innovation won the Nobel Prize in Physics in 1986.

Read More »

Petronas launches Malaysia Bid Round 2026

@import url(‘https://fonts.googleapis.com/css2?family=Inter:[email protected]&display=swap’); a { color: var(–color-primary-main); } .ebm-page__main h1, .ebm-page__main h2, .ebm-page__main h3, .ebm-page__main h4, .ebm-page__main h5, .ebm-page__main h6 { font-family: Inter; } body { line-height: 150%; letter-spacing: 0.025em; font-family: Inter; } button, .ebm-button-wrapper { font-family: Inter; } .label-style { text-transform: uppercase; color: var(–color-grey); font-weight: 600; font-size: 0.75rem; } .caption-style

Read More »

Intel says Google engineers spotted Xeon vulnerabilities

“In a perfect world, the [Trusted Computer Base] would be bug-free; in reality, the complexity of modern systems makes continuous assessment essential. Collaborative reviews allow industry leaders to proactively fix vulnerabilities while fostering transparency for everyone who relies on the technology,” Google researchers wrote. The main problem arose when using

Read More »

How Cisco’s platform mindset is meeting the AI era

4. Sovereignty, trust, and the rise of sovereign AI In EMEA, trust and sovereignty were more than talk—they were central to almost every discussion. This came across loud and clear at the event and in Davos in January. Cisco emphasized four dimensions of trust: security, innovation, execution, and sovereign control.​

Read More »

Secretary Wright Delivers Remarks Alongside Interim Venezuelan President Delcy Rodriguez

CARACAS, VENEZUELA—U.S. Secretary of Energy Chris Wright deliveredremarks today alongside Interim President Delcy Rodriguez after meetings at the Miraflores Palace in Caracas. Secretary Wright’s full remarks are below: Thank you so much, Interim President Rodriguez. It is an honor to stand here with you today, and to be among the tremendous people of Venezuela. As Interim President Rodriguez mentioned, our countries share a long history. It has gone through different chapters as all relationships do. But today, I bring a message from President Trump: He is passionately committed to absolutely transforming the relationship between the United States and Venezuela. This is part of a broader agenda to make the Americas great again, to bring our countries closer together, and to bring commerce, peace, prosperity, jobs, and opportunity to the people of Venezuela in partnership with the United States. These are not just words or ambitions. We have very specific plans and very specific actions already. This is President Trump’s broader agenda: peace, commerce, and trade, not conflict, not military action, not what has dominated so much of our world. Whether it’s in the Middle East, whether it’s in South Asia, or maybe, most importantly, in the Americas, we want commerce, we want peace, we want prosperity, we want security. Our government in Washington, DC, has been working 7 days a week to issue licenses, so existing businesses in Venezuela, new businesses that want to enter Venezuela, Venezuelan national companies can buy products, invest money, raise oil production, create new jobs, grow export revenue. All of the things that have constricted the Venezuelan economy, we want to set the Venezuelan people and the economy free. We had very wonderful and candid dialogues today. We both spoke very candidly about the tremendous opportunities in front of us, and some of the problems

Read More »

Energy Department Announces 26 Genesis Mission Science and Technology Challenges to Accelerate AI-Enabled American Innovation and Leadership

WASHINGTON—The U.S. Department of Energy (DOE) today announced 26 science and technology challenges of national importance to advance the Genesis Mission and accelerate innovation and discovery through artificial intelligence (AI). Building on President Trump’s Executive Orders Launching The Genesis Mission and Removing Barriers to American Leadership In Artificial Intelligence, the challenges span DOE’s discovery science, energy, and national security missions. Each was selected for its potential to deliver measurable benefits for the American people and to accelerate advancements through the Genesis Mission’s AI platforms, world-class facilities, and public-private partnerships. “These challenges represent a bold step toward a future where science moves at the speed of imagination because of AI. It’s a game-changer for science, energy, and national security,” said DOE Under Secretary for Science and Genesis Mission Lead Dr. Darío Gil. “By uniting the U.S. Government’s unparalleled data resources and DOE’s experimental facilities with cutting-edge AI, we can unlock discoveries that will power the economy, secure our energy future, and keep America at the forefront of global innovation.” “President Trump’s Genesis Mission is mobilizing America’s unmatched scientific infrastructure and AI ingenuity to double the pace of discovery. These 26 challenges are a direct call to action to America’s researchers and innovators to join the Genesis Mission and deliver science and technology breakthroughs that will benefit the American people,” said Assistant to the President and Director of The White House Office of Science and Technology Policy Michael Kratsios. “We look forward to expanding the list of challenges across Federal agencies to bring even greater impact to the Mission.” Working in partnership with DOE’s National Laboratories, industry, and academia, these efforts will deliver tangible results for the American people. Examples include: Scaling the Grid to Power the American Economy: Using AI to improve power grid planning, interconnection, operations, and security — enabling decisions up to 20–100 times faster and improving electricity cost and reliability by up to 10 percent. Harnessing America’s Historic Nuclear Data: Digitizing eight decades of

Read More »

Oil Drops Amid Supply And Equity Jitters

Oil slid as risk-averse sentiment pervaded global markets and investors digested fresh developments in US-Iran tensions that continue to cloud the supply outlook. West Texas Intermediate fell 2.8% to end the day below $63 a barrel, as equities weakened amid concerns over technology profits. US President Donald Trump said negotiations with Iran over its nuclear program could stretch for as long as a month, but it would be “very traumatic” for Tehran if it failed to reach an agreement. Traders remain concerned about the potential for military strikes and risks to supply from the Middle East. Crude has gained every week this year, with a single exception, as geopolitical risks and supply disruptions drove futures higher. The US intervened in Venezuela in January, then pivoted to Iran after a wave of protests challenged the Islamic Republic’s leadership. A vast hoard of sanctioned oil at sea is also keeping a floor under prices as buyers compete for other barrels. Still, banks maintain that there is abundant supply, with Goldman Sachs Group Inc. saying that the surplus was appearing, but mainly in locations that are less significant for price-setting. The International Energy Agency said on Thursday that global oil stockpiles grew at the strongest pace since 2020 last year, underscoring the view that a period of oversupply has arrived, even if it is not being felt evenly across the globe. As the US-Iranian tensions play out, the Pentagon has positioned a naval force in the region. Israeli President Benjamin Netanyahu met with Trump on Wednesday, with the US leader saying his “preference” is to reach a deal with Tehran on its nuclear ambitions. Netanyahu visited Washington in a bid to warn Trump against such a move, instead hoping to press him to endorse a more sweeping rollback of Iran’s military influence in

Read More »

Verde Clean Fuels suspends Permian basin GTG project

@import url(‘https://fonts.googleapis.com/css2?family=Inter:[email protected]&display=swap’); a { color: var(–color-primary-main); } .ebm-page__main h1, .ebm-page__main h2, .ebm-page__main h3, .ebm-page__main h4, .ebm-page__main h5, .ebm-page__main h6 { font-family: Inter; } body { line-height: 150%; letter-spacing: 0.025em; font-family: Inter; } button, .ebm-button-wrapper { font-family: Inter; } .label-style { text-transform: uppercase; color: var(–color-grey); font-weight: 600; font-size: 0.75rem; } .caption-style { font-size: 0.75rem; opacity: .6; } #onetrust-pc-sdk [id*=btn-handler], #onetrust-pc-sdk [class*=btn-handler] { background-color: #c19a06 !important; border-color: #c19a06 !important; } #onetrust-policy a, #onetrust-pc-sdk a, #ot-pc-content a { color: #c19a06 !important; } #onetrust-consent-sdk #onetrust-pc-sdk .ot-active-menu { border-color: #c19a06 !important; } #onetrust-consent-sdk #onetrust-accept-btn-handler, #onetrust-banner-sdk #onetrust-reject-all-handler, #onetrust-consent-sdk #onetrust-pc-btn-handler.cookie-setting-link { background-color: #c19a06 !important; border-color: #c19a06 !important; } #onetrust-consent-sdk .onetrust-pc-btn-handler { color: #c19a06 !important; border-color: #c19a06 !important; } <!–> Verde Clean Fuels Inc., Houston, has suspended development of a natural gas-to-gasoline (GTG) project in the Permian basin. The company cited “changing market conditions driven by increasing demand for natural gas,” in the region. ]–> <!–> –><!–> –> June 5, 2024 In 2024, Verde Clean Fuels and Midland-based Diamondback Energy Inc. subsidiary Cottonmouth Ventures LLC agreed to a joint development of a GTG plant in Martin County, Tex., in the Permian’s Midland basin. The project was originally slated to combine Verde’s technology with a feedstock of stranded or otherside-flared associated natural from Diamondback’s Permian basin operations for commercial-scale production of almost 3,000 b/d of fully finished reformulated blend stock for oxygenate blending (RBOB) gasoline. A front-end engineering and design (FEED) study was completed in December 2025. Verde’s chief executive officer, Ernest Miller, said knowledge collected from the work completed “will continue to be useful as we explore other opportunities to deploy our technology.” He said the company will devote resources toward other opportunities “in regions where natural gas is stranded or flared without access to a higher value outlet to market.”

Read More »

Oil prices ride the Iran seesaw

Oil, fundamental analysis Crude prices were up-and-down this week as traders eyed both optimistic and pessimistic signs regarding the proposed US/Iran talks held Friday. A large drop in crude and distillate inventories were the main fundamental factors noticed. Despite the rollercoaster ride, US prices remained above the key $60/bbl level. WTI had a High of $65.55/bbl on Wednesday with a weekly Low of $61.10 on Tuesday. Brent crude’s High was $69.75/bbl on Wednesday while its Low was $65.75 Monday. Both grades settled lower week-on-week. The WTI/Brent spread has tightened to ($4.50). The week started with a bearish tone as US President Trump spoke with optimism about the upcoming US/Iran talks. However, later in the week, the Iranian government objected to the specific topics the US wants to discuss beyond their nuclear developments and the meeting appeared doomed. The two sides did decide to follow through with the planned meeting Friday with Iranian officials labeling the meeting as a ‘very good start.’ Meanwhile, the Iranian Revolutionary Guard Corps (IRGCC) attempted to stop a US-flagged vessel in the Strait of Hormuz and sent drones to a US Navy ship in the area. Both attempts were thwarted by US naval forces. Some Very Large Crude Carriers (VLCC) are said to have been increasing their normal speeds for faster passage through the Strait of Hormuz while it’s still open. It is estimate that slightly more than 25% of global oil supplies move through the Strait. Russia is now relying heavily on its relationship with China to buy its oil exports after the US offered India a deal whereby tariffs could be cut if India halts buying Russian Urals. Additionally, Indian refiners would have access to Venezuelan exports. No official action has yet to be taken by the Indian government in terms of such a

Read More »

Perenco Congo installs Kombi 2 platform on KLL field

@import url(‘https://fonts.googleapis.com/css2?family=Inter:[email protected]&display=swap’); a { color: var(–color-primary-main); } .ebm-page__main h1, .ebm-page__main h2, .ebm-page__main h3, .ebm-page__main h4, .ebm-page__main h5, .ebm-page__main h6 { font-family: Inter; } body { line-height: 150%; letter-spacing: 0.025em; font-family: Inter; } button, .ebm-button-wrapper { font-family: Inter; } .label-style { text-transform: uppercase; color: var(–color-grey); font-weight: 600; font-size: 0.75rem; } .caption-style { font-size: 0.75rem; opacity: .6; } #onetrust-pc-sdk [id*=btn-handler], #onetrust-pc-sdk [class*=btn-handler] { background-color: #c19a06 !important; border-color: #c19a06 !important; } #onetrust-policy a, #onetrust-pc-sdk a, #ot-pc-content a { color: #c19a06 !important; } #onetrust-consent-sdk #onetrust-pc-sdk .ot-active-menu { border-color: #c19a06 !important; } #onetrust-consent-sdk #onetrust-accept-btn-handler, #onetrust-banner-sdk #onetrust-reject-all-handler, #onetrust-consent-sdk #onetrust-pc-btn-handler.cookie-setting-link { background-color: #c19a06 !important; border-color: #c19a06 !important; } #onetrust-consent-sdk .onetrust-pc-btn-handler { color: #c19a06 !important; border-color: #c19a06 !important; } Perenco Congo installed the new Kombi 2 platform on Kombi-Likalala-Libondo II (KLL II) field offshore Congo-Brazzaville. The new mobile offshore production unit has improved water and effluent treatment, increased associated gas recovery, and 8 MW of electricity generated by two gas turbines. Total investment in the KLL II project was over $200 million. Kombi 2 will host a six-well drilling campaign starting in 2026, with the aim of increasing production, extending the life of the field. Connection work is currently under way, with commissioning scheduled for early March 2026.

Read More »

Data center capex to hit $1.7 trillion by 2030 due to AI boom

Asked on Thursday about the fact that capex is expected to approach the $1 trillion mark in 2026, he said it is somewhat surprising. “Last year, I thought it would take at least three years to get to that trillion dollar mark,” he said. “It seems increases are supported by the result of larger models needed for training infrastructure, and in turn, you need inference as well. You also need a supporting infrastructure in storage, networking, power, and cooling.” AI, he said, has become “the tide that lift all boats, meaning that in addition to the core accelerated compute, AI also positively impacts complementary infrastructure, such as storage, networking, and physical infrastructure.” Fung added that while much of the achievement of projected spending estimates will depend on whether or not this growth is sustainable, he pointed out, “it seems like the large hyperscalers have a lot of weight in optimizing cash flow and cost structures. They’re trying to get as creative as possible, generally moving towards a more vertical, integrated stack with their own custom networking and external financing, which would help  [create] more sustainable deployments and operations.” Enterprises thinking of expanding their own infrastructure can learn from this growth. In a recent article on the hyper spending of hyperscalers, Greyhound Research chief analyst Sanchit Vir Gogia said their capex spending  levels can help pinpoint where the hyperscalers are expecting bottlenecks, which is useful information for enterprises planning their own cloud strategy across multiple geographies. These and other factors can help enterprises plan their own execution timelines, he said. This article originally appeared on CIO.com.

Read More »

Cisco highlights memory costs, Silicon One growth in Q2 recap

“AI infrastructure orders taken from hyperscalers totaled $2.1 billion in Q2 compared to $1.3 billion just last quarter and equal to the total orders taken in all of fiscal year ’25, marking another significant acceleration in growth across our silicon, systems and optics,” Robbins said. “Given the strong demand for our Silicon One systems and optics, we now expect to take AI orders in excess of $5 billion and to recognize over $3 billion in AI infrastructure revenue from hyperscalers in FY ’26.” Regarding enterprise uptake, Robbins said Cisco took in $350 million in AI orders from enterprise customers in Q2 and has a pipeline in excess of $2.5 billion for its high-performance AI infrastructure portfolio. Cisco is seeing early enterprise use cases for AI around fraud detection and video analytics in sectors such as financial, manufacturing and pharmaceuticals, for example. “I also see examples in retail, where customers are leveraging agents on mobile devices in retail to help their staff do a better job engaging with their customers. We’re seeing a combination of both investment in cloud-based architectures as well as on prem,” Robbins said. Networking rules Cisco is experiencing a faster-than-historical ramp-up of next-generation platforms, including its Catalyst 9K, Wi-Fi 7, and smart switches, stated Sebastien Naji, a research analyst with William Blair, in a report after the call. He attributed it to three factors: an accelerated refresh cycle in the data center; early AI-readiness efforts in the enterprise; and end-of-support for legacy Catalyst and Nexus switches.  “We are seeing strong demand for our next-generation switching, routing and wireless products, which continue to ramp faster than prior product launches. We’re delivering AI-native capabilities across these products, including weaving security into the fabric of the network and modernizing the operational stack of campus networks,” Robbins said. Co-packaged optics? When asked

Read More »

Energy providers seek flexible load strategies for data center operations

“In theory, yes, they’d have to wait a little bit longer while their queries are routed to a data center that has capacity,” said Lawrence. The one thing the industry cannot do is operate like it has in the past, where data center power was tuned and then forgotten for six months. Previously, data centers would test their power sources once or twice a year. They don’t have that luxury anymore. They need to check their power sources and loads far more regularly, according to Lawrence. “I think that for that for the data center industry to continue to survive like we all need it, there’s going to have to be some realignment on the incentives to why somebody would become flexible,” said Lawrence. The survey suggests that utilities and load operators expect to expand their demand response activities and budgets in the near term. Sixty-three percent of respondents anticipate DR program funding to grow by 50% or more over the next three years. While they remain a major source of load growth and system strain, 57% of respondents indicate that onsite power generation from data centers will be most important to improving grid stability over the next five years. One of the proposed fixes to the power shortage has been small modular nuclear reactors. These have gained a lot of traction in the marketplace even if they have nothing to sell yet. But Lawrence said that that’s not an ideal solution for existing power generators, ironically enough.

Read More »

Nokia predicts huge WAN traffic growth, but experts question assumptions

Consumer, which includes both mobile access and fixed access, including fixed wireless access. Enterprise and industrial, which covers wide-area connectivity that supports knowledge work, automation, machine vision, robotics coordination, field support, and industrial IoT. AI, including applications that people directly invoke, such as assistants, copilots, and media generation, as well as autonomous use cases in which AI systems trigger other AI systems to perform functions and move data across networks. The report outlines three scenarios: conservative, moderate, and aggressive. “Our goal is to present scenarios that fall within a realistic range of possible outcomes, encouraging stakeholders to plan across the full spectrum of high-impact demand possibilities,” the report says. Nokia’s prediction for global WAN traffic growth ranges from a 13% CAGR for the conservative scenario to 16% CAGR for moderate and 22% CAGR for aggressive. Looking more closely at the moderate scenario, it’s clear that consumer traffic dominates. Enterprise and industrial traffic make up only about 14% to 17% of overall WAN traffic, although their share is expected to grow during the 10-year forecast period. “On the consumer side, the vast majority of traffic by volume is video,” says William Webb, CEO of the consulting firm Commcisive. Asked whether any of that consumer traffic is at some point served up by enterprises, the answer is a decisive “no.” It’s mostly YouTube and streaming services like Netflix, he says. In short, that doesn’t raise enterprise concerns. Nokia predicts AI traffic boom AI is a different story. “Consumer- and enterprise-generated AI traffic imposes a substantial impact on the wide-area network (WAN) by adding AI workloads processed by data centers across the WAN. AI traffic does not stay inside one data center; it moves across edge, metro, core, and cloud infrastructure, driving dense lateral flows and new capacity demands,” the report says. An

Read More »

Cisco amps up Silicon One line, delivers new systems and optics for AI networking

Those building blocks include the new G300 as well as the G200 51.2 Tbps chip, which is aimed at spine and aggregation applications, and the G100 25.6 Tbps chip, which is aimed at leaf operations. Expanded portfolio of Silicon One P200-powered systems Cisco in October rolled out the P200 Silicon One chip and the high-end, 51.2 Tbps 8223 router aimed at distributed AI workloads. That system supports Octal Small Form-Factor Pluggable (OSFP) and Quad Small Form-Factor Pluggable Double Density (QSFP-DD) optical form factors that help the box support geographically dispersed AI clusters. Cisco grew the G200 family this week with the addition of the 8122X-64EF-O, a 64x800G switch that will run the SONiC OS and includes support for Cisco 800G Linear Pluggable Optics (LPO) connectivity. LPO components typically set up direct links between fiber optic modules, eliminating the need for traditional components such as a digital signal processor. Cisco said its P200 systems running IOS XR software now better support core routing services to allow data-center-to-data-center links and data center interconnect applications. In addition, Cisco introduced a P200-powered 88-LC2-36EF-M line card, which delivers 28.8T of capacity. “Available for both our 8-slot and 18-slot modular systems, this line card enables up to an unprecedented 518.4T of total system bandwidth, the highest in the industry,” wrote Guru Shenoy, senior vice president of the Cisco provider connectivity group, in a blog post about the news. “When paired with Cisco 800G ZR/ZR+ coherent pluggable optics, these systems can easily connect sites over 1,000 kilometers apart, providing the high-density performance needed for modern data center interconnects and core routing.”

Read More »

NetBox Labs ships AI copilot designed for network engineers, not developers

Natural language for network engineers Beevers explained that network operations teams face two fundamental barriers to automation. First, they lack accurate data about their infrastructure. Second, they aren’t software developers and shouldn’t have to become them. “These are not software developers. They are network engineers or IT infrastructure engineers,” Beevers said. “The big realization for us through the copilot journey is they will never be software developers. Let’s stop trying to make them be. Let’s let these computers that are really good at being software developers do that, and let’s let the network engineers or the data center engineers be really good at what they’re really good at.”  That vision drove the development of NetBox Copilot’s natural language interface and its capabilities. Grounding AI in infrastructure reality The challenge with deploying AI  in network operations is trust. Generic large language models hallucinate, produce inconsistent results, and lack the operational context to make reliable decisions. NetBox Copilot addresses this by grounding the AI agent in NetBox’s comprehensive infrastructure data model. NetBox serves as the system of record for network and infrastructure teams, maintaining a semantic map of devices, connections, IP addressing, rack layouts, power distribution and the relationships between these elements. Copilot has native awareness of this data structure and the context it provides. This enables queries that would be difficult or impossible with traditional interfaces. Network engineers can ask “Which devices are missing IP addresses?” to validate data completeness, “Who changed this prefix last week?” for change tracking and compliance, or “What depends on this switch?” for impact analysis before maintenance windows.

Read More »

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs).  In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

Read More »

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

Read More »

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

Read More »

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Read More »