Stay Ahead, Stay ONMINE

Why Data Scientists Should Care about Containers — and Stand Out with This Knowledge

“I train models, analyze data and create dashboards — why should I care about Containers?” Many people who are new to the world of data science ask themselves this question. But imagine you have trained a model that runs perfectly on your laptop. However, error messages keep popping up in the cloud when others access […]

“I train models, analyze data and create dashboards — why should I care about Containers?”

Many people who are new to the world of data science ask themselves this question. But imagine you have trained a model that runs perfectly on your laptop. However, error messages keep popping up in the cloud when others access it — for example because they are using different library versions.

This is where containers come into play: They allow us to make machine learning models, data pipelines and development environments stable, portable and scalable — regardless of where they are executed.

Let’s take a closer look.

Table of Contents
1 — Containers vs. Virtual Machines: Why containers are more flexible than VMs
2 — Containers & Data Science: Do I really need Containers? And 4 reasons why the answer is yes.
3 — First Practice, then Theory: Container creation even without much prior knowledge
4 — Your 101 Cheatsheet: The most important Docker commands & concepts at a glance
Final Thoughts: Key takeaways as a data scientist
Where Can You Continue Learning?

1 — Containers vs. Virtual Machines: Why containers are more flexible than VMs

Containers are lightweight, isolated environments. They contain applications with all their dependencies. They also share the kernel of the host operating system, making them fast, portable and resource-efficient.

I have written extensively about virtual machines (VMs) and virtualization in ‘Virtualization & Containers for Data Science Newbiews’. But the most important thing is that VMs simulate complete computers and have their own operating system with their own kernel on a hypervisor. This means that they require more resources, but also offer greater isolation.

Both containers and VMs are virtualization technologies.

Both make it possible to run applications in an isolated environment.

But in the two descriptions, you can also see the 3 most important differences:

  • Architecture: While each VM has its own operating system (OS) and runs on a hypervisor, containers share the kernel of the host operating system. However, containers still run in isolation from each other. A hypervisor is the software or firmware layer that manages VMs and abstracts the operating system of the VMs from the physical hardware. This makes it possible to run multiple VMs on a single physical server.
  • Resource consumption: As each VM contains a complete OS, it requires a lot of memory and CPU. Containers, on the other hand, are more lightweight because they share the host OS.
  • Portability: You have to customize a VM for different environments because it requires its own operating system with specific drivers and configurations that depend on the underlying hardware. A container, on the other hand, can be created once and runs anywhere a container runtime is available (Linux, Windows, cloud, on-premise). Container runtime is the software that creates, starts and manages containers — the best-known example is Docker.
Created by the author

You can experiment faster with Docker — whether you’re testing a new ML model or setting up a data pipeline. You can package everything in a container and run it immediately. And you don’t have any “It works on my machine”-problems. Your container runs the same everywhere — so you can simply share it.

2 — Containers & Data Science: Do I really need Containers? And 4 reasons why the answer is yes.

As a data scientist, your main task is to analyze, process and model data to gain valuable insights and predictions, which in turn are important for management.

Of course, you don’t need to have the same in-depth knowledge of containers, Docker or Kubernetes as a DevOps Engineer or a Site Reliability Engineer (SRE). Nevertheless, it is worth having container knowledge at a basic level — because these are 4 examples of where you will come into contact with it sooner or later:

Model deployment

You are training a model. You not only want to use it locally but also make it available to others. To do this, you can pack it into a container and make it available via a REST API.

Let’s look at a concrete example: Your trained model runs in a Docker container with FastAPI or Flask. The server receives the requests, processes the data and returns ML predictions in real-time.

Reproducibility and easier collaboration

ML models and pipelines require specific libraries. For example, if you want to use a deep learning model like a Transformer, you need TensorFlow or PyTorch. If you want to train and evaluate classic machine learning models, you need Scikit-Learn, NumPy and Pandas. A Docker container now ensures that your code runs with exactly the same dependencies on every computer, server or in the cloud. You can also deploy a Jupyter Notebook environment as a container so that other people can access it and use exactly the same packages and settings.

Cloud integration

Containers include all packages, dependencies and configurations that an application requires. They therefore run uniformly on local computers, servers or cloud environments. This means you don’t have to reconfigure the environment.

For example, you write a data pipeline script. This works locally for you. As soon as you deploy it as a container, you can be sure that it will run in exactly the same way on AWS, Azure, GCP or the IBM Cloud.

Scaling with Kubernetes

Kubernetes helps you to orchestrate containers. But more on that below. If you now get a lot of requests for your ML model, you can scale it automatically with Kubernetes. This means that more instances of the container are started.

3 — First Practice, then Theory: Container creation even without much prior knowledge

Let’s take a look at an example that anyone can run through with minimal time — even if you haven’t heard much about Docker and containers. It took me 30 minutes.

We’ll set up a Jupyter Notebook inside a Docker container, creating a portable, reproducible Data Science environment. Once it’s up and running, we can easily share it with others and ensure that everyone works with the exact same setup.

0 — Install Docker Dekstop and create a project directory

To be able to use containers, we need Docker Desktop. To do this, we download Docker Desktop from the official website.

Now we create a new folder for the project. You can do this directly in the desired folder. I do this via Terminal — on Windows with Windows + R and open CMD.

We use the following command:

Screenshot taken by the author

1. Create a Dockerfile

Now we open VS Code or another editor and create a new file with the name ‘Dockerfile’. We save this file without an extension in the same directory. Why doesn’t it need an extension?

We add the following code to this file:

# Use the official Jupyter notebook image with SciPy
FROM jupyter/scipy-notebook:latest  

# Set the working directory inside the container
WORKDIR /home/jovyan/work  

# Copy all local files into the container
COPY . .

# Start Jupyter Notebook without token
CMD ["start-notebook.sh", "--NotebookApp.token=''"]

We have thus defined a container environment for Jupyter Notebook that is based on the official Jupyter SciPy Notebook image.

First, we define with FROM on which base image the container is built. jupyter/scipy-notebook:latest is a preconfigured Jupyter notebook image and contains libraries such as NumPy, SiPy, Matplotlib or Pandas. Alternatively, we could also use a different image here.

With WORKDIR we set the working directory within the container. /home/jovyan/work is the default path used by Jupyter. User jovyan is the default user in Jupyter Docker images. Another directory could also be selected — but this directory is best practice for Jupyter containers.

With COPY . . we copy all files from the local directory — in this case the Dockerfile, which is located in the jupyter-docker directory — to the working directory /home/jovyan/work in the container.

With CMD [“start-notebook.sh”, “ — NotebookApp.token=‘’’”] we specify the default start command for the container, specify the start script for Jupyter Notebook and define that the notebook is started without a token — this allows us to access it directly via the browser.

2. Create the Docker image

Next, we will build the Docker image. Make sure you have the previously installed Docker desktop open. We now go back to the terminal and use the following command:

cd jupyter-docker
docker build -t my-jupyter .

With cd jupyter-docker we navigate to the folder we created earlier. With docker build we create a Docker image from the Dockerfile. With -t my-jupyter we give the image a name. The dot means that the image will be built based on the current directory. What does that mean? Note the space between the image name and the dot.

The Docker image is the template for the container. This image contains everything needed for the application such as the operating system base (e.g. Ubuntu, Python, Jupyter), dependencies such as Pandas, Numpy, Jupyter Notebook, the application code and the startup commands. When we “build” a Docker image, this means that Docker reads the Dockerfile and executes the steps that we have defined there. The container can then be started from this template (Docker image).

We can now watch the Docker image being built in the terminal.

Screenshot taken by the author

We use docker images to check whether the image exists. If the output my-jupyter appears, the creation was successful.

docker images

If yes, we see the data for the created Docker image:

Screenshot taken by the author

3. Start Jupyter container

Next, we want to start the container and use this command to do so:

docker run -p 8888:8888 my-jupyter

We start a container with docker run. First, we enter the specific name of the container that we want to start. And with -p 8888:8888 we connect the local port (8888) with the port in the container (8888). Jupyter runs on this port. I do not understand.

Alternatively, you can also perform this step in Docker desktop:

Screenshot taken by the author

4. Open Jupyter Notebook & create a test notebook

Now we open the URL [http://localhost:8888](http://localhost:8888/) in the browser. You should now see the Jupyter Notebook interface.

Here we will now create a Python 3 notebook and insert the following Python code into it.

import numpy as np
import matplotlib.pyplot as plt

x = np.linspace(0, 10, 100)
y = np.sin(x)

plt.plot(x, y)
plt.title("Sine Wave")
plt.show()

Running the code will display the sine curve:

Screenshot taken by the author

5. Terminate the container

At the end, we end the container either with ‘CTRL + C’ in the terminal or in Docker Desktop.

With docker ps we can check in the terminal whether containers are still running and with docker ps -a we can display the container that has just been terminated:

Screenshot taken by the author

6. Share your Docker image

If you now want to upload your Docker image to a registry, you can do this with the following command. This will upload your image to Docker Hub (you need a Docker Hub account for this). You can also upload it to a private registry of AWS Elastic Container, Google Container, Azure Container or IBM Cloud Container.

docker login

docker tag my-jupyter your-dockerhub-name/my-jupyter:latest

docker push dein-dockerhub-name/mein-jupyter:latest

If you then open Docker Hub and go to your repositories in your profile, the image should be visible.

This was a very simple example to get started with Docker. If you want to dive a little deeper, you can deploy a trained ML model with FastAPI via a container.

4 — Your 101 Cheatsheet: The most important Docker commands & concepts at a glance

You can actually think of a container like a shipping container. Regardless of whether you load it onto a ship (local computer), a truck (cloud server) or a train (data center) — the content always remains the same.

The most important Docker terms

  • Container: Lightweight, isolated environment for applications that contains all dependencies.
  • Docker: The most popular container platform that allows you to create and manage containers.
  • Docker Image: A read-only template that contains code, dependencies and system libraries.
  • Dockerfile: Text file with commands to create a Docker image.
  • Kubernetes: Orchestration tool to manage many containers automatically.

The basic concepts behind containers

  • Isolation: Each container contains its own processes, libraries and dependencies
  • Portability: Containers run wherever a container runtime is installed.
  • Reproducibility: You can create a container once and it runs exactly the same everywhere.

The most basic Docker commands

docker --version # Check if Docker is installed
docker ps # Show running containers
docker ps -a # Show all containers (including stopped ones)
docker images # List of all available images
docker info # Show system information about the Docker installation

docker run hello-world # Start a test container
docker run -d -p 8080:80 nginx # Start Nginx in the background (-d) with port forwarding
docker run -it ubuntu bash # Start interactive Ubuntu container with bash

docker pull ubuntu # Load an image from Docker Hub
docker build -t my-app . # Build an image from a Dockerfile

Final Thoughts: Key takeaways as a data scientist

👉 With Containers you can solve the “It works on my machine” problem. Containers ensure that ML models, data pipelines, and environments run identically everywhere, independent of OS or dependencies.

👉 Containers are more lightweight and flexible than virtual machines. While VMs come with their own operating system and consume more resources, containers share the host operating system and start faster.

👉 There are three key steps when working with containers: Create a Dockerfile to define the environment, use docker build to create an image, and run it with docker run — optionally pushing it to a registry with docker push.

And then there’s Kubernetes.

A term that comes up a lot in this context: An orchestration tool that automates container management, ensuring scalability, load balancing and fault recovery. This is particularly useful for microservices and cloud applications.

Before Docker, VMs were the go-to solution (see more in ‘Virtualization & Containers for Data Science Newbiews’.) VMs offer strong isolation, but require more resources and start slower.

So, Docker was developed in 2013 by Solomon Hykes to solve this problem. Instead of virtualizing entire operating systems, containers run independently of the environment — whether on your laptop, a server or in the cloud. They contain all the necessary dependencies so that they work consistently everywhere.

I simplify tech for curious minds🚀 If you enjoy my tech insights on Python, data science, Data Engineering, machine learning and AI, consider subscribing to my substack.

Where Can You Continue Learning?

Shape
Shape
Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy,  bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Shape

Western Digital wants to ramp-up hard disk drive speeds

Most enterprises are not using SATA drives, at least not with hot data. Perhaps cold storage but not frequently accessed data. They are using PCI Express based drives and those are considerably faster than anything Western Digital can engineer in a hard disk. Capacity aside, Western Digital is also aiming

Read More »

LoRaWAN reaches 125 million devices as industrial IoT expands

Satellite integration is set to grow Terrestrial LoRaWAN networks cannot achieve complete geographic coverage. Yegin cited Swisscom’s nationwide Switzerland deployment, which covers 97.2% of the population but cannot reach remote alpine terrain. Two LoRa Alliance members, Lacuna Space and Plan-S, already operate commercial LoRaWAN services from low Earth orbit. Standard

Read More »

Insights: Venezuela – new legal frameworks vs. the inertia of history

@import url(‘https://fonts.googleapis.com/css2?family=Inter:[email protected]&display=swap’); a { color: var(–color-primary-main); } .ebm-page__main h1, .ebm-page__main h2, .ebm-page__main h3, .ebm-page__main h4, .ebm-page__main h5, .ebm-page__main h6 { font-family: Inter; } body { line-height: 150%; letter-spacing: 0.025em; font-family: Inter; } button, .ebm-button-wrapper { font-family: Inter; } .label-style { text-transform: uppercase; color: var(–color-grey); font-weight: 600; font-size: 0.75rem; } .caption-style { font-size: 0.75rem; opacity: .6; } #onetrust-pc-sdk [id*=btn-handler], #onetrust-pc-sdk [class*=btn-handler] { background-color: #c19a06 !important; border-color: #c19a06 !important; } #onetrust-policy a, #onetrust-pc-sdk a, #ot-pc-content a { color: #c19a06 !important; } #onetrust-consent-sdk #onetrust-pc-sdk .ot-active-menu { border-color: #c19a06 !important; } #onetrust-consent-sdk #onetrust-accept-btn-handler, #onetrust-banner-sdk #onetrust-reject-all-handler, #onetrust-consent-sdk #onetrust-pc-btn-handler.cookie-setting-link { background-color: #c19a06 !important; border-color: #c19a06 !important; } #onetrust-consent-sdk .onetrust-pc-btn-handler { color: #c19a06 !important; border-color: #c19a06 !important; } In this Insights episode of the Oil & Gas Journal ReEnterprised podcast, Head of Content Chris Smith updates the evolving situation in Venezuela as the industry attempts to navigate the best path forward while the two governments continue to hammer out the details. The discussion centers on the new legal frameworks being established in both countries within the context of fraught relations stretching back for decades. Want to hear more? Listen in on a January episode highlighting industry’s initial take following the removal of Nicholas Maduro from power. References Politico podcast Monaldi Substack Baker webinar Washington, Caracas open Venezuela to allow more oil sales 

Read More »

Eni makes Calao South discovery offshore Ivory Coast

@import url(‘https://fonts.googleapis.com/css2?family=Inter:[email protected]&display=swap’); a { color: var(–color-primary-main); } .ebm-page__main h1, .ebm-page__main h2, .ebm-page__main h3, .ebm-page__main h4, .ebm-page__main h5, .ebm-page__main h6 { font-family: Inter; } body { line-height: 150%; letter-spacing: 0.025em; font-family: Inter; } button, .ebm-button-wrapper { font-family: Inter; } .label-style { text-transform: uppercase; color: var(–color-grey); font-weight: 600; font-size: 0.75rem; } .caption-style { font-size: 0.75rem; opacity: .6; } #onetrust-pc-sdk [id*=btn-handler], #onetrust-pc-sdk [class*=btn-handler] { background-color: #c19a06 !important; border-color: #c19a06 !important; } #onetrust-policy a, #onetrust-pc-sdk a, #ot-pc-content a { color: #c19a06 !important; } #onetrust-consent-sdk #onetrust-pc-sdk .ot-active-menu { border-color: #c19a06 !important; } #onetrust-consent-sdk #onetrust-accept-btn-handler, #onetrust-banner-sdk #onetrust-reject-all-handler, #onetrust-consent-sdk #onetrust-pc-btn-handler.cookie-setting-link { background-color: #c19a06 !important; border-color: #c19a06 !important; } #onetrust-consent-sdk .onetrust-pc-btn-handler { color: #c19a06 !important; border-color: #c19a06 !important; } Eni SPA discovered gas and condensate in the Murene South-1X exploration well in Block CI-501, Ivory Coast. The well is the first exploration in the block and was drilled by the Saipem Santorini drilling ship about 8 km southwest of the Murene-1X discovery well in adjacent CI-205 block. The well was drilled to about 5,000 m TD in 2,200 m of water. Extensive data acquisition confirmed a main hydrocarbon bearing interval in high-quality Cenomanian sands with a gross thickness of about 50 m with excellent petrophysical properties, the operator said. Murene South-1X will undergo a full conventional drill stem test (DST) to assess the production capacity of this discovery, named Calao South. Calao South confirms the potential of the Calao channel complex that also includes the Calao discovery. It is the second largest discovery in the country after Baleine, with estimated volumes of up to 5.0 tcf of gas and 450 million bbl of condensate (about 1.4 billion bbl of oil). Eni is operator of Block CI-501 (90%) with partner Petroci Holding (10%).

Read More »

CFEnergía to supply natural gas to low-carbon methanol plant in Mexico

CFEnergía, a subsidiary of Mexico’s Federal Electricity Commission (CFE), has agreed to supply natural gas to Transition Industries LLC for its Pacifico Mexinol project near Topolobampo, Sinaloa, Mexico. Under the signed agreement, which enables the start of Pacifico Mexinol’s construction phase, CFEnergía will supply about 160 MMcfd of natural gas for an unspecified timeframe noted as “long term,” Transition Industries said in a release Feb. 16. The natural gas—to be sourced from the US and supplied at market prices via existing infrastructure—will be used as “critical input for Mexinol’s production of ultra-low carbon methanol,” the company said. Pacifico Mexinol The $3.3-billion Mexinol project, when it begins operations in late 2029 to early 2030, is expected to be the world’s largest ultra-low carbon chemicals plant with production of about 1.8 million tonnes of blue methanol and 350,000 tonnes of green methanol annually. Supply is aimed at markets in Asia, including Japan, while also boosting the development of the domestic market and the Mexican chemical industry. Mitsubishi Gas Chemical has committed to purchasing about 1 million tonnes/year of methanol from the project, about 50% of the project’s planned production. Transition Industries is jointly developing Pacifico Mexinol with the International Finance Corporation (IFC), a member of the World Bank Group. Last year, the company signed a contingent engineering, procurement, and construction (EPC) contract with the consortium of Samsung E&A Co., Ltd., Grupo Samsung E&A Mexico SA de CV, and Techint Engineering and Construction for the project. MAIRE group’s technology division NextChem, through its subsidiary KT TECH SpA, also signed a basic engineering, critical and proprietary equipment supply agreement with Samsung E&A in connection with its proprietary NX AdWinMethanol®Zero technology supply to the project.

Read More »

North Atlantic’s Gravenchon refinery scheduled for major turnaround

Canada-based North Atlantic Refining Ltd. France-based subsidiary North Atlantic France SAS is undertaking planned maintenance in March at its North Atlantic Energies-operated 230,000-b/d Notre-Dame-de-Gravenchon refinery in Port-Jérôme-sur-Seine, Normandy. Scheduled to begin on Mar. 3 with the phased shutdown of unidentified units at the refinery, the upcoming turnaround will involve thorough inspections of associated equipment designed for continuous operation, as well as unspecified works to improve energy efficiency, environmental performance, and overall competitiveness of the site, North Atlantic Energies said on Feb. 16. Part of the operator’s routine maintenance program aimed at meeting regulatory requirements to ensure the safety, compliance, and long-term performance of the refinery, North Atlantic Energies said the scheduled turnaround will not interrupt product supplies to customers during the shutdown period. While the company confirmed the phased shutdown of units slated for work during the maintenance event would last for several days, the operator did not reveal a definitive timeline for the entire duration of the turnaround. Further details regarding specific works to be carried out during the major maintenance event were not revealed. The upcoming turnaround will be the first to be executed under North Atlantic Group’s ownership, which completed its purchase of the formerly majority-owned ExxonMobil Corp. refinery and associated petrochemical assets at the site in November 2025.

Read More »

Azule Energy starts Ndungu full field production offshore Angola

Azule Energy has started full field production from Ndungu, part of the Agogo Integrated West Hub Project (IWH) in the western area of Block 15/06, offshore Angola. Ndungo full field lies about 10 km from the NGOMA FPSO in a water depth of around 1,100 m and comprises seven production wells and four injection wells, with an expected production peak of 60,000 b/d of oil. The National Agency for Petroleum, Gas and Biofuels (ANPG) and Azule Energy noted the full field start-up with first oil of three production wells. The phased integration of IWH, with Ndungu full field producing first via N’goma FPSO and later via Agogo FPSO, is expected to reach a peak output of about 175,000 b/d across the two fields. The fields have combined estimated reserves of about 450 million bbl. The Agogo IWH project is operated by Azule Energy with a 36.84% stake alongside partners Sonangol E&P (36.84%) and Sinopec International (26.32%).   

Read More »

Ovintiv to divest Anadarko assets for $3 billion

In a release Feb. 17, Brendan McCracken, Ovintiv president and chief executive officer, said the company has “built one of the deepest premium inventory positions in our industry in the two most valuable plays in North America, the Permian and the Montney,” and that the Anadarko assets sale “positions [Ovintiv] to deliver superior returns for our shareholders for many years to come.” Ovintiv in 2025 had noted plans to sell the asset to help offset the cost of its acquisition of NuVista Energy Ltd. That $2.7-billion cash and stock deal, which closed earlier this month, added about 930 net 10,000-ft equivalent well locations and about 140,000 net acres (70% undeveloped) in the core of the oil-rich Alberta Montney.  Proceeds from the Anadarko assets sale are earmarked for accelerated debt reduction, the company said.  Ovintiv’s sale of its Anadarko assets is expected to close early in this year’s second quarter, subject to customary conditions, with an effective date of Jan. 1, 2026.

Read More »

Raising the temp on liquid cooling

IBM isn’t the only one. “We’ve been doing liquid cooling since 2012 on our supercomputers,” says Scott Tease, vice president and general manager of AI and high-performance computing at Lenovo’s infrastructure solutions group. “And we’ve been improving it ever since—we’re now on the sixth generation of that technology.” And the liquid Lenovo uses in its Neptune liquid cooling solution is warm water. Or, more precisely, hot water: 45 degrees Celsius. And when the water leaves the servers, it’s even hotter, Tease says. “I don’t have to chill that water, even if I’m in a hot climate,” he says. Even at high temperatures, the water still provides enough cooling to the chips that it has real value. “Generally, a data center will use evaporation to chill water down,” Tease adds. “Since we don’t have to chill the water, we don’t have to use evaporation. That’s huge amounts of savings on the water. For us, it’s almost like a perfect solution. It delivers the highest performance possible, the highest density possible, the lowest power consumption. So, it’s the most sustainable solution possible.” So, how is the water cooled down? It gets piped up to the roof, Tease says, where there are giant radiators with massive amounts of surface area. The heat radiates away, and then all the water flows right back to the servers again. Though not always. The hot water can also be used to, say, heat campus or community swimming pools. “We have data centers in the Nordics who are giving the heat to the local communities’ water systems,” Tease says.

Read More »

Vertiv’s AI Infrastructure Surge: Record Orders, Liquid Cooling Expansion, and Grid-Scale Power Reflect Data Center Growth

2) “Units of compute”: OneCore and SmartRun On the earnings call, Albertazzi highlighted Vertiv OneCore, an end-to-end data center solution designed to accelerate “time to token,” scaling in 12.5 MW building blocks; and Vertiv SmartRun, a prefabricated white space infrastructure solution aimed at rapidly accelerating fit-out and readiness. He pointed to collaborations (including Hut 8 and Compass Data Centers) as proof points of adoption, emphasizing that SmartRun can stand alone or plug into OneCore. 3) Cooling evolution: hybrid thermal chains and the “trim cooler” Asked how cooling architectures may change (amid industry chatter about warmer-temperature operations and shifting mixes of chillers, CDUs, and other components) Albertazzi leaned into complexity as a feature, not a bug. He argued heat rejection doesn’t disappear, even if some GPU loads can run at higher temperatures. Instead, the future looks hybrid, with mixed loads and resiliency requirements forcing more nuanced thermal chains. Vertiv’s strategic product anchor here is its “trim cooler” concept: a chiller optimized for higher-temperature operation while retaining flexibility for lower-temperature requirements in the same facility, maximizing free cooling where climate and design allow. And importantly, Albertazzi dismissed the idea that CDUs are going away: “We are pretty sure that CDUs in various shapes and forms are a long-term element of the thermal chain.” 4) Edge densification: CoolPhase Ceiling + CoolPhase Row (Feb. 3) Vertiv also expanded its thermal portfolio for edge and small IT environments with the: Vertiv CoolPhase Ceiling (launching Q2 2026): ceiling-mounted, 3.5 kW to 28 kW, designed to preserve floor space. Vertiv CoolPhase Row (available now in North America) for row-based cooling up to 30 kW (300 mm width) or 40 kW (600 mm width). Vertiv Director of Edge Thermal Michal Podmaka tied the products directly to AI-driven edge densification and management consistency, saying the new systems “integrate seamlessly

Read More »

Execution, Power, and Public Trust: Rich Miller on 2026’s Data Center Reality and Why He Built Data Center Richness

DCF founder Rich Miller has spent much of his career explaining how the data center industry works. Now, with his latest venture, Data Center Richness, he’s also examining how the industry learns. That thread provided the opening for the latest episode of The DCF Show Podcast, where Miller joined present Data Center Frontier Editor in Chief Matt Vincent and Senior Editor David Chernicoff for a wide-ranging discussion that ultimately landed on a simple conclusion: after two years of unprecedented AI-driven announcements, 2026 will be the year reality asserts itself. Projects will either get built, or they won’t. Power will either materialize, or it won’t. Communities will either accept data center expansion – or they’ll stop it. In other words, the industry is entering its execution phase. Why Data Center Richness Matters Now Miller launched Data Center Richness as both a podcast and a Substack publication, an effort to experiment with formats and better understand how professionals now consume industry information. Podcasts have become a primary way many practitioners follow the business, while YouTube’s discovery advantages increasingly make video versions essential. At the same time, Miller remains committed to written analysis, using Substack as a venue for deeper dives and format experimentation. One example is his weekly newsletter distilling key industry developments into just a handful of essential links rather than overwhelming readers with volume. The approach reflects a broader recognition: the pace of change has accelerated so much that clarity matters more than quantity. The topic of how people learn about data centers isn’t separate from the industry’s trajectory; it’s becoming part of it. Public perception, regulatory scrutiny, and investor expectations are now shaped by how stories are told as much as by how facilities are built. That context sets the stage for the conversation’s core theme. Execution Defines 2026 After

Read More »

Utah’s 4 GW AI Campus Tests the Limits of Speed-to-Power

Back in September 2025, we examined an ambitious proposal from infrastructure developer Joule Capital Partners – often branding the effort as “Joule Power” – in partnership with Caterpillar. The concept is straightforward but consequential: acquire a vast rural tract in Millard County, Utah, and pair an AI-focused data center campus with large-scale, on-site “behind-the-meter” generation to bypass the interconnection queues, transmission constraints, and substation bottlenecks slowing projects nationwide. The appeal is clear: speed-to-power and greater control over delivery timelines. But that speed shifts the project’s risk profile. Instead of navigating traditional utility procurement, the development begins to resemble a distributed power plant subject to industrial permitting, fuel supply logistics, air emissions scrutiny, noise controls, and groundwater governance. These are issues communities typically associate with generation facilities, not hyperscale data centers. Our earlier coverage focused on the technical and strategic logic of pairing compute with on-site generation. Now the story has evolved. Community opposition is emerging as a material variable that could influence schedule and scope. Although groundbreaking was held in November 2025, final site plans and key conditional use permits remain pending at the time of publication. What Is Actually Being Proposed? Public records from Millard County show Joule pursuing a zone change for approximately 4,000 acres (about 6.25 square miles), converting agricultural land near 11000 N McCornick Road to Heavy Industrial use. At a July 2025 public meeting, residents raised familiar concerns that surface when a rural landscape is targeted for hyperscale development: labor influx and housing strain, water use, traffic, dust and wildfire risk, wildlife disruption, and the broader loss of farmland and local character. What has proven less clear is the precise scale and sequencing of the buildout. Local reporting describes an initial phase of six data center buildings, each supported by a substantial fleet of Caterpillar

Read More »

From Lab to Gigawatt: CoreWeave’s ARENA and the AI Validation Imperative

The Production Readiness Gap AI teams continue to confront a familiar challenge: moving from experimentation to predictable production performance. Models that train successfully on small clusters or sandbox environments often behave very differently when deployed at scale. Performance characteristics shift. Data pipelines strain under sustained load. Cost assumptions unravel. Synthetic benchmarks and reduced test sets rarely capture the complex interactions between compute, storage, networking, and orchestration that define real-world AI systems. The result can be an expensive “Day One” surprise:  unexpected infrastructure costs, bottlenecks across distributed components, and delays that ripple across product timelines. CoreWeave’s view is that benchmarking and production launch can no longer be treated as separate phases. Instead, validation must occur in environments that replicate the architectural, operational, and economic realities of live deployment. ARENA is designed around that premise. The platform allows customers to run full workloads on CoreWeave’s production-grade GPU infrastructure, using standardized compute stacks, network configurations, data paths, and service integrations that mirror actual deployment environments. Rather than approximating production behavior, the goal is to observe it directly. Key capabilities include: Running real workloads on GPU clusters that match production configurations. Benchmarking both performance and cost under realistic operational conditions. Diagnosing bottlenecks and scaling behavior across compute, storage, and networking layers. Leveraging standardized observability tools and guided engineering support. CoreWeave positions ARENA as an alternative to traditional demo or sandbox environments; one informed by its own experience operating large-scale AI infrastructure. By validating workloads under production conditions early in the lifecycle, teams gain empirical insight into performance dynamics and cost curves before committing capital and operational resources. Why Production-Scale Validation Has Become Strategic The demand for environments like ARENA reflects how fundamentally AI workloads have changed. Several structural shifts are driving the need for production-scale validation: Continuous, Multi-Layered Workloads AI systems are no longer

Read More »

GenAI Pushes Cloud to $119B Quarter as AI Networking Race Intensifies

Cisco Targets the AI Fabric Bottleneck Cisco introduced its Silicon One G300, a new switching ASIC delivering 102.4 Tbps of throughput and designed specifically for large-scale AI cluster deployments. The chip will power next-generation Cisco Nexus 9000 and 8000 systems aimed at hyperscalers, neocloud providers, sovereign cloud operators, and enterprises building AI infrastructure. The company is positioning the platform around a simple premise: at AI-factory scale, the network becomes part of the compute plane. According to Cisco, the G300 architecture enables: 33% higher network utilization 28% reduction in AI job completion time Support for emerging 1.6T Ethernet environments Integrated telemetry and path-based load balancing Martin Lund, EVP of Cisco’s Common Hardware Group, emphasized the growing centrality of data movement. “As AI training and inference continues to scale, data movement is the key to efficient AI compute; the network becomes part of the compute itself,” Lund said. The new systems also reflect another emerging trend in AI infrastructure: the spread of liquid cooling beyond servers and into the networking layer. Cisco says its fully liquid-cooled switch designs can deliver nearly 70% energy efficiency improvement compared with prior approaches, while new 800G linear pluggable optics aim to reduce optical power consumption by up to 50%. Ethernet’s Next Big Test Industry analysts increasingly view AI networking as one of the most consequential battlegrounds in the current infrastructure cycle. Alan Weckel, founder of 650 Group, noted that backend AI networks are rapidly moving toward 1.6T architectures, a shift that could push the Ethernet data center switch market above $100 billion annually. SemiAnalysis founder Dylan Patel was even more direct in framing the stakes. “Networking has been the fundamental constraint to scaling AI,” Patel said. “At this scale, networking directly determines how much AI compute can actually be utilized.” That reality is driving intense innovation

Read More »

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs).  In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

Read More »

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

Read More »

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

Read More »

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Read More »