➡️ Start Asking Your Data ‘Why?’ — A Gentle Intro To Causality

Stay Ahead, Stay ONMINE

➡️ Start Asking Your Data ‘Why?’ — A Gentle Intro To Causality

Correlation does not imply causation. It turns out, however, that with some simple ingenious tricks one can, potentially, unveil causal relationships within standard observational data, without having to resort to expensive randomised control trials. This post is targeted towards anyone making data driven decisions. The main takeaway message is that causality may be possible by […]

This post is targeted towards anyone making data driven decisions. The main takeaway message is that causality may be possible by understanding that the story behind the data is as important as the data itself.

By introducing Simpson’s and Berkson’s Paradoxes, situations where the outcome of a population is in conflict with that of its cohorts, I shine a light on the importance of using causal reasoning to identify these paradoxes in data and avoid misinterpretation. Specifically I introduce causal graphs as a method to visualise the story behind the data point out that by adding this to your arsenal you are likely to conduct better analyses and experiments.

My ultimate objective is to whet your appetite to explore more on causality, as I believe that by asking data “Why?” you will be able to go beyond correlation calculations and extract more insights, as well as avoid common misjudgement pitfalls.

Note that throughout this gentle intro I do not use equations but demonstrate using accessible intuitive visuals. That said I provide resources for you to take your next step in adding Causal Inference to your statistical toolbox so that you may get more value from your data.

The Era of Data Driven Decision Making

In [Deity] We Trust, All Others Bring Data! — William E. Deming

In this digital age it is common to put a lot of faith in data. But this raises an overlooked question: Should we trust data on its own?

Judea Pearl, who is considered the godfather of Causality, articulated best:

“The collection of information is as important as the information itself “ — Judea Pearl

In other words the story behind the data is as important as the data itself.

Judea Pearl is considered the Godfather of Causality. Credit: Aleksander Molak

This manifests in a growing awareness of the importance of identifying bias in datasets. By the end of this post I hope that you will appreciate that causality pertains the fundamental tools to best express, quantify and attempt to correct for these biases.

In causality introductions it is customary to demonstrate why “correlation does not imply causation” by highlighting limitations of association analysis due to spurious correlations (e.g, shark attacks 🦈 and ice-cream sales 🍦). In an attempt to reduce the length of this post I defer this aspect to an older one of mine. Here I focus on two mind boggling paradoxes 🤯 and their resolution via causal graphs to make a similar point.

Paradoxes in Analysis

To understand the importance of the story behind the data we will examine two counter-intuitive (but nonetheless true) paradoxes which are classical situations of data misinterpretation.

In the first we imagine a clinical trial in which patients are given a treatment and that results in a health score. Our objective is to assess the average impact of increased treatment to the health outcome. For pedagogical purposes in these examples we assume that samples are representative (i.e, the sample size is not an issue) and that variances in measurements are minimal.

Population outcome of imaginary clinical trial. Each dot is one patient and the red line indicates the naïve population trend.

In the figure above we learn that on average increasing the treatment appears to be beneficial since it results in a better outcome.

Now we’ll color code by age and gender groupings and examine how the treatment increases impacts each cohort.

Same data as before where each symbol represents an age-gender cohort.

Track any cohort (e.g, “Girls” representing young females) and you immediately realise that increase in treatment appears adverse.

What is the conclusion of the study? On the one hand increasing the treatment appears to be better for the population at large, but when examining gender-age cohorts it seems disadvantageous. This is Simpson’s Paradox which may be stated:

“Trends can exist in subgroups but reverse for the whole”

Below we will resolve this paradox using causality tools, but beforehand let’s explore another interesting one, which also examines made up data.

Imagine that we quantify for the general population their attractiveness and how talented they are as in this figure:

General population. Source: Wikipedia, created by Cmglee

We find no apparent correlation.

Now we’ll focus on an unusual subset — famous people:

A subset of celebrities. Source: Wikipedia created by Cmglee

Here we clearly see an anti-correlation that doesn’t exist in the general population.

Should we conclude that Talent and Attractiveness are independent variables as per the first plot of the general population or that they are correlated as per that of celebrities?

This is Berkson’s Paradox where one population has a trait trend that another lacks.

Whereas an algorithm would identify these correlations, resolving these paradoxes requires a full understanding of the context which normally is not fed to a computer. In other words without knowing the story behind the data results may be misinterpreted and wrong conclusions may be inferred.

Mastering identification and resolution these paradoxes is an important first step to elevating one’s analyses from correlations to causal inference.

Whereas these simple examples may be explained away logically, for the purposes of learning causal tools in the next section I’ll introduce Causal Graphs.

Causal Graphs— Visualising The Story Behind The Data

“[From the Simpson’s and Berkson’s Paradoxes we learn that] certain decisions cannot be made based on the basis of data alone, but instead depend on the story behind the data. … Graph Theory enables these stories to be conveyed” — Judea Pearl

Causal graph models are probabilistic graphical models used to visualise the story behind the data. They are perhaps one of the most powerful tools for analysts that is not taught in most statistics curricula. They are both elegant and highly informative. Hopefully by the end of this post you will appreciate it when Judea Pearl says that this is the missing vocabulary to communicate causality.

To understand causal graph models (or causal graphs for short) we start with the following illustration of an example undirected graph with four nodes/vertices and three edges.

An undirected graph with four nodes/vertices and three edges

Each node is a variable and the edges communicate “who is related to whom?” (i.e, correlations, joint probabilities).A directed graph is one in which we add arrows as in this figure.

A directed graph with four nodes/vertices and five directed edges

A directed edge communicates “who listens to whom?” which is the essence of causation.

In this specific example you can notice a cyclical relationship between the C and D nodes.A useful subset of directed graphs are the directed acyclic graphs (DAG), which have no cycles as in the next figure.

A directed acyclic graph with four nodes/vertices and four directed edges

Here we see that when starting from any node (e.g, A) there isn’t a path that gets back to it.

DAGs are the go-to choice in causality for simplicity as the fact that parameters do not have feedback highly simplifies the flow of information. (For mechanisms that have feedback, e.g temporal systems, one may consider rolling out nodes as a function of time, but that is beyond the scope of this intro.)

Causal graphs are powerful at conveying the cause/effect relationships between the parameter and hence how data was generated (the story behind the data).

From a practical point of view, graphs enable us to understand which parameters are confounders that need to be controlled for, and, as important, which not to control for, because doing so causes spurious correlations. This will be demonstrated below.

The practice of attempting to build a causal graph enables:

Design of better experiments.
Draw causal conclusions (go beyond correlations by means of representing interventions, counterfactuals and encoding conditional independence relationships; all beyond the scope of this post).

To further motivate the usage of causal graph models we will use them to resolve the Simpson’s and Berkson’s paradoxes introduced above.

💊 Causal Graph Resolution of Simpson’s Paradox

For simplicity we’ll examine Simpson’s paradox focusing on two cohorts, male and female adults.

Outcome of the imaginary therapeutic trial, similar to the previous but focusing on the adults. Each symbol is one patient from the respective age-gender cohort and the red line indicates the naïve population trend.

Examining this data we can make three statements about three variables of interest:

Gender is an independent variable (it does not “listen to” the other two)
Treatment depends on Gender (as we can see, in this setting the level given depends on Gender — women have been given, for some reason, a higher dosage.)
Outcome depends on both Gender and Treatment

According to these we can draw the causal graph as the following:

Simpson’s paradox Graphic Model where Gender is a confounding variable between Treatment and Outcome

Notice how each arrow contributes to communicate the statements above. As important, the lack of an arrow pointing into Gender conveys that it is an independent variable.

We also notice that by having arrows pointing from Gender to Treatment and Outcome it is considered a common cause between them.

The essence of the Simpson’s paradox is that although the Outcome is effected by changes in Treatment, as expected, there is also a backdoor path flow of information via Gender.

As you may have guessed by this stage, the solution to this paradox is that the common cause Gender is a confounding variable that needs to be controlled.

Controlling for a variable, in terms of a causal graph, means eliminating the relationship between Gender and Treatment.

This may be done in two manners:

Pre data collection: Setting up a Randomised Control Trial (RCT) in which participants will be given dosage regardless of their Gender.
Post data collection: E.g, in this made up scenario the data has already been collected and hence we need to deal with what is referred to as Observational Data.

In both pre- and post- data collection the elimination of the Treatment dependency of Gender (i.e, controlling for the Gender) may be done by modifying the graph such that the arrow between them is removed as in the following:

A modified version of the Simpson’s paradox Graphic Model. The dark node means we control for Gender.

Applying this “graphical surgery” means that the last two statements need to be modified (for convenience I’ll write all three):

Gender is an independent variable
Treatment is an independent variable
Outcome depends on Gender and Treatment (but with no backdoor path).

This enables obtaining the causal relationship of interest : we can assess the direct impact of modification Treatment on the Outcome.

The process of controlling for a confounder, i.e manipulation of the data generation process, is formally referred to as applying an intervention. That is to say we are no longer passive observers of the data, but we are taking an active role in modification it to assess the causal impact.

How is this manifested in practice?

In the case of RCTs the researcher needs to control for important confounding variables. Here we limit the discussion to Gender (but in real world settings you can imagine other variables such as Age, Social Status and anything else that might be relevant to one’s health).

RCTs are considered the golden standard for causal analysis in many experimental settings thanks to its practice of confounding variables. That said, it has many setbacks:

It may be expensive to recruit individuals and may be complicated logistically
The intervention under investigation may not be physically possible or ethical to conduct (e.g, one can’t ask randomly selected people to smoke or not for ten years)
Artificial setting of a laboratory — not a true natural habitat of the population.

Observational data on the other hand is much more readily available in the industry and academia and hence much cheaper and could be more representative of actual habits of the individuals. But as illustrated in the Simpson’s diagram it may have confounding variables that need to be controlled.

This is where ingenious solutions developed in the causal community in the past few decades are making headway. Detailing them are beyond the scope of this post, but I briefly mention how to learn more at the end.

To resolve for this Simpson’s paradox with the given observational data one

Calculates for each cohort the impact of the change of the treatment on the outcome
Calculates a weighted average contribution of each cohort on the population.

Here we will focus on intuition, but in a future post we will describe the maths behind this solution.

I am sure that many analysts, just like myself, have noticed Simpson’s at some stage in their data and hopefully have corrected for it. Now you know the name of this effect and hopefully start to appreciate how causal tools are useful.

That said … being confused at this stage is OK 😕

I’ll be the first to admit that I struggled to understand this concept and it took me three weekends of deep diving into examples to internalised it. This was the gateway drug to causality for me. Part of my process to understanding statistics is playing with data. For this purpose I created an interactive web application hosted in Streamlit which I call Simpson’s Calculator 🧮. I’ll write a separate post for this in the future.

Even if you are confused the main takeaways of Simpson’s paradox is that:

It is a situation where trends can exist in subgroups but reverse for the whole.
It may be resolved by identifying confounding variables between the treatment and the outcome variables and controlling for them.

This raises the question — should we just control for all variables except for the treatment and outcome? Let’s keep this in mind when resolving for the Berkson’s paradox.

🦚 Causal Graph Resolution of Berkson’s Paradox

As in the previous section we are going to make clear statements about how we believe the data was generated and then draw these in a causal graph.

Let’s examine the case of the general population, for convenience I’m copying the image from above:

Here we understand that:

Talent is an independent variable
Attractiveness is an independent variable

A causal graph for this is quite simple, two nodes without an edge.

In the general population ones Talent and Attractiveness are independent

Let’s examine the plot of the celebrity subset.

The cheeky insight from this mock data is that the more likely one is attractive the less they need to be talented to be a celebrity. Hence we can deduce that:

Talent is an independent variable
Attractiveness is an independent variable
Celebrity variable depends on both Talent and Attractiveness variables. (Imagine this variable is boolean as in: true for celebrities or false for not).

Hence we can draw the causal graph as:

Being a celebrity depends on Talent and Attractiveness

By having arrows pointing into it Celebrity is a collider node between Talent and Attractiveness.

Berkson’s paradox is the fact that when controlling for celebrities we see an interesting trend (anti correlation between Attractiveness and Talent) not seen in the general population.

This can be visualised in the causal graph that by confounding for the Celebrity parameter we are creating a spurious correlation between the otherwise independent variables Talent and Attractiveness. We can draw this as the following:

Berkson’s paradox Graphic Model. The dark node means we control for Celebrity. Controlling this collider variable generates a spurious correlation (dashed line) between Talent and Attractiveness.

The solution of this Berkson’s paradox should be apparent here: Talent and Attractiveness are independent variables in general, but by controlling for the collider Celebrity node causes a spurious correlation in the data.

Let’s compare the resolution of both paradoxes:

Resolving Simpson’s Paradox is by controlling for common cause (Gender)
Resolving Berkson’s Paradox is by not controlling for the collider (Celebrity)

The next figure combines both insights in the form of their causal graphs:

Graph models show how to resolve the paradoxes. Dark nodes are controlled for. Left: Modified graph to resolve Simpson’s paradox by controlling for Gender. Right: To resolve for Berkson’s paradox the collider should not be controlled.

The main takeaway from the resolution of these paradoxes is that controlling for parameters requires a justification. Common causes should be controlled for but colliders should not.

Even though this is common knowledge for those who study causality (e.g, Economics majors), it is unfortunate that most analysts and machine learning practitioners are not aware of this (including myself in 2020 after over 15 years of analysis and predictive modelling experience).

“Oddly, statisticians both over- and underrate the importance of confounders“ — Judea Pearl

Summary

The main takeaway from this post is that the story behind the data is as important as the data itself.

Appreciating this will help you avoid result misinterpretation as spurious correlations and, as demonstrated here, in Simpson’s and Berskon’s paradoxes.

Causal Graphs are an essential tool to visualise the story behind the data. By using them to solve for the paradoxes we learnt that controlling for variables requires justification (common causes ✅, colliders ⛔️).

For those interested in taking the next step in their causal journey I highly suggest mastering Simpson’s paradox. One great way is by playing with data. Feel free to do so with my interactive “Simpson-calculator” 🧮.

Loved this post? 💌 Join me on LinkedIn or ☕ Buy me a coffee!

Credits

Unless otherwise noted, all images were created by the author.

Many thanks to Jim Parr, Will Reynolds, Hedva Kazin and Betty Kazin for their useful comments.

Wondering what your next step should be in your causal journey? Check out my new article on mastering Simpson’s Paradox — you will never look at data the same way. 🔎

Useful Resources

Here I provide resources that I find useful as well as a shopping list of topics for beginners to learn.

📚 Books

The Book of Why — popular science reading (NY Times level)
Causal Inference in Statistics A Primer — excellent short technical book (site)
Causal Inference and Discovery in Python by Aleksander Molak (Packt, github) — clearly explained with python applications 🐍.
What If? — a cohesive presentation of concepts of, and methods for, causal inference (site, github)
Causal Inference The Mixtape — Social Science focused using Python, R and Strata (site, resources, mooc)
Counterfactuals and Causal Inference — Methods and Principles (Social Science focused)

This list is far from comprehensive, but I’m glad to add to it if anyone has suggestions (please mention why the book stands out from the pack).

🔏 Courses

There are probably a few courses online. I love the 🆓 one of Brady Neil bradyneal.com/causal-inference-course.

Clearly explained
Covers many aspects
Thorough
Provides memorable examples
F.R.E.E

One paid course 💰 that is targeted to practitioners is Altdeep.

💾 Software

This list is far from comprehensive because the space is rapidly growing:

Causal Wizard app also have an article about Causal Diagram tools.

🐾 Suggested Next Steps In The Causal Journey

Here I highlight a list of topics which I would have found useful when I started my learnings in the field. If I’m missing anything I’d be more than glad to get feedback and adding. I bold face the ones which were briefly discussed here.

Pearl’s Causal Hierarchy of seeing, doing, imagining and their applications. This is an approved modification of the original illustration by Maayan Harel from MaayanVisuals.com in The Book of Why.

Pearl’s Causal Hierarchy of seeing, doing and imagining (figure above)
Observational data vs. Randomised Control Trials
d-separation, common causes, colliders, mediators, instrumental variables
Causal Graphs
Structural Causal Models
Assumptions: Ignorability, SUTVA, Consistency, Positivity
“Do” Algebra — assessing impact on cohorts by intervention
Counterfactuals — assessing impact on individuals by comparing real outcomes to potential ones
The fundamental problem of causality
Estimand, Estimator, Estimate, Identifiability — relating causal definitions to observable statistics (e.g, conditional probabilities)
Causal Discovery — finding causal graphs with data (e.g, Markov Equivalence)
Causal Machine Learning (e.g, Double Machine Learning)

For completeness it is useful to know that there are different streams of causality. Although there is a lot of overlap you may find that methods differ in naming convention due to development in different fields of research: Computer Science, Social Sciences, Health, Economics

Here I used definitions mostly from the Pearlian perspective (as developed in the field of computer science).

The Story Behind This Post

This narrative is a result of two study groups that I have conducted in a previous role to get myself and colleagues to learn about causality, which I felt missing in my skill set. If there is any interest I’m glad to write a post about the study group experience.

This intro was created as the one I felt that I needed when I started my journey in causality.

In the first iteration of this post I wrote and presented the limitations of spurious correlations and Simpson’s paradox. The main reason for this revision to focus on two paradoxes is that, whereas most causality intros focus on the limitations of correlations, I feel that understanding the concept of justification of confounders is important for all analysts and machine learning practitioners to be aware of.

On September 5th 2024 I have presented this content in a contributed talk at the Royal Statistical Society Annual Conference in Brighton, England (abstract link).

Unfortunately there is no recording but there are of previous talks of mine:

The slides are available at bit.ly/start-ask-why. Presenting this material for the first time at PyData Global 2021

Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy, bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Wi-Fi 8 is coming and it’s going to make AI a lot faster

Traditional Wi-Fi optimizes for 90/10 download-to-upload ratios. AI applications push toward 50/50 symmetry. Voice assistants, edge AI processing and sensor data all require consistent uplink capacity. “AI traffic looks different,” Szymanski explained. “It’s increasingly symmetric, with heavy uplink demands from these edge devices. These devices are pushing all this data

AI certifications outpace non-certified AI skills in pay growth

A key finding is that non-certified IT skills show significantly more volatility than certifications. Over the past year, an average of 35% of non-certified IT skills changed in market value every 90 days, compared to 19% for IT certifications. Foote Partners explained how each type of skill is valued: non-certified

AWS DNS error hits DynamoDB, causing problems for multiple services and customers

By this time, it was clear that the problems were not confined to users or services on the US East Coast. “Global services or features that rely on US-EAST-1 endpoints such as IAM updates and DynamoDB Global tables may also be experiencing issues,” it said. By 2:27 a.m. Pacific time,

NextDecade names Mott as interim CFO

NextDecade Corp. has appointed company senior vice-president Michael (Mike) Mott as interim chief financial officer, effective Oct. 20, 2025. Mott will take over from Brent Wahl, who resigns from the company as chief financial officer, effective Oct. 20. Wahl was named chief financial officer of NextDecade in 2021 after having served

EU Strikes Deal to Ban Russian Gas by End-2027

European Union energy ministers agreed a joint position on plans to ban all gas supplies from Russia by the end of 2027, as the bloc looks to definitively end its reliance on energy from Moscow. A qualified majority of officials meeting in Luxembourg Monday supported the ban, which starts by prohibiting Russian supplies under existing short-term contracts by mid-June, with an exemption for landlocked countries such as Hungary and Slovakia. A prohibition on long-term deals follows 18 months later. Hungary and Slovakia did not support the ban. The deal on Monday was a procedural step on the RePowerEU regulation, which aims to permanently end Europe’s dependence on Russian fossil fuels. Negotiations with the European Parliament, which is calling for a faster exit from Russian gas and a halting of oil imports from the start of next year, can now start. The aim is to reach a final deal before the end of the year. The EU is pursuing a two-pronged strategy to finally end its addiction to Russian fossil fuels after President Vladimir Putin invaded Ukraine in 2022. The second strand involves a separate proposal to impose sanctions on imports of Russian liquefied natural gas from the start of 2027. It requires unanimity and is still being discussed. Europe has been under pressure from the US to speed up moves to sever its energy ties with Moscow, and buy more American LNG. A joint statement on EU-US trade pledged $750 billion in energy deals between the two over the next three years. The EU receives about 15% of its LNG supplies from Moscow, making Russia the second-largest provider of the fuel to Europe after the US, with the monthly bill for those imports ranging between €500 million ($584 million) and €700 million. “This is not just for the present conflict,”

Oil Dips as Surplus Signs Grow

Oil edged down slightly amid signs of easing tensions between the US and China, while traders took stock of mounting evidence that a long-anticipated surplus is finally starting to emerge. West Texas Intermediate was little changed to settle near $57 a barrel as investors rolled over positions ahead of the November contract’s expiry this week, adding to choppy trading. US President Donald Trump earlier expressed optimism about a potential deal between the world’s top oil consumers. Enthusiasm surrounding this development was limited, though, as oil stored on tankers rose to a fresh high, among the most tangible signs yet that markets are oversupplied. Oil futures have tumbled more than 20% from their summer highs as the Organization of the Petroleum Exporting countries and its allies ramp up production, while major forecasters project a flood of supplies continuing into next year. Even so, WTI earlier edged into oversold territory on the nine-day relative strength index for the first time since May, a possible indication that prices lurched lower too fast. It also suggests a reversal may be in the cards. “Crude futures continue to trade on the defensive amid ideas a looming supply surplus is near,” said Dennis Kissler, senior vice president for trading at BOK Financial. Price support for WTI rests at around $56.15, though a close below $55 risks a further price slide, he added. Geopolitical forces are also at play. Prices have been weighed down by limited progress toward a de-escalation of the war in Ukraine, a scenario that could push oil toward $50 a barrel, according to Citigroup Inc. President Donald Trump last week said he would hold a second meeting with Russia’s Vladimir Putin seeking to end the conflict, though previous talks have done little to stem the hostilities. Meanwhile, China’s economy slowed for a second

As cyber threats grow, utilities say lapsed information-sharing law stymies security

Dive Brief: Amid rising threats to operational systems and a chaotic geopolitical environment, electric utilities want Congress to cleanly reauthorize the Cybersecurity Information Sharing Act of 2015, which allows for greater information sharing between the power sector and federal government. The law lapsed October 1. A temporary extension was included in the government funding bill, which failed and resulted in the current shutdown. A bipartisan Senate bill could bring CISA’s protections back into force. It is vital that utilities are able to share threat information as the risks are rising, said Kristine Martz, a principal product advisor at cybersecurity firm Dragos. “Adversaries are becoming aware of the impact that they can achieve against easy to access industrial control systems,” or ICS, she said Friday at a conference hosted by Columbia University’s School of International and Public Affairs. Dive Insight: “We’ve seen a consistent rise in threat activity over the years,” Martz said, noting new threat adversaries are focused on operational technology and ICS environments where they can impact the delivery of services. “They get in through these internet-facing devices and just live off the land for a long time to perform reconnaissance, pulling down things like your GIS data, your network maps,” Martz said. “Living off the land” refers to cyber intruders using legitimate network tools to cover their presence and gain information. While utility regulations like the North American Electric Reliability Corp.’s Critical Infrastructure Protection standards have helped create a baseline of security and shored up obvious weaknesses, Dragos has identified new threat groups developing operational and ICS-specific malware which take advantage of the extensive knowledge of utility work environments that hackers can gain from their research, Martz said. Given the threat, and in an environment of rapidly growing electricity demand, it is vital that electric utilities are able to share information

Residential electricity prices surge ahead of C&I rates: Berkeley Lab

Residential electricity retail prices rose more rapidly than commercial and industrial prices from 2019 to 2024, according to a study released last week by the Lawrence Berkeley National Laboratory. Nationwide, average residential prices jumped 27% to 16.5 cents/kWh in the five years since 2019 while average commercial prices increased 19% to 12.8 cents/kWh and industrial prices climbed 19% to 8.1 cents/kWh, the LBNL researchers said in a summary of the report examining factors that affect electricity prices. Overall retail electricity prices fell in 37 states from 2019 to 2024 when adjusting for inflation, the researchers said. Driven by wildfire-related costs, inflation-adjusted electricity prices surged 6.2% in California — the most among states, according to the report. Real prices also increased in the Northeast in the five-year period. Optional Caption Retrieved from Lawrence Berkeley National Laboratory. Last year, retail electricity prices ranged from less than 8 cents/kWh in North Dakota to more than 27 cents/kWh in California, according to the LBNL report. The LBNL researchers found that state energy policies can contribute to rising electricity prices. “States with the largest price increases in recent years typically featured shrinking customer loads — partially linked to growth in net metered behind-the-meter solar — and had [renewable portfolio standard] programs in concert with relatively costly incremental renewable energy supplies,” the researchers said. States with RPS programs that called for new supplies in the last five years increased retail electricity prices by about 0.4 cents/kWh, according to the study. However, electricity prices appear unaffected by “market-based” utility-scale renewable energy projects built outside of RPS mandates, the LBNL researchers found. Also, while behind-the-meter solar cut net electricity load in some states — by more than 5% in California, Maine and Rhode Island, for example — it is linked to higher electricity prices, according to the study.

Oil’s Billion Barrel Sea Surplus Expands

A flotilla of crude oil on the world’s oceans expanded to a fresh high as producer nations keep adding barrels and the tankers sail further for deliveries. A total of 1.24 billion barrels of crude and condensate, a light form of oil recovered from gas fields, was moving on tankers in the week to Oct. 17, according to data from analytics firm Vortexa. That was up from a revised 1.22 billion barrels a week earlier. Oil traders warned last week that a long-anticipated surplus is finally starting to materialize and the amount of cargo at sea is one indicator of that. Production is rising from members of the OPEC+ group of nations, which are unwinding earlier output cuts — as well as countries outside the group, predominantly in the Americas, where Guyana recently started pumping from a new offshore field and US output hit a new high. The build-up comes at a time when demand growth is slowing, with forecasters predicting a surplus that could rise to as much as 4 million barrels a day in the early months of next year. Oil prices fell 0.8% on Monday, taking their decline so far this year to 18%. Eight members of the Organization of the Petroleum Exporting Countries and their allies, which together make up the OPEC+ grouping, raised their collective production target by almost 2.5 million barrels a day between March and September. While increases in actual production have lagged, the group still added more than 2 million barrels a day to supply over that period. Vortexa’s figures exclude oil in floating storage, defined as being on vessels that have been stationary for at least seven days. The biggest increases have come from Saudi Arabia, the United Arab Emirates and Russia, whose combined output has risen by 1.77 million barrels a day.

Bullish Weather Shift Sparks NatGas Revival

In an EBW Analytics Group report sent to Rigzone by the EBW team on Monday, Eli Rubin, an energy analyst at the company, noted that a “bullish weather shift spark[ed]… [a] natural gas revival”. “After testing as low as $2.893 per million British thermal units (MMBtu) intraday Friday – and with weekend Henry Hub spot prices falling to $2.66 [per MMBtu] – a cooler weekend weather shift has revived bullish fortunes for the November contract,” Rubin said in the report. “Further, weekly average LNG was at a record high over the weekend and natural gas production readings slumped in the Permian and Marcellus,” Rubin added. In the report, Rubin noted that “some meteorologists reflect a larger heating demand gain, helping to explain the pop at the front of the curve”. “Cooler weather (the next three EIA [U.S. Energy Information Administration] weeks are expected to be below long-term normals) slashes risks of bearish outcomes akin to early November 2024,” Rubin said. The energy analyst stated in the report that “shorts covering some positions likely underlies the market reaction higher”. “Still, lofty storage, mild weather, and returning production remain,” Rubin warned in the report. “The $3.22-3.24 per MMBtu level is a key technical battleground, however, and if weather models continue colder or bulls lift November above key resistance, extended near-term upside potential may occur,” he added. EBW’s report highlighted that the November natural gas contract closed at $3.008 per MMBtu on Friday. This was up 7.0 cents, or 2.4 percent, from Thursday’s close, the report outlined. In a separate EBW report sent to Rigzone by the EBW team on Friday, Rubin warned that “near to medium term natural gas weakness extend[ed]”. “Yesterday’s [Thursday] EIA-reported 80 billion cubic foot injection confirmed a lofty storage trajectory, driving the November 2025 natural gas contract to

Riverbed tackles AI data bottleneck with new Oracle-based service

“Customers are looking for faster, more secure ways to move massive datasets so they can bring AI initiatives to life,” said Sachin Menon, Oracle’s vice president of cloud engineering, in a statement. “With Riverbed Data Express Service deployed on OCI, organizations will be able to accelerate time to value, reduce costs, and help ensure that their data remains protected.” Riverbed’s Aras explains that its Data Express Service uses post-quantum cryptography (PQC) to move petabyte-scale datasets through secure VPN tunnels to ensure that customer data remains protected during the transfer process. The technology is based on Riverbed’s SteelHead acceleration platform running RiOS 10 software. “Our cloud-optimized technology design delivers much higher data retrieval, data movement across the network, and data write rates, through highly performant data mover instances, instance parallelization and matched network fabric configurations. The design is tailored for each cloud, to ensure maximal performance can be achieved using cloud-specific product adjustments,” Aras says. “The time for preventing harvest-now, decrypt-later is now,” Aras says, referring to the security threat where encrypted data is intercepted and stored for decryption once quantum computers become powerful enough. The Riverbed service addresses use cases spanning AI model training, inference operations, and emerging agentic AI applications. Data Express is initially deployed on Oracle Cloud Infrastructure, but Riverbed said the service will orchestrate data movement across AWS, Azure, and Google Cloud Platform, as well as on-premises data centers. General availability is planned for Q4 2025.

Roundup: Digital Realty Marks Major Milestones in AI, Quantum Computing, Data Center Development

Key features of the DRIL include: • High-Density AI and HPC Testing. The DRIL supports AI and high-performance computing (HPC) workloads with high-density colocation, accommodating workloads up to 150 kW per cabinet. • AI Infrastructure Optimization. The ePlus AI Experience Center lets businesses explore AI-specific power, cooling, and GPU resource requirements in an environment optimized for AI infrastructure. • Hybrid Cloud Validation. With direct cloud connectivity, users can refine hybrid strategies and onboard through cross connects. • AI Workload Orchestration. Customers can orchestrate AI workloads across Digital Realty’s Private AI Exchange (AIPx) for seamless integration and performance. • Latency Testing Across Locations. Enterprises can test latency scenarios for seamless performance across multiple locations and cloud destinations. The firm’s Northern Virginia campus is the primary DRIL location, but companies can also test latency scenarios between there and other remote locations. DRIL rollout to other global locations is already in progress, and London is scheduled to go live in early 2026. Digital Realty, Redeployable Launch Pathway for Veteran Technical Careers As new data centers are created, they need talented workers. To that end, Digital Realty has partnered with Redeployable, an AI-powered career platform for veterans, to expand access to technical careers in the United Kingdom and United States. The collaboration launched a Site Engineer Pathway, now live on the Redeployable platform. It helps veterans explore, prepare for, and transition into roles at Digital Realty. Nearly half of veterans leave their first civilian role within a year, often due to unclear expectations, poor skill translation, and limited support, according to Redeployable. The Site Engineer Pathway uses real-world relevance and replaces vague job descriptions with an experience-based view of technical careers. Veterans can engage in scenario-based “job drops” simulating real facility and system challenges so they can assess their fit for the role before applying. They

BlackRock’s $40B data center deal opens a new infrastructure battle for CIOs

Everest Group partner Yugal Joshi said, “CIOs are under significant pressure to clearly define their data center strategy beyond traditional one-off leases. Given most of the capacity is built and delivered by fewer players, CIOs need to prepare for a higher-price market with limited negotiation power.” The numbers bear this out. Global data center costs rose to $217.30 per kilowatt per month in the first quarter of 2025, with major markets seeing increases of 17-18% year-over-year, according to CBRE. Those prices are at levels last seen in 2011-2012, and analysts expect them to remain elevated. Gogia said, “The combination of AI demand, energy scarcity, and environmental regulation has permanently rewritten the economics of running workloads. Prices that once looked extraordinary have now become baseline.” Hyperscalers get first dibs The consolidation problem is compounded by the way capacity is being allocated. North America’s data center vacancy rate fell to 1.6% in the first half of 2025, with Northern Virginia posting just 0.76%, according to CBRE Research. More troubling for enterprises: 74.3% of capacity currently under construction is already preleased, primarily to cloud and AI providers. “The global compute market is no longer governed by open supply and demand,” Gogia said. “It is increasingly shaped by pre-emptive control. Hyperscalers and AI majors are reserving capacity years in advance, often before the first trench for power is dug. This has quietly created a two-tier world: one in which large players guarantee their future and everyone else competes for what remains.” That dynamic forces enterprises into longer planning cycles. “CIOs must forecast their infrastructure requirements with the same precision they apply to financial budgets and talent pipelines,” Gogia said. “The planning horizon must stretch to three or even five years.”

Nvidia, Infineon partner for AI data center power overhaul

The solution is to convert power right at the GPU on the server board and to upgrade the backbone to 800 volts. That should squeeze more reliability and efficiency out of the system while dealing with the heat, Infineon stated. Nvidia announced the 800 Volt direct current (VDC) power architecture at Computex 2025 as a much-needed replacement for the 54 Volt backbone currently in use, which is overwhelmed by the demand of AI processors and increasingly prone to failure. “This makes sense with the power needs of AI and how it is growing,” said Alvin Nguyen, senior analyst with Forrester Research. “This helps mitigate power losses seen from lower voltage and AC systems, reduces the need for materials like copper for wiring/bus bars, better reliability, and better serviceability.” Infineon says a shift to a centralized 800 VDC architecture allows for reduced power losses, higher efficiency and reliability. However, the new architecture requires new power conversion solutions and safety mechanisms to prevent potential hazards and costly server downtimes such as service and maintenance.

Meta details cutting-edge networking technologies for AI infrastructure

ESUN initiative As part of its standardization efforts, Meta said it would be a key player in the new Ethernet for Scale-Up Networking (ESUN) initiative that brings together AMD, Arista, ARM, Broadcom, Cisco, HPE Networking, Marvell, Microsoft, NVIDIA, OpenAI and Oracle to advance the networking technology to handle the growing scale-up domain for AI systems. ESUN will focus solely on open, standards-based Ethernet switching and framing for scale-up networking—excluding host-side stacks, non-Ethernet protocols, application-layer solutions, and proprietary technologies. The group will focus on the development and interoperability of XPU network interfaces and Ethernet switch ASICs for scale-up networks, the OCP wrote in a blog. ESUN will actively engage with other organizations such as Ultra-Ethernet Consortium (UEC) and long-standing IEEE 802.3 Ethernet to align open standards, incorporate best practices, and accelerate innovation, the OCP stated. Data center networking milestones The launch of ESUN is just one of the AI networking developments Meta shared at the event. Meta engineers also announced three data center networking innovations aimed at making its infrastructure more flexible, scalable, and efficient: The evolution of Meta’s Disaggregated Scheduled Fabric (DSF) to support scale-out interconnect for large AI clusters that span entire data center buildings. A new Non-Scheduled Fabric (NSF) architecture based entirely on shallow-buffer, disaggregated Ethernet switches that will support our largest AI clusters like Prometheus. The addition of Minipack3N, based on Nvidia’s Ethernet Spectrum-4 ASIC, to Meta’s portfolio of 51Tbps OCP switches that use OCP’s Switch Abstraction Interface and Meta’s Facebook Open Switching System (FBOSS) software stack. DSF is Meta’s open networking fabric that completely separates switch hardware, NICs, endpoints, and other networking components from the underlying network and uses OCP-SAI and FBOSS to achieve that, according to Meta. It supports Ethernet-based RoCE RDMA over Converged Ethernet (RoCE/RDMA)) to endpoints, accelerators and NICs from multiple vendors, such as Nvidia,

Arm joins Open Compute Project to build next-generation AI data center silicon

Keeping up with the demand comes down to performance, and more specifically, performance per watt. With power limited, OEMs have become much more involved in all aspects of the system design, rather than pulling silicon off the shelf or pulling servers or racks off the shelf. “They’re getting much more specific about what that silicon looks like, which is a big departure from where the data center was ten or 15 years ago. The point here being is that they look to create a more optimized system design to bring the acceleration closer to the compute, and get much better performance per watt,” said Awad. The Open Compute Project is a global industry organization dedicated to designing and sharing open-source hardware configurations for data center technologies and infrastructure. It covers everything from silicon products to rack and tray design. It is hosting its 2025 OCP Global Summit this week in San Jose, Calif. Arm also was part of the Ethernet for Scale-Up Networking (ESUN) initiative announced this week at the Summit that included AMD, Arista, Broadcom, Cisco, HPE Networking, Marvell, Meta, Microsoft, and Nvidia. ESUN promises to advance Ethernet networking technology to handle scale-up connectivity across accelerated AI infrastructures. Arm’s goal by joining OCP is to encourage knowledge sharing and collaboration between companies and users to share ideas, specifications and intellectual property. It is known for focusing on modular rather than monolithic designs, which is where chiplets come in. For example, customers might have multiple different companies building a 64-core CPU and then choose IO to pair it with, whether like PCIe or an NVLink. They then choose their own memory subsystem, deciding whether to go HBM, LPDDR, or DDR. It’s all mix and match like Legos, Awad said.

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs). In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle