Stay Ahead, Stay ONMINE

➡️ Start Asking Your Data ‘Why?’ — A Gentle Intro To Causality

Correlation does not imply causation. It turns out, however, that with some simple ingenious tricks one can, potentially, unveil causal relationships within standard observational data, without having to resort to expensive randomised control trials. This post is targeted towards anyone making data driven decisions. The main takeaway message is that causality may be possible by […]

Correlation does not imply causation. It turns out, however, that with some simple ingenious tricks one can, potentially, unveil causal relationships within standard observational data, without having to resort to expensive randomised control trials.

This post is targeted towards anyone making data driven decisions. The main takeaway message is that causality may be possible by understanding that the story behind the data is as important as the data itself.

By introducing Simpson’s and Berkson’s Paradoxes, situations where the outcome of a population is in conflict with that of its cohorts, I shine a light on the importance of using causal reasoning to identify these paradoxes in data and avoid misinterpretation. Specifically I introduce causal graphs as a method to visualise the story behind the data point out that by adding this to your arsenal you are likely to conduct better analyses and experiments.

My ultimate objective is to whet your appetite to explore more on causality, as I believe that by asking data “Why?” you will be able to go beyond correlation calculations and extract more insights, as well as avoid common misjudgement pitfalls.

Note that throughout this gentle intro I do not use equations but demonstrate using accessible intuitive visuals. That said I provide resources for you to take your next step in adding Causal Inference to your statistical toolbox so that you may get more value from your data.

The Era of Data Driven Decision Making

In [Deity] We Trust, All Others Bring Data! — William E. Deming

In this digital age it is common to put a lot of faith in data. But this raises an overlooked question: Should we trust data on its own?

Judea Pearl, who is considered the godfather of Causality, articulated best:

“The collection of information is as important as the information itself “ — Judea Pearl

In other words the story behind the data is as important as the data itself.

Judea Pearl is considered the Godfather of Causality. Credit: Aleksander Molak

This manifests in a growing awareness of the importance of identifying bias in datasets. By the end of this post I hope that you will appreciate that causality pertains the fundamental tools to best express, quantify and attempt to correct for these biases.

In causality introductions it is customary to demonstrate why “correlation does not imply causation” by highlighting limitations of association analysis due to spurious correlations (e.g, shark attacks 🦈 and ice-cream sales 🍦). In an attempt to reduce the length of this post I defer this aspect to an older one of mine. Here I focus on two mind boggling paradoxes 🤯 and their resolution via causal graphs to make a similar point.

Paradoxes in Analysis

To understand the importance of the story behind the data we will examine two counter-intuitive (but nonetheless true) paradoxes which are classical situations of data misinterpretation.

In the first we imagine a clinical trial in which patients are given a treatment and that results in a health score. Our objective is to assess the average impact of increased treatment to the health outcome. For pedagogical purposes in these examples we assume that samples are representative (i.e, the sample size is not an issue) and that variances in measurements are minimal.

Population outcome of imaginary clinical trial. Each dot is one patient and the red line indicates the naïve population trend.

In the figure above we learn that on average increasing the treatment appears to be beneficial since it results in a better outcome.

Now we’ll color code by age and gender groupings and examine how the treatment increases impacts each cohort.

Same data as before where each symbol represents an age-gender cohort.

Track any cohort (e.g, “Girls” representing young females) and you immediately realise that increase in treatment appears adverse.

What is the conclusion of the study? On the one hand increasing the treatment appears to be better for the population at large, but when examining gender-age cohorts it seems disadvantageous. This is Simpson’s Paradox which may be stated:

“Trends can exist in subgroups but reverse for the whole”

Below we will resolve this paradox using causality tools, but beforehand let’s explore another interesting one, which also examines made up data.

Imagine that we quantify for the general population their attractiveness and how talented they are as in this figure:

General population. Source: Wikipedia, created by Cmglee

We find no apparent correlation.

Now we’ll focus on an unusual subset — famous people:

A subset of celebrities. Source: Wikipedia created by Cmglee

Here we clearly see an anti-correlation that doesn’t exist in the general population.

Should we conclude that Talent and Attractiveness are independent variables as per the first plot of the general population or that they are correlated as per that of celebrities?

This is Berkson’s Paradox where one population has a trait trend that another lacks.

Whereas an algorithm would identify these correlations, resolving these paradoxes requires a full understanding of the context which normally is not fed to a computer. In other words without knowing the story behind the data results may be misinterpreted and wrong conclusions may be inferred.

Mastering identification and resolution these paradoxes is an important first step to elevating one’s analyses from correlations to causal inference.

Whereas these simple examples may be explained away logically, for the purposes of learning causal tools in the next section I’ll introduce Causal Graphs.

Causal Graphs— Visualising The Story Behind The Data

“[From the Simpson’s and Berkson’s Paradoxes we learn that] certain decisions cannot be made based on the basis of data alone, but instead depend on the story behind the data. … Graph Theory enables these stories to be conveyed” — Judea Pearl

Causal graph models are probabilistic graphical models used to visualise the story behind the data. They are perhaps one of the most powerful tools for analysts that is not taught in most statistics curricula. They are both elegant and highly informative. Hopefully by the end of this post you will appreciate it when Judea Pearl says that this is the missing vocabulary to communicate causality.

To understand causal graph models (or causal graphs for short) we start with the following illustration of an example undirected graph with four nodes/vertices and three edges.

An undirected graph with four nodes/vertices and three edges

Each node is a variable and the edges communicate “who is related to whom?” (i.e, correlations, joint probabilities).A directed graph is one in which we add arrows as in this figure.

A directed graph with four nodes/vertices and five directed edges

A directed edge communicates “who listens to whom?” which is the essence of causation.

In this specific example you can notice a cyclical relationship between the C and D nodes.A useful subset of directed graphs are the directed acyclic graphs (DAG), which have no cycles as in the next figure.

A directed acyclic graph with four nodes/vertices and four directed edges

Here we see that when starting from any node (e.g, A) there isn’t a path that gets back to it.

DAGs are the go-to choice in causality for simplicity as the fact that parameters do not have feedback highly simplifies the flow of information. (For mechanisms that have feedback, e.g temporal systems, one may consider rolling out nodes as a function of time, but that is beyond the scope of this intro.)

Causal graphs are powerful at conveying the cause/effect relationships between the parameter and hence how data was generated (the story behind the data).

From a practical point of view, graphs enable us to understand which parameters are confounders that need to be controlled for, and, as important, which not to control for, because doing so causes spurious correlations. This will be demonstrated below.

The practice of attempting to build a causal graph enables:

  • Design of better experiments.
  • Draw causal conclusions (go beyond correlations by means of representing interventions, counterfactuals and encoding conditional independence relationships; all beyond the scope of this post).

To further motivate the usage of causal graph models we will use them to resolve the Simpson’s and Berkson’s paradoxes introduced above.

💊 Causal Graph Resolution of Simpson’s Paradox

For simplicity we’ll examine Simpson’s paradox focusing on two cohorts, male and female adults.

Outcome of the imaginary therapeutic trial, similar to the previous but focusing on the adults. Each symbol is one patient from the respective age-gender cohort and the red line indicates the naïve population trend.

Examining this data we can make three statements about three variables of interest:

  • Gender is an independent variable (it does not “listen to” the other two)
  • Treatment depends on Gender (as we can see, in this setting the level given depends on Gender — women have been given, for some reason, a higher dosage.)
  • Outcome depends on both Gender and Treatment

According to these we can draw the causal graph as the following:

Simpson’s paradox Graphic Model where Gender is a confounding variable between Treatment and Outcome

Notice how each arrow contributes to communicate the statements above. As important, the lack of an arrow pointing into Gender conveys that it is an independent variable.

We also notice that by having arrows pointing from Gender to Treatment and Outcome it is considered a common cause between them.

The essence of the Simpson’s paradox is that although the Outcome is effected by changes in Treatment, as expected, there is also a backdoor path flow of information via Gender.

As you may have guessed by this stage, the solution to this paradox is that the common cause Gender is a confounding variable that needs to be controlled.

Controlling for a variable, in terms of a causal graph, means eliminating the relationship between Gender and Treatment.

This may be done in two manners:

  • Pre data collection: Setting up a Randomised Control Trial (RCT) in which participants will be given dosage regardless of their Gender.
  • Post data collection: E.g, in this made up scenario the data has already been collected and hence we need to deal with what is referred to as Observational Data.

In both pre- and post- data collection the elimination of the Treatment dependency of Gender (i.e, controlling for the Gender) may be done by modifying the graph such that the arrow between them is removed as in the following:

A modified version of the Simpson’s paradox Graphic Model. The dark node means we control for Gender.

Applying this “graphical surgery” means that the last two statements need to be modified (for convenience I’ll write all three):

  • Gender is an independent variable
  • Treatment is an independent variable
  • Outcome depends on Gender and Treatment (but with no backdoor path).

This enables obtaining the causal relationship of interest : we can assess the direct impact of modification Treatment on the Outcome.

The process of controlling for a confounder, i.e manipulation of the data generation process, is formally referred to as applying an intervention. That is to say we are no longer passive observers of the data, but we are taking an active role in modification it to assess the causal impact.

How is this manifested in practice?

In the case of RCTs the researcher needs to control for important confounding variables. Here we limit the discussion to Gender (but in real world settings you can imagine other variables such as Age, Social Status and anything else that might be relevant to one’s health).

RCTs are considered the golden standard for causal analysis in many experimental settings thanks to its practice of confounding variables. That said, it has many setbacks:

  • It may be expensive to recruit individuals and may be complicated logistically
  • The intervention under investigation may not be physically possible or ethical to conduct (e.g, one can’t ask randomly selected people to smoke or not for ten years)
  • Artificial setting of a laboratory — not a true natural habitat of the population.

Observational data on the other hand is much more readily available in the industry and academia and hence much cheaper and could be more representative of actual habits of the individuals. But as illustrated in the Simpson’s diagram it may have confounding variables that need to be controlled.

This is where ingenious solutions developed in the causal community in the past few decades are making headway. Detailing them are beyond the scope of this post, but I briefly mention how to learn more at the end.

To resolve for this Simpson’s paradox with the given observational data one

  1. Calculates for each cohort the impact of the change of the treatment on the outcome
  2. Calculates a weighted average contribution of each cohort on the population.

Here we will focus on intuition, but in a future post we will describe the maths behind this solution.

I am sure that many analysts, just like myself, have noticed Simpson’s at some stage in their data and hopefully have corrected for it. Now you know the name of this effect and hopefully start to appreciate how causal tools are useful.

That said … being confused at this stage is OK 😕

I’ll be the first to admit that I struggled to understand this concept and it took me three weekends of deep diving into examples to internalised it. This was the gateway drug to causality for me. Part of my process to understanding statistics is playing with data. For this purpose I created an interactive web application hosted in Streamlit which I call Simpson’s Calculator 🧮. I’ll write a separate post for this in the future.

Even if you are confused the main takeaways of Simpson’s paradox is that:

  • It is a situation where trends can exist in subgroups but reverse for the whole.
  • It may be resolved by identifying confounding variables between the treatment and the outcome variables and controlling for them.

This raises the question — should we just control for all variables except for the treatment and outcome? Let’s keep this in mind when resolving for the Berkson’s paradox.

🦚 Causal Graph Resolution of Berkson’s Paradox

As in the previous section we are going to make clear statements about how we believe the data was generated and then draw these in a causal graph.

Let’s examine the case of the general population, for convenience I’m copying the image from above:

General population. Source: Wikipedia, created by Cmglee

Here we understand that:

  • Talent is an independent variable
  • Attractiveness is an independent variable

A causal graph for this is quite simple, two nodes without an edge.

In the general population ones Talent and Attractiveness are independent

Let’s examine the plot of the celebrity subset.

A subset of celebrities. Source: Wikipedia created by Cmglee

The cheeky insight from this mock data is that the more likely one is attractive the less they need to be talented to be a celebrity. Hence we can deduce that:

  • Talent is an independent variable
  • Attractiveness is an independent variable
  • Celebrity variable depends on both Talent and Attractiveness variables. (Imagine this variable is boolean as in: true for celebrities or false for not).

Hence we can draw the causal graph as:

Being a celebrity depends on Talent and Attractiveness

By having arrows pointing into it Celebrity is a collider node between Talent and Attractiveness.

Berkson’s paradox is the fact that when controlling for celebrities we see an interesting trend (anti correlation between Attractiveness and Talent) not seen in the general population.

This can be visualised in the causal graph that by confounding for the Celebrity parameter we are creating a spurious correlation between the otherwise independent variables Talent and Attractiveness. We can draw this as the following:

Berkson’s paradox Graphic Model. The dark node means we control for Celebrity. Controlling this collider variable generates a spurious correlation (dashed line) between Talent and Attractiveness.

The solution of this Berkson’s paradox should be apparent here: Talent and Attractiveness are independent variables in general, but by controlling for the collider Celebrity node causes a spurious correlation in the data.

Let’s compare the resolution of both paradoxes:

  • Resolving Simpson’s Paradox is by controlling for common cause (Gender)
  • Resolving Berkson’s Paradox is by not controlling for the collider (Celebrity)

The next figure combines both insights in the form of their causal graphs:

Graph models show how to resolve the paradoxes. Dark nodes are controlled for. Left: Modified graph to resolve Simpson’s paradox by controlling for Gender. Right: To resolve for Berkson’s paradox the collider should not be controlled.

The main takeaway from the resolution of these paradoxes is that controlling for parameters requires a justification. Common causes should be controlled for but colliders should not.

Even though this is common knowledge for those who study causality (e.g, Economics majors), it is unfortunate that most analysts and machine learning practitioners are not aware of this (including myself in 2020 after over 15 years of analysis and predictive modelling experience).

Oddly, statisticians both over- and underrate the importance of confounders — Judea Pearl

Summary

The main takeaway from this post is that the story behind the data is as important as the data itself.

Appreciating this will help you avoid result misinterpretation as spurious correlations and, as demonstrated here, in Simpson’s and Berskon’s paradoxes.

Causal Graphs are an essential tool to visualise the story behind the data. By using them to solve for the paradoxes we learnt that controlling for variables requires justification (common causes ✅, colliders ⛔️).

For those interested in taking the next step in their causal journey I highly suggest mastering Simpson’s paradox. One great way is by playing with data. Feel free to do so with my interactive “Simpson-calculator” 🧮.

Loved this post? 💌 Join me on LinkedIn or ☕ Buy me a coffee!

Credits

Unless otherwise noted, all images were created by the author.

Many thanks to Jim Parr, Will Reynolds, Hedva Kazin and Betty Kazin for their useful comments.

Wondering what your next step should be in your causal journey? Check out my new article on mastering Simpson’s Paradox — you will never look at data the same way. 🔎

Useful Resources

Here I provide resources that I find useful as well as a shopping list of topics for beginners to learn.

📚 Books

Credit: Gaelle Marcel
  • The Book of Why — popular science reading (NY Times level)
  • Causal Inference in Statistics A Primer — excellent short technical book (site)
  • Causal Inference and Discovery in Python by Aleksander Molak (Packt, github) — clearly explained with python applications 🐍.
  • What If? — a cohesive presentation of concepts of, and methods for, causal inference (site, github)
  • Causal Inference The Mixtape — Social Science focused using Python, R and Strata (site, resources, mooc)
  • Counterfactuals and Causal Inference — Methods and Principles (Social Science focused)

This list is far from comprehensive, but I’m glad to add to it if anyone has suggestions (please mention why the book stands out from the pack).

🔏 Courses

Credit: Austrian National Library

There are probably a few courses online. I love the 🆓 one of Brady Neil bradyneal.com/causal-inference-course.

  • Clearly explained
  • Covers many aspects
  • Thorough
  • Provides memorable examples
  • F.R.E.E

One paid course 💰 that is targeted to practitioners is Altdeep.

💾 Software

Credit: Artturi Jalli

This list is far from comprehensive because the space is rapidly growing:

Causal Wizard app also have an article about Causal Diagram tools.

🐾 Suggested Next Steps In The Causal Journey

Here I highlight a list of topics which I would have found useful when I started my learnings in the field. If I’m missing anything I’d be more than glad to get feedback and adding. I bold face the ones which were briefly discussed here.

Pearl’s Causal Hierarchy of seeing, doing, imagining and their applications. This is an approved modification of the original illustration by Maayan Harel from MaayanVisuals.com in The Book of Why.
  • Pearl’s Causal Hierarchy of seeing, doing and imagining (figure above)
  • Observational data vs. Randomised Control Trials
  • d-separation, common causes, colliders, mediators, instrumental variables
  • Causal Graphs
  • Structural Causal Models
  • Assumptions: Ignorability, SUTVA, Consistency, Positivity
  • “Do” Algebra — assessing impact on cohorts by intervention
  • Counterfactuals — assessing impact on individuals by comparing real outcomes to potential ones
  • The fundamental problem of causality
  • Estimand, Estimator, Estimate, Identifiability — relating causal definitions to observable statistics (e.g, conditional probabilities)
  • Causal Discovery — finding causal graphs with data (e.g, Markov Equivalence)
  • Causal Machine Learning (e.g, Double Machine Learning)

For completeness it is useful to know that there are different streams of causality. Although there is a lot of overlap you may find that methods differ in naming convention due to development in different fields of research: Computer Science, Social Sciences, Health, Economics

Here I used definitions mostly from the Pearlian perspective (as developed in the field of computer science).

The Story Behind This Post

This narrative is a result of two study groups that I have conducted in a previous role to get myself and colleagues to learn about causality, which I felt missing in my skill set. If there is any interest I’m glad to write a post about the study group experience.

This intro was created as the one I felt that I needed when I started my journey in causality.

In the first iteration of this post I wrote and presented the limitations of spurious correlations and Simpson’s paradox. The main reason for this revision to focus on two paradoxes is that, whereas most causality intros focus on the limitations of correlations, I feel that understanding the concept of justification of confounders is important for all analysts and machine learning practitioners to be aware of.

On September 5th 2024 I have presented this content in a contributed talk at the Royal Statistical Society Annual Conference in Brighton, England (abstract link).

Unfortunately there is no recording but there are of previous talks of mine:

The slides are available at bit.ly/start-ask-why. Presenting this material for the first time at PyData Global 2021

Shape
Shape
Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy,  bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Shape

Palo Alto Networks readies security for AI-first world

Palo Alto has articulated the value of a security platform for several years. But now, given the speed at which AI is moving, the value shifts from cost consolidation to agility. With AI, most customers don’t know what their future operating environment will look like, and a platform approach lets

Read More »

Chevron executives see 2025 production growth nearing 8%

Executives of Chevron Corp., Houston, expect the company’s 2025 production growth, excluding former Hess operations, to be near the top of their guidance range of 6-8%, they said Oct. 31. Chevron’s total production for the 3 months that ended Sept. 30 totaled nearly 4.09 MMboe/d compared with 3.37 MMboe/d in

Read More »

Cisco unveils integrated edge platform for AI

Announced at Cisco’s Partner Summit, Unified Edge will likely be part of many third-party packages that can be configured in a variety of ways, Cisco stated. “The platform is customer definable. For example, if a customer has a workload and they’ve decided they want to use Nutanix, they can go

Read More »

Infoblox bolsters Universal DDI Platform with multi-cloud integrations

Universal DDI for Microsoft Management integration enables enterprises to gain control of their DNS and DHCP by centrally managing DNS and DHCP hosted on Microsoft server platforms. Integration with Google Cloud Internal Range applies consistent IPAM policies across Google Cloud, on-premises, and other cloud environments, which helps enterprise IT to

Read More »

Russia in Talks with Turkey to Maintain Gas Flows

Russia and Turkey are in talks to keep up the volumes of gas supplies from Gazprom PJSC as they negotiate the renewal of two major pipeline supply deals, according to people familiar with the matter.  The contracts between Russia’s gas giant and Turkey’s state company Botas for combined deliveries of as much as 21.75 billion cubic meters a year are set to expire on Dec. 31. Russia and Turkey are negotiating to keep the annual flows at about 22 billion cubic meters, the people said, asking not to be identified as the information isn’t public. Gazprom didn’t immediately respond to a Bloomberg request for comment sent during a public holiday in Russia. Turkey’s Energy Ministry didn’t comment. Botas didn’t reply to a query seeking comment. Gas market watchers have been questioning the future of Russian gas flows to Turkey amid growing pressure from US President Donald Trump’s administration to curb energy purchases that help the Kremlin fund its war on Ukraine. Following US sanctions on Russia’s two biggest oil producers last month, Turkey’s oil refiners have started cutting imports of Russian crude.  Turkey has previously pushed back on Western efforts to stop it from buying Russian gas, which is mostly traded through long-term contracts via extensive pipeline connections between the two countries. In September, however, Turkey agreed to a string of contracts to buy liquefied natural gas, including from the US. With Turkey’s own production from the Black Sea set to grow, it may end up with more gas than it needs.  Turkey’s large market has been a lifeline for Gazprom, which has all but lost the European gas market after the war triggered a push for diversification of supplies. This should give Turkey leverage to negotiate discounts in a renewal of supply deals.  Last year, Gazprom shipped 21.6 billion

Read More »

‘Disappointing’ Results for Melbana at Cuban Well

Melbana Energy Ltd said Wednesday flow testing at the Amistad-2 well in Cuba’s onshore Block 9 had failed to recover oil. “The testing of Amistad-2 is disappointing given the well was up-dip of known oil, but this can occur in the early-stage appraisal and development of new oilfields”, Melbana executive chair Andrew Purcell said in an online statement. “Oil shows were muted during the drilling, perhaps because the reservoir drilling fluid we have designed for these formations was in balance and doing its job, but well logs indicated good reservoir quality and reasonable oil saturation. Flow testing confirmed excellent reservoir quality, given the high rate of fluid recovery, but oil was residual at that location. “The rate of drilling was also quicker than prognosed, allowing us to continue drilling the encountered formation much deeper than originally planned”. The Sydney, Australia-based company exceeded its target total depth of 1,125 meters (3,690.94 feet) and reached 2,000 meters. Amistad-2 sits about 850 meters southwest and 200 meters up-dip of the already producing Alameda-2, also in Block 9, according to Melbana. However, pressure data from the latest drilling campaign “indicates that the reservoirs at the Amistad-2 location are not in communication with those at the Alameda-2 location”, Wednesday’s statement said. “Given the results of Amistad-2 consideration is now being given to Amistad-11 replacing Amistad-3 as the next well. This would be a shallow production well located on Pad 1, where good production characteristics have previously been obtained (peak flow of 1,903 bopd at a sustained rate of 1,235 bopd)”, Melbana added. “Production operations in Amistad-1 have been temporarily halted to prepare for the drilling of this well in case the joint operation approves this course of action”. Block 9 spans 2,344 square kilometers (905.02 square miles) on the north coast of Cuba, 140 kilometers

Read More »

Shell Commits to Long-Term Purchase from Ruwais LNG

Abu Dhabi National Oil Co PJSC (ADNOC) said Tuesday it has signed a 15-year deal with Shell PLC to supply the British company up to one million metric tons per annum (MMtpa) of liquefied natural gas (LNG) from the Ruwais LNG project in the United Arab Emirates. “Signed during ADIPEC, the deal marks ADNOC’s first long-term LNG sales agreement with Shell and the eighth long-term offtake agreement secured for the Ruwais LNG project”, ADNOC said in a press release. “This SPA [sale and purchase agreement] converts a previous heads of agreement into a definitive agreement and marks a significant step in ADNOC’s efforts to rapidly commercialize the Ruwais LNG project. “With this latest agreement, more than eight MMtpa of the project’s planned 9.6 MMtpa capacity is now secured through long-term deals with customers across Asia and Europe, just 16 months after the project’s final investment decision in July 2024”. Fatema Al Nuaimi, chief executive of ADNOC gas processing and sales arm ADNOC Gas PLC, said, “While the industry can take up to four or five years to market such volumes, Ruwais is advancing at record pace”. “In parallel, construction, contractor mobilization and site works are all on track for commissioning by the end of 2028”, Al Nuaimi added. The export plant in Al Ruwais Industrial City is planned to have two trains, each with a production capacity of 4.8 MMtpa. Targeted to be put into production 2028, the facility would more than double ADNOC’s LNG capacity. Shell already holds a 10 percent stake in the project through Shell Overseas Holdings Ltd, ADNOC confirmed Tuesday. Last year ADNOC penned separate agreements farming out a total of 40 percent in Ruwais LNG to Shell, BP PLC, Mitsui & Co Ltd and TotalEnergies SE. Japan’s Mitsui also penned an offtake of 600,000 metric tons a year,

Read More »

Oil Retreats on Strong Greenback

Oil fell, halting a four-session run of gains, pressured by a strong dollar and a backdrop of oversupply. West Texas Intermediate fell 0.8% to settle below $61 a barrel on Tuesday. A global equities rally hit a speed bump amid concerns about lofty valuations while the greenback climbed to the highest in more than five months, weighing on crude and other dollar-denominated commodities. Oil declined because of “the dollar funding stress and the second-order effect on global liquidity and, in turn, global growth,” said Jon Byrne, an analyst at Strategas Securities. The Organization of the Petroleum Exporting Countries and its allies said over the weekend they planned to hold back from lifting production quotas in the first quarter. The decision came as market observers brace for what is expected to be a global crude glut. The US oil benchmark has retreated almost 16% this year as OPEC+ and non-member nations ramped up production. Prices rebounded from five-month lows when the US recently announced sanctions on Rosneft PJSC and Lukoil PJSC, Russia’s two biggest oil companies, but have since surrendered some of those advances. Russian seaborne crude shipments fell sharply in the wake of the sanctions, dropping by the most since January 2024, according to data tracked by Bloomberg. Cargo discharges have been hit even harder than loadings, with oil held in tanker ships surging. Still, some are skeptical the restrictions will stop Russian oil from finding buyers. “Down the line, you will see that more and more of the disrupted Russian oil, one way or another, finds its way to the market,” Torbjörn Törnqvist, chief executive officer of Gunvor Group, said during an interview on Tuesday. “It always does somehow.” Eni SpA CEO Claudio Descalzi said Monday that any concerns about oversupply will be short-lived, the latest comments by an

Read More »

Gunvor CEO Says Deal for Lukoil Assets Is a ‘Clean Break’

Gunvor Group Chief Executive Officer Torbjörn Törnqvist said a deal to acquire the international assets of sanctioned Russian oil producer Lukoil PJSC represents a “clean break” for the portfolio and should pass muster with regulators. “We believe it is satisfying all the concerns that may arise from a transaction of this magnitude and given the parties involved,” Törnqvist said in an interview with Bloomberg Television. He ruled out selling any of the assets back should sanctions on Lukoil be removed.  “It’s a clean break; the moment the deal is done — that’s it.” Lukoil last week announced it had agreed to sell Gunvor its vast international network of oil wells, refineries and gas stations, as well as its trading book, without disclosing terms. If finalized, the deal is a coup for Gunvor, a large trader of oil and gas that has longstanding ties to Russia’s energy industry. In 2014, co-founder Gennady Timchenko was sanctioned by the US, which claimed Russian President Vladimir Putin had “investments in Gunvor,” which the company has consistently denied. Törnqvist said he believes any concerns the authorities might have about continued Russian influence over the portfolio would be satisfied. “We’re pretty confident that this deal ticks off all the critical boxes,” he said Tuesday. The US blacklisted Lukoil and fellow Russian oil giant Rosneft PJSC last month as part of a fresh bid to end the war in Ukraine by depriving Moscow of revenues. Gunvor’s subsequent deal is subject to clearance from the US Treasury’s Office of Foreign Assets Control, among other authorities. As part of the sanctions, Lukoil and its Litasco trading arm have a short window to wind down business dealings. Gunvor is in talks with US regulators to secure an extension to a license to transact with the Russian company.  The US license

Read More »

Energy Department Announces $625 Million to Advance the Next Phase of National Quantum Information Science Research Centers

WASHINGTON— The U.S. Department of Energy (DOE) today announced $625 million in funding to renew its five National Quantum Information Science (QIS) Research Centers, originally established under the National Quantum Initiative Act signed into law by President Trump in December 2018. The renewal of DOE’s National Quantum Information Science Research Centers advances President Trump’s directive to restore American leadership in quantum science and technology. The DOE is aligning its quantum research enterprise with national priorities, focusing resources on advancing critical R&D across the American QIS, strengthening the quantum innovation ecosystem, accelerating discoveries that power next-generation technologies, and securing American leadership in quantum computing, hardware, and applications. “President Trump positioned America to lead the world in quantum science and technology and today, a new frontier of scientific discovery lies before us. Breakthroughs in QIS have the potential to revolutionize the ways we sense, communicate, and compute, sparking entirely new technologies and industries,” said U.S. Department of Energy Under Secretary for Science Darío Gil. “The renewal of DOE’s National Quantum Information Science Research Centers will empower America to secure our advantage in pioneering the next generation of scientific and engineering advancements needed for this technology.” Each NQISRC: Supports fundamental science with disruptive potential across quantum computing, simulation, networking, and sensing. Develops unique tools, equipment, and instrumentation that unlock transformative new QIS capabilities. Advances quantum technology through application to DOE’s most pressing scientific and national security challenge areas. Establishes community resources, workforce opportunities, and industry partnerships to strengthen the entire QIS ecosystem. Center renewals include: Co-design Center for Quantum Advantage (C2QA) – Brookhaven National Laboratory will advance quantum computing and sensing by improving materials used in superconducting and plasma-grown, diamond-based quantum devices and developing modular approaches for superconducting and neutral-atom systems. Superconducting Quantum Materials and Systems Center (SQMS) – Fermi National Accelerator Laboratory

Read More »

Cisco centralizes customer experience around AI

The idea is to make sure enterprises are effectively choosing, implementing, and using the technologies they purchase to achieve their business goals, according to the company. Cisco CX offers a suite of services to help customers optimize their network infrastructure, security, collaboration, cloud and data center operations – from planning and design to implementation and maintenance. “For too long, the delivery of services has been fragmented, with support and professional services using different tools optimized for specific functions or lifecycle stages. This has led to a fragmented experience where customers, partners, and Cisco teams spend more time on data collection and tool maintenance than on high-value analysis,” wrote Bhaskar Jayakrishnan, senior vice president of engineering with the Cisco CX group in a blog about the new technology.  “Historically, the handoffs between these stages have been inefficient. Designs are interpreted by humans and then converted into code. Operational data is manually analyzed to inform optimizations. This process is slow, error-prone, and loses critical context at every step.” “Cisco IQ represents a shift from this tool-centric model to an intelligence-centric one. It is a multi-persona system, serving customers, partners, and our own services teams through an API-first architecture. Our objective is to turn decades of institutional knowledge into a living, adaptive system that makes your infrastructure smarter, more resilient, and more secure,” Jayakrishnan wrote.

Read More »

Data Center Jobs: Engineering, Construction, Commissioning, Sales, Field Service and Facility Tech Jobs Available in Major Data Center Hotspots

Each month Data Center Frontier, in partnership with Pkaza, posts some of the hottest data center career opportunities in the market. Here’s a look at some of the latest data center jobs posted on the Data Center Frontier jobs board, powered by Pkaza Critical Facilities Recruiting. Looking for Data Center Candidates? Check out Pkaza’s Active Candidate / Featured Candidate Hotlist Data Center Facility Technician (All Shifts Available) Impact, TX This position is also available in: Ashburn, VA; Abilene, TX; Needham, MA and New York, NY.  Navy Nuke / Military Vets leaving service accepted! This opportunity is working with a leading mission-critical data center provider. This firm provides data center solutions custom-fit to the requirements of their client’s mission-critical operational facilities. They provide reliability of mission-critical facilities for many of the world’s largest organizations facilities supporting enterprise clients, colo providers and hyperscale companies. This opportunity provides a career-growth minded role with exciting projects with leading-edge technology and innovation as well as competitive salaries and benefits. Electrical Commissioning Engineer Montvale, NJ This traveling position is also available in: New York, NY; White Plains, NY;  Richmond, VA; Ashburn, VA; Charlotte, NC; Atlanta, GA; Hampton, GA; Fayetteville, GA; New Albany, OH; Cedar Rapids, IA; Phoenix, AZ; Dallas, TX or Chicago IL *** ALSO looking for a LEAD EE and ME CxA Agents and CxA PMs. *** Our client is an engineering design and commissioning company that has a national footprint and specializes in MEP critical facilities design. They provide design, commissioning, consulting and management expertise in the critical facilities space. They have a mindset to provide reliability, energy efficiency, sustainable design and LEED expertise when providing these consulting services for enterprise, colocation and hyperscale companies. This career-growth minded opportunity offers exciting projects with leading-edge technology and innovation as well as competitive salaries and benefits. Data Center MEP Construction

Read More »

NVIDIA at GTC 2025: Building the AI Infrastructure of Everything

Omniverse DSX Blueprint Unveiled Also at the conference, NVIDIA released a blueprint for how other firms should build massive, gigascale AI data centers, or AI factories, in which Oracle, Microsoft, Google, and other leading tech firms are investing billions. The most powerful and efficient of those, company representatives said, will include NVIDIA chips and software. A new NVIDIA AI Factory Research Center in Virginia will use that technology. This new “mega” Omniverse DSX Blueprint is a comprehensive, open blueprint for designing and operating gigawatt-scale AI factories. It combines design, simulation, and operations across factory facilities, hardware, and software. • The blueprint expands to include libraries for building factory-scale digital twins, with Siemens’ Digital Twin software first to support the blueprint and FANUC and Foxconn Fii first to connect their robot models. • Belden, Caterpillar, Foxconn, Lucid Motors, Toyota, Taiwan Semiconductor Manufacturing Co. (TSMC), and Wistron build Omniverse factory digital twins to accelerate AI-driven manufacturing. • Agility Robotics, Amazon Robotics, Figure, and Skild AI build a collaborative robot workforce using NVIDIA’s three-computer architecture. NVIDIA Quantum Gains  And then there’s quantum computing. It can help data centers become more energy-efficient and faster with specific tasks such as optimization and AI model training. Conversely, the unique infrastructure needs of quantum computers, such as power, cooling, and error correction, are driving the development of specialized quantum data centers. Huang said it’s now possible to make one logical qubit, or quantum bit, that’s coherent, stable, and error corrected.  However, these qubits—the units of information enabling quantum computers to process information in ways ordinary computers can’t—are “incredibly fragile,” creating a need for powerful technology to do quantum error correction and infer the qubit’s state. To connect quantum and GPU computing, Huang announced the release of NVIDIA NVQLink — a quantum‑GPU interconnect that enables real‑time CUDA‑Q calls from quantum

Read More »

The Evolution of the Neocloud: From Niche to Mainstream Hyperscale Challenger

Infrastructure and Supply Chain Race Cloud competition is increasingly defined by the ability to secure power, land, and chips— three resources that dictate project timelines and customer onboarding. Neoclouds and hyperscalers face a common set of constraints: local utility availability, substation interconnection bottlenecks, and fierce competition for high-density GPU inventory. Power stands as the gating factor for expansion, often outpacing even chip shortages in severity. Facilities are increasingly being sited based on access to dedicated, reliable megawatt-scale electricity, rather than traditional latency zones or network proximity. AI growth forecasts point to four key ceilings: electrical capacity, chip procurement cycles, latency wall between computation and data, and scalable data throughput for model training. With hyperscaler and neocloud deployments now competing for every available GPU from manufacturers, deployment agility has become a prime differentiator. Neoclouds distinguish themselves by orchestrating microgrid agreements, securing direct-source utility contracts, and compressing build-to-operational timelines. Converting a bare site to a functional data hall with operators that can viably offer a shortened deployment timeline gives neoclouds a material edge over traditional hyperscale deployments that require broader campus and network-level integration cycles. The aftereffects of the COVID era supply chain disruptions linger, with legacy operators struggling to source critical electrical components, switchgear, and transformers, sometimes waiting more than a year for equipment. As a result, neocloud providers have moved aggressively into site selection strategies, regional partnerships, and infrastructure stack integration to hedge risk and shorten delivery cycles. Microgrid solutions and island modes for power supply are increasingly utilized to ensure uninterrupted access to electricity during ramp-up periods and supply chain outages, fundamentally rebalancing the competitive dynamics of AI infrastructure deployment. Creditworthiness, Capital, and Risk Management Securing capital remains a decisive factor for the growth and sustainability of neoclouds. Project finance for campus-scale deployments hinges on demonstrable creditworthiness; lenders demand

Read More »

Canyon Magnet Energy: The Superconducting Future of Powering AI Data Centers

At this year’s Data Center Frontier Trends Summit, Honghai Song, founder of Canyon Magnet Energy, presented his company’s breakthrough superconducting magnet technology during the “6 Moonshot Trends for the 2026 Data Center Frontier” panel—showcasing how high-temperature superconductors (HTS) could reshape both fusion energy and AI data-center power systems. In this episode of the Data Center Frontier Show, Editor in Chief Matt Vincent speaks with Song about how Canyon Magnet Energy—founded in 2023 and based in New Jersey with research roots at Stony Brook University—is bridging fusion research and AI infrastructure through next-generation magnet and energy-storage technology. From Fusion Research to Data Center Reality Founded in 2023, Canyon Magnet Energy emerged from the advanced-magnet research ecosystem around Stony Brook and now operates a manufacturing line in Newark, New Jersey. Its team draws on decades of experience designing the ultra-strong magnetic fields that enable the confinement and stability of fusion plasma—but their ambitions go far beyond the laboratory. “Super magnets are the foundation of fusion,” Song explains in the interview. “But the same high-temperature superconductors that can make fusion practical can also dramatically improve how we move and store electricity in data centers.” The company’s magnets are built using REBCO (Rare Earth Barium Copper Oxide) tape, which operates at around 77 Kelvin—cold, but far warmer and more manageable than traditional low-temperature superconductors. The result is a zero-resistance pathway for electricity, unlocking new possibilities in power transmission, energy storage, and grid integration. Why High-Temperature Superconductors Matter Since their discovery in 1986, high-temperature superconductors have progressed from exotic physics experiments to industrial-scale wire and magnet manufacturing. Canyon Magnet Energy is among a new generation of companies moving this technology into the AI data-center context—where efficiency and instantaneous power responsiveness are increasingly critical. With AI training clusters consuming power at hundreds of megawatts per campus,

Read More »

OpenAI spends even more money it doesn’t have

The aim, said Gogia, “is continuity, not cost efficiency. These deals are forward leaning, relying on revenue forecasts that remain speculative. In that context, OpenAI must continue to draw heavily on outside capital, whether through venture rounds, debt, or a future public offering.” He pointed out, “the company’s recent legal and corporate restructuring was designed to open the doors to that capital. Removing Microsoft’s exclusivity makes room for more vendors but also signals that no one provider can meet OpenAI’s demands. In several cases, suppliers are stepping in with financing arrangements that link product sales to future performance. While these strategies help close funding gaps, they introduce fragility. What looks like revenue is often pre-paid consumption, not realized margin.” Execution risks, he said, add to the concern. “Building and energizing enough data centers to meet OpenAI’s projected needs is not a function of ambition alone. It requires grid access, cooling capacity, and regional stability. Microsoft has acknowledged that it lacks the power infrastructure to fully deploy the GPUs it owns. Without physical readiness, all of these agreements sit on shaky ground.” Lots of equity swapping going on Scott Bickley, advisory fellow at Info-Tech Research Group, said he has not only been astounded by the funding announcements over the last few months, but is also appalled, primarily, he said, “because of the disconnect to what this does to the underlying technology stocks and their market prices versus where the technology is at from a development and ROI perspective … and from a boots on the ground perspective.” He added that while the financial pledges involve “huge, staggering numbers, most of them are tied up in ways that are not necessarily going to require all the cash to come from OpenAI. In a lot of cases, there is equity swapping. You have

Read More »

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs).  In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

Read More »

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

Read More »

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

Read More »

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Read More »