Stay Ahead, Stay ONMINE

Introduction to Minimum Cost Flow Optimization in Python

Minimum cost flow optimization minimizes the cost of moving flow through a network of nodes and edges. Nodes include sources (supply) and sinks (demand), with different costs and capacity limits. The aim is to find the least costly way to move volume from sources to sinks while adhering to all capacity limitations. Applications Applications of […]

Minimum cost flow optimization minimizes the cost of moving flow through a network of nodes and edges. Nodes include sources (supply) and sinks (demand), with different costs and capacity limits. The aim is to find the least costly way to move volume from sources to sinks while adhering to all capacity limitations.

Applications

Applications of minimum cost flow optimization are vast and varied, spanning multiple industries and sectors. This approach is crucial in logistics and supply chain management, where it is used to minimize transportation costs while ensuring timely delivery of goods. In telecommunications, it helps in optimizing the routing of data through networks to reduce latency and improve bandwidth utilization. The energy sector leverages minimum cost flow optimization to efficiently distribute electricity through power grids, reducing losses and operational costs. Urban planning and infrastructure development also benefit from this optimization technique, as it assists in designing efficient public transportation systems and water distribution networks.

Example

Below is a simple flow optimization example:

The image above illustrates a minimum cost flow optimization problem with six nodes and eight edges. Nodes A and B serve as sources, each with a supply of 50 units, while nodes E and F act as sinks, each with a demand of 40 units. Every edge has a maximum capacity of 25 units, with variable costs indicated in the image. The objective of the optimization is to allocate flow on each edge to move the required units from nodes A and B to nodes E and F, respecting the edge capacities at the lowest possible cost.

Node F can only receive supply from node B. There are two paths: directly or through node D. The direct path has a cost of 2, while the indirect path via D has a combined cost of 3. Thus, 25 units (the maximum edge capacity) are moved directly from B to F. The remaining 15 units are routed via B -D-F to meet the demand.

Currently, 40 out of 50 units have been transferred from node B, leaving a remaining supply of 10 units that can be moved to node E. The available pathways for supplying node E include: A-E and B-E with a cost of 3, A-C-E with a cost of 4, and B-C-E with a cost of 5. Consequently, 25 units are transported from A-E (limited by the edge capacity) and 10 units from B-E (limited by the remaining supply at node B). To meet the demand of 40 units at node E, an additional 5 units are moved via A-C-E, resulting in no flow being allocated to the B-C pathway.

Mathematical formulation

I introduce two mathematical formulations of minimum cost flow optimization:

1. LP (linear program) with continuous variables only

2. MILP (mixed integer linear program) with continuous and discrete variables

I am using following definitions:

Definitions

LP formulation

This formulation only contains decision variables that are continuous, meaning they can have any value as long as all constraints are fulfilled. Decision variables are in this case the flow variables x(u, v) of all edges.

The objective function describes how the costs that are supposed to be minimized are calculated. In this case it is defined as the flow multiplied with the variable cost summed up over all edges:

Constraints are conditions that must be satisfied for the solution to be valid, ensuring that the flow does not exceed capacity limitations.

First, all flows must be non-negative and not exceed to edge capacities:

Flow conservation constraints ensure that the same amount of flow that goes into a node has to come out of the node. These constraints are applied to all nodes that are neither sources nor sinks:

For source and sink nodes the difference of out flow and in flow is smaller or equal the supply of the node:

If v is a source the difference of outflow minus inflow must not exceed the supply s(v). In case v is a sink node we do not allow that more than -s(v) can flow into the node than out of the node (for sinks s(v) is negative).

MILP

Additionally, to the continuous variables of the LP formulation, the MILP formulation also contains discreate variables that can only have specific values. Discrete variables allow to restrict the number of used nodes or edges to certain values. It can also be used to introduce fixed costs for using nodes or edges. In this article I show how to add fixed costs. It is important to note that adding discrete decision variables makes it much more difficult to find an optimal solution, hence this formulation should only be used if a LP formulation is not possible.

The objective function is defined as:

With three terms: variable cost of all edges, fixed cost of all edges, and fixed cost of all nodes.

The maximum flow that can be allocated to an edge depends on the edge’s capacity, the edge selection variable, and the origin node selection variable:

This equation ensures that flow can only be assigned to edges if the edge selection variable and the origin node selection variable are 1.

The flow conservation constraints are equivalent to the LP problem.

Implementation

In this section I explain how to implement a MILP optimization in Python. You can find the code in this repo.

Libraries

To build the flow network, I used NetworkX which is an excellent library (https://networkx.org/) for working with graphs. There are many interesting articles that demonstrate how powerful and easy to use NetworkX is to work with graphs, i.a. customizing NetworkX GraphsNetworkX: Code Demo for Manipulating SubgraphsSocial Network Analysis with NetworkX: A Gentle Introduction.

One important aspect when building an optimization is to make sure that the input is correctly defined. Even one small error can make the problem infeasible or can lead to an unexpected solution. To avoid this, I used Pydantic to validate the user input and raise any issues at the earliest possible stage. This article gives an easy to understand introduction to Pydantic.

To transform the defined network into a mathematical optimization problem I used PuLP. Which allows to define all variables and constraint in an intuitive way. This library also has the advantage that it can use many different solvers in a simple pug-and-play fashion. This article provides good introduction to this library.

Defining nodes and edges

The code below shows how nodes are defined:

from pydantic import BaseModel, model_validator
from typing import Optional

# node and edge definitions
class Node(BaseModel, frozen=True):
    """
    class of network node with attributes:
    name: str - name of node
    demand: float - demand of node (if node is sink)
    supply: float - supply of node (if node is source)
    capacity: float - maximum flow out of node
    type: str - type of node
    x: float - x-coordinate of node
    y: float - y-coordinate of node
    fixed_cost: float - cost of selecting node
    """
    name: str
    demand: Optional[float] = 0.0
    supply: Optional[float] = 0.0
    capacity: Optional[float] = float('inf')
    type: Optional[str] = None
    x: Optional[float] = 0.0
    y: Optional[float] = 0.0
    fixed_cost: Optional[float] = 0.0

    @model_validator(mode='after')
    def validate(self):
        """
        validate if node definition are correct
        """
        # check that demand is non-negative
        if self.demand < 0 or self.demand == float('inf'): raise ValueError('demand must be non-negative and finite')
        # check that supply is non-negative
        if self.supply < 0: raise ValueError('supply must be non-negative')
        # check that capacity is non-negative
        if self.capacity < 0: raise ValueError('capacity must be non-negative')
        # check that fixed_cost is non-negative
        if self.fixed_cost < 0: raise ValueError('fixed_cost must be non-negative')
        return self

Nodes are defined through the Node class which is inherited from Pydantic’s BaseModel. This enables an automatic validation that ensures that all properties are defined with the correct datatype whenever a new object is created. In this case only the name is a required input, all other properties are optional, if they are not provided the specified default value is assigned to them. By setting the “frozen” parameter to True I made all properties immutable, meaning they cannot be changed after the object has been initialized.

The validate method is executed after the object has been initialized and applies more checks to ensure the provided values are as expected. Specifically it checks that demand, supply, capacity, variable cost and fixed cost are not negative. Furthermore, it also does not allow infinite demand as this would lead to an infeasible optimization problem.

These checks look trivial, however their main benefit is that they will trigger an error at the earliest possible stage when an input is incorrect. Thus, they prevent creating a optimization model that is incorrect. Exploring why a model cannot be solved would be much more time consuming as there are many factors that would need to be analyzed, while such “trivial” input error may not be the first aspect to investigate.

Edges are implemented as follows:

class Edge(BaseModel, frozen=True):
"""
class of edge between two nodes with attributes:
origin: 'Node' - origin node of edge
destination: 'Node' - destination node of edge
capacity: float - maximum flow through edge
variable_cost: float - cost per unit flow through edge
fixed_cost: float - cost of selecting edge
"""
origin: Node
destination: Node
capacity: Optional[float] = float('inf')
variable_cost: Optional[float] = 0.0
fixed_cost: Optional[float] = 0.0

@model_validator(mode='after')
def validate(self):
"""
validate of edge definition is correct
"""
# check that node names are different
if self.origin.name == self.destination.name: raise ValueError('origin and destination names must be different')
# check that capacity is non-negative
if self.capacity < 0: raise ValueError('capacity must be non-negative')
# check that variable_cost is non-negative
if self.variable_cost < 0: raise ValueError('variable_cost must be non-negative')
# check that fixed_cost is non-negative
if self.fixed_cost < 0: raise ValueError('fixed_cost must be non-negative')
return self

The required inputs are an origin node and a destination node object. Additionally, capacity, variable cost and fixed cost can be provided. The default value for capacity is infinity which means if no capacity value is provided it is assumed the edge does not have a capacity limitation. The validation ensures that the provided values are non-negative and that origin node name and the destination node name are different.

Initialization of flowgraph object

To define the flowgraph and optimize the flow I created a new class called FlowGraph that is inherited from NetworkX’s DiGraph class. By doing this I can add my own methods that are specific to the flow optimization and at the same time use all methods DiGraph provides:

from networkx import DiGraph
from pulp import LpProblem, LpVariable, LpMinimize, LpStatus

class FlowGraph(DiGraph):
    """
    class to define and solve minimum cost flow problems
    """
    def __init__(self, nodes=[], edges=[]):
        """
        initialize FlowGraph object
        :param nodes: list of nodes
        :param edges: list of edges
        """
        # initialialize digraph
        super().__init__(None)

        # add nodes and edges
        for node in nodes: self.add_node(node)
        for edge in edges: self.add_edge(edge)


    def add_node(self, node):
        """
        add node to graph
        :param node: Node object
        """
        # check if node is a Node object
        if not isinstance(node, Node): raise ValueError('node must be a Node object')
        # add node to graph
        super().add_node(node.name, demand=node.demand, supply=node.supply, capacity=node.capacity, type=node.type, 
                         fixed_cost=node.fixed_cost, x=node.x, y=node.y)
        
    
    def add_edge(self, edge):    
        """
        add edge to graph
        @param edge: Edge object
        """   
        # check if edge is an Edge object
        if not isinstance(edge, Edge): raise ValueError('edge must be an Edge object')
        # check if nodes exist
        if not edge.origin.name in super().nodes: self.add_node(edge.origin)
        if not edge.destination.name in super().nodes: self.add_node(edge.destination)

        # add edge to graph
        super().add_edge(edge.origin.name, edge.destination.name, capacity=edge.capacity, 
                         variable_cost=edge.variable_cost, fixed_cost=edge.fixed_cost)

FlowGraph is initialized by providing a list of nodes and edges. The first step is to initialize the parent class as an empty graph. Next, nodes and edges are added via the methods add_node and add_edge. These methods first check if the provided element is a Node or Edge object. If this is not the case an error will be raised. This ensures that all elements added to the graph have passed the validation of the previous section. Next, the values of these objects are added to the Digraph object. Note that the Digraph class also uses add_node and add_edge methods to do so. By using the same method name I am overwriting these methods to ensure that whenever a new element is added to the graph it must be added through the FlowGraph methods which validate the object type. Thus, it is not possible to build a graph with any element that has not passed the validation tests.

Initializing the optimization problem

The method below converts the network into an optimization model, solves it, and retrieves the optimized values.

  def min_cost_flow(self, verbose=True):
        """
        run minimum cost flow optimization
        @param verbose: bool - print optimization status (default: True)
        @return: status of optimization
        """
        self.verbose = verbose

        # get maximum flow
        self.max_flow = sum(node['demand'] for _, node in super().nodes.data() if node['demand'] > 0)

        start_time = time.time()
        # create LP problem
        self.prob = LpProblem("FlowGraph.min_cost_flow", LpMinimize)
        # assign decision variables
        self._assign_decision_variables()
        # assign objective function
        self._assign_objective_function()
        # assign constraints
        self._assign_constraints()
        if self.verbose: print(f"Model creation time: {time.time() - start_time:.2f} s")

        start_time = time.time()
        # solve LP problem
        self.prob.solve()
        solve_time = time.time() - start_time

        # get status
        status = LpStatus[self.prob.status]

        if verbose:
            # print optimization status
            if status == 'Optimal':
                # get objective value
                objective = self.prob.objective.value()
                print(f"Optimal solution found: {objective:.2f} in {solve_time:.2f} s")
            else:
                print(f"Optimization status: {status} in {solve_time:.2f} s")
        
        # assign variable values
        self._assign_variable_values(status=='Optimal')

        return status

Pulp’s LpProblem is initialized, the constant LpMinimize defines it as a minimization problem — meaning it is supposed to minimize the value of the objective function. In the following lines all decision variables are initialized, the objective function as well as all constraints are defined. These methods will be explained in the following sections.

Next, the problem is solved, in this step the optimal value of all decision variables is determined. Following the status of the optimization is retrieved. When the status is “Optimal” an optimal solution could be found other statuses are “Infeasible” (it is not possible to fulfill all constraints), “Unbounded” (the objective function can have an arbitrary low values), and “Undefined” meaning the problem definition is not complete. In case no optimal solution was found the problem definition needs to be reviewed.

Finally, the optimized values of all variables are retrieved and assigned to the respective nodes and edges.

Defining decision variables

All decision variables are initialized in the method below:

   def _assign_variable_values(self, opt_found):
        """
        assign decision variable values if optimal solution found, otherwise set to None
        @param opt_found: bool - if optimal solution was found
        """
        # assign edge values        
        for _, _, edge in super().edges.data():
            # initialize values
            edge['flow'] = None
            edge['selected'] = None
            # check if optimal solution found
            if opt_found and edge['flow_var'] is not None:                    
                edge['flow'] = edge['flow_var'].varValue                    

                if edge['selection_var'] is not None: 
                    edge['selected'] = edge['selection_var'].varValue

        # assign node values
        for _, node in super().nodes.data():
            # initialize values
            node['selected'] = None
            if opt_found:                
                # check if node has selection variable
                if node['selection_var'] is not None: 
                    node['selected'] = node['selection_var'].varValue

First it iterates through all edges and assigns continuous decision variables if the edge capacity is greater than 0. Furthermore, if fixed costs of the edge are greater than 0 a binary decision variable is defined as well. Next, it iterates through all nodes and assigns binary decision variables to nodes with fixed costs. The total number of continuous and binary decision variables is counted and printed at the end of the method.

Defining objective

After all decision variables have been initialized the objective function can be defined:

    def _assign_objective_function(self):
        """
        define objective function
        """
        objective = 0
 
        # add edge costs
        for _, _, edge in super().edges.data():
            if edge['selection_var'] is not None: objective += edge['selection_var'] * edge['fixed_cost']
            if edge['flow_var'] is not None: objective += edge['flow_var'] * edge['variable_cost']
        
        # add node costs
        for _, node in super().nodes.data():
            # add node selection costs
            if node['selection_var'] is not None: objective += node['selection_var'] * node['fixed_cost']

        self.prob += objective, 'Objective',

The objective is initialized as 0. Then for each edge fixed costs are added if the edge has a selection variable, and variable costs are added if the edge has a flow variable. For all nodes with selection variables fixed costs are added to the objective as well. At the end of the method the objective is added to the LP object.

Defining constraints

All constraints are defined in the method below:

  def _assign_constraints(self):
        """
        define constraints
        """
        # count of contraints
        constr_count = 0
        # add capacity constraints for edges with fixed costs
        for origin_name, destination_name, edge in super().edges.data():
            # get capacity
            capacity = edge['capacity'] if edge['capacity'] < float('inf') else self.max_flow
            rhs = capacity
            if edge['selection_var'] is not None: rhs *= edge['selection_var']
            self.prob += edge['flow_var'] <= rhs, f"capacity_{origin_name}-{destination_name}",
            constr_count += 1
            
            # get origin node
            origin_node = super().nodes[origin_name]
            # check if origin node has a selection variable
            if origin_node['selection_var'] is not None:
                rhs = capacity * origin_node['selection_var'] 
                self.prob += (edge['flow_var'] <= rhs, f"node_selection_{origin_name}-{destination_name}",)
                constr_count += 1

        total_demand = total_supply = 0
        # add flow conservation constraints
        for node_name, node in super().nodes.data():
            # aggregate in and out flows
            in_flow = 0
            for _, _, edge in super().in_edges(node_name, data=True):
                if edge['flow_var'] is not None: in_flow += edge['flow_var']
            
            out_flow = 0
            for _, _, edge in super().out_edges(node_name, data=True):
                if edge['flow_var'] is not None: out_flow += edge['flow_var']

            # add node capacity contraint
            if node['capacity'] < float('inf'):
                self.prob += out_flow = demand - supply
                rhs = node['demand'] - node['supply']
                self.prob += in_flow - out_flow >= rhs, f"flow_balance_{node_name}",
            constr_count += 1

            # update total demand and supply
            total_demand += node['demand']
            total_supply += node['supply']

        if self.verbose:
            print(f"Constraints: {constr_count}")
            print(f"Total supply: {total_supply}, Total demand: {total_demand}")

First, capacity constraints are defined for each edge. If the edge has a selection variable the capacity is multiplied with this variable. In case there is no capacity limitation (capacity is set to infinity) but there is a selection variable, the selection variable is multiplied with the maximum flow that has been calculated by aggregating the demand of all nodes. An additional constraint is added in case the edge’s origin node has a selection variable. This constraint means that flow can only come out of this node if the selection variable is set to 1.

Following, the flow conservation constraints for all nodes are defined. To do so the total in and outflow of the node is calculated. Getting all in and outgoing edges can easily be done by using the in_edges and out_edges methods of the DiGraph class. If the node has a capacity limitation the maximum outflow will be constraint by that value. For the flow conservation it is necessary to check if the node is either a source or sink node or a transshipment node (demand equals supply). In the first case the difference between inflow and outflow must be greater or equal the difference between demand and supply while in the latter case in and outflow must be equal.

The total number of constraints is counted and printed at the end of the method.

Retrieving optimized values

After running the optimization, the optimized variable values can be retrieved with the following method:

    def _assign_variable_values(self, opt_found):
        """
        assign decision variable values if optimal solution found, otherwise set to None
        @param opt_found: bool - if optimal solution was found
        """
        # assign edge values        
        for _, _, edge in super().edges.data():
            # initialize values
            edge['flow'] = None
            edge['selected'] = None
            # check if optimal solution found
            if opt_found and edge['flow_var'] is not None:                    
                edge['flow'] = edge['flow_var'].varValue                    

                if edge['selection_var'] is not None: 
                    edge['selected'] = edge['selection_var'].varValue

        # assign node values
        for _, node in super().nodes.data():
            # initialize values
            node['selected'] = None
            if opt_found:                
                # check if node has selection variable
                if node['selection_var'] is not None: 
                    node['selected'] = node['selection_var'].varValue 

This method iterates through all edges and nodes, checks if decision variables have been assigned and adds the decision variable value via varValue to the respective edge or node.

Demo

To demonstrate how to apply the flow optimization I created a supply chain network consisting of 2 factories, 4 distribution centers (DC), and 15 markets. All goods produced by the factories have to flow through one distribution center until they can be delivered to the markets.

Supply chain problem

Node properties were defined:

Node definitions

Ranges mean that uniformly distributed random numbers were generated to assign these properties. Since Factories and DCs have fixed costs the optimization also needs to decide which of these entities should be selected.

Edges are generated between all Factories and DCs, as well as all DCs and Markets. The variable cost of edges is calculated as the Euclidian distance between origin and destination node. Capacities of edges from Factories to DCs are set to 350 while from DCs to Markets are set to 100.

The code below shows how the network is defined and how the optimization is run:

# Define nodes
factories = [Node(name=f'Factory {i}', supply=700, type='Factory', fixed_cost=100, x=random.uniform(0, 2),
                  y=random.uniform(0, 1)) for i in range(2)]
dcs = [Node(name=f'DC {i}', fixed_cost=25, capacity=500, type='DC', x=random.uniform(0, 2), 
            y=random.uniform(0, 1)) for i in range(4)]
markets = [Node(name=f'Market {i}', demand=random.randint(1, 100), type='Market', x=random.uniform(0, 2), 
                y=random.uniform(0, 1)) for i in range(15)]

# Define edges
edges = []
# Factories to DCs
for factory in factories:
    for dc in dcs:
        distance = ((factory.x - dc.x)**2 + (factory.y - dc.y)**2)**0.5
        edges.append(Edge(origin=factory, destination=dc, capacity=350, variable_cost=distance))

# DCs to Markets
for dc in dcs:
    for market in markets:
        distance = ((dc.x - market.x)**2 + (dc.y - market.y)**2)**0.5
        edges.append(Edge(origin=dc, destination=market, capacity=100, variable_cost=distance))

# Create FlowGraph
G = FlowGraph(edges=edges)

G.min_cost_flow()

The output of flow optimization is as follows:

Variable types: 68 continuous, 6 binary
Constraints: 161
Total supply: 1400.0, Total demand: 909.0
Model creation time: 0.00 s
Optimal solution found: 1334.88 in 0.23 s

The problem consists of 68 continuous variables which are the edges’ flow variables and 6 binary decision variables which are the selection variables of the Factories and DCs. There are 161 constraints in total which consist of edge and node capacity constraints, node selection constraints (edges can only have flow if the origin node is selected), and flow conservation constraints. The next line shows that the total supply is 1400 which is higher than the total demand of 909 (if the demand was higher than the supply the problem would be infeasible). Since this is a small optimization problem, the time to define the optimization model was less than 0.01 seconds. The last line shows that an optimal solution with an objective value of 1335 could be found in 0.23 seconds.

Additionally, to the code I described in this post I also added two methods that visualize the optimized solution. The code of these methods can also be found in the repo.

Flow graph

All nodes are located by their respective x and y coordinates. The node and edge size is relative to the total volume that is flowing through. The edge color refers to its utilization (flow over capacity). Dashed lines show edges without flow allocation.

In the optimal solution both Factories were selected which is inevitable as the maximum supply of one Factory is 700 and the total demand is 909. However, only 3 of the 4 DCs are used (DC 0 has not been selected).

In general the plot shows the Factories are supplying the nearest DCs and DCs the nearest Markets. However, there are a few exceptions to this observation: Factory 0 also supplies DC 3 although Factory 1 is nearer. This is due to the capacity constraints of the edges which only allow to move at most 350 units per edge. However, the closest Markets to DC 3 have a slightly higher demand, hence Factory 0 is moving additional units to DC 3 to meet that demand. Although Market 9 is closest to DC 3 it is supplied by DC 2. This is because DC 3 would require an additional supply from Factory 0 to supply this market and since the total distance from Factory 0 over DC 3 is longer than the distance from Factory 0 through DC 2, Market 9 is supplied via the latter route.

Another way to visualize the results is via a Sankey diagram which focuses on visualizing the flows of the edges:

Sankey flow diagram

The colors represent the edges’ utilizations with lowest utilizations in green changing to yellow and red for the highest utilizations. This diagram shows very well how much flow goes through each node and edge. It highlights the flow from Factory 0 to DC 3 and also that Market 13 is supplied by DC 2 and DC 1.

Summary

Minimum cost flow optimizations can be a very helpful tool in many domains like logistics, transportation, telecommunication, energy sector and many more. To apply this optimization it is important to translate a physical system into a mathematical graph consisting of nodes and edges. This should be done in a way to have as few discrete (e.g. binary) decision variables as necessary as those make it significantly more difficult to find an optimal solution. By combining Python’s NetworkX, Pulp and Pydantic libraries I built an flow optimization class that is intuitive to initialize and at the same time follows a generalized formulation which allows to apply it in many different use cases. Graph and flow diagrams are very helpful to understand the solution found by the optimizer.

If not otherwise stated all images were created by the author.

Shape
Shape
Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy,  bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Shape

Kyndryl: AI success hinges on workforce readiness

Nearly 80% of respondents said the pace of AI adoption is likely to outstrip their organization’s ability to adapt its workforce, governance structures, and operating model. As Kyndryl notes in the report, most leaders believe addressing those challenges “will prove more arduous than those involving code and compute.” Organizations are

Read More »

Netgear brings AI-driven network management to SMEs and MSPs

AI-powered operations Contextual insights to help identify issues faster Proactive recommendations for troubleshooting and optimization AI-assisted workflows designed to reduce manual effort Support for more predictive network operations Unified visibility and network intelligence Centralized visibility into network performance Monitoring of device health, connectivity, and user experience Actionable intelligence derived from

Read More »

Aggressive federal PQE timeline prompts warnings for enterprises

“The U.S. government’s EOs will likely spur accelerated intervention from all major governments and regional political blocs,” the firm said. “CISOs should be prepared for regulations to conflict and contain sovereignty requirements, which will complicate compliance.” Gartner recommends that companies build a PQC inventory and remediation program in 2026 and

Read More »

Presidential order addresses quantum computing gaps

By comparison, in AI, there are a number of benchmarks comparing AI models on everything from how well they do at math, to their ability to generate images, to whether they can find bugs in computer code. Benchmarks are important so that people can do fair comparisons of quantum systems,

Read More »

U.S. Department of Energy Meets President Trump’s Goal, Delivers Third Advanced Reactor Criticality

WASHINGTON—As part of the U.S. Department of Energy (DOE) Nuclear Energy Launch Pad initiative, Deployable Energy’s demonstration reactor, Unity, successfully completed a zero-power fueled criticality demonstration at Idaho National Laboratory. Unity, which achieved criticality late yesterday, is the third DOE-authorized advanced reactor to go critical by the July 4th deadline set by President Trump in his May 2025 executive order. This criticality marks DOE’s fulfillment of a precedent-setting directive to reignite nuclear energy innovation in the United States. Earlier this month, Antares Nuclear’s Mark-0 and Valar Atomics’ Ward 250 reactors achieved criticality under DOE’s Reactor Pilot Program, making the United States the first country in history to achieve criticality in three unique advanced microreactor designs in a single month. “Last week, I had the opportunity to see the Unity demonstration reactor firsthand and meet with the talented teams from Deployable Energy, INL and DOE whose work made this historic moment possible on the eve of our nation’s 250th anniversary,” Secretary of Energy Chris Wright said. “America’s nuclear renaissance is underway because of President Trump’s bold vision and ambitious goals. Yesterday, we accomplished a significant milestone on a timeline many thought was unachievable. Advanced nuclear technologies like Unity will help power the next generation of American industry, strengthen our energy security, and ensure the United States remains the world’s nuclear innovation leader.” Deployable Energy completed the Unity criticality experiment under the Nuclear Energy Launch Pad initiative, managed by the National Reactor Innovation Center at Idaho National Laboratory. The next evolution of the Reactor Pilot Program, Nuclear Energy Launch Pad leverages DOE authorization to expeditiously certify and construct first-of-a-kind advanced nuclear technologies for demonstration. “We are proud to be a part of this historic achievement and I want to express Deployable Energy’s gratitude to the administration for setting an audacious goal to

Read More »

Energy Secretary Secures Mid-Atlantic Grid Ahead of Period of Hot Weather

WASHINGTON—The U.S. Department of Energy (DOE) today issued two emergency orders to mitigate blackout risks in the Mid-Atlantic ahead of the region’s predicted record-breaking peak loads brought on by the forecasted hot weather conditions. The first order directs PJM Interconnection, LLC (PJM) to dispatch specified units and to order their operation as needed to maintain reliability. The second order authorizes PJM, in collaboration with its Transmission Owners and Electric Distribution Companies, to direct backup generation resources to operate as a last resort before declaring an Energy Emergency Alert (EEA) 3 or during an EEA 3. The orders were issued pursuant to applications from PJM submitted on June 27 and 29, 2026. “Maintaining affordable, reliable, and secure power in the PJM service territory is non-negotiable,” said U.S. Secretary of Energy Chris Wright. “The previous administration’s energy subtraction policies weakened the grid, leaving Americans more vulnerable during events like this. Thanks to President Trump’s leadership, we are reversing those failures and using every available tool ensuring Americans in the Mid-Atlantic have continued access to affordable, reliable, and secure energy to power and cool their homes.” DOE estimates more than 35 GW of unused backup generation remains available nationwide. On day one, President Trump declared a national energy emergency after the Biden administration’s energy subtraction agenda left behind a grid increasingly vulnerable to blackouts. According to the North American Electric Reliability Corporation’s (NERC) 2026 Summer Reliability Assessment, the peak electricity demand in PJM occurs during the summer season. It further notes that “if extreme high temperatures are experienced, PJM anticipates the need for demand-response resources to help reduce load.” Power outages cost the American people $44 billion per year, according to data from DOE’s National Laboratories. These orders will mitigate the possibility of power outages in the Mid-Atlantic and highlight the commonsense policies of the Trump Administration to ensure Americans have access to

Read More »

Equinor to invest in additional Troll development to boost European gas supply

Equinor Energy AS and partners will invest more than 4 billion krone ($400 million) in a new subsea development to increase gas production from Troll field in the North Sea. The Troll West Increased gas recovery North (TWIN) expansion—the third step of Troll Phase 3, which produces gas from the Troll West reservoir—could come online as early as 2028, said Gunnar Nakken, Equinor’s senior vice-president for projects and subsea Norway. TWIN is expected to contribute around 11 billion standard cu m of gas. “By simplifying, increasing standardization and reusing existing infrastructure and equipment, we are reducing costs and enabling faster production,” he said. Equinor aims to produce 1.3 million b/d from the Norwegian Continental Shelf (NCS) in 2035 to meet a portion of Europe’s energy needs. Troll field contains about 40% of NCS total gas reserves, with gas from Troll meeting around 10% of Europe’s gas needs. The TWIN project consists of two wells in a template and a pipeline connected to existing subsea infrastructure. The umbilical and MEG line will be extended to the new development. The second step of Troll Phase 3 is expected to come online this year, continuing production from Troll A platform, 80 km northwest of Bergen, Norway, and the Gassco-operated Kollsnes processing plant towards 2030, Equinor said. Equinor is operator of the project with 30.55% interest. Partners are Petoro AS (55.93%), A/S Norske Shell (8.19%), TotalEnergies EP Norge AS (3.69%), and ConocoPhillips Skandinavia AS (1.64%).

Read More »

Zululand Energy Terminal invites EPC expressions of interest

The proposed 7.5-million tonne/year (tpy) Zululand Energy Terminal (ZET) at the Port of Richards Bay, South Africa, has invited expressions of interest (EOI) from engineering, procurement and construction (EPC) contractors for development of planned LNG regasification infrastructure. Imported natural gas is expected to supply both industry and power generation. Phase 1 of the project will use a 170,000-cu m floating storage unit attached to 3 million tpy of onshore regasification capacity. Phase 2 will add 220,000 cu m of onshore storage (potentially replacing the FSU) and 4.5 million tpy of regasification.  ZET hopes to complete detailed engineering during 2027 to reach final investment decision in 2028 and start operations in 2030. Reuters reported last week that ExxonMobil Corp. had signed a preliminary deal to supply LNG to ZET. Developed as a joint between Vopak Terminal Durban and Transnet Pipelines, ZET project is expected to be South Africa’s first LNG terminal. The consortium will design, develop, construct, finance, operate, and maintain the terminal in the South Dunes Precinct at the Port of Richards Bay over a 25-year concession. EPC execution will be subject to ZET’s localization and economic development objectives. Successful contractors will be expected to support local supplier participation, skills development, and the use of local labor. Qualifying parties will be included in the project’s vendor database and may be shortlisted for subsequent phases as potential preferred contractors or subcontractors. The EOI submission window closes July 9, 2026. Interested contractors are invited to access the full EOI documentation here. South African utility Eskom and ZET earlier this month signed a head of agreement (HOA) establishing the framework for a long-term strategic partnership to support South Africa’s gas-to-power program, underpinning a planned 3-Gw power plant near the terminal in KwaZulu-Natal. Vopak Terminal Durban is owned by Royal Vopak and Reatile Group

Read More »

Petrobras greenlights renewables plant for RPBC refinery

REDUC’s fist soybean oil-based SAF sale Announcement of FID on the RPBC renewables plant followed Petrobras’ June 17 confirmation that its 239,000-b/d Duque de Caxias (REDUC) refinery in the Baixada Fluminense area of Rio de Janeiro had completed first production and sale of a first 3,800-cu m batch of SAF made from soybean oil certified under the CORSIA low Land Use Change (ILUC) risk standard, which verifies sustainability criteria and a lower risk of impact on new land areas. Produced via co-processing and featuring 1% renewable content, the SAF batch marked “commercialization of the world’s first SAF made from certified low-ILUC-risk soy [to demonstrate] Petrobras’s commitment to sustainability, the energy transition, and the development of products aligned with market and societal demands [for lower-carbon solutions],” said Angélica Laureano, Petrobras’ director of logistics, sales, and markets. In October 2025, the REDUC refinery secured Brazil’s first international approval to advance commercial-scale production of SAF via the hydroprocessed esters and fatty acids (HEFA) co-processing route complying with ISCC System GmbH’s International Sustainability Carbon Certification (ISCC) standards, validating that SAF produced at the site meets the highest international sustainability and lifecycle carbon emission standards. Developed under ICAO’s CORSIA, the ISCC CORSIA certification was a prerequisite for commercial-scale SAF production following rigorous assessment of the production’s lifecycle carbon emissions and traceability. Equipped to produce as much as 10,000 b/d of SAF using a blend of conventional petroleum and up to 1.2% renewable feedstock, REDUC’s integration of bio-based oils—such as vegetable oil—into existing refining infrastructure via the HEFA co-processing method allows the refinery to produce SAF alongside conventional jet fuel with minimal investment, Petrobras previously said.

Read More »

Equinor to expand Troll with TWIN subsea development

Equinor Energy AS and partners will invest about NOK 4 billion ($410 million) in the new Troll West increased gas recovery north (TWIN) subsea development in Troll field in the North Sea. The TWIN project consists of two wells in a template and a pipeline connected to existing subsea infrastructure. The umbilical and monoethylene glycol line will be extended to the new development. The project is expected to contribute about 11 billion std cu m of gas to Troll. It is the third step of Troll Phase 3, which produces gas from the Troll West reservoir. Recoverable reserves from Troll Phase 3, mainly gas, are estimated at 2.2 billion boe. In accordance with the Petroleum Act, the partnership will now send an announcement to the Ministry of Energy concerning the development. An environmental impact assessment has been carried out. Troll, which supplies as much as 10% of Europe’s daily demand for gas, contains about 40% of the total gas reserves on the Norwegian continental shelf and was developed in phases, with gas extraction from Troll Øst in Phase 1 and oil from Troll West in Phase 2. The oil in Troll West is produced from multiple subsea templates tied into Troll B and Troll C via pipelines. Production from the Troll C installation started in 1999. Troll C is also used for production from Fram, Fram H-Nord, and Byrding. Several amended development plans were approved in connection with installing multiple subsea templates on Troll West. Equinor Energy AS is operator of TWIN (30.55%) with partners Petoro AS (55.93%), A/S Norske Shell (8.19%), TotalEnergies EP Norge AS (3.69%), and ConocoPhillips Skandinavia AS (1.64%).

Read More »

Executive Roundtable: The Rise of Integrated Infrastructure

Steve Altizer, Compu Dynamics: Integration has to be foundational. It has to start at the first planning conversation, not after the equipment is selected or once the building is already designed. In previous generations of data center development, mechanical, electrical, IT, and operations teams could often work in parallel and bring the pieces together later. That worked when the load profile was more predictable and the facility had more room to absorb change. Before the introduction of ChatGPT, there was very little change to absorb. AI removes that tolerance. A change in rack density can affect electrical distribution, structural requirements, thermal strategy, commissioning, service access, and the way the site is operated. These are no longer independent decisions. They are all part of one performance system. As AI systems move toward POD-scale platforms, the boundary between IT and facility infrastructure becomes much harder to separate. The challenge is that AI workloads are too varied for a one-size-fits-all approach. Training clusters, inference nodes, enterprise AI environments, and edge sites can all have different requirements for density, cooling architecture, network connectivity, security, site conditions, and serviceability. That is why many companies are adopting a modular approach, while others are embracing hybrid models where turnkey modular AI capacity is integrated into larger campus environments.  At the campus level, that means standardizing the backbone infrastructure that serves the site (utility power feeds, central cooling capacity, and network pathways), while allowing the IT environment and the integrated critical infrastructure components to evolve as workload requirements change. The goal is not modularity for its own sake. The goal is to support the next generation of AI deployments without forcing every hardware change to become a major redesign. AI infrastructure cannot be planned as a collection of disparate systems. It has to be designed as one coordinated

Read More »

Data Center Insights 2026 Brings Industry Leaders Together for a Two-Day Look at the AI Infrastructure Era

The data center industry has never been more visible, more vital, or more challenged. Support for AI and its overall industry impact has pushed digital infrastructure into the public conversation. It has become clear that the sector is confronting unprecedented demands for everything from power to basic infrastructure. That convergence is the focus of Data Center Insights 2026, a two-day virtual event taking place July 15–16, 2026, produced by Endeavor B2B’s Data Center Frontier, Cabling Installation & Maintenance, ISE, Lightwave, and SecurityInfoWatch. Designed for data center owners, operators, engineers, IT leaders, and the people supporting the next generation of data center development, the event offers a concentrated look at the technologies and strategies shaping the future of digital infrastructure. The program arrives at a crucial moment. AI workloads are changing almost every assumption behind data center design. Rack densities are rising, liquid cooling is becoming mainstream, and fiber networks are being rethought for 400G and beyond. Power constraints are now central to site selection. Security is becoming highlighted and operators are being asked to build faster, scale larger, be more resource efficient and maintain resilience in an environment where downtime carries higher consequences than ever. Data Center Insights 2026 is structured to help attendees make sense of this moment. Rather than treating data center infrastructure as a set of separate disciplines, the event brings together experts across cooling, cabling, fiber, power distribution, modular design, AI infrastructure, and operational strategy. The result is a practical, cross-functional program built around the real-world questions now facing the industry. What will I learn at this event? The event opens with “Expert Roundup: The State of the Data Center Industry,” featuring perspectives from Steven Carlini of Schneider Electric.This session sets the stage by examining the forces driving change across the data center landscape in 2026.

Read More »

Executive Roundtable: Scaling Beyond the Prototype Phase

Steve Altizer, Compu Dynamics: The defining challenge is keeping pace with the rate of change in the IT environment. It takes time to design, permit, build, and commission a data center. AI hardware operates on a completely different timeline. New GPU families are being introduced every 12 to 18 months, and from one generation to the next, rack power densities can double or even triple. At prototype scale, you can design around a single cluster or a specific density profile. At production scale, that approach becomes a real liability. The facility has to support today’s deployment while remaining adaptable for the next compute profile. We are not just talking about adding more power. We are preparing for major architectural shifts, including the move toward DC power delivery or cooling systems that may rely on two-phase liquid to remove heat at scale. That is what becomes materially harder. You are no longer solving for a single, static deployment. You are solving for a moving target inside a live operating environment. This is where strategic modularity proves its value. It helps decouple the lifecycle of the building from the lifecycle of the IT hardware. Instead of treating the data center as one monolithic design, modularity creates a more agile framework that can absorb new power and cooling architectures without requiring a full facility retrofit every time the IT roadmap shifts. At Compu Dynamics Modular, we are seeing this play out in real time. The value of a turnkey modular approach is not simply speed. It is the agility owners need to keep pace with ever-evolving rack densities, power delivery requirements, and cooling architectures.

Read More »

Q2 Executive Roundtable Recap

Matt Vincent is Editor in Chief of Data Center Frontier, where he leads editorial strategy and coverage focused on the infrastructure powering cloud computing, artificial intelligence, and the digital economy. A veteran B2B technology journalist with more than two decades of experience, Vincent specializes in the intersection of data centers, power, cooling, and emerging AI-era infrastructure. Since assuming the EIC role in 2023, he has helped guide Data Center Frontier’s coverage of the industry’s transition into the gigawatt-scale AI era, with a focus on hyperscale development, behind-the-meter power strategies, liquid cooling architectures, and the evolving energy demands of high-density compute, while working closely with the Digital Infrastructure Group at Endeavor Business Media to expand the brand’s analytical and multimedia footprint. Vincent also hosts The Data Center Frontier Show podcast, where he interviews industry leaders across hyperscale, colocation, utilities, and the data center supply chain to examine the technologies and business models reshaping digital infrastructure. Since its inception he serves as Head of Content for the Data Center Frontier Trends Summit. Before becoming Editor in Chief, he served in multiple senior editorial roles across Endeavor Business Media’s digital infrastructure portfolio, with coverage spanning data centers and hyperscale infrastructure, structured cabling and networking, telecom and datacom, IP physical security, and wireless and Pro AV markets. He began his career in 2005 within PennWell’s Advanced Technology Division and later held senior editorial positions supporting brands such as Cabling Installation & Maintenance, Lightwave Online, Broadband Technology Report, and Smart Buildings Technology. Vincent is a frequent moderator, interviewer, and keynote speaker at industry events including the HPC Forum, where he delivers forward-looking analysis on how AI and high-performance computing are reshaping digital infrastructure. He graduated with honors from Indiana University Bloomington with a B.A. in English Literature and Creative Writing and lives in southern New Hampshire with

Read More »

Emergence Water and Nimbus: Water Joins Power as AI Infrastructure’s Next Critical Constraint

For much of the past decade, the conversation surrounding AI infrastructure has been dominated by one resource above all others: power. Utilities have become strategic partners. Natural gas generation, small modular reactors, microgrids and behind-the-meter power have become central themes across virtually every major data center conference. Developers increasingly speak about securing megawatts years before they discuss servers. But another infrastructure constraint is quietly following the same trajectory: Water. According to executives from Emergence Water and Nimbus Advanced Process Cooling Systems, water is rapidly evolving beyond its traditional role as a sustainability metric and becoming one of the primary determinants of where AI campuses can be built, how they are cooled, and how efficiently they will operate over the coming decade. Speaking with Data Center Frontier Editor in Chief Matt Vincent on the latest DCF Show podcast, Emergence Water Chief Product Officer Leif Percifield and Nimbus Technical Director Vamsi Mokkapati described an industry where water has effectively joined power and fiber as foundational infrastructure for AI development. “From a community perspective, water is absolutely the number one priority about where and why a data center gets built,” Percifield said. “From the developer, it’s pretty binary. They either have water available to them—or they don’t.” Water Is Becoming a Site Selection Constraint The shift reflects the changing realities of AI infrastructure. Traditional enterprise data centers often viewed water primarily through sustainability reporting or Power Usage Effectiveness (PUE) discussions. AI facilities operating at unprecedented rack densities have fundamentally altered that equation. Liquid cooling, hybrid cooling architectures and increasingly sophisticated thermal management strategies all place new emphasis on reliable long-term water availability. Equally important, communities are beginning to scrutinize water usage with the same intensity previously reserved for electrical demand. Percifield says those conversations are increasingly determining whether projects move forward at all.

Read More »

U.S. Open powers up AI-ready network in challenging environment

Environmental conditions add another layer of complexity. Anthony Santora, managing director of IT for the USGA, describes the championship network as a data center without the usual comforts. There’s dust, rain, wind, and wide temperature swings instead of clean, controlled air. Hardware resides in trailers and weatherproof enclosures, not in racks behind raised floor tiles. For network engineers who spend most of their time on office campuses and in colos, that’s an important reminder: Critical infrastructure increasingly sits in places that look nothing like a traditional wiring closet. User behavior is just as hostile. The U.S. Open has its own term — the “Tiger effect” (though one could argue it’s now the Scottie effect) — for what happens when tens of thousands of fans follow a single golfer. The hot spot moves with the group, and the RF design must cope with a dense, moving cluster of devices. That pattern should sound familiar to anyone who supports large conferences or festivals; it’s the same phenomenon, just under a different name. Building an AI‑ready, fault‑tolerant course network Cisco’s answer to this environment is a fully redundant, mobile core design. Instead of a single large core in a building, the network collapses into dual trailers that serve as cores on the go, typically anchored at the NBC broadcast compound and another central location. Each core hosts Cisco Secure Firewall appliances, FMCs, core Catalyst switches, DHCP, UPS, and generators, all in pairs. Rodriguez was matter-of-fact about the philosophy: “We do everything in pairs as much as we can.” If one fails, its twin picks up the load.

Read More »

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs).  In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

Read More »

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

Read More »

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

Read More »

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Read More »