Stay Ahead, Stay ONMINE

Russia Watchdog Halts Oil Loadings at One Berth in Novorossiysk

Russia’s transportation watchdog has ordered a 90-day halt to oil-loading operations at berth 8 of the Black Sea port of Novorossiysk, according to state infrastructure operator Transneft PJSC. The order came as security checks at the port identified some violations, Transneft, which controls the loading facilities, said in a statement on Wednesday. Tanker tracking by […]

Russia’s transportation watchdog has ordered a 90-day halt to oil-loading operations at berth 8 of the Black Sea port of Novorossiysk, according to state infrastructure operator Transneft PJSC.

The order came as security checks at the port identified some violations, Transneft, which controls the loading facilities, said in a statement on Wednesday. Tanker tracking by Bloomberg suggests that there will be no impact on crude oil flows and minimal impact on fuel flows from the step.

The oil terminal has until June 30 to rectify the violations.

The Novorossiysk port is a key export route for Russian crude oil and petroleum products and any significant disruptions to loadings may affect the nation’s production flows. However, last month, 23 crude tankers loaded there to carry Russian barrels abroad. All of them loaded at berths 1, 1a or 2, according to vessel tracking data compiled by Bloomberg and shipping industry data.

The other oil berths in Novorossiysk are used for loading petroleum products, the data show. Since late February, no product tankers moored at berth 8 either, which suggests the facility isn’t used often.

The suspension order comes just days after the transportation watchdog halted loadings at two moorings of the Caspian Pipeline Consortium located nearby. The authority identified security breaches there and ordered an indefinite halt until the violations are rectified. The order has left the CPC infrastructure with just one operational mooring. 

As the CPC link is the single-largest export route for Kazakh crude, initially expected to load about 1.5 million barrels a day in April, the halt has more of an impact for Kazakhstan. With no options for rerouting all the barrels, the nation may need to cut oil production just as the Organization of Petroleum Exporting Countries nudges it to meet the production quota. 

Apart from the Novorossiysk facilities, Russia’s transportation authority have also run security checks at the regional ports of Tuapse and Taman on President Vladimir Putin’s orders, after a massive oil spill in Black Sea last December.

What do you think? We’d love to hear from you, join the conversation on the

Rigzone Energy Network.

The Rigzone Energy Network is a new social experience created for you and all energy professionals to Speak Up about our industry, share knowledge, connect with peers and industry insiders and engage in a professional community that will empower your career in energy.


MORE FROM THIS AUTHOR



Bloomberg


Shape
Shape
Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy,  bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Shape

A look back at Microsoft’s IPO

Speaking of good fortune, Fortune magazine was granted inside access to Gates, his executive and legal teams, and their Wall Street partners in the months leading up to the IPO. That arrangement resulted in a terrific fly-on-the-wall story published four months later. A few highlights gleaned from that story and

Read More »

ServiceNow to acquire Logik.ai to boost CRM portfolio

“With CPQ more seamlessly embedded into the sales and order management capabilities, sellers can increase productivity by exponentially reducing time towards building sales quotes and recording opportunities in the system. But also, as the system learns, it can also recommend the right products and services to add to a particular

Read More »

New Relic simplifies Kubernetes performance monitoring

For customers, New Relic explains that the benefits of eAPM include: Faster troubleshooting: Debug more quickly because they can monitor metrics, transaction details, and database performance in one place. Speedy deployment without altering existing code: Enable quick setup of application performance monitoring, discover all applications and services, identify critical span

Read More »

TotalEnergies Completes Multiple RE Asset Acquisitions

TotalEnergies has announced the completion of its acquisition of VSB Group, a wind and solar developer across Europe, and SN Power, a hydropower developer in Uganda. Concurrently it announced the signing of agreements to buy several wind and solar projects in the Canadian province of Alberta from RES. TotalEnergies said it had just closed the acquisition of one of these projects, the 184-megawatt (MW) Big Sky Solar, commissioned late February. “The completion of these three acquisitions in Europe, North America and Africa will contribute to our targets of 35 GW [gigawatts] of gross renewable capacity by 2025 and over 100 TWh [terawatt hours] of electricity production by 2030”, Stéphane Michel, TotalEnergies president for gas, renewables and power, said in a company statement. In Europe TotalEnergies’ renewable energy portfolio now consists of 500 MW installed or under construction and a pipeline of over 15 GW of solar and wind. In Africa the company has 255 MW of installed renewables capacity plus 560 MW of hydro-generation projects. In Canada TotalEnergies has raised its installed RE capacity to 180 MW. It also has more than 600 MW of solar and wind projects, according to the company. “These acquisitions strengthen our operations in markets where we are deploying our Integrated Power business, like Germany and in North America, and in countries, such as Uganda, where we can leverage synergies with our exploration and production activities”, Michel added. “Furthermore, these acquisitions will contribute to cash flow growth and to our goal of reaching our 12 percent profitability target in the electricity segment”. VSB, acquired from Swiss asset manager Partners Group for EUR 1.57 billion ($1.72 billion), contributes to TotalEnergies over 15 GW of projects in the pipeline, in addition to 7 GW operational or being built. However, TotalEnergies said, “Given its targeted strategy for certain key European

Read More »

ICYMI: DOE National Lab and Nuclear Weapons Directors Voice Support for Commonsense Permitting Reforms

GOLDEN, COLORADO—U.S. Secretary of Energy Chris Wright received strong support from the Department of Energy’s (DOE) National Lab and nuclear weapons assembly plant directors after he announced expedited permitting reforms for construction projects on Energy Department lands. These common sense reforms will save at least hundreds of millions of taxpayer dollars and rapidly accelerate project completion dates, helping better unleash American innovation, restore energy dominance, and modernize America’s nuclear stockpiles.  What America’s National Lab Directors Are Saying: “The recent guidance issued by the Department of Energy related to how the Laboratory executes our construction activities is intended to provide new tools to improve our operations and increase our effectiveness. This guidance will have positive impacts on our construction scheduling, budgeting, work execution and safety. We believe this guidance will have a net positive impact on most of our construction activities now costing less than $300,000,000 including, Energetic Materials Characterization modular facilities and Pajarito Corridor Office Complexes (PCOCs). The new guidance will save the federal government tens of millions of dollars on projects at LANL like these by reducing costs and avoiding overruns due to delays. In addition, allowing the Laboratory to better utilize existing Occupational Health and Safety Administration standards (OHSA) will increase the number of construction companies that can bid on work at LANL and provide a more competitive bidding process that will assist in lowing the costs of construction.” — Los Alamos National Laboratory Director Thom Mason “I appreciate Secretary Wright’s bold action to empower the National Laboratories to more efficiently deliver transformative scientific and technological outcomes that will benefit American taxpayers.  This is the most substantive and quickest change in improving lab operations that I have seen in my many years with DOE.” —Pacific Northwest National Laboratory Director Dr. Steven Ashby “Secretary Wright’s decisive actions in easing permitting rules and regulations

Read More »

Department of Energy Awards Management and Operating (M&O) Contract for Strategic Petroleum Reserve

WASHINGTON, D.C. — Following a rigorous competitive selection process, the Department of Energy (DOE) today announced that Strategic Storage Partners, LLC has secured a $1.4 billion contract to manage and operate the Strategic Petroleum Reserve (SPR). This includes the operation and maintenance of facilities and related systems located in Louisiana and Texas over a five-year period, with an option for DOE to extend the contract for an additional five years.  After a transition period, Strategic Storage Partners, LLC, will assume responsibility for management and operation of the SPR on June 15, 2025. The SPR’s mission is to safeguard the United States from significant petroleum supply disruptions through the acquisition, storage, distribution, and management of emergency petroleum stocks, fulfilling U.S. obligations under the International Energy Program. Federally owned oil stocks are stored in underground salt caverns at four sites located in Texas and Louisiana. The SPR has a longstanding history of protecting the U.S. economy and livelihoods during emergency oil shortages.  This announcement underscores President Trump’s commitment to advancing initiatives that support American jobs, strengthen domestic supply chains, and reinforce the United States’ position as a global energy leader.                                                                                                 ###

Read More »

Russia Oil Tax Revenue Slides as Prices, Output Decline

Russia reaped less money from oil taxes last month as production fell and global crude prices declined. Oil-tax proceeds dropped by more than 15 percent from a year earlier to 956.8 billion rubles ($11.4 billion), according to Bloomberg calculations based on Finance Ministry data published Thursday.  Global crude prices have fallen amid slower demand growth in China, ample supplies from North and South America and the fallout from US President Donald Trump’s increasingly aggressive tariff policy. Expectations for weaker global demand are “putting pressure on commodity prices,” Russia’s central bank said earlier this week. In February, the bank estimated an average 2025 Urals crude price for tax purposes of $65 a barrel and $60 in the next two years. Now it says the chance of lower prices has “increased somewhat.” For the latest data, the Finance Ministry calculated taxes based on an average Urals price of $61.69 a barrel in February. That’s down almost 10% from a year earlier and compares with a drop of almost 16 percent in the global Brent benchmark.  The decline in tax proceeds also follows a reduction in oil output, with Russia saying it’s brought production into line with its OPEC+ quota. The country last year was among the group’s laggard members, and it still needs to make up for the months it overshot targets. March’s total oil and gas revenue slumped 17 percent from a year earlier to 1.08 trillion rubles, the ministry’s data showed. Almost 89 percent of that came from crude and refined products. State subsidies to Russia’s refiners have also dented the federal budget, with the government paying 100.3 billion rubles to producers of gasoline and diesel to supply the domestic market.  Proceeds from the gas industry alone fell by almost a third from a year earlier to 124.5 billion rubles in March. That was driven

Read More »

Aker BP, SLB, Stimwell Extend Alliance for Five More Years

Aker BP ASA has extended its alliance agreement with SLB (Schlumberger Limited) and Stimwell Services Ltd. for another five years. The alliance formed in 2019 has supported Aker BP’s operated assets in meeting production targets. Aker BP said in a media release the alliance will work on further accelerating and boosting oil production. The alliance has established new standards for safe, efficient, and cost-effective operations through collaboration, digitalization, and innovative technology, according to Aker BP. Key achievements include simultaneous operations with jack-up rigs, a decreased backlog of locked-in barrels, and the first-ever autonomous intervention operation globally. Currently, both planning and execution utilize digital workflows, leading to enhanced productivity, reduced risk, and higher success rates, the company said. “Strategic partnerships are essential to shaping the future of our industry. At Aker BP, we remain committed to the alliance model, which creates value through long-term collaboration. It enables us to increase productivity, maintain world-class performance, and deliver oil and gas with low cost and low emissions. This is how we position ourselves as the E&P company of the future”, Aker BP CEO Karl Johnny Hersvik said. The alliance strategy aims to transform offshore well intervention.  Over the next five years, the partners aim to deliver top-quartile performance while developing future-proof capabilities, enhancing digital integration between subsurface and operations, expanding Aker BP’s Integrated Operations Centre for remote operations, and accelerating new technology deployment, Aker BP added. Furthermore, the alliance will focus on bringing wells onstream across Aker BP’s projects, using a newly upgraded stimulation vessel to optimize the Valhall PWP wells, contributing significantly to future production. “Increasing production and recovery from maturing assets is a top priority across the industry, and this alliance demonstrates how we can drive progress together through the power of partnership. The complex challenges facing our industry will increasingly

Read More »

BP chairman Helge Lund to step down

BP chairman Helge Lund has announced his plans to step down from his position. The company’s board of directors has initiated a succession process to select a new chair, with the succession process being led by Amanda Blanc in her capacity as senior independent director. Although a timeline for Lund’s resignation was not established, BP said it would most likely be during 2026. Until then, Lund will work with the successful candidate for his replacement to ensure an orderly transition ahead of taking on the role of chair. Whilst this succession process progresses, the board’s focus will remain on overseeing management’s delivery of the new strategy and this will continue to be their key priority under the new chair. Lund said: “Having fundamentally reset our strategy, BP’s focus now is on delivering the strategy at pace, improving performance and growing shareholder value. “Now is the right time to start the process to find my successor and enable an orderly and seamless handover. The board and I are committed to supporting Murray and his team, and to overseeing bp’s delivery of its strategic and financial objectives as we set out in our recent Capital Markets Update.” Lund has held the position of BP chairman since 2019, having served with three CEOs – Bob Dudley, Bernard Looney, and the current chief executive Murray Auchincloss. His last few years as chairman have been marked by BP committing, then rowing back on its climate commitments. Amid tighter economic conditions, BP’s investors have been looking for the company to refocus on its core oil and gas operations to boost revenues and share prices. This includes selling its lubricants business Castrol and divesting 50% of its solar business Lightsource BP. Amanda Blanc added: “We are starting a comprehensive search to identify chair candidates with the credibility

Read More »

Tariff war throws building of data centers into disarray

Forrester’s bottom line? “Because of the long term planning and all of the potential policy changes, I wouldn’t change my data center plans that much,” Nguyen said. Confusion reigns Every day it seems, the tariff situation becomes muddier. For example, according to a fact sheet released Wednesday, the White House has temporarily exempted semiconductors from tariffs, but not the aluminum used to build the servers and racks that house them. Furthermore, Scott Bickley, advisory fellow at the Info-Tech Research Group, said it is important to note how the various countries match with the various components. “Just about every major cost center for the buildout of a data center will be severely impacted by the new tariffs. Servers and hardware, including semiconductors, memory, network components, cabling, construction materials are going to see prices rise overnight once the tariffs go into effect,” Bickley said. “Consider that China, which has a 54% full tariff, is a major source of raw materials and rare earth elements essential for manufacturing DC components while Taiwan, at a 32% tariff rate, is the sole-source provider country for most advanced chipsets used in AI, cell phones, and any modern application footprint requiring high performance in a small footprint. South Korea (25% tariff) is a key provider of memory chips, while Japan (24%), Germany (20% EU rate), and the Netherlands (20% EU rate) are providers of sub-components like server racks, cooling systems, and semiconductor equipment.” But, he continued: “Now factor in the offshore/nearshore contract manufacturers like Mexico and Vietnam (46%) for electronics manufacturing (assembly and distribution) and Malaysia (10%) for semiconductor packaging, and it is clear to see that the complete technology supply chain leading into the data center will be taxed at multiple touchpoints.” Put all of that together and Info-Tech anticipates a lot of enterprise data center pain.

Read More »

New MLCommons benchmarks to test AI infrastructure performance

The latest release also broadens its scope beyond chatbot benchmarks. A new graph neural network (GNN) test targets datacenter-class hardware and is designed for workloads like fraud detection, recommendation engines, and knowledge graphs. It uses the RGAT model based on a graph dataset containing over 547 million nodes and 5.8 billion edges. Judging performance Analysts suggest that these benchmarks will make it easier to judge the performance of various hardware chips and clusters based on documented models. “As every chipmaker seeks to prove that its hardware is good enough to support AI, we now have a standard benchmark that shows the quality of question support, math, and coding skills associated with hardware,” said Hyoun Park, CEO and Chief Analyst at Amalgam Insights.  Chipmakers can now compete not just on traditional speeds and feeds, but in mathematical skill and informational accuracy. This benchmark provides a rare opportunity to add new performance standards on cross-vendor hardware, Park added. “The latency in terms of how quickly tokens are delivered and the time for the user to see the response is the deciding factor,” said Neil Shah, partner and co-founder at Counterpoint Research. “This is where players such as NVIDIA, AMD, and Intel have to get the software right to help developers optimize the models and bring out the best compute performance.” Benchmarking and buying decisions Independent benchmarks like those from MLCommons play a key role in helping buyers evaluate system performance, but relying on them alone may not provide the full picture.

Read More »

Potential Nvidia chip shortage looms as Chinese customers rush to beat US sales ban

Will it lead to shortages? The US first placed export controls on chips sent to China in October 2022 as a means to slow the country’s technological advances. It blocked the sale of Nvidia’s A100 and H100 chips, leading the company to develop the less powerful A800 and H800 chips for the market; they were also subsequently banned. There was a surge in demand for the H20 following the arrival of Chinese startup DeepSeek’s ultra low-cost, open-source AI model in January. And while the H20 is reported to be 15 times slower than Nvidia’s newest Blackwell chips sold elsewhere in the world, it was designed specifically by Nvidia to comply with the further US export controls introduced in October 2023. It is being used by Chinese companies for training, although it’s billed as an inference chip, explained Matt Kimball, VP and principal analyst for datacenter compute and storage at Moor Insights & Strategy. Should Nvidia choose to focus its efforts on manufacturing more of the chips, Kimball said he doesn’t think it will impact supply in the US and Europe, as Blackwell is the main product sold in those markets and H20 is an N-1 Hopper architecture chip. “If you take this a step further and ask whether this large order slows down the production of chips destined for the US and Europe, I’d say the answer is no, as the Hopper family is built on a different process node than the Blackwell family,” he said. Still, Kimball noted, “supply chain management is difficult, especially for smaller organizations that are put to the back of the line as hyperscalers with multibillion dollar orders are first in line for the newest [chips].”

Read More »

European cloud group invests to create what it dubs “Trump-proof cloud services”

But analysts have questioned whether the Microsoft move truly addresses those European business concerns. Phil Brunkard, executive counselor at Info-Tech Research Group UK, said, commenting on last month’s announcement of the EU Data Boundary for the Microsoft Cloud,  “Microsoft says that customer data will remain stored and processed in the EU and EFTA, but doesn’t guarantee true data sovereignty.” And European companies are now rethinking what data sovereignty means to them. They are moving beyond having it refer to where the data sits to focusing on which vendors control it, and who controls them. Responding to the new Euro cloud plan, another analyst, IDC VP Dave McCarthy, saw the effort as “signaling a growing European push for data control and independence.” “US providers could face tougher competition from EU companies that leverage this tech to offer sovereignty-friendly alternatives. Although €1 million isn’t a game-changer on its own, it’s a clear sign Europe wants to build its own cloud ecosystem—potentially at the expense of US market share,” McCarthy said. “For US providers, this could mean investing in more EU-based data centers or reconfiguring systems to ensure European customers’ data stays within the region. This isn’t just a compliance checkbox. It’s a shift that could hike operational costs and complexity, especially for companies used to running centralized setups.” Adding to the potential bad news for US hyperscalers, McCarthy said that there was little reason to believe that this trend would be limited to Europe. “If Europe pulls this off, other regions might take note and push for similar sovereignty rules. US providers could find themselves adapting to a patchwork of regulations worldwide, forcing a rethink of their global strategies,” McCarthy said. “This isn’t just a European headache, it’s a preview of what could become a broader challenge.”

Read More »

Talent gap complicates cost-conscious cloud planning

The top strategy so far is what one enterprise calls the “Cloud Team.” You assemble all your people with cloud skills, and your own best software architect, and have the team examine current and proposed cloud applications, looking for a high-level approach that meets business goals. In this process, the team tries to avoid implementation specifics, focusing instead on the notion that a hybrid application has an agile cloud side and a governance-and-sovereignty data center side, and what has to be done is push functionality into the right place. The Cloud Team supporters say that an experienced application architect can deal with the cloud in abstract, without detailed knowledge of cloud tools and costs. For example, the architect can assess the value of using an event-driven versus transactional model without fixating on how either could be done. The idea is to first come up with approaches. Then, developers could work with cloud providers to map each approach to an implementation, and assess the costs, benefits, and risks. Ok, I lied about this being the top strategy—sort of, at least. It’s the only strategy that’s making much sense. The enterprises all start their cloud-reassessment journey on a different tack, but they agree it doesn’t work. The knee-jerk approach to cloud costs is to attack the implementation, not the design. What cloud features did you pick? Could you find ones that cost less? Could you perhaps shed all the special features and just host containers or VMs with no web services at all? Enterprises who try this, meaning almost all of them, report that they save less than 15% on cloud costs, a rate of savings that means roughly a five-year payback on the costs of making the application changes…if they can make them at all. Enterprises used to build all of

Read More »

Lightmatter launches photonic chips to eliminate GPU idle time in AI data centers

“Silicon photonics can transform HPC, data centers, and networking by providing greater scalability, better energy efficiency, and seamless integration with existing semiconductor manufacturing and packaging technologies,” Jagadeesan added. “Lightmatter’s recent announcement of the Passage L200 co-packaged optics and M1000 reference platform demonstrates an important step toward addressing the interconnect bandwidth and latency between accelerators in AI data centers.” The market timing appears strategic, as enterprises worldwide face increasing computational demands from AI workloads while simultaneously confronting the physical limitations of traditional semiconductor scaling. Silicon photonics offers a potential path forward as conventional approaches reach their limits. Practical applications For enterprise IT leaders, Lightmatter’s technology could impact several key areas of infrastructure planning. AI development teams could see significantly reduced training times for complex models, enabling faster iteration and deployment of AI solutions. Real-time AI applications could benefit from lower latency between processing units, improving responsiveness for time-sensitive operations. Data centers could potentially achieve higher computational density with fewer networking bottlenecks, allowing more efficient use of physical space and resources. Infrastructure costs might be optimized by more efficient utilization of expensive GPU resources, as processors spend less time waiting for data and more time computing. These benefits would be particularly valuable for financial services, healthcare, research institutions, and technology companies working with large-scale AI deployments. Organizations that rely on real-time analysis of large datasets or require rapid training and deployment of complex AI models stand to gain the most from the technology. “Silicon photonics will be a key technology for interconnects across accelerators, racks, and data center fabrics,” Jagadeesan pointed out. “Chiplets and advanced packaging will coexist and dominate intra-package communication. The key aspect is integration, that is companies who have the potential to combine photonics, chiplets, and packaging in a more efficient way will gain competitive advantage.”

Read More »

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs).  In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

Read More »

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

Read More »

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

Read More »

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Read More »