Stay Ahead, Stay ONMINE

Enhancing AI agents with long-term memory: Insights into LangMem SDK, Memobase and the A-MEM Framework

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More AI agents can automate many tasks that enterprises want to perform. One downside, though, is that they tend to be forgetful. Without long-term memory, agents must either finish a task in a single session or be […]

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More


AI agents can automate many tasks that enterprises want to perform. One downside, though, is that they tend to be forgetful. Without long-term memory, agents must either finish a task in a single session or be constantly re-prompted. 

So, as enterprises continue to explore use cases for AI agents and how to implement them safely, the companies enabling development of agents must consider how to make them less forgetful. Long-term memory will make agents much more valuable in a workflow, able to remember instructions even for complex tasks that require several turns to complete.

Manvinder Singh, VP of AI product management at Redis, told VentureBeat that memory makes agents more robust. 

“Agentic memory is crucial for enhancing [agents’] efficiency and capabilities since LLMs are inherently stateless — they don’t remember things like prompts, responses or chat histories,” Singh said in an email. “Memory allows AI agents to recall past interactions, retain information and maintain context to deliver more coherent, personalized responses, and more impactful autonomy.”

Companies like LangChain have begun offering options to extend agentic memory. LangChain’s LangMem SDK helps developers build agents with tools “to extract information from conversation, optimize agent behavior through prompt updates, and maintain long-term memory about behaviors, facts, and events.”

Other options include Memobase, an open-source tool launched in January to give agents “user-centric memory” so apps remember and adapt. CrewAI also has tooling around long-term agentic memory, while OpenAI’s Swarm requires users to bring their memory model. 

Mike Mason, chief AI officer at tech consultancy Thoughtworks, told VentureBeat in an email that better agentic memory changes how companies use agents.

“Memory transforms AI agents from simple, reactive tools into dynamic, adaptive assistants,” Mason said. “Without it, agents must rely entirely on what’s provided in a single session, limiting their ability to improve interactions over time.” 

Better memory 

Longer-lasting memory in agents could come in different flavors. 

LangChain works with the most common memory types: semantic and procedural. Semantic refers to facts, while procedural refers to processes or how to perform tasks. The company said agents already have good short-term memory and can respond in the current conversation thread. LangMem stores procedural memory as updated instructions in the prompt. Banking on its work on prompt optimization, LangMem identifies interaction patterns and updates “the system prompt to reinforce effective behaviors. This creates a feedback loop where the agent’s core instructions evolve based on observed performance.”

Researchers working on ways to extend the memories of AI models and, consequently, AI agents have found that agents with long-term memory can learn from mistakes and improve. A paper from October 2024 explored the concept of AI self-evolution through long-term memory, showing that models and agents actually improve the more they remember. Models and agents begin to adapt to more individual needs because they remember more custom instructions for longer. 

In another paper, researchers from Rutgers University, the Ant Group and Salesforce introduced a new memory system called A-MEM, based on the Zettelkasten note-taking method. In this system, agents create knowledge networks that enable “more adaptive and context-aware memory management.”

Redis’s Singh said that agents with long-term memory function like hard drives, “holding lots of information that persists across multiple task runs or conversations, letting agents learn from feedback and adapt to user preferences.” When agents are integrated into workflows, that kind of adaptation and self-learning allows organizations to keep the same set of agents working on a task long enough to complete it without the need to re-prompt them.

Memory considerations

But it is not enough to make agents remember more; Singh said organizations must also make decisions on what the agents need to forget. 

“There are four high-level decisions you must make as you design a memory management architecture: Which type of memories do you store? How do you store and update memories? How do you retrieve relevant memories? How do you decay memories?” Singh said. 

He stressed that enterprises must answer those questions because making sure an “agentic system maintains speed, scalability and flexibility is the key to creating a fast, efficient and accurate user experience.” 

LangChain also said organizations must be clear about which behaviors humans mujst set and which should be learned through memory; what types of knowledge agents should continually track; and what triggers memory recall. 

“At LangChain, we’ve found it useful first to identify the capabilities your agent needs to be able to learn, map these to specific memory types or approaches, and only then implement them in your agent,” the company said in a blog post.

The recent research and these new offerings represent just the start of the development of toolsets to give agents longer-lasting memory. And as enterprises plan to deploy agents at a larger scale, memory presents an opportunity for companies to differentiate their products. 

Shape
Shape
Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy,  bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Shape

SolarWinds buys Squadcast to speed incident response

Squadcast customers shared their experiences with the technology. “Since implementing Squadcast, we’ve reduced incoming alerts from tens of thousands to hundreds, thanks to flexible deduplication. It has a direct impact on reducing alert fatigue and increasing awareness,” said Avner Yaacov, Senior Manager at Redis, in a statement. According to SolarWinds,

Read More »

Tariffs won’t impact IT organizations, for now anyway

The idea behind tariffs is to increase domestic manufacturing, but Almassy notes the United Stated doesn’t have the manufacturing capacity or capability that Taiwan does. TSMC, Samsung, and GlobalFoundries have some fabs here but they are not building the most leading edge technologies. “Those are all in Taiwan at the

Read More »

DOE Issues Export Approval to Golden Pass LNG, Accelerating President Trump’s Pledge to Restore American Energy Dominance

WASHINGTON—U.S. Secretary of Energy Chris Wright today approved an LNG export permit extension for Golden Pass LNG Terminal LLC (Golden Pass), marking yet another step toward meeting President Trump’s and Secretary Wright’s commitment to unleash American energy dominance and restore regular order to liquefied natural gas (LNG) export reviews. The approval will grant additional time to begin LNG exports from the Golden Pass LNG Terminal, currently under construction in Sabine Pass, Texas. “Exporting U.S. LNG supports American jobs, bolsters our national security and strengthens America’s position as a world energy leader. President Trump has pledged to restore energy dominance for the American people, and I am proud to help deliver on that agenda with today’s permit extension,” said Secretary Wright. The issuance to Golden Pass marks the third LNG-related approval from DOE since President Trump took office, following an export approval to Commonwealth LNG on February 14 and an order on rehearing removing barriers for the use of LNG as bunkering fuel announced on February 28. “Golden Pass was the first project approved for exports to non-free trade agreement countries by DOE during the first Trump Administration, and it is gratifying that this project is so close to being able to deliver its first LNG,” said Tala Goudarzi, Acting Principal Deputy Assistant Secretary of the Office of Fossil Energy and Carbon Management. Golden Pass, owned by QatarEnergy and ExxonMobil, is set to begin exporting as early as later this year, and once operational, will become the ninth large-scale export terminal operating in the United States. Once completed, Golden Pass will be able to export up to 2.57 billion cubic feet per day (Bcf/d) of natural gas as LNG and will bring unprecedented levels of LNG exports from the United States.  ###

Read More »

USA Energy Sec Says We’re at the Start of a New Manhattan Project

We’re at the start of a new Manhattan Project. That’s what U.S. Secretary of Energy Chris Wright said in a release posted on the U.S. Department of Energy (DOE) website on Saturday. Wright made the statement “after participating in the launch of an AI collaboration session involving more than 1,000 Energy Department scientists and OpenAI employees”, the release highlighted. The DOE release noted that Wright joined Senator Bill Hagerty, Chairman of House Committee on Energy and Waters Chuck Fleischmann, and OpenAI President and Co-Founder Greg Brockman,   at the ‘1,000 Scientist AI Jam Session’ on February 28. This first of its kind event co-hosted by OpenAI and nine of the U.S. Department of Energy’s national labs explored how AI can accelerate scientific discovery, the release stated. “More than 70 years ago, experts from the Department of Energy’s Oak Ridge National Lab joined with innovators from around the United States in one of the greatest scientific and engineering accomplishments in history: the Manhattan Project,” Wright said in the release. “We’re at the start of a new Manhattan Project. With President Trump’s leadership, the United States will win the global AI race, but first, we must unleash our energy dominance and restore American competitiveness,” he added. “Today’s collaboration of DOE’s national labs and technology companies is an important step in our efforts to secure America’s future,” he continued. Hagerty, Fleischmann, and Brockman also made statements following the event, the DOE release highlighted. “It was great to join Secretary Wright and Representative Chuck Fleischmann this morning in Oak Ridge,” Hagerty said in the release. “In order for the U.S. to win the Artificial Intelligence race, we need computing power and energy, and Tennessee has both,” he added. “The U.S. can lead in this space with the Volunteer State at the tip of the spear. I

Read More »

US SMR deals remain elusive for NuScale

Dive Brief: NuScale is in “advanced commercial dialogue with major technology and industrial companies, utilities, and national and local governments” on potential small modular reactor deals as it looks forward to U.S. Nuclear Regulatory Commission approval for its 77-MW reactor uprate application later this year, the nuclear technology company said in a Monday earnings release. The company ordered “long-lead materials” for six additional modules from supply chain partner Doosan Enerbility in anticipation of customer orders, and continues to advance a 462-MW power plant project in Romania as a subcontractor to Fluor Corp., CEO John Hopkins said on the company’s Monday earnings call. But NuScale has yet to finalize a deal with any U.S. data center operators or other industrial customers due to “the complexity of putting these projects together,” Hopkins said. Dive Insight: In a Q4 and full-year 2024 earnings presentation that noted a significant revenue boost toward the end of last year, NuScale detailed “broad customer interest” in its technology, which it said can support multiple industries and grid use cases. Those grid use cases include resiliency, since reactors like NuScale’s can continue operating through outages and remain available once grid connectivity is restored, and microgrid support for mission-critical facilities like hospitals, the company said. NuScale reactors can also firm an increasingly renewables-heavy grid and replace baseload capacity lost due to coal-fired power plant retirements, the company said. Last year, a Department of Energy report on coal-to-nuclear transition potential found up to 174 GW of potential nuclear electric-generating capacity at 145 existing U.S. coal sites. The company presentation also highlighted four industrial opportunities: data centers and AI, carbon capture and sequestration, water desalination and hydrogen production. New nuclear power capacity is eligible for the maximum $3/kg credit for clean hydrogen production under the Inflation Reduction Act’s Section 45V tax

Read More »

Government confirms windfall tax to go in 2030 as consultation kicks off

The UK government has confirmed that the windfall tax is scheduled to end in 2030, and has launched a consultation on what its successor will look like. The government had already announced a 2030 end date for the controversial taxation policy in the Autumn Budget when chancellor Rachel Reeves pushed back its end date. The government has said that authorities will work with industry, communities, trade unions and wider organisations to determine what the new regime could look like to ensure it can respond to any future shocks in commodity prices. The government said that the new regime will provide long-term certainty to the oil and gas industry, helping support investments. The regime aims to protect jobs in existing and future industries and deliver a fair return for the nation during times of unusually high prices. The highly controversial energy profits levy (EPL), or windfall tax, brought the headline rate of tax on operators to 78% late last year after Keir Starmer’s government hiked taxes imposed on UK operators by 3%. The oil and gas industry has long since rallied against the tax regime in the UK. Energy secretary Ed Miliband said: “The North Sea will be at the heart of Britain’s energy future. For decades, its workers, businesses and communities have helped power our country and our world. “Oil and gas production will continue to play an important role and, as the world embraces the drive to clean energy, the North Sea can power our Plan for Change and clean energy future in the decades ahead. “This consultation is about a dialogue with North Sea communities – businesses, trade unions, workers, environmental groups and communities – to develop a plan that enables us to take advantage of the tremendous opportunities of the years ahead.” Commitment to existing North Sea

Read More »

Saudi Aramco Weighs Bid for BP’s Castrol Lubricant Assets

Saudi Aramco is considering a potential offer for lubricant assets being sold by BP Plc, as the Middle Eastern company pushes for acquisitions that deepen its reach in oil consuming countries, according to people with knowledge of the matter. The world’s biggest energy company is studying whether to bid for part or all of the business, which operates under the Castrol brand name, the people said. Aramco could look to combine the Castrol assets with its Valvoline lubricants unit, which it bought in a $2.65 billion deal completed in 2023.  As part of a major corporate revamp, BP is starting a strategic review of the Castrol lubricants business, which Bloomberg News previously reported could be worth around $10 billion.  Aramco is particularly interested in Castrol’s operations in fast-growing markets like India, the people said, asking not to be identified because the information is private. Its Mumbai-listed subsidiary Castrol India Ltd. has a market value of about $2.5 billion.  Deliberations are still at an early stage, and Aramco hasn’t made a final decision the structure of a potential bid or whether it will proceed, the people said. The Castrol assets are also attracting interest from other bidders, according to the people. Representatives for Aramco and BP declined to comment.  Aramco said last year it’s looking for more refining and chemicals acquisitions in Asia and sees China, India and southeast Asia as its big growth markets. Owning businesses like filling stations and lubricant manufacturers gives oil producers more sway along the energy value chain, while taking them deeper into markets where they sell their crude.  State-owned Aramco has done a string of deals for refining and chemical assets in China, and bought a stake in a company in the Philippines last month that will give it access to the retail market. Aramco is also among bidders for Shell Plc’s service stations in

Read More »

Will Trump tariffs delay utility transmission, power plant plans?

FirstEnergy and other utilities are warning that Trump administration tariffs on Canada, Mexico and other countries could hurt them, according to risk disclosures included in annual reports filed last month with the U.S. Securities and Exchange Commission. The alerts on tariffs come as U.S. utilities have been expanding their capital expenditure plans to build transmission lines and power plants to meet rising demand growth, partly driven by data center development. “Any widespread imposition of new or increased tariffs could have an adverse effect on our results of operations, cash flow and financial condition,” FirstEnergy said in a Feb. 27 filing. “New or increased tariffs could also negatively affect U.S. national or regional economies, which also could negatively impact our business and results of operations.” Deteriorating economic conditions triggered by tariffs or other causes generally lead to reduced electric use by customers, particularly industrial customers, according to American Electric Power. “The current administration has implemented tariffs on certain imported goods and may impose additional tariffs,” AEP said in a Feb. 13 filing. “As a result, prevailing economic conditions may reduce future net income and cash flows and negatively impact [our] financial condition.” Tariffs could disrupt supply chains and delay building, maintaining and repairing infrastructure needed to support operations or are required to execute AEP’s plans for continued capital investment and to transition its generation fleet, the Akron, Ohio-based utility company said. Tariffs could also drive up the price of materials and equipment, increase the cost of capital and extend procurement lead times, according to AEP. President Trump on March 4 imposed 25% tariffs on imports from Mexico and Canada, and increased tariffs on imports from China by 10%. Those countries responded with plans to impose tariffs on U.S. imports. The tariffs could limit access to electrical equipment, such as transformers, needed

Read More »

AI driving a 165% rise in data center power demand by 2030

Goldman Sachs Research estimates the power usage by the global data center market to be around 55 gigawatts, which breaks down as 54% for cloud computing workloads, 32% for traditional line of business workloads and 14% for AI. By 2027, that number jumps to 84 GW, with AI growing to 27% of the overall market, cloud dropping to 50%, and traditional workloads falling to 23%, Schneider stated. Goldman Sachs Research estimates that there will be around 122 GW of data center capacity online by the end of 2030, and the density of power use in data centers is likely to grow as well, from 162 kilowatts per square foot to 176 KW per square foot in 2027, thanks to AI, Schneider stated.  “Data center supply — specifically the rate at which incremental supply is built — has been constrained over the past 18 months,” Schneider wrote. These constraints have arisen from the inability of utilities to expand transmission capacity because of permitting delays, supply chain bottlenecks, and infrastructure that is both costly and time-intensive to upgrade. The result is that due to power demand from data centers, there will need to be additional utility investment, to the tune of about $720 billion of grid spending through 2030. And then they are subject to the pace of public utilities, which move much slower than hyperscalers. “These transmission projects can take several years to permit, and then several more to build, creating another potential bottleneck for data center growth if the regions are not proactive about this given the lead time,” Schneider wrote.

Read More »

Top data storage certifications to sharpen your skills

Organization: Hitachi Vantara Skills acquired: Knowledge of data center infrastructure management tasks automation using Hitachi Ops Center Automator. Price: $100 Exam duration: 60 minutes How to prepare: Knowledge of all storage-related operations from an end-user perspective, including planning, allocating, and managing storage and architecting storage layouts. Read more about Hitachi Vantara’s training and certification options here. Certifications that bundle cloud, networking and storage skills AWS Certified Solutions Architect – Professional The AWS Certified Solutions Architect – Professional certification from leading cloud provider Amazon Web Services (AWS) helps individuals showcase advanced knowledge and skills in optimizing security, cost, and performance, and automating manual processes. The certification is a means for organizations to identify and develop talent with these skills for implementing cloud initiatives, according to AWS. The ideal candidate has the ability to evaluate cloud application requirements, make architectural recommendations for deployment of applications on AWS, and provide expert guidance on architectural design across multiple applications and projects within a complex organization, AWS says. Certified individuals report increased credibility with technical colleagues and customers as a result of earning this certification, it says. Organization: Amazon Web Services Skills acquired: Helps individuals showcase skills in optimizing security, cost, and performance, and automating manual processes Price: $300 Exam duration: 180 minutes How to prepare: The recommended experience prior to taking the exam is two or more years of experience in using AWS services to design and implement cloud solutions Cisco Certified Internetwork Expert (CCIE) Data Center The Cisco CCIE Data Center certification enables individuals to demonstrate advanced skills to plan, design, deploy, operate, and optimize complex data center networks. They will gain comprehensive expertise in orchestrating data center infrastructure, focusing on seamless integration of networking, compute, and storage components. Other skills gained include building scalable, low-latency, high-performance networks that are optimized to support artificial intelligence (AI)

Read More »

Netskope expands SASE footprint, bolsters AI and automation

Netskope is expanding its global presence by adding multiple regions to its NewEdge carrier-grade infrastructure, which now includes more than 75 locations to ensure processing remains close to end users. The secure access service edge (SASE) provider also enhanced its digital experience monitoring (DEM) capabilities with AI-powered root-cause analysis and automated network diagnostics. “We are announcing continued expansion of our infrastructure and our continued focus on resilience. I’m a believer that nothing gets adopted if end users don’t have a great experience,” says Netskope CEO Sanjay Beri. “We monitor traffic, we have multiple carriers in every one of our more than 75 regions, and when traffic goes from us to that destination, the path is direct.” Netskope added regions including data centers in Calgary, Helsinki, Lisbon, and Prague as well as expanded existing NewEdge regions including data centers in Bogota, Jeddah, Osaka, and New York City. Each data center offers customers a range of SASE capabilities including cloud firewalls, secure web gateway (SWG), inline cloud access security broker (CASB), zero trust network access (ZTNA), SD-WAN, secure service edge (SSE), and threat protection. The additional locations enable Netskope to provide coverage for more than 220 countries and territories with 200 NewEdge Localization Zones, which deliver a local direct-to-net digital experience for users, the company says.

Read More »

Inside the Nuclear Race for Data Center Energy with Aalo Atomics CEO Matt Loszak

The latest episode of the DCF Show podcast delves into one of the most pressing challenges facing the data center industry today: the search for sustainable, high-density power solutions. And how, as hyperscale operators like Google and Meta contend with growing energy demands—and, in some cases, resistance from utilities unwilling or unable to support their expanding footprints—the conversation around nuclear energy has intensified.  Both legacy nuclear providers and innovative startups are racing to secure the future business of data center giants, each bringing unique approaches to the table. Our guest for this podcast episode is Matt Loszak, co-founder and CEO of Aalo Atomics, an Austin-based company that’s taking a fresh approach to nuclear energy. Aalo, which secured a $29.5 million Series A funding round in 2024, stands out in the nuclear sector with its 10-megawatt sodium-cooled reactor design—eliminating the need for water, a critical advantage for siting flexibility. Inspired by the Department of Energy’s MARVEL microreactor, Aalo’s technology benefits from direct expertise, as the company’s CTO was the chief architect behind MARVEL. Beyond reactor design, Aalo’s vision extends to full-scale modular plant production. Instead of just building reactors, the company aims to manufacture entire nuclear plants using prefabricated, LEGO-style components. The fully modular plants, shipped in standard containers, are designed to match the footprint of a data center while requiring no onsite water—features that could make them particularly attractive to hyperscale operators seeking localized, high-density power.  Aalo has already made significant strides, with the Department of Energy identifying land at Idaho National Laboratory (INL) as a potential site for its first nuclear facility. The company is on an accelerated timeline, expecting to complete a non-nuclear prototype within three months and break ground on its first nuclear reactor in about a year—remarkably fast progress for the nuclear industry. In our discussion,

Read More »

Does It Matter If Microsoft Is Cancelling AI Data Center Leases?

Strategic Reallocation: Microsoft is a major owner and operator of data centers and might be reallocating resources to in-house infrastructure rather than leased spaces. Supply Chain Delays: TD Cowen noted that Microsoft used power and facility delays as justifications for voiding agreements, a tactic previously employed by Meta. Oversupply Issues: Analysts at TD Cowen speculate that Microsoft may have overestimated AI demand, leading to an excess in capacity. As it is all speculation, it could simply be that the latest information has driven Microsoft to reevaluate demand and move to more closely align projected supply with projected demand. Microsoft has reiterated their commitment to spend $80 billion on AI in the coming year. Reallocating this spending internally or wit a different set of partners remains on the table. And when you put the TD Cowen report that Microsoft has cancelled leases for “a couple hundred megawatts” into context with Microsoft’s overall leased power, which is estimated at around 20 GW, you see that more than 98% of their energy commitment remains unchanged. Investment Markets Might See the Biggest Hits Microsoft’s retreat has had ripple effects on the stock market, particularly among energy and infrastructure companies. European firms like Schneider Electric and Siemens Energy experienced a decline in stock value, indicating fears that major AI companies might scale back energy-intensive data center investments. However, at press time we have not seen any other indicators that this is an issue as despite these concerns about potential AI overcapacity, major tech firms continue to invest heavily in AI infrastructure:         Amazon: Pledged $100 billion towards AI data centers.         Alphabet (Google): Committed $75 billion.         Meta (Facebook): Planning to spend up to $65 billion.         Alibaba: Announced a $53 billion investment over the next three years. If we see a rush of announcements

Read More »

Dual Feed: Vantage Data Centers, VoltaGrid, Equinix, Bloom Energy, Constellation, Calpine

Nuclear Giant Constellation Acquires Natural Gas Stalwart Calpine, Creating the Largest U.S. Clean Energy Provider On January 10, 2025, Constellation (Nasdaq: CEG) announced a definitive agreement to acquire Calpine Corp. in a $16.4 billion cash-and-stock transaction, including the assumption of $12.7 billion in net debt.  A landmark transaction, the acquisition positions Constellation as the largest clean energy provider in the United States, significantly enhancing its generation portfolio with natural gas and geothermal assets. With an expanded coast-to-coast footprint, the combined company will provide 60 GW of power, reinforcing grid reliability and offering businesses and consumers a broader array of sustainability solutions. The move strengthens Constellation’s competitive retail electricity presence, serving 2.5 million customers across key U.S. markets, including Texas, California, and the Northeast. “This acquisition will help us better serve our customers across America, from families to businesses and utilities,” said Joe Dominguez, president and CEO of Constellation. “By combining Constellation’s unmatched expertise in zero-emission nuclear energy with Calpine’s industry-leading, low-carbon natural gas and geothermal generation, we can deliver the most comprehensive clean energy portfolio in the industry.” A Strategic Move for the Data Center Industry With skyrocketing demand for AI and cloud services, data centers are under increasing pressure to secure reliable, low-carbon energy sources. The Constellation-Calpine combination is particularly relevant for large-scale hyperscale operators and colocation providers seeking flexible energy solutions.  For the data center industry, this consolidation offers several advantages: Diverse Energy Mix: The integration of nuclear, geothermal, and low-emission natural gas provides data centers with flexible and reliable energy options. Grid Stability: Calpine’s extensive natural gas fleet enhances grid reliability, crucial for data centers operating in high-demand regions. Sustainability Initiatives: The combined entity is well-positioned to invest in clean energy infrastructure, including battery storage and carbon sequestration, aligning with the sustainability goals of hyperscale operators. The

Read More »

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs).  In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

Read More »

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

Read More »

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

Read More »

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Read More »