Stay Ahead, Stay ONMINE

Medical training’s AI leap: How agentic RAG, open-weight LLMs and real-time case insights are shaping a new generation of doctors at NYU Langone

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Patient data records can be convoluted and sometimes incomplete, meaning doctors don’t always have all the information they need readily available. Added to this is the fact that medical professionals can’t possibly keep up with the […]

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More


Patient data records can be convoluted and sometimes incomplete, meaning doctors don’t always have all the information they need readily available. Added to this is the fact that medical professionals can’t possibly keep up with the barrage of case studies, research papers, trials and other cutting-edge developments coming out of the industry. 

New York City-based NYU Langone Health has come up with a novel approach to tackle these challenges for the next generation of doctors. 

The academic medical center — which comprises NYU Grossman School of Medicine and NYU Grossman Long Island School of Medicine, as well as six inpatient hospitals and 375 outpatient locations — has developed a large language model (LLM) that serves as a respected research companion and medical advisor. 

Every night, the model processes electronic health records (EHR), matching them with relevant research, diagnosis tips and essential background information that it then delivers in concise, tailored emails to residents the following morning. This is an elemental part of NYU Langone’s pioneering approach to medical schooling — what it calls “precision medical education” that uses AI and data to provide highly customized student journeys. 

“This concept of ‘precision in everything’ is needed in healthcare,”  Marc Triola, associate dean for educational informatics and director of the Institute for Innovations in Medical Education at NYU Langone Health, told VentureBeat. “Clearly the evidence is emerging that AI can overcome many of the cognitive biases, errors, waste and inefficiencies in the healthcare system, that it can improve diagnostic decision-making.” 

How NYU Langone is using Llama to enhance patient care

NYU Langone is using an open-weight model built on the latest version of Llama-3.1-8B-instruct and the open-source Chroma vector database for retrieval-augmented generation (RAG). But it’s not just accessing documents — the model is going beyond RAG, actively employing search and other tools to discover the latest research documents.

Each night, the model connects to the facility’s EHR database and pulls out medical data for patients seen at Langone the previous day. It then searches for basic background information on diagnoses and medical conditions. Using a Python API, the model also performs a search of related medical literature in PubMed, which has “millions and millions of papers,” Triola explained. The LLM sifts through reviews, deep-dive papers and clinical trials, selecting a couple of the seemingly most relevant and “puts it all together in a nice email.” 

Early the following morning, medical students and internal medicine, neurosurgery and radiation oncology residents receive a personalized email with detailed patient summaries. For instance, if a patient with congestive heart failure had been in for a checkup the previous day, the email will provide a refresher on the basic pathophysiology of heart conditions and information about the latest treatments. It also offers self-study questions and AI-curated medical literature. Further, it may give pointers about steps the residents could take next or actions or details they may have overlooked.

“We’ve gotten great feedback from students, from residents and from the faculty about how this is frictionlessly keeping them up to date, how they’re incorporating this in the way they make choices about a patient’s plan of care,” said Triola. 

A key success metric for him personally was when a system outage halted the emails for a few days — and faculty members and students complained they weren’t receiving the morning nudges they had come to rely on.

“Because we’re sending these emails right before our doctors start rounds — which is among the craziest and busiest times of the day for them — and for them to notice that they weren’t getting these emails and miss them as a part of their thinking was awesome,” he said. 

Transforming the industry with precision medical education

This sophisticated AI retrieval system is fundamental to NYU Langone’s precision medical education model, which Triola explained is based on “higher density, frictionless” digital data, AI and strong algorithms.

The institution has collected vast amounts of data over the past decade about students — their performance, the environments they’re taking care of patients in, the EHR notes they’re writing, the clinical decisions they’re making and the way they reason through patient interactions and care. Further, NYU Langone has a vast catalog of all the resources available to medical students, whether those be videos, self-study or exam questions, or online learning modules.

The success of the project is also thanks to the medical facility’s streamlined architecture: It boasts centralized IT, a single data warehouse on the healthcare side and a single data warehouse for education, allowing Langone to marry its various data resources.

Chief medical information officer Paul Testa noted that great AI/ML systems aren’t possible without great data, but “it’s not the easiest thing to do if you’re sitting on unwarehoused data in silos across your system.” The medical system may be large, but it operates as “one patient, one record, one standard.”

Gen AI allowing NYU Langone to move away from ‘one-size-fits-all’ education

As Triola put it, the main question his team has been looking to address is: “How do they link the diagnosis, the context of the individual student and all of these learning materials?” 

“All of a sudden we’ve got this great key to unlock that: generative AI,” he said. 

This has enabled the school to move away from a “one-size-fits-all” model that has been the norm, whether students intended to become, for example, a neurosurgeon or a psychiatrist — vastly different disciplines that require unique approaches. 

It’s important that students get tailored education throughout their schooling, as well as “educational nudges” that adapt to their needs, he said. But you can’t just tell faculty to “spend more time with each individual student” — that’s humanly impossible. 

“Our students have been hungry for this, because they recognize that this is a high-velocity period of change in medicine and generative AI,” said Triola. “It absolutely will change…what it means to be a physician.”

Serving as a model for other medical institutions

Not that there haven’t been challenges along the way. Notably, technical teams have been working through model “immaturity.” 

As Triola noted: “It’s fascinating how expansive and accurate their embedded knowledge is, and sometimes how limited. It’ll work perfectly, predictably, 99 times in a row, and then on the 100th time it’ll make an interesting set of choices.”

For instance, early on in development, the LLMs couldn’t differentiate between an ulcer on the skin and an ulcer in the stomach, which are “not related conceptually at all,” Triola explained. His team has since focused on prompt refining and grounding, and the result has been “remarkable.” 

In fact, his team is so confident in the stack and process that they believe it can serve as a great example for others to follow. “We were favoring open source and open weight because we wanted to get to the point where we could say, ‘Hey, other medical schools, many of whom don’t have a lot of resources, you can do this on the cheap,’” Triola explained. 

Testa agreed: “Is it reproducible? Is it something we want to disseminate? Absolutely, we want to disseminate it across healthcare.”

Reassessing ‘sacrosanct’ practices in medicine

Understandably, there’s much concern across the indusry about nuanced biases that might be baked into AI systems. However, Triola pointed out that that’s not a huge concern in this use case, as it’s a relatively straightforward task for AI. “It’s searching, it’s choosing from a list, it’s summarizing,” he noted. 

Rather, one of the biggest surfaced concerns is around unskilling or deskilling. Here’s a correlation: Those of a certain vintage might remember learning cursive in elementary school — yet they likely have forgotten the skill because they’ve found rare occasion to use it in their adult life. Now, it’s near obsolete, rarely taught in today’s primary education. 

Triola pointed out that there are “sacrosanct” parts of being a physician, and that some are resistant to give those up to AI or digital systems “in any way, shape or form.” For example, there’s a perception that young doctors should be actively researching and nose-down in the latest literature whenever they’re not in a clinical setting. But the amount of medical knowledge available today and the “frenetic pace” of clinical medicine demands a different way of doing things, Triola emphasized.

When it comes to researching and retrieving information, he noted: “AI does it better, and that’s an uncomfortable truth that many people are hesitant to believe.”

Instead, he posited: “Let’s say that this is going to give superpowers to doctors and figure out the co-pilot relationship between the human and AI, not the competitive relationship of who’s going to do what.” 

Shape
Shape
Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy,  bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Shape

Fortinet speeds threat detection with improved FortiAnalyzer

The package also now integrates with FortiAI, the vendor’s genAI assistant, to better support analytics and telemetry to help security teams speed threat investigation and response, the vendor stated. “FortiAI identifies the threats that need analysis from the data collected by FortiAnalyzer, primarily collected from FortiGates. By automating the collection,

Read More »

Aryaka adds AI-powered observability to SASE platform

Nadkarni explained that Aryaka runs unsupervised machine learning models on the data to identify anomalies and outliers in the data. For example, the models may detect a sudden spike in traffic to a domain that has not been seen before. This unsupervised analysis helps surface potential issues or areas of

Read More »

USA, Ukraine Ramp Up Talks on Minerals Deal During Envoy Trip

Ukrainian and US negotiators are seeking to move past the breakdown in transatlantic relations this week to finalize a deal on critical minerals, a person with knowledge of the talks said.   Two days after President Donald Trump hectored President Volodymyr Zelenskiy as a “dictator” who needed to move quickly on a peace deal, Ukrainian officials are discussing the minerals issue with US special envoy Keith Kellogg during a visit to Kyiv, the person said on condition of anonymity as talks take place behind closed doors.  Zelenskiy, who rejected an initial US offer that involved securing half the income from Ukraine’s minerals, said that his meeting with Kellogg on Thursday had “restored hope.”  The US proposal envisaged securing 50% of license sales and other proceeds from the minerals, which would violate Ukrainian laws, a person familiar with the discussions said. It also covered revenue from oil, gas and ports, ABC News reported, citing a draft document.  After Kyiv suggested changes, the US has now sent a revised and improved version, which includes language on future assistance, a person familiar with the matter said. Trump has said he wanted the equivalent of $500 billion worth of rare earths, which are mainly used in high-strength magnets. But despite reports of $10 trillion worth of mineral deposits, Ukraine has no major rare-earth reserves that are internationally recognized as economically viable. Most deposits are likely by-products of producing materials like phosphates. Some are in areas of Russian control.  A completed deal on US access to Ukrainian minerals in exchange for security guarantees would be a key element of the Trump administration’s effort to end the three-year war. Trump has said he may meet with Russian President Vladimir Putin soon.  “We need strong agreements with the United States – agreements that will truly work,” Zelenskiy said in his daily address to

Read More »

Grangemouth protesters lay 400 hard hats outside Scottish Labour conference

Grangemouth workers targeted the opening day of the Scottish Labour conference on Friday, laying 400 hard hats on the ground to represent every job set to be lost. Workers and members of the Unite union gathered outside the Scottish Event Campus in Glasgow to urge the UK Government to do more to save the oil refinery. Protesters braved wet and windy weather as they chanted “You said our refinery was your top priority” and “Keep Grangemouth working”. Petroineos announced last year that more than 400 jobs would be lost as Scotland’s last oil refinery shuts this year. The Scottish and UK governments have committed £100 million to the Falkirk and Grangemouth Growth Deal to support jobs and skills in the area along with a report – called Project Willow – to look at the future of the site. Scotland’s First Minister John Swinney also announced a further £25 million to ensure a “just transition” in the area. Grangemouth future But trade unions have accused both governments of not doing enough to save the plant, while Scottish Labour leader Anas Sarwar has described the response as “not good enough”. Sharon Graham, general secretary of the Unite union, told the PA news agency that Grangemouth could still be saved. She said: “We are taking every opportunity we can to say to the UK Government, the Scottish Government and all politicians that they have not got long now to save Grangemouth. © Andrew Milligan/PA WireUnite General Secretary Sharon Graham speaks at a demonstration to protest at Petroineos plans to close Grangemouth oil refinery, during the Scottish Labour Party conference at the Scottish Exhibition Centre (SEC) in Glasgow. Image: Andrew Milligan/PA Wire “Let’s be really clear, Grangemouth can be saved, but I’m saying to all politicians today, if they do not do that, then

Read More »

USA Crude Oil Inventories Rise by 4.6MM Barrels WoW

U.S. commercial crude oil inventories, excluding those in the Strategic Petroleum Reserve (SPR), increased by 4.6 million barrels from the week ending February 7 to the week ending February 14, the U.S. Energy Information Administration (EIA) highlighted in its latest weekly petroleum status report. The EIA’s report, which was released on February 20 and included data for the week ending February 14, showed that crude oil stocks, not including the SPR, stood at 432.5 million barrels on February 14, 427.9 million barrels on February 7, and 443.0 million barrels on February 16, 2024. Crude oil in the SPR stood at 395.3 million barrels on February 14 and February 7, and 359.5 million barrels on February 16, 2024, the report outlined. Total petroleum stocks – including crude oil, total motor gasoline, fuel ethanol, kerosene type jet fuel, distillate fuel oil, residual fuel oil, propane/propylene, and other oils – stood at 1.607 billion barrels on February 14, the report showed. Total petroleum stocks were up 0.2 million barrels week on week and up 16.4 million barrels year on year, the report outlined. “At 432.5 million barrels, U.S. crude oil inventories are about three percent below the five year average for this time of year,” the EIA said in its report. “Total motor gasoline inventories decreased by 0.2 million barrels from last week and are one percent below the five year average for this time of year. Finished gasoline inventories increased, while blending components inventories decreased last week,” it added. “Distillate fuel inventories decreased by 2.1 million barrels last week and are about 12 percent below the five year average for this time of year. Propane/propylene inventories decreased by 3.6 million barrels from last week and are slightly below the five year average for this time of year,” the EIA continued. U.S. crude

Read More »

Power Moves: SOWEC’s new co-chair and more

Claire Mack has joined the Scottish Offshore Wind Energy Council (SOWEC) as its new industry co-chair. Mack was appointed by minister for climate action Dr Alasdair Allan to represent Scotland’s offshore wind industry for a two-year term. She will provide strategic leadership to SOWEC and work alongside the partnership’s subgroups and developer forum to deliver sector growth. In addition to her new role, Mack has been chief executive of Scottish Renewables since 2017. She is responsible for leading the organisation’s work to grow Scotland’s renewable energy industry and maintaining its position as a global leader in clean power. She replaces Brian McFarlane, SSE Renewables head of offshore development, who announced his intention to step down as SOWEC industry co-chair late last year. Since SOWEC’s inception in 2019, he has acted on behalf of Scotland’s offshore wind industry to drive forward SOWEC’s mission to coordinate and grow the sector. Acting minister for climate action Allan said: “Offshore wind is at the heart of the Scottish Government’s economic development agenda and the SOWEC is vital to driving forward action in support of the sector.” Mack added: “Offshore wind is one of Scotland’s key assets and our ability to turn our 40GW pipeline into successfully delivered projects is pivotal to our environmental and economic aspirations. “As I step into the role of co-chair at SOWEC during this critical point for Scotland’s £4 billion offshore wind sector, I would like to express my sincere gratitude to my predecessor, Brian McFarlane, for his untiring dedication and contribution to the sector.” © Supplied by Offshore Energies UKOEUK operator and developer chair Doris Reiter. Doris Reiter, BP senior vice-president for the North, has taken over as operator and developer chair on the board of Offshore Energies UK (OEUK). Having previously served as vice chair to the board,

Read More »

Expansion of Corpus Christi LNG in Texas Starts Delivery

Cheniere Energy Inc. has produced the first liquefied natural gas (LNG) cargo from a project expanding the Corpus Christi LNG (CCL) terminal in the namesake Texan city. CCL Stage 3 has seven midscale trains with an expected production capacity of over 10 million metric tons per annum (MMtpa), raising CCL’s output capacity to over 25 MMtpa from 10 trains. “First LNG production from the first train of the CCL Stage 3 Project was achieved in December 2024, and the first cargo of LNG was produced in February 2025”, Houston, Texas-based Cheniere said in its quarterly report. It plans to build two more mid-scale trains adjacent to the Stage 3 project for a further capacity addition of about 3 MMtpa. “In June 2024, we received a positive Environmental Assessment from the FERC and anticipate receiving all remaining necessary regulatory approvals for the project in 2025”, the report said. Citing new capacity unlocked from the expansion, Cheniere expects to surpass last year’s earnings. Consolidated adjusted EBITDA guidance for 2025 is $6.5 billion-$7 billion, compared to the actual 2024 figure of $6.2 billion. Distributable cash flow this year is expected to be $4.1 billion-$4.6 billion, compared to the actual 2024 figure of $3.7 billion. “We expect 2025 to be another record year for LNG production as Stage 3 trains are completed, and we look forward to delivering financial results within these ranges and further enhancing the long-term value proposition of Cheniere”, commented president and chief executive Jack Fusco. Last year Cheniere exported a record 646 LNG cargoes, or 2.33 trillion British thermal units. Cheniere posted $977 million in net profit for the fourth quarter (Q4) of 2024 and $3.25 billion for the full year. The figures were down 29 percent and 67 percent by year-ago comparisons, respectively. On a diluted basis, Q4 earnings

Read More »

Leveraging surplus interconnection could unleash 800 GW of energy the US needs today

Cassady Craighill is the technical education director at GridLab. The challenges facing the grid can often feel intimidating and beyond grasp. How much of the load growth is hype and how much is real? How do we ensure the financial burden of updating our grid and leading the world in energy innovation does not disproportionally fall on residential customers? How do operators and utilities prepare for the next cataclysmic flood, fire, or storm? How do we ensure that our 21st energy grid serves everyone across the country amidst a patchwork of different state policies and resources?  Those are all big questions that will take years to get exactly right. Luckily, there is an out-of-the-box solution laying on the table today that will get hundreds of energy projects online in no time and will save consumers money. Surplus interconnection, which allows new energy projects to plug into existing interconnection infrastructure at plants with low capacity factors, could nearly double the generation in the United States by 2030 and at a fraction of the cost and time of a traditional interconnection process.  Affordable, clean, and abundant power is key to keeping businesses operating in the U.S. This sentiment dominates the rhetoric in Energy Secretary Chris Wright’s first Secretarial order issued this month and his message of “energy addition.” At its core, surplus interconnection is adding energy resources to the grid as quickly and cheaply as possible. In his order, Secretary Wright insists “we must expand energy production and reduce energy costs for American families and businesses.” Leveraging existing interconnection capacity at thermal plants is the fastest way to expand energy production by shaving years off the interconnection timeline, saving about $200 billion by avoiding costly new infrastructure and lengthy new buildouts, and adding reliable resources to the grid. We need this solution

Read More »

Do data centers threaten the water supply?

In a new report, the Royal Academy of Engineering called upon the government to ensure tech companies accurately report how much energy and water their data centers are using and reducing the use of drinking water for cooling. Without such action, warns one of the report’s authors, Professor Tom Rodden, “we face a real risk that our development, deployment and use of AI could do irreparable damage to the environment.” The situation is a little different for the US as the country has large bodies of water offering a  water supply that the UK just does not have. It’s not an accident that there are many data centers around the Chicago area: they’ve also got the Great Lakes to draw upon. Likewise, the Columbia and Klamath Rivers have become magnets for data centers for both water supply and hydroelectric power. Other than the Thames River, the UK doesn’t have these massive bodies of water. Still, the problem is not unique to the UK, says Alan Howard, senior analyst with Omdia. He notes that Microsoft took heat last year because it was draining the water supply of a small Arizona town of Goodyear with a new AI-oriented data center.  The city of Chandler, Arizona passed an ordinance in 2015 that restricted new water-intensive businesses from setting up shop which slowed data center development.   “I believe some data center operators just bowed out,” said Howard.

Read More »

Ireland says there will be no computation without generation

Stanish said that, in 2023, she wrote a paper that predicted “by 2028, more than 70% of multinational enterprises will alter their data center strategies due to limited energy supplies and data center moratoriums, up from only about 5% in 2023. It has been interesting watching this trend evolve as expected, with Ireland being a major force in this conversation since the boycotts against data center growth started a few years ago.” Fair, equitable, and stable electricity allocation, she said, “means that the availability of electricity for digital services is not guaranteed in the future, and I expect these policies, data center moratoriums, and regional rejections will only continue and expand moving forward.” Stanish pointed out that this trend is not just occurring in Ireland. “Many studies show that, globally, enterprises’ digital technologies are consuming energy at a faster rate than overall growth in energy supply (though, to be clear, these studies mostly assume a static position on energy efficiency of current technologies, and don’t take into account potential for nuclear or hydrogen to assuage some of these supply issues).” If taken at face value, she said, this means that a lack of resources could cause widespread electricity shortages in data centers over the next several years. To mitigate this, Stanish said, “so far, data center moratoriums and related constraints (including reduced tax incentives) have been enacted in the US (specifically Virginia and Georgia), Denmark, Singapore, and other countries, in response to concerns about the excessive energy consumption of IT, particularly regarding compute-intense AI workloads and concerns regarding an IT energy monopoly in certain regions. As a result, governments (federal, state, county, etc.) are working to ensure that consumption does not outpace capacity.” Changes needed In its report, the CRU stated, “a safe and secure supply of energy is essential

Read More »

Perspective: Can We Solve the AI Data Center Power Crisis with Microgrids?

President Trump announced a$500 billion private sector investment in the nation’s Artificial Intelligence (AI) infrastructure last month. The investment will come from The Stargate Project, a joint venture between OpenAI, SoftBank, Oracle and MGX, which intends to build 20 new AI data centers in the U.S in the next four to five years. The Stargate Project committed$100 billion for immediate deployment and construction has already begun on its first data center in Texas. At approximately a half a million square feet each, the partners say these new facilities will cement America’s leadership in AI, create jobs and stimulate economic growth. Stargate is not the only game in town, either. Microsoft is expected to invest$80 billion in AI data center development in 2025, with Google, AWS and Meta also spending big. While all this investment in AI infrastructure is certainly exciting, experts say there’s one lingering question that’s yet to be answered and it’s a big one: How are we going to power all these AI data centers? This will be one of the many questions tackled duringMicrogrid Knowledge’s annual conference, which will be held in Texas April 15-17 at the Sheraton Dallas. “Powering Data Centers: Collaborative Microgrid Solutions for a Growing Market” will be one of the key sessions on April 16. Industry experts will gather to discuss how private entities, developers and utilities can work together to deploy microgrids and distributed energy technologies that address the data center industry’s power needs. The panel will share solutions, technologies and strategies that will favorably position data centers in the energy queue. In advance of this session, we sat down with two microgrid experts to learn more about the challenges facing the data center industry and how microgrids can address the sector’s growing energy needs. We spoke with Michael Stadler, co-founder and

Read More »

Data Center Tours: Iron Mountain VA-1, Manassas, Virginia

Iron Mountain Northern Virginia Overview Iron Mountain’s Northern Virginia data centers VA-1 through VA-7 are situated on a 142-acre highly secure campus in Prince William County, Virginia. Located at 11680 Hayden Road in Manassas, Iron Mountain VA-1 spans 167,958 sq. ft. and harbors 12.4 MW of total capacity to meet colocation needs. The 36 MW VA-2 facility stands nearby. The total campus features a mixture of single and multi-tenant facilities which together provide more than 2,000,000 SF of highly efficient green colocation space for enterprises, federal agencies, service providers and hyperscale clouds.  The company notes that its Manassas campus offers tax savings compared to Ashburn and exceptional levels of energy-efficiency as well as a diverse and accessible ecosystem of cloud, network and other service providers.  Iron Mountain’s Virginia campus has 9 total planned data centers, with 5 operational facilities to date and two more data centers coming soon. VA-2 recently became the first data center in the United States to achieve DCOS Maturity Level 3.    As we continued the tour, Kinra led the way toward the break room, an area where customers can grab coffee or catch up on work. Unlike the high-end aesthetic of some other colocation providers, Iron Mountain’s approach is more practical and focused on functionality. At the secure shipping and receiving area, Kinra explained the process for handling customer equipment. “This is where our customers ship their equipment into,” he said. “They submit a ticket, send their shipments in, and we’ll take it, put it aside for them, and let them know when it’s here. Sometimes they ask us to take it to their environment, which we’ll do for them via a smart hands ticket.” Power Infrastructure and Security Measures The VA-1 campus is supported by a single substation, providing the necessary power for its growing

Read More »

Land and Expand: DPO, Microsoft, JLL and BlackChamber, Prologis, Core Scientific, Overwatch Capital

Land and Expand is a periodic feature at Data Center Frontier highlighting the latest data center development news, including new sites, land acquisitions and campus expansions. Here are some of the new and notable developments from hyperscale and colocation data center developers and operators about which we’ve been reading lately. DPO to Develop $200 Million AI Data Center in Wisconsin Rapids; Strategic Partnership with Billerud’s CWPCo Unlocks Hydroelectric Power for High-Density AI Compute Digital Power Optimization (DPO) is moving forward with plans to build a $200 million high-performance computing (HPC) data center in Wisconsin Rapids, Wisconsin. The project, designed to support up to 20 megawatts (MW) of artificial intelligence (AI) computing, leverages an innovative partnership with Consolidated Water Power Company (CWPCo), a subsidiary of global packaging leader Billerud. DPO specializes in developing and operating data centers optimized for power-dense computing. By partnering with utilities and independent power producers, DPO colocates its facilities at energy generation sites, ensuring direct access to sustainable power for AI, HPC, and blockchain computing. The company is privately held. Leveraging Power Infrastructure for Speed-to-Energization CWPCo, a regulated utility subsidiary, has operated hydroelectric generation assets since 1894, reliably serving industrial and commercial customers in Wisconsin Rapids, Biron, and Stevens Point. Parent company Billerud is a global leader in high-performance packaging materials, committed to sustainability and innovation. The company operates nine production facilities across Sweden, the USA, and Finland, employing 5,800 people in over 19 countries.  The data center will be powered by CWPCo’s renewable hydroelectric assets, tapping into the utility’s existing 32 megawatts of generation capacity. The partnership grants DPO a long-term land lease—extending up to 50 years—alongside interconnection rights to an already-energized substation and a firm, reliable power supply. “AI infrastructure is evolving at an unprecedented pace, and access to power-dense sites is critical,” said Andrew

Read More »

Data center spending to top $1 trillion by 2029 as AI transforms infrastructure

His projections account for recent advances in AI and data center efficiency, he says. For example, the open-source AI model from Chinese company DeepSeek seems to have shown that an LLM can produce very high-quality results at a very low cost with some clever architectural changes to how the models work. These improvements are likely to be quickly replicated by other AI companies. “A lot of these companies are trying to push out more efficient models,” says Fung. “There’s a lot of effort to reduce costs and to make it more efficient.” In addition, hyperscalers are designing and building their own chips, optimized for their AI workloads. Just the accelerator market alone is projected to reach $392 billion by 2029, Dell’Oro predicts. By that time, custom accelerators will outpace commercially available accelerators such as GPUs. The deployment of dedicated AI servers also has an impact on networking, power and cooling. As a result, spending on data center physical infrastructure (DCPI) will also increase, though at a more moderate pace, growing by 14% annually to $61 billion in 2029.  “DCPI deployments are a prerequisite to support AI workloads,” says Tam Dell’Oro, founder of Dell’Oro Group, in the report. The research firm raised its outlook in this area due to the fact that actual 2024 results exceeded its expectations, and demand is spreading from tier one to tier two cloud service providers. In addition, governments and tier one telecom operators are getting involved in data center expansion, making it a long-term trend.

Read More »

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs).  In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

Read More »

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

Read More »

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

Read More »

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Read More »