Stay Ahead, Stay ONMINE

Don’t Let Conda Eat Your Hard Drive

If you’re an Anaconda user, you know that conda environments help you manage package dependencies, avoid compatibility conflicts, and share your projects with others. Unfortunately, they can also take over your computer’s hard drive. I write lots of computer tutorials and to keep them organized, each has a dedicated folder structure complete with a Conda Environment. This […]

If you’re an Anaconda user, you know that conda environments help you manage package dependencies, avoid compatibility conflicts, and share your projects with others. Unfortunately, they can also take over your computer’s hard drive.

I write lots of computer tutorials and to keep them organized, each has a dedicated folder structure complete with a Conda Environment. This worked great at first, but soon my computer’s performance degraded, and I noticed that my SSD was filling up. At one point I had only 13 GB free.

C: drive memory usage (by author)

Conda helps manage this problem by storing downloaded package files in a single “cache” (pkgs_dirs). When you install a package, conda checks for it in the package cache before downloading. If not found, conda will download and extract the package and link the files to the active environment. Because the cache is “shared,” different environments can use the same downloaded files without duplication.

Because conda caches every downloaded packagepkgs_dirs can grow to many gigabytes. And while conda links to shared packages in the cache, there is still a need to store some packages in the environment folder. This is mainly to avoid version conflicts, where different environments need different versions of the same dependency (a package required to run another package).

In addition, large, compiled binaries like OpenCV may require full copies in the environment’s directory, and each environment requires a copy of the Python interpreter (at 100–200 MB). All these issues can bloat conda environments to several gigabytes.

In this Quick Success Data Science project, we’ll look at some techniques for reducing the storage requirements for conda environments, including those stored in default locations and dedicated folders.


Memory Management Techniques

Below are some Memory Management techniques that will help you reduce conda’s storage footprint on your machine. We’ll discuss each in turn.

  1. Cache cleaning
  2. Sharing task-based environments
  3. Archiving with environment and specifications files
  4. Archiving environments with conda-pack
  5. Storing environments on an external drive
  6. Relocating the package cache
  7. Using virtual environments (venv)

1. Cleaning the Package Cache

Cleaning the package cache is the first and easiest step for freeing up memory. Even after deleting environments, conda keeps the related package files in the cache. You can free up space by removing these unused packages and their associated tarballs (compressed package files), logs, index caches (metadata stored in conda), and temporary files.

Conda permits an optional “dry run” to see how much memory will be reclaimed. You’ll want to run this from either the terminal or Anaconda Prompt in your base environment:

conda clean --all --dry-run

To commit, run:

conda clean --all

Here’s how this looks on my machine:

Conda dry run and clean command in Anaconda Prompt (by author)

This process trimmed a healthy 6.28 GB and took several minutes to run.

2. Sharing Task-based Environments

Creating a few environments for specialized tasks — like computer vision or geospatial work — is more memory efficient than using dedicated environments for each project. These environments would include basic packages plus ones for the specific task (such as OpenCV, scikit-image, and PIL for computer vision).

An advantage of this approach is that you can easily keep all the packages up to date and link the environments to multiple projects. However, this won’t work if some projects require different versions of the shared packages.

3. Archiving with Environment and Specifications Files

If you don’t have enough storage sites or want to preserve legacy projects efficiently, consider using environment or specifications files. These small files record an environment’s contents, allowing you to rebuild it later.

Saving conda environments in this manner reduces their size on disk from gigabytes to a few kilobytes. Of course, you’ll have to recreate the environment to use it. So, you’ll want to avoid this technique if you frequently revisit projects that link to the archived environments.

NOTE: Consider using Mamba, a drop-in replacement for conda, for faster rebuilds. As the docs say, “If you know conda, you know Mamba!”

Using Environment Files: An environmental file is a small file that lists all the packages and versions installed in an environment, including those installed using Python’s package installer (pip). This helps you both restore an environment and share it with others.

The environment file is written in YAML (.yml), a human-readable data-serialization format for data storage. To generate an environment file, you must activate and then export the environment. Here’s how to make a file for an environment named my_env:

 conda activate my_env
 conda env export > my_env.yml

You can name the file any valid filename but be careful as an existing file with the same name will be overwritten.

By default, the environment file is written to the user directory. Here’s a truncated example of the file’s contents:

name: C:Usershannaquick_successfed_hikesfed_env
channels:
  - defaults
  - conda-forge
dependencies:
  - asttokens=2.0.5=pyhd3eb1b0_0
  - backcall=0.2.0=pyhd3eb1b0_0
  - blas=1.0=mkl
  - bottleneck=1.3.4=py310h9128911_0
  - brotli=1.0.9=ha925a31_2
  - bzip2=1.0.8=he774522_0
  - ca-certificates=2022.4.26=haa95532_0
  - certifi=2022.5.18.1=py310haa95532_0
  - colorama=0.4.4=pyhd3eb1b0_0
  - cycler=0.11.0=pyhd3eb1b0_0
  - debugpy=1.5.1=py310hd77b12b_0
  - decorator=5.1.1=pyhd3eb1b0_0
  - entrypoints=0.4=py310haa95532_0

  ------SNIP------

You can now remove your conda environment and reproduce it again with this file. To remove an environment, first deactivate it and then run the remove command (where ENVNAME is the name of your environment):

conda deactivate
conda remove -n ENVNAME --all

If the conda environment exists outside of Anaconda’s default envs folder, then include the directory path to the environment, as so:

conda remove -p PATHENVNAME --all

Note that this archiving technique will only work perfectly if you continue to use the same operating system, such as Windows or macOS. This is because solving for dependencies can introduce packages that might not be compatible across platforms.

To restore a conda environment using a file, run the following, where my_env represents your conda environment name and environment.yml represents your environment file:

 conda env create -n my_env -f directorypathtoenvironment.yml

You can also use the environment file to recreate the environment on your D: drive. Just provide the new path when using the file. Here’s an example:

conda create --prefix D:my_envsmy_new_env --file environment.yml

For more on environment files, including how to manually produce them, visit the docs.

Using Specifications Files: If you haven’t installed any packages using pip, you can use a specifications file to reproduce a conda environment on the same operating system. To create a specification file, activate an environment, such as my_env, and enter the following command:

 conda list --explicit > exp_spec_list.txt

This produces the following output, truncated for brevity:

 # This file may be used to create an environment using:
 # $ conda create --name  --file 
 # platform: win-64
 @EXPLICIT
 https://conda.anaconda.org/conda-forge/win-64/ca-certificates-202x.xx.x-h5b45459_0.tar.bz2
 https://conda.anaconda.org/conda-forge/noarch/tzdata-202xx-he74cb21_0.tar.bz2

------snip------

Note that the --explicit flag ensures that the targeted platform is annotated in the file, in this case, # platform: win-64 in the third line.

You can now remove the environment as described in the previous section.

To re-create my_env using this text file, run the following with a proper directory path:

conda create -n my_env -f directorypathtoexp_spec_list.txt

4. Archiving Environments with conda-pack

The conda-pack command lets you archive a conda environment before removing it. It packs the entire environment into a compressed archive with the extension: .tar.gz. It’s handy for backing up, sharing, and moving environments without the need to reinstall packages.

The following command will preserve an environment but remove it from your system (where my_env represents the name of your environment):

conda install -c conda-forge conda-pack
conda pack -n my_env -o my_env.tar.gz

To restore the environment later run this command:

mkdir my_env && tar -xzf my_env.tar.gz -C my_env

This technique won’t save as much memory as the text file option. However, you won’t need to re-download packages when restoring an environment, which means it can be used without internet access.

5. Storing Environments on an External Drive

By default, conda stores all environments in a default location. For Windows, this is under the …anaconda3envs folder. You can see these environments by running the command conda info --envs in a prompt window or terminal. Here’s how it looks on my C: drive (this is a truncated view):

Truncated view of conda environments on my C: drive (by author)

Using a Single Environments Folder: If your system supports an external or secondary drive, you can configure conda to store environments there to free up space on your primary disk. Here’s the command; you’ll need to substitute your specific path:

conda config --set envs_dirs /path/to/external/drive

If you enter a path to your D drive, such as D:conda_envs, conda will create new environments at this location.

This technique works well when your external drive is a fast SSD and when you’re storing packages with large dependencies, like TensorFlow. The downside is slower performance. If your OS and notebooks remain on the primary drive, you may experience some read/write latency when running Python.

In addition, some OS settings may power down idle external drives, adding a delay when they spin back up. Tools like Jupyter may struggle to locate conda environments if the drive letter changes, so you’ll want to use a fixed drive letter and ensure that the correct kernel paths are set.

Using Multiple Environment Folders: Instead of using a single envs_dirs directory for all environments, you can store each environment inside its respective project folder. This lets you store everything related to a project in one place.

Example project file structure with embedded (1.7 GB) conda environment (opencv_env) (by author)

For example, suppose you have a project on your Windows D: drive in a folder called D:projectsgeospatial. To place the project’s conda environment in this folder, loaded with ipykernel for JupyterLab, you would run:

conda create -p D:projectsgeospatialenv ipykernel

Of course, you can call env something more descriptive, like geospatial_env.

As with the previous example, environments stored on a different disk can cause performance issues.

Special Note on JupyterLab: Depending on how you launch JupyterLab, its default behavior may be to open in your user directory (such as, C:Usersyour_user_name). Since its file browser is restricted to the directory from which it is launched, you won’t see directories on other drives like D:. There are many ways to handle this, but one of the simplest is to launch JupyterLab from the D: drive.

For example, in Anaconda Prompt, type:

D:

followed by:

jupyter lab

Now, you will be able to pick from kernels on the D: drive.

For more options on changing JupyterLab’s working directory, ask an AI about “how to change Jupyter’s default working directory” or “how to create a Symlink to D: in your user folder.”

Moving Existing Environments: You should never manually move a conda environment, such as by cutting and pasting to a new location. This is because conda relies on internal paths and metadata that can become invalid with location changes.

Instead, you should clone existing environments to another drive. This will duplicate the environment, so you’ll need to manually remove it from its original location.

In the following example, we use the --clone flag to produce an exact copy of a C: drive environment (called my_env) on the D: drive:

conda create -p D:new_envsmy_env --clone C:pathtooldenv

NOTE: Consider exporting your environment to a YAML file (as described in Section 3 above) before cloning. This allows you to recreate the environment if something goes wrong with the clone procedure.

Now, when you run conda env list, you’ll see the environment listed in both the C: and D: drives. You can remove the old environment by running the following command in the base environment:

conda remove --name my_env --all -y

Again, latency issues may affect these setups if you’re working across two disks.

You may be wondering, is it better to move a conda environment using an environment (YAML) file or to use--clone? The short answer is that --clone is the best and fastest option for moving an environment to a different drive on the same machine. An environment file is best for recreating the same environment on a different machine. While the file guarantees a consistent environment across different systems, it can take much longer to run, especially with large environments.

6. Relocating the Package Cache

If your primary drive is low on space, you can move the package cache to a larger external or secondary drive using this command:

conda config --set pkgs_dirs D:conda_pkgs

In this example, packages are now stored on the D drive (D:conda_pkgs) instead of the default location.

If you’re working in your primary drive and both drives are SSD, then latency issues should not be significant. However, if one of the drives is a slower HDD, you can experience slowdowns when creating or updating environments. If D: is an external drive connected by USB, you may see significant slowdowns for large environments.

You can mitigate some of these issues by keeping the package cache (pkgs_dirs) and frequently used environments on the faster SSD, and other environments on the slower HDD.

One last thing to consider is backups. Primary drives may have routine backups scheduled but secondary or external drives may not. This puts you at risk of losing all your environments.

7. Using Virtual Environments

If your project doesn’t require conda’s extensive package management system for handling heavy dependencies (like TensorFlow or GDAL), you can significantly reduce disk usage with a Python virtual environment (venv). This represents a lightweight alternative to a conda environment.

To create a venv named my_env, run the following command:

This type of environment has a small base installation. A minimal conda environment takes up about 200 MB and includes multiple utilities, such as condapipsetuptools, and so on. A venv is much lighter, with a minimum install size of only 5–10 MB.

Conda also caches package tarballs in pkgs_dirs. These tarballs can grow to several GBs over time. Because venv installs packages directly into the environment, no extra copies are preserved.

In general, you’ll want to consider venv when you only need basic Python packages like NumPy, pandas, or Scikit-learn. Packages for which conda is strongly recommended, like Geopandas, should still be placed in a conda environment. If you use lots of environments, you’ll probably want to stick with conda and benefit from its package linking.

You can find details on how to activate and use Python virtual environments in the venv docs.


Recap

High impact/low disruption memory management techniques for conda environments include cleaning the package cache and storing little-used environments as YAML or text files. These methods can save many gigabytes of memory while retaining Anaconda’s default directory structure.

Other high impact methods include moving the package cache and/or conda environments to a secondary or external drive. This will resolve memory problems but may introduce latency issues, especially if the new drive is a slow HDD or uses a USB connection.

For simple environments, you can use a Python virtual environment (venv) as a lightweight alternative to conda.

Shape
Shape
Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy,  bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Shape

Fortinet speeds threat detection with improved FortiAnalyzer

The package also now integrates with FortiAI, the vendor’s genAI assistant, to better support analytics and telemetry to help security teams speed threat investigation and response, the vendor stated. “FortiAI identifies the threats that need analysis from the data collected by FortiAnalyzer, primarily collected from FortiGates. By automating the collection,

Read More »

Aryaka adds AI-powered observability to SASE platform

Nadkarni explained that Aryaka runs unsupervised machine learning models on the data to identify anomalies and outliers in the data. For example, the models may detect a sudden spike in traffic to a domain that has not been seen before. This unsupervised analysis helps surface potential issues or areas of

Read More »

U.S. Department of Energy Recognizes National Black History Month, 2025

WASHINGTON— U.S. Secretary of Energy Chris Wright released the following statement in recognition of National Black History Month – February 2025:  “Today, I am honored to join President Trump in recognizing February 2025 as National Black History Month. Throughout our history, Black Americans have strengthened our nation’s position as a global leader in energy production, science, and technology. Lewis Latimer’s contributions to electric lighting, Dr. George Washington Carver’s advancements in biofuels, and Dr. William Knox and Dr. Blanche Lawrence’s critical work on the Manhattan Project are just a few examples of the innovation and dedication to excellence that embody the American spirit—one of hard work, determination, and a relentless drive to achieve greatness.  “The Department of Energy remains committed to advancing bold, America-first energy policies that empower our workforce, fuel economic growth, and solidify our nation’s leadership on the world stage. This Black History Month, join us as we celebrate the patriots and pioneers who have contributed to America’s energy success and look forward to a future where we continue to lead the world in energy production, innovation, and strength.” ###

Read More »

WTI Slips 2% as Key Support Level Breached

Oil fell after the breach of a key technical level accelerated losses driven by the possibility of increased flows from Iraq, weakening the prospects of supply constraints that have gripped the market recently. West Texas Intermediate slid more than 2% to trade below $71 a barrel, with the drop deepening after prices dipped below their 100-day moving average of about $71.51. The decline puts oil at risk of its fifth straight weekly loss, which would be the longest streak in more than a year. Crude has been trapped in a roughly $5 range for the past three weeks because of an uncertain outlook for supply, including increasing expectations that OPEC+ will delay a planned production increase and a drone attack that threatened Kazakh pipeline flows. At the same time, US President Donald Trump’s rapid-fire tariff actions and other policy decisions have dimmed the outlook for demand and boosted US consumers’ expectations for long-term inflation. OPEC+ postponing its 120,000 barrel-a-day output hike — a move delegates are flagging as a possibility — would mark the fourth time the group delayed plans to revive production halted in 2022. At present, the alliance aims to restore a total of 2.2 million barrels a day in monthly increments, starting in April. “Given prices in the mid-$70s, we continue to anticipate that the producer group postpones the beginning of bringing back withheld oil supply to market,” Citigroup Inc. analysts, including Eric Lee, wrote in a note. “The decision to bring back more oil to market might only come if the US exerts more sanctions pressure on Iran amid potential negotiations.” Oil Prices: WTI for April delivery slid 2.6% to $70.63 a barrel at 1:30 p.m. in New York. Brent for April settlement fell 2.4% to $74.67 a barrel. What do you think? We’d love to

Read More »

Tokyo Gas to Sell USA Shale Stake Amid Activist Elliott Scrutiny

Tokyo Gas Co. will offload its stake in a Texas shale project, amid scrutiny from activist shareholder Elliott Investment Management, which has called for the utility to streamline its portfolio. The company will sell all the interests held by its subsidiary in the Eagle Ford shale gas project to Shizuoka Gas Co. for $130 million, the two firms said in a statement on Friday. Japan’s largest gas utility has been under Elliott’s scrutiny since last year, when the activist investor revealed it held a 5% stake in Tokyo Gas and called for it to sell down its multi-billion dollar real estate portfolio. The company has since said it is considering various measures to unlock value of assets with low capital efficiency, in all areas of its business portfolio.  The firm, which acquired a stake in the Eagle Ford project in 2016, will instead focus on assets it gained through the acquisition of Rockcliff Energy in 2023 in East Texas and North Louisiana, according to a statement.  For Shizuoka Gas, the deal will be its first foray into the US market, and comes as the Japanese government makes overtures to President Donald Trump about buying more US energy. The company said it will also consider additional business developments in the US. WHAT DO YOU THINK? Generated by readers, the comments included herein do not reflect the views and opinions of Rigzone. All comments are subject to editorial review. Off-topic, inappropriate or insulting comments will be removed. MORE FROM THIS AUTHOR Bloomberg

Read More »

USA, Ukraine Ramp Up Talks on Minerals Deal During Envoy Trip

Ukrainian and US negotiators are seeking to move past the breakdown in transatlantic relations this week to finalize a deal on critical minerals, a person with knowledge of the talks said.   Two days after President Donald Trump hectored President Volodymyr Zelenskiy as a “dictator” who needed to move quickly on a peace deal, Ukrainian officials are discussing the minerals issue with US special envoy Keith Kellogg during a visit to Kyiv, the person said on condition of anonymity as talks take place behind closed doors.  Zelenskiy, who rejected an initial US offer that involved securing half the income from Ukraine’s minerals, said that his meeting with Kellogg on Thursday had “restored hope.”  The US proposal envisaged securing 50% of license sales and other proceeds from the minerals, which would violate Ukrainian laws, a person familiar with the discussions said. It also covered revenue from oil, gas and ports, ABC News reported, citing a draft document.  After Kyiv suggested changes, the US has now sent a revised and improved version, which includes language on future assistance, a person familiar with the matter said. Trump has said he wanted the equivalent of $500 billion worth of rare earths, which are mainly used in high-strength magnets. But despite reports of $10 trillion worth of mineral deposits, Ukraine has no major rare-earth reserves that are internationally recognized as economically viable. Most deposits are likely by-products of producing materials like phosphates. Some are in areas of Russian control.  A completed deal on US access to Ukrainian minerals in exchange for security guarantees would be a key element of the Trump administration’s effort to end the three-year war. Trump has said he may meet with Russian President Vladimir Putin soon.  “We need strong agreements with the United States – agreements that will truly work,” Zelenskiy said in his daily address to

Read More »

Grangemouth protesters lay 400 hard hats outside Scottish Labour conference

Grangemouth workers targeted the opening day of the Scottish Labour conference on Friday, laying 400 hard hats on the ground to represent every job set to be lost. Workers and members of the Unite union gathered outside the Scottish Event Campus in Glasgow to urge the UK Government to do more to save the oil refinery. Protesters braved wet and windy weather as they chanted “You said our refinery was your top priority” and “Keep Grangemouth working”. Petroineos announced last year that more than 400 jobs would be lost as Scotland’s last oil refinery shuts this year. The Scottish and UK governments have committed £100 million to the Falkirk and Grangemouth Growth Deal to support jobs and skills in the area along with a report – called Project Willow – to look at the future of the site. Scotland’s First Minister John Swinney also announced a further £25 million to ensure a “just transition” in the area. Grangemouth future But trade unions have accused both governments of not doing enough to save the plant, while Scottish Labour leader Anas Sarwar has described the response as “not good enough”. Sharon Graham, general secretary of the Unite union, told the PA news agency that Grangemouth could still be saved. She said: “We are taking every opportunity we can to say to the UK Government, the Scottish Government and all politicians that they have not got long now to save Grangemouth. © Andrew Milligan/PA WireUnite General Secretary Sharon Graham speaks at a demonstration to protest at Petroineos plans to close Grangemouth oil refinery, during the Scottish Labour Party conference at the Scottish Exhibition Centre (SEC) in Glasgow. Image: Andrew Milligan/PA Wire “Let’s be really clear, Grangemouth can be saved, but I’m saying to all politicians today, if they do not do that, then

Read More »

USA Crude Oil Inventories Rise by 4.6MM Barrels WoW

U.S. commercial crude oil inventories, excluding those in the Strategic Petroleum Reserve (SPR), increased by 4.6 million barrels from the week ending February 7 to the week ending February 14, the U.S. Energy Information Administration (EIA) highlighted in its latest weekly petroleum status report. The EIA’s report, which was released on February 20 and included data for the week ending February 14, showed that crude oil stocks, not including the SPR, stood at 432.5 million barrels on February 14, 427.9 million barrels on February 7, and 443.0 million barrels on February 16, 2024. Crude oil in the SPR stood at 395.3 million barrels on February 14 and February 7, and 359.5 million barrels on February 16, 2024, the report outlined. Total petroleum stocks – including crude oil, total motor gasoline, fuel ethanol, kerosene type jet fuel, distillate fuel oil, residual fuel oil, propane/propylene, and other oils – stood at 1.607 billion barrels on February 14, the report showed. Total petroleum stocks were up 0.2 million barrels week on week and up 16.4 million barrels year on year, the report outlined. “At 432.5 million barrels, U.S. crude oil inventories are about three percent below the five year average for this time of year,” the EIA said in its report. “Total motor gasoline inventories decreased by 0.2 million barrels from last week and are one percent below the five year average for this time of year. Finished gasoline inventories increased, while blending components inventories decreased last week,” it added. “Distillate fuel inventories decreased by 2.1 million barrels last week and are about 12 percent below the five year average for this time of year. Propane/propylene inventories decreased by 3.6 million barrels from last week and are slightly below the five year average for this time of year,” the EIA continued. U.S. crude

Read More »

Do data centers threaten the water supply?

In a new report, the Royal Academy of Engineering called upon the government to ensure tech companies accurately report how much energy and water their data centers are using and reducing the use of drinking water for cooling. Without such action, warns one of the report’s authors, Professor Tom Rodden, “we face a real risk that our development, deployment and use of AI could do irreparable damage to the environment.” The situation is a little different for the US as the country has large bodies of water offering a  water supply that the UK just does not have. It’s not an accident that there are many data centers around the Chicago area: they’ve also got the Great Lakes to draw upon. Likewise, the Columbia and Klamath Rivers have become magnets for data centers for both water supply and hydroelectric power. Other than the Thames River, the UK doesn’t have these massive bodies of water. Still, the problem is not unique to the UK, says Alan Howard, senior analyst with Omdia. He notes that Microsoft took heat last year because it was draining the water supply of a small Arizona town of Goodyear with a new AI-oriented data center.  The city of Chandler, Arizona passed an ordinance in 2015 that restricted new water-intensive businesses from setting up shop which slowed data center development.   “I believe some data center operators just bowed out,” said Howard.

Read More »

Ireland says there will be no computation without generation

Stanish said that, in 2023, she wrote a paper that predicted “by 2028, more than 70% of multinational enterprises will alter their data center strategies due to limited energy supplies and data center moratoriums, up from only about 5% in 2023. It has been interesting watching this trend evolve as expected, with Ireland being a major force in this conversation since the boycotts against data center growth started a few years ago.” Fair, equitable, and stable electricity allocation, she said, “means that the availability of electricity for digital services is not guaranteed in the future, and I expect these policies, data center moratoriums, and regional rejections will only continue and expand moving forward.” Stanish pointed out that this trend is not just occurring in Ireland. “Many studies show that, globally, enterprises’ digital technologies are consuming energy at a faster rate than overall growth in energy supply (though, to be clear, these studies mostly assume a static position on energy efficiency of current technologies, and don’t take into account potential for nuclear or hydrogen to assuage some of these supply issues).” If taken at face value, she said, this means that a lack of resources could cause widespread electricity shortages in data centers over the next several years. To mitigate this, Stanish said, “so far, data center moratoriums and related constraints (including reduced tax incentives) have been enacted in the US (specifically Virginia and Georgia), Denmark, Singapore, and other countries, in response to concerns about the excessive energy consumption of IT, particularly regarding compute-intense AI workloads and concerns regarding an IT energy monopoly in certain regions. As a result, governments (federal, state, county, etc.) are working to ensure that consumption does not outpace capacity.” Changes needed In its report, the CRU stated, “a safe and secure supply of energy is essential

Read More »

Perspective: Can We Solve the AI Data Center Power Crisis with Microgrids?

President Trump announced a$500 billion private sector investment in the nation’s Artificial Intelligence (AI) infrastructure last month. The investment will come from The Stargate Project, a joint venture between OpenAI, SoftBank, Oracle and MGX, which intends to build 20 new AI data centers in the U.S in the next four to five years. The Stargate Project committed$100 billion for immediate deployment and construction has already begun on its first data center in Texas. At approximately a half a million square feet each, the partners say these new facilities will cement America’s leadership in AI, create jobs and stimulate economic growth. Stargate is not the only game in town, either. Microsoft is expected to invest$80 billion in AI data center development in 2025, with Google, AWS and Meta also spending big. While all this investment in AI infrastructure is certainly exciting, experts say there’s one lingering question that’s yet to be answered and it’s a big one: How are we going to power all these AI data centers? This will be one of the many questions tackled duringMicrogrid Knowledge’s annual conference, which will be held in Texas April 15-17 at the Sheraton Dallas. “Powering Data Centers: Collaborative Microgrid Solutions for a Growing Market” will be one of the key sessions on April 16. Industry experts will gather to discuss how private entities, developers and utilities can work together to deploy microgrids and distributed energy technologies that address the data center industry’s power needs. The panel will share solutions, technologies and strategies that will favorably position data centers in the energy queue. In advance of this session, we sat down with two microgrid experts to learn more about the challenges facing the data center industry and how microgrids can address the sector’s growing energy needs. We spoke with Michael Stadler, co-founder and

Read More »

Data Center Tours: Iron Mountain VA-1, Manassas, Virginia

Iron Mountain Northern Virginia Overview Iron Mountain’s Northern Virginia data centers VA-1 through VA-7 are situated on a 142-acre highly secure campus in Prince William County, Virginia. Located at 11680 Hayden Road in Manassas, Iron Mountain VA-1 spans 167,958 sq. ft. and harbors 12.4 MW of total capacity to meet colocation needs. The 36 MW VA-2 facility stands nearby. The total campus features a mixture of single and multi-tenant facilities which together provide more than 2,000,000 SF of highly efficient green colocation space for enterprises, federal agencies, service providers and hyperscale clouds.  The company notes that its Manassas campus offers tax savings compared to Ashburn and exceptional levels of energy-efficiency as well as a diverse and accessible ecosystem of cloud, network and other service providers.  Iron Mountain’s Virginia campus has 9 total planned data centers, with 5 operational facilities to date and two more data centers coming soon. VA-2 recently became the first data center in the United States to achieve DCOS Maturity Level 3.    As we continued the tour, Kinra led the way toward the break room, an area where customers can grab coffee or catch up on work. Unlike the high-end aesthetic of some other colocation providers, Iron Mountain’s approach is more practical and focused on functionality. At the secure shipping and receiving area, Kinra explained the process for handling customer equipment. “This is where our customers ship their equipment into,” he said. “They submit a ticket, send their shipments in, and we’ll take it, put it aside for them, and let them know when it’s here. Sometimes they ask us to take it to their environment, which we’ll do for them via a smart hands ticket.” Power Infrastructure and Security Measures The VA-1 campus is supported by a single substation, providing the necessary power for its growing

Read More »

Land and Expand: DPO, Microsoft, JLL and BlackChamber, Prologis, Core Scientific, Overwatch Capital

Land and Expand is a periodic feature at Data Center Frontier highlighting the latest data center development news, including new sites, land acquisitions and campus expansions. Here are some of the new and notable developments from hyperscale and colocation data center developers and operators about which we’ve been reading lately. DPO to Develop $200 Million AI Data Center in Wisconsin Rapids; Strategic Partnership with Billerud’s CWPCo Unlocks Hydroelectric Power for High-Density AI Compute Digital Power Optimization (DPO) is moving forward with plans to build a $200 million high-performance computing (HPC) data center in Wisconsin Rapids, Wisconsin. The project, designed to support up to 20 megawatts (MW) of artificial intelligence (AI) computing, leverages an innovative partnership with Consolidated Water Power Company (CWPCo), a subsidiary of global packaging leader Billerud. DPO specializes in developing and operating data centers optimized for power-dense computing. By partnering with utilities and independent power producers, DPO colocates its facilities at energy generation sites, ensuring direct access to sustainable power for AI, HPC, and blockchain computing. The company is privately held. Leveraging Power Infrastructure for Speed-to-Energization CWPCo, a regulated utility subsidiary, has operated hydroelectric generation assets since 1894, reliably serving industrial and commercial customers in Wisconsin Rapids, Biron, and Stevens Point. Parent company Billerud is a global leader in high-performance packaging materials, committed to sustainability and innovation. The company operates nine production facilities across Sweden, the USA, and Finland, employing 5,800 people in over 19 countries.  The data center will be powered by CWPCo’s renewable hydroelectric assets, tapping into the utility’s existing 32 megawatts of generation capacity. The partnership grants DPO a long-term land lease—extending up to 50 years—alongside interconnection rights to an already-energized substation and a firm, reliable power supply. “AI infrastructure is evolving at an unprecedented pace, and access to power-dense sites is critical,” said Andrew

Read More »

Data center spending to top $1 trillion by 2029 as AI transforms infrastructure

His projections account for recent advances in AI and data center efficiency, he says. For example, the open-source AI model from Chinese company DeepSeek seems to have shown that an LLM can produce very high-quality results at a very low cost with some clever architectural changes to how the models work. These improvements are likely to be quickly replicated by other AI companies. “A lot of these companies are trying to push out more efficient models,” says Fung. “There’s a lot of effort to reduce costs and to make it more efficient.” In addition, hyperscalers are designing and building their own chips, optimized for their AI workloads. Just the accelerator market alone is projected to reach $392 billion by 2029, Dell’Oro predicts. By that time, custom accelerators will outpace commercially available accelerators such as GPUs. The deployment of dedicated AI servers also has an impact on networking, power and cooling. As a result, spending on data center physical infrastructure (DCPI) will also increase, though at a more moderate pace, growing by 14% annually to $61 billion in 2029.  “DCPI deployments are a prerequisite to support AI workloads,” says Tam Dell’Oro, founder of Dell’Oro Group, in the report. The research firm raised its outlook in this area due to the fact that actual 2024 results exceeded its expectations, and demand is spreading from tier one to tier two cloud service providers. In addition, governments and tier one telecom operators are getting involved in data center expansion, making it a long-term trend.

Read More »

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs).  In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

Read More »

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

Read More »

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

Read More »

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Read More »

Talking about Games

Game theory is a field of research that is quite prominent in Economics but rather unpopular in other scientific disciplines. However, the concepts used in

Read More »