Stay Ahead, Stay ONMINE

How I Became A Machine Learning Engineer (No CS Degree, No Bootcamp)

Machine learning and AI are among the most popular topics nowadays, especially within the tech space. I am fortunate enough to work and develop with these technologies every day as a machine learning engineer! In this article, I will walk you through my journey to becoming a machine learning engineer, shedding some light and advice […]

Machine learning and AI are among the most popular topics nowadays, especially within the tech space. I am fortunate enough to work and develop with these technologies every day as a machine learning engineer!

In this article, I will walk you through my journey to becoming a machine learning engineer, shedding some light and advice on how you can become one yourself!

My Background

In one of my previous articles, I extensively wrote about my journey from school to securing my first Data Science job. I recommend you check out that article, but I will summarise the key timeline here.

Pretty much everyone in my family studied some sort of STEM subject. My great-grandad was an engineer, both my grandparents studied physics, and my mum is a maths teacher.

So, my path was always paved for me.

Me at age 11

I chose to study physics at university after watching The Big Bang Theory at age 12; it’s fair to say everyone was very proud!

At school, I wasn’t dumb by any means. I was actually relatively bright, but I didn’t fully apply myself. I got decent grades, but definitely not what I was fully capable of.

I was very arrogant and thought I would do well with zero work.

I applied to top universities like Oxford and Imperial College, but given my work ethic, I was delusional thinking I had a chance. On results day, I ended up in clearing as I missed my offers. This was probably one of the saddest days of my life.

Clearing in the UK is where universities offer places to students on certain courses where they have space. It’s mainly for students who don’t have a university offer.

I was lucky enough to be offered a chance to study physics at the University of Surrey, and I went on to earn a first-class master’s degree in physics!

There is genuinely no substitute for hard work. It is a cringy cliche, but it is true!

My original plan was to do a PhD and be a full-time researcher or professor, but during my degree, I did a research year, and I just felt a career in research was not for me. Everything moved so slowly, and it didn’t seem there was much opportunity in the space.

During this time, DeepMind released their AlphaGo — The Movie documentary on YouTube, which popped up on my home feed.

From the video, I started to understand how AI worked and learn about neural networks, reinforcement learning, and deep learning. To be honest, to this day I am still not an expert in these areas.

Naturally, I dug deeper and found that a data scientist uses AI and machine learning algorithms to solve problems. I immediately wanted in and started applying for data science graduate roles.

I spent countless hours coding, taking courses, and working on projects. I applied to 300+ jobs and eventually landed my first data science graduate scheme in September 2021.

You can hear more about my journey from a podcast.

Data Science Journey

I started my career in an insurance company, where I built various supervised learning models, mainly using gradient boosted tree packages like CatBoost, XGBoost, and generalised linear models (GLMs).

I built models to predict:

  • Fraud — Did someone fraudulently make a claim to profit.
  • Risk Prices — What’s the premium we should give someone.
  • Number of Claims — How many claims will someone have.
  • Average Cost of Claim — What’s the average claim value someone will have.

I made around six models spanning the regression and classification space. I learned so much here, especially in statistics, as I worked very closely with Actuaries, so my maths knowledge was excellent.

However, due to the company’s structure and setup, it was difficult for my models to advance past the PoC stage, so I felt I lacked the “tech” side of my toolkit and understanding of how companies use machine learning in production.

After a year, my previous employer reached out to me asking if I wanted to apply to a junior data scientist role that specialises in time series forecasting and optimisation problems. I really liked the company, and after a few interviews, I was offered the job!

I worked at this company for about 2.5 years, where I became an expert in forecasting and combinatorial optimisation problems.

I developed many algorithms and deployed my models to production through AWS using software engineering best practices, such as unit testing, lower environment, shadow system, CI/CD pipelines, and much more.

Fair to say I learned a lot. 

I worked very closely with software engineers, so I picked up a lot of engineering knowledge and continued self-studying machine learning and statistics on the side.

I even earned a promotion from junior to mid-level in that time!

Transitioning To MLE

Over time, I realised the actual value of data science is using it to make live decisions. There is a good quote by Pau Labarta Bajo

ML models inside Jupyter notebooks have a business value of $0

There is no point in building a really complex and sophisticated model if it will not produce results. Seeking out that extra 0.1% accuracy by staking multiple models is often not worth it.

You are better off building something simple that you can deploy, and that will bring real financial benefit to the company.

With this in mind, I started thinking about the future of data science. In my head, there are two avenues:

  • Analytics -> You work primarily to gain insight into what the business should be doing and what it should be looking into to boost its performance.
  • Engineering -> You ship solutions (models, decision algorithms, etc.) that bring business value.

I feel the data scientist who analyses and builds PoC models will become extinct in the next few years because, as we said above, they don’t provide tangible value to a business.

That’s not to say they are entirely useless; you have to think of it from the business perspective of their return on investment. Ideally, the value you bring in should be more than your salary.

You want to say that you did “X that produced Y”, which the above two avenues allow you to do.

The engineering side was the most interesting and enjoyable for me. I genuinely enjoy coding and building stuff that benefits people, and that they can use, so naturally, that’s where I gravitated towards.

To move to the ML engineering side, I asked my line manager if I could deploy the algorithms and ML models I was building myself. I would get help from software engineers, but I would write all the production code, do my own system design, and set up the deployment process independently.

And that’s exactly what I did.

I basically became a Machine Learning Engineer. I was developing my algorithms and then shipping them to production.

I also took NeetCode’s data structures and algorithms course to improve my fundamentals of computer science and started blogging about software engineering concepts.

Coincidentally, my current employer contacted me around this time and asked if I wanted to apply for a machine learning engineer role that specialises in general ML and optimisation at their company!

Call it luck, but clearly, the universe was telling me something. After several interview rounds, I was offered the role, and I am now a fully fledged machine learning engineer!

Fortunately, a role kind of “fell to me,” but I created my own luck through up-skilling and documenting my learning. That is why I always tell people to show their work — you don’t know what may come from it.

My Advice

I want to share the main bits of advice that helped me transition from a machine learning engineer to a data scientist.

  • Experience — A machine learning engineer is not an entry-level position in my opinion. You need to be well-versed in data science, machine learning, software engineering, etc. You don’t need to be an expert in all of them, but have good fundamentals across the board. That’s why I recommend having a couple of years of experience as either a software engineer or data scientist and self-study other areas.
  • Production Code — If you are from data science, you must learn to write good, well-tested production code. You must know things like typing, linting, unit tests, formatting, mocking and CI/CD. It’s not too difficult, but it just requires some practice. I recommend asking your current company to work with software engineers to gain this knowledge, it worked for me!
  • Cloud Systems — Most companies nowadays deploy many of their architecture and systems on the cloud, and machine learning models are no exception. So, it’s best to get practice with these tools and understand how they enable models to go live. I learned most of this on the job, to be honest, but there are courses you can take.
  • Command Line — I am sure most of you know this already, but every tech professional should be proficient in the command line. You will use it extensively when deploying and writing production code. I have a basic guide you can checkout here.
  • Data Structures & Algorithms — Understanding the fundamental algorithms in computer science are very useful for MLE roles. Mainly because you will likely be asked about it in interviews. It’s not too hard to learn compared to machine learning; it just takes time. Any course will do the trick.
  • Git & GitHub — Again, most tech professionals should know Git, but as an MLE, it is essential. How to squash commits, do code reviews, and write outstanding pull requests are musts.
  • Specialise — Many MLE roles I saw required you to have some specialisation in a particular area. I specialise in time series forecasting, optimisation, and general ML based on my previous experience. This helps you stand out in the market, and most companies are looking for specialists nowadays.

The main theme here is that I basically up-skilled my software engineering abilities. This makes sense as I already had all the math, stats, and machine learning knowledge from being a data scientist.

If I were a software engineer, the transition would likely be the reverse. This is why securing a machine learning engineer role can be quite challenging, as it requires proficiency across a wide range of skills.

Summary & Further Thoughts

I have a free newsletter, Dishing the Data, where I share weekly tips and advice as a practising data scientist. Plus, when you subscribe, you will get my FREE data science resume and short PDF version of my AI roadmap!

Connect With Me

Shape
Shape
Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy,  bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Shape

Fluent Bit vulnerabilities could enable full cloud takeover

Attackers could flood monitoring systems with false or misleading events, hide alerts in the noise, or even hijack the telemetry stream entirely, Katz said. The issue is now tracked as CVE-2025-12969 and awaits a severity valuation. Almost equally troubling are other flaws in the “tag” mechanism, which determines how the records are

Read More »

Ivory Coast Sees Oil and Gas Spurring Growth in Next 5 Years

Ivory Coast’s economic growth is poised to accelerate in the next five years as the country sees an increase in oil and gas activity, Planning and Development Minister Kaba Niale said. “We can do a much stronger growth rate in the coming five years,” Niale said in an interview at an African Development Bank conference in Rabat, Morocco’s capital, on Wednesday. A “strong increase” in production of fossil fuels will raise oil output to at least 200,000 barrels per day in the years 2027 to 2028, she said. The world’s top cocoa producer pumped 44,000 barrels a day in 2024, according to the government. Ivory Coast has been positioning itself as a major regional energy hub, attracting companies such as Eni SpA, Houston-based Vaalco Energy Inc. and Brazil’s Petrobras in the last decade. The entry of these global players stems from a government policy to partner with the private sector in areas it thinks would contribute significantly to long-term economic expansion, Patrick Achi, minister of state and special advisor to President Alassane Ouattara, said during an online press conference.  “It’s a paradigm shift where you don’t find the administration sitting there, waiting, asking you questions instead of moving the journey with you,” Achi said. Ivory Coast aims to accelerate economic growth to 7.2% by 2030, from an average of 6.5% achieved between 2021 and 2025. The target forms part of a five-year national development plan to lift the economy to upper-middle-income status.  The energy ministry forecasts that the country could be among the top five African oil producers by 2035, when crude-oil production is expected to reach at least 500,000 barrels per day and natural gas output will account for 1 million cubic feet per day. WHAT DO YOU THINK? Generated by readers, the comments included herein do not reflect the views

Read More »

Romania Ready to Impose Oversight of Cos Hit by International Sanctions

Romania will set up a mechanism to place companies at risk of being hit by international sanctions under special oversight, such as the local unit of Russian state-owned Lukoil PJSC, a cabinet member said.  Justice Minister Radu Marinescu said a draft emergency decree published Wednesday, though it doesn’t name specific companies, would affect Lukoil. The Moscow-based oil producer operates Petrotel, a refinery that processes some 50,000 barrels of crude a day and is set to come under US sanctions announced last month.  “It’s necessary to establish the legal framework for such instances,” Marinescu told Bloomberg News. The decree is written broadly, “but one particular case to which this legislation could apply is Lukoil.”  The proposal, which must be approved by Prime Minister Ilie Bolojan’s government, is designed to shield Romania’s energy sector, where uninterrupted supply is critical to avoid price spikes in a country with the highest inflation and widest budget deficit in the European Union.  Under the decree, the government in Bucharest would be empowered to appoint special administrators to manage local entities affected by sanctions triggered by Russia’s war in Ukraine. The measure can be enacted after determining that “a significant economic” fallout will occur or if the company requests it.  The plan mirrors a decision by neighboring Bulgaria to take control of Lukoil’s Neftohim refinery this month. The moves by the two eastern EU member states underscore their efforts to balance compliance with Western sanctions against Russia while trying to safeguard energy security.  Lukoil didn’t immediately respond to a request for comment.  In addition to the Petrotel refinery, Lukoil has a network of more than 300 fueling stations affected US sanctions, which will come into force next month. Petrotel is currently closed for maintenance and is the third-largest in the Black Sea country.  The draft decree may be approved as early

Read More »

Lotte and Hyundai to Merge Some Petrochem Units

Lotte Chemical and HD Hyundai Chemical agreed to combine part of their naphtha-cracking facilities as South Korea’s top petrochemicals producers respond to a prolonged oversupply and weakening margins. Lotte Chemical will split its facility at the Daesan petrochemical complex in South Chungcheong Province and fold it into HD Hyundai Chemical’s operations, according to separate statements from Lotte. The two companies had separately operated 1.1-million-ton-a-year and 850,000-ton-a-year units at the same site before Wednesday’s announcement. HD Hyundai Co., the parent company of HD Hyundai Oilbank Co. also confirmed the consolidation plan through a regulatory filing. HD Hyundai Oilbank currently owns 60% of HD Hyundai Chemical while Lotte Chemical owns the rest. South Korean plants are designed to turn naphtha — a crude oil derived product — into petrochemicals that can go into making everything from plastic bags to pipes and even paint solvents. These units are struggling to compete with big and fully integrated Chinese complexes, many of which have sprung up in the last decade. The South Korean government has been pressing for industry reform, which resulted in 10 major chemical firms pledging to curb capacity and a year-end deadline set for consolidation. The Lotte-Hyundai combination is the first restructuring under the government plan. For Korea’s chemicals industry, the Lotte–HD Hyundai deal marks a shift from incremental cost cuts to structural consolidation. For global competitors, particularly Chinese or Middle Eastern producers, it signals that South Korea is no longer sticking with the standalone, fragmented and cost-disadvantaged model. Petrochemical is one of South Korea’s major export sectors and weakening financials of a major operator became a flashpoint for the struggling industry this year. Yeochun NCC Co., one of Korea’s largest ethylene producers, faced a near default until it secured emergency financing from major shareholders.  That partly prompted the government to summon major players

Read More »

Trump Issues EO to Launch DOE Led Genesis Mission

The U.S. Department of Energy (DOE) announced, in a statement posted on its site on Monday, that U.S. President Donald Trump has issued an executive order to launch the Genesis Mission. In the statement, the DOE dubbed the Genesis Mission as “a historic national effort led by the Department of Energy” and highlighted that it “will focus on addressing three key challenges of national importance”. The DOE pointed out that these comprise “American energy dominance”, “advancing discovery science”, and “ensuring national security”.     Under a subcategory for “American energy dominance” in the statement, the DOE said the Genesis Mission “will accelerate advanced nuclear, fusion, and grid modernization using AI to provide affordable, reliable, and secure energy for Americans”. Another subcategory for “advancing discovery science” in the statement noted that, “through DOE’s investment and collaboration with industry, America is building the quantum ecosystem that will power discoveries-and industries-for decades to come”. A subcategory for “ensuring national security” in the DOE statement said the DOE “will create advanced AI technologies for national security missions, deploy systems to ensure the safety and reliability of the U.S. nuclear stockpile, and accelerate the development of defense-ready materials”.  The DOE noted in the statement that the Genesis Mission “will transform American science and innovation through the power of artificial intelligence (AI), strengthening the nation’s technological leadership and global competitiveness”. “The ambitious mission will harness the current AI and advanced computing revolution to double the productivity and impact of American science and engineering within a decade,” the DOE added. “It will deliver decisive breakthroughs to secure American energy dominance, accelerate scientific discovery, and strengthen national security,” it continued. U.S. Energy Secretary Chris Wright has designated Under Secretary for Science Darío Gil to lead the mission, the statement revealed, adding that Genesis “will mobilize the Department of Energy’s 17 National

Read More »

USA Data Center Electricity Demand Projected to Triple

U.S. data center electricity consumption is projected to more than triple from 2021, the last full year without ChatGPT, to 2030, according to data sent to Rigzone by the International Energy Agency (IEA) recently. The IEA, which described U.S. data center electricity demand figures as the closest proxy available for the energy consumption of artificial intelligence (AI), modelled that U.S. data center electricity demand came in at 120.65 Terawatt hours (TWh) in 2021, IEA data sent to Rigzone showed. In other data sent to Rigzone, which is available in the IEA’s Energy and AI report released earlier this year, the IEA forecasts that this consumption will rise to well over 400 TWh in 2030, in its base case. In data sent to Rigzone, the IEA modelled that U.S. data center electricity consumption stood at 108.41 TWh in 2020, 134.07 TWh in 2022, 154.07 TWh in 2023, and 182.61 TWh in 2024. The IEA projected in its AI report that this demand will rise to well over 200 TWh in 2025, more than 250 TWh in 2026, over 300 TWh in 2027, around 350 TWh in 2028, and a little bit under 400 TWh in 2029. A chart included in the IEA report showing electricity generation for data centers by fuel in the U.S., in the IEA’s base case, from 2020 to 2035, outlined that natural gas has had, and will continue to have, the biggest slice of the pie. According to this chart, natural gas generated around 50 TWh of electricity for data centers in the U.S. in 2020. This figure rose slightly in 2021 and 2022, and came in a little under 100 TWh in 2024, the chart showed. The IEA chart projects that this figure will come in slightly over 100 TWh this year, go well over 100

Read More »

OEUK Says UK Budget Delivers ‘Bitter Blow’ to Workers

Industry body Offshore Energies UK (OEUK) said the UK government Budget “delivers [a] bitter blow to [the] UK’s energy workers and industry”, in a statement sent to Rigzone on Wednesday. In that statement, OEUK “condemned the government’s decision in today’s [Wednesday] Budget to reject replacement of the Energy Profits Levy (EPL) in 2026”, dubbing it “a move that will cost tens of thousands of jobs, cripple investment, and undermine Scotland and the UK’s energy security”.  OEUK revealed in the statement that it will meet its 450 member companies “for urgent talks”. The industry body noted that it is also seeking an immediate meeting with UK Chancellor of the Exchequer Rachel Reeves “to explore every option to reverse this policy and prevent further economic and industrial damage”.  OEUK Chief Executive David Whitehouse said in the statement, “today, the government turned down GBP 50 billion [$66 billion] of investment for the UK and the chance to protect the jobs and industries that keep this country running”. “Instead, they’ve chosen a path that will see 1,000 jobs continue to be lost every month, more energy imports, and a contagion across supply chains and our industrial heartlands,” he added. “This is not over. We will keep pressing for change – this industry’s people, their communities, and the value of this strategic national asset are too important to dismiss. The government was warned of the dangers of inaction – they must now own the consequences and reconsider,” he continued. Whitehouse warned in the statement that the future of North Sea energy depends on investment, which he said won’t come without urgent reform of the windfall tax. “If the levy stays in place beyond 2026, projects will stall and jobs will vanish, no matter how pragmatic licensing policy becomes. Fixing this outdated tax is the key to unlocking billions in investment across the UK’s entire energy mix,” he added.  “Waiting four years for reform of this tax is too late. The North Sea continues to be

Read More »

Microsoft loses two senior AI infrastructure leaders as data center pressures mount

Microsoft did not immediately respond to a request for comment. Microsoft’s constraints Analysts say the twin departures mark a significant setback for Microsoft at a critical moment in the AI data center race, with pressure mounting from both OpenAI’s model demands and Google’s infrastructure scale. “Losing some of the best professionals working on this challenge could set Microsoft back,” said Neil Shah, partner and co-founder at Counterpoint Research. “Solving the energy wall is not trivial, and there may have been friction or strategic differences that contributed to their decision to move on, especially if they saw an opportunity to make a broader impact and do so more lucratively at a company like Nvidia.” Even so, Microsoft has the depth and ecosystem strength to continue doubling down on AI data centers, said Prabhu Ram, VP for industry research at Cybermedia Research. According to Sanchit Gogia, chief analyst at Greyhound Research, the departures come at a sensitive moment because Microsoft is trying to expand its AI infrastructure faster than physical constraints allow. “The executives who have left were central to GPU cluster design, data center engineering, energy procurement, and the experimental power and cooling approaches Microsoft has been pursuing to support dense AI workloads,” Gogia said. “Their exit coincides with pressures the company has already acknowledged publicly. GPUs are arriving faster than the company can energize the facilities that will house them, and power availability has overtaken chip availability as the real bottleneck.”

Read More »

What is Edge AI? When the cloud isn’t close enough

Many edge devices can periodically send summarized or selected inference output data back to a central system for model retraining or refinement. That feedback loop helps the model improve over time while still keeping most decisions local. And to run efficiently on constrained edge hardware, the AI model is often pre-processed by techniques such as quantization (which reduces precision), pruning (which removes redundant parameters), or knowledge distillation (which trains a smaller model to mimic a larger one). These optimizations reduce the model’s memory, compute, and power demands so it can run more easily on an edge device. What technologies make edge AI possible? The concept of the “edge” always assumes that edge devices are less computationally powerful than data centers and cloud platforms. While that remains true, overall improvements in computational hardware have made today’s edge devices much more capable than those designed just a few years ago. In fact, a whole host of technological developments have come together to make edge AI a reality. Specialized hardware acceleration. Edge devices now ship with dedicated AI-accelerators (NPUs, TPUs, GPU cores) and system-on-chip units tailored for on-device inference. For example, companies like Arm have integrated AI-acceleration libraries into standard frameworks so models can run efficiently on Arm-based CPUs. Connectivity and data architecture. Edge AI often depends on durable, low-latency links (e.g., 5G, WiFi 6, LPWAN) and architectures that move compute closer to data. Merging edge nodes, gateways, and local servers means less reliance on distant clouds. And technologies like Kubernetes can provide a consistent management plane from the data center to remote locations. Deployment, orchestration, and model lifecycle tooling. Edge AI deployments must support model-update delivery, device and fleet monitoring, versioning, rollback and secure inference — especially when orchestrated across hundreds or thousands of locations. VMware, for instance, is offering traffic management

Read More »

Networks, AI, and metaversing

Our first, conservative, view says that AI’s network impact is largely confined to the data center, to connect clusters of GPU servers and the data they use as they crunch large language models. It’s all “horizontal” traffic; one TikTok challenge would generate way more traffic in the wide area. WAN costs won’t rise for you as an enterprise, and if you’re a carrier you won’t be carrying much new, so you don’t have much service revenue upside. If you don’t host AI on premises, you can pretty much dismiss its impact on your network. Contrast that with the radical metaverse view, our third view. Metaverses and AR/VR transform AI missions, and network services, from transaction processing to event processing, because the real world is a bunch of events pushing on you. They also let you visualize the way that process control models (digital twins) relate to the real world, which is critical if the processes you’re modeling involve human workers who rely on their visual sense. Could it be that the reason Meta is willing to spend on AI, is that the most credible application of AI, and the most impactful for networks, is the metaverse concept? In any event, this model of AI, by driving the users’ experiences and activities directly, demands significant edge connectivity, so you could expect it to have a major impact on network requirements. In fact, just dipping your toes into a metaverse could require a major up-front network upgrade. Networks carry traffic. Traffic is messages. More messages, more traffic, more infrastructure, more service revenue…you get the picture. Door number one, to the AI giant future, leads to nothing much in terms of messages. Door number three, metaverses and AR/VR, leads to a message, traffic, and network revolution. I’ll bet that most enterprises would doubt

Read More »

Microsoft’s Fairwater Atlanta and the Rise of the Distributed AI Supercomputer

Microsoft’s second Fairwater data center in Atlanta isn’t just “another big GPU shed.” It represents the other half of a deliberate architectural experiment: proving that two massive AI campuses, separated by roughly 700 miles, can operate as one coherent, distributed supercomputer. The Atlanta installation is the latest expression of Microsoft’s AI-first data center design: purpose-built for training and serving frontier models rather than supporting mixed cloud workloads. It links directly to the original Fairwater campus in Wisconsin, as well as to earlier generations of Azure AI supercomputers, through a dedicated AI WAN backbone that Microsoft describes as the foundation of a “planet-scale AI superfactory.” Inside a Fairwater Site: Preparing for Multi-Site Distribution Efficient multi-site training only works if each individual site behaves as a clean, well-structured unit. Microsoft’s intra-site design is deliberately simplified so that cross-site coordination has a predictable abstraction boundary—essential for treating multiple campuses as one distributed AI system. Each Fairwater installation presents itself as a single, flat, high-regularity cluster: Up to 72 NVIDIA Blackwell GPUs per rack, using GB200 NVL72 rack-scale systems. NVLink provides the ultra-low-latency, high-bandwidth scale-up fabric within the rack, while the Spectrum-X Ethernet stack handles scale-out. Each rack delivers roughly 1.8 TB/s of GPU-to-GPU bandwidth and exposes a multi-terabyte pooled memory space addressable via NVLink—critical for large-model sharding, activation checkpointing, and parallelism strategies. Racks feed into a two-tier Ethernet scale-out network offering 800 Gbps GPU-to-GPU connectivity with very low hop counts, engineered to scale to hundreds of thousands of GPUs without encountering the classic port-count and topology constraints of traditional Clos fabrics. Microsoft confirms that the fabric relies heavily on: SONiC-based switching and a broad commodity Ethernet ecosystem to avoid vendor lock-in and accelerate architectural iteration. Custom network optimizations, such as packet trimming, packet spray, high-frequency telemetry, and advanced congestion-control mechanisms, to prevent collective

Read More »

Land & Expand: Hyperscale, AI Factory, Megascale

Land & Expand is Data Center Frontier’s periodic roundup of notable North American data center development activity, tracking the newest sites, land plays, retrofits, and hyperscale campus expansions shaping the industry’s build cycle. October delivered a steady cadence of announcements, with several megascale projects advancing from concept to commitment. The month was defined by continued momentum in OpenAI and Oracle’s Stargate initiative (now spanning multiple U.S. regions) as well as major new investments from Google, Meta, DataBank, and emerging AI cloud players accelerating high-density reuse strategies. The result is a clearer picture of how the next wave of AI-first infrastructure is taking shape across the country. Google Begins $4B West Memphis Hyperscale Buildout Google formally broke ground on its $4 billion hyperscale campus in West Memphis, Arkansas, marking the company’s first data center in the state and the anchor for a new Mid-South operational hub. The project spans just over 1,000 acres, with initial site preparation and utility coordination already underway. Google and Entergy Arkansas confirmed a 600 MW solar generation partnership, structured to add dedicated renewable supply to the regional grid. As part of the launch, Google announced a $25 million Energy Impact Fund for local community affordability programs and energy-resilience improvements—an unusually early community-benefit commitment for a first-phase hyperscale project. Cooling specifics have not yet been made public. Water sourcing—whether reclaimed, potable, or hybrid seasonal mode—remains under review, as the company finalizes environmental permits. Public filings reference a large-scale onsite water treatment facility, similar to Google’s deployments in The Dalles and Council Bluffs. Local governance documents show that prior to the October announcement, West Memphis approved a 30-year PILOT via Groot LLC (Google’s land assembly entity), with early filings referencing a typical placeholder of ~50 direct jobs. At launch, officials emphasized hundreds of full-time operations roles and thousands

Read More »

The New Digital Infrastructure Geography: Green Street’s David Guarino on AI Demand, Power Scarcity, and the Next Phase of Data Center Growth

As the global data center industry races through its most frenetic build cycle in history, one question continues to define the market’s mood: is this the peak of an AI-fueled supercycle, or the beginning of a structurally different era for digital infrastructure? For Green Street Managing Director and Head of Global Data Center and Tower Research David Guarino, the answer—based firmly on observable fundamentals—is increasingly clear. Demand remains blisteringly strong. Capital appetite is deepening. And the very definition of a “data center market” is shifting beneath the industry’s feet. In a wide-ranging discussion with Data Center Frontier, Guarino outlined why data centers continue to stand out in the commercial real estate landscape, how AI is reshaping underwriting and development models, why behind-the-meter power is quietly reorganizing the U.S. map, and what Green Street sees ahead for rents, REITs, and the next wave of hyperscale expansion. A ‘Safe’ Asset in an Uncertain CRE Landscape Among institutional investors, the post-COVID era was the moment data centers stepped decisively out of “niche” territory. Guarino notes that pandemic-era reliance on digital services crystallized a structural recognition: data centers deliver stable, predictable cash flows, anchored by the highest-credit tenants in global real estate. Hyperscalers today dominate new leasing and routinely sign 15-year (or longer) contracts, a duration largely unmatched across CRE categories. When compared with one-year apartment leases, five-year office leases, or mall anchor terms, the stability story becomes plain. “These are AAA-caliber companies signing the longest leases in the sector’s history,” Guarino said. “From a real estate point of view, that combination of tenant quality and lease duration continues to position the asset class as uniquely durable.” And development returns remain exceptional. Even without assuming endless AI growth, the math works: strong demand, rising rents, and high-credit tenants create unusually predictable performance relative to

Read More »

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs).  In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

Read More »

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

Read More »

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

Read More »

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Read More »