Stay Ahead, Stay ONMINE

How I Became A Machine Learning Engineer (No CS Degree, No Bootcamp)

Machine learning and AI are among the most popular topics nowadays, especially within the tech space. I am fortunate enough to work and develop with these technologies every day as a machine learning engineer! In this article, I will walk you through my journey to becoming a machine learning engineer, shedding some light and advice […]

Machine learning and AI are among the most popular topics nowadays, especially within the tech space. I am fortunate enough to work and develop with these technologies every day as a machine learning engineer!

In this article, I will walk you through my journey to becoming a machine learning engineer, shedding some light and advice on how you can become one yourself!

My Background

In one of my previous articles, I extensively wrote about my journey from school to securing my first Data Science job. I recommend you check out that article, but I will summarise the key timeline here.

Pretty much everyone in my family studied some sort of STEM subject. My great-grandad was an engineer, both my grandparents studied physics, and my mum is a maths teacher.

So, my path was always paved for me.

Me at age 11

I chose to study physics at university after watching The Big Bang Theory at age 12; it’s fair to say everyone was very proud!

At school, I wasn’t dumb by any means. I was actually relatively bright, but I didn’t fully apply myself. I got decent grades, but definitely not what I was fully capable of.

I was very arrogant and thought I would do well with zero work.

I applied to top universities like Oxford and Imperial College, but given my work ethic, I was delusional thinking I had a chance. On results day, I ended up in clearing as I missed my offers. This was probably one of the saddest days of my life.

Clearing in the UK is where universities offer places to students on certain courses where they have space. It’s mainly for students who don’t have a university offer.

I was lucky enough to be offered a chance to study physics at the University of Surrey, and I went on to earn a first-class master’s degree in physics!

There is genuinely no substitute for hard work. It is a cringy cliche, but it is true!

My original plan was to do a PhD and be a full-time researcher or professor, but during my degree, I did a research year, and I just felt a career in research was not for me. Everything moved so slowly, and it didn’t seem there was much opportunity in the space.

During this time, DeepMind released their AlphaGo — The Movie documentary on YouTube, which popped up on my home feed.

From the video, I started to understand how AI worked and learn about neural networks, reinforcement learning, and deep learning. To be honest, to this day I am still not an expert in these areas.

Naturally, I dug deeper and found that a data scientist uses AI and machine learning algorithms to solve problems. I immediately wanted in and started applying for data science graduate roles.

I spent countless hours coding, taking courses, and working on projects. I applied to 300+ jobs and eventually landed my first data science graduate scheme in September 2021.

You can hear more about my journey from a podcast.

Data Science Journey

I started my career in an insurance company, where I built various supervised learning models, mainly using gradient boosted tree packages like CatBoost, XGBoost, and generalised linear models (GLMs).

I built models to predict:

  • Fraud — Did someone fraudulently make a claim to profit.
  • Risk Prices — What’s the premium we should give someone.
  • Number of Claims — How many claims will someone have.
  • Average Cost of Claim — What’s the average claim value someone will have.

I made around six models spanning the regression and classification space. I learned so much here, especially in statistics, as I worked very closely with Actuaries, so my maths knowledge was excellent.

However, due to the company’s structure and setup, it was difficult for my models to advance past the PoC stage, so I felt I lacked the “tech” side of my toolkit and understanding of how companies use machine learning in production.

After a year, my previous employer reached out to me asking if I wanted to apply to a junior data scientist role that specialises in time series forecasting and optimisation problems. I really liked the company, and after a few interviews, I was offered the job!

I worked at this company for about 2.5 years, where I became an expert in forecasting and combinatorial optimisation problems.

I developed many algorithms and deployed my models to production through AWS using software engineering best practices, such as unit testing, lower environment, shadow system, CI/CD pipelines, and much more.

Fair to say I learned a lot. 

I worked very closely with software engineers, so I picked up a lot of engineering knowledge and continued self-studying machine learning and statistics on the side.

I even earned a promotion from junior to mid-level in that time!

Transitioning To MLE

Over time, I realised the actual value of data science is using it to make live decisions. There is a good quote by Pau Labarta Bajo

ML models inside Jupyter notebooks have a business value of $0

There is no point in building a really complex and sophisticated model if it will not produce results. Seeking out that extra 0.1% accuracy by staking multiple models is often not worth it.

You are better off building something simple that you can deploy, and that will bring real financial benefit to the company.

With this in mind, I started thinking about the future of data science. In my head, there are two avenues:

  • Analytics -> You work primarily to gain insight into what the business should be doing and what it should be looking into to boost its performance.
  • Engineering -> You ship solutions (models, decision algorithms, etc.) that bring business value.

I feel the data scientist who analyses and builds PoC models will become extinct in the next few years because, as we said above, they don’t provide tangible value to a business.

That’s not to say they are entirely useless; you have to think of it from the business perspective of their return on investment. Ideally, the value you bring in should be more than your salary.

You want to say that you did “X that produced Y”, which the above two avenues allow you to do.

The engineering side was the most interesting and enjoyable for me. I genuinely enjoy coding and building stuff that benefits people, and that they can use, so naturally, that’s where I gravitated towards.

To move to the ML engineering side, I asked my line manager if I could deploy the algorithms and ML models I was building myself. I would get help from software engineers, but I would write all the production code, do my own system design, and set up the deployment process independently.

And that’s exactly what I did.

I basically became a Machine Learning Engineer. I was developing my algorithms and then shipping them to production.

I also took NeetCode’s data structures and algorithms course to improve my fundamentals of computer science and started blogging about software engineering concepts.

Coincidentally, my current employer contacted me around this time and asked if I wanted to apply for a machine learning engineer role that specialises in general ML and optimisation at their company!

Call it luck, but clearly, the universe was telling me something. After several interview rounds, I was offered the role, and I am now a fully fledged machine learning engineer!

Fortunately, a role kind of “fell to me,” but I created my own luck through up-skilling and documenting my learning. That is why I always tell people to show their work — you don’t know what may come from it.

My Advice

I want to share the main bits of advice that helped me transition from a machine learning engineer to a data scientist.

  • Experience — A machine learning engineer is not an entry-level position in my opinion. You need to be well-versed in data science, machine learning, software engineering, etc. You don’t need to be an expert in all of them, but have good fundamentals across the board. That’s why I recommend having a couple of years of experience as either a software engineer or data scientist and self-study other areas.
  • Production Code — If you are from data science, you must learn to write good, well-tested production code. You must know things like typing, linting, unit tests, formatting, mocking and CI/CD. It’s not too difficult, but it just requires some practice. I recommend asking your current company to work with software engineers to gain this knowledge, it worked for me!
  • Cloud Systems — Most companies nowadays deploy many of their architecture and systems on the cloud, and machine learning models are no exception. So, it’s best to get practice with these tools and understand how they enable models to go live. I learned most of this on the job, to be honest, but there are courses you can take.
  • Command Line — I am sure most of you know this already, but every tech professional should be proficient in the command line. You will use it extensively when deploying and writing production code. I have a basic guide you can checkout here.
  • Data Structures & Algorithms — Understanding the fundamental algorithms in computer science are very useful for MLE roles. Mainly because you will likely be asked about it in interviews. It’s not too hard to learn compared to machine learning; it just takes time. Any course will do the trick.
  • Git & GitHub — Again, most tech professionals should know Git, but as an MLE, it is essential. How to squash commits, do code reviews, and write outstanding pull requests are musts.
  • Specialise — Many MLE roles I saw required you to have some specialisation in a particular area. I specialise in time series forecasting, optimisation, and general ML based on my previous experience. This helps you stand out in the market, and most companies are looking for specialists nowadays.

The main theme here is that I basically up-skilled my software engineering abilities. This makes sense as I already had all the math, stats, and machine learning knowledge from being a data scientist.

If I were a software engineer, the transition would likely be the reverse. This is why securing a machine learning engineer role can be quite challenging, as it requires proficiency across a wide range of skills.

Summary & Further Thoughts

I have a free newsletter, Dishing the Data, where I share weekly tips and advice as a practising data scientist. Plus, when you subscribe, you will get my FREE data science resume and short PDF version of my AI roadmap!

Connect With Me

Shape
Shape
Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy,  bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Shape

Cisco initiative targets device security

Cisco is announcing a security initiative that will push customers to update or replace aging infrastructure components, such as routers, switches and firewalls, as well as discourage them from using any insecure features. Called Resilient Infrastructure, the plan calls for Cisco to strengthen network security by increasing default protections, removing

Read More »

NetOps teams struggle with AI readiness

Some 87% of respondents indicated that internet and cloud environments are creating network blind spots in many areas. Half of organizations reported a lack of adequate insight into public clouds, 44% of respondents indicated transit and peering networks created blind spots, and 43% said remote work environments lack visibility. Other

Read More »

USA Crude Oil Stocks Drop by 3.4 Million Barrels WoW

U.S. commercial crude oil inventories, excluding those in the Strategic Petroleum Reserve (SPR) decreased by 3.4 million barrels from the week ending November 7 to the week ending November 14, the U.S. Energy Information Administration (EIA) highlighted in its latest weekly petroleum status report. This EIA report, which was released on November 19 and included data for the week ending November 14, showed that crude oil stocks, not including the SPR, stood at 424.2 million barrels on November 14, 427.6 million barrels on November 7, and 430.3 million barrels on November 15, 2024. Crude oil in the SPR stood at 410.9 million barrels on November 14, 410.4 million barrels on November 7, and 389.2 million barrels on November 15, 2024, the report highlighted. Total petroleum stocks – including crude oil, total motor gasoline, fuel ethanol, kerosene type jet fuel, distillate fuel oil, residual fuel oil, propane/propylene, and other oils – stood at 1.680 billion barrels on November 14, the report revealed. Total petroleum stocks were down 2.2 million barrels week on week and up 47.1 million barrels year on year, the report showed. “At 424.2 million barrels, U.S. crude oil inventories are about five percent below the five year average for this time of year,” the EIA said in its latest weekly petroleum status report. “Total motor gasoline inventories increased by 2.3 million barrels from last week and are about three percent below the five year average for this time of year. Finished gasoline inventories decreased, while blending components inventories increased last week,” the EIA added. “Distillate fuel inventories increased by 0.2 million barrels last week and are about seven percent below the five year average for this time of year. Propane/propylene inventories remained unchanged from last week and are about 16 percent above the five year average for this

Read More »

Norway Gas Output Hits Six-Month High

Norway produced 336.76 million standard cubic meters a day (MMscmd) of natural gas in October, its highest over the last six months, according to preliminary monthly production figures from the country’s upstream regulator. However, last month’s gas output fell 1.7 percent compared to October 2024, though it beat the Norwegian Offshore Directorate’s (NOD) projection by 2.1 percent. Norway sold 10.4 billion scm of gas last month, up 1.9 billion scm from September, the NOD reported on its website. The Nordic country’s oil production in October averaged 1.82 million barrels per day (MMbpd), down 3.6 percent from September but up 2.1 percent from October 2024. The figure exceeded the NOD forecast by 0.4 percent. Total liquids production was 2.02 MMbpd, down 2.8 percent month-on-month but up 0.8 percent year-on-year. “Preliminary production figures for October 2025 show an average daily production of 2,017,000 barrels of oil, NGL and condensate”, the NOD said. “The total petroleum production so far in 2025 is about 197.1 million Sm3 oil equivalents (MSm3 o.e.), broken down as follows: about 87.8 MSm3 o.e. of oil, about 9.7 MSm3 o.e. of NGL and condensate and about 99.5 MSm3 o.e. of gas for sale”, it said. “The total volume is 4.1 MSm3 o.e. less than 2024”. For the third quarter majority state-owned Equinor ASA reported Norwegian equity liquid and gas production of 1.42 million barrels of oil equivalent a day (MMboed), up from 1.36 MMboed in Q2 and 1.31 MMboed in Q3 2024. “In the third quarter of 2025, new fields coming onstream (Johan Castberg and Halten East) drove an increase in production compared to the same quarter last year”, Equinor said of its Norwegian production in its quarterly report October 29. “High production efficiency from Johan Sverdrup, new wells and a lower impact from turnarounds and maintenance more than

Read More »

CEO Denies Alleged TotalEnergies Link to Mozambique Crimes

TotalEnergies SE Chief Executive Officer Patrick Pouyanne rejected accusations the French energy firm has responsibilities in alleged killing of civilians four years ago at its liquefied natural gas project site in Mozambique. The company “is accused of having directly financed and materially supported” a group of armed forces, who “allegedly detained, tortured and killed dozens of civilians” at the LNG project in the north of the country, the European Center for Constitutional and Human Rights said in a statement Tuesday. It filed a criminal complaint over the allegations with the French National Anti-Terrorism Prosecutor this week. “We will defend ourselves and we will explain that all this has nothing to do with TotalEnergies,” Pouyanne said Wednesday on LCI television station. “We’ve done inquiries. We never managed to find evidence” of the allegations.  The complaint comes as Total is on the verge of restarting construction of the project for the first time since the site was shut in 2021 due to an Islamist insurgency. Other global corporations operating in conflict areas have had cases brought against them including Holcim Ltd.’s Lafarge, on trial in France over operations in Syria, and a US ruling against BNP Paribas related to Sudan. The ECCHR complaint, citing an account by Politico, accuses Total of “complicity in war crimes” through a financial link to a Mozambican army unit that allegedly held civilians in shipping containers where dozens of them were tortured and killed at the project between July and September 2021. The company had evacuated the site earlier that year after an attack by insurgents and declared a force majeure. In 2023, Jean-Christophe Rufin, a former French ambassador hired by Total to review the security and humanitarian situation around the project, warned that the developers should stop paying bonuses to Mozambique’s security forces protecting the site.  Total asked government authorities to

Read More »

Powering the grid: embracing EPC for extra-high-voltage growth

Across the country, the demand for power is soaring. Hyperscale facilities, rising industrial load, extreme weather impacts and the loss of firm power capacity are pushing the grid harder than ever. Integration of renewable and distributed generation sources — often far from load centers — has been constrained as infrastructure build-out has lagged soaring demand. The response from the energy sector has been a boom in capital investment, significant new construction and rebuilds of aging infrastructure, aiming to dramatically increase capacity on the grid. The complexity and sheer scale of these projects pose serious risks. A streamlined approach to project delivery, utilizing the engineer-procure-construct (EPC) model, will be key to delivering at the rate the market demands. Accelerating the front end of projects, from concept to mobilization, offers opportunities to optimize through integrated delivery and collaborative contracting. Three important takeaways: Extra-high-voltage (EHV) projects, such as 765-kV transmission lines, are an important part of the sector’s response to modern challenges. Given limited practical experience with such projects, partnerships can better leverage that pool of experience. The portfolio-based approach required to scale extra-high-voltage infrastructure needs EPC delivery for maximum efficiency. The importance of collaboration and coordination is magnified for region-spanning efforts. Bridging Experience Gaps Solving capacity challenges means significant capital investment is essential, particularly in EHV transmission infrastructure. However, most of the limited 765-kV infrastructure in the U.S. was built decades ago. The number of people in today’s workforce who have hands-on experience with design, construction or commissioning at that scale is small and shrinking. The supply of experienced workers — especially field personnel, skilled linemen and engineering leadership — for high-voltage projects is a major constraint in an already-tight labor market. The risk created by that lack of bench strength requires trust among all stakeholders for the projects. Intentional knowledge transfer

Read More »

Dynagas Q3 Revenue Down YoY

Dynagas LNG Partners LP on Thursday reported $38.89 million in revenue for the third quarter, down from $39.07 million for the same three-month period last year. The decrease brought down net profit adjusted for nonrecurring items from $14.48 million for Q3 2024 to $14.23 million, or $0.36 per share, for Q3 2025, the Athens-based owner and operator of liquefied natural gas (LNG) carriers said in its quarterly report. The revenue fall was driven by “the decrease of the daily hire rate of the Arctic Aurora in the three-month period ending September 30, 2025, and the decrease in revenue earning days of the Yenisei River due to unscheduled repairs”, Dynagas said. “The above decrease in voyage revenues was partially offset by the non-cash effect of the amortization of deferred revenues and the value of the EU ETS emissions allowances due to the Partnership by the charterers of its vessels”. Dynagas logged average daily hire gross of commissions of nearly $70,000 per day per vessel in Q3 2025, down from around $72,800 per day per vessel for Q3 2024. Its fleet, consisting of six carriers with a combined capacity of approximately 914,000 cubic meters (32.28 million cubic feet), had utilization rates of 99.1 percent and 100 percent in Q3 2025 and Q3 2024 respectively. “Our fleet-wide time charter equivalent of $67,094 per day comfortably exceeded our cash breakeven for the quarter of approximately $47,500, allowing us to continue generating stable free cash flow”, said chief executive Tony Lauritzen. While revenue dropped, net income grew from $15.05 million for Q3 2024 to $18.66 million for Q3 2025. This was “mainly attributable to the increase of other income from insurance claims for damages incurred in prior years, the decrease in net interest and finance costs… [and] the decrease in general and administrative expenses”, Dynagas said.

Read More »

Russian Oil Giant Recommends Lowest Interim Dividends Since 2020

Russian oil giant Rosneft PJSC plans to pay the lowest interim dividends since the pandemic in 2020 as slumping crude prices, a stronger ruble and looming US sanctions bite. The board of directors of Russia’s largest state-controlled oil producer recommended to pay 11.56 rubles, $0.14, per share in interim dividends, according to a regulatory filing on Thursday.  The recommendation comes just a day before unprecedented US sanctions are due to hit Rosneft and fellow Russian oil giant Lukoil PJSC. President Donald Trump’s administration last month stepped up restrictions on Russia’s oil industry, which together with gas accounts for about a quarter of the nation’s coffers.  Rosneft’s earnings were already undermined by lower global oil prices amid fears of global surplus and much stronger ruble, with the appreciation of the nation’s currency meaning fewer rubles for each sold barrel. As a result, Rosneft’s net income shrank by 68% in the first half of the year from the same period in 2024.  Rosneft, responsible for over a third of the nation’s oil output, has been paying dividends to the state since 1999, and to other shareholders since 2006 when it began trading publicly. The producer started to pay interim dividends in 2017, distributing half of its profit to shareholders. It scrapped the payouts for the first half of 2020 after posting a loss for the period. Lukoil’s board of directors will discuss recommendations on interim dividends on Friday. The oil producer initially planned to discuss nine-month payouts on Oct. 23, but postponed after US announced sanctions against the company on Oct. 22. Some Lukoil units on Friday received extensions to sanctions waivers that the Trump administration imposed. WHAT DO YOU THINK? Generated by readers, the comments included herein do not reflect the views and opinions of Rigzone. All comments are subject to editorial review. Off-topic, inappropriate

Read More »

Nvidia is flying high: Is there anything left to say?

Supply chain risks, he said, “are numerous in nature; however, it is clear that Nvidia is customer Number One with all of their suppliers, which drives an inordinate allocation of resources to ensure that production flows. Any disruption would likely be materials-based as opposed to a process or labor issue from their vendor base.” He added, “geopolitical events would be the most likely origin of any type of medium to long term disruption, think China-Taiwan, expansion of the Russia-Ukraine conflict, or escalation in the US-China trade war.” For lower impact events, he said, “[Nvidia] does a nice job of setting conservative shipment goals and targets for Wall Street, which they almost invariably beat quarter after quarter. This provides some cushion for them to absorb a labor, process, or geopolitical hiccup and still meet their stated goals. Shipment volumes may not exceed targets, but shipments would continue to flow; the spice must flow after all.” In a worst-case scenario where shipments are materially impacted, there is little recourse for enterprises that are not large-scale cloud consumers with clout with the limited providers in the space, Bickley added. Enterprises joining a ‘very long queue’ According to Sanchit Vir Gogia, the chief analyst at Greyhound Research, the Nvidia earnings call “confirms that the bottleneck in enterprise AI is no longer imagination or budget. It is capacity. Nvidia reported $57 billion in quarterly revenue, with more than $51 billion from data center customers alone, yet still described itself as supply-constrained at record levels.” Blackwell and Blackwell Ultra, he said, have become the default currency of AI infrastructure, yet even at a build rate of roughly 1,000 GPU racks per week, the company cannot meet demand.

Read More »

Server memory prices could double by 2026 as AI demand strains supply

Limited options for enterprise buyers As supply tightens, most enterprises face limited leverage in selecting suppliers. “Enterprise will have less control over what memory supplier they can choose unless you are a hyperscaler or tier-2 AI datacenter scale enterprise,” Neil Shah, VP for research and partner at Counterpoint Research, told NetworkWorld. “For most enterprises investing in AI infrastructure, they will rely on vendors such as Dell, Lenovo, HPE, Supermicro, and others on their judgment to select the best memory supplier.” Shah advised enterprises with control over their bill of materials to negotiate and lock in supply and costs in advance. “In most cases for long-tail enterprises, smaller buyers without volume leverage, they will have little control as demand outstrips supply, so the prudent thing would be to spread out the rollout over time to average out the cost spikes,” he said. Legacy shortage opens door for Chinese suppliers The current pricing pressure has its roots in production decisions made months ago. According to Counterpoint, the supply crunch originated at the low end of the market as Samsung, SK Hynix, and Micron redirected production toward high-bandwidth memory for AI accelerators, which commands higher margins but consumes three times the wafer capacity of standard DRAM. That shift created an unusual price inversion: DDR4 used in budget devices now trades at approximately $2.10 per gigabit, while server-grade DDR5 sells for around $1.50 per gigabit, according to the firm. This tightness is creating an opportunity for China’s CXMT, noted Shah. “DDR4 is being used in low- to mid-tier smart devices and considering bigger vendors such as Samsung and SK Hynix planned to ramp down DDR4 capacity, CXMT could gain advantage and balance the supply versus demand dynamics moving into the second half of next year,” Shah said.

Read More »

Cobalt 200: Microsoft’s next-gen Arm CPU targets lower TCO for cloud workloads

These architectural improvements underpin Cobalt 200’s claimed increase in performance, which, according to Stephen Sopko, analyst at HyperFRAME Research, will lead to a reduction in total cost of ownership (TCO) compared to its predecessor. As a result, enterprise customers can benefit from consolidating workloads onto fewer machines. “For example, a 1k-instance cluster can see up to 30-40% TCO gains,” Sopko said, adding that this also helps enterprises free up resources to allocate to other workloads or projects. Moor Strategy and Insights principal analyst Matt Kimball noted that the claimed improvements in throughput-per-watt could be beneficial for compute-intensive workloads such as AI inferencing, microservices, and large-scale data processing. Some of Microsoft’s customers are already using Cobalt 100 virtual machines (VMs) for large-scale data processing workloads, and the chips are deployed across 32 Azure data centers, the company said. With Cobalt 200, the company will directly compete with AWS’s Graviton series and Google’s recently announced Axion processors, both of which leverage Arm architecture to deliver better price-performance for cloud workloads. Microsoft and other hyperscalers have been forced to design their own chips for data centers due to the skyrocketing costs for AI and cloud infrastructure, supply constraints around GPUs, and the need for energy-efficient yet customizable architectures to optimize workloads.

Read More »

AWS boosts its long-distance cloud connections with custom DWDM transponder

By controlling the entire hardware stack, AWS can implement comprehensive security measures that would be challenging with third-party solutions, Rehder stated. “This initial long-haul deployment represents just the first implementation of the in-house technology across our extensive long-haul network. We have already extended deployment to Europe, with plans to use the AWS DWDM transponder for all new long-haul connections throughout our global infrastructure,” Rehder wrote. Cloud vendors are some of the largest optical users in the world, though not all develop their own DWDM or other optical systems, according to a variety of papers on the subject. Google develops its own DWDM, for example, but others like Microsoft Azure develop only parts and buy optical gear from third parties. Others such as IBM, Oracle and Alibaba have optical backbones but also utilize third-party equipment. “We are anticipating that the time has come to interconnect all those new AI data centers being built,” wrote Jimmy Yu, vice president at Dell’Oro Group, in a recent optical report. “We are forecasting data center interconnect to grow at twice the rate of the overall market, driven by increased spending from cloud providers. The direct purchases of equipment for DCI will encompass ZR/ZR+ optics for IPoDWDM, optical line systems for transport, and DWDM systems for high-performance, long-distance terrestrial and subsea transmission.”

Read More »

Nvidia’s first exascale system is the 4th fastest supercomputer in the world

The world’s fourth exascale supercomputer has arrived, pitting Nvidia’s proprietary chip technologies against the x86 systems that have dominated supercomputing for decades. For the 66th edition of the TOP500, El Capitan holds steady at No. 1 while JUPITER Booster becomes the fourth exascale system on the list. The JUPITER Booster supercomputer, installed in Germany, uses Nvidia CPUs and GPUs and delivers a peak performance of exactly 1 exaflop, according to the November TOP500 list of supercomputers, released on Monday. The exaflop measurement is considered a major milestone in pushing computing performance to the limits. Today’s computers are typically measured in gigaflops and teraflops—and an exaflop translates to 1 billion gigaflops. Nvidia’s GPUs dominate AI servers installed in data centers as computing shifts to AI. As part of this shift, AI servers with Nvidia’s ARM-based Grace CPUs are emerging as a high-performance alternative to x86 chips. JUPITER is the fourth-fastest supercomputer in the world, behind three systems with x86 chips from AMD and Intel, according to TOP500. The top three supercomputers on the TOP500 list are in the U.S. and owned by the U.S. Department of Energy. The top two supercomputers—the 1.8-exaflop El Capitan at Lawrence Livermore National Laboratory and the 1.35-exaflop Frontier at Oak Ridge National Laboratory—use AMD CPUs and GPUs. The third-ranked 1.01-exaflop Aurora at Argonne National Laboratory uses Intel CPUs and GPUs. Intel scrapped its GPU roadmap after the release of Aurora and is now restructuring operations. The JUPITER Booster, which was assembled by France-based Eviden, has Nvidia’s GH200 superchip, which links two Nvidia Hopper GPUs with CPUs based on ARM designs. The CPU and GPU are connected via Nvidia’s proprietary NVLink interconnect, which is based on InfiniBand and provides bandwidth of up to 900 gigabytes per second. JUPITER first entered the Top500 list at 793 petaflops, but

Read More »

Samsung’s 60% memory price hike signals higher data center costs for enterprises

Industry-wide price surge driven by AI Samsung is not alone in raising prices. In October, TrendForce reported that Samsung and SK Hynix raised DRAM and NAND flash prices by up to 30% for Q4. Similarly, SK Hynix said during its October earnings call that its HBM, DRAM, and NAND capacity is “essentially sold out” for 2026, with the company posting record quarterly operating profit exceeding $8 billion, driven by surging AI demand. Industry analysts attributed the price increases to manufacturers redirecting production capacity. HBM production for AI accelerators consumes three times the wafer capacity of standard DRAM, according to a TrendForce report, citing remarks from Micron’s Chief Business Officer. After two years of oversupply, memory inventories have dropped to approximately eight weeks from over 30 weeks in early 2023. “The memory industry is tightening faster than expected as AI server demand for HBM, DDR5, and enterprise SSDs far outpaces supply growth,” said Manish Rawat, semiconductor analyst at TechInsights. “Even with new fab capacity coming online, much of it is dedicated to HBM, leaving conventional DRAM and NAND undersupplied. Memory is shifting from a cyclical commodity to a strategic bottleneck where suppliers can confidently enforce price discipline.” This newfound pricing power was evident in Samsung’s approach to contract negotiations. “Samsung’s delayed pricing announcement signals tough behind-the-scenes negotiations, with Samsung ultimately securing the aggressive hike it wanted,” Rawat said. “The move reflects a clear power shift toward chipmakers: inventories are normalized, supply is tight, and AI demand is unavoidable, leaving buyers with little room to negotiate.” Charlie Dai, VP and principal analyst at Forrester, said the 60% increase “signals confidence in sustained AI infrastructure growth and underscores memory’s strategic role as the bottleneck in accelerated computing.” Servers to cost 10-25% more For enterprises building AI infrastructure, these supply dynamics translate directly into

Read More »

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs).  In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

Read More »

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

Read More »

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

Read More »

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Read More »