Google’s Jules aims to out-code Codex in battle for the AI developer stack

Stay Ahead, Stay ONMINE

Google’s Jules aims to out-code Codex in battle for the AI developer stack

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Vibe coding and the growth of AI-powered coding platforms gave rise to yet another battleground among tech companies. In December, Google released Jules, an autonomous coding agent that can fix bugs asynchronously, as an experiment. However, […]

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More

Vibe coding and the growth of AI-powered coding platforms gave rise to yet another battleground among tech companies.

In December, Google released Jules, an autonomous coding agent that can fix bugs asynchronously, as an experiment. However, during Google I/O, Google announced that Jules will now be available in beta.

With the broader release of Jules, Google positions itself as a strong competitor against a rising number of AI coding assistants designed to write, check and fix code autonomously.

Josh Woodward, vice president of Google Labs, told reporters in a briefing that Jules “will be available to help developers fix bugs, create tests, consult documentation all happening in the background.”

“People are describing apps into existence,” Woodward said. “This started out as an asynchronous coding agent with the idea that, what if you created a way where you could assign tasks to this agent for the things you didn’t want to do?”

Jules will be integrated into GitHub and uses Google’s Gemini 2.5 Pro. During the public beta phase, developers can access Jules for free but with usage limits.

Asynchronous and parallel

Jules works asynchronously, allowing developers to assign it a task while they work separately on something else. It runs tasks inside a virtual machine, shows tasks and their reasoning and even offers audio summaries.

But Jules is not the only asynchronous and parallel task coding agent around, nor is it the only one announced in May.

OpenAI surprised the industry by releasing a research preview of its coding agent Codex, after rumors circulated that the company would buy the coding startup Windsurf. Codex began life as a coding model but has since transformed into a coding agent able to write, fix bugs, and answer codebase questions in a separate sandbox.

Codex was also behind one of the first code completion assistants, GitHub Copilot. GitHub announced during Microsoft Build this week, GitHub Copilot Agent, doing much of the same asynchronous work as Codex and Jules.

The upcoming arms race around coding agents is gaining interest in social media, even before Jules and Codex are fully released to the public.

Yeah, I think Jules beats Codex by a lot. Only tested on a my lazy prompt so far “Analyze the project and write unit tests to cover 100%”.
– Jules plans first and creates its own tasks. Codex does not. That’s major.
– Jules VMs have internet pic.twitter.com/DCGPKwiNiP
— Daniel Nakov (@dnak0v) May 19, 2025

@Google ai agent Jules just made her first contribution to a project I’m working on
Feedback: I really wish there was a way where I could select files or directories where I would want the AI to focus on pic.twitter.com/z5yMaF2ERb
— Nicolas (@NicolasSerna314) May 20, 2025

Seems like Coding agents that can submit PRs are the new shiny objects. Codex from OpenAI, Copilot coding agent from GitHub/Microsoft, Jules from Google, Claude and xAI when?
— Samuel (@SamuelSurfboard) May 20, 2025

These more autonomous coding platforms follow the growth of “vibe coding,” where code and applications are generated mostly through prompting rather than hard coding written by humans. The entrance of Big Tech companies like Google and OpenAI into this arena brings coding agents even more to the forefront of the AI arms race.

More AI-powered code

Even inside Google, Jules is not the only AI coding platform to build applications. Google offers Code Assist, AI Studio, Jules and Firebase.

Firebase, announced in April, allows non-coders to build applications and add AI features. Google updated the platform, adding a new AI Workspace for Firebase Studio and Firebase AI Logic for monitoring AI usage.

Firebase Studio, powered by Gemini 2.5 Pro, so that people can build more sophisticated applications. Firebase AI Logic offers developers the means to add features to the app’s backend, like authentication and identity. It also allows people to check token usage or resolve latency issues without needing a third-party orchestration program.

Jeanine Banks, vice president and general manager for Developer X and head of Developer Relations at Google, told VentureBeat that Firebase differentiates itself from Jules and other Google coding products by being the first place people new to coding can experiment with making their own AI applications.

“Google offers many wonderful tools to help you with specialized parts of your stack. So, for example, you can use Google AI Studio, which helps in experimenting with your AI inference to figure out the best optimized prompts,” Banks said. “But Firebase is the single place that integrates all of those things together, and it’s a single place for full-stack developers and professionals, but also creators who are vibe coding.”

Daily insights on business use cases with VB Daily

If you want to impress your boss, VB Daily has you covered. We give you the inside scoop on what companies are doing with generative AI, from regulatory shifts to practical deployments, so you can share insights for maximum ROI.

Read our Privacy Policy

Thanks for subscribing. Check out more VB newsletters here.

An error occured.

Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy, bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Broadcom resets private cloud strategy with VMware Cloud Foundation 9.0

“Private cloud is not a location, but an operating model for our customers. We’re resetting the bar on what a good private cloud platform needs to look like in the industry. We’re catering to two key personas: the cloud admins who build and operate infrastructure, and the developers who need

Essential commands for Linux server management

Any Linux systems administrator needs to be proficient with a wide range of commands for user management, file handling, system monitoring, networking, security and more. This article covers a range of commands that are essential for managing a Linux server. Keep in mind that some commands will depend on the

Multicloud explained: Why it pays to diversify your cloud strategy

Flexibility. While most cloud vendors pitch themselves as a total cloud solution, the truth is that each major offering has strengths and weaknesses, and companies may not want to commit to one vendor if they have multiple cloud use cases. For instance, an organization might use Microsoft’s Azure cloud for its

Understanding how data fabric enhances data security and governance

“If security is already inconsistent across hybrid or multi-cloud setups, teams will subsequently struggle to get their data fabric architecture as secure as it needs to be,” Inamdar said. How data fabric enhances security While there are some challenges, the reason why so many organizations choose to deploy data fabric

Energy Secretary Wright Testifies Before Senate Energy and Natural Resources Committee on FY2026 Budget Request

WASHINGTON— U.S. Secretary of Energy Chris Wright testified today before the U.S. Senate Committee on Energy and Natural Resources on the Department of Energy’s Fiscal Year 2026 budget request. Earlier this month, Secretary Wright testified before the U.S. House Energy Subcommittee to outline the department’s FY2026 request. He also appeared last month before both the U.S. Senate and U.S. House Appropriations Subcommittees on Energy and Water Development to outline department priorities and provide a comprehensive overview of the budget. The FY2026 Budget delivers on President Trump’s directive to restore American energy dominance, unleash every American energy advantage, and bring commonsense back to Washington. It returns non-defense discretionary spending to the most disciplined levels since 2017 and redirects over $15 billion away from the Green New Scam— a reckless Biden-era agenda that drives up costs, weakens reliability, and undermines U.S. energy strength. The department remains committed to being responsible stewards of American taxpayer dollars while protecting the affordable, abundant, and reliable energy our nation depends on. For more details, view the budget toplines here. Secretary Wright’s opening remarks: Thank you, Chairman Lee, Ranking Member Heinrich, and Members of the Committee, it is an honor to appear before you today as Secretary of Energy to discuss the President’s Fiscal Year 2026 Budget request for the Department of Energy. Under President Trump’s leadership, our priorities for the Department are clear—to achieve American energy dominance, bolster our national security, meet our Cold War legacy cleanup commitments and unleash historic innovation, including AI, for our nation and world. We are driven by a bedrock conviction that an affordable, reliable, secure energy supply is the foundation of a strong and prosperous nation. When America leads in energy, we lead in prosperity, security and human flourishing. We are committed to advancing our critical missions while cutting red

Energy Department Announces New Pathway to Test Advanced Reactors

WASHINGTON— The U.S. Department of Energy (DOE) today announced the start of a new pilot program to expedite the testing of advanced nuclear reactor designs under DOE authority outside of the national laboratories. In accordance with President Trump’s Executive Order, Reforming Nuclear Reactor Testing at the Department of Energy, DOE issued a Request for Application (RFA) and is seeking qualified U.S. reactor companies interested in constructing and operating their test reactors outside of the national laboratories using the DOE authorization process. Today’s action represents an important step toward streamlining nuclear reactor testing and ensuring at least three reactors achieve criticality by July 4, 2026. “For too long, the federal government has stymied the development and deployment of advanced civil nuclear reactors in the United States,” said Energy Secretary Chris Wright. “Thanks to President Trump’s leadership, we are expediting the development of next-generation nuclear technologies and giving American innovators a new path forward to advance their designs, propelling our economic prosperity and bolstering our national security.” President Trump is committed to re-establishing the United States as a global leader in nuclear energy and securing a reliable, diversified, and affordable energy supply to drive American prosperity and technological advancement. The new reactor pilot program will help to unleash American nuclear energy capabilities, support U.S. jobs and strengthen American innovation. The pilot program builds on current efforts to demonstrate advanced reactors on DOE sites through microreactor testbeds and other projects led by the Department of Defense and private industry. It is specifically designed to foster research and development of nuclear reactors and not demonstrate reactors for commercial suitability. Seeking DOE authorization provided under the Atomic Energy Act will help unlock private funding and provide a fast-tracked approach to enable future commercial licensing activities for potential applicants. DOE will consider advanced reactors that have

Middle East Tensions Keep Oil Volatile

Oil edged higher after a volatile session as President Donald Trump fanned speculation that the US may join the Middle East conflict. West Texas Intermediate rose 0.4% to settle above $75 a barrel, the highest closing price since January. Markets swung between gains and losses in a $3 range amid sharp reactions to developments in the Israel-Iran conflict. “Implied volatility continues to climb, signaling that underlying market anxiety remains elevated — even if that’s not fully reflected in price action,” said Rebecca Babin, a senior energy trader at CIBC Private Wealth Group. Trump said Iran squandered the chance to make a deal over its nuclear enrichment, but declined to say whether the US plans to join Israel’s offensive aimed at destroying the program. “I may do it. I may not do it,” Trump told reporters Wednesday at the White House when asked if he is moving closer to bombing Iran. “I mean, nobody knows what I’m going to do.” Earlier, Iran Supreme Leader Ayatollah Ali Khamenei said his country won’t surrender to Israel after Trump called for the Islamic Republic’s capitulation as the conflict enters its fifth day. Trump had demanded Iran’s “UNCONDITIONAL SURRENDER” and warned of a possible strike against its leader in social media posts Tuesday. The US is also moving more military assets into the region, including the USS Nimitz aircraft carrier strike group, which is sailing there ahead of schedule. The oil market’s main concerns are flows from Iran and the threat to vessel traffic in the Strait of Hormuz, through which about a quarter of the world’s crude shipments flow. Early data from TankerTrackers.com Inc. show that Iran increased its exports significantly since the attacks began, and there has been no major disruption to the strait. The risks to prices have permeated the oil derivatives

Groups appeal DOE ‘emergency’ order keeping Michigan plant online

The Department of Energy’s May order directing Consumers Energy to delay retiring a 1,560-MW, coal-fired power plant in Michigan was illegal and based on a nonexistent emergency, Earthjustice and other groups said in a rehearing request filed on Wednesday at DOE. “The order represents an effort to replace the market- and state-led planning process provided by statute with an ill-advised and misinformed exercise in federal command-and-control,” the public interest groups said in their request that DOE rescind its decision. If DOE does not respond to the request within a required 30 days, the groups plan to challenge the 90-day emergency order in court, they said. The Federal Power Act’s section 202(c) gives the DOE secretary the authority to temporarily order power plants to operate during wars and emergencies. In a May 23 order, DOE Secretary Chris Wright said parts of the Midwest faced an “energy emergency” and that Consumers’ J.H. Campbell power plant in West Olive, Michigan, should run until Aug. 21, past its planned May 31 shutdown. DOE cited two reports in finding that an emergency exists in MISO: NERC’s 2025 Summer Reliability Assessment issued on May 14 and the grid operator’s capacity auction results released in late April. Those reports fail to show Michigan faces an emergency that requires the Campbell power plant to keep running, according to the public interest groups. MISO and the Michigan Public Service Commission have said the state and broader Midcontinent system have adequate power supplies this summer, the groups said. Also, MISO and the PSC approved retiring the Campbell power plant, and Consumers acquired replacement power supplies, they noted. “MISO has made clear time and again that the vast region over which it has balancing authority is resource adequate for summer 2025,” the groups said. “This means that MISO is not facing

Canada Business Group Pushes for Pipeline Expansion in Mexico

Canadian companies could help Mexico reduce its lopsided dependency on imported US natural gas by maximizing its own domestic supplies of the fuel, according to the head of Canada’s top business group. While Mexico has for decades been a major oil producer, local natural gas output has failed to keep up with demand as it instead favored imports from US suppliers, mostly across the border in Texas. But Canadian firms see opportunities to increase investment in Mexican energy, said Business Council of Canada CEO Goldy Hyder after meeting with President Claudia Sheinbaum. Executives from major pipeline builders ATCO Ltd and TC Energy Corp were present at the sit-down with the Mexican president at the Group of Seven summit in Kananaskis, Canada. “There was a general feeling that it’s in Mexico’s interest to diversify more its sources of energy. It’s dependent on natural gas in the United States. And so obviously Canada can be very helpful in that regard,” Hyder said. “We have projects that are already taking place there that are going to allow Mexicans to have energy security because the gas is in Mexico and it’s being extracted.” State-owned Petroleos Mexicanos, known locally as Pemex, has struggled for years to boost its natural gas output at home. But due to a growing network of pipelines, Mexico’s reliance on gas from Texas sharply scaled up beginning around 15 years ago as US shale projects took off. More than 70% of the Latin American economy’s demand for the fuel is now satisfied via cross-border imports. Sheinbaum met with the business council on Monday prior to her meeting with Canadian Prime Minister Mark Carney on Tuesday. US President Donald Trump’s unexpected return to Washington late on Monday led to the cancelation of what would have been his first in-person meeting with Sheinbaum.

US electric vehicle sales are slowing amid policy shifts: BNEF

Sales are still growing, but policy changes in the United States are significantly slowing the country’s adoption of electric vehicles, BloombergNEF said Wednesday in its annual global outlook for the sector. This “is the first year where we have reduced both our near-term and long-term passenger EV adoption outlook,” BNEF said in its 2025 Electric Vehicles Outlook. “Policy changes in the US are the biggest factor, with national fuel-economy targets being rolled back, supportive elements of the Inflation Reduction Act either being removed or under threat, and the potential removal of California’s ability to set its own air quality standards.” Electric vehicles set global sales records last year, and adoption rapidly increased in emerging markets across Asia and Latin America, according to Colin McKerracher, lead author of the report and BNEF’s head of clean transport and energy storage. “Despite these positive tailwinds, we see slower EV adoption in the short and long-term due in large part to the changing landscape in the US,” McKerracher said in a statement. “This shift in global adoption will also have major impacts on the battery industry, leading to overcapacity in manufacturing.” BNEF now expects passenger EV sales in the United States to rise from 1.6 million this year to 4.1 million in 2030, to make up 27% of total passenger car sales by the end of the decade. In last year’s report, the firm had anticipated EVs would make up 48% of sales by that time. “It results in cumulative EV sales between now and 2030 being 14 million units lower,” according to the report. The impact on battery supply is significant, according to BNEF. The firm’s global battery demand outlook between 2025 and 2035 “fell 8% compared to last year’s, equating to 3.4 [TWh] fewer batteries — a majority of which (2.8 TWh) can be attributed

Can Intel cut its way to profit with factory layoffs?

Matt Kimball, principal analyst at Moor Insights & Strategy, said, “While I’m sure tariffs have some impact on Intel’s layoffs, this is actually pretty simple — these layoffs are largely due to the financial challenges Intel is facing in terms of declining revenues.” The move, he said, “aligns with what the company had announced some time back, to bring expenses in line with revenues. While it is painful, I am confident that Intel will be able to meet these demands, as being able to produce quality chips in a timely fashion is critical to their comeback in the market.” Intel, said Kimball, “started its turnaround a few years back when ex-CEO Pat Gelsinger announced its five nodes in four years plan. While this was an impressive vision to articulate, its purpose was to rebuild trust with customers, and to rebuild an execution discipline. I think the company has largely succeeded, but of course the results trail a bit.” Asked if a combination of layoffs and the moving around of jobs will affect the cost of importing chips, Kimball predicted it will likely not have an impact: “Intel (like any responsible company) is extremely focused on cost and supply chain management. They have this down to a science and it is so critical to margins. Also, while I don’t have insights, I would expect Intel is employing AI and/or analytics to help drive supply chain and manufacturing optimization.” The company’s number one job, he said, “is to deliver the highest quality chips to its customers — from the client to the data center. I have every confidence it will not put this mandate at risk as it considers where/how to make the appropriate resourcing decisions. I think everybody who has been through corporate restructuring (I’ve been through too many to count)

Intel appears stuck between ‘a rock and a hard place’

Intel, said Kimball, “started its turnaround a few years back when ex-CEO Pat Gelsinger announced its five nodes in four years plan. While this was an impressive vision to articulate, its purpose was to rebuild trust with customers, and to rebuild an execution discipline. I think the company has largely succeeded, but of course the results trail a bit.” Asked if a combination of layoffs and the moving around of jobs will affect the cost of importing chips, Kimball predicted it will likely not have an impact: “Intel (like any responsible company) is extremely focused on cost and supply chain management. They have this down to a science and it is so critical to margins. Also, while I don’t have insights, I would expect Intel is employing AI and/or analytics to help drive supply chain and manufacturing optimization.” The company’s number one job, he said, “is to deliver the highest quality chips to its customers — from the client to the data center. I have every confidence it will not put this mandate at risk as it considers where/how to make the appropriate resourcing decisions. I think everybody who has been through corporate restructuring (I’ve been through too many to count) realizes that, when planning for these, ensuring the resilience of these mission critical functions is priority one.” Added Bickley, “trimming the workforce, delaying construction of the US fab plants, and flattening the decision structure of the organization are prudent moves meant to buy time in the hopes that their new chip designs and foundry processes attract new business.”

Next-gen AI chips will draw 15,000W each, redefining power, cooling, and data center design

“Dublin imposed a 2023 moratorium on new data centers, Frankfurt has no new capacity expected before 2030, and Singapore has just 7.2 MW available,” said Kasthuri Jagadeesan, Research Director at Everest Group, highlighting the dire situation. Electricity: the new bottleneck in AI RoI As AI modules push infrastructure to its limits, electricity is becoming a critical driver of return on investment. “Electricity has shifted from a line item in operational overhead to the defining factor in AI project feasibility,” Gogia noted. “Electricity costs now constitute between 40–60% of total Opex in modern AI infrastructure, both cloud and on-prem.” Enterprises are now forced to rethink deployment strategies—balancing control, compliance, and location-specific power rates. Cloud hyperscalers may gain further advantage due to better PUE, renewable access, and energy procurement models. “A single 15,000-watt module running continuously can cost up to $20,000 annually in electricity alone, excluding cooling,” said Manish Rawat, analyst at TechInsights. “That cost structure forces enterprises to evaluate location, usage models, and platform efficiency like never before.” The silicon arms race meets the power ceiling AI chip innovation is hitting new milestones, but the cost of that performance is no longer just measured in dollars or FLOPS — it’s in kilowatts. The KAIST TeraLab roadmap demonstrates that power and heat are becoming dominant factors in compute system design. The geography of AI, as several experts warn, is shifting. Power-abundant regions such as the Nordics, the Midwest US, and the Gulf states are becoming magnets for data center investments. Regions with limited grid capacity face a growing risk of becoming “AI deserts.”

Edge reality check: What we’ve learned about scaling secure, smart infrastructure

Enterprises are pushing cloud resources back to the edge after years of centralization. Even as major incumbents such as Google, Microsoft, and AWS pull more enterprise workloads into massive, centralized hyperscalers, use cases at the edge increasingly require nearby infrastructure—not a long hop to a centralized data center—to take advantage of the torrents of real-time data generated by IoT devices, sensor networks, smart vehicles, and a panoply of newly connected hardware. Not long ago, the enterprise edge was a physical one. The central data center was typically located in or very near the organization’s headquarters. When organizations sought to expand their reach, they wanted to establish secure, speedy connections to other office locations, such as branches, providing them with fast and reliable access to centralized computing resources. Vendors initially sold MPLS, WAN optimization, and SD-WAN as “branch office solutions,” after all. Lesson one: Understand your legacy before locking in your future The networking model that connects centralized cloud resources to the edge via some combination of SD-WAN, MPLS, or 4G reflects a legacy HQ-branch design. However, for use cases such as facial recognition, gaming, or video streaming, old problems are new again. Latency, middle-mile congestion, and the high cost of bandwidth all undermine these real-time edge use cases.

Cisco capitalizes on Isovalent buy, unveils new load balancer

The customer deploys the Isovalent Load Balancer control plane via automation and configures the desired number of virtual load-balancer appliances, Graf said. “The control plane automatically deploys virtual load-balancing appliances via the virtualization or Kubernetes platform. The load-balancing layer is self-healing and supports auto-scaling, which means that I can replace unhealthy instances and scale out as needed. The load balancer supports powerful L3-L7 load balancing with enterprise capabilities,” he said. Depending on the infrastructure the load balancer is deployed into, the operator will deploy the load balancer using familiar deployment methods. In a data center, this will be done using a standard virtualization automation installation such as Terraform or Ansible. In the public cloud, the load balancer is deployed as a public cloud service. In Kubernetes and OpenShift, the load balancer is deployed as a Kubernetes Deployment/Operator, Graf said. “In the future, the Isovalent Load Balancer will also be able to run on top of Cisco Nexus smart switches,” Graf said. “This means that the Isovalent Load Balancer can run in any environment, from data center, public cloud, to Kubernetes while providing a consistent load-balancing layer with a frictionless cloud-native developer experience.” Cisco has announced a variety of smart switches over the past couple of months on the vendor’s 4.8T capacity Silicon One chip. But the N9300, where Isovalent would run, includes a built-in programmable data processing unit (DPU) from AMD to offload complex data processing work and free up the switches for AI and large workload processing. For customers, the Isovalent Load Balancer provides consistent load balancing across infrastructure while being aligned with Kubernetes as the future for infrastructure. “A single load-balancing solution that can run in the data center, in public cloud, and modern Kubernetes environments. This removes operational complexity, lowers cost, while modernizing the load-balancing infrastructure in preparation

Oracle’s struggle with capacity meant they made the difficult but responsible decisions

IDC President Crawford Del Prete agreed, and said that Oracle senior management made the right move, despite how difficult the situation is today. “Oracle is being incredibly responsible here. They don’t want to have a lot of idle capacity. That capacity does have a shelf life,” Del Prete said. CEO Katz “is trying to be extremely precise about how much capacity she puts on.” Del Prete said that, for the moment, Oracle’s capacity situation is unique to the company, and has not been a factor with key rivals AWS, Microsoft, and Google. During the investor call, Katz said that her team “made engineering decisions that were much different from the other hyperscalers and that were better suited to the needs of enterprise customers, resulting in lower costs to them and giving them deployment flexibility.” Oracle management certainly anticipated a flurry of orders, but Katz said that she chose to not pay for expanded capacity until she saw finalized “contracted noncancelable bookings.” She pointed to a huge capex line of $9.1 billion and said, “the vast majority of our capex investments are for revenue generating equipment that is going into data centers and not for land or buildings.”

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs). In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Stay Ahead, Stay ONMINE