Stay Ahead, Stay ONMINE

Midjourney’s surprise: new research on making LLMs write more creatively

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Midjourney is best known as one of the leading AI image generators — with nearly 20 million users on its Discord channel, according to third-party trackers, and presumably more atop that on its website — but […]

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More


Midjourney is best known as one of the leading AI image generators — with nearly 20 million users on its Discord channel, according to third-party trackers, and presumably more atop that on its website — but its ambitions are beginning to expand.

Following the news in late summer 2024 that it was building its own computing and AI hardware, the company this week released a new research paper alongside machine learning experts at New York University (NYU) on training text-based large language models (LLMs) such as Meta’s open source Llama and Mistral’s eponymous source models to write more creatively.

The collaboration, documented in a new research paper published on AI code community Hugging Face, introduces two new technieques — Diversified Direct Preference Optimization (DDPO) and Diversified Odds Ratio Preference Optimization (DORPO)— designed to expand the range of possible outputs while maintaining coherence and readability.

For a company that is best known for its diffusion AI image generating models, Midjourney’s new approach to rethinking creativity in text-based LLMs shows that it is not limiting its ambitions to visuals, and that, a picture may not actually be worth a thousand words.

Could a Midjourney-native LLM or fine-tuned version of an existing LLM be in the cards from the small, bootstrapped startup? I reached out to Midjourney founder David Holz but have yet to hear back.

Regardless of a first-party Midjourney LLM offering, the implications of its new research go beyond academic exercises and could be used to help fuel a new wave of LLM training among enterprise AI teams, product developers, and content creators looking to improve AI-generated text.

It also shows that despite recent interest and investment among AI model providers in new multimodal and reasoning language models, there’s still a lot of juice left to be squeezed, cognitively and performance-wise, from classic Transformer-based, text-focused LLMs.

The problem: AI-generated writing collapses around homogenous outputs

In domains like fact-based Q&A or coding assistance, LLMs are expected to generate a single best response.

However, creative writing is inherently open-ended, meaning there are many valid responses to a single prompt.

For an example provided by the Midjourney researchers, given a prompt like “Write a story about a dog on the moon”, the LLM could explore multiple diverse paths like:

  • An astronaut’s pet dog accidentally left behind after a lunar mission.
  • A dog who finds itself in a futuristic canine space colony.
  • A stranded dog that befriends an alien species.

Despite this range of possibilities, instruction-tuned LLMs often converge on similar storylines and themes. This happens because:

  1. Post-training techniques prioritize user preference over originality, reinforcing popular but repetitive responses.
  2. Instruction tuning often smooths out variation, making models favor “safe” responses over unique ones.
  3. Existing diversity-promoting techniques (like temperature tuning) operate only at inference time, rather than being baked into the model’s learning process.

This leads to homogenized storytelling, where AI-generated creative writing feels repetitive and lacks surprise or depth.

The solution: modifying post-training methods to prioritize diversity

To overcome these limitations, the researchers introduced DDPO and DORPO, two extensions of existing preference optimization methods. The core innovation in these approaches is the use of deviation—a measure of how much a response differs from others—to guide training.

Here’s how it works:

  1. During training, the model is given a writing prompt and multiple possible responses.
  2. Each response is compared to others for the same prompt, and a deviation score is calculated.
  3. Rare but high-quality responses are weighted more heavily in training, encouraging the model to learn from diverse examples.

By incorporating deviation into Direct Preference Optimization (DPO) and Odds Ratio Preference Optimization (ORPO), the model learns to produce high-quality but more varied responses.

This method ensures that AI-generated stories do not converge on a single predictable structure, but instead explore a wider range of characters, settings, and themes—just as a human writer might.

What Midjourney’s researchers did to achieve this

The study involved training LLMs on creative writing tasks using a dataset from the subreddit r/writingPrompts, a Reddit community where users post prompts and respond with short stories.

The researchers used two base models for their training:

  • Meta’s Llama-3.1-8B (an 8-billion-parameter model from the Llama 3 series).
  • Mistral-7B-v0.3 (a 7-billion-parameter model from Mistral AI).

Then, they took these models through the following processes:

  1. Supervised Fine-Tuning (SFT): The models were first fine-tuned using LoRA (Low-Rank Adaptation) to adjust parameters efficiently.
  2. Preference Optimization:
    • DPO and ORPO were used as baselines—these standard methods focus on improving response quality based on user preference signals.
    • DDPO and DORPO were then applied, introducing deviation-based weighting to encourage more unique responses.
  3. Evaluation:
    • Automatic evaluation: Measured semantic and stylistic diversity using embedding-based techniques.
    • Human evaluation: Judges assessed whether outputs were diverse and engaging compared to GPT-4o and Claude 3.5.

Key Training Findings:

  • DDPO significantly outperformed standard DPO in terms of output diversity while maintaining quality.
  • Llama-3.1-8B with DDPO achieved the best balance of quality and diversity, producing responses that were more varied than GPT-4o while maintaining coherence.
  • When dataset size was reduced, DDPO models still maintained diversity, though they required a certain number of diverse training samples to be fully effective.

Enterprise implications: what does it mean for those using AI to produce creative responses — such as in marketing copywriting, corporate storytelling, and film/TV/video game scripting?

For AI teams managing LLM deployment, enhancing output diversity while maintaining quality is a critical challenge. These findings have significant implications for organizations that rely on AI-generated content in applications such as:

  • Conversational AI and chatbots (ensuring varied and engaging responses).
  • Content marketing and storytelling tools (preventing repetitive AI-generated copy).
  • Game development and narrative design (creating diverse dialogue and branching storylines).

For professionals responsible for fine-tuning and deploying models in an enterprise setting, this research provides:

  • A new approach to LLM post-training that enhances creativity without sacrificing quality.
  • A practical alternative to inference-time diversity tuning (such as temperature adjustments) by integrating diversity into the learning process itself.
  • The potential to develop more engaging AI applications, from AI-assisted writing tools to virtual assistants that can adapt their responses dynamically.

For those handling AI model orchestration and automation, this research highlights:

  • The importance of tuning models at the training stage, reducing the need for post-processing adjustments at deployment.
  • A way to introduce adaptive storytelling into AI-driven applications, ensuring variability while keeping content quality high.
  • A method for making LLM outputs more human-like, which is crucial for applications requiring interactive storytelling, customer engagement, or dynamic content creation.

The future of AI generated creative projects looks bright

The success of DDPO and DORPO demonstrates that training LLMs with diversity-focused objectives can yield significant improvements in creative writing. Some ideas include:

  1. Integrating deviation-based learning into enterprise AI models to enhance response diversity in customer-facing applications.
  2. Exploring how these methods apply to other generative tasks, such as AI-powered poetry, screenwriting, or game storytelling.
  3. Developing hybrid training approaches that balance diversity and instruction-following capabilities for AI assistants.

For those interested in applying these techniques, the researchers plan to make their code publicly available on this GitHub Repository

Whether you are fine-tuning LLMs for business applications or optimizing large-scale AI orchestration, this study provides actionable insights into how models can be more dynamic, engaging, and responsive to creative tasks.

By adopting these techniques, AI teams can move beyond rigid, formulaic outputs—building AI systems that are not only smart but also truly imaginative.

Shape
Shape
Stay Ahead

Explore More Insights

Stay ahead with more perspectives on cutting-edge power, infrastructure, energy,  bitcoin and AI solutions. Explore these articles to uncover strategies and insights shaping the future of industries.

Shape

Ubuntu namespace vulnerability should be addressed quickly: Expert

Thus, “there is little impact of not ‘patching’ the vulnerability,” he said. “Organizations using centralized configuration tools like Ansible may deploy these changes with regularly scheduled maintenance or reboot windows.”  Features supposed to improve security Ironically, last October Ubuntu introduced AppArmor-based features to improve security by reducing the attack surface

Read More »

Google Cloud partners with mLogica to offer mainframe modernization

Other than the partnership with mLogica, Google Cloud also offers a variety of other mainframe migration tools, including Radis and G4 that can be employed to modernize specific applications. Enterprises can also use a combination of migration tools to modernize their mainframe applications. Some of these tools include the Gemini-powered

Read More »

Trump Revokes Permits to USA, Foreign Oil Firms in Venezuela

The Trump administration revoked permits and waivers allowing Western energy firms to operate in Venezuela, three people familiar with the issue said Saturday. The move further isolates Venezuelan President Nicolas Maduro from the world oil market.  The decision covers a number of companies including US oil firm Global Oil Terminals, led by energy magnate Harry Sargeant III. Spain’s Repsol and France’s Maurel et Prom are among the other companies that must wind down operations in Venezuela by May 27, the people said.  The decision also targets licenses issued to Venezuelan gas companies that hold deals with the state petroleum company, PDVSA, one of the people said.  The US Treasury Department had issued different permits to international oil and gas companies, including licenses, waivers or letters of comfort, to allow them to conduct different operations in Venezuela, such as exporting PDVSA’s oil, despite sanctions. Sargeant’s Global Oil Terminals must also end financial transactions with PDVSA by April 2 and pay any remaining debt owed for the purchases of asphalt oil. The Wall Street Journal reported late Friday that Sargeant’s oil-trading company was ordered to leave Venezuela, citing a letter from Treasury. The Trump administration recently gave Chevron a May 27 deadline to wrap up its operations with Venezuela as a way to pressure Maduro’s autocratic regime to make democratic reforms and accept the return of more Venezuelans who had migrated to the US. Sargeant had initially obtained a two-year-waiver in May of last year, allowing Global Oil Terminals to purchase and transport asphalt to the US and the Caribbean.  The US Treasury Department declined to comment, and the White House, National Security Council and State Department didn’t immediately respond to requests for comment. Repsol, Maurel et Prom and PDVSA didn’t immediately reply to a request for comment.  –With assistance from Skylar

Read More »

Woodside Sells Greater Angostura Stakes to Perenco

Woodside Energy Group Ltd. has signed an agreement to sell its producing oil and gas assets in Greater Angostura in Trinidad and Tobago to Perenco Group for AUD 206 million ($129.82 million). The divestment includes Woodside’s operating stakes in the shallow-water Angostura and Ruby oil and gas fields, as well as the associated production facilities and onshore terminal. The Australian oil and gas exploration and production company owns 45 percent of Angostura in Block 2(c) and 68.46 percent in Ruby in Block 3(a). Its Angostura partners are the South American country’s National Gas Co. with a 30 percent stake and Chaoyang Petroleum (Trinidad) Block 2C Ltd with a 25 percent stake. In Ruby, National Gas owns the remaining 31.54 percent. Greater Angostura produces about 12 percent of Trinidad and Tobago’s natural gas supply, Woodside chief executive Meg O’Neill said in a company statement announcing the sale. “The transaction provides near-term cash flow to support ongoing investments and shareholder distributions and builds on the Australian asset swap announced in December 2024, further simplifying Woodside’s portfolio”, Woodside said. Woodside and Chevron Australia Pty. Ltd., an indirect subsidiary of Chevron Corp., have entered a swap agreement to consolidate assets including the Wheatstone and North West Shelf (NWS) gas projects in Western Australia. Upon the completion of the swap transaction, Woodside will hold a 50 percent stake in the NWS Project, 66.67 percent in the NWS Oil Project and 40 percent in the Angel Carbon, Capture and Storage Project. “Chevron Australia will acquire Woodside’s 13 percent non-operated interest in the Wheatstone Project and 65 percent operated interest in the Julimar-Brunello Project”, Chevron Australia said December 19, 2024. Chevron will pay Woodside AUD 300 million in cash plus up to AUD 100 million in contingent payments related to the handover of the Julimar Phase 3 Project

Read More »

Texas Oil, Gas Industry Continues Hot Streak in 2024

The Texas oil and gas industry continued a hot streak in 2024 with production volumes surpassing records that were set in 2023, the Texas Railroad Commission (RRC) said in a statement posted on its website recently. In the statement, the RRC noted that it tallies production reports submitted by operators and outlined that the latest reports show that oil production came in at 2,003,844,281 barrels, and natural gas production hit 12.62 trillion cubic feet, last year. The RRC highlighted in the statement that this was the first time oil “surpassed the two billion threshold”. The RRC statement pointed out that Texas’ top five crude oil and condensate production years came in 2024, at 2.00 billion barrels, 2023, at 1.99 billion barrels, 2022, at 1.87 billion barrels, 2019, at 1.86 billion barrels, and 2020, at 1.77 billion barrels. Texas’ top five gas production years, including gas well and casinghead gas, were seen in 2024, at 12.62 trillion cubic feet, 2023, at 12.30 trillion cubic feet, 2022, at 11.43 trillion cubic feet, 2021, at 10.51 trillion cubic feet, and 2020, at 10.24 trillion cubic feet, the statement highlighted. “These latest records further demonstrate Texas’s position as a global leader in oil and gas production,” RRC Chairman Christi Craddick said in the statement. RRC Commissioner Wayne Christian said in the statement, “Texas oil and gas powers the state, nation, and the world both with energy and economics”. “The Texas ‘Economic Miracle’ happens because of oil and gas, which brings in hundreds of billions of dollars that has financially enriched the Lone Star State’s education, infrastructure, health care, and more,” he added. “Energy independence is key to a secure and prosperous nation, and Texas’ production is vital to making that a reality,” Christian continued. RRC Commissioner Jim Wright said in the statement, “yet another

Read More »

BP Plans to Sell Fuel Retail Sites in Austria

BP PLC is initiating a marketing process to divest its retail sites, associated fleet and electric vehicle (EV) charging infrastructure in Austria, as well as its stake in the Linz fuel terminal. “This decision is the latest example of bp’s strategy to reshape and focus its downstream businesses”, the British energy major said in an online statement. BP previously exited the retail sector in Switzerland and Turkey and recently initiated retail and downstream divestments in Germany and the Netherlands. The Austria sale includes over 260 retail sites, of which about 120 are owned by BP, and the associated fleet; EV charging assets including those under development; and BP’s interest in the company operating the Linz fuel terminal non-operated joint venture. BP plans to complete the sale this year, subject to regulatory approvals. “Over recent years we have grown the business to become number two major branded retailer in the market”, said Emma Delaney, executive vice president for customers and products at BP. “As bp now looks to focus downstream and reshape our portfolio, we believe that a new owner will be best placed to unlock the business’s full potential”. On February 6, 2025, BP announced a marketing process for the potential sale of Ruhr Oel GmbH-BP Gelsenkirchen (ROG), which operates in Germany and the Netherlands. Sale completion is targeted 2025. “bp needs to continually manage its global portfolio as we position to grow as a simpler, more focused, higher-value company”, Delaney said then. With a workforce of about 2,000 employees, ROG’s German operations include 2 plants in Horst and Scholven in Gelsenkirchen that form a refining and petrochemical site. The refinery can process up to approximately 12 million metric tons of petroleum a year. ROG also owns the Bottrop tank farm, DHC Solvent Chemie GmbH and Nord-West Oelleitung GmbH, which

Read More »

Empire Petroleum Loss Widens for 2024 as EOR Activities Drag On

Empire Petroleum Corp. has posted a $16.2 million net loss for 2024, affected by operational challenges on the initial production optimization associated with enhanced oil recovery (EOR) efforts in the Starbuck drilling program in North Dakota. I had recorded a net loss of $12.5 million for 2023. Empire said in a media release that its total product revenue in 2024 reached $44 million, $4 million above 2023. The Starbuck drilling program helped the company increase oil sales volumes. Earnings before interest, taxes, depreciation and amortization (EBITDA) was $0.7 million for 2024, compared to negative $2.4 million in EBITDA for 2023. “As an emerging, agile company, Empire Petroleum has a unique ability to pivot quickly as we receive new data and insights. This flexibility is a tremendous advantage in the dynamic energy sector, allowing us to efficiently allocate capital and resources to the most promising opportunities where they will have the greatest impact”, chair Phil Mulacek said. Empire said that during the fourth quarter of 2024, it obtained authorization from the North Dakota Industrial Commission (NDIC) to transform two additional oil wells into injectors, further progressing its enhanced oil recovery strategy. The previous conversion of three wells resulted in a reduction of its short-term output but has improved prospects for long-term production growth, according to the company. In February 2025, Empire also said it had secured NDIC approval for five new drilling permits for horizontal wells. To contact the author, email andreson.n.paul@gmail.com WHAT DO YOU THINK? Generated by readers, the comments included herein do not reflect the views and opinions of Rigzone. All comments are subject to editorial review. Off-topic, inappropriate or insulting comments will be removed. MORE FROM THIS AUTHOR

Read More »

House Republicans probe EPA climate grant recipients

Dive Brief: Republicans on the House Oversight and Government Reform Committee have called on eight environmental organizations that received grants from the U.S. Environmental Protection Agency to submit documents, communications and other internal records related to the funding they received. The probe, announced Thursday, is spearheaded by Committee Chair James Comer and seeks to investigate the funding disbursed to recipients under the Biden administration’s EPA, including grants from the Greenhouse Gas Reduction Fund. The $27 billion fund was established as part of the 2022 Inflation Reduction Act, the former president’s landmark climate law. Comer said in his March 27 letters that the committee is “examining potential entanglements or conflicts of interests” between the environmental nonprofits and the Biden administration’s EPA. All eight groups received grants collectively worth $20 billion from the GGRF — a financial commitment Republicans have dubbed an “unprecedented arrangement.” Dive Insight: Comer sent letters to Climate United, Coalition for Green Capital, Power Forward Communities, Opportunity Finance Network, Inclusiv, Justice Climate Fund, Appalachian Community Capital and Native CDFI Network on Thursday. The letters requested that all eight groups provide information on employees, salaries and communication that took place between them and the former EPA administration related to GGRF grants by April 10. Earlier this month, the EPA — under Trump-nominated Lee Zeldin — froze access to the $20 billion grants committed through the GGRF, leaving recipients in a precarious financial situation. Climate United Fund, a grantee and one of the eight organizations that received a letter from Comer Thursday, said at the time the abrupt halt on funding impacted its ability to make payroll for employees and disburse funding to its contracted borrowers. In his March 3 letter to the EPA’s Acting Inspector General Nicole Murley, Zeldin said the structure of the IRA-backed fund was fraudulent and made

Read More »

Silicon Motion rolls SSD kit to bolster AI workload performance

The kit utilizes the PCIe Dual Ported enterprise-grade SM8366 controller with support for PCIe Gen 5 x4 NVMe 2.0 and OCP 2.5 data center specifications. The 128TB SSD RDK also supports NVMe 2.0 Flexible Data Placement (FDP), a feature that allows advanced data management and improved SSD write efficiency and endurance. “Silicon Motion’s MonTitan SSD RDK offers a comprehensive solution for our customers, enabling them to rapidly develop and deploy enterprise-class SSDs tailored for AI data center and edge server applications.” said Alex Chou, senior vice president of the enterprise storage & display interface solution business at Silicon Motion. Silicon Motion doesn’t make drives, rather it makes reference design kits in different form factors that its customers use to build their own product. Its kits come in E1.S, E3.S, and U.2 form factors. The E1.S and U.2 forms mirror the M.2, which looks like a stick of gum and installs on the motherboard. There are PCI Express enclosures that hold four to six of those drives and plug into one card slot and appear to the system as a single drive.

Read More »

Executive Roundtable: Cooling Imperatives for Managing High-Density AI Workloads

Michael Lahoud, Stream Data Centers: For the past two years, Stream Data Centers has been developing a modular, configurable air and liquid cooling system that can handle the highest densities in both mediums. Based on our collaboration with customers, we see a future that still requires both cooling mediums, but with the flexibility to deploy either type as the IT stack destined for that space demands. With this necessity as a backdrop, we saw a need to develop a scalable mix-and-match front-end thermal solution that gives us the ability to late bind the equipment we need to meet our customers’ changing cooling needs. It’s well understood that liquid far outperforms air in its ability to transport heat, but further to this, with the right IT configuration, cooling fluid temperatures can also be raised, and this affords operators the ability to use economization for a greater number of hours a year. These key properties can help reduce the energy needed for the mechanical part of a data center’s operations substantially.  It should also be noted that as servers are redesigned for liquid cooling and the onboard server fans get removed or reduced in quantity, more of the critical power delivered to the server is being used for compute. This means that liquid cooling also drives an improvement in overall compute productivity despite not being noted in facility PUE metrics.  Counter to air cooling, liquid cooling certainly has some added management challenges related to fluid cleanliness, concurrent maintainability and resiliency/redundancy, but once those are accounted for, the clusters become stable, efficient and more sustainable with improved overall productivity.

Read More »

Airtel connects India with 100Tbps submarine cable

“Businesses are becoming increasingly global and digital-first, with industries such as financial services, data centers, and social media platforms relying heavily on real-time, uninterrupted data flow,” Sinha added. The 2Africa Pearls submarine cable system spans 45,000 kilometers, involving a consortium of global telecommunications leaders including Bayobab, China Mobile International, Meta, Orange, Telecom Egypt, Vodafone Group, and WIOCC. Alcatel Submarine Networks is responsible for the cable’s manufacturing and installation, the statement added. This cable system is part of a broader global effort to enhance international digital connectivity. Unlike traditional telecommunications infrastructure, the 2Africa Pearls project represents a collaborative approach to solving complex global communication challenges. “The 100 Tbps capacity of the 2Africa Pearls cable significantly surpasses most existing submarine cable systems, positioning India as a key hub for high-speed connectivity between Africa, Europe, and Asia,” said Prabhu Ram, VP for Industry Research Group at CyberMedia Research. According to Sinha, Airtel’s infrastructure now spans “over 400,000 route kilometers across 34+ cables, connecting 50 countries across five continents. This expansive infrastructure ensures businesses and individuals stay seamlessly connected, wherever they are.” Gogia further emphasizes the broader implications, noting, “What also stands out is the partnership behind this — Airtel working with Meta and center3 signals a broader shift. India is no longer just a consumer of global connectivity. We’re finally shaping the routes, not just using them.”

Read More »

Former Arista COO launches NextHop AI for customized networking infrastructure

Sadana argued that unlike traditional networking where an IT person can just plug a cable into a port and it works, AI networking requires intricate, custom solutions. The core challenge is creating highly optimized, efficient networking infrastructure that can support massive AI compute clusters with minimal inefficiencies. How NextHop is looking to change the game for hyperscale networking NextHop AI is working directly alongside its hyperscaler customers to develop and build customized networking solutions. “We are here to build the most efficient AI networking solutions that are out there,” Sadana said. More specifically, Sadana said that NextHop is looking to help hyperscalers in several ways including: Compressing product development cycles: “Companies that are doing things on their own can compress their product development cycle by six to 12 months when they partner with us,” he said. Exploring multiple technological alternatives: Sadana noted that hyperscalers might try and build on their own and will often only be able to explore one or two alternative approaches. With NextHop, Sadana said his company will enable them to explore four to six different alternatives. Achieving incremental efficiency gains: At the massive cloud scale that hyperscalers operate, even an incremental one percent improvement can have an oversized outcome. “You have to make AI clusters as efficient as possible for the world to use all the AI applications at the right cost structure, at the right economics, for this to be successful,” Sadana said. “So we are participating by making that infrastructure layer a lot more efficient for cloud customers, or the hyperscalers, which, in turn, of course, gives the benefits to all of these software companies trying to run AI applications in these cloud companies.” Technical innovations: Beyond traditional networking In terms of what the company is actually building now, NextHop is developing specialized network switches

Read More »

Microsoft abandons data center projects as OpenAI considers its own, hinting at a market shift

A potential ‘oversupply position’ In a new research note, TD Cowan analysts reportedly said that Microsoft has walked away from new data center projects in the US and Europe, purportedly due to an oversupply of compute clusters that power AI. This follows reports from TD Cowen in February that Microsoft had “cancelled leases in the US totaling a couple of hundred megawatts” of data center capacity. The researchers noted that the company’s pullback was a sign of it “potentially being in an oversupply position,” with demand forecasts lowered. OpenAI, for its part, has reportedly discussed purchasing billions of dollars’ worth of data storage hardware and software to increase its computing power and decrease its reliance on hyperscalers. This fits with its planned Stargate Project, a $500 billion, US President Donald Trump-endorsed initiative to build out its AI infrastructure in the US over the next four years. Based on the easing of exclusivity between the two companies, analysts say these moves aren’t surprising. “When looking at storage in the cloud — especially as it relates to use in AI — it is incredibly expensive,” said Matt Kimball, VP and principal analyst for data center compute and storage at Moor Insights & Strategy. “Those expenses climb even higher as the volume of storage and movement of data grows,” he pointed out. “It is only smart for any business to perform a cost analysis of whether storage is better managed in the cloud or on-prem, and moving forward in a direction that delivers the best performance, best security, and best operational efficiency at the lowest cost.”

Read More »

PEAK:AIO adds power, density to AI storage server

There is also the fact that many people working with AI are not IT professionals, such as professors, biochemists, scientists, doctors, clinicians, and they don’t have a traditional enterprise department or a data center. “It’s run by people that wouldn’t really know, nor want to know, what storage is,” he said. While the new AI Data Server is a Dell design, PEAK:AIO has worked with Lenovo, Supermicro, and HPE as well as Dell over the past four years, offering to convert their off the shelf storage servers into hyper fast, very AI-specific, cheap, specific storage servers that work with all the protocols at Nvidia, like NVLink, along with NFS and NVMe over Fabric. It also greatly increased storage capacity by going with 61TB drives from Solidigm. SSDs from the major server vendors typically maxed out at 15TB, according to the vendor. PEAK:AIO competes with VAST, WekaIO, NetApp, Pure Storage and many others in the growing AI workload storage arena. PEAK:AIO’s AI Data Server is available now.

Read More »

Microsoft will invest $80B in AI data centers in fiscal 2025

And Microsoft isn’t the only one that is ramping up its investments into AI-enabled data centers. Rival cloud service providers are all investing in either upgrading or opening new data centers to capture a larger chunk of business from developers and users of large language models (LLMs).  In a report published in October 2024, Bloomberg Intelligence estimated that demand for generative AI would push Microsoft, AWS, Google, Oracle, Meta, and Apple would between them devote $200 billion to capex in 2025, up from $110 billion in 2023. Microsoft is one of the biggest spenders, followed closely by Google and AWS, Bloomberg Intelligence said. Its estimate of Microsoft’s capital spending on AI, at $62.4 billion for calendar 2025, is lower than Smith’s claim that the company will invest $80 billion in the fiscal year to June 30, 2025. Both figures, though, are way higher than Microsoft’s 2020 capital expenditure of “just” $17.6 billion. The majority of the increased spending is tied to cloud services and the expansion of AI infrastructure needed to provide compute capacity for OpenAI workloads. Separately, last October Amazon CEO Andy Jassy said his company planned total capex spend of $75 billion in 2024 and even more in 2025, with much of it going to AWS, its cloud computing division.

Read More »

John Deere unveils more autonomous farm machines to address skill labor shortage

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Self-driving tractors might be the path to self-driving cars. John Deere has revealed a new line of autonomous machines and tech across agriculture, construction and commercial landscaping. The Moline, Illinois-based John Deere has been in business for 187 years, yet it’s been a regular as a non-tech company showing off technology at the big tech trade show in Las Vegas and is back at CES 2025 with more autonomous tractors and other vehicles. This is not something we usually cover, but John Deere has a lot of data that is interesting in the big picture of tech. The message from the company is that there aren’t enough skilled farm laborers to do the work that its customers need. It’s been a challenge for most of the last two decades, said Jahmy Hindman, CTO at John Deere, in a briefing. Much of the tech will come this fall and after that. He noted that the average farmer in the U.S. is over 58 and works 12 to 18 hours a day to grow food for us. And he said the American Farm Bureau Federation estimates there are roughly 2.4 million farm jobs that need to be filled annually; and the agricultural work force continues to shrink. (This is my hint to the anti-immigration crowd). John Deere’s autonomous 9RX Tractor. Farmers can oversee it using an app. While each of these industries experiences their own set of challenges, a commonality across all is skilled labor availability. In construction, about 80% percent of contractors struggle to find skilled labor. And in commercial landscaping, 86% of landscaping business owners can’t find labor to fill open positions, he said. “They have to figure out how to do

Read More »

2025 playbook for enterprise AI success, from agents to evals

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More 2025 is poised to be a pivotal year for enterprise AI. The past year has seen rapid innovation, and this year will see the same. This has made it more critical than ever to revisit your AI strategy to stay competitive and create value for your customers. From scaling AI agents to optimizing costs, here are the five critical areas enterprises should prioritize for their AI strategy this year. 1. Agents: the next generation of automation AI agents are no longer theoretical. In 2025, they’re indispensable tools for enterprises looking to streamline operations and enhance customer interactions. Unlike traditional software, agents powered by large language models (LLMs) can make nuanced decisions, navigate complex multi-step tasks, and integrate seamlessly with tools and APIs. At the start of 2024, agents were not ready for prime time, making frustrating mistakes like hallucinating URLs. They started getting better as frontier large language models themselves improved. “Let me put it this way,” said Sam Witteveen, cofounder of Red Dragon, a company that develops agents for companies, and that recently reviewed the 48 agents it built last year. “Interestingly, the ones that we built at the start of the year, a lot of those worked way better at the end of the year just because the models got better.” Witteveen shared this in the video podcast we filmed to discuss these five big trends in detail. Models are getting better and hallucinating less, and they’re also being trained to do agentic tasks. Another feature that the model providers are researching is a way to use the LLM as a judge, and as models get cheaper (something we’ll cover below), companies can use three or more models to

Read More »

OpenAI’s red teaming innovations define new essentials for security leaders in the AI era

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI has taken a more aggressive approach to red teaming than its AI competitors, demonstrating its security teams’ advanced capabilities in two areas: multi-step reinforcement and external red teaming. OpenAI recently released two papers that set a new competitive standard for improving the quality, reliability and safety of AI models in these two techniques and more. The first paper, “OpenAI’s Approach to External Red Teaming for AI Models and Systems,” reports that specialized teams outside the company have proven effective in uncovering vulnerabilities that might otherwise have made it into a released model because in-house testing techniques may have missed them. In the second paper, “Diverse and Effective Red Teaming with Auto-Generated Rewards and Multi-Step Reinforcement Learning,” OpenAI introduces an automated framework that relies on iterative reinforcement learning to generate a broad spectrum of novel, wide-ranging attacks. Going all-in on red teaming pays practical, competitive dividends It’s encouraging to see competitive intensity in red teaming growing among AI companies. When Anthropic released its AI red team guidelines in June of last year, it joined AI providers including Google, Microsoft, Nvidia, OpenAI, and even the U.S.’s National Institute of Standards and Technology (NIST), which all had released red teaming frameworks. Investing heavily in red teaming yields tangible benefits for security leaders in any organization. OpenAI’s paper on external red teaming provides a detailed analysis of how the company strives to create specialized external teams that include cybersecurity and subject matter experts. The goal is to see if knowledgeable external teams can defeat models’ security perimeters and find gaps in their security, biases and controls that prompt-based testing couldn’t find. What makes OpenAI’s recent papers noteworthy is how well they define using human-in-the-middle

Read More »