Newcomer

Newcomer

DATABRICKS CEO ALI GHODSI: A Costly AI Training Foray, Sidestepping the Model Wars & the Push to Kill Tokenmaxxing Bloat

Databricks releases products to compete with ClickHouse, Splunk & Oracle

Eric Newcomer's avatar
Eric Newcomer
Jun 16, 2026
∙ Paid

Back in 2024, when Databricks released its open-source DBRX model, the data warehousing company bragged that it had spent $10 million on the training run that made that model possible.

Last week, in an expansive interview ahead of the release of a suite of new Databricks AI products, the company’s boisterous CEO, Ali Ghodsi, came clean: Databricks had actually spent about $20 million on the cluster to train the model. Only $4 million went to the training run that actually produced DBRX. The remaining roughly $16 million went to stumbling through the wilderness, downtime, and other mistakes of inexperience. They’d either underestimated or overestimated the cost depending on how you slice it.

The waste felt unavoidable.

“First of all the cluster is not running all the time — it’s idle,” Ghodsi told me. “You don’t just yolo and say hey let’s run the thing and then it comes out and it’s like after a month of training it’s like oh it was shit. So you do smaller training runs — so you work your way up to it.”

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2026 Eric Newcomer · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture