A 136B-parameter model developed by Databricks and MosaicML, with an estimated $10M training cost:contentReference[oaicite:37]{index=37}.