A 13B-parameter model trained on the Fugaku supercomputer using only CPUs, a milestone for non-GPU training:contentReference[oaicite:38]{index=38}.