OLMix
paperFramework for data mixing across the full LM development lifecycle. Solves the "evolving domain problem" where domains are added, removed, or revised over time. Mixture reuse matches full recomputation with 74% less compute and improves 11.6% over no-mixing baselines. Used in OLMo 3 pre-training.
Paper
arXiv: 2602.12237