publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2025
- MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources2025
-
-