Bolmo: Byteifying the Next Generation of Language Models Paper • 2512.15586 • Published 12 days ago • 11
Bolmo: Byteifying the Next Generation of Language Models Paper • 2512.15586 • Published 12 days ago • 11
Retrofitting (Large) Language Models with Dynamic Tokenization Paper • 2411.18553 • Published Nov 27, 2024 • 2
Cross-Tokenizer Distillation via Approximate Likelihood Matching Paper • 2503.20083 • Published Mar 25 • 1