tkocmathla/Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-beaked_mammalian_clam Text Generation • 0.5B • Updated 27 days ago • 25
tkocmathla/Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-beaked_mammalian_clam Text Generation • 0.5B • Updated 27 days ago • 25
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing Paper • 2509.08721 • Published Sep 10 • 660