The Multiple Ticket Hypothesis: Random Sparse Subnetworks Suffice for RLVR Paper • 2602.01599 • Published Feb 2 • 1
Every Question Has Its Own Value: Reinforcement Learning with Explicit Human Values Paper • 2510.20187 • Published Oct 23, 2025 • 20
Nigeria Energy Sector Collection A collection of datasets across Nigeria's energy sector. • 35 items • Updated Oct 11, 2025 • 9
view article Article Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models +1 Mar 20, 2024 • 113
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper • 2502.15007 • Published Feb 20, 2025 • 175