FlashHead: Accelerating Language Model Inference ~ *Efficient drop-in replacement for the classification head* about 14 hours ago
Geometric Memory II: Sequence Reconstruction, Diffusion Integration, and the Numerical Topology of Alignment about 14 hours ago
Code Concepts: A Large-Scale Synthetic Dataset Generated from Programming Concept Seeds about 21 hours ago • 1
Geometric Memory: Context Extension and Cross-Model Alignment Through Pentachoron Regularization 2 days ago
🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do 2 days ago • 33
ThermoGFN-IF for Catalysis: A Protein Sequence Design Model Tuned with GFlowNets for Stable Protein Design and Kinetic-Aware Enzyme Engineering 2 days ago • 1