Aryabhata: An exam-focused language model for JEE Math Paper • 2508.08665 • Published Aug 12, 2025 • 16
view post Post 185 Reinforcement learning can sometimes lead to emergent behavior through much simpler training setups compared to large scale pre-training. I explored this idea by running a small GRPO experiment on Qwen3.5 4B, and the results were pretty exciting.Hypothesis: improving visual mathematical reasoning may also improve the model’s ability to transcribe LaTeX from images.I wrote a short breakdown of the experiment here:https://hanzlajavaid.github.io/blog/grpo-experiment-exploring-emergent-properties/ See translation 👀 1 1 + Reply
Selectivity and Shape in the Design of Forward-Forward Goodness Functions Paper • 2604.13081 • Published Apr 16
Diffutron: A Masked Diffusion Language Model for Turkish Language Paper • 2603.20466 • Published Mar 20 • 9
LagerNVS: Latent Geometry for Fully Neural Real-time Novel View Synthesis Paper • 2603.20176 • Published Mar 20 • 11
LagerNVS: Latent Geometry for Fully Neural Real-time Novel View Synthesis Paper • 2603.20176 • Published Mar 20 • 11
Diffutron: A Masked Diffusion Language Model for Turkish Language Paper • 2603.20466 • Published Mar 20 • 9
Dynamic Model Routing and Cascading for Efficient LLM Inference: A Survey Paper • 2603.04445 • Published 27 days ago • 5
Dynamic Model Routing and Cascading for Efficient LLM Inference: A Survey Paper • 2603.04445 • Published 27 days ago • 5
AfriNLLB: Efficient Translation Models for African Languages Paper • 2602.09373 • Published Feb 10 • 3
iFSQ: Improving FSQ for Image Generation with 1 Line of Code Paper • 2601.17124 • Published Jan 23 • 33
Masking Teacher and Reinforcing Student for Distilling Vision-Language Models Paper • 2512.22238 • Published Dec 23, 2025 • 30
Masking Teacher and Reinforcing Student for Distilling Vision-Language Models Paper • 2512.22238 • Published Dec 23, 2025 • 30
HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming Paper • 2512.21338 • Published Dec 24, 2025 • 23
In Pursuit of Pixel Supervision for Visual Pre-training Paper • 2512.15715 • Published Dec 17, 2025 • 11
Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics Paper • 2512.12602 • Published Dec 14, 2025 • 44