-
Physics of Language Models: Part 1, Context-Free Grammar
Paper • 2305.13673 • Published • 7 -
Physics of Language Models: Part 3.2, Knowledge Manipulation
Paper • 2309.14402 • Published • 7 -
Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws
Paper • 2404.05405 • Published • 10 -
Physics of Language Models: Part 3.1, Knowledge Storage and Extraction
Paper • 2309.14316 • Published • 8
Collections
Discover the best community collections!
Collections including paper arxiv:2512.17351
-
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers
Paper • 2512.17351 • Published • 22 -
facebook/PhysicsLM4.2__LlamaCanon-8B-Nemo-1T-lr0.003
Updated • 1 • 4 -
facebook/PhysicsLM4.2__LlamaCanon-1B-Nemo-1T-lr0.002
Updated • 1 • 2 -
facebook/PhysicsLM4.2__LlamaCanon-1B-Nemo-1T-lr0.003
Updated • 1 • 1
-
Guided Self-Evolving LLMs with Minimal Human Supervision
Paper • 2512.02472 • Published • 50 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper • 2509.25454 • Published • 140 -
Video Reasoning without Training
Paper • 2510.17045 • Published • 7 -
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 269
-
LoFT: Parameter-Efficient Fine-Tuning for Long-tailed Semi-Supervised Learning in Open-World Scenarios
Paper • 2509.09926 • Published • 13 -
What Breaks Knowledge Graph based RAG? Empirical Insights into Reasoning under Incomplete Knowledge
Paper • 2508.08344 • Published -
MemMamba: Rethinking Memory Patterns in State Space Model
Paper • 2510.03279 • Published • 72 -
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs
Paper • 2510.07499 • Published • 48
-
SemanticGen: Video Generation in Semantic Space
Paper • 2512.20619 • Published • 88 -
QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models
Paper • 2512.19526 • Published • 10 -
MatSpray: Fusing 2D Material World Knowledge on 3D Geometry
Paper • 2512.18314 • Published • 8 -
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers
Paper • 2512.17351 • Published • 22
-
The Art of Scaling Reinforcement Learning Compute for LLMs
Paper • 2510.13786 • Published • 31 -
Attention Is All You Need for KV Cache in Diffusion LLMs
Paper • 2510.14973 • Published • 40 -
BitNet Distillation
Paper • 2510.13998 • Published • 55 -
GigaBrain-0: A World Model-Powered Vision-Language-Action Model
Paper • 2510.19430 • Published • 49
-
Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities
Paper • 2507.13158 • Published • 23 -
DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering
Paper • 2507.11527 • Published • 32 -
Promptomatix: An Automatic Prompt Optimization Framework for Large Language Models
Paper • 2507.14241 • Published • 17 -
The Prompt Report: A Systematic Survey of Prompting Techniques
Paper • 2406.06608 • Published • 68
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 447 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
Physics of Language Models: Part 1, Context-Free Grammar
Paper • 2305.13673 • Published • 7 -
Physics of Language Models: Part 3.2, Knowledge Manipulation
Paper • 2309.14402 • Published • 7 -
Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws
Paper • 2404.05405 • Published • 10 -
Physics of Language Models: Part 3.1, Knowledge Storage and Extraction
Paper • 2309.14316 • Published • 8
-
SemanticGen: Video Generation in Semantic Space
Paper • 2512.20619 • Published • 88 -
QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models
Paper • 2512.19526 • Published • 10 -
MatSpray: Fusing 2D Material World Knowledge on 3D Geometry
Paper • 2512.18314 • Published • 8 -
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers
Paper • 2512.17351 • Published • 22
-
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers
Paper • 2512.17351 • Published • 22 -
facebook/PhysicsLM4.2__LlamaCanon-8B-Nemo-1T-lr0.003
Updated • 1 • 4 -
facebook/PhysicsLM4.2__LlamaCanon-1B-Nemo-1T-lr0.002
Updated • 1 • 2 -
facebook/PhysicsLM4.2__LlamaCanon-1B-Nemo-1T-lr0.003
Updated • 1 • 1
-
Guided Self-Evolving LLMs with Minimal Human Supervision
Paper • 2512.02472 • Published • 50 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper • 2509.25454 • Published • 140 -
Video Reasoning without Training
Paper • 2510.17045 • Published • 7 -
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 269
-
The Art of Scaling Reinforcement Learning Compute for LLMs
Paper • 2510.13786 • Published • 31 -
Attention Is All You Need for KV Cache in Diffusion LLMs
Paper • 2510.14973 • Published • 40 -
BitNet Distillation
Paper • 2510.13998 • Published • 55 -
GigaBrain-0: A World Model-Powered Vision-Language-Action Model
Paper • 2510.19430 • Published • 49
-
LoFT: Parameter-Efficient Fine-Tuning for Long-tailed Semi-Supervised Learning in Open-World Scenarios
Paper • 2509.09926 • Published • 13 -
What Breaks Knowledge Graph based RAG? Empirical Insights into Reasoning under Incomplete Knowledge
Paper • 2508.08344 • Published -
MemMamba: Rethinking Memory Patterns in State Space Model
Paper • 2510.03279 • Published • 72 -
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs
Paper • 2510.07499 • Published • 48
-
Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities
Paper • 2507.13158 • Published • 23 -
DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering
Paper • 2507.11527 • Published • 32 -
Promptomatix: An Automatic Prompt Optimization Framework for Large Language Models
Paper • 2507.14241 • Published • 17 -
The Prompt Report: A Systematic Survey of Prompting Techniques
Paper • 2406.06608 • Published • 68
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 447 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88