Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2512.17351

"Physics of Language Models" series

Physics of Language Models: Part 1, Context-Free Grammar

Paper • 2305.13673 • Published May 23, 2023 • 7
Physics of Language Models: Part 3.2, Knowledge Manipulation

Paper • 2309.14402 • Published Sep 25, 2023 • 7
Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws

Paper • 2404.05405 • Published Apr 8, 2024 • 10
Physics of Language Models: Part 3.1, Knowledge Storage and Extraction

Paper • 2309.14316 • Published Sep 25, 2023 • 8

Physics of Language Models: Part 4.2

Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers

Paper • 2512.17351 • Published 10 days ago • 22
facebook/PhysicsLM4.2__LlamaCanon-8B-Nemo-1T-lr0.003

Updated 6 days ago • 1 • 4
facebook/PhysicsLM4.2__LlamaCanon-1B-Nemo-1T-lr0.002

Updated 6 days ago • 1 • 2
facebook/PhysicsLM4.2__LlamaCanon-1B-Nemo-1T-lr0.003

Updated 6 days ago • 1 • 1

Guided Self-Evolving LLMs with Minimal Human Supervision

Paper • 2512.02472 • Published 27 days ago • 50
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29 • 140
Video Reasoning without Training

Paper • 2510.17045 • Published Oct 19 • 7
Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 269

LoFT: Parameter-Efficient Fine-Tuning for Long-tailed Semi-Supervised Learning in Open-World Scenarios

Paper • 2509.09926 • Published Sep 12 • 13
What Breaks Knowledge Graph based RAG? Empirical Insights into Reasoning under Incomplete Knowledge

Paper • 2508.08344 • Published Aug 11
MemMamba: Rethinking Memory Patterns in State Space Model

Paper • 2510.03279 • Published Sep 28 • 72
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs

Paper • 2510.07499 • Published Oct 8 • 48

Physics and operators

Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation

Paper • 2507.02608 • Published Jul 3 • 21
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers

Paper • 2512.17351 • Published 10 days ago • 22

SemanticGen: Video Generation in Semantic Space

Paper • 2512.20619 • Published 6 days ago • 88
QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models

Paper • 2512.19526 • Published 7 days ago • 10
MatSpray: Fusing 2D Material World Knowledge on 3D Geometry

Paper • 2512.18314 • Published 9 days ago • 8
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers

Paper • 2512.17351 • Published 10 days ago • 22

Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers

Paper • 2512.17351 • Published 10 days ago • 22

The Art of Scaling Reinforcement Learning Compute for LLMs

Paper • 2510.13786 • Published Oct 15 • 31
Attention Is All You Need for KV Cache in Diffusion LLMs

Paper • 2510.14973 • Published Oct 16 • 40
BitNet Distillation

Paper • 2510.13998 • Published Oct 15 • 55
GigaBrain-0: A World Model-Powered Vision-Language-Action Model

Paper • 2510.19430 • Published Oct 22 • 49

Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities

Paper • 2507.13158 • Published Jul 17 • 23
DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering

Paper • 2507.11527 • Published Jul 15 • 32
Promptomatix: An Automatic Prompt Optimization Framework for Large Language Models

Paper • 2507.14241 • Published Jul 17 • 17
The Prompt Report: A Systematic Survey of Prompting Techniques

Paper • 2406.06608 • Published Jun 6, 2024 • 68

lusxvr/nanoVLM-222M

Image-Text-to-Text • 0.2B • Updated May 8 • 447 • 98
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12 • 36
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published May 30 • 97
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23 • 88

"Physics of Language Models" series

Physics of Language Models: Part 1, Context-Free Grammar

Paper • 2305.13673 • Published May 23, 2023 • 7
Physics of Language Models: Part 3.2, Knowledge Manipulation

Paper • 2309.14402 • Published Sep 25, 2023 • 7
Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws

Paper • 2404.05405 • Published Apr 8, 2024 • 10
Physics of Language Models: Part 3.1, Knowledge Storage and Extraction

Paper • 2309.14316 • Published Sep 25, 2023 • 8

SemanticGen: Video Generation in Semantic Space

Paper • 2512.20619 • Published 6 days ago • 88
QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models

Paper • 2512.19526 • Published 7 days ago • 10
MatSpray: Fusing 2D Material World Knowledge on 3D Geometry

Paper • 2512.18314 • Published 9 days ago • 8
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers

Paper • 2512.17351 • Published 10 days ago • 22

Physics of Language Models: Part 4.2

Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers

Paper • 2512.17351 • Published 10 days ago • 22
facebook/PhysicsLM4.2__LlamaCanon-8B-Nemo-1T-lr0.003

Updated 6 days ago • 1 • 4
facebook/PhysicsLM4.2__LlamaCanon-1B-Nemo-1T-lr0.002

Updated 6 days ago • 1 • 2
facebook/PhysicsLM4.2__LlamaCanon-1B-Nemo-1T-lr0.003

Updated 6 days ago • 1 • 1

Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers

Paper • 2512.17351 • Published 10 days ago • 22

Guided Self-Evolving LLMs with Minimal Human Supervision

Paper • 2512.02472 • Published 27 days ago • 50
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29 • 140
Video Reasoning without Training

Paper • 2510.17045 • Published Oct 19 • 7
Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 269

The Art of Scaling Reinforcement Learning Compute for LLMs

Paper • 2510.13786 • Published Oct 15 • 31
Attention Is All You Need for KV Cache in Diffusion LLMs

Paper • 2510.14973 • Published Oct 16 • 40
BitNet Distillation

Paper • 2510.13998 • Published Oct 15 • 55
GigaBrain-0: A World Model-Powered Vision-Language-Action Model

Paper • 2510.19430 • Published Oct 22 • 49

LoFT: Parameter-Efficient Fine-Tuning for Long-tailed Semi-Supervised Learning in Open-World Scenarios

Paper • 2509.09926 • Published Sep 12 • 13
What Breaks Knowledge Graph based RAG? Empirical Insights into Reasoning under Incomplete Knowledge

Paper • 2508.08344 • Published Aug 11
MemMamba: Rethinking Memory Patterns in State Space Model

Paper • 2510.03279 • Published Sep 28 • 72
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs

Paper • 2510.07499 • Published Oct 8 • 48

Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities

Paper • 2507.13158 • Published Jul 17 • 23
DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering

Paper • 2507.11527 • Published Jul 15 • 32
Promptomatix: An Automatic Prompt Optimization Framework for Large Language Models

Paper • 2507.14241 • Published Jul 17 • 17
The Prompt Report: A Systematic Survey of Prompting Techniques

Paper • 2406.06608 • Published Jun 6, 2024 • 68

Physics and operators

Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation

Paper • 2507.02608 • Published Jul 3 • 21
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers

Paper • 2512.17351 • Published 10 days ago • 22

lusxvr/nanoVLM-222M

Image-Text-to-Text • 0.2B • Updated May 8 • 447 • 98
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12 • 36
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published May 30 • 97
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23 • 88

Previous
1
2
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs