-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 23 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 151 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
Collections
Discover the best community collections!
Collections including paper arxiv:2510.26583
-
LightMem: Lightweight and Efficient Memory-Augmented Generation
Paper • 2510.18866 • Published • 110 -
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders
Paper • 2510.19779 • Published • 60 -
Emu3.5: Native Multimodal Models are World Learners
Paper • 2510.26583 • Published • 106
-
Emu3.5: Native Multimodal Models are World Learners
Paper • 2510.26583 • Published • 106 -
RECALL: REpresentation-aligned Catastrophic-forgetting ALLeviation via Hierarchical Model Merging
Paper • 2510.20479 • Published • 11 -
A Definition of AGI
Paper • 2510.18212 • Published • 34 -
Video-As-Prompt: Unified Semantic Control for Video Generation
Paper • 2510.20888 • Published • 45
-
WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent
Paper • 2508.05748 • Published • 141 -
TTT3R: 3D Reconstruction as Test-Time Training
Paper • 2509.26645 • Published • 14 -
Human3R: Everyone Everywhere All at Once
Paper • 2510.06219 • Published • 10 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 493
-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 23 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 151 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
-
Emu3.5: Native Multimodal Models are World Learners
Paper • 2510.26583 • Published • 106 -
RECALL: REpresentation-aligned Catastrophic-forgetting ALLeviation via Hierarchical Model Merging
Paper • 2510.20479 • Published • 11 -
A Definition of AGI
Paper • 2510.18212 • Published • 34 -
Video-As-Prompt: Unified Semantic Control for Video Generation
Paper • 2510.20888 • Published • 45
-
LightMem: Lightweight and Efficient Memory-Augmented Generation
Paper • 2510.18866 • Published • 110 -
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders
Paper • 2510.19779 • Published • 60 -
Emu3.5: Native Multimodal Models are World Learners
Paper • 2510.26583 • Published • 106
-
WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent
Paper • 2508.05748 • Published • 141 -
TTT3R: 3D Reconstruction as Test-Time Training
Paper • 2509.26645 • Published • 14 -
Human3R: Everyone Everywhere All at Once
Paper • 2510.06219 • Published • 10 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 493