DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle Paper • 2512.04324 • Published 5 days ago • 141
Focusing by Contrastive Attention: Enhancing VLMs' Visual Reasoning Paper • 2509.06461 • Published Sep 8 • 19
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR Paper • 2508.14029 • Published Aug 19 • 118
MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning Paper • 2507.16812 • Published Jul 22 • 63
A Survey of Context Engineering for Large Language Models Paper • 2507.13334 • Published Jul 17 • 259
RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs Paper • 2507.03253 • Published Jul 4 • 18
Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models Paper • 2503.15888 • Published Mar 20 • 1
Training a Utility-based Retriever Through Shared Context Attribution for Retrieval-Augmented Language Models Paper • 2504.00573 • Published Apr 1 • 2
Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation Paper • 2503.19622 • Published Mar 25 • 31
Context-Faithful LLMs Collection Usage Instructions can be found at https://github.com/byronBBL/Context-DPO?tab=readme-ov-file#context-faithful-models • 4 items • Updated Feb 17 • 1