Towards Trustworthy Retrieval Augmented Generation for Large Language Models: A Survey Paper • 2502.06872 • Published Feb 8, 2025 • 8
Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents? Paper • 2605.19196 • Published 29 days ago
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens Paper • 2508.01191 • Published Aug 2, 2025 • 240