BioMedArena: An Open-source Toolkit for Building and Evaluating Biomedical Deep Research Agents Paper • 2605.06177 • Published May 7 • 3
Measuring Epistemic Resilience of LLMs Under Misleading Medical Context Paper • 2606.12291 • Published 21 days ago • 60