Doppler-Enhanced Deep Learning: Improving Thyroid Nodule Segmentation with YOLOv5 Instance Segmentation Paper • 2512.00639 • Published 8 days ago
Parrot: Persuasion and Agreement Robustness Rating of Output Truth -- A Sycophancy Robustness Benchmark for LLMs Paper • 2511.17220 • Published 17 days ago • 16
TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval Paper • 2511.16528 • Published 17 days ago • 16
Mask-to-Height: A YOLOv11-Based Architecture for Joint Building Instance Segmentation and Height Classification from Satellite Imagery Paper • 2510.27224 • Published Oct 31 • 2
Turk-LettuceDetect: A Hallucination Detection Models for Turkish RAG Applications Paper • 2509.17671 • Published Sep 22 • 9
EthicsMH: A Pilot Benchmark for Ethical Reasoning in Mental Health AI Paper • 2509.11648 • Published Sep 15 • 1
D-HUMOR: Dark Humor Understanding via Multimodal Open-ended Reasoning Paper • 2509.06771 • Published Sep 8 • 5
Guided Decoding and Its Critical Role in Retrieval-Augmented Generation Paper • 2509.06631 • Published Sep 8 • 10
NeurIPS 2025 E2LM Competition : Early Training Evaluation of Language Models Paper • 2506.07731 • Published Jun 9 • 2
Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance Paper • 2507.22448 • Published Jul 30 • 66
Guaranteed Guess: A Language Modeling Approach for CISC-to-RISC Transpilation with Testing Guarantees Paper • 2506.14606 • Published Jun 17 • 11
Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home Paper • 2501.12835 • Published Jan 22 • 4
LLM-Independent Adaptive RAG: Let the Question Speak for Itself Paper • 2505.04253 • Published May 7 • 14
Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA Paper • 2505.21115 • Published May 27 • 140
CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark Paper • 2505.16968 • Published May 22 • 40
LLM Post-Training: A Deep Dive into Reasoning Large Language Models Paper • 2502.21321 • Published Feb 28 • 1
Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions Paper • 2503.22678 • Published Mar 28 • 1
Traffic Scene Generation from Natural Language Description for Autonomous Vehicles with Large Language Model Paper • 2409.09575 • Published Sep 15, 2024 • 1
Language and Planning in Robotic Navigation: A Multilingual Evaluation of State-of-the-Art Models Paper • 2501.05478 • Published Jan 7 • 1
SALT: Singular Value Adaptation with Low-Rank Transformation Paper • 2503.16055 • Published Mar 20 • 8