e2lmc-competition (NeurIPS 2025 E2LMC competition)

MElHuseyni

authored a paper 6 days ago

Doppler-Enhanced Deep Learning: Improving Thyroid Nodule Segmentation with YOLOv5 Instance Segmentation

Paper • 2512.00639 • Published 8 days ago

MElHuseyni

authored a paper 14 days ago

Parrot: Persuasion and Agreement Robustness Rating of Output Truth -- A Sycophancy Robustness Benchmark for LLMs

Paper • 2511.17220 • Published 17 days ago • 16

MElHuseyni

authored a paper 17 days ago

TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval

Paper • 2511.16528 • Published 17 days ago • 16

MElHuseyni

authored a paper about 1 month ago

Mask-to-Height: A YOLOv11-Based Architecture for Joint Building Instance Segmentation and Height Classification from Satellite Imagery

Paper • 2510.27224 • Published Oct 31 • 2

MElHuseyni

authored a paper 3 months ago

Turk-LettuceDetect: A Hallucination Detection Models for Turkish RAG Applications

Paper • 2509.17671 • Published Sep 22 • 9

UVSKKR

authored 2 papers 3 months ago

EthicsMH: A Pilot Benchmark for Ethical Reasoning in Mental Health AI

Paper • 2509.11648 • Published Sep 15 • 1

D-HUMOR: Dark Humor Understanding via Multimodal Open-ended Reasoning

Paper • 2509.06771 • Published Sep 8 • 5

MElHuseyni

authored a paper 3 months ago

Guided Decoding and Its Critical Role in Retrieval-Augmented Generation

Paper • 2509.06631 • Published Sep 8 • 10

ybelkada

authored 2 papers 4 months ago

NeurIPS 2025 E2LM Competition : Early Training Evaluation of Language Models

Paper • 2506.07731 • Published Jun 9 • 2

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Paper • 2507.22448 • Published Jul 30 • 66

Sarim-Hash

authored a paper 6 months ago

Guaranteed Guess: A Language Modeling Approach for CISC-to-RISC Transpilation with Testing Guarantees

Paper • 2506.14606 • Published Jun 17 • 11

zlatamaria

authored 3 papers 6 months ago

Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home

Paper • 2501.12835 • Published Jan 22 • 4

LLM-Independent Adaptive RAG: Let the Question Speak for Itself

Paper • 2505.04253 • Published May 7 • 14

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Paper • 2505.21115 • Published May 27 • 140

Sarim-Hash

authored a paper 6 months ago

CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark

Paper • 2505.16968 • Published May 22 • 40

ItsMaxNorm

authored 2 papers 8 months ago

LLM Post-Training: A Deep Dive into Reasoning Large Language Models

Paper • 2502.21321 • Published Feb 28 • 1

Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions

Paper • 2503.22678 • Published Mar 28 • 1

Justin900

authored a paper 9 months ago

Traffic Scene Generation from Natural Language Description for Autonomous Vehicles with Large Language Model

Paper • 2409.09575 • Published Sep 15, 2024 • 1

Sarim-Hash

authored 2 papers 9 months ago

Language and Planning in Robotic Navigation: A Multilingual Evaluation of State-of-the-Art Models

Paper • 2501.05478 • Published Jan 7 • 1

SALT: Singular Value Adaptation with Low-Rank Transformation

Paper • 2503.16055 • Published Mar 20 • 8

AI & ML interests

Team members 156

e2lmc-competition's activity