Models That Know How Evaluations Are Designed Score Safer
Paper • 2605.28591 • Published • 8
The Ubiquitous Knowledge Processing Lab researches natural language processing, text mining, eLearning, and digital humanities.
SciCoQA: Quality Assurance for Scientific Paper--Code Alignment
PeerQA: A Scientific Question Answering Dataset from Peer Reviews