D-REX: A Benchmark for Detecting Deceptive Reasoning in Large Language Models Paper • 2509.17938 • Published Sep 22, 2025 • 4
D-REX: A Benchmark for Detecting Deceptive Reasoning in Large Language Models Paper • 2509.17938 • Published Sep 22, 2025 • 4 • 2
Evaluating the Critical Risks of Amazon's Nova Premier under the Frontier Model Safety Framework Paper • 2507.06260 • Published Jul 7, 2025 • 5
Evaluating the Critical Risks of Amazon's Nova Premier under the Frontier Model Safety Framework Paper • 2507.06260 • Published Jul 7, 2025 • 5 • 1