opencompass/NeedleBench
Viewer • Updated • 6.8k • 16.8k • 5
None defined yet.
CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward
Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM