FINAL_Bench

community

AI & ML interests

None defined yet.

Recent Activity

SeaWolf-AI published an article about 1 hour ago

"The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge"

SeaWolf-AI updated a Space about 6 hours ago

FINAL-Bench/all-bench-leaderboard

SeaWolf-AI updated a Space about 8 hours ago

FINAL-Bench/Darwin-35B-A3B-Opus

View all activity

Articles

"The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge"

about 1 hour ago

Introducing WM Bench: A Benchmark for Cognitive Intelligence in World Models

🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do

MARL: Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning

Structural Problems in AI Benchmarking and the Case for a Unified Evaluation Framework

Do Bubbles Form When Tens of Thousands of AIs Simulate Capitalism?

FINAL Bench: The Real Bottleneck to AGI Is Self-Correction

View all articles

Collections 1

spaces 7

PROMETHEUS v1.0 — World Model Interactive Demo

World-first embodied AI world model

Darwin 35B A3B Opus

Darwin-35B-A3B-Opus

ALL Bench Leaderboard

ALL Bench Leaderboard

WORLD MODEL Leaderboard

WORLD MODEL Bench

World Model

World-Model

Leaderboard - FINAL Bench 'Metacognitive'

Metacognitive

models 1

FINAL-Bench/Darwin-35B-A3B-Opus

Text Generation • 36B • Updated about 5 hours ago • 6

datasets 3

FINAL-Bench/World-Model

Viewer • Updated 1 day ago • 100 • 495 • 20

FINAL-Bench/ALL-Bench-Leaderboard

Viewer • Updated 21 days ago • 90 • 2.33k • 22

FINAL-Bench/Metacognitive

Viewer • Updated Feb 27 • 100 • 1.88k • 76