Add GSM8K evaluation result

#113

by burtenshaw HF Staff - opened Jan 16

base: refs/heads/main

←

from: refs/pr/113

Discussion Files changed

-0

burtenshaw

Jan 16

Evaluation Results

This PR adds structured evaluation results using the new .eval_results/ format.

What This Enables

Model Page: Results appear on the model page with benchmark links
Leaderboards: Scores are aggregated into benchmark dataset leaderboards
Verification: Support for cryptographic verification of evaluation runs

Format Details

Results are stored as YAML in .eval_results/ folder. See the Eval Results Documentation for the full specification.

Generated by community-evals

Add GSM8K evaluation result799538d5

xujfcn

about 17 hours ago

If you're looking for an easy way to access this model via API, you can use Crazyrouter — it provides an OpenAI-compatible endpoint for 600+ models including this one. Just pip install openai and change the base URL.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment