Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Saurabh Shah
saurabh5
Follow
sbrandeis's profile picture
ishanprogs's profile picture
mst272's profile picture
4 followers
·
2 following
saurabh_shah2
saurabh111233212
AI & ML interests
None yet
Recent Activity
updated
a model
5 days ago
allenai/Olmo-3-7B-RL-Zero-General
updated
a dataset
5 days ago
allenai/Dolci-RL-Zero-General-7B
updated
a dataset
5 days ago
allenai/Dolci-RL-Zero-General-7B
View all activity
Organizations
models
4
Sort: Recently updated
saurabh5/olmo_25_RL0_base_math_template
7B
•
Updated
Oct 1
•
1
saurabh5/olmo_25_RL0_base
7B
•
Updated
Sep 30
•
3
saurabh5/qwen3.2-8b-nothink
Text Generation
•
8B
•
Updated
Sep 8
•
9
saurabh5/qwen-2.5-7B-OT3
Updated
Jul 15
•
2
datasets
151
Sort: Recently updated
saurabh5/code_rlvr_mixture_dpo
Viewer
•
Updated
24 days ago
•
21.3k
•
96
saurabh5/too-big-stuff
Viewer
•
Updated
24 days ago
•
214
•
35
saurabh5/hard-coded-olmo-qwen3-vl-32b-thinking-traces-hand-filtered
Viewer
•
Updated
Nov 3
•
58
•
25
saurabh5/hard-coded-olmo-qwen3-vl-32b-thinking-traces
Viewer
•
Updated
Oct 30
•
60
•
23
saurabh5/hard-coded-olmo-DPO-qwen3-vl-32b-thinking
Viewer
•
Updated
Oct 29
•
168
•
19
saurabh5/hard-coded-olmo-DPO-qwen3-vl-32b-instruct
Viewer
•
Updated
Oct 29
•
168
•
19
saurabh5/hard-coded-olmo-qwq-32b-traces
Viewer
•
Updated
Oct 27
•
60
•
25
saurabh5/coding-agent-synth-data
Viewer
•
Updated
Oct 21
•
8.09k
•
35
saurabh5/RL0-General-Data
Viewer
•
Updated
Oct 20
•
12.8k
•
15
saurabh5/RL0-IF-Data
Viewer
•
Updated
Oct 20
•
13.2k
•
25
View 151 datasets