Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
113
46
609
Nathan Lambert
natolambert
Follow
dutitello's profile picture
donat-m's profile picture
Fishtiks's profile picture
266 followers
Β·
36 following
https://www.natolambert.com/
natolambert
natolambert
AI & ML interests
Reinforcement learning, Ethics, Robotics, Dynamics Models
Recent Activity
upvoted
a
collection
13 days ago
NVIDIA Nemotron v3
upvoted
a
collection
13 days ago
Nemotron-Post-Training-v3
liked
a model
13 days ago
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8
View all activity
Organizations
natolambert
's datasets
66
Sort:Β Recently updated
natolambert/rlhf-library
Viewer
β’
Updated
Sep 17, 2025
β’
864
β’
6
β’
3
natolambert/rlhf-library-Llama-3.1-Tulu-3-70B-DPO
Viewer
β’
Updated
Sep 15, 2025
β’
48
β’
10
natolambert/rlhf-library-Llama-3.1-Tulu-3-70B-SFT
Viewer
β’
Updated
Sep 15, 2025
β’
48
β’
25
natolambert/rlhf-library-tulu-2-dpo-7b
Viewer
β’
Updated
Sep 15, 2025
β’
48
β’
13
natolambert/rlhf-library-OLMo-2-0425-1B-DPO
Viewer
β’
Updated
Sep 15, 2025
β’
48
β’
12
natolambert/rlhf-library-OLMo-2-0425-1B-SFT
Viewer
β’
Updated
Sep 15, 2025
β’
48
β’
9
natolambert/rlhf-library-Llama-3.1-Tulu-3-8B-DPO
Viewer
β’
Updated
Sep 15, 2025
β’
48
β’
13
natolambert/rlhf-library-tulu-2-7b
Viewer
β’
Updated
Sep 15, 2025
β’
48
β’
32
natolambert/rlhf-library-OLMo-7B-0424-Instruct-hf
Viewer
β’
Updated
Sep 15, 2025
β’
48
β’
11
natolambert/rlhf-library-OLMo-7B-0424-SFT-hf
Viewer
β’
Updated
Sep 15, 2025
β’
48
β’
12
natolambert/rlhf-library-OLMo-7B-Instruct-hf
Viewer
β’
Updated
Sep 15, 2025
β’
48
β’
8
natolambert/rlhf-library-OLMo-7B-SFT-hf
Viewer
β’
Updated
Sep 15, 2025
β’
48
β’
8
natolambert/rlhf-library-OLMo-2-0325-32B-DPO
Viewer
β’
Updated
Sep 15, 2025
β’
48
β’
11
natolambert/rlhf-library-OLMo-2-0325-32B-SFT
Viewer
β’
Updated
Sep 15, 2025
β’
48
β’
6
natolambert/rlhf-library-OLMo-2-1124-13B-DPO
Viewer
β’
Updated
Sep 15, 2025
β’
48
β’
6
natolambert/rlhf-library-OLMo-2-1124-13B-SFT
Viewer
β’
Updated
Sep 15, 2025
β’
48
β’
4
natolambert/rlhf-library-OLMo-2-1124-7B-DPO
Viewer
β’
Updated
Sep 15, 2025
β’
48
β’
5
natolambert/rlhf-library-Llama-3.1-Tulu-3-8B-SFT
Viewer
β’
Updated
Sep 15, 2025
β’
48
β’
6
natolambert/rlhf-library-OLMo-2-1124-7B-SFT
Viewer
β’
Updated
Sep 15, 2025
β’
48
β’
7
natolambert/rlhf-book-prompts-v2
Viewer
β’
Updated
Sep 14, 2025
β’
16
β’
5
natolambert/coconot-r1-debug-debug
Viewer
β’
Updated
Jun 30, 2025
β’
10
β’
6
natolambert/tulu_v3.9_wildchat_100k_english-r1
Viewer
β’
Updated
Jun 30, 2025
β’
57.4k
β’
6
natolambert/acecoder-r1
Viewer
β’
Updated
Jun 29, 2025
β’
63.6k
β’
6
natolambert/rlvr-code-data-python-r1
Viewer
β’
Updated
Jun 29, 2025
β’
80k
β’
14
natolambert/tulu_v3.9_wildchat_100k_english-r1-debug
Viewer
β’
Updated
Jun 29, 2025
β’
9
β’
2
natolambert/hardcoded-test
Viewer
β’
Updated
Jun 29, 2025
β’
24
β’
4
natolambert/rlvr_acecoder_filtered-r1
Updated
Jun 28, 2025
β’
4
natolambert/the-algorithm-python-r1
Viewer
β’
Updated
Jun 28, 2025
β’
608
β’
17
natolambert/the-algorithm-python-r1-debug
Viewer
β’
Updated
Jun 28, 2025
β’
10
β’
21
natolambert/GeneralThought-430K-filtered
Viewer
β’
Updated
Mar 26, 2025
β’
338k
β’
191
β’
35
Previous
1
2
3
Next