Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
20
Xu Zhihao
naiweizi
Follow
didiforhugface's profile picture
1 follower
·
0 following
AI & ML interests
Trustworthy AI
Recent Activity
upvoted
a
paper
about 1 month ago
AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning
updated
a model
about 1 month ago
naiweizi/r1-qwen-7b-sft_meta
published
a model
about 1 month ago
naiweizi/r1-qwen-7b-sft_meta
View all activity
Organizations
None yet
naiweizi
's models
12
Sort: Recently updated
naiweizi/r1-qwen-7b-sft_meta
8B
•
Updated
Nov 21, 2025
•
1
naiweizi/R1-Qwen-7B-SFT-Meta
Updated
Nov 21, 2025
naiweizi/R1-Qwen-1_5B-Cold_Start-OpenR1_Math-priority
2B
•
Updated
Jul 18, 2025
•
9
naiweizi/dpo-harmless_saferlhf
Updated
Jun 18, 2025
•
4
naiweizi/mistral-dpo-helpful-vanilla-1e-4
Updated
May 6, 2025
•
3
naiweizi/mistral-dpo-harmless-vanilla-2e-4
Updated
May 6, 2025
•
2
naiweizi/test
Text Generation
•
8B
•
Updated
Apr 21, 2025
•
5
naiweizi/dpo-harmless_helpful-vanilla
Updated
Apr 14, 2025
naiweizi/dpo-harmless_helpful-rc_armo
Updated
Apr 14, 2025
naiweizi/dpo-harmless_helpful-mixed
Updated
Apr 14, 2025
naiweizi/dpo-harmless_helpful-rc_armo_mistral
Updated
Apr 14, 2025
naiweizi/qwen2.5-instruct-sft_helpsteer2
8B
•
Updated
Mar 14, 2025
•
6