Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
💼
Hiring
NYCU-RL-Bandits-Lab
rl-bandits-lab
Follow
rl-bandits-lab
AI & ML interests
Reinforcement learning
Organizations
None yet
models
6
Sort:Â Recently updated
rl-bandits-lab/AskR-Qwen2.5-VL-7B-Instruct-LoRA
Updated
Apr 16
rl-bandits-lab/AskR-Qwen2.5-7B-Instruct-LoRA
Text Generation
•
Updated
Apr 15
•
195
rl-bandits-lab/ultrafeedback_rm
8B
•
Updated
Jul 30, 2025
•
2
rl-bandits-lab/helpsteer_rm
8B
•
Updated
Jun 10, 2025
•
6
rl-bandits-lab/hhrlhf_rm
8B
•
Updated
May 21, 2025
•
43
rl-bandits-lab/translation_rm
8B
•
Updated
May 21, 2025
•
12
datasets
2
Sort:Â Recently updated
rl-bandits-lab/SEGALE-WMT24
Viewer
•
Updated
Nov 5, 2025
•
137k
•
14
rl-bandits-lab/SEGALE-WMT24-Human-Eval
Viewer
•
Updated
Nov 5, 2025
•
27k
•
152