Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Hanqing Zhu's picture
2 4 3

Hanqing Zhu

hanqing666
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 25 days ago
The Path Not Taken: RLVR Provably Learns Off the Principals
commented on a paper 25 days ago
The Path Not Taken: RLVR Provably Learns Off the Principals
published a model 3 months ago
hanqing666/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
View all activity

Organizations

University of Texas at Austin's profile picture

upvoted a paper 25 days ago

The Path Not Taken: RLVR Provably Learns Off the Principals

Paper • 2511.08567 • Published 26 days ago • 31
upvoted a paper 5 months ago

Stabilizing Knowledge, Promoting Reasoning: Dual-Token Constraints for RLVR

Paper • 2507.15778 • Published Jul 21 • 20
upvoted a collection 8 months ago

DeepSeek-R1-Distill Quantized

Collection
18 items • Updated Feb 7 • 16
upvoted a paper 12 months ago

APOLLO: SGD-like Memory, AdamW-level Performance

Paper • 2412.05270 • Published Dec 6, 2024 • 38
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs