Hanqing Zhu's picture

2 4 3

Hanqing Zhu

hanqing666

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 25 days ago

The Path Not Taken: RLVR Provably Learns Off the Principals

commented on a paper 25 days ago

The Path Not Taken: RLVR Provably Learns Off the Principals

published a model 3 months ago

hanqing666/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

View all activity

Organizations

upvoted a paper 25 days ago

The Path Not Taken: RLVR Provably Learns Off the Principals

Paper • 2511.08567 • Published 26 days ago • 31

upvoted a paper 5 months ago

Stabilizing Knowledge, Promoting Reasoning: Dual-Token Constraints for RLVR

Paper • 2507.15778 • Published Jul 21 • 20

upvoted a collection 8 months ago

DeepSeek-R1-Distill Quantized

18 items • Updated Feb 7 • 16

upvoted a paper 12 months ago

APOLLO: SGD-like Memory, AdamW-level Performance

Paper • 2412.05270 • Published Dec 6, 2024 • 38