sian cao
sonald
AI & ML interests
AI, big data, OS
Recent Activity
upvoted
an
article
12 days ago
Deriving the DPO Loss from First Principles
upvoted
an
article
14 days ago
Deriving the PPO Loss from First Principles
upvoted
an
article
17 days ago
From GRPO to DAPO and GSPO: What, Why, and How