Varad Pimpalkhute's picture

1 5 1

Varad Pimpalkhute

DaoistKalki

·

https://nightlessbaron.github.io/

AI & ML interests

Few-shot learning, generalization, multi-modality

Recent Activity

upvoted a paper about 19 hours ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

upvoted a paper 16 days ago

The Path Not Taken: RLVR Provably Learns Off the Principals

liked a model 3 months ago

LLM360/K2-Think

View all activity

Organizations

upvoted a paper about 19 hours ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 8 days ago • 83

upvoted a paper 16 days ago

The Path Not Taken: RLVR Provably Learns Off the Principals

Paper • 2511.08567 • Published 28 days ago • 32

upvoted a paper 5 months ago

Critiques of World Models

Paper • 2507.05169 • Published Jul 7 • 25

upvoted a paper 6 months ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17 • 49

upvoted an article 9 months ago

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Mar 17

•

344