AI & ML interests
None yet
Organizations
kangdawei/MMR-Sigmoid-DAPO-7B
Text Generation
• 8B • Updated • 50
kangdawei/MMR-Sigmoid-DR-GRPO-8B
Text Generation
• 8B • Updated • 1
kangdawei/MMR-Sigmoid-DAPO-8B
Text Generation
• 8B • Updated • 4
kangdawei/MMR-Sigmoid-DAPO
Text Generation
• 2B • Updated • 8
kangdawei/MMR-Sigmoid-GRPO-8B
Text Generation
• 8B • Updated • 5
• 1
kangdawei/MMR-Sigmoid-GRPO-7B
Text Generation
• 8B • Updated • 6
kangdawei/MMR-Sigmoid-DR-GRPO-7B
Text Generation
• 8B • Updated • 3
Text Generation
• 8B • Updated • 111
Text Generation
• 8B • Updated • 3
• 1
Text Generation
• 8B • Updated • 178
• 1
Text Generation
• 8B • Updated • 3
Text Generation
• 8B • Updated • 162
Text Generation
• 2B • Updated • 160
Text Generation
• 8B • Updated • 56
Text Generation
• 2B • Updated • 3
Text Generation
• 2B • Updated • 13
Text Generation
• 8B • Updated • 1
Text Generation
• 8B • Updated • 3
kangdawei/Open-RS-DR_GRPO-8B
Text Generation
• 8B • Updated • 6
• 1
Text Generation
• 8B • Updated • 1
Text Generation
• 8B • Updated • 1
Text Generation
• 8B • Updated • 1
kangdawei/Open-RS-DR_GRPO-7B
Text Generation
• 8B • Updated • 4
• 1
Text Generation
• 8B • Updated • 2
Text Generation
• 8B • Updated • 2
Text Generation
• 8B • Updated • 5
Text Generation
• 8B • Updated • 2
Text Generation
• 8B • Updated • 3
kangdawei/MMR-DR_GRPO-lambda-0.9
Text Generation
• 2B • Updated • 1
kangdawei/MMR-DR_GRPO-lambda-0.8
Text Generation
• 2B • Updated • 1