arxiv:2503.24115
Peidong Wang
WDong
AI & ML interests
None yet
Organizations
models 9
WDong/lora_06072000
Updated • 2
WDong/7B_lora_06051615
Updated • 2
WDong/7B-0428
Text Generation • 8B • Updated • 3
WDong/qwen1.5-1.8B-seed-sft
Text Generation • 2B • Updated • 4 •
WDong/CartPole
Reinforcement Learning • Updated
WDong/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning • Updated • 22
WDong/Taxi-v3
Reinforcement Learning • Updated
WDong/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning • Updated
WDong/ppo-LunarLander-v2
Reinforcement Learning • Updated • 3