arxiv:2410.16184
Zijun
TranSirius
AI & ML interests
None yet
Recent Activity
upvoted a paper about 2 months ago
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards upvoted a paper 5 months ago
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world
Markets? upvoted a paper 5 months ago
SIRI: Scaling Iterative Reinforcement Learning with Interleaved
Compression