PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper • 2510.14528 • Published Oct 16 • 104
G-CUT3R: Guided 3D Reconstruction with Camera and Depth Prior Integration Paper • 2508.11379 • Published Aug 15 • 12
Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success Paper • 2508.04280 • Published Aug 6 • 35
Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success Paper • 2508.04280 • Published Aug 6 • 35
Reinforcement Learning for Long-Horizon Interactive LLM Agents Paper • 2502.01600 • Published Feb 3 • 1
view article Article LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone! Mar 7 • 88