PromptRL: Prompt Matters in RL for Flow-Based Image Generation Paper • 2602.01382 • Published 1 day ago • 5
FSVideo: Fast Speed Video Diffusion Model in a Highly-Compressed Latent Space Paper • 2602.02092 • Published about 22 hours ago • 12
PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss Paper • 2602.02493 • Published about 16 hours ago • 21
Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation Paper • 2602.02214 • Published about 20 hours ago • 16
PISCES: Annotation-free Text-to-Video Post-Training via Optimal Transport-Aligned Rewards Paper • 2602.01624 • Published 1 day ago • 20
PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing Paper • 2601.21957 • Published 5 days ago • 10
NativeTok: Native Visual Tokenization for Improved Image Generation Paper • 2601.22837 • Published 4 days ago • 9
DINO-SAE: DINO Spherical Autoencoder for High-Fidelity Image Reconstruction and Generation Paper • 2601.22904 • Published 4 days ago • 11
DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment Paper • 2601.20218 • Published 6 days ago • 14
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published 4 days ago • 43
One-step Latent-free Image Generation with Pixel Mean Flows Paper • 2601.22158 • Published 5 days ago • 14
LoL: Longer than Longer, Scaling Video Generation to Hour Paper • 2601.16914 • Published 11 days ago • 19
ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation Paper • 2601.21420 • Published 5 days ago • 40
OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models Paper • 2601.21639 • Published 5 days ago • 46
HyperAlign: Hypernetwork for Efficient Test-Time Alignment of Diffusion Models Paper • 2601.15968 • Published 12 days ago • 6
AR-Omni: A Unified Autoregressive Model for Any-to-Any Generation Paper • 2601.17761 • Published 9 days ago • 14
iFSQ: Improving FSQ for Image Generation with 1 Line of Code Paper • 2601.17124 • Published 11 days ago • 31