GEditBench v2: A Human-Aligned Benchmark for General Image Editing Paper • 2603.28547 • Published 6 days ago • 32
PEARL: Personalized Streaming Video Understanding Model Paper • 2603.20422 • Published 16 days ago • 40
WebVR: Benchmarking Multimodal LLMs for WebPage Recreation from Videos via Human-Aligned Visual Rubrics Paper • 2603.13391 • Published 26 days ago • 19
CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation Paper • 2603.08652 • Published 27 days ago • 39
Proact-VL: A Proactive VideoLLM for Real-Time AI Companions Paper • 2603.03447 • Published Mar 3 • 37
GEBench: Benchmarking Image Generation Models as GUI Environments Paper • 2602.09007 • Published Feb 9 • 39
PCC-Pretrained Collection Pretraining Context Compressor for Large Language Models with Embedding-Based Memory • 13 items • Updated Mar 2 • 1