NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos Paper β’ 2601.00393 β’ Published 9 days ago β’ 106
GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction Paper β’ 2512.25073 β’ Published 10 days ago β’ 38
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community +1 Apr 15, 2024 β’ 191
TripoSR: Fast 3D Object Reconstruction from a Single Image Paper β’ 2403.02151 β’ Published Mar 4, 2024 β’ 16
Running Featured 565 Image Arena Leaderboard π 565 Image Generation and Image Editing Arena & Leaderboard
VideoBooth: Diffusion-based Video Generation with Image Prompts Paper β’ 2312.00777 β’ Published Dec 1, 2023 β’ 24
3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features Paper β’ 2311.04391 β’ Published Nov 7, 2023 β’ 14
LoRAShear: Efficient Large Language Model Structured Pruning and Knowledge Recovery Paper β’ 2310.18356 β’ Published Oct 24, 2023 β’ 24
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation Paper β’ 2310.19512 β’ Published Oct 30, 2023 β’ 16