HumanNet: Scaling Human-centric Video Learning to One Million Hours Paper β’ 2605.06747 β’ Published 7 days ago β’ 48
Representation FrΓ©chet Loss for Visual Generation Paper β’ 2604.28190 β’ Published 14 days ago β’ 28
Seeing Fast and Slow: Learning the Flow of Time in Videos Paper β’ 2604.21931 β’ Published 21 days ago β’ 19
view article Article Nucleus-Image: Scaling Text-to-Image with Sparse Mixture of Experts NucleusAI β’ 30 days ago β’ 11
OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper β’ 2604.11804 β’ Published Apr 13 β’ 72