VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs Paper β’ 2406.07476 β’ Published Jun 11, 2024 β’ 36
DAMO-NLP-SG/VideoLLaMA2.1-7B-AV Visual Question Answering β’ 9B β’ Updated Oct 25, 2024 β’ 3.19k β’ 16
DAMO-NLP-SG/VideoLLaMA2-7B-16F Visual Question Answering β’ 8B β’ Updated Aug 13, 2024 β’ 55 β’ 14