DaoanZhang's picture

DaoanZhang

DwanZhang

·

AI & ML interests

None yet

Recent Activity

liked a dataset 7 days ago

LeeLi4704/VEU-Bench

upvoted a paper 14 days ago

UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios

upvoted a paper 19 days ago

VIDEOP2R: Video Understanding from Perception to Reasoning

View all activity

Organizations

liked a dataset 7 days ago

LeeLi4704/VEU-Bench

Preview • Updated Jun 14 • 100 • 8

upvoted a paper 14 days ago

UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios

Paper • 2511.18050 • Published 16 days ago • 37

upvoted a paper 19 days ago

VIDEOP2R: Video Understanding from Perception to Reasoning

Paper • 2511.11113 • Published 24 days ago • 111

upvoted a paper 24 days ago

UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist

Paper • 2511.08521 • Published 27 days ago • 37

published a dataset 4 months ago

worldrl/Uni-Janus

Viewer • Updated Aug 9 • 19.2k • 74

upvoted a paper 4 months ago

MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge

Paper • 2507.21183 • Published Jul 27 • 14

updated a dataset 5 months ago

Proactive-lmm-2/video_2

Updated Jul 26 • 10

published a dataset 5 months ago

Proactive-lmm-2/video_2

Updated Jul 26 • 10

updated a dataset 5 months ago

DwanZhang/useless_store

Updated Jul 9 • 15

New activity in rghermi/sf20k 6 months ago

Request for Alternative Access to SF20K Videos for Academic Research

#2 opened 6 months ago by

upvoted a paper 6 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 303

published a dataset 6 months ago

DwanZhang/useless_store

Updated Jul 9 • 15

upvoted 2 papers 7 months ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published Nov 21, 2024 • 25

Emerging Properties in Unified Multimodal Pretraining

Paper • 2505.14683 • Published May 20 • 134

updated 2 datasets 7 months ago

DwanZhang/useless

Updated May 16 • 90

Proactive-LMM/VisualActBench

Viewer • Updated May 13 • 1 • 16

published a dataset 7 months ago

Proactive-LMM/VisualActBench

Viewer • Updated May 13 • 1 • 16

New activity in worldrl/WorldGenBench 7 months ago

Add link to paper

#2 opened 7 months ago by

published a dataset 7 months ago

DwanZhang/useless

Updated May 16 • 90

upvoted a paper 7 months ago

On Path to Multimodal Generalist: General-Level and General-Bench

Paper • 2505.04620 • Published May 7 • 82