Fu-En Yang

FuEnYang

https://fuenyang1127.github.io/

AI & ML interests

Computer Vision, Deep Learning, Vision-Language Models (VLMs), Vision-Language-Action Models (VLAs), Reasoning Models, Embodied AI

Recent Activity

authored a paper 1 day ago

3AM: Segment Anything with Geometric Consistency in Videos

upvoted a paper 1 day ago

Transition Matching Distillation for Fast Video Generation

upvoted a paper 1 day ago

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

View all activity

Organizations

upvoted 8 papers 1 day ago

Transition Matching Distillation for Fast Video Generation

Paper • 2601.09881 • Published 3 days ago • 14

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Paper • 2601.10611 • Published 2 days ago • 16

CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation

Paper • 2601.10061 • Published 2 days ago • 25

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published 3 days ago • 141

RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes

Paper • 2601.05249 • Published 9 days ago • 44

3AM: Segment Anything with Geometric Consistency in Videos

Paper • 2601.08831 • Published 4 days ago • 31

Motion Attribution for Video Generation

Paper • 2601.08828 • Published 4 days ago • 65

Flow Equivariant World Models: Memory for Partially Observed Dynamic Environments

Paper • 2601.01075 • Published 14 days ago • 4

upvoted 4 papers 2 days ago

Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models

Paper • 2601.08955 • Published 4 days ago • 10

Efficient Camera-Controlled Video Generation of Static Scenes via Sparse Diffusion and 3D Rendering

Paper • 2601.09697 • Published 3 days ago • 6

OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding

Paper • 2601.09575 • Published 3 days ago • 24

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

Paper • 2601.09708 • Published 3 days ago • 44

upvoted 8 papers 3 days ago

MemoBrain: Executive Memory as an Agentic Brain for Reasoning

Paper • 2601.08079 • Published 4 days ago • 34

MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences

Paper • 2601.06789 • Published 6 days ago • 73

ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning in LLMs

Paper • 2506.15211 • Published Jun 18, 2025 • 39

All is Not Lost: LLM Recovery without Checkpoints

Paper • 2506.15461 • Published Jun 18, 2025 • 39

Sekai: A Video Dataset towards World Exploration

Paper • 2506.15675 • Published Jun 18, 2025 • 66

Fu-En Yang

AI & ML interests

Recent Activity

Organizations

FuEnYang's activity