alex long
xlalex
·
AI & ML interests
None yet
Recent Activity
updated
a collection
2 months ago
data
updated
a collection
2 months ago
data
updated
a collection
2 months ago
data
Organizations
None yet
svg
-
OmniSVG: A Unified Scalable Vector Graphics Generation Model
Paper • 2504.06263 • Published • 182 -
InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models
Paper • 2510.11341 • Published • 34 -
SVGThinker: Instruction-Aligned and Reasoning-Driven Text-to-SVG Generation
Paper • 2509.24299 • Published -
VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation
Paper • 2511.02778 • Published • 101
interleaved
-
Interleaved Reasoning for Large Language Models via Reinforcement Learning
Paper • 2505.19640 • Published • 15 -
ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning
Paper • 2510.27492 • Published • 82 -
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control
Paper • 2508.21112 • Published • 77 -
SFR-DeepResearch: Towards Effective Reinforcement Learning for Autonomously Reasoning Single Agents
Paper • 2509.06283 • Published • 17
3d
omni
synthesis
-
Oasis: One Image is All You Need for Multimodal Instruction Data Synthesis
Paper • 2503.08741 • Published • 1 -
Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models
Paper • 2406.17294 • Published • 11 -
SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis
Paper • 2506.02096 • Published • 52 -
VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL
Paper • 2505.23977 • Published • 10
survey
critic
agent
-
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 271 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 228 -
Scaling Agents via Continual Pre-training
Paper • 2509.13310 • Published • 117 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 122
data
-
OpenThoughts: Data Recipes for Reasoning Models
Paper • 2506.04178 • Published • 50 -
Exploring Multi-Grained Concept Annotations for Multimodal Large Language Models
Paper • 2412.05939 • Published • 15 -
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
Paper • 2411.15124 • Published • 67 -
PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding
Paper • 2504.13180 • Published • 19
video
ocr
world model
infra
-
LightMem: Lightweight and Efficient Memory-Augmented Generation
Paper • 2510.18866 • Published • 111 -
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders
Paper • 2510.19779 • Published • 60 -
Emu3.5: Native Multimodal Models are World Learners
Paper • 2510.26583 • Published • 108
perception
-
Perception-Aware Policy Optimization for Multimodal Reasoning
Paper • 2507.06448 • Published • 47 -
Perception Encoder: The best visual embeddings are not at the output of the network
Paper • 2504.13181 • Published • 34 -
Perception Tokens Enhance Visual Reasoning in Multimodal Language Models
Paper • 2412.03548 • Published • 17 -
Slow Perception: Let's Perceive Geometric Figures Step-by-step
Paper • 2412.20631 • Published • 15
RL
speech full duplex
self-paly
-
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Paper • 2505.03335 • Published • 188 -
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers
Paper • 2408.06195 • Published • 73 -
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR
Paper • 2508.14029 • Published • 118 -
Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play
Paper • 2509.25541 • Published • 140
encoder
data
-
OpenThoughts: Data Recipes for Reasoning Models
Paper • 2506.04178 • Published • 50 -
Exploring Multi-Grained Concept Annotations for Multimodal Large Language Models
Paper • 2412.05939 • Published • 15 -
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
Paper • 2411.15124 • Published • 67 -
PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding
Paper • 2504.13180 • Published • 19
svg
-
OmniSVG: A Unified Scalable Vector Graphics Generation Model
Paper • 2504.06263 • Published • 182 -
InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models
Paper • 2510.11341 • Published • 34 -
SVGThinker: Instruction-Aligned and Reasoning-Driven Text-to-SVG Generation
Paper • 2509.24299 • Published -
VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation
Paper • 2511.02778 • Published • 101
video
interleaved
-
Interleaved Reasoning for Large Language Models via Reinforcement Learning
Paper • 2505.19640 • Published • 15 -
ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning
Paper • 2510.27492 • Published • 82 -
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control
Paper • 2508.21112 • Published • 77 -
SFR-DeepResearch: Towards Effective Reinforcement Learning for Autonomously Reasoning Single Agents
Paper • 2509.06283 • Published • 17
ocr
3d
world model
omni
infra
-
LightMem: Lightweight and Efficient Memory-Augmented Generation
Paper • 2510.18866 • Published • 111 -
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders
Paper • 2510.19779 • Published • 60 -
Emu3.5: Native Multimodal Models are World Learners
Paper • 2510.26583 • Published • 108
synthesis
-
Oasis: One Image is All You Need for Multimodal Instruction Data Synthesis
Paper • 2503.08741 • Published • 1 -
Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models
Paper • 2406.17294 • Published • 11 -
SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis
Paper • 2506.02096 • Published • 52 -
VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL
Paper • 2505.23977 • Published • 10
perception
-
Perception-Aware Policy Optimization for Multimodal Reasoning
Paper • 2507.06448 • Published • 47 -
Perception Encoder: The best visual embeddings are not at the output of the network
Paper • 2504.13181 • Published • 34 -
Perception Tokens Enhance Visual Reasoning in Multimodal Language Models
Paper • 2412.03548 • Published • 17 -
Slow Perception: Let's Perceive Geometric Figures Step-by-step
Paper • 2412.20631 • Published • 15
survey
RL
critic
speech full duplex
agent
-
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 271 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 228 -
Scaling Agents via Continual Pre-training
Paper • 2509.13310 • Published • 117 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 122
self-paly
-
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Paper • 2505.03335 • Published • 188 -
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers
Paper • 2408.06195 • Published • 73 -
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR
Paper • 2508.14029 • Published • 118 -
Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play
Paper • 2509.25541 • Published • 140