Xinyu Yang's picture

3 45

Xinyu Yang

Hanyuezhuohua

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

Soft Adaptive Policy Optimization

upvoted a paper 12 days ago

Training Foundation Models on a Full-Stack AMD Platform: Compute, Networking, and System Design

upvoted a paper 18 days ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

View all activity

Organizations

upvoted 2 papers 12 days ago

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published 13 days ago • 33

Training Foundation Models on a Full-Stack AMD Platform: Compute, Networking, and System Design

Paper • 2511.17127 • Published 17 days ago • 1

upvoted a paper 18 days ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published 27 days ago • 104

upvoted a paper 26 days ago

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Paper • 2511.07317 • Published 28 days ago • 13

upvoted 7 papers about 1 month ago

RDMA Point-to-Point Communication for LLM Systems

Paper • 2510.27656 • Published Oct 31 • 6

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5 • 124

Defeating the Training-Inference Mismatch via FP16

Paper • 2510.26788 • Published Oct 30 • 29

Revisiting Multimodal Positional Encoding in Vision-Language Models

Paper • 2510.23095 • Published Oct 27 • 20

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29 • 219

When "Correct" Is Not Safe: Can We Trust Functionally Correct Patches Generated by Code Agents?

Paper • 2510.17862 • Published Oct 15 • 6

Planned Diffusion

Paper • 2510.18087 • Published Oct 20 • 7

upvoted 8 papers about 2 months ago

DeepSeek-OCR: Contexts Optical Compression

Paper • 2510.18234 • Published Oct 21 • 82

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published Oct 20 • 67

Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models

Paper • 2510.14961 • Published Oct 16 • 7

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published Oct 17 • 89

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13 • 165

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 266

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published Oct 3 • 97

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 494

upvoted a paper 2 months ago

BroRL: Scaling Reinforcement Learning via Broadened Exploration

Paper • 2510.01180 • Published Oct 1 • 18