free-bit's picture

49 31

free-bit

free-bit

·

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

Pointcept/Concerto

upvoted a paper 5 days ago

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

liked a model 14 days ago

Snowflake/snowflake-arctic-embed-l-v2.0

View all activity

Organizations

None yet

upvoted a paper 5 days ago

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 225

upvoted 2 papers 19 days ago

Point Transformer V3: Simpler, Faster, Stronger

Paper • 2312.10035 • Published Dec 15, 2023 • 22

Sonata: Self-Supervised Learning of Reliable Point Representations

Paper • 2503.16429 • Published Mar 20, 2025 • 13

upvoted 17 papers 23 days ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published 26 days ago • 134

Self-Adapting Language Models

Paper • 2506.10943 • Published Jun 12, 2025 • 7

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 283

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24, 2025 • 77

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22, 2025 • 126

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 434

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20, 2025 • 157

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19, 2025 • 212

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4, 2025 • 253

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26, 2025 • 168

Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10, 2025 • 133

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 203

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14, 2025 • 306

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14, 2025 • 98

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11, 2025 • 154

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 321

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published Jun 5, 2025 • 77