view article Article Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality ibm-granite • 4 days ago • 21
MiniCPM-V 4.6 Collection MLX variants of MiniCPM-V 4.6, 1.3B parameters (SigLIP2 400M vision encoder + Qwen3.5-0.8B LLM), repo: https://huggingface.co/openbmb/MiniCPM-V-4.6 • 7 items • Updated 7 days ago • 1
view article Article EMO: Pretraining mixture of experts for emergent modularity allenai • 10 days ago • 35
view article Article Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents nvidia • 20 days ago • 55
TIPSv2: Advancing Vision-Language Pretraining with Enhanced Patch-Text Alignment Paper • 2604.12012 • Published Apr 13 • 12
view article Article DeepSeek-V4: a million-token context that agents can actually use burtenshaw • 25 days ago • 45
view article Article DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models lightonai • 27 days ago • 38
WildDet3D Collection This is the collection of WildDet3D artifacts, including demos, model checkpoints and data. https://github.com/allenai/WildDet3D • 8 items • Updated Apr 13 • 18
view article Article How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs nielsr • Apr 7 • 61
Falcon Perception Collection Falcon-Perception and Falcon-OCR model: early-fusion, natively multimodal, dense Autoregressive Transformer models. • 5 items • Updated Apr 6 • 14
view article Article SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation OpenMed • Mar 23 • 17