Daniel Bourke's picture

Building on HF

Daniel Bourke PRO

mrdbourke

·

https://www.mrdbourke.com

AI & ML interests

Computer vision. Small on-device models. VLMs. High-quality tutorials.

Recent Activity

liked a model about 12 hours ago

mlx-community/cohere-transcribe-03-2026-mlx-8bit

upvoted an article about 14 hours ago

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

liked a model 6 days ago

XiaomiMiMo/MiMo-V2.5-ASR

View all activity

Organizations

upvoted an article about 14 hours ago

Article

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

ibm-granite

•

4 days ago

• 21

upvoted a collection 7 days ago

MiniCPM-V 4.6

MLX variants of MiniCPM-V 4.6, 1.3B parameters (SigLIP2 400M vision encoder + Qwen3.5-0.8B LLM), repo: https://huggingface.co/openbmb/MiniCPM-V-4.6 • 7 items • Updated 7 days ago • 1

upvoted an article 7 days ago

Article

EMO: Pretraining mixture of experts for emergent modularity

allenai

•

10 days ago

• 35

upvoted an article 17 days ago

Article

Build a Domain-Specific Embedding Model in Under a Day

nvidia

•

Mar 20

• 73

upvoted an article 19 days ago

Article

Granite 4.1 LLMs: How They’re Built

ibm-granite

•

19 days ago

• 70

upvoted an article 20 days ago

Article

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

nvidia

•

20 days ago

• 55

upvoted a paper 21 days ago

TIPSv2: Advancing Vision-Language Pretraining with Enhanced Patch-Text Alignment

Paper • 2604.12012 • Published Apr 13 • 12

upvoted an article 24 days ago

Article

DeepSeek-V4: a million-token context that agents can actually use

burtenshaw

•

25 days ago

• 45

upvoted a collection 24 days ago

DeepSeek-V4

4 items • Updated 24 days ago • 645

upvoted an article 24 days ago

Article

ML Intern Takes Our Post-Training Internship Test

cmpatino

•

25 days ago

• 31

upvoted an article 26 days ago

Article

DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models

lightonai

•

27 days ago

• 38

upvoted an article 27 days ago

Article

The PR you would have opened yourself

pcuenq, awni

•

Apr 16

• 71

upvoted 2 collections about 1 month ago

Qwen3.6

4 items • Updated 26 days ago • 359

NVIDIA EGM

Efficient Grounding Models • 4 items • Updated 10 days ago • 8

upvoted a paper about 1 month ago

Falcon Perception

Paper • 2603.27365 • Published Mar 28 • 15

upvoted a collection about 1 month ago

WildDet3D

This is the collection of WildDet3D artifacts, including demos, model checkpoints and data. https://github.com/allenai/WildDet3D • 8 items • Updated Apr 13 • 18

upvoted an article about 1 month ago

Article

How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs

nielsr

•

Apr 7

• 61

upvoted 2 collections about 1 month ago

EUPE

6 items • Updated Mar 30 • 28

Falcon Perception

Falcon-Perception and Falcon-OCR model: early-fusion, natively multimodal, dense Autoregressive Transformer models. • 5 items • Updated Apr 6 • 14

upvoted an article about 1 month ago

Article

SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation

OpenMed

•

Mar 23

• 17