joaogante (Joao Gante)

liked a model 3 months ago

deepseek-ai/DeepSeek-V3.2-Speciale

Text Generation • Updated Dec 1, 2025 • 11.6k • 685

liked a Space 4 months ago

The Smol Training Playbook

📚

3.05k

The secrets to building world-class LLMs

liked a Space 5 months ago

Maintain the unmaintainable

📚

79

Explore the complex relationships between 400+ machine learning models

liked a model 7 months ago

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26, 2025 • 7.47M • • 4.46k

liked 2 models 8 months ago

transformers-community/sep_cache

8B • Updated Aug 4, 2025 • 9 • 9

mistralai/Voxtral-Mini-3B-2507

Updated Jul 28, 2025 • 464k • 627

liked a model 11 months ago

Qwen/Qwen3-0.6B

Text Generation • 0.8B • Updated Jul 26, 2025 • 12.2M • 1.14k

liked a model about 1 year ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

Text Generation • 8B • Updated Feb 24, 2025 • 618k • • 804

liked a model over 1 year ago

Qwen/Qwen2.5-0.5B-Instruct

Text Generation • 0.5B • Updated Sep 25, 2024 • 6.16M • 481

liked a Space over 1 year ago

SynthID Text

🏃

68

Watermarking LLM-generated text with SynthID Text

liked a model over 1 year ago

meta-llama/Llama-3.2-1B

Text Generation • 1B • Updated Oct 24, 2024 • 1.87M • 2.33k

liked a Space over 1 year ago

Repository statistics

📊

15

liked 2 models over 1 year ago

meta-llama/Llama-3.1-8B-Instruct

Text Generation • Updated Sep 25, 2024 • 7.33M • • 5.56k

mattshumer/Reflection-Llama-3.1-70B

Text Generation • Updated Sep 24, 2024 • 286 • 1.71k

liked a Space over 1 year ago

FLUX.1 [dev]

🖥

9.4k

Generate images from your text prompt

liked a model over 1 year ago

google/gemma-2-2b-it

Text Generation • Updated Aug 27, 2024 • 446k • • 1.3k

liked 3 Spaces over 1 year ago

Hf Co Docs Chat

🚀

8

Open-LLM performances are plateauing, let’s make the leaderboard steep again

🏔

126

Explore and compare advanced language models on a new leaderboard

Omni-Zero

🧛

462

Restylize & repose person ID

liked a Space almost 2 years ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.31k

Generate a curated web‑text dataset for LLM training

Joao Gante

AI & ML interests

Organizations

deepseek-ai/DeepSeek-V3.2-Speciale

The Smol Training Playbook

Maintain the unmaintainable

openai/gpt-oss-20b

transformers-community/sep_cache

mistralai/Voxtral-Mini-3B-2507

Qwen/Qwen3-0.6B

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

Qwen/Qwen2.5-0.5B-Instruct

SynthID Text

meta-llama/Llama-3.2-1B

Repository statistics

meta-llama/Llama-3.1-8B-Instruct

mattshumer/Reflection-Llama-3.1-70B

FLUX.1 [dev]

google/gemma-2-2b-it

Hf Co Docs Chat

Open-LLM performances are plateauing, let’s make the leaderboard steep again

Omni-Zero

FineWeb: decanting the web for the finest text data at scale

Joao Gante

AI & ML interests

Organizations

joaogante's activity

The Smol Training Playbook

Maintain the unmaintainable

SynthID Text

Repository statistics

FLUX.1 [dev]

Hf Co Docs Chat

Open-LLM performances are plateauing, let’s make the leaderboard steep again

Omni-Zero

FineWeb: decanting the web for the finest text data at scale