Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
mishig 's Collections
Test
most ducked models 🦆🦆🦆
zephyr story
A little guide to building Large Language Models in 2024
fuck quadratic attention

most ducked models 🦆🦆🦆

updated Apr 28, 2025

https://x.com/jeremyphoward/status/1881264223646576786

Upvote
4

  • answerdotai/ModernBERT-large

    Fill-Mask • 0.4B • Updated Jan 15, 2025 • 502k • 470

  • Qwen/Qwen2-VL-72B-Instruct

    Image-Text-to-Text • 73B • Updated Feb 6, 2025 • 28.7k • 310

  • Qwen/Qwen2.5-72B-Instruct

    Text Generation • 73B • Updated Jan 12, 2025 • 881k • • 941

  • answerdotai/ModernBERT-base

    Fill-Mask • 0.1B • Updated Jan 15, 2025 • 2.13M • 1.05k

  • Qwen/Qwen2.5-Coder-32B-Instruct

    Text Generation • 33B • Updated Jan 12, 2025 • 1.08M • • 2.02k
Upvote
4
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs