Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
glab-caltech 's Collections
TWIN
VALOR

TWIN

updated 1 day ago

Datasets and models from the paper "Same or Not? Enhancing Visual Perception in Vision-Language Models"

Upvote
2

  • glab-caltech/TWIN-Qwen2.5-VL-3B

    Image-Text-to-Text • 4B • Updated 29 days ago • 67 • 2

  • glab-caltech/TWIN-InternVL3_5-1B

    Image-Text-to-Text • 1B • Updated 29 days ago • 25 • 1

  • glab-caltech/FGVQA

    Viewer • Updated 30 days ago • 12k • 96 • 1

  • glab-caltech/TWIN

    Viewer • Updated 30 days ago • 562k • 131 • 3

  • Same or Not? Enhancing Visual Perception in Vision-Language Models

    Paper • 2512.23592 • Published about 1 month ago • 1
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs