Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
riddhimanrana
/
fastvlm-0.5b-captions
like
3
Image-Text-to-Text
Transformers
Core ML
Safetensors
MLX
riddhimanrana/coco-fastvlm-2k-val2017
English
llava_qwen2
text-generation
finetuned
4bit
multimodal
conversational
arxiv:
2412.13303
arxiv:
1910.09700
License:
apple-amlr
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
fastvlm-0.5b-captions
864 MB
1 contributor
History:
19 commits
riddhimanrana
Update README.md
ce37ded
verified
4 months ago
demo
Upload demo.gif
7 months ago
fastvithd.mlpackage
Upload 3 files
7 months ago
.gitattributes
1.67 kB
Upload demo.gif
7 months ago
README.md
9.92 kB
Update README.md
4 months ago
added_tokens.json
101 Bytes
Upload model
7 months ago
config.json
1.45 kB
Upload model
7 months ago
merges.txt
1.67 MB
Upload model
7 months ago
model.safetensors
357 MB
xet
Upload model
7 months ago
model.safetensors.index.json
54.4 kB
Upload model
7 months ago
predict.py
3.68 kB
Create predict.py
7 months ago
preprocessor_config.json
467 Bytes
Upload model
7 months ago
processor_config.json
168 Bytes
Upload model
7 months ago
special_tokens_map.json
367 Bytes
Upload model
7 months ago
tokenizer.json
11.4 MB
xet
Upload model
7 months ago
tokenizer_config.json
1.64 kB
Upload model
7 months ago
vocab.json
2.78 MB
Upload model
7 months ago