Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
sentence-transformers
Safetensors
ONNX
GGUF
Transformers.js
MLX
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 12
Inference Providers
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
+ 11
Apply filters
Models
9,615
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
image-to-text
Clear all
stepfun-ai/GELab-Zero-4B-preview
Image-to-Text
•
4B
•
Updated
9 days ago
•
860
•
92
datalab-to/chandra
Image-to-Text
•
9B
•
Updated
Oct 21
•
87.1k
•
409
Salesforce/blip-image-captioning-base
Image-to-Text
•
Updated
Feb 3
•
2.23M
•
823
allenai/olmOCR-2-7B-1025-FP8
Image-to-Text
•
8B
•
Updated
1 day ago
•
737k
•
157
monkt/paddleocr-onnx
Image-to-Text
•
Updated
Oct 7
•
25
lightonai/LightOnOCR-1B-1025
Image-to-Text
•
Updated
17 days ago
•
15.1k
•
181
nvidia/nemotron-ocr-v1
Image-to-Text
•
Updated
about 1 month ago
•
395
•
44
allenai/olmOCR-2-7B-1025
Image-to-Text
•
8B
•
Updated
Oct 22
•
32.5k
•
90
thesby/Qwen3-VL-8B-NSFW-Caption-V4.5
Image-to-Text
•
9B
•
Updated
Nov 7
•
16.9k
•
48
xtuner/llava-llama-3-8b-v1_1-gguf
Image-to-Text
•
8B
•
Updated
Apr 30, 2024
•
3.51k
•
221
VLM2Vec/VLM2Vec-V2.0
Image-to-Text
•
Updated
Jul 13
•
10.4k
•
21
asmud/EasyOCR-onnx
Image-to-Text
•
Updated
Sep 2
•
2
XiaomiMiMo/MiMo-Embodied-7B
Image-to-Text
•
8B
•
Updated
19 days ago
•
1.06k
•
47
shkb/MemeLeak
Image-to-Text
•
9B
•
Updated
8 days ago
•
103
•
2
prithivMLmods/LightOnOCR-1B-1025-AIO-GGUF
Image-to-Text
•
0.8B
•
Updated
3 days ago
•
223
•
2
Salesforce/blip-image-captioning-large
Image-to-Text
•
0.5B
•
Updated
Feb 3
•
1.1M
•
1.44k
team-lucid/trocr-small-korean
Image-to-Text
•
54.5M
•
Updated
Jul 1, 2023
•
514
•
18
hezarai/trocr-base-fa-v2
Image-to-Text
•
Updated
Nov 14, 2024
•
89
•
4
microsoft/kosmos-2-patch14-224
Image-to-Text
•
2B
•
Updated
Nov 28, 2023
•
196k
•
180
deepghs/paddleocr
Image-to-Text
•
Updated
23 days ago
•
12
OleehyO/TexTeller
Image-to-Text
•
0.3B
•
Updated
Jun 22, 2024
•
7.76k
•
38
breezedeus/pix2text-mfr
Image-to-Text
•
Updated
May 5, 2024
•
167k
•
47
GnanaPrasath/ocr_tamil
Image-to-Text
•
Updated
Feb 14, 2024
•
19
techietrader/captcha_ocr
Image-to-Text
•
Updated
Jun 6, 2024
•
21
unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit
Image-to-Text
•
6B
•
Updated
Dec 10, 2024
•
518k
•
80
unsloth/Llama-3.2-11B-Vision-Instruct
Image-to-Text
•
11B
•
Updated
Dec 10, 2024
•
19k
•
86
Vikhrmodels/Vikhr-2-VL-2b-Instruct-experimental
Image-to-Text
•
2B
•
Updated
Nov 3, 2024
•
51
•
20
HuggingFaceTB/SmolVLM-256M-Base
Image-to-Text
•
0.3B
•
Updated
Jan 20
•
4.33k
•
18
allenai/olmOCR-7B-0225-preview
Image-to-Text
•
8B
•
Updated
Aug 19
•
7.05k
•
704
ibm-granite/granite-vision-3.1-2b-preview
Image-to-Text
•
3B
•
Updated
Jun 12
•
805
•
110
Previous
1
2
3
...
100
Next