facebook/vjepa2-vitl-fpc64-256 Video Classification β’ 0.3B β’ Updated Aug 11, 2025 β’ 57.2k β’ 172
ibm-granite/granite-docling-258M Image-Text-to-Text β’ 0.3B β’ Updated Sep 23, 2025 β’ 214k β’ 1.08k
Runtime error 36 Multimodal RAG with Granite Vision π 36 RAG example using Granite [vision, embedding, instruct]
Running on Zero Featured 261 granite-docling-258M demo π 261 Convert images to structured text and answer questions
docling-project/SmolDocling-256M-preview Image-Text-to-Text β’ 0.3B β’ Updated Sep 17, 2025 β’ 47.2k β’ 1.6k
Running on A100 224 Omnilingual ASR Media Transcription π 224 Transcribe audio or video into text in any language