Mahmud ElHuseyni 🇵🇸
MElHuseyni
AI & ML interests
Computer Vision
NLP
Machine Learning
Recent Activity
liked a model 1 day ago
moussaKam/mbarthez liked a model 2 days ago
LiquidAI/LFM2.5-ColBERT-350M liked a Space 3 days ago
treble-technologies/ffasrOrganizations
SmolVLM 🚐
OCR Models 👀️📃
Visual Embedding Models 🖼️
-
jinaai/jina-embeddings-v4
Visual Document Retrieval • 4B • Updated • 598k • 526 -
vidore/colqwen2.5-v0.2
Visual Document Retrieval • Updated • 122k • 99 -
nomic-ai/colnomic-embed-multimodal-7b
Visual Document Retrieval • Updated • 30.4k • 105 -
nvidia/llama-nemoretriever-colembed-3b-v1
Visual Document Retrieval • 4B • Updated • 176 • 75
Speech Models 🎧
Arabic Models (LLM, VLM, Multimodel)
Image Segmentation Models 🍪
-
nvidia/segformer-b5-finetuned-cityscapes-1024-1024
Image Segmentation • Updated • 121k • • 43 -
nvidia/segformer-b0-finetuned-ade-512-512
Image Segmentation • 3.75M • Updated • 284k • • 190 -
facebook/maskformer-swin-base-ade
Image Segmentation • Updated • 1.03k • 13 -
facebook/maskformer-swin-base-coco
Image Segmentation • 0.1B • Updated • 1.09k • 26
Object Detection Models 🍉
VLM Leaderboards 📈
- RunningAgents46
OCRBenchv2 Leaderboard
🏆46Display OCRBench leaderboard for text recognition models
- RunningAgents208
Vidore Leaderboard
🥇208Browse and compare visual document retrieval model scores
- Running on CPU UpgradeAgents1.02k
Open VLM Leaderboard
🌎1.02kVLMEvalKit Evaluation Results Collection
- RunningFeatured561
Vision Arena (Testing VLMs side-by-side)
🖼561Explore Vision Arena visual AI demo online
Emotion Detection
Arabic Models (LLM, VLM, Multimodel)
SmolVLM 🚐
Image Segmentation Models 🍪
-
nvidia/segformer-b5-finetuned-cityscapes-1024-1024
Image Segmentation • Updated • 121k • • 43 -
nvidia/segformer-b0-finetuned-ade-512-512
Image Segmentation • 3.75M • Updated • 284k • • 190 -
facebook/maskformer-swin-base-ade
Image Segmentation • Updated • 1.03k • 13 -
facebook/maskformer-swin-base-coco
Image Segmentation • 0.1B • Updated • 1.09k • 26
OCR Models 👀️📃
Object Detection Models 🍉
Visual Embedding Models 🖼️
-
jinaai/jina-embeddings-v4
Visual Document Retrieval • 4B • Updated • 598k • 526 -
vidore/colqwen2.5-v0.2
Visual Document Retrieval • Updated • 122k • 99 -
nomic-ai/colnomic-embed-multimodal-7b
Visual Document Retrieval • Updated • 30.4k • 105 -
nvidia/llama-nemoretriever-colembed-3b-v1
Visual Document Retrieval • 4B • Updated • 176 • 75
VLM Leaderboards 📈
- RunningAgents46
OCRBenchv2 Leaderboard
🏆46Display OCRBench leaderboard for text recognition models
- RunningAgents208
Vidore Leaderboard
🥇208Browse and compare visual document retrieval model scores
- Running on CPU UpgradeAgents1.02k
Open VLM Leaderboard
🌎1.02kVLMEvalKit Evaluation Results Collection
- RunningFeatured561
Vision Arena (Testing VLMs side-by-side)
🖼561Explore Vision Arena visual AI demo online
Speech Models 🎧