Atif Saleem
atifsal
AI & ML interests
AI safety and security research and its super-alignment with human race by using sup-intelligence that follows ethics, compliance and mimics human emotions in real time with empathy. I recently have been interested in Quantum computing and Molecular computing for AI where efficient low energy computing is leveraged to develop AI agents and Robots for our everyday use.
Recent Activity
updated
a collection
3 days ago
VTON_Models
updated
a collection
4 days ago
Text-to-Image
updated
a collection
4 days ago
Text-to-Image
Organizations
Text-to-Image
ComfyUI-Models-Workflows
Fashion-Models
Image-to-Image_Models
Video-Text_to_Text_Models
Image-to-Video_Models
Vision-Models
VTON_Models
AI-Datasets
Research-Papers
-
MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens
Paper • 2404.03413 • Published • 28 -
RepVideo: Rethinking Cross-Layer Representation for Video Generation
Paper • 2501.08994 • Published • 15 -
Hierarchical Cross-modal Prompt Learning for Vision-Language Models
Paper • 2507.14976 • Published • 2
Rerankers
STT-TTS-Audio-Models
Text-to-Video_Models
Graph-Learning_Models
Audio-Text-to-Text_Models
Text-Gen_Models
Any-to-Any_Models
Embedding-Models
AI-Models
-
microsoft/Orca-2-13b
Text Generation • Updated • 11.1k • 666 -
SG161222/Realistic_Vision_V6.0_B1_noVAE
Text-to-Image • Updated • 37.6k • 287 -
Runtime errorFeatured84
UDOP
🏃84Generate text from document images
-
Efficient and Effective Text Encoding for Chinese LLaMA and Alpaca
Paper • 2304.08177 • Published • 2
Prompt-Engineering
Embeddings
Rerankers
Text-to-Image
STT-TTS-Audio-Models
ComfyUI-Models-Workflows
Text-to-Video_Models
Fashion-Models
Graph-Learning_Models
Image-to-Image_Models
Audio-Text-to-Text_Models
Video-Text_to_Text_Models
Text-Gen_Models
Image-to-Video_Models
Any-to-Any_Models
Vision-Models
Embedding-Models
VTON_Models
AI-Models
-
microsoft/Orca-2-13b
Text Generation • Updated • 11.1k • 666 -
SG161222/Realistic_Vision_V6.0_B1_noVAE
Text-to-Image • Updated • 37.6k • 287 -
Runtime errorFeatured84
UDOP
🏃84Generate text from document images
-
Efficient and Effective Text Encoding for Chinese LLaMA and Alpaca
Paper • 2304.08177 • Published • 2
AI-Datasets
Prompt-Engineering
Research-Papers
-
MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens
Paper • 2404.03413 • Published • 28 -
RepVideo: Rethinking Cross-Layer Representation for Video Generation
Paper • 2501.08994 • Published • 15 -
Hierarchical Cross-modal Prompt Learning for Vision-Language Models
Paper • 2507.14976 • Published • 2