Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Mohamed Elhoseiny's picture
1

Mohamed Elhoseiny

mhelhoseiny
https://www.mohamed-elhoseiny.com/
  • moElhoseiny
  • mhelhoseiny

AI & ML interests

Computer Vision

Organizations

None yet

authored 2 papers 9 months ago

From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning

Paper • 2504.16080 • Published Apr 22, 2025 • 15

4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding

Paper • 2503.17827 • Published Mar 22, 2025 • 8
authored a paper over 1 year ago

Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling

Paper • 2408.03695 • Published Aug 7, 2024 • 13
authored a paper about 2 years ago

MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning

Paper • 2310.09478 • Published Oct 14, 2023 • 21
authored a paper over 2 years ago

MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models

Paper • 2304.10592 • Published Apr 20, 2023 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs