MobileVLM : A Fast, Reproducible and Strong Vision Language Assistant for Mobile Devices
Paper
•
2312.16886
•
Published
•
22
MobileLLaMA-2.7B-Chat is fine-tuned from MobileLLaMA-2.7B-Base with supervised instruction fine-tuning on ShareGPT dataset.
Model weights can be loaded with Hugging Face Transformers. Examples can be found at Github.
please refer to our paper in section 4.1: MobileVLM: A Fast, Strong and Open Vision Language Assistant for Mobile Devices.