MobileVLM : A Fast, Reproducible and Strong Vision Language Assistant for Mobile Devices
Paper
•
2312.16886
•
Published
•
22
MobileLLaMA-1.4B-Chat is fine-tuned from MobileLLaMA-1.4B-Base with supervised instruction fine-tuning on ShareGPT dataset.
Model weights can be loaded with Hugging Face Transformers. Examples can be found at Github.
please refer to our paper in section 4.1: MobileVLM: A Fast, Strong and Open Vision Language Assistant for Mobile Devices.