Florence-2: Mastering Multiple Vision Tasks with a Single VLM Model

Learning Generalist Models for Anomaly Detection

Exploring “Small” Vision-Language Models with TinyGPT-V