Florence-2: Mastering Multiple Vision Tasks with a Single VLM Model

How to Perform Computer Vision Tasks with Florence-2