LLaVA

Definition

A widely used open vision-language model that connects an image encoder to an open LLM, a common base for self-hosted multimodal features.

LLaVA pairs a visual encoder with an open language model so it can describe images and answer visual questions, and its open weights make it a frequent starting point for teams building or fine-tuning their own VLM features.

Also known as

LLaVA-NeXT

Specialist software house for video, real-time and AI products. Founded 2005. 50 in-house engineers.

Knowledge base

Blog Guides Courses Glossary Downloads

Company

Services Projects Demos Calculator Contacts

+852-8193-2621

Hong Kong

+1 (914) 775-5855

New York · USA

eager2develop@forasoft.com

Your message has been sent successfully

We will contact you soon

Message not sent. Please try again.

LLaVA

Related terms

VLM (Vision-language model)

Qwen-VL

Open-weights model

Fine-tuning