Basic VLM - Workflow

Run this workflow on InstaSD

Get started in minutes! Run this ComfyUI workflow online - no setup required.

Use VLM models in your ComfyUI workflows, here is a basic example on how to use vision Vision Language models for simple image to text.

File	Destination	Source
mmproj-model-f16.gguf	/ComfyUI/models/LLavacheckpoints	Download
ggml-model-q4_k.gguf	/ComfyUI/models/LLavacheckpoints	Download

LLavaSamplerSimpleSimpleTextLlavaClipLoaderLLava Loader SimpleViewTextLoadImage