Preview for Turn Images into Prompts Using DeepSeek Janus Pro
Turn Images into Prompts Using DeepSeek Janus Pro workflow diagram

Run this workflow on InstaSD

Get started in minutes! Run this ComfyUI workflow online - no setup required.

Description

Effortlessly turn visual content into creative inspiration using the Janus Prompt from Image + Flux txt2img workflow in ComfyUI. This setup extracts detailed prompts from any image using Janus, then feeds that description into a Flux-based generation pipeline to create stunning new variationsβ€”all automatically.


🎯 Features

  • Janus Vision-to-Text – Uses JanusImageUnderstanding to analyze and describe any image.
  • Prompt Switching – Choose between the generated Janus prompt or your own custom prompt via Any Switch.
  • Flux-Optimized Generation – Leverages FluxGuidance for fine-tuned output fidelity based on the prompt.
  • SDXL-Compatible – Works with SD3 latent image initialization for sharp, high-res generation.
  • Flexible Prompt Flow – Integrates easy showAnything and easy positive nodes for live prompt control and inspection.

πŸ’‘ Use Cases

  • Art Remixing – Feed in an artwork or concept image and create new versions based on its description.
  • Prompt Engineering – Learn how Janus sees your images and use that data to refine custom prompts.
  • Visual Brainstorming – Instantly turn sketches or references into evolved AI concepts.
  • Automatic Inspiration – Let the system suggest creative directions based on image context.
  • Character Expansion – Generate alternate looks of a character portrait using its own style and mood.

βš™οΈ How It Works

  1. Load an Image – Use LoadImage to input a visual reference (e.g., character or concept art).
  2. Describe with Janus – JanusModelLoader and JanusImageUnderstanding extract a natural-language prompt from the image.
  3. View the Prompt – Use easy showAnything to preview Janus' description, or mix with your custom input using Any Switch.
  4. Encode & Guide – CLIPTextEncode processes the prompt and FluxGuidance enhances generation control.
  5. Generate the Image – Sample new variations with KSampler using Flux and SD3 latent space.
  6. Decode & Save – Use VAEDecode and SaveImage to output the final render.

Credits: pixaroma

Models

FileDestinationSource
ae.safetensors/ComfyUI/models/vaeDownload
t5-v1_1-xxl-encoder-Q8_0.gguf/ComfyUI/models/clipDownload
clip_l.safetensors/ComfyUI/models/clipDownload
flux1-dev-Q8_0.gguf/ComfyUI/models/unetDownload
pytorch_model.bin/ComfyUI/models/Janus-Pro/Janus-Pro-1BDownload
preprocessor_config.json/ComfyUI/models/Janus-Pro/Janus-Pro-1BDownload
config.json/ComfyUI/models/Janus-Pro/Janus-Pro-1BDownload
processor_config.json/ComfyUI/models/Janus-Pro/Janus-Pro-1BDownload
special_tokens_map.json/ComfyUI/models/Janus-Pro/Janus-Pro-1BDownload
tokenizer.json/ComfyUI/models/Janus-Pro/Janus-Pro-1BDownload
tokenizer_config.json/ComfyUI/models/Janus-Pro/Janus-Pro-1BDownload

Nodes

CLIPTextEncodeFluxGuidanceKSamplerVAEDecodeVAELoaderDualCLIPLoaderGGUFEmptySD3LatentImageSaveImageUnetLoaderGGUFJanusImageUnderstandingeasy showAnythingeasy positiveAny Switch (rgthree)Fast Groups Muter (rgthree)LoadImageJanusModelLoader