
This workflow combine power of LLM text models powered by Ollama with Flux image generation. It takes image as input, improve prompt and and change it according to instructions.
Information:
First of all you need to download and install Ollama
In current workfow we use 2 LLM models:
Llava for image tagging and Mistral for manipulations
Before running Comfy you need to download models:
open command prompt from Ollama folder (with ollama.exe) and say
ollama pull llava:7b (if you have 8-12 Vram)
or
ollama pull llava:13b (for 16+ Vram)
and wait for model download and say
ollama pull mistral-small
After download finished start ollama app.exe, wait for tray icon and start Comfy and install missing custom nodes.
If not set, select llava in Ollama Vision node and mistral in Translate and Ollama Generate Advance nodes.
If you plan to give instructions in other language turn on and use Translate node.