Does Ollama support vision features?

by YarvixPA - opened about 20 hours ago

about 20 hours ago

I'd like to ask if Ollama supports vision features, or what exactly I need to do... I've read the details, but I don't quite understand what to do.

bartowski

Owner about 19 hours ago

It does but I'm not sure what their process is.. I've never tried it, if you CAN us lmstudio I'd recommend it, but otherwise hopefully someone else can chime in to comment on the support!

Hopefully it's as easy as pulling the HF model and letting it figure everything out?

ollama

about 12 hours ago

Yes, Ollama supports vision features.

Text Only:
ollama run gemma3:1b

Text+Vision:
ollama run gemma3:4b
ollama run gemma3:12b
ollama run gemma3:27b

FrenzyXL

about 8 hours ago

Yes, but the Vision support seems broken on Ollama with your quants @bartowski . I generally prefer your quants so I can use KL variants, but each and every attempt to use them on Ollama results in an immediate EOF error. @ollama could this be related to a differing vision implementation from llama.cpp?

FrenzyXL

about 8 hours ago

@YarvixPA it depends on if you are using the API or the CLI, but if we assume the CLI you just paste the path to your image at the end of your query, and it should analyze the supplied image

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment