What can I use for an offline, selfhosted LLM client, pref with images,charts, python code execution

catty@lemmy.world · edit-2 13 days ago

What can I use for an offline, selfhosted LLM client, pref with images,charts, python code execution

andrew0@lemmy.dbzer0.com · 13 days ago

Ollama for API, which you can integrate into Open WebUI. You can also integrate image generation with ComfyUI I believe.

It’s less of a hassle to use Docker for Open WebUI, but ollama works as a regular CLI tool.

O_R_I_O_N@lemm.ee · edit-2 13 days ago

ChainLit is a super ez UI too. Ollama works well with Semantic Kernal (for integration with existing code) and langChain (for agent orchestration). I’m working on building MCP interaction with ComfyUI’s API, it’s a pain in the ass.

catty@lemmy.world · edit-2 12 days ago

But won’t this be a mish-mash of different docker containers and projects creating an installation, dependency, upgrade nightmare?

andrew0@lemmy.dbzer0.com · 11 days ago

All the ones I mentioned can be installed with pip or uv if I am not mistaken. It would probably be more finicky than containers that you can put behind a reverse proxy, but it is possible if you wish to go that route. Ollama will also run system-wide, so any project will be able to use its API without you having to create a separate environment and download the same model twice in order to use it.

12 days ago

This is what I do its excellent.

Björn Tantau@swg-empire.de · 13 days ago

!localllama@sh.itjust.works

hendrik@palaver.p3x.de · 13 days ago

Maybe LocalAI? It doesn’t do python code execution, but pretty much all of the rest.

catty@lemmy.world · 12 days ago

This looks interesting - do you have experience of it? How reliable / efficient is it?

Mitex leo@buddyverse.one · 12 days ago

LocalAI is pretty good but resource-intensive. I ran it on a vps in the past.

catty@lemmy.world · edit-2 13 days ago

I’ve discovered jan.ai which is far faster than GPT4All, and visually a little nicer.

EDIT: After using it for an hour or so, it seems to crash all the time, I keep on having to reset it, and currently am facing it freezing for no reason.

otacon239@lemmy.world · edit-2 13 days ago

I also started using this recently and it’s very plug and play. Just open and run. It’s the only client so far that feels like I could recommend to non-geeks.

breadsmasher@lemmy.world · 13 days ago

AUTOMATIC1111?

https://github.com/AUTOMATIC1111/stable-diffusion-webui

ViatorOmnium@piefed.social · 13 days ago

The main limitation is the VRAM, but I doubt any model is going to be particularly fast.

I think phi3:mini on ollama might be an okish fit for python, since it’s a small model, but was trained on python codebases.

catty@lemmy.world · 13 days ago

I’m getting very-near real-time on my old laptop. Maybe a delay of 1-2s whilst it creates the response

Mitex leo@buddyverse.one · 12 days ago

You should try https://cherry-ai.com/ … It’s the most advanced client out there. I personally use Ollama for running the models and Mistral API for advnaced tasks.

catty@lemmy.world · 12 days ago

But its website is Chinese. Also what’s the github?

happinessattack@lemmy.world · 11 days ago

https://github.com/CherryHQ/cherry-studio

Mitex leo@buddyverse.one · 12 days ago

It’s fully open source and free (as in beer).

just_another_person@lemmy.world · 13 days ago

Would like fries or a jetpack with that?

Mitex leo@buddyverse.one · 12 days ago

You should try https://cherry-ai.com/ … It’s the most advanced client out there. I personally use Ollama for running the models and Mistral API for advnaced tasks.