The local-first AI stack, in one trackable index
Ollama, LM Studio, Mistral, llama.cpp + every other self-hosted runtime, UI, framework, and open-weight model family we cover. Follow a platform to surface release updates in your feed.
Inference runtime
3 platforms
Ollama
One-line local model serving with smart quant manifests
macOS · Linux · Windows· MITllama.cpp
The C++ inference engine under most of the local stack
macOS · Linux · Windows · iOS · Android· MITvLLM
High-throughput inference server with PagedAttention
Linux + NVIDIA GPU primary; AMD ROCm + CPU also supported· Apache 2.0
Desktop / web UI
6 platforms
LM Studio
Desktop UI for downloading + running open-weight models
macOS (Apple Silicon + Intel) · Linux · Windows· Proprietary (free for personal use)KoboldCPP
Self-contained local chat + story-writing UI on llama.cpp
macOS · Linux · Windows· AGPL-3.0text-generation-webui
Oobabooga's all-in-one local chat web UI
macOS · Linux · Windows· AGPL-3.0GPT4All
Cross-platform desktop chat client + local model hub
macOS · Linux · Windows· MIT (app) · varies (bundled models)Open WebUI
Self-hosted ChatGPT-style web UI on top of Ollama
Docker / Python · runs anywhere· MITJan
Privacy-first offline AI assistant desktop app
macOS · Linux · Windows· AGPL-3.0
Library / framework
1 platform
Model family
1 platform