Ollama
One-line local model serving with smart quant manifests
One-line local model serving with smart quant manifests
0.4.1
Apr 22, 2026
Streaming tool-call deltas for compatible models.
Full release notes0.4.0
Apr 8, 2026
Native tool-calling support, expanded vision-model catalogue.
0.3.14
Mar 19, 2026
Manifest fallback for Apple Silicon AMX dispatch.
Editor-curated slugs that route to this platform’s coverage. Reader-voted tags live below.
Be the first to tag this page. A tag becomes publicly visible once it reaches the community vote threshold.
Loading edit history…
Ollama bundles popular open-weight models behind a single CLI — `ollama run llama3` and the model is downloaded, quantized for your hardware, and exposed on a local API. Built atop llama.cpp with a manifest format that picks the right GGUF variant per machine.
Posts to your status feed
Pick the closest match below, edit the body, and post. Your report carries the #ollama tag automatically so it surfaces here + in the trending-tags rail.
From an Ollama frontend to a polished multi-user RAG platform in 14 months. We talked to the maintainers about scaling moderation, the plugin marketplace, and what's next.
All systems normal
No community reports inside the window.
No reports for Ollama in the last 2 hours. All clear.