Ollama v0.23.3
What's Changed mlx: refined model push behavior by @dhiltgen in https://github.com/ollama/ollama/pull/15431 test: integration test hardening by @dhiltgen in https://github.com/ollama/ollama/pull/13532 app: harden…
https://github.com/ollama/ollama/releases·tool·24 items·last fetched
What's Changed mlx: refined model push behavior by @dhiltgen in https://github.com/ollama/ollama/pull/15431 test: integration test hardening by @dhiltgen in https://github.com/ollama/ollama/pull/13532 app: harden…
What's Changed mlx: refined model push behavior by @dhiltgen in https://github.com/ollama/ollama/pull/15431 test: integration test hardening by @dhiltgen in https://github.com/ollama/ollama/pull/13532 app: harden…
What's Changed mlx: partial cleanup of imagegen layout by @dhiltgen in https://github.com/ollama/ollama/pull/15435 server: cache show responses by @ParthSareen in https://github.com/ollama/ollama/pull/15967 cmd/launch…
What's Changed `ollama launch` no longer includes Claude Desktop due to the third-party integration being limited to Anthropic models. `ollama launch claude-desktop --restore` still works to restore Claude to it's…
Gemma 4 MTP (Multi-token Processing) for the MLX runner Gemma 4 MTP speculative decoding is now supported on Macs. This can give over a 2x speed increase for the Gemma 4 31B model on coding tasks. ``` ollama run…
Claude Desktop Claude Desktop is now supported with Ollama Launch. Claude Cowork and Claude Code are supported within the Claude Desktop App. ``` ollama launch claude-desktop ``` Claude Cowork Claude Code Claude Code…
What's Changed Updated the Gemma 4 renderer for thinking and tool calling improvements Model recommendations are now updated without updating Ollama Aligned the desktop app's launch page with `ollama launch`…
New models NVIDIA's Nemotron 3 Omni Poolside's first open-weight coding model - Laguna XS.2 Full Changelog: https://github.com/ollama/ollama/compare/v0.21.2...v0.22.0
What's Changed api: accept "max" as a think value by @ParthSareen in https://github.com/ollama/ollama/pull/15787 openai: map responses reasoning effort to think by @ParthSareen in…
What's Changed Improved reliability of the OpenClaw onboarding flow in `ollama launch` Recommended models in `ollama launch` now appear in a fixed, canonical order OpenClaw integration now bundles Ollama's web search…
What's Changed Kimi CLI You can now install and run the Kimi CLI through Ollama. ``` ollama launch kimi --model kimi-k2.6:cloud ``` Kimi CLI with Kimi K2.6 excels at long horizon agentic execution tasks through a…
Hermes Agent ``` ollama launch hermes ``` Hermes learns with you, automatically creating skills to better serve your workflows. Great for research and engineering tasks. What's Changed Gemma 4 on MLX. Added support for…
What's Changed Fix quality of gemma:e2b and gemma:e4b when thinking is disabled ROCm: Update to ROCm 7.2.1 on Linux by @saman-amd in https://github.com/ollama/ollama/pull/15483 Full Changelog…
What's Changed Gemma 4 tool calling ability is improved and updated to use Google's latest post-launch fixes Parallel tool calling improved for streaming responses Hermes agent Ollama integration guide is now available…
OpenClaw channel setup with `ollama launch` What's Changed OpenClaw channel setup: connect WhatsApp, Telegram, Discord, and other messaging channels through `ollama launch openclaw` Enable flash attention for Gemma 4…
What's Changed mlx: Improve M5 performance with NAX gemma4: enable flash attention Full Changelog: https://github.com/ollama/ollama/compare/v0.20.3...v0.20.4
What's Changed Gemma 4 Tool Calling improvements Added latest models to Ollama App OpenClaw fixes for launching TUI Full Changelog: https://github.com/ollama/ollama/compare/v0.20.2...v0.20.3
What's Changed app: default app home view to new chat instead of launch by @jmorganca in https://github.com/ollama/ollama/pull/15312 Full Changelog: https://github.com/ollama/ollama/compare/v0.20.1...v0.20.2
What's Changed bench: add prompt calibration, context size flag, and NumCtx reporting by @dhiltgen in https://github.com/ollama/ollama/pull/15158 model/parsers: fix gemma4 arg parsing when quoted strings contain " by…
Gemma 4 Effective 2B (E2B) ``` ollama run gemma4:e2b ``` Effective 4B (E4B) ``` ollama run gemma4:e4b ``` 26B (Mixture of Experts model with 4B active parameters) ``` ollama run gemma4:26b ``` 31B (Dense) ``` ollama…
Ollama is now powered by MLX on Apple Silicon in preview Ollama on Apple silicon is now built on top of Apple’s machine learning framework, MLX, to take advantage of its unified memory architecture…
What's Changed ggml: force flash attention off for grok by @rick-github in https://github.com/ollama/ollama/pull/15050 mlx: fix KV cache snapshot memory leak by @jessegross in…
Visual Studio Code Microsoft Visual Studio Code now directly integrates with Ollama via GitHub Copilot. If you have Ollama installed, any local or cloud model from Ollama can be selected for use within visual studio…
What's Changed Add extra check to ensure `npm` and `git` are installed before installing OpenClaw Claude Code will now be faster when run locally, due to preventing cache breakages Fix to correctly support `ollama…