Changelog
All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
Unreleased
0.8.41 - 2026-05-23
Changed
- Project renamed to codewhale. The canonical CLI dispatcher is now
codewhale(wasdeepseek) and the TUI runtime iscodewhale-tui(wasdeepseek-tui). The 14 workspace crates are renamed fromdeepseek-*/deepseek-tui-*tocodewhale-*/codewhale-tui-*. The npm wrapper package is nowcodewhale(wasdeepseek-tui). See docs/REBRAND.md for migration notes. - DeepSeek provider integration is unchanged.
DEEPSEEK_*env vars, model IDs (deepseek-v4-pro,deepseek-v4-flash, the legacydeepseek-chat/deepseek-reasoneraliases), thehttps://api.deepseek.comhost, and the~/.deepseek/config directory are all preserved.
Deprecated
- The
deepseekanddeepseek-tuibinary names continue to ship as tiny shims that print a one-line warning and forward argv to the renamed binaries. They will be removed in v0.9.0. - The
deepseek-tuinpm package continues to publish for one release cycle as a no-bindeprecation shim whose postinstall directs users tonpm install -g codewhale. It will be removed in v0.9.0.
Fixed
- Windows CI spillover tests are isolated. Tool-result deduplication tests now use a temporary spillover root guarded by the existing global spillover mutex, removing the shared-state race that made Windows CI fail unrelated PRs (#1943).
- Terminated sub-agents keep
agent_evalrecoverable. Evaluating a completed child session now returns the available transcript result instead of losing the final output (#1738, #1928). - Bare
@/completions no longer freeze the TUI. File-mention completion skips bare separator and dot tokens so Windows/WSL2 workspaces do not trigger an eager 4096-entry filesystem walk on the UI thread (#1921, #1929). - Enter paths avoid synchronous UI-thread waits. Composer history writes, offline queue persistence, feedback URL launching, and clipboard fallback helpers now run off the hot Enter path where appropriate (#1927, #1931, #1940, #1941, #1944).
- tmux and screen sessions stop idling as terminal activity. Terminal multiplexers now force low-motion behavior and pin the fallback footer label so passive animations do not trip activity monitors (#1925, #1942).
- Composer sanitization catches OSC 8 and Kitty fragments. The input sanitizer now strips common hyperlink and keyboard-protocol fragments that leaked into drafts while preserving ordinary prose (#1915, #1933).
- The Work sidebar hides stale completed tasks. Terminal task records older than the current session and outside the recent-completion window no longer crowd active Work sidebar rows (#1913, #1930).
- V4 Pro pricing docs reflect permanent rates. The English, Simplified Chinese, and Japanese READMEs now describe the V4 Pro pricing change as permanent instead of temporary (#1923, #1932).
Thanks
Thanks to OpenWarp (@zerx-lab) for prioritizing codewhale support and collaborating on terminal-agent UX. Thanks to @leo119 for the update-command documentation lineage now preserved through the rename.
0.8.40 - 2026-05-21
Added
- Configurable sub-agent per-step API timeout. A new
[subagents] api_timeout_secssetting in~/.deepseek/config.tomlcontrols how long each sub-agent step will wait on a DeepSeekcreate_messageresponse before falling back. The value is clamped to1..=1800;0or unset preserves the legacy 120-second default, so existing installs see no behavior change. Long-thinking children (e.g. heavy plan or review work behindagent_open) can extend the timeout without recompiling (#1806, #1808). - Delegated file-write permissions for write-capable sub-agent roles.
implementerandcustomsub-agents may now runSuggest-level write tools (write_file,edit_file,apply_patch) without the parent runtime being auto-approved. Read-only stances (explore,plan,review,verifier) and the defaultgeneralrole still bounce approval-gated tools so they can't quietly mutate the workspace, andRequired-level tools (shell, etc.) still need parent auto-approve regardless of role. Pickimplementer(or pass an explicitcustomallowlist) when the delegated task needs to land file changes (#1828, #1833). - Experimental Fin fast-lane tool agents.
tool_agentopens a durable child session on DeepSeek V4 Flash with thinking forced off for simple tool-bound work such as OCR, file/search lookups, fetches, and command probes. It uses the existingagent_eval/agent_closelifecycle and mailbox token-usage stream, so sub-agent cost accounting stays on the same path as normalagent_opensessions.
Fixed
- WSL2 and headless Linux startup no longer blocks on clipboard init. The TUI now defers clipboard initialization so machines without an X server can reach the first frame instead of hanging on a blank screen (#1773, #1772).
- Windows alt-screen output stays clean when
RUST_LOGis set. Runtime tracing is routed away from the interactive buffer so logs no longer leak into the TUI display (#1774, #1776). - OpenAI-compatible custom model names are preserved. Non-DeepSeek providers now pass explicit model names through instead of rewriting them to a DeepSeek default (#1714, #1740).
- Wanjie Ark is a first-class provider.
--provider wanjie-ark, the TUI provider picker,deepseek auth, doctor, and config files now target Wanjie's OpenAI-compatible MaaS endpoint with pass-through model IDs and Wanjie-specific env vars. - DeepSeek reasoning replay works through OpenAI-compatible endpoints.
DeepSeek models selected under the generic
openaiprovider now replay priorreasoning_contentconsistently and classify streamed reasoning the same way the replay path does (#1694, #1739, #1743). - Thinking-only turns no longer disappear. If a clean turn ends with thinking but no final answer text, the UI now surfaces a clear status instead of silently ending the turn (#1727, #1742).
- Windows
cmd /Cpreserves quoted shell arguments. Commands such asgit commit -m "feat: complete sub-pages"now round-trip through the Windows shell wrapper without losing the quoted message (#1691, #1744). - Home/End are line-local inside multiline composer drafts. The keys now jump to the current input line boundary before falling back to transcript navigation (#1748, #1749).
- Ctrl+C restores the canceled prompt reliably. Canceling a streaming turn puts the submitted prompt back in the composer and suppresses late stream events from drawing stale output (#1757, #1764).
- Compaction recovers from cache-aligned summary context overflow. When a cache-preserving summary request itself exceeds the provider context window, compaction retries with the bounded formatted summary path instead of failing with a 400 "compression command failed" style error.
- Terminal sub-agent sessions expose full transcript handles. Completed
and canceled child agents now store the full child message transcript behind
transcript_handle, so the parent can inspect details withhandle_readinstead of relying only on a lossy summary (#1738). - Forked saved sessions now keep visible lineage.
deepseek forkrecords the parent session id and fork-time message count in additive metadata, and session listings mark forked paths with their source id. This gives users a bounded branchable-conversation workflow while the larger visual tree browser stays scoped for a future release. - Repeated shell wait rows collapse in the Tasks sidebar. Multiple live
task_shell_waitpolls for the same background job now render as one row with an explicit collapsed-wait count, reducing the stuck-task appearance tracked for v0.8.40 (#1737). - Leaked mouse scroll reports no longer erase composer draft suffixes. If
a terminal delivers raw SGR mouse bytes into the input stream, the sanitizer
now strips only the mouse report and adjacent coordinate fragments instead
of deleting legitimate draft text such as
commit -mor numeric prompts (#1778). - TUI runtime logs are separated per process and pruned on startup. Each
session now writes
~/.deepseek/logs/tui-YYYY-MM-DD-PID.log, and startup removes stale TUI logs older than seven days by default. SetDEEPSEEK_LOG_RETENTION_DAYSto a positive day count to adjust retention (#1782, #1784). - The offline eval harness preserves quoted Windows shell payloads. Its
exec_shellstep now uses the same single-payload shape as the runtime shell path, with rawcmd /Carguments on Windows so quoted commands remain intact (#1779). - The Feishu/Lark bridge recovers better after restarts. It now reattaches to persisted active turns after the long-connection client starts, and text chunking no longer splits emoji or other multi-code-unit characters.
- RLM survives non-UTF-8 stdout.
rlm_evalnow decodes REPL stdout lossily instead of treating a single invalid byte as a fatal crash, so binary-adjacent diagnostics can still return a bounded result (#1815, #1819). - Small UI/review reliability fixes landed with the stability branch.
/clearnow resets all displayed cost state, grayscale theme previews avoid luma overflow,/themepicker arrow navigation wraps at the list edges, and encoded JSON review output is parsed before display. - New-file writes execute on the first Agent-mode call.
write_filenow stays preloaded in Agent mode, so creating a file no longer stops at the deferred-tool schema hydration message before the normal approval/execution path (#1825, #1841). - Saved sessions keep the selected model mode. Changing from
autoto a concrete model now updates existing session metadata, and resumed sessions recompute theautoflag from the saved model instead of falling back to the startup default. - The
/modelpicker persists thinking effort across restarts. Selecting Pro/Flash plushigh/max/autonow writes bothdefault_modelandreasoning_efforttosettings.toml, and startup restores the saved effort before falling back toconfig.toml. - The footer water strip is visible by default again.
fancy_animationsnow defaults totrue, whileNO_ANIMATIONS, SSH/Termius, VS Code, Ghostty, and legacy terminal overrides still disable the animated strip where it is known to flicker. - Screenshots are readable without extra setup on macOS.
image_ocrnow uses the native Vision framework on macOS when Tesseract is absent, andread_fileroutes screenshot/image reads through the same OCR path. Pasted clipboard screenshots saved under~/.deepseek/clipboard-imagesare trusted automatically for read-only tools. - Auto-routing context no longer leaks hidden thinking. The model/router
context summary now excludes
ContentBlock::Thinking, so prior internal reasoning is not reintroduced as if it were visible user or assistant text.
Changed
- Slash-command autocomplete ranks exact alias matches first. Typing
/qnow surfaces/exit(whose aliasqis an exact match) above/clear(which only matches by the longer pinyin aliasqingping). Within each rank tier the menu still falls back to alphabetical name order for deterministic display (#1811). - CNB mirror preflight covers stability-release branches. The CNB sync path now recognizes the v0.8.40 stability branch shape before release tags exist, making the Tencent Lighthouse/Lark deployment path easier to verify before publishing.
Thanks
Thanks to jayzhu (@zlh124) for the WSL2
startup report and clipboard-init fix in #1772/#1773. Thanks to Paulo Aboim
Pinto (@aboimpinto) for the Windows
alt-screen logging report and fix in #1774/#1776, and for the Home/End
composer work in #1748/#1749, plus the per-process log filename follow-up in
#1782/#1783. Thanks to Zhongyue Lin
(@LeoLin990405) for the provider model
passthrough, reasoning replay, thinking-only turn, and Windows quoting fixes
in #1740, #1743, #1742, and #1744. Thanks to Nightt
(@nightt5879) for the Ctrl+C prompt restore
fix in #1764. Thanks to Ling (@LING71671;
commits as www17 <ivonrust@gmail.com>) for the configurable sub-agent API
timeout in #1808 and the Agent-mode write_file preload fix in #1841,
harvested with 1..=1800 clamping and a fail-fast guard so a stray
api_timeout_secs = 0 keeps the legacy 120-second default.
Thanks to @knqiufan for the sub-agent
file-write delegation work in #1833, harvested with structured approval-
gate semantics (Implementer and Custom only, never Required-level
tools) so write-capable children can actually land code without bypassing
the Required approval class. Thanks to @IIzzaya
for the exact-alias-first slash-completion ordering idea in #1811, landed
with a focused regression test. Thanks to Bevis and the community reports
that surfaced the compaction failure mode addressed in this release. Thanks to
Reid (@reidliu41) for the grayscale theme
overflow report and /theme picker edge-wrapping patch in #1814.
0.8.39 - 2026-05-17
Fixed
- Feishu/Lark bridge startup order is guarded. The bridge now keeps
ThreadStoreinitialized before startup opens persisted thread state, with a regression test to prevent moving it below its first use. /modelpicker opens instantly with the curated list again. Reverted the v0.8.38 live-catalog rework: the picker no longer makes a blocking network call on open and once again shows the curatedauto/deepseek-v4-pro/deepseek-v4-flashrows. The/modelscommand still lists the live provider catalog.- "Approve for session" groups by command family again. Session approvals
are keyed by a lossy, arity-aware fingerprint once more, so approving
cargo buildalso coverscargo build --release. Denials keep the exact per-call fingerprint from #1617, so denying one call no longer over-blocks later, different calls to the same tool. - Docker first-run state directories are writable. The image now
pre-creates
/home/deepseek/.deepseekwithdeepseekownership so the documented named-volume launch can create runtime thread state on first use (#1684). - Runtime API system prompt overrides survive the first turn. Threads
created with a
system_promptoverride now keep that prompt through mode/context refreshes before the model request is built (#1688). - Compaction keeps a user text query in tool-heavy histories. Automatic compaction now pins the latest user text message when the retained tail only contains tool calls/results, avoiding OpenAI-compatible Jinja template failures on the next request (#1704).
- Pager jumps land at the visible bottom. Pressing
Gor End in the pager no longer overshoots the render clamp, sok/Up scrolls upward immediately afterward, and mouse wheels now scroll pager overlays directly (#1706, #1716). - Mouse-wheel-as-arrow scrolling preserves composer drafts. When
composer_arrows_scrollis enabled, Up/Down now scroll the transcript even with text in the composer instead of replacing the draft with input history (#1677). - Multiline composer arrows move between input lines. Plain Up/Down now move the cursor within multiline drafts before falling back to input history, while single-line mouse-wheel-as-arrow scrolling remains unchanged (#1721).
- Third-party
reasoning_contentstreams no longer corrupt text output. Generic OpenAI-compatible providers that stream answer text inreasoning_contentnow render it as normal text unless the selected provider is one whose reasoning-content semantics are supported (#1673). - macOS system theme detection recognizes Light mode. When
COLORFGBGis missing or unusable,theme = "system"now falls back to macOS appearance detection and treats a missingAppleInterfaceStylekey as Light mode (#1670). rlm_openaccepts schema-filled blank source fields. Emptyfile_path,content, andurlstrings now count as absent, so calls that provide one real source no longer fail the exactly-one-source validator (#1712).- Resize keeps transcript paging usable immediately. After a terminal resize, PageUp/PageDown now use the resized viewport height instead of falling back to one-line jumps before the next render (#1724).
- ACP responses stringify JSON-RPC ids.
serve --acpnow returns string ids even when clients send numeric ids, matching Zed's stricter ACP client expectations (#1696).
Thanks
Thanks to Matt Van Horn (@mvanhorn) for the
Docker first-run permission fix in #1699 and the runtime system-prompt
regression tests harvested from #1702. Thanks to Kristopher Clark
(@krisclarkdev) for the compaction
user-query preservation fix in #1704. Thanks to Stephen Xu
(@wlon) for the pager jump-bottom fix in #1706.
Thanks to tdccccc (@tdccccc) for the
composer scroll fix in #1715 and pager mouse-wheel support in #1716.
Thanks to Paulo Aboim Pinto
(@aboimpinto) for the multiline composer
arrow navigation tests harvested from #1719. Thanks to LittleBlacky
(@LittleBlacky) for the provider-gated
reasoning_content stream fix in #1680.
Thanks to Eosin Ai (@Aitensa) for the macOS
system appearance fallback in #1674.
Thanks to Anaheim (@AnaheimEX) for the
rlm_open schema validation report in #1712.
Thanks to THatch26 (@THatch26) for the
terminal resize paging fix in #1724.
Thanks to Alvin (@alvin1) for the Zed ACP id
compatibility report in #1696.
0.8.38 - 2026-05-15
Changed
- Update guidance is clearer on the website. The homepage and install page
now surface
deepseek updatewhile keeping package-manager update paths visible for Homebrew, npm, and Cargo installs. - README setup docs are current again. The English, Simplified Chinese, and Japanese READMEs now use the current Docker volume/workspace invocation, document update paths, list the current provider/model switching surface, and send release-specific feature notes back to the changelog.
Fixed
- OpenAI-compatible providers receive stricter request bodies. Fireworks
requests now use
reasoning_effortwithout the DeepSeek/Anthropic-style top-levelthinkingfield, and chat tool schemas no longer include Anthropic-only metadata such asallowed_callers,defer_loading, orinput_examples(#1592). - pnpm global installs no longer hang in optional postinstall. pnpm postinstall now skips install-time binary downloads and leaves the existing runtime downloader to verify or fetch binaries on first run (#1637).
- Terminal modes are restored on early TUI exits. A cleanup guard now restores raw mode, alternate screen, focus events, mouse capture, bracketed paste, keyboard flags, and cursor visibility if startup returns early after terminal initialization (#1593, #1582).
- Wrapped OSC 8 links keep their full target. Long clickable URLs now reopen the original full link target on each wrapped visual chunk instead of exposing truncated hyperlink targets (#1577).
- Provider-selected models survive startup and picker reselects. The
/modelpicker now uses live provider model catalogs when available, saved default providers sync into the runtime config before the first request, and reselecting the active provider from the picker keeps the current model instead of falling back to the provider default (#1632). - OpenAI-compatible batch tool calls keep all start events. Streaming
responses with multiple
tool_callsin one assistant message now preserve every tool-use block instead of pairing many tool results with only the last tool start event (#1686). - Diagnostics tool schemas include an empty
requiredlist. The built-indiagnosticstool now sendsrequired: []with its empty object schema so DeepSeek no longer rejects it as a null required array (#1685). - Windows wheel-as-arrow scrolling works with mouse capture enabled.
composer_arrows_scrollnow defaults on for Windows terminals even when mouse capture is enabled, so wheel events that arrive as arrow keys scroll the transcript instead of cycling composer history (#1578). - Plain Windows PowerShell / ConHost uses calmer rendering. Unmarked
legacy Windows console hosts now automatically enable low-motion rendering,
disable fancy animations, and resolve
synchronized_output = "auto"to off so streaming redraws do not overlap or visibly flicker (#1590). - LoopGuard blocks now count as failed tool calls. Identical tool-call blocks now return a failed tool result instead of a success, so repeated blocked checklist/tool retries can trip the existing failure warning and halt path instead of spinning indefinitely (#1574).
- Denied tool approvals are scoped to the exact call. Denying one write/shell approval now caches the canonical argument fingerprint instead of a lossy tool/prefix key, so later calls to the same tool with different arguments can still be reviewed and approved (#1617).
Thanks
Thanks to DC (@duanchao-lab) for the
terminal cleanup-guard idea harvested from #1630, and imkingjh999
(@imkingjh999) for the provider/model
switching fixes harvested from #1642. Thanks to Photo
(@eng2007) for the provider-aware /model
picker catalog work harvested from #1201. Thanks to hexin
(@h3c-hexin) for the OpenAI batch tool-call
streaming fix in #1686. Thanks to chennest
(@chennest) for the diagnostics schema report
in #1685. Thanks to
@kunpeng-ai-lab for the Windows
composer scroll fix harvested from #1578, and WuMing
(@asdfg314284230) for the Windows
PowerShell flicker fix harvested from #1591. Thanks to
@maker316 for the LoopGuard/checklist loop
report in #1574. Thanks to lalala
(@lalala-233) for the approval denial
regression report in #1617, and Nightt
(@nightt5879) for the exact-call approval
key work harvested from #1624.
0.8.37 - 2026-05-14
Added
- Tencent Lighthouse + Feishu/Lark bridge setup. Added a
/opt/whalebroLighthouse runbook, systemd deploy assets, a long-connection Feishu/Lark bridge, a bridge config validator, and a VPS doctor for runtime, Node, binaries, env, systemd, and localhost health checks. - Tencent Cloud remote-first onboarding. Documented the CNB + Lighthouse + Feishu/Lark + optional EdgeOne teaching path and added non-active CNB deploy templates for a future Lighthouse deploy button. Feishu/Lighthouse branches are now mirrored to CNB for Tencent-first bootstrap.
- Homebrew tap automation is release-gated. The release workflow can update
Hmbown/homebrew-deepseek-tuifrom the checksum manifest when a tap token is configured, and skips cleanly before downloading release assets when no tap token exists.
Changed
- Bing is the default
web_searchbackend. DuckDuckGo remains selectable with[search] provider = "duckduckgo"and keeps its Bing fallback path.
Fixed
- First-run onboarding stays usable without an API key. Missing-key startup no longer aborts the TUI before onboarding can collect provider settings.
- Streamable HTTP MCP sessions keep their server-issued session ID. Custom headers also apply to GET preflight requests, fixing authenticated MCP servers that require both.
- DeepSeek model completions use canonical IDs. Alias completions now resolve to stable DeepSeek model names before being written to config.
- Terminal and child-process reliability is tighter. Signal shutdown now restores the terminal, child tasks preserve proxy environment variables, and Windows Enter / CSI-u input handling avoids the prior event mismatch.
- Long terminal text wraps instead of overflowing. Streaming output, diff rendering, and the pager now hard-wrap overlong no-whitespace and CJK runs.
- Release and platform edges are safer. The TUI no longer trips the Windows Instant-underflow test path, unsupported desktop targets compile the external URL opener, and legacy DeepSeek CN provider aliases deserialize to the canonical DeepSeek provider.
- Footer diagnostics are less cryptic. Prefix-cache stability is no longer
shown in the default footer, and the opt-in
/statuslinechip now sayscache prefix 100%instead of the ambiguousP 100%. - Feishu/Lark bridge dependency installs are locked and audited. The
bridge now ships a package lock, installs with
npm cion Lighthouse when available, and overrides the Lark SDK's transitiveaxiosdependency to a patched line. - China-friendly update fallback.
deepseek updatenow supports mirrored release assets throughDEEPSEEK_TUI_RELEASE_BASE_URLplusDEEPSEEK_TUI_VERSION, and its network-failure hints point users behind GitHub-blocking networks to the CNBcargo install --gitpath for both shipped binaries. - CNB is the default Tencent release-candidate mirror. The CNB sync workflow now mirrors Feishu/Lighthouse release branches, so Tencent Lighthouse bootstrap can use CNB before the release branch merges.
Thanks
Thanks to ZzzPL (@Oliver-ZPLiu) for the MCP Streamable HTTP and Homebrew automation fixes (#1643, #1631), Reid (@reidliu41) for CI, streaming wrap, and model-completion fixes (#1603, #1628, #1601), MidoriKurage (@mdrkrg) for the onboarding crash fix (#1598), Gordon (@gordonlu) for the Windows Enter / CSI-u fix (#1612), Aitensa (@Aitensa) for the CJK diff/pager wrap fix (#1622), qiyan233 (@qiyan233) for legacy DeepSeek CN provider aliases (#1645), jieshu666 (@jieshu666) for the repaint-flicker reduction (#1563), Vishnu (@Vishnu1837) for terminal restoration on signals (#1586), and axobase001 (@axobase001) for proxy environment preservation in child tasks (#1608).
0.8.36 - 2026-05-14
Added
- The right sidebar can be hidden for copy-friendly terminals.
sidebar_focus = "hidden"(orCtrl+Alt+0for the current session) removes the Work/Tasks/Agents/Context rail so raw terminal selection cannot copy sidebar borders alongside transcript text.
Changed
- Sub-agent completion handoffs are leaner and more cache-friendly.
Internal
<deepseek:subagent.done>sentinels now point to the preceding human summary line instead of duplicating the summary, elapsed time, and step count inside JSON sent to the parent model. - Prefix stability is visible beside cache telemetry by default. The footer now includes the prefix-stability chip in the default status layout, and low last-request cache hit rates are no longer colored as hard errors when the system/tool prefix itself is stable.
- RLM batch helpers now require an explicit independence assertion.
sub_query_batch,sub_query_map, and low-level*_batchedhelpers refuse dependency-unsafe parallel fanout unless callers passdependency_mode="independent", and RLM now exposessub_query_sequencefor A-to-B dependent work.
0.8.35 - 2026-05-13
A post-0.8.34 cleanup release focused on prompt hygiene, context-pressure guidance, and keeping the next release branch clearly separated from the already-published v0.8.34 tag.
Note on v0.8.34 contributor credits: Horace Liu (@liuhq) contributed Nix package support and install documentation in the v0.8.34 cycle but was inadvertently omitted from that release's changelog. The README contributor list and this note correct the record.
Changed
- First-turn prompt context is leaner and easier to audit. The
generated project context pack now ignores hidden tool/cache state,
balances top-level directories before descending, and
/contextshows named prompt layers instead of a single opaque system blob. - Model-visible prompt policy de-conflicted. The base and mode
prompts no longer forbid useful
deepseekCLI diagnostics, no longer require checklists for simple one-step work, and align long-session compaction guidance around the 60% suggestion threshold. - Context-pressure guidance now has one split rule. Manual
/compactsuggestions start around 60% during sustained work, while automatic replacement compaction remains an opt-in hard guardrail near 80% so DeepSeek V4 prefix-cache economics stay intact. - The Tasks sidebar now ages out stale live-tool noise. Completed active tool rows linger briefly and then leave the right rail; very old running shell rows collapse to a single row instead of occupying the whole Tasks panel.
Fixed
auto_compactsettings help now reports the real default, which has been off since v0.8.11 to avoid unnecessary cache-prefix rewrites.
0.8.34 - 2026-05-13
A polish, terminal-protocol, and internal-cleanup release. The model-facing surface is stable; this cycle focused on prefix-cache stability metrics, broader terminal protocol coverage, bundled skills, and shrinking the mega-files that had grown around the agent loop and TUI.
Added
- Prefix-cache stability tracking. A footer chip surfaces how stable the cached prefix has been across recent turns (inspired by Reasonix), so users can spot cache-busting edits before cost climbs.
- Bundled DeepSeek-native workflow skills. A starter set of skills
ships in-binary so a fresh install has a usable
/skillscatalog without external assets. - Native Kitty + Ghostty notification protocols.
OSC 99(Kitty) andOSC 777(Ghostty) are now first-class alongside the existing desktop notification fallback. - Theme picker with more presets. Catppuccin, Tokyo Night, Dracula,
and Gruvbox join the built-in palette set;
/themenow shows a live picker. - Chunked parallel-safe tool execution. The engine batches side-effect-free tool calls into a chunked dispatch so independent reads/searches finish in one turn instead of serialising round-trip by round-trip.
- Cancel-all shell jobs. A single action stops every running background shell command instead of cancelling them one-by-one.
edit_filetolerates typographic punctuation drift. When the exact-match and leading-whitespace-fuzzy passes both fail andfuzz: trueis set, the tool retries with smart quotes ("/"→",'/'→'), en/em-dashes (–/—→-), and non-breaking spaces (U+00A0 → space) normalized to ASCII. Catches the copy-paste failure mode where a browser or chat client substituted Unicode punctuation for the ASCII the file actually contains.
Changed
crates/tui/src/tui/ui.rssplit into focused modules. The former 10k-line single-file TUI dispatcher is decomposed into smaller modules with clearer responsibilities so reviewing a UI change does not require holding the entire surface in head.crates/tui/src/core/engine.rsreduced. Helper clusters moved into the existingcore/engine/submodule directory next to the turn loop and tool execution code, making the agent-loop core easier to read end-to-end.- Structured tracing on tool dispatch. Tool entry, exit, duration,
and result/error are emitted through
tracingevents soRUST_LOG=engine.tool_execution=debugproduces a coherent timeline instead of scattered ad-hoc prints. /initupdatesAGENTS.mdin place instead of refusing when the file already exists, so adding new project guidance does not require manual stitching.- Reasoning tokens included in cost calculations, and the cost
display auto-switches to CNY when the session locale is
zh-Hans. - Stale repo-root development docs removed.
TAKEOVER_PROMPT.md(v0.8.6 era),PROMPT_ANALYSIS.md, and the redundantDEPENDENCY_GRAPH.mdno longer ship in releases;docs/ARCHITECTURE.mdremains the canonical crate-layout reference.
Fixed
- Auth keys checked against the saved provider on startup, so a stored DeepSeek key is no longer rejected after switching providers mid-session.
- Auto router skipped for decisive local routes, removing an extra model round-trip on prompts the dispatcher can route directly.
- Reasoning content stripped for generic providers that do not
understand the
reasoning_contentfield, preventing HTTP 400s when pointing at an OpenAI-compatible gateway that lacks DeepSeek thinking semantics. FocusGaineddebounced so terminals (Tabby) that emit rapid focus events no longer trigger a repaint flicker loop.- MCP HTTP transport defaults
Accept: application/json, text/event-streamand persistsMcp-Session-Idacross requests, matching the spec for resumable streams. - Shell output tail preserved when truncating, so the last lines of a long command output (usually the error trailer) survive the in-transcript summary.
- Prefix cache preserved while pruning tool results. Old side-effect tool payloads no longer invalidate the prefix that the next turn would otherwise reuse.
- Review sub-agents prevented from spawning further sub-agents (#1489), keeping recursive depth bounded.
- Help overlay closes cleanly and repaints without a stale frame.
- Pinyin
/skillsalias dispatched correctly so Chinese-locale users reach the same surface. - VTE flicker terminals get reduced motion by default to avoid thrashing on terminals that mishandle frequent partial redraws.
- Composer border no longer shows the derived session title, keeping the composer chrome reserved for editor and mode state.
0.8.33 - 2026-05-12
A sub-agent and RLM renovation release. The model-facing delegation
surface is now session-oriented instead of one-shot: RLM work happens
through rlm_open / rlm_eval / rlm_configure / rlm_close,
sub-agent work happens through agent_open / agent_eval /
agent_close, and large outputs can be parked behind typed handles
that the model reads back explicitly with handle_read.
Added
- Persistent RLM sessions with bounded REPL helpers. RLM prompts now
use
peek,search,chunk,context_meta,sub_query,sub_query_batch,sub_query_map,sub_rlm, andfinalize(value, confidence)instead of exposing the full parent context as an ambient variable. - Fork-aware sub-agent sessions.
agent_opensupports named sessions,fork_context, and bounded recursive depth so the parent can ask for multiple perspectives while preserving prompt-cache-friendly prefix context where available. - Shared
handle_readstorage. RLM finals, sub-agent transcripts, and other large structured results can returnvar_handlereferences with slice, range, count, and JSONPath projections. - Slash-command routing for the new surface.
/rlm [N] ...and/agent [N] ...now prompt the assistant to use the persistent tools instead of the removed foreground RLM operation. - Harness-friendly non-interactive exec sessions.
deepseek execnow supports--resume,--session-id,--continue, and--output-format stream-jsonso backend wrappers such as ClawBench can keep conversation state and parse one JSON event per line without running a long-lived server. /relayslash command with CJK aliases (/接力). Hands the assistant a structured handoff prompt for coordinated multi-turn continuation across sessions.checklist_writesidebar rename. The sidebar focus tab formerly known as "Plan" / "Todos" is now "Work" — one panel for the active checklist and optional plan, consistent across all three modes.- Grayscale theme.
/theme grayscaleand/set theme grayscale --saveprovide a low-opinion black/white palette for users who want less brand color in the terminal.
Changed
- Prompts and docs now teach only the new tool names. Legacy RLM/sub-agent helpers remain internally where needed for durable transcript compatibility, but the registry exposes the session tools.
- Large or noisy tool results are easier to keep out of context.
Tool output summaries, sub-agent results, and transcript snapshots now
point the model toward
handle_readwhen it needs raw detail. - Tool-surface smoke guidance is explicit. Release checks now document
the exact version commands and registry-name searches for
handle_read, persistent RLM tools, and persistent sub-agent tools. - README acknowledgements expanded. The project thanks OpenWarp and Open Design for support and collaboration around terminal-agent and design-forward workflows.
- Light theme tuned for calmer contrast. The canvas, panel, elevated, border, and selection tokens now separate surfaces without the washed-out white-on-white feel.
- Session picker is history-first.
/sessionsandCtrl+Rnow show the full selected session history on the left with the session list on the right; number keys1-9open visible session histories,PgUp/PgDnscroll that history, andEnterstill resumes. - Foreground RLM operation removed. The old
Op::Rlmpath and itshandle_rlmengine method are gone; all RLM work now flows through the persistent-session tools. - Stale competitive-analysis doc removed. The old cross-agent matrix had become an unreliable inventory of tool names rather than useful release guidance.
Fixed
- Local/custom endpoints stay prompt-free when auth is optional. The dispatcher no longer reads the secret store for SGLang, vLLM, Ollama, or loopback custom URLs unless API-key auth is explicitly requested, and the direct TUI treats loopback model endpoints as no-key by default. This avoids macOS Keychain prompts and stale DeepSeek keys when users point the app at local OpenAI-compatible servers.
- Transcript browsing stays put across resizes. If the user is reading older chat history, terminal resize events preserve the current transcript position instead of jumping back to the live tail; the scrollbar and jump-to-latest affordance now follow the active theme.
- Backtrack preview opens near the selected turn. Pressing Esc twice no longer opens the live transcript preview at the oldest conversation line; the highlighted recent user turn is pinned into view, and changing the backtrack target re-pins only that selection.
- Completed thinking no longer masquerades as prompt text. Collapsed
completed reasoning now shows only explicit
Summary:content inline; raw reasoning remains available through Ctrl+O/transcript instead of appearing as assistant self-talk in the main flow. When Ctrl+O starts from a reasoning block, it opens a full-session reasoning timeline instead of a single isolated chunk. - Transcript selection keeps working while the agent is streaming. The loading-state mouse filter now drops inert move events but allows active transcript and scrollbar drags to continue (reported as a known issue in v0.8.32).
- Empty-composer arrow scrolling feels less twitchy. When configured to scroll the transcript, plain Up/Down now move by a small wheel-like step instead of a single-line flick.
- Mouse and trackpad scrolling feel less sticky in long logs. Rapid same-direction transcript scrolls now get bounded acceleration while direction changes reset to precise single-line movement.
- RLM smoke-test papercuts fixed.
rlm_evalnow bindscontentas a convenience alias for_context, tolerates commontimeout_secskeyword guesses on child-query helpers while preserving session-level timeout policy, and stores JSON-serializablefinalize(...)values as JSON handles sohandle_readcan project them directly. - RLM REPL uses the shared Python resolver. RLM startup now tries
python3,python, andpy -3, matching the dependency resolver used by code execution and avoiding Windows failures wherepython3is absent (harvested from PR #1540). - Session titles and history previews hide metadata noise. Saved
session titles and the picker history strip leading
<turn_meta>envelopes and thinking-tag blocks so historical conversations read like user-visible chat rather than prompt plumbing (harvested from PR #1510). - Companion binary version smoke is unambiguous.
deepseek-tui --versionnow reports thedeepseek-tuibinary name instead of the dispatcher label. - Vision path boundary test is platform-native. The absolute-path
rejection smoke uses a Windows absolute path on Windows and
/etc/hostselsewhere (harvested from PR #1526). - Tool papercuts:
file_searchhas safer default excludes and an explicitexcludeoption;grep_filesreturns single-line context as strings;fetch_urlcan project JSON fields and returns headers;edit_filecan opt into leading-indentation fuzz;exec_shellcan merge stdout/stderr in chronological order;revert_turnrejects no-op snapshot boundaries. - CLI reasoning-effort honoured on non-auto exec routes (PR #1511
from @h3c-hexin).
deepseek -p "..." --reasoning-effort highnow applies the flag correctly instead of falling back to the config-file default. - Edit-file replacement boundaries clarified (PR #1516). The tool
description and error messages now make it unambiguous that
edit_fileis for one clear replacement in one file. - Pandoc output validated before probing (PR #1523). Binary-format conversions that produce empty or invalid output now surface a clear error instead of a confusing pandoc stack trace.
- Running turns can be steered and repainted (PR #1533, #1537). Composer input during an active turn no longer stalls; the TUI redraws the transcript as the agent streams.
- Tasks and Activity Detail are calmer under load. The Tasks panel now keeps live/background/recent activity from double-counting the same shell or RLM work, groups repeated read/search/checklist noise, and keeps failures, status, command summaries, and durations visible. Ctrl+O now opens Activity Detail for selected/live/recent tool work and the reasoning timeline for thinking blocks, while Alt+V remains the direct tool-detail pager; the idle footer now advertises that split for the visible activity.
- npm retry shows timeout hint on first failure (PR #1538). Installations behind slow proxies now see a clear "retrying" message instead of a silent hang.
- Issue templates improved (PR #1525 from @reidliu41). Bug and feature-request templates are clearer and easier for new contributors.
Credits
Thanks to @reidliu41 (#1525/#1526), @h3c-hexin (#1511), @xulongzhe (#1530/#1544), @tyouter (#1510), and @Duducoco (#1540) for community contributions in this release.
0.8.32 - 2026-05-12
A "more useful tools" release. v0.8.31 made the tool surface
reliable on every host; v0.8.32 expands it. Anchor is the question
every new contributor asks: "what does the model actually have to
work with?" — and the answer is now closer to "everything you'd
reach for from a shell, including the document formats the real
world uses." Five new tools (pdf-extract swap, js_execution,
pandoc_convert, image_ocr, image_analyze), ten community PR
harvests targeting model-protocol bugs (vLLM thinking) and UX
papercuts (Shift+Enter on Windows VSCode, mention truncation
splitting CJK codepoints, approval modal hiding the transcript),
and a snapshot-self-disable on workspaces over 2 GB of
non-excluded content so first-turn git add -A no longer hangs
the TUI on multi-hundred-GB project directories.
Performance
- Move
instructions = [...], user memory, and session goal below the prompt's volatile-content boundary so DeepSeek's KV prefix cache survives mid-session edits (harvested from PR #1345 by @Duducoco). Before this change, the per-workspaceinstructionsblock, the user memory file (/memory), and the current session goal (/goal) were rendered at position 2.5 in the system prompt — inside the static prefix layer that the cache hits. Any edit to those files (or any# fooquick-add to memory) busted the cached prefix from that byte onwards, forcing the next turn to re-tokenize the rest of the static layer. Relocating them to position 6 (immediately above the previous-session handoff block) means the cache hit covers the entire static prefix — mode, project context, env, skills, context management, compact template — regardless of how often the user edits their memory file. Skills, context management, and the compact template stay always-cacheable in the static layer where they belong.
Removed
- Shift-to-bypass-mouse-capture is gone. The #376 escape-hatch
feature (hold Shift while moving the mouse → temporarily disable
alt-screen mouse capture so terminal-native text selection works,
then re-enable on release) was causing visible scroll/redraw
thrash: every Shift transition flipped the mouse-capture mode AND
pushed a status toast ("Native selection — release Shift to
return" / "Mouse capture restored"). On modern terminals that
honor
xterm-modifyOtherKeysthe toast cycle fired on stray Shift events and produced what users described as a "scroll demon." Removing the bypass path entirely: text selection in alt-screen sessions now goes through the same path as any other TUI (your terminal's modifier-bypass — typically Option/Alt on macOS, Shift in some Linux terminals — still works at the terminal level, this just stops us from second-guessing it).
Fixed
- Tool-result spillover and wire-dedup now share a retrieval
namespace, so
retrieve_tool_resultfinds what the model was pointed at. Two systems used to mint reference blocks independently — disk spillover keyed by tool-call id (~/.deepseek/tool_outputs/<id>.txt, only above 100 KB) and the Chat-Completions wire compactor that replaced repeated tool results with<TOOL_RESULT_REF sha="…"/>(any size, keyed by SHA256 of the content). The SHA refs were impossible to retrieve:retrieve_tool_result ref=<sha>looked in the tool-call-id directory and 404'd. Worse, the wire dedup fired on tiny outputs (a 65-bytegh run view --jsoncould turn into a ref the model then chased through three guesses), and the[artifact: …]block emitted byapply_spillover_with_artifactshowedid:(withart_prefix),path:(with a slash), andtool_call_id:separately with no indication of which oneretrieve_tool_resultaccepted. Reported by users hitting "spilled tool result was not found" 4–5 times per session while polling CI runs.- The wire compactor now persists deduped content to
~/.deepseek/tool_outputs/sha_<sha>.txton first sighting, and only dedupes outputs ≥ 1 KiB — tiny results stay inline on both occurrences instead of becoming a ref the model has to chase. The<TOOL_RESULT_REF>block grew a literalretrieve: retrieve_tool_result ref=sha:<sha>line. retrieve_tool_resultlearns five new ref shapes:sha:<64-hex>, bare 64-hex,art_<tool_call_id>,artifacts/art_<id>.txt, and absolute paths anywhere under the session-artifact root. The lookup tries the legacy spillover dir, the current session's artifact dir, and theart_→ legacy fallback in one call. When everything misses, the error enumerates every candidate path tried and lists every accepted ref form so the model can correct on the next attempt instead of guessing blind.- The
[artifact: …]block emitted alongside spilled outputs now includes a literalretrieve: retrieve_tool_result ref=art_<id>line so the model never has to guess betweenid:,path:, andtool_call_id:.
- The wire compactor now persists deduped content to
<TURN_META_REF sha="…" original_chars="…" />is gone. The legacy wire-level dedup for identical per-turn metadata blocks emitted an opaque SHA-tagged reference that periodically leaked into model reasoning and user-visible debug dumps with no retrieval mechanism — the SHA was an artifact of the cache optimization, not a content address the model could resolve. The compactor now emits a self-explanatory<turn_meta_unchanged />marker instead. Same KV-cache friendliness (bytes in the same prompt position when nothing changed), zero ambiguity if the marker ever surfaces.- "Request cancelled while awaiting approval" now says
why. The approval and user-input handlers used to emit the
same opaque string regardless of source — user Esc, runtime-API
DELETE, parent-agent cancel, or a torn-down channel all
produced an identical error. A new
CancelReasonenum is latched alongside theCancellationTokenand surfaced as a(reason: …)suffix:user cancelled the request,request cancelled by external caller,request was preempted by a new turn,engine torn down before approval resolved. A closed approval channel reports the teardown race explicitly instead of just "Approval channel closed." Remaining non-user cancellation call sites are tracked in #1541. pandoc_convertvalidates binary-output requests before resolving thepandocbinary (harvested from PR #1523 by @muyuliyan). Hosts without pandoc installed now still get the intended validation error fortarget_format = "docx"withoutoutput_path, which keeps the CI test independent of runner packages.- Mouse movement no longer leaks xterm tracking bytes into the composer while the model is streaming (harvested from PR #1533 by @Oliver-ZPLiu, fixes #1529). When mouse capture is enabled, move/drag events can arrive faster than the TUI needs them during a streaming turn; those stale movement events are now discarded while loading, while click/scroll handling remains active when the TUI is idle.
- Resize and successful turn completion avoid blank-frame
flicker (harvested from PR #1537 by @czf0718, addresses
#1515 and #1539). Resize now performs the viewport reset, clear,
and redraw inside a single synchronized-output batch. Successful
TurnCompleteevents use the incremental renderer instead of forcing a clear+redraw every time; full repaint remains reserved for interrupted/failed turns and periodic resyncs. - The npm wrapper shows binary-download remediation on the first
retryable timeout (harvested from PR #1538 by @jieshu666,
addresses #1532). Users behind networks that time out when
fetching GitHub Release assets now see the mirror/proxy/Cargo
alternatives immediately, and the installer points at the fixed
docs/INSTALL.md#npm-binary-download-times-outanchor. edit_filestates its exact-replacement boundary and warns on multi-match replacements (from #1516). The tool description and base prompt now steer structural or intertwined edits towardapply_patch/write_file, while keeping legitimate repeated replacements compatible with an advisory "verify with read_file" hint.- Shift+Enter explicitly steers a running turn. Plain Enter
while busy continues to queue (the agreed default since
#1331). Pressing Shift+Enter during an in-flight turn
routes the draft directly through
engine.steer()— the same path Ctrl+Enter already drove — without going through the queue at all. When the agent is idle, Shift+Enter still inserts a composer newline as before. Ctrl+Enter remains bound for terminals that swallow the Shift modifier on Enter at the protocol level. (Pattern cross-checked against pi-agent'sstreamingBehavior: "steer"API: seeagent-session.ts:156— sameShift+Enter → steer / Plain Enter → queuesplit, just surfaced as a keypress instead of a programmatic flag.) exec_shell_waitdefault timeout raised from 5 s → 30 s. The 5 s default was fine for "did the command exit yet?" polls but forced models into hand-rolledwhile true; do exec_shell_wait …; sleep 30; doneloops forgh run watch, longcargo builds, andcargo login-style interactives. 30 s covers the common case in a single tool call without burning a whole turn on the timeout boundary; the model can still pass any value up to 600 s.- Snapshots no longer try to index a multi-hundred-GB workspace
on first turn. Reported by users running
deepseek-tuiinside project directories with hundreds of GB of content — datasets, model weights (.safetensors,.gguf,.pt), Docker image dumps, parquet / arrow caches — where the side-git snapshot initialization would hang the TUI for minutes or hours whilegit add -Awalked the workspace. v0.8.32 adds a default 2 GB ceiling on non-excluded workspace content (measured before any git work, walking the same excludes the snapshot path already honors). When the cap is exceeded the side repo isn't initialized; subsequent snapshots are skipped with a clear WARN-level log line referencing the new[snapshots] max_workspace_gbconfig knob users can raise (or set to0to disable the cap entirely and restore v0.8.31 behaviour). The bounded estimator also early-exits past 200k file entries, so a workspace full of tiny files trips the cap before paying for a full walk. Pre-existing v0.8.27 fixes for the growth-over-time angle (#1112: retention cap, mid-session prune, expanded built-in excludes) continue to apply; this closes the orthogonal "snapshots-too-big-to-start" path. - Toast stack overlay no longer renders on top of the composer
input (harvested from PR #1485 by @MeAiRobot). When a
deferred tool's schema auto-loaded after the model requested
it, the resulting status toast ("Auto-loaded deferred tool
'edit_file' after model request.") could render at
footer_area.y - 1— which on tight layouts is the bottom row of the composer area, visibly overwriting the start of the user's typed text.render_toast_stack_overlaynow clampsmax_aboveto the gap betweencomposer_area.y + composer_area.heightandfooter_area.y, so when the composer and footer are adjacent the overlay collapses to zero rows and the toast is suppressed rather than drawn on top. /sessionspicker highlights the selected row more strongly in dark terminals (harvested from PR #1493 by @reidliu41). Previously the selection background was subtle enough to lose in low-contrast dark themes; keyboard navigation up/down didn't obviously change which row was active. The selected row now uses a bolded label on a stronger background so the focused row reads cleanly across the dark palettes the TUI ships with.- TUI input no longer freezes while long-running shell jobs
flood stdout (#1299, harvested from PR #1494 by
@CrepuscularIRIS / autoghclaw). The job-panel refresh path
was calling
full_output()from inside theShellManagermutex, which cloned the entire accumulated stdout/stderr buffer every 2.5 seconds. For browser-automation or large-build jobs the buffer grew unboundedly; cloning held the mutex for O(total_bytes) time, starving thecrossterm::event::pollloop and dropping keystrokes. The refresh now reads only the lastmax_tail_chars * 4bytes under the lock (lock hold time is O(1) regardless of total output volume) and decodes those into a tail string for display.stdout_len/stderr_lenstill report the true total byte counts so no caller invariant breaks. Also tightenstake_delta_from_bufferto slice[cursor..total]inside the lock guard instead of cloning the whole buffer first, and skips UTF-8 continuation bytes attail_startsofrom_utf8_lossynever emits a leading U+FFFD in the job panel. @-mention truncation no longer splits multi-byte UTF-8 sequences (#1441, harvested from PR #1495 by @CrepuscularIRIS / autoghclaw). When@-mentioning a file larger than 128 KB the composer truncated the buffer at exactlyMAX_MENTION_FILE_BYTES, which on CJK / emoji content landed mid-codepoint and produced a stray U+FFFD at the cut point. The truncator now usesstr::from_utf8(...).error_len()to detect the incomplete-tail case and rounds down to the last valid codepoint boundary before decoding. Genuinely invalid UTF-8 files still surface the "file is not UTF-8" error (the rounding is only applied when the error is an incomplete tail, not a real decoding failure mid-buffer).- vLLM provider:
reasoning_effort = "off"now actually disables thinking on Qwen3 / DeepSeek-R1 servers, cutting TTFT from ~13s to ~270ms (harvested from PR #1480 by @h3c-hexin). The vLLM branch ofapply_reasoning_effortwas injectingthinking: {type: "disabled"}at the top of the request body — but vLLM speaks OpenAI's chat-completions protocol, not Anthropic-native fields, and silently ignored the directive. The model then emitted a full hidden reasoning trace into the non-standardreasoningfield (which this client doesn't surface), so users saw a multi-second freeze before any content token arrived. The vLLM branch now emits the OpenAI extensionchat_template_kwargs.enable_thinking(which vLLM forwards into the model's chat template — the canonical way to toggle Qwen3's<think>...</think>mode). Measurement against vLLM + Qwen3.6-35B-A3B-FP8: TTFT 13039ms → 274ms, total LLM call 13s → 5.7s. Thehigh/maxeffort levels likewise switch to the OpenAI extension. No change for non-vLLM providers. /sessionspicker no longer shows<turn_meta>as the session title (harvested from PR #1498 by @wdw8276).session_manager::create_saved_session_with_id_and_modepicked the first text content block off the user message viafind_map; the engine prepends an internal<turn_meta>block ahead of the real user text, so the picker rendered that metadata blob as the session name. Guard added so titles fall through to the actual user input. Existing sessions without the prefix block are unaffected.- Kitty keyboard protocol now activates on Windows (VSCode +
Windows Terminal), so
Shift+Enterinserts a newline instead of submitting (#1359, harvested from PR #1483 by @CrepuscularIRIS / autoghclaw). Root cause: crossterm'sPushKeyboardEnhancementFlagsgates the escape sequence onis_ansi_code_supported(), which on Windows queries the console mode rather than the VT capability and unconditionally returns false — so the Kitty push (\x1b[>1u) was never written, leaving xterm.js in legacy mode whereShift+EnterandEnterboth produce\rand are indistinguishable.Alt+Enter/Ctrl+Jwere affected the same way. The fix writes the push and pop escapes directly under#[cfg(windows)], bypassing the capability gate; terminals that don't speak the protocol silently discard the sequences. Also extends the pop-on-exit path to two missed call sites (themain.rspanic hook andexternal_editor.rs::spawn_editor_for_input) so a crash or$EDITORinvocation can no longer leave the parent shell's keyboard state corrupted. - Approval modal can be collapsed to a one-line banner with Tab (harvested from PR #1455 by @tiger-dog). Previously the approval prompt rendered as a full-screen takeover that hid the transcript behind it, so users had to dismiss the modal just to remember which tool call they were being asked to approve. Tab now toggles between the takeover card and a single-line bottom banner — the rest of the conversation stays visible while the decision is pending. Tab again restores the full card; the selection state is preserved across the toggle.
- Markdown renderer no longer eats underscores inside
identifiers (harvested from PR #1455 by @tiger-dog). The
inline parser was matching
_italic_against the underscore indeepseek_tui/foo_bar_bazand rendering the second half of the identifier in italic, which made transcript snippets that named code symbols read as garbled prose. Both_italic_and*italic*now apply a CommonMark-style boundary check on the closing delimiter — when the next character is a letter, digit, or underscore, the delimiter is treated as literal text instead of markup. Regression-pinned with cases likecrate deepseek_tui handles approvalsandlook at *not_emphasised*tail.
Added
- npm wrapper installs cleanly on OpenHarmony / HarmonyPC
(#1072, harvested from PR #1499 by @CrepuscularIRIS /
autoghclaw).
os.platform()returnsopenharmonyon HarmonyPC and on OpenHarmony's Linux ABI-compatible userspace, but the npm wrapper's platform-asset matrix only coveredlinux/darwin/win32, sonpm i -g deepseek-tuiwould abort withUnsupported platform: openharmonyeven though the Linux x64 / arm64 binaries run unchanged on that environment. Added aPLATFORM_ALIASESmapping that resolvesopenharmonyto thelinuxasset family before lookup so install succeeds on those hosts. The error message for genuinely unsupported platforms still reports the raw platform name (freebsd, etc.) so OS-mismatch reports stay diagnostic. - Startup empty-state shows useful context instead of
repeating the header (harvested from PR #1444 by
@reidliu41). The center of the welcome view used to repeat
information already displayed in the header and footer. It now
shows the build version, the active model with a
/modelhint, and the current working directory so first-time users have somewhere to look while they decide what to type. - Opt-in
v4-best-practicesbundled skill (harvested from PR #1448 by @SamhandsomeLee). A single 50-lineSKILL.mdencoding three V4-specific workflow rules for multi-step thinking-mode tasks. Each rule maps to a concrete observable failure class. Discovered through the existingcrates/tui/assets/skills/...mechanism alongside theskill-creatorskill — not enabled by default; users opt in via the standard/skillsUI. image_analyzetool — vision-model image understanding (harvested from PR #1467 by @MMMarcinho). Sends an image file to an OpenAI-compatible vision endpoint and returns the model's natural-language description. Complementsimage_ocr: useimage_ocrfor "what text is on this image",image_analyzefor "what is this image about". Opt-in only — gated by both the[features] vision_model = trueflag and a[vision_model]config block specifyingmodel(and optionallyapi_key/base_url). Default configuration ships the feature flag atfalse, so no install sees vision API calls fire without an explicit two-step opt-in. Billing: each call hits the configured vision endpoint (OpenAI by default), so usage is billed by the third-party provider; calls are stateless (no conversation context attached). Workspace-boundary check: the tool rejects absolute paths and any..parent-dir traversal before any base64 encoding or API call. To disable later: set[features] vision_model = false(or omit[vision_model]). Supports PNG, JPEG, GIF, WebP, and BMP inputs.image_ocrtool — extract text from images via local tesseract. Lets the model OCR a screenshot, scanned receipt, whiteboard photo, or image-only PDF the user drops into the workspace, without bouncing throughexec_shell. Spawnstesseract <image> -and returns the recognised text inline; no file is written. PNG / JPEG / TIFF inputs supported. Registration is gated ondependencies::resolve_tesseract(); when tesseract is missing the tool isn't advertised, so the model never tries to call an OCR engine the host can't run.deepseek doctorreports tesseract status alongside the other external-binary dependencies with platform-aware install hints (brew install tesseract/apt install tesseract-ocr/winget install UB-Mannheim.TesseractOCR). For non-default language packs or PSM modes, users can still drop intoexec_shellwith the full tesseract CLI surface.pandoc_converttool — convert documents between formats via the local pandoc binary. Pandoc is the Swiss Army knife the real world uses for moving prose around — Markdown to HTML, HTML to Markdown, reST to anything, anything to DOCX / EPUB / LaTeX — and surfacing it as a model-callable tool unblocks "rewrite this report as ..." / "publish this changelog as ..." workflows that previously needed the user to drop into a terminal between turns. Curated target whitelist of 11 formats (markdown, gfm, commonmark, html, rst, latex, docx, odt, epub, plain, asciidoc) so the model can't ask forpdf(would need LaTeX) or typos likemarkown. Binary targets (docx, odt, epub) require anoutput_path; text targets can return the converted text inline. Approval routes through the WritesFiles / Suggest tier on every call. Registration is gated ondependencies::resolve_pandoc();deepseek doctorsurfaces the binary's status with platform-aware install hints.js_executiontool — execute model-provided JavaScript via a local Node.js runtime. Mirrorscode_execution(Python) so the model has a single consistent surface for "run this snippet locally and tell me what it printed" across both interpreters. Same tempfile-spawn pattern, same 120-second timeout, same stdout/stderr/return_code result shape — so prompt-cache layouts that cover one tool also cover the other. Registration is gated oncrate::dependencies::resolve_node(): when Node is missing the tool is simply not advertised, so the model never sees a runtime it can't actually use.deepseek doctorreports Node availability under "Tool Dependencies" with platform-aware install hints (brew install node/apt install nodejs/winget install OpenJS.NodeJS). Approval routes through the same Suggest tier ascode_execution./translateopt-in: respond in the user's UI locale, with a post-hoc fallback for English that leaks through (harvested from PR #1462 by @YaYII). Two-layer design: when the user enables translation via the/translateslash command, a## Language Output Requirementblock is appended to the system prompt instructing the model to reply in the resolved session locale (Simplified Chinese, Traditional Chinese, Japanese, or Brazilian Portuguese — code identifiers and user-requested English code blocks are exempt). For replies that still surface English despite the directive, a heuristic intui::translation(Latin-vs-CJK character ratio with weighting for CJK information density) detects the leak and invokes a focused per-message translation API call to render the localised version before display. Both layers are off by default and have no effect on installs that don't enable them. Trust-boundary scope: opt-in only, system prompt addition is conditional on the runtime flag, no model behaviour change for English-locale users.- AtlasCloud is now a first-class provider (harvested from
PR #1436 by @lucaszhu-hue). AtlasCloud hosts the V4 family
(and other DeepSeek-compatible models) on its own endpoint at
https://api.atlascloud.ai/v1, and several contributors had been running it through the OpenAI-compatible passthrough with manualbase_url/ model overrides. Selectingprovider = "atlascloud"in~/.deepseek/config.toml(or viaDEEPSEEK_PROVIDER=atlascloud) now wires up the documented defaults, a[providers.atlascloud]config block for per-user api_key / base_url / model / http_headers overrides, theATLASCLOUD_API_KEYenv var path, and the provider-picker //providerslash command entries — same shape as the existing NVIDIA NIM / Fireworks / OpenAI provider rows. Default remains DeepSeek; nothing changes for installs that don't opt in. web_searchsupports Tavily and Bocha as configurable backends (harvested from PR #1294 by @sandofree). DuckDuckGo with Bing fallback remains the default — no API key required — but users in regions where those scrapers are unreliable can now set[search] provider = "tavily" | "bocha"plusapi_key = "..."inconfig.toml(or via theDEEPSEEK_SEARCH_PROVIDER/DEEPSEEK_SEARCH_API_KEYenv vars) to route everyweb_searchcall through the chosen API. Tavily is an AI-search API targeted at general use; Bocha is the mainland-China-friendly equivalent. Trust-boundary pins: an unsetapi_keyon an opted-in provider surfaces a clearToolErrornaming the missing key rather than silently falling through to a different provider, the network policy gate ([network]) is consulted for the provider host on every call, and the default path is unchanged so no install sees provider behaviour change unless they explicitly opt in./changeslash command displays the most recent CHANGELOG.md version section from inside the TUI, so users can see what they just upgraded into without leaving the chat (harvested from PR #1416 by @zhuangbiaowei). The command works against the bundled release-notes copy when no workspace CHANGELOG is available, and on non-English locales it requests a model-side translation of the section so localised users see the changelog in their UI language. Pure offline fallback when no API key is configured.
Fixed
deepseek updatenow refreshes the companion TUI binary alongside the dispatcher (harvested from PR #1492 by @NorethSea). Closes the documented two-binary footgun:~/.cargo/bin/deepseekwould update to the latest dispatcher, but~/.cargo/bin/deepseek-tuiwould stay at the previously installed version, so users saw the dispatcher report a new release while the TUI runtime they actually interacted with reported the old version. Most painful for Volta-managed npm installs and any maintainer flow that callsupdateinstead of re-running bothcargo install --path crates/{cli,tui}. The updater now enumerates colocated binaries up front, downloads and verifies every release asset before replacing anything, then swaps the sibling first and the running dispatcher last so a partial network failure cannot leave the launcher updated while the TUI remains stale.
Changed
read_filenow extracts PDFs in pure Rust by default — no Poppler install required. Before v0.8.32 the PDF path shelled out topdftotext(Poppler), so first-time users on hosts without it sawread_filereturn abinary_unavailablesentinel and had tobrew install poppler/apt install poppler-utilsbefore the model could open a PDF. The bundledpdf-extractcrate (which already powered URL-fetched PDFs inweb_run) now drives the localread_filepath too. Thepagesparameter still filters by 1-indexed inclusive page range; both the whole-file and per-page variants run with no system dependency. Users with column-heavy or complex-table PDFs (academic papers, financial filings) wherepdftotext -layoutstill wins can opt into the external path withprefer_external_pdftotext = truein~/.config/deepseek/settings.toml— when set, the previous Poppler dispatch (and thebinary_unavailableinstall hint when the binary is missing) returns.deepseek doctornow reportspdftotextas optional and explains how to opt in instead of framing it as a missing dependency.
Known issues
- Terminal-native text selection can still be blocked while the agent is thinking or streaming a response. v0.8.32 removed the noisy Shift-to-bypass-mouse-capture path that caused visible scroll/redraw thrash, but the replacement selection path is not complete yet. v0.8.33 is planned to ship the text-selection fix alongside the sub-agent and RLM renovation.
0.8.31 - 2026-05-12
A "tools that actually work" release. code_execution no longer
fails on Windows hosts where python3 isn't on PATH — we probe
for the interpreter at catalog-build time and only advertise the
tool when one resolves, so the model never sees a runtime it can't
actually use. The new deepseek doctor "Tool Dependencies" and
"Terminal Quirks" sections surface external-binary status and
active env-driven overrides so flicker / motion / missing-tool
puzzles answer themselves before a bug report gets filed. Ptyxis
50.x users on Ubuntu 26.04 get a manual synchronized_output = off
knob plus auto-detection that opts them out of the DEC 2026
synchronized-output wrap their VTE 0.84 mishandles. The CNB Cool
mirror workflow is rewritten with concurrency and scoped pushes so
release tags reliably reach cnb.cool/deepseek-tui.com/DeepSeek-TUI
for users behind GitHub-blocking networks. Plus a new auto-close
workflow that closes contributor PRs whose code has been harvested
into main, so credit lands at the same moment the fix does.
Fixed
- Windows
exec_shellpreserves MSVC toolchain env (harvested from PR #1487 by @Jianfengwu2024). When the parent shell has already loadedVsDevCmd/vcvars(Developer Command Prompt, the standard way to run Rust + MSVC on Windows),exec_shellwas strippingLIB/LIBPATH/INCLUDEand the related VS / SDK / CRT root variables on its way to the child. That made model-drivencargo buildcalls fail to resolvekernel32.libeven thoughlink.exewas reachable viaPATH. The allowlist inchild_env.rsnow preserves the 13 MSVC env vars so the toolchain context survives the sanitisation pass. code_executionno longer fails with "program not found" on Windows (and any other host withoutpython3onPATH). Before v0.8.31 the tool hardcodedpython3and was unconditionally advertised in Agent / YOLO modes — so the model would call it, spawn would fail, and the error surfaced as a generic tool failure with no upstream hint. The fix probes for a Python interpreter (python3→python→py -3) at catalog-build time, caches the resolved interpreter, and only advertisescode_executionwhen one resolves. On hosts with no Python the tool is not registered at all — the model never sees a tool it can't actually run. Reported by a Windows contributor; resolver lives atcrates/tui/src/dependencies.rsand is also surfaced bydeepseek doctor. Folds in the contributor's "write code to a tempfile and run the file" suggestion at the same time, so multiline code with quote nesting no longer round-trips throughpython3 -c.- Termius and every SSH session auto-enable low-motion
(#1433, harvested from PR #1479 by @CrepuscularIRIS / autoghclaw).
Termius desktop sets
TERM_PROGRAM=Termius; sshd exportsSSH_CLIENTfor every TCP session andSSH_TTYfor interactive PTY logins. Any of those signals now flipslow_motionandfancy_animationslike the existing VS Code / Ghostty path, so the 120 FPS cursor-repositioning that races the SSH round-trip no longer flickers a remote TUI. Disk-loadedfancy_animations = trueis unconditionally overridden under these signals, matching the existing env-precedence contract. - DEC 2026 synchronized output is auto-disabled on Ptyxis (the new
default terminal on Ubuntu 26.04 and an increasingly common Linux
TUI host). Ptyxis 50.x ships on VTE 0.84.x, which parses the
\x1b[?2026h/\x1b[?2026lbegin/end pair but still flashes the entire viewport on every wrapped frame instead of deferring rendering — so a TUI that uses DEC 2026 to avoid tearing experiences visible flicker on every redraw. gnome-terminal 3.58 on the same VTE renders cleanly, so the heuristic must stay narrow: we trigger only onTERM_PROGRAMmatchingptyxiscase-insensitively, orPTYXIS_VERSIONset to any non-empty value. Either signal flips the newsynchronized_outputsetting fromautotooff; the renderer then skips the begin/end pair on every draw, inreset_terminal_viewport, and inresume_terminal. Users on Ptyxis who upgrade past the upstream fix (or who want to confirm a fix landed) can override with/set synchronized_output onor by addingsynchronized_output = "on"to~/.config/deepseek/settings.toml.
Added
deepseek doctornow reports tool-dependency status. A new "Tool Dependencies" section lists which external binaries the registered tools rely on, with ✓ when present and ✗ + an install hint when missing. Today this covers the Python interpreter (code_execution) andpdftotext(read_filePDF path). A separate "Terminal Quirks" section shows which env-driven auto-overrides (VS Code / Ghostty / Termius / SSH / Ptyxis) are currently active so users can see at a glance why a particular rendering compromise is in effect. Foundation for surfacing future tool dependencies as the toolset grows.- New
synchronized_outputsetting controls whether the renderer wraps each frame in DEC mode 2026 synchronized output. Acceptsauto(default; respect the Ptyxis env opt-out),on(always emit DEC 2026, override the heuristic), oroff(never emit DEC 2026). The cost ofoffis brief tearing on terminals that handle DEC 2026 cleanly; it is purely a rendering-quality knob, not a correctness one. Set via/set synchronized_output <auto|on|off>or in~/.config/deepseek/settings.toml. read_fileacceptsstart_lineandmax_linesfor chunked, bounded reads of large files (#1450, harvested from PR #1451 by @Oliver-ZPLiu). Default window is 200 lines / ~16 KB; the hard cap is 500 lines. Small files (≤ 200 lines AND ≤ 16 KB) still return their contents unchanged, so existing prompts that read config files / single source files see no behavior change. Large files now return a<file …>-wrapped, line-numbered window withshown_lines,truncated, andnext_start_lineattributes plus a[TRUNCATED]continuation hint — so the model can page through a 50 KB file in 16 KB slices instead of dragging the whole thing into the conversation context on every turn. PDFs continue to usepages;start_line/max_linesapply to text files only.web/dependency security updates. Bumps:next15.5.16 → 15.5.18 (GHSA-26hh-7cqf-hhc6 — App Router middleware/proxy bypass via segment-prefetch routes; high severity).mermaid11.14.0 → 11.15.0 (GHSA family: Gantt-chart infinite- loop DoS,classDefHTML injection,classDefs/ configuration CSS injection; all medium severity).eslint-config-next15.5.16 → 15.5.18 (matches Next.js).npm run buildconfirmed clean on the bumped lockfile. None of these affect the Rust TUI binary; the bumps are for the separately-deployeddeepseek-tui.comsite.
- MCP HTTP servers accept custom headers for authentication
(#1454, harvested from PR #1456 by @Oliver-ZPLiu). Mirrors the
headersfield that Claude Code, Codex, and OpenCode already accept in their MCP config — add e.g."headers": { "Authorization": "Bearer ${HF_TOKEN}" }under any HTTP server entry in~/.deepseek/mcp.jsonand the headers are sent on every Streamable HTTP request. Headers are sent literally — env-var interpolation is a follow-up, so tokens pasted directly into mcp.json live there as plain text. The Streamable HTTP transport filters out empty keys, framing overrides (Accept,Content-Type), and CR/LF in values (response-splitting defense) so a single bad entry can't break protocol negotiation or smuggle a header through a misbehaving proxy. Stdio servers (command-based) and the legacy SSE transport ignore the field; SSE coverage is a follow-up.
0.8.30 - 2026-05-11
A "tighten what we shipped" release. Bare single-letter keystrokes
(g, G, [, ], ?, l, v) no longer get eaten as transcript-
nav shortcuts when the composer is empty — every one of them is now
freely usable as the first character of a message. The water-spout
animation in the footer is decoupled from low_motion so typewriter
mode no longer hides the wave, and the v0.3.5-era 🐳→🐋 cycling
indicator is back next to the effort chip after a long detour through
geometric dots. Plus a handful of provider, shell, and config fixes
that surfaced during v0.8.29 testing.
Added
-
The whale is back. Restored the
🐳 → 🐳. → 🐳.. → 🐳... → 🐋 → 🐋. → 🐋.. → 🐋... → 🐋.. → 🐋. → 🐳..cycling status indicator that originally shipped in v0.3.5 and silently disappeared in commit1a04659a9(the "smoother TUI streaming" pass, which swapped the 12-frame whale sequence for a 6-frame geometric◍ ◉ ◌ring) and then was deleted outright inf4dbf828c(footer-polish commit). The chip renders in the header status cluster, immediately before the reasoning-effort chip — exactly where long-time users remember it. Idle frame is a steady 🐳; the cycle advances every 420 ms keyed offApp::turn_started_at, so the breaching whale shows up halfway through any active turn.Configurable via the new
status_indicatorsetting:whale(default) — the historical cycling whale.dots— the geometric◍ ◉ ◌frames from the dots era.off— hide the chip entirely.
Set via
/config status_indicator <whale|dots|off>or insettings.toml.
Changed
-
Transcript-nav single-letter shortcuts now require
Alt. Before v0.8.30, pressing a bareg,G,[,],?,l, orvwith an empty composer hijacked the keystroke for transcript navigation — so typing "good morning" produced "ood morning" with no warning, and the v0.8.29 spot-fix atc13ddb04d(gg double-tap) only suppressed the scroll, not the lost character. The bindings are now uniformlyAlt+<key>, mirroringAlt+R(history search) andAlt+V(tool details) which already followed this pattern:Old (bare) New ( Alt+…)What it does gg(double-tap)Alt+Gscroll transcript to top GorShift+GAlt+Shift+Gscroll transcript to bottom [Alt+[jump to previous tool output ]Alt+]jump to next tool output ?Alt+?open the searchable help overlay (F1 / Ctrl+/also bound)lAlt+Lopen pager for last message v/VAlt+Vopen tool-details pager Plain letters are now always inserted into the composer as text. The
App::transcript_pending_gfield from the v0.8.29 half-fix is removed; the unifiedalt_nav_modifierspredicate replaces the per-keyis_empty()checks.
Fixed
low_motion = trueno longer hides the footer water-spout whenfancy_animations = true. The spout-strip animation in the footer was hard-gated on!low_motion, collapsing two unrelated concerns — streaming pacing and footer animation — onto one flag. The two are now orthogonal:low_motiongoverns only streaming pacing (typewriter vs. upstream cadence), andfancy_animationsalone decides whether the water-spout strip renders. The wave itself is unchanged from prior releases (wall-clock-driven sine, same cadence as v0.8.29).- Custom-base-URL providers preserve the user's model name (#857
class). Only OpenRouter was previously whitelisted; Sglang, Novita,
Fireworks, Vllm, Ollama, and NvidiaNim users hitting custom gateways
with a bare model name were getting HTTP 400s because the dispatcher
rewrote the model identifier. Now any provider with a user-set
base_urlis treated as a custom endpoint and passes the model name through unchanged. exec_shellno longer freezes the TUI when a background subprocess outlives its parent shell (#828, cherry-picked from PR #1475 by @CrepuscularIRIS / autoghclaw). Orphaned children that kept the pipe write-end open madehandle.join()incollect_outputblock indefinitely; every transcript-rendering tick that calledlist_jobs()then hung the UI. The collector now kills the process group before joining the reader threads, and the previously deadcleanup()is now wired to drop completed jobs older than an hour.
0.8.29 - 2026-05-11
A maintenance release anchored by a regression fix for the "scroll demon" (#1085 class, re-introduced by v0.8.27's flicker patch) and a wrong-project session-restore bug (#1395). Plus 25 community PRs covering MCP transport, prompt steering, auto-routing language coverage, web-search SERP filtering, and broad test coverage additions.
Fixed
- Scroll demon — alt-screen no longer drifts under parallel
sub-agent load (#1085 regression). The v0.8.27 flicker fix
dropped the
\x1b[2J\x1b[3Jdeep-clear from the viewport-reset path, which had been silently masking threeeprintln!sites inside the sub-agent and network-policy modules. Each leak scrolled the alt-screen up by one row while ratatui's diff renderer remained convinced its model matched reality. Three layers of defence now ship together: atracing-subscriberwriting to~/.deepseek/logs/tui-YYYY-MM-DD.log, an fd-leveldup2stderr redirect for the alt-screen lifetime (Unix only; Windows follow-up tracked), and module-level#![deny(clippy::print_stdout, clippy::print_stderr)]ontools/,core/,tui/,runtime_threads.rs, andnetwork_policy.rs. The three known leak sites (subagent::persist_state_best_effort,subagent::new_shared_subagent_manager,network_policy::record) now route throughtracing::warn!with structured fields. Ctrl+Rsession-restore picker is workspace-scoped (#1395, PR #1397 from @linzhiqin2003).SessionPickerView::newpreviously listed every saved session on disk sorted globally — so opening DeepSeek-TUI in Project B and pressingCtrl+Rcould hand back Project A's last conversation. The picker now filters by current workspace, with a fallback hint when no in-workspace sessions exist.- MCP discovery survives malformed items (PR #1410 from
@Liu-Vince). The
tools/list,resources/list,resources/templates/list, andprompts/listwalks previously didserde_json::from_value::<Vec<…>>(…).unwrap_or_default(), which silently discarded the entire page when any single entry was misshapen. Each list now iterates per-item, skipping malformed entries with atracing::debug!instead of dropping the rest of the catalogue. Composes with the v0.8.x pagination loop landed for #1256. - MCP SSE transport accepts CRLF-framed endpoint events (#1309,
PR #1358 from @reidliu41). FastMCP / uvicorn-style SSE
streams using
\r\n\r\nseparators now discover the endpoint and send initialization requests instead of timing out while waiting for an LF-only event boundary. - Composer ignores leaked SGR mouse-report bursts (#1418,
PR #1421 from @reidliu41). Some SSH / IDE terminal chains
leak fragments like
[<35;44;18Minto stdin while mouse capture is enabled; the composer now filters those bursts at the insertion boundary without stripping ordinary coordinate-like typed text. - Footer right-cluster chips can no longer crowd the left status line (#1357, PR #1417 from @Wenjunyun123). The footer now reserves visible space for the left status before selecting cache / aux chips, dropping oversized right-side chips instead of pushing the row over the available terminal width.
- Web search drops spam-stuffed SERPs (#964, PR #1396 from @linzhiqin2003). The Bing / DDG fallback paths now filter the SEO-farm domains that were poisoning quick lookups.
- Language directive:
reasoning_contentfollows the user's message language (#1118, PR #1398 from @linzhiqin2003) — previously the project context's inferredlangcould override the latest user message, leading to English thinking for a Chinese turn. - Deferred tools hydrate their schema before first execution
(#1419, PR #1429 from @SamhandsomeLee). When the model asks
for a deferred tool such as
edit_filebefore seeing its schema, the engine now loads the tool, returns a non-executed hydration result with the expected fields, and requires a retry instead of executing guessed argument names. Commonedit_filealiases such asold_string -> searchandnew_string -> replaceare called out in the retry hint. - DeepSeek public aliases replay thinking-mode tool turns
(PR #1428 from @Beltran12138).
deepseek-chatanddeepseek-reasonernow classify as V4 reasoning models forreasoning_contentreplay, preventing second-turn HTTP 400s after tool calls when users keep the onboarding default model alias. Ctrl+Oexpands thinking blocks still in flight. Two compounding bugs were making the "thinking collapsed; press Ctrl+O for full text" affordance a lie. (1)open_thinking_pageronly searchedapp.history, but afterThinkingCompletethe finalized thinking entry sits inapp.active_cellwithstreaming = falseuntil the active cell flushes at end-of-turn; during that window the handler surfaced "No thinking blocks to expand" while the affordance pointed at the live entry. Routed through the existingcell_at_virtual_index/virtual_cell_countresolver thatopen_tool_details_pageralready uses, so selection-based and most-recent lookups both reach in-flight entries. (2) The keybinding guard requiredkey.modifiers == KeyModifiers::CONTROL(exact match), so any extra modifier bit set by the terminal — Shift while a native-selection bypass was active, Caps Lock indicator on some keyboard layouts — silently fell through to the$EDITORarm and did nothing visible on an empty composer. Relaxed tocontains(KeyModifiers::CONTROL)to match the existing Ctrl+P / Ctrl+B pattern. Regression-guarded byopen_thinking_pager_finds_thinking_in_active_cell.- Skill completions no longer flood the top-level slash menu
(#1437, PR #1442 from @reidliu41). Installed skills now
complete under
/skill <name>while the root/menu stays focused on built-in commands. edit_filerejects no-op replacements (PR #1460 from @xiluoduyu). Identicalsearch/replacearguments now fail fast with a clear validation error instead of producing an empty diff that can trap the model in retry loops.- Windows-terminal glyph widths are stable (#1314, PR #1465 from @CrepuscularIRIS). SMP emoji in the header and file tree were replaced with BMP-width-safe symbols / text so cmd, PowerShell, WezTerm, and Alacritty do not mismeasure rows.
- Ghostty defaults to low-motion rendering (#1445, PR #1468
from @CrepuscularIRIS).
TERM_PROGRAM=ghosttynow receives the same animation cap as VS Code terminals to avoid redraw flicker on affected setups. - Docker buildx provenance permission failures get an actionable hint (#1449, PR #1469 from @CrepuscularIRIS). macOS shell outputs matching the restricted provenance metadata failure now include guidance to disable provenance for that build.
- Windows CMD mouse-wheel fallback scrolls the transcript (#1443, PR #1471 from @CrepuscularIRIS). When mouse capture is off, composer arrow-scroll defaults on so terminal wheel events mapped to Up / Down do not cycle composer history.
Added
- MCP HTTP transport honors
HTTP(S)_PROXY/NO_PROXY(#1408 from @hlx98007). Reqwest 0.13 does not auto-detect proxy env vars by default, so MCP HTTP connections were bypassing the proxy that every other tool on the box (curl, npm, git, …) was using. Connections behind corporate egress proxies and China-mainland Clash / Shadowsocks tunnels now work transparently. MalformedHTTPS_PROXYvalues log atracing::warn!and the connection proceeds without a proxy rather than failing the MCP attach. - Note management slash commands (PR #1407 from
@reidliu41).
/note add,/note list, and friends for persistent maintainer-style notes inside the TUI, backed by~/.deepseek/notes/. - Header surfaces the runtime version chip. A
v0.8.29tag sits in the header's right cluster after the provider / effort / Live / context chips. Styled withpalette::TEXT_HINTso it reads behind the streaming indicators. Drops first under tight terminal width. - Global
~/.deepseek/AGENTS.mdnow merges with project AGENTS.md (#1157, PR #1399 from @linzhiqin2003) instead of being shadowed when a workspace ships its own. - Auto-routing recognises CJK debug / search keywords (PRs
#1401 and #1402 from @linzhiqin2003) —
--model autoand the reasoning-effort picker correctly route Chinese / Japanese technical queries that previously fell through to the generic baseline.
Security
sync-cnb.ymlworkflow hardened (CodeQL finding from v0.8.28). Adds explicitpermissions: contents: read(least-privilege), bumpsactions/checkoutv3 → v4, and narrows the trigger fromon: [push]toon: push.branches: [main]+tags: ['v*']. Feature branches no longer mirror to CNB; onlymainand tagged releases do.- Post-exit resume hint avoids session-id taint. The TUI now
checks whether a session exists separately from the constant
resume-hint text it prints after leaving the alt-screen, resolving
the
rust/cleartext-loggingCodeQL alert without reintroducing scroll-demon stdout writes.
Internal
- +438 LOC of new test coverage across four PRs from
@linzhiqin2003:
error_taxonomy::classify_error_messageand Display impls (#1403),parse_pages_argedge cases (#1404),optional_search_max_resultsprecedence (#1405), andsanitize_stream_chunkcontrol-byte filtering (#1406). runtime_logmodule ships with a regression test pinning theHOME/USERPROFILE/dirs::home_dir()resolution order, holding the process-widetest_support::lock_test_env()lock for env-mutation safety.- Header rendering gains two regression tests
(
header_renders_version_chip_when_width_allowsandnarrow_header_drops_version_chip_before_dropping_mode) pinning the version chip's cascade priority. - Workspace/session test isolation tightened (PR #1431 from
@reidliu41). Git-root detection ignores invalid parent
.gitmarkers, env-mutating tests share the crate-wide test lock, and the streamable HTTP MCP mock server stays alive for the full test. - Config-mutating smoke tests now isolate
DEEPSEEK_CONFIG_PATH. The command registry and web-config commit tests no longer rewrite the developer's real~/.deepseek/config.tomlwhile validating release candidates locally.
0.8.28 - 2026-05-10
A maintenance release bundling four streaming / approvals / cache bug-fix cherry-picks, six smaller community fixes, a Cmux notification probe, GPU-terminal flicker hardening via DEC 2026 synchronized output, VS Code low-motion auto-detection, a CNB mirror workflow, V4-steered tool descriptions, and test-suite stabilization for parallel-test environment races.
Added
- CNB mirror workflow (PR #1373 from @Anyexyz) — a
GitHub Actions workflow (
sync-cnb.yml) mirrors every push to thecnb.cool/deepseek-tui.com/DeepSeek-TUIrepository, closing out the long-standing China-mirror request. Requires theCNB_GIT_TOKENrepo secret. - Cmux desktop notification support via
LC_TERMINAL(#1281, PR #1340 from @CrepuscularIRIS) — Cmux setsLC_TERMINAL=Cmuxrather thanTERM_PROGRAM, so the previous notification probe fell back toBELinstead of using OSC 9.resolve_method()now checksLC_TERMINALas a secondary probe and adds Cmux to the OSC 9 allowlist. Terminals that set neither env var can still force OSC 9 via[notifications].method = "osc9". Two regression tests pin the Cmux and WezTermLC_TERMINALpaths; the existing unknown-terminal-on-Unix test now clearsLC_TERMINALbefore asserting fallback so it doesn't flake on CI hosts that set it. - DEC 2026 synchronized output around terminal repaints (PR
#1361 from @xuezhaoyu) — the viewport-reset path now wraps
terminal.clear()in\x1b[?2026h/\x1b[?2026lso GPU-accelerated terminals (Ghostty, VSCode Terminal, Kitty, WezTerm) defer rendering until the whole frame is staged, eliminating mid-frame flicker on resize / focus / TurnComplete. The earlier "drop destructive 2J/3J" fix from v0.8.27 stays; this PR is complementary, batching the same lighter reset sequence into a single synchronized frame. Terminals without DEC 2026 support silently ignore the sequence. low_motionauto-enables under VS Code integrated terminal (PR #1365 from @CrepuscularIRIS) —apply_env_overrides()now treatsTERM_PROGRAM=vscodethe same way it treatsNO_ANIMATIONS=1: forcelow_motion = trueandfancy_animations = false. The VS Code terminal compositor cannot keep up with 120 fps redraws and produces rapid flicker (#1356); the 30 fps low-motion cap is the right default there. Env overlays always win over the disk-loaded value, matching the existing precedence forNO_ANIMATIONS.
Fixed
- Cache usage shows 0 when API omits cache data (#1391, PR #1392
from @Oliver-ZPLiu) —
SessionUsage.cache_creation_input_tokens/cache_read_input_tokensare nowOption<u64>instead ofu64defaulting to 0. When the upstream API doesn't report cache hit/miss, the model seesnullinstead of misleading zeros, and reasoning about cache utilization is accurate. - Deny of one tool call no longer blocks all future calls of the
same tool (#1377, PR #1388 from @Oliver-ZPLiu) — denying a
tool call now only caches the per-call
approval_key, not the tool type. Subsequent invocations of the same tool prompt for approval again instead of being silently auto-denied. - Streaming thinking blocks no longer drop their tail on
MessageComplete (#861 RC3, PR #1389 from @linzhiqin2003) —
the active streaming entry is now drained into the finalized
cell on
MessageComplete, eliminating a data-loss path where the last chunk(s) of a streaming "thinking" reply could be discarded whenMessageCompletearrived ahead ofThinkingCompletein a bursty event stream. Also closes a related HTTP 400 on the next turn (DeepSeek V4 requiresreasoning_contentreplay for assistant messages that carry tool calls). - Streaming thinking renders live in collapsed view (#861 RC4, #1324, PR #1390 from @linzhiqin2003) — collapsed thinking cells now stream their content as it arrives instead of staying at a static "thinking..." placeholder until streaming ends. When the live body exceeds the collapsed budget, the truncation affordance ("thinking continues; press Ctrl+O for full text") now fires during streaming with head lines dropped so the visible window tracks the live cursor at the bottom.
- First-turn latency bounded on large workspaces (#697, PR #1386
from @linzhiqin2003) — the working-set file walker now caps
the number of entries it visits during initial indexing, so
starting a session in a workspace with a deep
node_modules,target, or.venvno longer stalls the first response on filesystem traversal. - Duplicate error toast on transcript-rendered turn errors (PR #1368 from @douglarek) — when a turn error is already in the transcript as a system/error cell, the status-line toast is suppressed so the user doesn't see the same failure twice.
- Clearer continue tip on idle prompts (PR #1370 from @nightfallsad) — the "press Tab to continue" affordance now uses concrete language instead of a vague hint.
- Ctrl+Enter content lost when engine is idle (#1331, PR #1347
from @Oliver-ZPLiu) — when no turn was active,
Ctrl+Enterrouted the message torx_steer(only monitored insidehandle_deepseek_turn), so the user saw their message in the transcript via the local mirror but the LLM never received it — the next regular Enter would drain it as a "stale steer". The idle path now sends through the standardhandle_send_messageflow so the submission reaches the engine. - Explicit hidden / ignored
@-mention completions work (#1270 follow-up) — PR #1270 from @SamhandsomeLee landed theadd_local_reference_completionshelper and tests in v0.8.27 but never wired it intoWorkspace::completions()orbuild_file_index. The two regression tests were ignored with a "v0.8.28 follow-up" marker. This release wires the helper into both entry points so@.deepseek/commands/start-task.mdand@.generated/specs/device-layout.md(and the basename fuzzy-resolve equivalent) now surface from gitignored user-folders while.deepseekignoreentries stay blocked. Both tests un-ignored.
Changed
- Prompt-side reliability guidance (PR #1393 from
@Oliver-ZPLiu) —
prompts/base.mdgains three Verification Principle bullets steering the model to verify before reporting complete, preserve only key facts from tool results, and inspect errors before retrying. Combined with the truthful- reporting addition from #1392, the model is less likely to claim unverified successes or repeat the identical failing tool call. - V4-steered tool descriptions (#711, PR #1379 from
@linzhiqin2003) — every model-visible tool description
(
read_file,write_file,edit_file,list_dir,grep_files,file_search,web_search,apply_patch,fetch_url) now opens with a short "use this instead of X in exec_shell" steering line, the return shape, and the limits. Routes V4 toward our typed tools and away from shell footguns. All description strings stay under 1024 chars (max: 350) with no embedded newlines so the cached tool catalogue stays prefix-stable for V4's KV cache. Removes the unused legacynormal.txt/plan.txt/yolo.txtprompt templates (referenced only by their own self-tests).
Internal
- Test-suite parallelism stabilization (commit
test: stabilize parallel test execution). Folds three local test-mutex implementations into the process-widetest_support::lock_test_env, eliminating a class of intermittent failures (refresh_system_prompt_is_noop_when_unchanged,save_api_key_for_openrouter_writes_provider_table,list_archives_sorts_by_cycle_number) observed during the v0.8.27 release cycle. - Windows
task_managertimeout bumped 3s → 10s on four tests exercising durable task recovery, addressing an intermittent CI timeout on Windows under file-I/O load. provider_switch_clears_turn_cache_historynow isolatesHOME/USERPROFILEto a tempdir for its lifetime. The test was silently writingdefault_provider = "ollama"to the developer's real~/Library/Application Support/deepseek/settings.tomlon every run, which then contaminated parallel-running picker tests because Ollama is a pass-through provider that hides the DeepSeek model rows.settings::tests::no_animations_test_guardandterm_program_test_guardboth now returncrate::test_support::lock_test_env()instead of their own module-local mutexes — folding them into the same process-wide test env lock the v0.8.27 EnvGuard family was migrated to. Without this, aNO_ANIMATIONS=1write from one test family could race aTERM_PROGRAM=iTerm.appwrite from the other through the sharedapply_env_overridespath and fliplow_motiontotrueon the assertion side.
0.8.27 - 2026-05-10
A polish release bundling 17 community PRs plus a focused user-issue
sweep over the 24–48 hours after v0.8.26 shipped. Headline fixes:
cross-terminal flicker on Ghostty / VSCode / Win10 conhost (most-
reported v0.8.26 regression), long-text right-edge overflow, an
in-app pager copy-out, context-sensitive Ctrl+C, an MCP pool that
auto-reloads on config changes, and a model-callable notify tool.
Big thanks to every contributor below.
Added
- Unified
/modecommand (#1247) —/mode [agent|plan|yolo|1|2|3]replaces the separate/agent,/plan, and/yolocommands. Running/modewithout arguments opens a picker modal. The legacy aliases (/yolo,/agent,/plan) are kept as compatibility shorthands. Thanks @reidliu41. /statusruntime diagnostics (#1223) — shows version, provider, model, workspace, mode, permissions, context-window usage, cache hit/miss, and session cost. Previously/statuswas an alias for/statusline(footer config); that alias is now/statuslineonly. Thanks @reidliu41./feedbackcommand (#1185) — opens the matching GitHub issue template (bug report, feature request) in the browser. Security vulnerability reports route through the project's security policy page first. Thanks @reidliu41.- Session artifact metadata (#1220) — large tool outputs spilled to the session artifacts directory are now tracked in a durable metadata index, so saved sessions retain references across save/restore cycles. Thanks @THINKER-ONLY.
- Subagent results are self-reports (#1140) — the compacted result
summary now notes that child-agent outputs are unverified self-reports.
The parent model should verify side effects with tools like
read_fileorlist_dirbefore claiming success. Thanks @THINKER-ONLY. - Global AGENTS.md fallback (#1197) — when the workspace and its
parents don't provide project instructions, the TUI now loads
~/.deepseek/AGENTS.mdbefore falling back to auto-generated instructions. Repo-local context still takes priority. Thanks @manaskarra. --yoloforwarded from CLI to TUI (#1233) — thedeepseek --yoloflag now propagates through the dispatcher to the TUI binary viaDEEPSEEK_YOLO=true. Previously the flag setyoloin the CLI process but the TUI session started in its default mode. Thanks @fuleinist.composer_arrows_scrollconfig (#1211) — a newtui.composer_arrows_scrolloption (defaultfalse) makes plain Up/Down arrow keys scroll the transcript when the composer is empty, instead of navigating input history. Helpful for terminals that map trackpad gestures to arrow keys. Thanks @lbcheng888.- Session cost persistence (#1192) — accumulated costs (session + sub-agents, both USD and CNY) and the displayed-cost high-water mark now survive session save/restore, so the monotonic cost guarantee (#244) holds across restarts. Thanks @lbcheng888.
- Provider-aware model picker and provider persistence (#1320) —
switching providers now persists the choice to
~/.deepseek/settings.tomlso it survives restarts. The model picker hides DeepSeek-specific models when a non-DeepSeek provider is active.OPENAI_MODELenv var now overrides the per-provider model rather than the globaldefault_text_model. Bailian / ZhiPu Coding Plan endpoints are now supported. Thanks @imkingjh999. - HTTP User-Agent header (#1320) — all outbound API requests now
carry
deepseek-tui/{version}in the User-Agent, matching the formatfetch_urlalready uses. Thanks @imkingjh999.
Fixed
- Cross-terminal flicker on TurnComplete / focus / resize (#1119,
#1260, #1295, #1352, #1356, #1363, #1366) — the viewport-reset
sequence emitted before each forced repaint no longer includes
\x1b[2J\x1b[3J. Combined with the immediately-following ratatuiterminal.clear(), the destructive pair produced a double-clear that Ghostty, the VSCode integrated terminal, and Win10 conhost rendered as a visible blank-then-repaint flicker. The lighter sequence (\x1b[r\x1b[?6l\x1b[H) plus the alt-screen buffer's double-buffering handles viewport correctness without flicker. macOS Terminal.app / iTerm2 / alacritty users were already unaffected and remain so. /skills --remoteand/skills syncdiagnostics (#1329) — the underlying anyhow chain has always been formatted with{err:#}, but the chain alone is often opaque (e.g. "error sending request"). The error message now appends a one-line hint when the chain matches a common failure pattern: DNS / connection refused / TLS / 4xx / 429 / timeout. Each hint points at the most likely cause and a concrete next step.
Added
- Pager copy-out (#1354) — full-screen pagers (
Alt+Vtool details,Ctrl+Othinking content, shell-job / task / MCP-manager pagers, and the selection pager) now acceptcoryto copy the entire body to the system clipboard. The pager intercepts mouse capture so terminal- native selection isn't available inside it; this restores the copy-out path that users on macOS / Windows / WSL expect. The footer hint now reads… / search c copy q/Esc close. A status toast confirms success ("Pager content copied"), empty-body, or failure. notifytool (#1322) — model-callable desktop notification. Always-loaded (no ToolSearch round-trip). Routes through the existingtui::notificationsinfrastructure: OSC 9 on iTerm2 / Ghostty / WezTerm, BEL fallback on macOS / Linux,MessageBeepon Windows when explicitly opted in. Honours the user's[notifications].methodconfig — when set tooff, the tool is a silent no-op. Title and body are length-capped (80 / 200 chars) on character (not byte) boundaries to keep the OSC 9 escape clean and avoid mid-grapheme truncation. The tool description steers the model away from chatter: use only when a long-running task completes or genuinely needs the user's attention.
Fixed (cont.)
- Long output text overflowed the right edge (#1344, #1351) —
paragraph rendering (
render_line_with_links) and code-block wrapping (wrap_textforBlock::Code) were word-based: a single word wider than the available column was placed alone on a line and silently overflowed. Long URLs, paths, hashes, and no-whitespace CJK runs all hit this. Both paths now hard-break overlong words at the character level, matching the v0.8.25 fix for table cells. The rendered width is capped at the budget for every line; full content is preserved across wrapped segments. Snapshot-style tests pin the invariant at widths 40, 60, 80, and 120.
Changed
Ctrl+Cnow copies an active transcript selection (#1337) — on Windows, plainCtrl+Cis the OS-wide copy chord, and treating it as "exit" stole work whenever a user copy-pasted from the transcript.Ctrl+Cis now a four-stage decision: 1) selection active → copy + clear (matches the OS convention); 2) turn in flight → cancel (unchanged); 3) quit-armed within 2s → exit cleanly (unchanged); 4) idle, no selection → arm the 2-second "press Ctrl+C again to quit" prompt (unchanged). The decision is factored into aCtrlCDispositionhelper with a unit-tested priority table.Cmd+C(macOS) andCtrl+Shift+Ccontinue to copy unchanged.- Cancel-key discoverability hint on turn start (#1367) — when a turn begins, the status-message slot now surfaces "Press Esc or Ctrl+C to cancel" if the slot is otherwise empty. Real transient status messages still take precedence; the hint clears as soon as any other update fires. Closes the loop on users who didn't know how to interrupt a long-running turn.
- Lazy auto-reload of MCP pool on config-file change (#1267 part 2) —
v0.8.26 surfaced the underlying spawn errors; v0.8.27 closes the
loop on the second half of the report (manual
/mcp reloadafter~/.deepseek/mcp.jsonedits).McpPool::get_or_connectnow does a cheapstat+ content-hash check before each connection lookup. If the on-disk file's mtime moved AND its content hash changed since the pool was loaded, all live connections are dropped so the nextget_or_connectreattaches under the new config. Pool-construction viaMcpPool::new(tests, ad-hoc snapshots) is unaffected — only pools built withfrom_config_pathwatch the source file. No file watcher; no long-lived task. mtime-only churn (touched but byte-unchanged content) does not trigger a reload, so networked filesystems with coarse mtime granularity won't churn the pool. - Paste consolidation now happens at paste time, not submit time —
large bracketed pastes that exceed the 16 000-char safety cap are
now folded into a workspace
.deepseek/pastes/paste-…mdfile and swapped for an@-mention immediately on paste, instead of waiting until the user presses Enter. The user sees the@-mention in the composer (and the "consolidated → @mention" toast) before deciding whether to send, eliminating the "I pressed Enter and an@-mention appeared in the chat I didn't authorise" surprise. The submit-time consolidation remains as a safety net for any other code path that fills the buffer above the cap, so the cap is still enforced exactly once. - Auto-disable paste-burst once bracketed paste verified — the
rapid-keystroke paste-burst heuristic (default-on for terminals
without bracketed paste) used to keep running on every session.
Once a real
Event::Pastearrives in a session, paste-burst now short-circuits — bracketed paste is verified working, and running the heuristic alongside it just creates false positives on fast typing / IME commits / autocomplete bursts. Terminals that never deliver bracketed paste (the original target audience) are unaffected; the heuristic still fires there. - Short CJK multi-line paste no longer auto-submits first line
(#1302) — pasting
请联网搜索:\nSTM32 …(short non-ASCII first line followed by a newline) used to fail the paste-burst detection heuristic because the first line had no whitespace and was under the 16-char threshold; the trailing pasted newline then fell through as a real Enter and submitted the first line on its own. The heuristic now treats any non-ASCII run as paste-like, so the Enter is absorbed into the burst buffer. Thanks @reidliu41 (PR #1342). - Onboarding screens render in the selected language — when a
user picked 简体中文 / 日本語 / Português (Brasil) at the language
step, every subsequent screen (API key entry, workspace trust
prompt, final tips) used to remain in English. The
set_locale_from_onboardingpath now drives the title, body copy, hints, and footer of each onboarding screen through the localization table, so once you pick your language the rest of the flow is in that language. Particularly nice for users on CJK input methods who want to avoid IME juggling during setup. /skills <prefix>filters the local skills list (#1318) — on top of the v0.8.26 inter-row spacing (#1328 from @reidliu41), the list now narrows to skills whose names start with the typed prefix. Case-insensitive. The header reflects matched count vs registry total; an empty match set says so explicitly and points back at unfiltered/skills.--remoteandsyncstay reserved as subcommands; any---prefixed argument is rejected rather than being silently treated as a no-match prefix.- HTTP 400 quota errors retried (#1203) — some OpenAI-compatible
gateways return quota/rate-limit errors as HTTP 400 instead of 429.
These are now classified as retryable
RateLimitederrors. Thanks @dst1213. - Explicit hidden/ignored file completions (#1270) — when the user
types an explicit path starting with
.(e.g.,.deepseek/commands/), the file-completion system now surfaces hidden and gitignored entries while still respecting.deepseekignore. Thanks @SamhandsomeLee.
Changed
- Windows mouse capture docs (#1181) — the
--mouse-capturehelp text and the configuration docs now mention scrollbar dragging and note that raw terminal selection on Windows may cross the sidebar. Thanks @Oliver-ZPLiu. - README zh-CN sync (#1235) — the Chinese README's quickstart section
now shows
deepseek run pr <N>instead of the outdateddeepseek pr <N>. Thanks @whtis. - Tool output render perf (#1098) — tool output summaries and the
"is this a diff?" check are now pre-computed once at cell creation
instead of re-parsed every frame. Tool output cells also got a visual
card-rail (
╭ │ ╰) for clearer grouping. Thanks @lbcheng888.
Internal
- Test coverage for approval decision branches (@tuohai666, #1316)
- Test coverage for hook event dispatch paths (@tuohai666, #1317)
0.8.26 - 2026-05-09
A security + polish release. Two responsibly-disclosed issues were patched, plus a small batch of internal release-pipeline fixes. Big thanks to @JafarAkhondali and @47Cid for the disclosures.
Security
- Hardened the
fetch_urltool's network-target validation (GHSA-88gh-2526-gfrr). Thanks to @JafarAkhondali. - Tightened the default privileges of sub-agents created through
task_create(GHSA-72w5-pf8h-xfp4). Thanks to @47Cid.
Both items will have full advisory text once the GHSA entries are published.
Fixed
- Hint when root
base_urlis set with a non-DeepSeek provider (#1308) — config load now logs a warning telling the user to move the URL under the matching[providers.<name>]table or use the*_BASE_URLenv var. Closes the silent-ignore footgun for Ollama / vLLM / OpenAI-compatible setups. - Insecure base-URL error message is more discoverable (#1303) —
the rejection now spells out which env var to set (with underscores
visible), notes that loopback hosts are auto-allowed, and shows a
one-line
DEEPSEEK_ALLOW_INSECURE_HTTP=1 deepseekexample. - Workspace skills survive prompt truncation — when the skill catalog needs trimming to fit the prompt budget, workspace-local skills now keep precedence over global ones rather than being truncated indiscriminately. Thanks @hhhaiai.
/skillslisting has visual spacing between entries so long skill descriptions don't run together. Thanks @reidliu41.- Provider base-URL overrides reach the active provider — the
per-provider
*_BASE_URLenv vars (e.g.OPENAI_BASE_URL,OPENROUTER_BASE_URL) now propagate into the active provider's config entry consistently. Closes a gap where the override was parsed but never applied. Thanks @reidliu41. - WSL2 turn-start timeout —
TurnStartedis now emitted before the snapshot step so a slow snapshot on WSL2's/mnt/*volumes doesn't push past the runtime watchdog and surface a spurious "engine may have stopped" error. Thanks @michaeltse321. /initauto-adds.deepseek/to.gitignore(#1326) when the workspace is a git repo, so workspace-local snapshots, instructions, and pastes don't get accidentally committed. Idempotent on repeated runs. Thanks @Giggitycountless.- MCP tool ordering is deterministic — discovered tools and the resulting API tool block are now sorted by name so the prompt prefix the model sees is stable across runs, regardless of server-side pagination order. Improves prompt-cache hit rates with multi-server MCP setups. Thanks @hxy91819.
- Error cells render as plain text so env-var names (
API_KEY_FOO) in error messages keep their underscores instead of being parsed as markdown emphasis. Thanks @douglarek. /clearresets the Todos sidebar (#1258) — previously/clearonly reset the Plan panel; the Todos checklist persisted across clears. Thanks @Giggitycountless.- Drag-select past the viewport edge auto-scrolls (#1163, #1255,
#1292, #1298) — when the mouse drag reaches the top or bottom of
the transcript area the viewport now scrolls to follow the
selection, the way text editors do. Copy strips every visual-only
decoration glyph — tool-card rails (
╭│╰), transcript rails (▏), reasoning rails (╎), tool-status symbols (·•◦), and tool-family glyphs no longer leak into clipboard output. Thanks @Oliver-ZPLiu. - MCP stdio servers no longer discard stderr. The spawn site now pipes
stderr through a bounded ring buffer; when a server crashes
mid-session, the transport-closed error includes the captured stderr
tail instead of disappearing into
Stdio::null. Useful for debugging Node/Python MCP servers that fail well afterinitialize. - Mouse capture now defaults on inside Windows Terminal (#1169, #1298,
#1331). When
WT_SESSIONis set, in-app text selection is enabled by default and the wheel scrolls the transcript again (rather than the terminal interpreting wheel events as input-history keys). Legacy conhost stays opt-in via--mouse-captureor[tui] mouse_capture = trueto preserve the protections from #878 / #898. Selection now clamps to the transcript region instead of the terminal painting native selection across the sidebar. - The build script now invalidates its cache on
.git/HEADchanges, so the embedded short-SHA indeepseek --versionstays current after commits and branch switches without needingcargo clean. Both regular checkouts andgit worktreelayouts are handled. - The release-time
changelog_entry_exists_for_current_package_versiongate walks up from the crate manifest to findCHANGELOG.mdinstead of assuming a fixed../../CHANGELOG.mdlayout. The workspace path still resolves; running the suite from a packaged crate skips the gate quietly instead of panicking.
0.8.25 - 2026-05-09
A stabilization + drift-fixes release. Headline work hardens the
self-update path (no more curl shellout, real SHA-256 verification),
fixes long-cell truncation in markdown tables, centralizes the MCP
JSON-RPC framing, and unifies terminal-mode recovery on focus events.
Big thanks to Reid Liu (@reidliu41) (Streamable HTTP MCP transport,
/config column alignment), Duducoco (@Duducoco) (cache-stable
reasoning_content replay), jinpengxuan (@jinpengxuan) (provider
credentials during onboarding), heloanc (@heloanc) (Home/End cursor
keys), Wenjunyun123 (@Wenjunyun123) (docs anchor scroll), and
Liu-Vince (@Liu-Vince) (zh-Hans approval-dialog wording) for the
contributions below.
Added
- Streamable HTTP MCP endpoints with SSE fallback (#1300) — adds the third MCP transport alongside stdio and SSE. The new transport posts JSON-RPC over plain HTTP with optional SSE upgrade for servers that prefer streaming responses. Thanks Reid Liu (@reidliu41).
recall_archiveexposed in the parent agent registry — the read-only BM25 archive search tool was previously only available to sub-agents; it is now callable from Plan, Agent, and YOLO parent registries. Plan mode's read-only contract is preserved (the existing registry test was updated to assert membership while still rejecting write/exec tools).
Changed
- Markdown tables wrap long cells instead of truncating (#1163-adjacent)
— long cell content is word-wrapped within the column instead of
collapsing to
…. Column separators are preserved on every wrapped line so the table grid stays readable. - MCP JSON-RPC framing centralized — request/response correlation, timeout handling, and message framing now live above the byte-level transports. Stdio, SSE, and the new Streamable HTTP transport share a single protocol layer instead of each maintaining its own copy of the framing code.
- Self-update is curl-free and verifies SHA-256 —
deepseek updateno longer shells out to systemcurl(and no longer needs the Schannel--ssl-no-revokeWindows hack from v0.8.23). Downloads now usereqwest::blockingwith rustls, and the aggregateddeepseek-artifacts-sha256.txtmanifest is parsed and checked against each downloaded asset before it is installed. Verification status is surfaced in the update output. - Terminal-mode recovery unified in
recover_terminal_modes()— startup,FocusGained, andresume_terminalall route through one idempotent helper that re-establishes keyboard enhancement flags, mouse capture, bracketed paste, and focus events. Adding a new mode flag now only has to happen in one place.
Fixed
reasoning_contentreplay stable for prompt cache (#1297) — reasoning text replayed from saved sessions now hashes consistently across turns so the cache-aware prompt builder's static-prefix stability isn't broken by replays. Thanks Duducoco (@Duducoco).- Active provider credentials respected during onboarding (#1265) — the onboarding flow now reads credentials from the active provider instead of falling back to the default DeepSeek path when another provider is selected. Thanks jinpengxuan (@jinpengxuan).
- Home/End keys move the input cursor (#1246) — Home and End now jump the composer cursor to line start/end instead of being swallowed. Thanks heloanc (@heloanc).
- Docs anchor scroll-margin overrideable (#1282) — the scroll-margin offset on docs anchors is now overrideable so embedded contexts can adjust it without forking the stylesheet. Thanks Wenjunyun123 (@Wenjunyun123).
/configview columns aligned (#1290) — the/configtable now sizes the key column from the actual data instead of a fixed width, so long keys no longer overflow into the value column. Thanks Reid Liu (@reidliu41).- zh-Hans approval dialog wording (#1274) — uses 终止 (terminate) instead of 中止 (abort) in the Chinese approval dialog, matching the English semantics. Thanks Liu-Vince (@Liu-Vince).
Removed
- Unwired
[context.per_model]config field — the field had no runtime consumer and was only present in the config schema. Removed to keep the schema honest. Existing configs that still contain a[context.per_model.*]table continue to load (serde ignores unknown keys; covered by a regression test). - Stale aspirational
[cycle.per_model]comments — reference to a config table that was never wired. No behavior change.
Documentation
.claude/CODEMAP_v0.8.25_dead_code.md— committed the cycle/seam/coherence/capacity codemap with a softenedcycle_managerclassification: live by code trace, design load-bearing, practical load-bearing unproven. Use this to decide the v0.8.26+ product direction for the cycle/seam/capacity subsystems.
Known issues
- Windows 10 conhost flicker regression (#1260, #1251) — v0.8.22-and-later content flickering on Windows 10 is still present. The viewport-reset escape sequence added in v0.8.22 needs a Windows guard. Deferred to v0.8.26.
- Snapshot system still snapshots every turn — the v0.8.24 500 MB hard cap protects against blowups, but the underlying design still snapshots on every turn regardless of whether the workspace changed. A write-aware skip is planned for v0.8.26.
▏glyph leak in code blocks (#1212), mouse selection crossing the sidebar (#1169), drag-select edge auto-scroll (#1163), mid-run MCP server stderr capture — all deferred to v0.8.26.
0.8.24 - 2026-05-09
A bugfix + refactor release picking up the backlog after the v0.8.23 security
release. Big thanks to wplll (cache-aware prompt + /cache inspect),
Liu-Vince (MCP pagination diagnosis), @Giggitycountless (snapshot cap
proposal), and to issue reporters @SamhandsomeLee,
@barjatiyasaurabh, @tyculw, @hongyuatcufe, and @ljlbit for
the bugs fixed below.
Fixed
- Mouse-wheel scroll survives focus toggles — on macOS, switching away
(Cmd+Tab, opening the screenshot tool, etc.) and back can drop the
terminal's mouse-tracking mode, leaving wheel scroll dead until restart.
The TUI now re-arms
EnableMouseCaptureonFocusGainedalongside the existing keyboard-mode recapture, so wheel events keep flowing after a focus round-trip. - Workspace-local slash commands are now loaded (#1259) — user command
files placed in
<workspace>/.deepseek/commands/,<workspace>/.claude/commands/, and<workspace>/.cursor/commands/are now discovered alongside the existing global~/.deepseek/commands/. Workspace-local commands shadow global by name, matching the precedence model already used for skills. Reported by @SamhandsomeLee. @-mention completion finds AI-tool dot-directories — files inside.deepseek/,.cursor/,.claude/, and.agents/are now discoverable in@-mention Tab-completion even when those directories are excluded by.gitignore. The fix also applies to the Ctrl+P file picker and fuzzy file resolution.- MCP paginated discovery (#1250, #1256) — tools, resources, resource
templates, and prompts from MCP servers that paginate their responses
(e.g., gbrain at 5 items per page) are now fully discovered by following
the MCP spec's
nextCursoracross all pages. Reported by @hongyuatcufe; thanks to Liu-Vince for the diagnosis and PR #1256 with the same fix shape. - Snapshot storage has a disk-space cap (#1112) — the snapshot side repo now enforces a 500 MB hard limit. When the limit is exceeded at snapshot time, the oldest snapshots are pruned aggressively to stay under a 400 MB target. Guards against the reported 1.2 TB snapshot blowup during high-churn sessions. Reported by @tyculw; thanks to @Giggitycountless for the PR #1131 proposal that informed the hard-cap approach.
/clearnow resets the Todos sidebar (#1258) — previously/clearonly reset the Plan panel; the Todos checklist persisted across clears until app restart. The fix ensuresclear_todos()clears theSharedTodoListinner state. Reported by @barjatiyasaurabh.
Added
-
Cache-aware prompt diagnostics + payload optimization (#1196) — adds a
PromptBuilderthat classifies the system prompt intostatic/history/dynamiclayers for cache-prefix stability, plus:/cache inspect— shows SHA-256 hashes per layer, base static prefix hash vs full request prefix hash, static-prefix stability across turns, and first-divergence tracking. Does not print prompt text./cache warmup— prefetches the stable prefix to seed the DeepSeek context cache.- Project Context Pack injected into the stable prefix by default
— a structured workspace summary (directory listing up to 4 levels /
400 entries, README excerpt up to 4 KB, config + key source file
lists). Adds ~1–10 KB to every prompt depending on repo size, in
exchange for a much more cacheable prefix. Default ON; disable
with
[context] project_pack = falsein~/.deepseek/config.tomlif you'd rather keep prompts minimal. - Wire-payload optimization: large tool outputs are budgeted, repeated
identical tool outputs and
<turn_meta>blocks are deduplicated with stable refs (wire-only — local session messages stay intact). - Footer cache-hit % chip from
prompt_cache_hit_tokens/prompt_cache_miss_tokensin the API response.
Thanks wplll for the design and implementation.
Changed
- Language directive strengthened against project-context bias (#1118) — the system prompt now explicitly instructs the model that project context (AGENTS.md, auto-generated instructions, file trees) is NOT a language signal. Chinese filenames in a repo no longer bias the model toward Chinese replies when the user writes in English. Reported by @ljlbit.
Known issues
- Windows flicker/shake regression (#1260, #1251) — v0.8.22 and v0.8.23
exhibit content flickering on Windows 10 (v0.8.20 works correctly). The
issue is likely caused by the viewport-reset escape sequence
(
\x1b[r\x1b[?6l\x1b[H\x1b[2J\x1b[3J) added in v0.8.22 to fix viewport drift. On Windows conhost, this sequence may trigger a full screen clear on every repaint. A platform guard or less aggressive sequence is needed.
0.8.23 - 2026-05-08
A security-focused follow-up to v0.8.22. The bulk of the diff is hardening of the child-process surface — shells, MCP stdio servers, and other spawned subprocesses — plus a related set of MCP, secret-store, and tool-policy fixes uncovered during follow-up review.
Security
- Sanitized child-process environments - shells, MCP stdio servers, hooks,
and other child processes spawned from the TUI now start from an explicit
allowlist of parent environment variables rather than inheriting every
parent var. The base allowlist covers
PATH,HOME,USER,LANG/LC_*,TERM/COLORTERM,SHELL,TMPDIR/TMP/TEMP, and the corresponding Windows variables. Stops casual exfiltration of*_API_KEY,AWS_*,GITHUB_TOKEN, and similar through a spawned subprocess. - Tighter shell safety classification - the
exec_shelldeny-list was reviewed and broadened to cover additional dangerous command patterns. - Plan mode tool surface narrowed - planning sub-agents see a smaller, read-only tool surface so a plan-mode call can no longer mutate workspace state.
- Sub-agent approval boundaries preserved - sub-agents inherit the parent's approval policy and cannot escalate beyond it.
- Symlinked workspace walks no longer followed - workspace-relative walkers (file-search, project context) now refuse to traverse symlinks pointing outside the workspace root.
- Path and output handling tightened - several tools that build paths
from model output now reject
..segments and absolute paths outside the workspace. - Runtime API requires authentication by default -
deepseek serve --httpno longer accepts unauthenticated requests in its default configuration. - Security-sensitive dependencies bumped - routine bump pass for crates with recent advisories.
- MCP config paths reject traversal -
load_config/save_confignow refuse paths containing..components. - Hardened
run_testsapproval policy. Thanks to @47Cid for the responsible disclosure.
Fixed
- macOS Keychain prompt at startup - the file-backed secret store is now
the default. The OS keyring is opt-in via
DEEPSEEK_SECRET_BACKEND=system|keyring, and the auth status surface refers to "secret store" rather than "keyring" where appropriate. - MCP stdio spawn errors are now visible (#1244) - when spawning a stdio
MCP server fails (e.g.,
npxnot onPATH), the underlying OS error is now shown ("No such file or directory (os error 2)") instead of the opaque wrapper "MCP stdio spawn failed (...)". The fix applies to the snapshot, themcp connect/mcp validateCLI commands, and the in-TUI status events. - MCP servers no longer break under env scrub (#1244) - MCP stdio launches
now inherit a wider env allowlist than arbitrary shell tools, so common
npx ...,uvx ...,python -m mcp_server_*, and proxy-bound corporate setups keep working under the new child-env scrub. Pass-through includesNVM_DIR,NODE_OPTIONS,NODE_PATH,NODE_EXTRA_CA_CERTS,NPM_CONFIG_*,VOLTA_HOME,COREPACK_HOME,PYTHONPATH,PYTHONHOME,VIRTUAL_ENV,PIPX_*,POETRY_HOME,UV_*,GEM_*,BUNDLE_*,JAVA_HOME,HTTP_PROXY/HTTPS_PROXY/NO_PROXY/ALL_PROXY/FTP_PROXY(case-insensitive),SSL_CERT_FILE,SSL_CERT_DIR,REQUESTS_CA_BUNDLE,CURL_CA_BUNDLE. Secret-bearing parent env stays scrubbed.
Changed
- Live thinking is compact by default - the streaming "thinking" panel collapses by default; expand via the existing details toggle.
Added
docs/RELEASE_CHECKLIST.md- explicit pre-tag checklist (CHANGELOG, versions, preflight, npm wrapper smoke) so the v0.8.21/v0.8.22 CHANGELOG gap does not recur.
Known issues
- Mid-run MCP server stderr is still suppressed - if a stdio MCP server
spawns successfully but exits later (e.g., crashes during
initialize), its stderr is not yet captured. Spawn-time OS errors (the most common case from #1244) are visible. Full mid-run stderr capture is planned for v0.8.24.
0.8.22 - 2026-05-08
A focused security release.
Security
- Hardened
fetch_urlredirect handling. Thanks to @47Cid for the responsible disclosure.
0.8.21 - 2026-05-08
A community-heavy release rolling up two weeks of contributor PRs across the TUI, runtime, and docs. Big thanks to Reid (@reidliu41), jiaren wang (@JiarenWang), Friende (@pengyou200902), ZzzPL (@Oliver-ZPLiu), Sun, Liu-Vince, kitty, and Aqil Aziz for the contributions below.
Added
- Distinct user-message body color (#1168) - user turns now render in a green body color so the conversation flow is easier to scan at a glance.
Fixed
- Plan mode enforces read-only tool boundaries (#1114) - planning calls can no longer reach into write-side tools. Thanks jiaren wang.
- Composer arrow keys navigate input history (#1117) - up/down in the composer cycles through prior prompts when the cursor is on the first/last line. Thanks Reid.
- RLM preserves prompt cache usage (#1127) - the RLM batch path no longer resets prompt-cache hits between calls. Thanks Sun.
fetch_urlproxy DNS opt-in (#1103) - the proxy DNS path is now opt-in rather than always forced, fixing breakage in environments where the proxy cannot resolve the target host. Thanks Sun.- Undo syncs session context after snapshot restore (#1150, fixes #1139) - rolling back a turn now correctly resyncs the in-memory session so a follow-up turn doesn't see stale context. Thanks jiaren wang.
- Stale busy-state watchdog (#1170) - the TUI now recovers if the busy indicator gets stuck after an aborted turn. Thanks ZzzPL.
ghdiscovered across common install paths - theghtool is found whether installed via Homebrew, apt, the Windows MSI, or the GitHub CLI installer. Thanks kitty.- Code block indentation preserved in transcript - leading whitespace inside fenced code blocks is no longer collapsed during rendering. Thanks Liu-Vince.
- Stream pacing preserves upstream cadence - long streaming responses no longer chunk together when the upstream is bursty. Thanks Sun.
- Task list output gets headers - the long-form
/tasksoutput now has group headers so it scans cleanly. Thanks Reid. - macOS option-V details shortcut - the details toggle now works correctly
on US Mac keyboards where Option+V produces
√. - Uppercase approval shortcuts accepted -
[A]/[D]/[V]work in either case in the approval dialog. - Transcript scrollbar inert - the transcript scrollbar no longer captures clicks intended for content below it.
- Hide transcript rail before code blocks - the rail glyph no longer bleeds onto the line just above a fenced code block.
- Pager exit hint prominent - the "press q to exit" hint is now visible on the pager footer.
- Empty tool call names fall back to a placeholder - a model that returns
an empty
function.namein a tool call no longer hangs the turn. - MCP SSE waits for endpoint before connect returns (#1225) - the SSE transport no longer reports "connected" before the endpoint event has been received, fixing a race where the first request was lost.
- Git branch status item renders (#1226, fixes #1217) - the
StatusItem::GitBranchtoggle now produces a footer entry instead of a blank slot. - Beta endpoint routes non-beta paths to v1 (#1174) - paths that aren't available on the DeepSeek beta host are transparently redirected to the v1 host instead of failing.
- Skill packs accept workflow-pack archive layouts (#1164) - skill archives produced by the workflow pack tool now install correctly.
- Interactive sessions stay in alternate screen (#1158) - returning from a sub-process no longer kicks the TUI back to the primary screen mid-turn.
- Slash-menu arrow navigation wraps (#1152) - up at the top / down at the bottom of the slash menu wraps to the other end.
- CLI preserves split prompt words from Windows shims (#1160) - prompt arguments forwarded by the npm wrapper on Windows are no longer joined into one giant token.
libcextended to all Unix targets (#1173) - improves FreeBSD build compatibility.- Memory truncation marker reports omitted bytes - the
[…N bytes omitted]marker now shows an accurate count. Thanks Friende.
Docs
- Memory skill link (#1096) - corrected. Thanks Aqil Aziz.
- Help keybinding reference (#1095) - corrected. Thanks Friende.
- Additional environment variables documented in the config reference. Thanks Liu-Vince.
- Docker volume guidance - the install snippet now uses a writable named data volume rather than a bind mount that may be read-only on some hosts.
- Competitive analysis reflects LSP diagnostics (#1171) - the doc now matches the shipping LSP diagnostics implementation.
- Dispatcher path for
/run-pr(#1227) - the README now points at the dispatcher binary.
0.8.20 - 2026-05-08
Added
- Global AGENTS.md fallback - when a workspace and its parents do not
provide project instructions, DeepSeek TUI now loads
~/.deepseek/AGENTS.mdbefore falling back to auto-generated.deepseek/instructions.md, keeping repo-local instructions higher priority while supporting shared defaults.
Fixed
- Chinese reasoning stays Chinese - restore the #588 language contract after
the deterministic environment prompt regressed it. The latest user message now
chooses the natural language for both
reasoning_contentand the final reply; the resolvedlangfield is only a fallback when the user turn is ambiguous.
0.8.19 - 2026-05-08
Fixed
- DeepSeek beta endpoint stays default for Chinese locales - the legacy
deepseek-cnruntime path no longer routes users to the non-betahttps://api.deepseek.combase URL. It is now a backwards-compatible alias for the normaldeepseekprovider default,https://api.deepseek.com/beta, so strict tool mode and other beta-gated features stay available worldwide. - Provider docs stop advertising
deepseek-cnas a separate provider - runtime docs now describe it only as a legacy config alias. DeepSeek uses the same official host worldwide; users with private mirrors should setbase_urlexplicitly.
0.8.18 - 2026-05-07
This is the v0.8.17 follow-up release: a tighter TUI/runtime/install pass with safer session startup semantics, Docker images promoted to a supported install path, and several community PRs harvested into the release branch. VS Code and Feishu/Lark/mobile companion work remain out of scope for this release.
Added
- Prebuilt Docker images on GHCR - release builds now publish
ghcr.io/hmbown/deepseek-tuiwithlatest, semver, andvX.Y.Ztags, and the GitHub release notes include a Docker install snippet. Docker publishing is now a release gate rather than a best-effort check. - Draggable transcript scrollbar (#1075, #1076) - when mouse capture is enabled, drag the transcript scrollbar thumb to move through long sessions. The implementation also clears stale drag state on resize and new left-clicks. Thanks @Oliver-ZPLiu.
- PTY regression for viewport drift (#1085) - the QA harness now covers the blank-top-rows failure after a failed/long turn so future layout changes catch terminal viewport drift.
Changed
- Plain
deepseekstarts a fresh session - opening a seconddeepseekin the same folder no longer silently attaches to the same in-flight checkpoint. Crash/interrupted checkpoints are preserved as saved sessions and recovered explicitly throughdeepseek --continue. - npm postinstall is recoverable for transient download failures (#1059) - install-time GitHub download/extract failures are non-blocking and documented, while unsupported platforms, checksum mismatches, glibc preflight failures, and runtime wrapper failures remain fatal. Thanks @Fire-dtx.
- Docker Buildx cargo caches are platform-isolated and locked - registry,
git, and target caches now use platform-specific cache IDs plus locked
sharing to avoid the
.cargo-ok File existsunpack race in release checks. - Long-session palette is easier to read (#1070, #936 partial) - default
body text is slightly softer, reasoning/thinking text uses a warmer accent,
and
/themenow updates the terminal color adapter so light mode keeps those contrasts coherent after an in-session toggle. Thanks @bevis-wong and @oooyuy92 for the readability reports. - Install docs add a second rustup mirror fallback (#1011) -
rsproxy.cnis documented as an alternate rustup mirror, and old Debian/Ubuntu Cargoedition2024failures now point users to rustup stable. Thanks @wuwuzhijing.
Fixed
- Chinese destructive approval dialogs keep explicit risk wording (#1087, #1091) - zh-Hans destructive approval copy now localizes the operation label, title, prompt, and destructive-risk warning without changing English default behavior. Thanks @qinxianyuzou and @axobase001.
- Terminal viewport is reset before repaint (#1085) - the TUI now clears scroll margins/origin mode before key repaints after resume, resize, and turn completion, preventing alt-screen content from drifting downward and leaving blank rows at the top.
- Interactive subprocesses wait for terminal release (#1085) - shell/editor handoff now waits until the UI has actually left alt-screen/raw mode before launching the child process, preventing the TUI from repainting into host scrollback after interactive tool use.
- Light theme reasoning blocks stay light (#1070, #936 partial) -
thinking/reasoning background tints now map to the light reasoning surface
instead of keeping the dark-mode tint after
/theme light. - FreeBSD can compile the secrets crate (#1089) - platforms without a native
keyringdependency now fail the OS-keyring probe cleanly and fall back to the file-backed secret store instead of referencing a missing crate. Thanks @avysk for the FreeBSD report. - Windows sandbox docs no longer overstate guarantees (#1015, #1058) - the docs and code comments now describe the future Windows helper as process-tree containment only until filesystem, network, registry, or AppContainer isolation is actually implemented. Thanks @axobase001.
0.8.17 - 2026-05-07
A focused reliability release built almost entirely from community contributions.
Fixes Plan-mode safety, paste-Enter auto-submit, slash-menu skills coverage, the
deepseek-cn endpoint preset, and a handful of platform / streaming /
gateway-compatibility issues. Also lands a small PTY-driven QA harness so the
next round of TUI fixes can be verified against real terminal behaviour.
Added
/themecommand (#1057) — toggle between dark and light themes inline, without round-tripping through/config. Thanks @MengZ-super.- PTY/frame-capture TUI QA harness — new
crates/tui/tests/support/qa_harness/lets integration tests spawndeepseek-tuiin a real pseudo-terminal, send scripted keys / paste / resize, and assert on the parsed terminal frame plus the workspace filesystem. Initial scenarios cover boot smoke and the #1073 paste regression. Adding-a-scenario walkthrough lives incrates/tui/tests/support/qa_harness/README.md. - Whalescale desktop runtime bridge — the local runtime API now exposes
POST /v1/approvals/{id},GET /v1/runtime/info,enabledflags onGET /v1/skills, andPOST /v1/skills/{name}toggles. Runtime thread events also carryagent_reasoningitems so desktop clients can render thinking separately from assistant text.
Changed
deepseek-cnprovider preset now defaults to the officialhttps://api.deepseek.comhost (#1079, #1084) — matches api-docs.deepseek.com. The legacy typo hostapi.deepseeki.comis still recognized in URL heuristics and chat-client normalization so existing user configs keep working. Thanks @Jefsky.- Plan mode runs shell commands in a read-only sandbox (#1077) — was
WorkspaceWritewith the workspace as a writable root, which letpython -c "open('f','w').write('x')"mutate files inside the workspace. NowSandboxPolicy::ReadOnly: no writes anywhere on the filesystem, no network. Read-only inspection commands (ls,git log,grep,cargo metadata, …) keep working through the per-platform sandbox; for anything that creates or modifies files, switch to Agent mode (/agent). Thanks @DI-HUO-MING-YI.
Fixed
- Pasting multi-line text with a trailing newline no longer auto-submits
(#1073) — the composer's Enter handler now consults the paste-burst
suppression state and either appends
\nto the in-flight burst buffer or inserts it into the composer text directly, instead of falling through tosubmit_input(). Reproduced from the original Windows / PowerShell symptom; fix covers both the bracketed-paste and rapid-keystroke detection paths. Thanks @bevis-wong for the precise reproducer. - Slash menu,
/skills, and/skill <name>show project-local AND global skills (#1068, #1083) — switched the cache todiscover_in_workspace, so the UI surfaces stay in sync with the system-prompt skills block. Bonus fix:SKILL.mdfrontmatter values are now stripped of surrounding YAML quotes, soname: "hud"registers ashudand matches prefix lookup. Thanks @AlphaGogoo / @Duducoco. - Windows shell output is decoded as UTF-8 even on non-UTF-8 system code
pages (#982, #1018) — Windows shell commands are now wrapped with
chcp 65001 >NUL &so subprocesses output UTF-8 instead of GBK / other ANSI code pages.display_commandstrips the prefix so transcripts and approval prompts stay clean. Thanks @chnjames. - Stale snapshot
tmp_pack_*files are cleaned up on startup (#975, #1055) — interrupted side-repo git pack operations no longer leak orphaned temp files;prune_unreachable_objectsruns during the regular prune cycle to drop loose objects from rolled-back snapshots. Closes the ~30 GB+ disk-usage report. Thanks @axobase001. - Window-resize artifacts on macOS Terminal.app and Windows ConHost are
gone (#993) — forces the resize-event size during the post-resize draw
so ratatui's internal
autoresize()cannot shrink the viewport back to a stale dimension and leave the newly-expanded area filled with stale content. Same class as #582 for additional emulator families. Thanks @ArronAI007. - Streaming thinking blocks finalize cleanly on stream errors and
restarts (#861 partial, #1078) — the engine-error handler now drains
the in-flight thinking block into the transcript instead of leaving the
partial reasoning orphaned in
StreamingState. Refactor extracts the thinking lifecycle into named helpers (start_streaming_thinking_block,finalize_current_streaming_thinking,stash_reasoning_buffer_into_last_reasoning). Thanks @reidliu41. - OpenRouter and other custom-endpoint providers preserve explicit model
IDs (#1066) — when a provider has an explicit model AND a custom
base_url(different from the provider default), the model name is no longer rewritten by provider-specific normalization. Lets OpenAI-compatible gateways accept bare IDs likedeepseek/deepseek-v4-pro,accounts/fireworks/models/..., orglm-5. Thanks @THINKER-ONLY. - Auto-generated
.deepseek/instructions.mdstabilizes the KV prefix cache (#1080) — replaces the per-turn filesystem-scan fallback inprompts.rswith a real on-disk artifact when no context file exists, so the system prompt's prefix stays byte-stable across turns and prefix-cache hit-rate improves. The auto-generated file is plainly labelled and the user can edit or delete it freely. Thanks @lloydzhou. - SSE responses behind compressing gateways decode correctly (#1061) —
enables reqwest's
gzipandbrotlifeatures so streams through proxies that compress the response come through clean instead of as protocol corruption. Quiets one of the failure modes behind some "stuck working" reports. Thanks @MengZ-super. - NVIDIA NIM provider configs use their own API key even when a legacy
root DeepSeek key is present (#1081) —
[providers.nvidia_nim] api_keynow wins for NIM requests, avoiding 401s caused by accidentally sending the top-level DeepSeek credential to NVIDIA. Thanks @wlon for the focused diagnosis. - npm installs explain the release-mirror escape hatch when GitHub Releases
are blocked (#1051, #1056) — network/DNS failures now point at the
existing
DEEPSEEK_TUI_RELEASE_BASE_URLoverride and the required checksum manifest / binary layout instead of stopping at a rawENOTFOUND github.com. Thanks @axobase001.
Notes for contributors
This release shifts the project's PR-handling philosophy: every contribution
has value somewhere; the maintainer's job is to find it, use it, and credit
the contributor — never to close a PR with nothing taken. If a PR is too
large or scope-mixed to merge whole, useful commits / files / ideas are
harvested directly rather than asking the contributor to split it. Trust
boundary on credentials, sandbox, providers, publishing, telemetry,
sponsorship, branding, and global prompts still requires explicit
maintainer sign-off, but the burden of getting there is on us. See
AGENTS.md for the full text.
0.8.16 - 2026-05-07
A focused hotfix for v0.8.15 regressions in RLM, sub-agent visibility, and terminal ownership. This release keeps the v0.8.15 feature set intact while making long-running delegated work easier to inspect and safer to run.
Changed
- RLM has no fixed 180s wall-clock timeout (#955) — RLM turns can continue past the old hard limit when the long-input REPL is still making progress.
- RLM output is easier to audit (#955) — final reports now include compact execution metadata: input size, iteration count, elapsed time, sub-LLM RPC count, and termination state.
- RLM chunking guidance is stricter for exact work (#955) — prompts now
tell the sub-agent to use deterministic Python over the full
contextfor counts/aggregation and to report chunk coverage when splitting a whole input. - Tool guidance is less defensive (#955) — the system prompt now explains when to use tools instead of discouraging the model from using capabilities that are actually available.
Fixed
- Active RLM work stays visible (#955) — foreground RLM calls surface in the
active task/right-rail state instead of leaving the Tasks panel saying
No active tasks. /subagentsno longer reports false emptiness (#955) — the sub-agent overlay now includes live progress-only agents and transcript fanout workers when the manager cache has not refreshed yet.- Sub-agent cards are quieter and more useful (#955) — low-signal scheduler
lines such as
step 1/100: requesting model responseare hidden, while compact tool activity remains visible. - Sub-agent completion protocol stays internal (#955) — completion sentinels are routed as internal runtime events instead of user messages, so the parent agent does not explain raw protocol XML back to the user.
- Sub-agents cannot take over the parent terminal (#955) — background
agents reject
exec_shellwithinteractive=true; they can still use non-interactive shell, background shell,tty=true, and task-shell tools. - Terminal scrollback ownership is restored (#955) — the TUI re-enters alternate-screen mode after foreground/sub-agent work drains, preventing the host terminal scrollbar from taking over the live interface.
0.8.15 - 2026-05-06
An auth, Windows, editor-integration, and setup stabilization release. This release keeps the existing DeepSeek V4 architecture intact while landing small community fixes that make first-run setup, terminal behavior, skills, cost display, and recovery paths easier to trust.
Added
- ACP stdio adapter for Zed/custom agents (#782) —
deepseek serve --acpstarts a local Agent Client Protocol server over stdio. The first slice supports new sessions and prompt responses through the user's existing DeepSeek config/API key; tool-backed editing and checkpoint replay remain outside the ACP surface for now. - Yuan/CNY cost display (#806) —
cost_currency = "cny"(also acceptsyuan/rmb) switches footer, context panel,/cost,/tokens, and long-turn notification summaries from USD to CNY. - Slash autocomplete for skills (#808) — installed skills are visible in the slash-command autocomplete menu.
/renamesession titles (#836) — sessions can be renamed without editing save files manually.
Changed
- Current local date in turn metadata (#893, closes #865) — real user turns
now include the current local date in
<turn_meta>, without changing the stable system prompt/cache prefix. - Doctor endpoint diagnostics (#823) —
deepseek doctorshows the resolved provider/API endpoint to make proxy, China endpoint, and inherited-env debugging more concrete. - More conservative request sizing (#826) — API requests cap
max_tokensagainst the active model/context budget before dispatch. - Safer config and secret file writes (#833, #837) — generated config files use restrictive permissions and improved secret redaction.
Fixed
- Env-only API key failure recovery (#892) — runtime auth failures now say
when the rejected key came from inherited
DEEPSEEK_API_KEYand no saved config key is present, matching the clearerdeepseek doctorguidance. - Windows Unicode output (#887, closes #872) — TUI startup now best-effort switches the Windows console input/output codepages to UTF-8, improving Chinese and other non-ASCII rendering.
- Windows resume picker (#886, closes #866) — the dispatcher keeps the resume picker path on Windows instead of bypassing it.
- Windows clipboard fallback (#850) — copy operations have a fallback path when the primary clipboard backend is unavailable.
- Workspace trust persistence (#870) — approval/trust choices persist in global config instead of surprising users on the next launch.
- Ctrl+E composer behavior (#883, closes #876) — plain Ctrl+E moves to the end of the composer again; file-tree toggling moved to the shifted shortcut.
- Plain Markdown skills (#869) —
SKILL.mdfiles without frontmatter now fall back to the first# Headinginstead of being ignored. - Workspace-scoped latest resume (#830, closes #779) —
resume --last,--continue, and fork/resume helpers choose the latest session for the current workspace/repo rather than the newest saved session globally. - Npm wrapper version fallback (#885) —
deepseek --version/-vcan report the package version when the native binary has not been downloaded yet. - TUI exit resume hint (#863, closes #682) — exiting the TUI now points users toward the relevant resume command.
- Startup and terminal reliability — includes bounded stream-open waits
(#847), cursor-lag reduction for
@mentions (#849), OSC52 clipboard fallback for SSH (#845), legacy Ctrl+V paste recognition (#786), Windows mouse capture defaulting off (#785), and UTF-8-preserving ANSI stripping (#784). - Install and policy reliability — avoids unstable Rust file-locking APIs
(#821), enforces network policy in
web_run(#800), fixes repeated setup language prompts after API-key setup (#844), and explains dispatcher TUI spawn failures (#853). - Workspace safety — refuses dangerous snapshots for
$HOMEor unsafe workspaces (#798, #804), fixes path-escape false positives for double-dots in names (#824), scopes snapshot built-in excludes (#854), and replaces providerunreachable!()paths with proper errors (#835). - Skills discovery — recursively reads the skills directory (#811), ignores
symlinks outside the selected install root (#814), discovers global Agents
skills (#848), and includes
.cursor/skills(#817). - Provider/model compatibility — restores auto model routing (#772), completes vLLM provider integration (#737), accepts provider-prefixed DeepSeek model IDs (#794), preserves requested model ID casing (#733), and pins RLM child calls to Flash (#832).
Thanks
- Thanks to @reidliu41 for the resume hint and workspace trust fixes (#863, #870).
- Thanks to @Oliver-ZPLiu for the Windows clipboard fallback (#850).
- Thanks to @xieshutao for the plain Markdown skill fallback (#869).
- Thanks to @GK012 for the npm wrapper version fallback (#885).
- Thanks to everyone filing Windows, Chinese-language setup, auth, and first-run reports. Those concrete reproductions shaped the release.
0.8.13 - 2026-05-05
A stabilization release for DeepSeek V4 runtime and TUI reliability. The v0.8.13 milestone was narrowed to direct runtime/TUI fixes; prompt hygiene, trajectory logging, Anthropic-wire support, and larger UI cleanup were moved out of this release.
Added
- No-LLM tool-result prune before compaction (#710) — old verbose tool results are mechanically summarized before the paid summary pass. Duplicate reads keep the freshest full body and replace older copies with one-line summaries; if that gets the session back under the compaction threshold, the LLM summary call is skipped entirely.
- Repeated-tool anti-loop guard (#714) — the engine now tracks
(tool_name, args)pairs per user turn. On the third identical call it inserts a synthetic corrective tool result instead of running the same tool again unchanged; per-tool failures warn at three and halt at eight. - V4 cache-hit telemetry fallback (#721) — usage parsing now recognizes
usage.prompt_tokens_details.cached_tokens, so the existing footer cache-hit chip works with DeepSeek V4's automatic prefix-cache telemetry as well as the older explicit hit/miss fields.
Fixed
- Invalid tool-call JSON repair (#712) — malformed streamed tool arguments now pass through a deterministic repair ladder before dispatch.
- Hallucinated tool-name recovery (#713) — common non-canonical tool names are resolved through the registry before the engine reports a missing tool.
- Tool-schema sanitation (#715) — schemas are normalized before API emission so provider-strict JSON Schema handling does not reject valid tools.
- Case-sensitive model IDs (#717, #729) — valid configured model IDs keep caller-provided case while compact DeepSeek aliases still canonicalize.
- Stale
working...state after failed dispatch (#738) — if the UI fails to send a message to the engine before a turn starts, the composer loading state is cleared instead of trapping later input in pending state. - Prompt-free doctor key checks —
deepseek doctorno longer reads the OS keyring, avoiding macOS Keychain prompts during diagnostics. - macOS Terminal color compatibility —
xterm-256colorsessions now receive 256-color palette indexes instead of truecolor SGR, preventing Apple Terminal from misrendering whale blues as green/cyan blocks. - Chat client repair after Responses cleanup — restored the chat client body and regression coverage after removing the dead experimental Responses fallback path.
- Up/Down arrow transcript scroll when composer is empty — bare Up/Down arrows now scroll the transcript when the composer input is empty (or whitespace-only); with text present they still navigate composer history. Previously the gate was hardcoded to false, leaving users in virtual terminals (Ghostty, Codex, Kitty-protocol) unable to scroll without modifier shortcuts.
0.8.11 - 2026-05-04
Changed
- Cache-maxing prompt path for DeepSeek V4 — the engine now skips system-prompt reassignment when the assembled stable prompt is unchanged, keeps the volatile repo working-set summary out of the system prompt, and injects it as per-turn metadata on the latest user message instead.
- Tool catalog cache anchor — the model-visible tool array now marks
the final native tool with
cache_control: ephemeralso DeepSeek can anchor the stable tool prefix explicitly. - V4-scale automatic compaction defaults — automatic compaction keeps a 500K-token hard floor and the fallback compaction threshold now reflects the V4-scale late-trigger policy instead of the old 50K-era default.
- Token-only compaction trigger — the message-count compaction trigger
was a 128K-era heuristic that fired on long sessions of small messages
— exactly the case where rewriting V4's prefix cache is most wasteful.
Removed
CompactionConfig::message_thresholdand the message-count branch inshould_compact; token budget is now the sole automatic trigger (gated by the 500K floor). Manual/compactis unchanged.
Fixed
- Legacy 128K context naming — the 128K fallback is now named and documented as legacy DeepSeek-only behavior, reducing ambiguity with the 1M-token DeepSeek V4 defaults.
npm installresilience for slow / firewalled networks — the postinstall binary fetch from GitHub Releases now retries on transient errors (5 attempts, 1-16 s exponential backoff with jitter), enforces a per-attempt timeout (default 5 min, configurable viaDEEPSEEK_TUI_DOWNLOAD_TIMEOUT_MS) plus a 30 s stall detector, honorsHTTPS_PROXY/HTTP_PROXY/NO_PROXYenv vars (pure-Node CONNECT tunneling, no new dependencies), and prints a download-progress line to stderr so users know it isn't hung. Suppressible withDEEPSEEK_TUI_QUIET_INSTALL=1. Reported by a community user from China whose install through a CN npm mirror took 18 minutes — the bottleneck was the GitHub fetch, which CN npm mirrors do not proxy.- YOLO sandbox dropped to DangerFullAccess — YOLO mode was still
routing shell commands through the WorkspaceWrite sandbox, which
intercepted legitimate outside-workspace writes (package installs,
sub-agent workspaces,
~/.cache, brew,npm install -g, pipx) and forced approval round-trips — contradicting the "no guardrails" contract. YOLO already auto-approves all tools and enables trust mode; the sandbox was the last residual restriction. Now uses DangerFullAccess (no sandbox), consistent with the full YOLO posture. - Scroll position lock preserved across render resolve — user
scroll-up during live streaming was being yanked back to the live tail
on the next chunk. The
user_scrolled_during_streamlock was cleared prematurely when content briefly fit in one screen, or when the transcript shrank between renders (e.g. sub-agent card collapsed). Fixed by snapshotting the prior tail state beforeresolve_topand only clearing the lock when the user was deliberately at the bottom. - Capacity controller disabled by default — the capacity controller
was silently clearing the transcript (
messages.clear()) based on slack-basedp_failcalculations, independent of token utilization or theauto_compactsetting. This contradicted the v0.8.11 default ofauto_compact = false— the user opted into trusting the model with the full 1M-token V4 window, and the controller was auto-managing the prefix on their behalf. The controller now defaults toenabled = false; power users can opt in viacapacity.enabled = true.
Docs
- README clarity pass (#685) — title-cased section headings, an explicit
Node + npm prerequisites block before the
npm install -gsnippet, a China-friendly--registry=https://registry.npmmirror.cominstall variant, a DeepWiki badge for AI-assisted repo browsing, and a 🐳 mark on the title. Thanks to @Agent-Skill-007 for this PR.
0.8.12 - 2026-05-05
A feature release built on the v0.8.11 cache-maxing foundation: 20 community PRs merged, covering reasoning-effort automation, V4 FIM edits, bash-arity execpolicy, skill-registry sync, vim composer mode, large-tool-output routing, pluggable sandbox backends, layered permission rulesets, and cache-aware resident sub-agents. No breaking changes.
Added
- Reasoning-effort auto mode (#669) —
reasoning_effort = "auto"inspects the last user message for keywords (debug/error → Max, search/lookup → Low, default → High) and resolves the tier before each API request. Sub-agents always get Low. - FIM edit tool for V4 /beta (#668) —
fim_edittool sends fill-in-the-middle requests to DeepSeek's/betaendpoint for surgical code edits. - Bash arity dictionary (#655) —
auto_allow = ["git status"]now matchesgit status -sbut NOTgit push. The arity dictionary knows command structure for git, cargo, npm, yarn, pnpm, docker, kubectl, aws, make, and others. Legacy flat prefix matching still works for unlisted commands. - Unified slash-command namespace (#661) — user-defined commands in
~/.deepseek/commands/support$1,$2,$ARGUMENTStemplate substitution. User commands override built-in commands. - Skill registry sync (#654) —
/skills syncfetches the community skill registry and installs/updates all listed skills. Network-gated by the existing[network]policy. - Vim modal editing in composer (#659) —
vim.insert_mode/vim.normal_modesettings enable modal editing in the message composer with standard Vim keybindings. - Separate tui.toml (#657) — theme colors and keybind overrides can live in
~/.deepseek/tui.tomlalongside the mainconfig.toml. Note: file format is defined but not yet loaded at startup — wiring deferred to v0.8.13. - Large-tool-output routing (#658) — tool results exceeding a configurable token threshold are routed through a workshop with truncated previews, protecting the parent context window. Synthesis is currently truncation-only; V4-Flash sub-agent synthesis deferred to follow-up.
- Pluggable sandbox backends (#645) — a
SandboxBackendtrait and Alibaba OpenSandbox HTTP adapter letexec_shellroute commands to a remote sandbox instead of spawning locally. Config keys:sandbox_backend,sandbox_url,sandbox_api_key. - Layered permission rulesets (#653) —
ExecPolicyEnginesupports builtin, agent, and user-priority layers for allow/deny prefix rules. Deny-always-wins semantics. - Cache-aware resident sub-agents (#660) — sub-agents spawned with
resident_fileprepend the file contents to their system prefix for V4 prefix-cache locality. A global lease table prevents two agents from holding a resident lease on the same file simultaneously. Leases are released on agent completion. - Context-limit handoff (#667) — engine-level support for replacing
routine compaction with a
.deepseek/handoff.mdfile write when context pressure triggers. Note: config knob removed pending implementation. - LSP auto-attach diagnostics (#656) — edit results now include post-edit diagnostics via the engine-level LSP hooks path.
Docs
- README install section rewritten (#672) — the previous lede claimed
"no Node.js or Python runtime" but the very next paragraph told readers to
install Node before continuing. Replaced with a three-path Install block
(npm / cargo / direct download) that makes the npm wrapper's role explicit:
it downloads the prebuilt binary, but
deepseekitself does not depend on Node at runtime. zh-CN README mirrored. - Windows Scoop install instructions (#696) — README and zh-CN README now
document
scoop install deepseek-tuifor Windows users. Thanks to @woyxiang for this PR. - DeepSeek Pro discount window extended (#692) — pricing footnote updated from 5 May 2026 to 31 May 2026 to match the platform-side promotion. Thanks to @wangfeng for this PR.
deepseek resume <SESSION_ID>surfaced in Usage — the command exists since v0.7 but was undocumented. Reported via #682.- SECURITY.md (#648) — vulnerability reporting policy and supported versions.
- CODE_OF_CONDUCT.md (#686) — Contributor Covenant v2.1. Thanks to @zichen0116 for this PR.
- zh-Hans locale activation docs (#652) — README.zh-CN.md and
config.example.toml now document
locale = "zh-Hans".
Fixed
- Cross-workspace session bleed (security) — launching
deepseekfrom any directory silently auto-recovered the most recent interrupted session, even if that session originated in a completely different workspace. Tools then operated on the prior workspace's file paths while the status bar displayed the current workspace name — a confusing trust-boundary violation that could leakapi_messages,working_setentries, and any secrets the prior session had accumulated into a new terminal that was never meant to see them.try_recover_checkpoint()now compares the saved session's workspace tostd::env::current_dir()(canonicalised, with a strict-equality fallback when canonicalisation fails) and only auto-recovers on a match. On a mismatch the checkpoint is persisted as a regular session (so the user can find it viadeepseek sessions/deepseek resume <id>) and cleared, and the new launch starts fresh — no data is lost. Hotfixed tomainahead of the v0.8.12 tag. cargo installon stable Rust — the language-picker match guard atcrates/tui/src/tui/ui.rs:1603used&& let Some(...) = ...inside anif-guard, which requires the nightly-onlyif_let_guardfeature on Rust before 1.94. Reported by an external user whosecargo install deepseek-tuifailed with E0658. Rewrote as a plain match guard with a nestedif letinside the arm body. The workspace also now declaresrust-version = "1.88"(the actual minimum forlet_chainsinif/while) so users on too-old toolchains see a clear cargo error instead of a confusing rustc one. AGENTS.md gains a "stable Rust only" section so this doesn't regress.- Resident-file lease never released after spawn (#660) — the lease was
stamped as
"pending"at spawn time because the agent id is only assigned by the manager after the spawn call returns. The release-on-terminal-state path (added in the original #660 commit) matched leases by agent id, so it could never find these placeholder entries. Now the placeholder is replaced with the real agent id immediately after spawn so existing release wiring fires. Resolves the v0.8.12 caveat documented at RC time. - Color::Reset across all UI widgets (#651, #671) — replaced hardcoded
Color::BlackandColor::Rgb(18, 29, 39)backgrounds withColor::Resetso the TUI respects the terminal's actual background color on light-themed and non-standard terminals. - Windows MessageBeep (#646) —
notify_done_tonow callsMessageBeepon Windows when BEL method is selected. - truncate_id optimization (#649) — replaced manual string slicing with a
shared
truncate_idhelper across session, picker, and UI call sites.
Maintenance
- Workspace
cargo fmtsweep across community PRs that landed unformatted. - Issue-triage GitHub Actions added (#688): keyword-driven auto-labeller,
stale-bot for
needs-infoissues (14 d → stale → 7 d → close), and a spam lockdown that auto-closes promotional issues from accounts <30 d old. All pure GitHub Actions — no third-party services. - Annotated
TuiPrefs(#657) andhandoff::THRESHOLDS(#667) with#[allow(dead_code)]so the deferred APIs don't trip CI's-D warningsflag while their call sites are staged for v0.8.13. - Removed dead
prefer_handofffield fromCompactionConfig— config knob existed but zero code paths consulted it (#667). - Removed dead
use_terminal_colorsfield fromTuiConfig— no rendering code read the value (#671). - Fixed
expect()panic risk inOpenSandboxBackend::new()— now returnsResult(#645). - Fixed broken
section_bgtest assertion after Color::Reset migration (#651). - Fixed
resolve_prefixesdocstring to accurately describe deny-always-wins behavior (#653). - Wired
create_backend()intoEngine::build_tool_context— sandbox backend was defined but never activated (#645). - Wired resident lease release on agent completion/cancellation/failure (#660).
Contributors
First-time contributor to this release: @zichen0116 (#686). Welcome — and thank you.
Bulk community contributions by @merchloubna70-dot (#645–#681, 28 PRs spanning features, fixes, and VS Code extension scaffolding). Thank you for the remarkable volume and quality of work.
0.8.10 - 2026-05-04
A patch release: hotfixes, small UX polish, and four whalescale-unblocking runtime API additions. No breaking changes.
Added
- OPENCODE shell.env hook (#456) — lifecycle hooks can now inject shell environment into spawned commands without hard-coding env in prompts or wrapper scripts.
- Stacked toast overlay (#439) — status toasts can queue and render together instead of overwriting each other.
- File @-mention frecency (#441) — file mention suggestions learn
from recent selections via
~/.deepseek/file-frecency.jsonl. - Durable keybinding catalog (#559) —
docs/KEYBINDINGS.mdis now the source-of-truth audit for current shortcuts and the future configurable-keymap registry. - Runtime API quartet for whalescale-desktop integration (#561, #562, #563,
#564, #567) — addresses whalescale#255/256/260/261:
[runtime_api] cors_originsconfig /--cors-origin URLflag (repeatable) /DEEPSEEK_CORS_ORIGINSenv var, all stacking on top of the built-in dev-origin defaults (#561 / whalescale#255).PATCH /v1/threads/{id}extended fromarchived-only to the full editable field set:allow_shell,trust_mode,auto_approve,model,mode,title,system_prompt. Empty string clearstitle/system_prompt. Newtitlefield onThreadRecordis additive — no schema_version bump (#562 / whalescale#256).archived_only=truequery param onGET /v1/threadsand/v1/threads/summary, backed by a newThreadListFilterenum (#563 / whalescale#260).GET /v1/usage?since=&until=&group_by=<day|model|provider|thread>aggregates token totals + cost (viapricing.rs) across all threads/turns. Empty time ranges yield emptybuckets(never 404) (#564 / whalescale#261).
- Language picker in first-run onboarding (#566) — new step between
Welcome and ApiKey lists every shipped locale (
auto/en/ja/zh-Hans/pt-BR) with the native name (日本語, 简体中文, …) plus an English label so the target language is reachable without already speaking it. Hotkeys 1-5 select; persists immediately to~/.deepseek/settings.toml. - Windows + China install documentation (#578) — expanded
docs/INSTALL.mdwith Windows source-build setup, Visual Studio Build Tools / MSVC environment notes, rustup and Cargo mirror guidance, and antivirus troubleshooting. Thanks to @loongmiaow-pixel for this PR.
Changed
- Agent prompt now explicitly describes DeepSeek cache-aware behavior — long-session guidance explains why stable prompt prefixes, sub-agents, RLM, and late compaction matter for V4 cache economics.
- Whale sub-agent nicknames now interleave Simplified Chinese with
English (
Blue/蓝鲸/Humpback/座头鲸/ …). Pure cosmetic; doubles the labeling pool size and gives a roughly even mix on each new spawn. - User memory docs + help polish (#497, #569) —
/memoryis now listed in slash-command help, supports/memory help, and the README / configuration docs now point at the fulldocs/MEMORY.mdguide and document both[memory].enabledandDEEPSEEK_MEMORY. Thanks to @20bytes for this PR.
Fixed
- Compaction summaries are cache-aligned for DeepSeek V4 (#575, #580)
— when the summarized message prefix fits the large V4 context budget,
the summary request now reuses the original messages and appends the
summary instruction as a normal user message instead of rebuilding a
fresh
SUMMARY_PROMPT + dropped messagesinput. This lets the summary call benefit from DeepSeek prefix caching. Thanks to @lloydzhou and @jeoor for the cost reports and concrete strategy. - Windows Terminal API-key paste during onboarding (#577) — the setup wizard now handles Ctrl/Cmd+V before generic character input and filters control/meta-modified keys out of the API-key text path. Thanks to @toi500 for the report and workaround details.
- Terminal startup repaint (#581) — the TUI clears the terminal immediately after initialization so normal-screen startup no longer leaves stale default-background rows above the first frame. Thanks to @xsstomy for the screenshot.
- Markdown rendering for tables, bold/italic, and horizontal rules (#579) — transcript markdown now handles table rows, strips separator rows, renders horizontal rules, applies inline bold/italic styles, and avoids an infinite-loop edge case on unclosed markers. Thanks to @WyxBUPT-22 for the PR, screenshots, and tests.
- Slash-prefix Enter activation (#573) — typing a short prefix such
as
/moand pressing Enter now activates the first slash-command match. Thanks to @melody0709 for the report. - macOS seatbelt blocked
~/.cargo/registry(#558) —cargo publish/cargo buildfrom inside the TUI's shell tool was getting sandbox-denied. The seatbelt now allows read on(param "CARGO_HOME")and write on theregistry/andgit/subpaths whenever the policy isn't read-only. HonorsCARGO_HOMEenv with a$HOME/.cargofallback. - Stdio MCP servers now receive SIGTERM on shutdown (#420) — instead
of SIGKILL via
kill_on_drop. Newasync fn shutdownonMcpTransportoverrides onStdioTransportto send SIGTERM and wait up to 2s for graceful exit before drop fires SIGKILL as the backstop. Wired into the engine'sOp::Shutdownpath so graceful exit is the default. A Drop fallback still SIGTERMs on abnormal exit paths. - Shell-spawned children get
PR_SET_PDEATHSIG(SIGTERM)on Linux (#421) — the kernel sends SIGTERM the moment the parent (TUI) exits, even on SIGKILL of the parent. Closes the leak window the cooperative cancellation path can't cover. macOS / Windows watchdog tracked as a follow-up; the existingkill_on_drop+ process_group SIGKILL on cancellation still cover normal shutdown there. - npm install on older glibc now fails fast (#555, #560, #556, #565)
— the prebuilt Linux x64 / arm64 binaries are now built via
cargo zigbuildtargetingx86_64-unknown-linux-gnu.2.28/aarch64-unknown-linux-gnu.2.28, lowering the requirement from glibc ≥ 2.39 to ≥ 2.28. The npm postinstall also runs a Linux-only glibc preflight that fails fast with a clear "build from source" message when the host is incompatible (or musl). Thanks to @staryxchen (#556) and @Vishnu1837 (#565) for these PRs. - Shell tool
cwdparameter now validated against the workspace boundary (#524) — the model could previously passcwdpaths outside the workspace; nowexec_shellrunsToolContext::resolve_pathoncwdlike every other path-taking file tool, returningPathEscapeon violations.trust_mode = truestill bypasses, consistent with the file-tool pattern. Thanks to @shentoumengxin for this PR.
Contributors
First-time contributors to this release: @staryxchen (#556), @shentoumengxin (#524), @Vishnu1837 (#565), @20bytes (#569), @loongmiaow-pixel (#578), and @WyxBUPT-22 (#579). Welcome — and thank you.
0.8.8 - 2026-05-03
Added
- User memory MVP (#489–#493) — opt-in persistent note file
injected into the system prompt as a
<user_memory>block.# footyped in the composer appends a timestamped bullet without firing a turn (#492)./memory [show|path|clear|edit]slash command for inline inspection / editing hints (#491).remembermodel-callable tool so the agent can capture durable preferences itself; auto-approved because writes are scoped to the user's own file (#489).- Hierarchy loader pulls
~/.deepseek/memory.md(path configurable viamemory_path/DEEPSEEK_MEMORY_PATH) and injects above the volatile-content boundary in the prompt (#490). - Default off; enable with
[memory] enabled = trueorDEEPSEEK_MEMORY=on(#493). - Full feature documentation in
docs/MEMORY.md.
- Inline diff rendering for
edit_file/write_file(#505) — tool results now emit a unified diff at the head of the body, picked up by the existing diff-aware renderer with line numbers and coloured+/-gutters. Newsimilarcrate dep. - OSC 8 hyperlinks (#498) — URLs in the transcript become
Cmd+click-openable in supporting terminals (iTerm2, Terminal.app
13+, Ghostty, Kitty, WezTerm, Alacritty). Clipboard path strips
the escapes so yanked text stays clean. Off-switch:
[tui] osc8_links = false. - Retry/backoff visual countdown (#499) —
⟳ retry N in Ms — reasonbanner ticks down during HTTP backoff. On exhaustion the row turns red× failed: <reason>until the next turn starts. - MCP server health chip (#502) — colour-coded
MCP M/Nin the footer's right-cluster: success / warning / error / muted by reachability. Hidden when zero MCP servers are configured. - Per-project config overlay (#485) —
<workspace>/.deepseek/config.tomloverlays a curated set of fields on top of the user-global config:model,reasoning_effort,approval_policy,sandbox_mode,notes_path,max_subagents,allow_shell, plus theinstructions = [...]array (#454). Pass--no-project-configto bypass for one launch. - Project-scope deny-list for credentials/redirects (#417) —
api_key,base_url,provider, andmcp_config_pathare refused at project scope. A malicious<workspace>/.deepseek/config.tomlwould otherwise be able to exfiltrate prompts to an attacker-controlled endpoint by swapping the user's credentials and target host with project-controlled values, or redirect the MCP loader at a config that spawns arbitrary stdio servers under the user's identity. The denied key emits a stderr warning so a user who expected the override sees the deny instead of a silent drop. - Project-scope value-deny for the loosest postures (#417
follow-up) —
approval_policy = "auto"andsandbox_mode = "danger-full-access"are pure escalation values, denied unconditionally at project scope regardless of the user's prior value. Sub-tightening comparisons (e.g. user"never"→ project"on-request"is allowed even though it loosens) stay v0.8.9 follow-up because they need a richer ordering check. SSL_CERT_FILEhonored in the HTTPS client (#418) — corporate proxy / TLS-inspecting MITM users can now point at their custom CA bundle and have it added alongside the platform's system trust store. Tries PEM-bundle parsing first (covers single-cert files too), falls back to DER. Failures log a warning and continue — the existing system roots still apply, so a malformed env var won't bring down the launch. Documented indocs/CONFIGURATION.md.- Execpolicy heredoc handling (#419) —
normalize_commandnow strips heredoc bodies before shlex tokenization so a user'sauto_allow = ["cat > file.txt"]pattern matches the heredoc formcat <<EOF > file.txt\nbody\nEOFcleanly. Recognises the common forms (<<DELIM,<<-DELIM,<<'DELIM',<<"DELIM") while leaving the here-string operator (<<<) untouched. Without this fix, heredoc-form file writes would skip the user's auto-approve list and route through the approval modal even for explicitly-blessed commands. - Sub-agent role taxonomy expansion (#404) — adds
Implementer("land this change with the minimum surrounding edit") andVerifier("run the test suite, report pass/fail with evidence") to the existinggeneral/explore/plan/review/customset. Each role has a distinct system prompt posture. Documented indocs/SUBAGENTS.md. docs/SUBAGENTS.md— full sub-agent reference: role taxonomy, alias map, concurrency cap, lifecycle, session-boundary classification, output contract.docs/MEMORY.md— user-facing memory feature documentation.- Competitive analysis doc —
docs/COMPETITIVE_ANALYSIS.mdcatalogues capability matrix vs OpenCode and Codex CLI. - Session prune helper +
/sessions prune <days>(#406 phase-1) — drops persisted sessions older than N days from~/.deepseek/sessions/. Skips the checkpoint subdirectory and compares against metadataupdated_at(not fs mtime, which can lie after an rsync). 10 total tests cover the helper's contract and the slash-command dispatch surface. Phase 2 (boot-prune + retention policy) stays v0.8.9 work. deepseek doctor --jsonnow surfaces amemoryblock (enabled/path/file_present) so operators can verify memory configuration without booting the TUI.- Tool-output spillover (#422 + #423 + #500) — tool outputs over
100 KiB now spill to
~/.deepseek/tool_outputs/<id>.txtfrom the engine's tool-execution path. The model receives a 32 KiB head plus a footer pointing at the spillover file (Use read_file path=…), the tool cell renders an inlinefull output: <path>annotation in live mode, and a 7-day boot prune keeps the directory bounded. Spillover is skipped on error results so the model still sees the failure message verbatim. The existing tool-details pager surfaces the truncated head so the user can verify what the model saw.
Changed
- Sub-agent concurrency cap raised to 10 by default (#509) —
was 5; configurable via
[subagents].max_concurrent(hard ceiling 20). Running-count now ignores non-running, no-handle, and finished handles so completed agents stop occupying slots. SharedSubAgentManagerisArc<RwLock<...>>(#510) — read paths take read locks, eliminating the multi-agent fan-out UI freeze.- Sub-agent output summarized before parent context (#511) —
compact_tool_result_for_contextnow compressesagent_result/agent_waitpayloads instead of dumping the full snapshot back into the parent's context window. agent_listdefaults to current-session view (#405) — each manager mints asession_boot_idand stamps every spawn; agents loaded from prior sessions are filtered unlessinclude_archived=trueis passed. Each result carries afrom_prior_sessionflag.- Concise todo / checklist update rendering (#403) — repeat
todo_update/checklist_updatecalls render a one-lineTodo #N: <title> → STATUScard with full list still reachable via Alt+V instead of dumping the entire item array on every call. - Compact
agent_spawnrendering (#409) — the generic tool block foragent_spawncollapses to one header line in live mode (◐ delegate · agent-abc12 [running]) since theDelegateCardalready owns live action progress. Transcript replay keeps the full block. - Plan panel role clarified (#408) — drops the "No active
plan" placeholder when the panel is otherwise empty; documents
the panel's narrow role (
update_plantool output +/goal+ cycle counter, distinct from todos). - Sub-agent description copy —
agent_spawntool description andprompts/base.mdupdated to reflect the new default cap of 10 (was stale "Max 5 in flight"). agent_spawn/agent_assignschema descriptions (#404 follow-up) — type/agent_name property descriptions now listimplementerandverifierso the model surfaces those roles without having to discover them fromdocs/SUBAGENTS.md. Adds the long-form aliases (builder/validator/tester) onagent_assignfor parity with the alias map.- Multi-day duration formatting (#447) —
humanize_durationnow caps at two units and promotes through h/d/w boundaries. Long-running sessions render as2d 3hinstead of188415s, and the previous "192m 30s" cycle output becomes3h 12m. The/goalstatus line picks up the same formatter so multi-day goal-elapsed times stay readable. - Accessibility flag (#450) —
NO_ANIMATIONS=1env var now forceslow_motion = trueandfancy_animations = falseat startup, regardless of the savedsettings.toml. Recognises the standard truthy spellings (1,true,yes,on). Documented end-to-end in the newdocs/ACCESSIBILITY.md, including the existinglow_motion/calm_mode/show_thinking/show_tool_detailstoggles for screen-reader users. - Cumulative session-elapsed footer chip (#448) — a
low-priority
worked 3h 12mchip in the footer's right cluster shows session age once it crosses 60s. Hidden during the first minute of a launch so a fresh start doesn't flash a ticker. Drops first under narrow widths so the existing chips (coherence / agents / replay / cache / mcp) keep their slots. Sampled at props-build time (matches theretrycapture pattern) so render stays pure for tests. instructions = [...]config array (#454) — declare additional instruction files (./AGENTS.md,~/.deepseek/global.md, …) and they're concatenated into the system prompt in declared order, above the skills block. Each file is capped at 100 KiB; missing files log a warning and are skipped instead of failing the launch. Project config replaces the user-level array wholesale (the typical "merge" pattern is for users who want both — they list~/global.mdinside the project array). Documented inconfig.example.toml.- Keyboard-enhancement flags pop on suspend paths too (#443
follow-up) —
pause_terminal(Ctrl+Z / shell-suspend) andexternal_editor::spawn_editor_for_input(composer$EDITORlaunch) now pop the flags before handing the terminal to the child process, matching the existing shutdown and panic-hook paths. Defense-in-depth: if a future code path enables the flags explicitly, the suspend handlers won't leak them to a Vim / less / shell child that hasn't asked for them. load_skilltool (#434) — model-callable tool that takes a skill id and returns the SKILL.md body plus the sibling companion-file list in one call. Faster than the existingread_file+list_dirdance; surfaces the skill's description as a quote block at the head so a single tool result is self-contained. Resolves the skills directory with the same hierarchyApp::newuses (.agents/skills→skills→~/.deepseek/skills). Available in Plan and Agent/Yolo modes.- Kitty keyboard protocol opt-in (#442) — pushes
DISAMBIGUATE_ESCAPE_CODESat startup so terminals that support the protocol (Kitty, Ghostty, Alacritty 0.13+, WezTerm, recent Konsole / xterm) report unambiguous events for Option/Alt-modified keys, plain Esc, and multi-byte sequences. Legacy terminals silently discard the escape and see no change. Only the disambiguation tier is pushed — release-event reporting was deliberately skipped because the existing handlers would mis-route releases as duplicate presses. The flags are popped on shutdown / panic / suspend paths (#443). - Multi-directory skill discovery (#432) — the system
prompt's
## Skillslisting and theload_skilltool now walk every candidate directory in the workspace plus the global default:<workspace>/.agents/skills→<workspace>/skills→<workspace>/.opencode/skills→<workspace>/.claude/skills→~/.deepseek/skills. Skills installed for any AI-tool convention show up in the same catalogue. Name conflicts resolve first-match-wins per the precedence order so workspace-local skills shadow user/global ones. Newskills_directories()anddiscover_in_workspace()helpers incrates/tui/src/skills/mod.rs. tool.spilloveraudit event (#500 polish) — emit a discrete audit-log entry wheneverapply_spilloverwrites a spillover file, so operators tailing~/.deepseek/audit.logcan correlate large-output episodes with disk-usage growth in~/.deepseek/tool_outputs/. Fires in both the sequential and parallel tool paths.- Prompt stash (#440) — Ctrl+S in the composer parks the
current draft to a JSONL-backed stash at
~/.deepseek/composer_stash.jsonl(no-op on empty composer)./stash listshows parked drafts (oldest first, with one-line previews and timestamps);/stash poprestores the most recently parked draft into the composer (LIFO). Self-healing parser drops malformed lines instead of poisoning the stash. Capped at 200 entries; multiline drafts round-trip intact via JSON's newline escaping. deepseek pr <N>subcommand (#451) — fetches PR title/body/diff viaghand launches the interactive TUI with a review prompt pre-populated in the composer. The diff is capped at 200 KiB (codepoint-safe truncation) so a massive PR doesn't blow the context window before the user hits Enter. Optional--repo <owner/name>and--checkoutflags; falls back gracefully with an actionable error message ifghisn't on PATH. Adds a newTuiOptions::initial_inputplumb that any future caller can reuse to drop the model into a session with text already typed./stash clearsubcommand (#440 polish) — wipes the entire stash file and reports how many parked drafts were dropped. Pairs with/stash listand/stash popso the user can fully manage the stash from inside the TUI without reaching forrm./hooksread-only listing (#460 MVP) — slash command enumerates configured lifecycle hooks grouped by event, showing each hook's name, command preview, timeout, and condition. Notes the global[hooks].enabledflag's state. No morecat ~/.deepseek/config.tomlto debug "did my hook actually load". The picker / persisted enable-disable surface from #460 stays as v0.8.9 follow-up. Available via/hooksor/hooks list; aliased to/hook. Localized in en/ja/zh-Hans/pt-BR.deepseek doctorreports cross-tool skill dirs (#432 follow-up) — both the human-readable and JSON outputs now surface.opencode/skills/and.claude/skills/presence / count, so operators can confirm at a glance whether any cross-tool skill folder is contributing to the merged catalogue. Empty dirs are omitted from the human-readable output to keep the report scannable; JSON always emits all five slots (global,agents,local,opencode,claude) for stable machine consumption.deepseek doctorreports storage surfaces (#422 / #440 / #500 follow-up) — newStorage:section surfaces the tool-output spillover dir (~/.deepseek/tool_outputs/) with file count and the composer stash file (~/.deepseek/composer_stash.jsonl) with parked-draft count. Mirrored understorage.{spillover,stash}in the JSON output sodeepseek doctor --jsonkeeps a stable schema./hooks eventssubcommand (#460 polish) — lists every supportedHookEventvalue with a short blurb so users can discover which events to target in[[hooks.hooks]]entries without reading source. Ordered lifecycle → per-tool → situational, stable across releases.- Structured-Markdown compaction template (#429) —
prompts/compact.mdswitches from the legacy Active-task/Files-touched/Key-decisions/Open-blockers framing to the spec'd structure: Goal / Constraints / Progress (Done / In Progress / Blocked) / Key Decisions / Next step. The richer Progress sub-bullets help long resumed sessions distinguish "what's verified done" from "what's mid-flight" — useful when the model writes.deepseek/handoff.mdbefore a long break. Backwards- compat: existing handoff.md files continue to render fine because the loader injects them as plain markdown (the template only guides what NEW handoffs look like). The pinned-tool-output configurability part of #429's spec stays a v0.8.9 follow-up — that requires changes tocycle_manager.rscompaction logic itself. tool_call_before/tool_call_after/message_submit/on_errorhooks all fire now (#455 observer-only slice) — these events were defined in theHookEventenum but never fired from production code. Wired through:tool_call_beforeandtool_call_afterfire fromtool_routing.rs;message_submitfires fromdispatch_user_messagebefore engine dispatch;on_errorfires fromapply_engine_error_to_appbefore the error cell reaches the transcript. Hook contexts populate the relevant fields (tool_name+tool_args/tool_result,message,error). Hooks remain read-only in this slice; argument / result / message mutation is a v0.8.9 follow-up because it needs a synchronous-gate contract that doesn't exist today. Combined with the existingsession_start/session_end/mode_changeevents, every variant in theHookEventenum now has a live producer. Each fire is fast-path-gated byHookExecutor::has_hooks_for_event(event)so per-tool dispatch never pays forHookContextallocation when the user has no hooks configured (the common case).- RLM tool family (#512) —
rlmtool cards map toToolFamily::Rlmand renderrlm, notswarm. Stale "swarm" wording cleaned out of docs / comments / tests. - Foreground RLM visible in Agents sidebar (#513 — stopgap) — projection now shows foreground RLM work; full async lifecycle remains v0.8.9.
Fixed
-
Don't auto-approve git -C ...(#416, shipped 2026-05-03) — v0.8.8 release runtime fix; foundation for the rest of the stabilization batch. -
Self-update arch mapping (#503) —
update.rsuses release asset naming (arm64/x64) instead of raw Rust constants (aarch64/x86_64); rejects.sha256siblings as primary binaries. -
Composer Option+Backspace deletes by word (#488) — was deleting by character.
-
Offline composer queue is session-scoped (#487) — legacy unscoped queues fail closed instead of leaking content into unrelated chats.
-
display_pathtest race + Windows separator (#506) — tests no longer mutate$HOME;display_path_with_homewalks components and joins withMAIN_SEPARATOR_STRso Windows shows~\projects\foonot~\projects/foo. -
Footer reads statusline colours from
app.ui_theme(#449) — was using a bespoke palette. -
Keyboard-enhancement flags pop on panic exit too (#443/#444) — raw-mode startup probe is now bounded by a configurable timeout.
-
CI workflow cleanup (#507) — pruned three duplicated/dead workflows (
crates-publish.yml,parity.yml,publish-npm.yml);release.ymlbuildjob now allowsparityto be skipped on manualworkflow_dispatch; release-runbook reconciled. -
Slash-menu layout jitter on Windows — typing through a
/fooautocomplete used to shrink the matched-entry count, which shrank the composer height every keystroke, which forced the chat area above to repaint. On Windows 10 PowerShell + WSL the per-cell write cost made the jitter visible. Composer now reserves its panel-max envelope for the whole slash/mention session so the chat-area Rect stays stable; the menu still renders only the entries that actually match. -
Linux ARM64 prebuilt binaries — the release workflow now publishes
deepseek-linux-arm64anddeepseek-tui-linux-arm64(built natively on GitHub'subuntu-24.04-armrunner). The npm wrapper picks them up automatically onarm64Linux hosts, so HarmonyOS thin-and-light, openEuler/Kylin, Asahi Linux, Raspberry Pi, AWS Graviton, etc. now work with a plainnpm i -g deepseek-tui. -
Interactive TUI hangs on
working.at 100% CPU (#549) — the event loop's blocking terminal poll starved the tokio runtime, preventing the engine task from dispatching the API request. Fixed by yielding to the scheduler before each poll cycle and clamping the event-poll timeout to a minimum of 1ms so a zero-timeout hot-loop can't monopolize the thread. -
Backspace key inserts "h" instead of deleting (#550) — terminals that send
^H(Ctrl+H) for Backspace were not recognized. Addedis_ctrl_h_backspace()guard in both the composer and API-key input handlers so Ctrl+H is treated as a delete, matching the existingKeyCode::Backspacebehavior.
Changed
- npm
postinstallfailure messages — when no prebuilt is available for the host'sos.platform() / os.arch()combo, the wrapper now prints the fullcargo installfallback recipe and a link todocs/INSTALL.mdinstead of just the bare error. DEEPSEEK_TUI_OPTIONAL_INSTALL=1— new env knob that downgrades a postinstall failure to a warning +exit 0, so CI matrices that include unsupported platforms don't fail the wholenpm install.
Docs
- New
docs/INSTALL.md— every supported platform, prebuilt vs.cargo installvs. manual download, cross-compiling x64 → ARM64 Linux withcrossorgcc-aarch64-linux-gnu, and a troubleshooting section covering the commonUnsupported architecture,MISSING_COMPANION_BINARY, and self-update mismatch errors. - README and
README.zh-CN.mdnow have an explicit Linux ARM64 quickstart pointing ARM64 users atcargo install deepseek-tui-cli deepseek-tui --lockedfor v0.8.7 and atnpm i -g deepseek-tuifor v0.8.8+.
Releases
- npm wrapper publish remains manual (npm 2FA OTP requirement).
- GitHub release automation depends on
RELEASE_TAG_PATsecret — without itauto-tag.ymlcreates the tag butrelease.ymldoesn't fire.
0.8.7 - 2026-05-03
Fixed
- Selection across transcript cell types — the selection-tightening from v0.8.6 (#383) restricted copy/select to user and assistant message bodies only, so text in system notes, thinking blocks, and tool output could not be copied. v0.8.7 removes the body-start gate; the rendered transcript block is fully selectable again.
0.8.6 - 2026-05-03
Added
- Long-session survivability by default (#402) — capacity control and compaction defaults are enabled, transcript history is bounded, persisted sessions are capped, and oversized history folds into archived context placeholders instead of freezing the TUI.
- v0.8.6 feature batch (#373-#402) — adds Goal mode, cache-hit chips,
cycle-boundary visualization, file-tree pane,
/share,/model auto, user-defined slash commands,/profile, LSP diagnostic wiring, crash-recovery, self-update,/init,/diff, patch-aware/undo,/edit, inline diff highlighting, smart clipboard, native-copy escape, right-click context menus, clickable file:line styling, and MCP Phase A.
Fixed
- Lag and rendering regressions (#399, #400) — moves git/file-tree work off the UI thread where possible, bounds render history, and tightens redraw behavior to avoid sidebar/chat text bleed-through.
- Release-hardening follow-ups —
/sharenow writes via secure temp files, self-update uses secure same-directory temps with Windows-safe replacement, and docs/rustfmt release gates are clean.
0.8.4 - 2026-05-02
Added
- Localization expansion (Phase 1, #285) — every slash command's help
description, the full
/tokens//cost//cachedebug output, the footer state and chip text, and the help-overlay section headings are now translated for all four shipped locales (en,ja,zh-Hans,pt-BR). Set the language with/config locale zh-Hans(orLANG=zh_CN.UTF-8/LC_ALL=zh_CN.UTF-8from the shell). Non-Latin scripts render via the sameunicode_widthplumbing the existing 27 chrome strings already use; theshipped_first_pack_has_no_missing_core_messagestest enforces full coverage across all four locales for every newMessageId. Tool descriptions sent to the model and the base system prompt intentionally remain English (training-data alignment, prefix cache stability).- Phase 1a (#294): 44 new IDs covering slash commands.
- Phase 1b (#295): 13 new IDs covering
/tokens//cost//cachedebug output. Templates use{placeholder}substitution so a translator can re-order args freely. - Phase 1c (#296): 11 new IDs covering footer state, sub-agent chip, quit-confirmation toast, and help-overlay section labels.
- Stable cache prefix (#263) — five companion fixes to keep the
DeepSeek prefix cache stable across turns: drop volatile fields from
the working-set summary block (#280, #287), place handoff and
working-set after the static prompt blocks (#288 → #292), memoise the
tool catalog so descriptions stay byte-stable (#289), sort
project_treeandsummarize_projectoutput (#290), and use a unique fallback id for parallel streaming tool calls so downstream tool-result routing doesn't match the first call twice (#291). The combined effect is a meaningful jump in cache hit rate after the third turn.
Fixed
- Agent-mode shell exec could not reach the network (#272) — the seatbelt
default policy denies all outbound network including DNS, so any
exec_shellcommand needing the network (curl,yt-dlp, package managers, …) failed in Agent mode unless the user dropped to Yolo. The engine now elevates the sandbox policy toWorkspaceWrite { network_access: true, … }for both Agent and Yolo. Plan mode is unchanged (read-only investigation never registers the shell tool). The application-levelNetworkPolicy(crates/tui/src/network_policy.rs) remains the only outbound-traffic boundary. /skill install <github-repo-url>failed withinvalid gzip header(#269) —https://github.com/<owner>/<repo>parsed as a raw direct URL, so the installer downloaded the HTML repo page and tried to gzip-decode HTML. Bare GitHub repo URLs (with or without.git, with or withoutwww., with or without a trailing slash) now route to theGitHubReposource the same asgithub:<owner>/<repo>. URLs that already point at a specific archive / blob / tree path still go throughDirectUrl.- V4 Pro discount expiry extended (#267) — DeepSeek extended the V4 Pro 75% promotional discount from 2026-05-05 15:59 UTC to 2026-05-31 15:59 UTC. Without this update the TUI would have started showing 4× the actual billed cost on May 6 onwards. Verified at https://api-docs.deepseek.com/quick_start/pricing.
0.8.3 - 2026-05-01
Fixed
- Skills prompt referenced fabricated paths —
render_available_skills_contextrendered each skill's file as<skills_dir>/<frontmatter-name>/SKILL.md, which did not exist when the directory name differed from the frontmattername(community installs, manually-placed skills).Skillnow carries the real path captured at discovery and renders that. - Missing-companion error was hostile to direct GitHub Release downloaders
(#258) — replaced "Build workspace default members to install it" wall of
text with a concrete three-path checklist:
npm install -g deepseek-tui,cargo install deepseek-tui-cli deepseek-tui --locked, or downloading bothdeepseek-<platform>ANDdeepseek-tui-<platform>from the same Release page.DEEPSEEK_TUI_BINstays as a power-user fallback.
Added
- Privacy:
$HOMEcontracts to~in viewer-visible paths — the TUI,deepseek doctor,deepseek setup, and onboarding now contract the home directory to~in every path shown on screen, so screenshots, screencasts, and pasted help output do not leak the OS account name. Persisted state, audit log, session checkpoints, and LLM-bound system prompts intentionally keep absolute paths for full fidelity. crates.iobadge alongside the CI and npm badges in both English and Simplified Chinese READMEs.- Engine decomposition (#227) —
core/engine.rsis split into focused submodules (engine/{streaming,turn_loop,dispatch,tool_setup,tool_execution,tool_catalog,context,approval,capacity_flow,lsp_hooks,tests}.rs). No behavior change; preparation for the future agent-loop work.
Tests
- RLM bridge:
batch_guardextracted and tested for the empty-batch and oversize-batch invariants; depth-guard fallback covered (partial #231). - Persistence: schema-version rejection covered for
load_session,load_offline_queue_state,runtime_threads::load_turn,runtime_threads::load_item(partial #233). - Command palette:
[disabled]server description tag (closes the remaining #197 acceptance gap). - Protocol-recovery contract tests now scan the engine submodules in
addition to
engine.rsso the decomposition refactor doesn't silently hide the fake-wrapper marker assertions.
Issue triage
- 10 issues closed with verification commits cited (#247, #235, #197, #250, #234, #243, #238, #236, #239, #195).
0.8.2 - 2026-05-01
Fixed
- Windows release build (LNK1104) — drop the
deepseekshim binary incrates/tuithat 0.8.1 introduced for the bundledcargo install. It produced a secondtarget/release/deepseek.exethat collided with thedeepseek-tui-cliartifact during workspace builds; the second linker invocation hitLNK1104: cannot open file deepseek.exeon Windows. The cli crate is now the single source ofdeepseek; workspace default members still produce both binaries (one per crate). - npm wrapper offline robustness —
bin/deepseek(-tui).jsno longer re-fetches the GitHub-hosted SHA-256 checksum manifest on every invocation. When the binary is already installed and its.versionmarker matches the package version, the wrapper trusts the local file. The manifest is fetched lazily on actual download (first install orDEEPSEEK_TUI_FORCE_DOWNLOAD=1), so GitHub flakes, captive portals, corporate proxies, and offline state no longer break every command.
Added
- Model-visible skills block — installed skills (name, description, file
path) are now exposed in the agent's system prompt under a
## Skillssection, with progressive disclosure: bodies stay on disk, the model opens a specificSKILL.mdonly when it decides to use that skill. Capped at a 12k prompt budget with 512-char per-description truncation. Threaded throughEngineConfig.skills_dirso the TUI app, exec agent, and runtime thread manager all populate it fromConfig::skills_dir(). - Simplified Chinese README (
README.zh-CN.md) with cross-link from the English README.
Changed
cargo installUX — to install the canonicaldeepseekcommand,cargo install deepseek-tui-cli(the historical path). The 0.8.1 one-command flow (cargo install deepseek-tuiproviding both binaries) is reverted because it broke Windows release builds; install both packages separately if you want the TUI binary too.
0.8.1 - 2026-05-01
Fixed
- One-command Cargo install —
cargo install deepseek-tui --lockednow provides both the canonicaldeepseekdispatcher and thedeepseek-tuicompanion binary from the maindeepseek-tuipackage, so dispatcher subcommands such asdeepseek doctor --jsonwork without installingdeepseek-tui-cliseparately.
0.8.0 - 2026-05-01
Fixed
- Shell FD leak / post-send lag — completed background shell jobs now release
their process, stdin, stdout, and stderr handles as soon as completion is
observed, while keeping the job record inspectable. This prevents long-running
TUI sessions from hitting
Too many open files (os error 24), which could make checkpoint saves fail and cause shell spawning, message send, close, and Esc/cancel paths to lag or fail. - Windows REPL runtime CI startup — Windows gets a longer Python bootstrap readiness timeout for the REPL runtime tests, matching GitHub runner startup contention without weakening bootstrap failures on other platforms.
Added
- China / mirror-friendly Cargo install docs — README now documents installing through the TUNA Cargo mirror and direct release assets for users with slow GitHub/npm access.
Tests
- Added a regression test proving completed background shell jobs drop their
live process handles after
exec_shell_wait. - Re-ran the focused shell cancellation and Python REPL runtime slices.
0.7.9 - 2026-05-02
Fixed
- Post-turn freeze — the checkpoint-restart cycle boundary (
maybe_advance_cycle) now runs beforeTurnCompleteemission instead of after, so the terminal is immediately responsive when the UI receives the completion event. The status chip ("↻ context refreshing…") remains visible during the cycle wait. (#234) - Enter during streaming no longer corrupts the turn — a new
QueueFollowUpsubmit disposition parks the draft onqueued_messageswhen the model is actively streaming text. Previously, pressing Enter during streaming would forward the message as a mid-turn steer, which could interfere with the in-flight response. The message now dispatches as a normal user message afterTurnComplete. (#234) - Idempotent Esc during fanout —
finalize_active_cell_as_interruptedandfinalize_streaming_assistant_as_interruptedare now guarded byOption::take(). When Esc cancels a turn and the engine later deliversTurnComplete(Interrupted), the second call is a no-op — no double[interrupted]prefix, no corrupted cell state. Regression test locks in the contract. (#243)
Tests
- 2 new tests:
submit_disposition_queue_follow_up_when_streaming(Enter/steering fix),turn_complete_after_esc_is_idempotent(Esc fanout double-call hardening) - 1 expanded test:
submit_disposition_queue_when_offline_and_busynow covers streaming state
0.7.8 - 2026-05-01
Added
exec_shell_canceltool — cancel a running background shell task by id, or cancel all running tasks withall: true. Requires approval. (#248)- Foreground-to-background shell detach — press
Ctrl+Bwhile a foreground command is running to open shell controls and either detach the command to the background (where it can be polled viaexec_shell_wait) or cancel the current turn. (#248) exec_shell_waitturn-cancellation awareness — canceling a turn whileexec_shell_waitis blocking now stops the wait but leaves the background task running, withwait_canceled: truein metadata. (#248)ShellControlViewmodal (Ctrl+B) — two-option dialog (Background / Cancel) rendered as a popup over the transcript. (#248)
Changed
exec_shellforeground path now spawns all foreground commands through the background job table, enabling the detach-to-background flow. Metadata now includesbackgrounded: true/false. (#248)exec_shell_interactpoll loop now observes the turn cancel token so stalled interactive sessions don't block turn cancellation. (#248)- Transcript running-tool hint — executing shell cells now show "Ctrl+B opens shell controls" while running. (#248)
- Keybinding registry now includes
Ctrl+B(opens shell controls) next toCtrl+C(cancel/exits). (#248) - Deferred swarm card creation —
agent_swarmno longer pre-seeds an all-pending FanoutCard fromToolCallStarted; the card is created only when the firstSwarmProgressevent carries real worker state. Until then the sidebar uses the declared task count as a pending dispatch placeholder. (#236, #238) - Swarm wording normalized — fanout-family fallback labels now render as
swarm, matching the canonicalagent_swarm/rlmmodel and avoiding mixedfanout/swarmterminology in the transcript. (#236, #238) - OPERATIONS_RUNBOOK and TOOL_SURFACE updated with new shell control paths and
exec_shell_canceldocumentation.
Fixed
- Nonblocking swarm state drift — the sidebar no longer falls back to
0or a contradictory seeded placeholder before the first progress event arrives, which removes the visiblependingvsrunning/donemismatch during earlyagent_swarmdispatch. (#236, #238) - Unicode-safe search globbing — search wildcard matching now iterates on UTF-8 char boundaries instead of raw byte offsets, preventing panics on filenames like
dialogue_line__冰糖.mp3. (#249)
Tests
- 7 new integration tests: foreground-to-background detach, wait-cancel-leaves-process, single-task cancel, bulk cancel (kill-all), foreground-cancel-kills, ShellControlView default/select states
- Expanded swarm/sidebar regression coverage for deferred card creation and pending-count fallback before first
SwarmProgress. (#236, #238) - Added a Unicode filename regression test for wildcard search matching. (#249)
0.7.7 - 2026-04-30
Added
- Checklist card rendering —
checklist_write/todo_*results now render as a purpose-built card with completed/total + percent header, per-item status markers (✅ /●/○), and a collapsing affordance for long lists. Plumbed throughGenericToolCellso no new variant threading is needed. (#241) - Context menu for transcript operations — right-click or
Ctrl+Mopens a context-sensitive menu with Copy, Copy All, and selection-aware actions. (crates/tui/src/tui/context_menu.rs) - Windows .exe sibling lookup —
locate_sibling_tui_binaryin the CLI dispatcher findsdeepseek-tui.exeon Windows, honoursDEEPSEEK_TUI_BINoverride, and falls back to suffix-less lookup. Tests lock in platform-correct name resolution and env override. (#247)
Changed
- Swarm/sub-agent canonical data model —
SwarmTaskOutcomeandSwarmOutcomeare now the single source of truth. Every UI surface (sidebar, transcript FanoutCard, footer) reads fromswarm_jobsrather than maintaining parallel projections. (#236, #238) swarm_card_indexbinds each swarm to its own FanoutCard byswarm_id, so overlapping fanouts no longer have one swarm's late progress clobber another's card. (#236, #238)- Fanout-class tools suppressed from footer —
agent_swarm,spawn_agents_on_csv,rlm, andagent_spawnno longer appear as active tools in the status strip; sidebar and FanoutCard show the actual worker counts. (#236, #238) - Esc clears active tool entries optimistically — the active cell is finalized immediately on cancel rather than waiting for the engine's
TurnCompleteecho. Backgroundblock:falseswarms remain durable and tracked throughswarm_jobs. (#243) - Post-turn workspace snapshot detached — the snapshot still runs on
spawn_blockingbut the engine no longer awaits itsJoinHandle, so the UI accepts input immediately afterTurnComplete. (#234) - Shell output preserves Cargo/test summaries under truncation — high-signal tail lines (
test result:,failures:,error[E…],Finished,Compiling, panic markers) survive truncation so the agent doesn't re-run gates. (#242) - Monotonic spend display —
displayed_session_cost+displayed_cost_high_waterensure the visible session+sub-agent total never decreases across reconciliation events (cache discounts, provisional → final). (#244) - Clipboard module expanded with additional platform-aware copy/paste paths. (
crates/tui/src/tui/clipboard.rs) - Context inspector enriched with additional metadata columns and session-scoped agent state. (
crates/tui/src/tui/context_inspector.rs) - Configuration documentation updated for v0.7.7 settings. (
docs/CONFIGURATION.md,docs/MODES.md)
Fixed
- Windows npm install path — the npm-distributed
deepseekdispatcher now locates the platform-correctdeepseek-tuibinary (.exesuffix on Windows), fixing runtime failures for Windows users. (#247) - Sidebar/transcript/footer agreement — all three surfaces now agree on agent counts and status because they share the canonical
swarm_jobsstore. (#236, #238) - Fanout card clobbering — overlapping swarms no longer overwrite each other's progress cards. (#238)
- Cost display regression — negative reconciliation events (cache-hit discount applied after provisional count) no longer briefly drop the displayed cost. (#244)
Tests
- 65+ new/expanded tests: checklist card rendering, swarm card index binding, fanout tool suppression, Esc cancel contract, monotonic spend under reconciliation, shell summary preservation, Windows sibling binary lookup, clipboard platform paths, context menu state transitions
Added
- UI Localization registry —
localesetting insettings.toml(auto,en,ja,zh-Hans,pt-BR) withLC_ALL/LC_MESSAGES/LANGauto-detection. Core packs shipped for English, Japanese, Chinese Simplified, and Brazilian Portuguese covering composer placeholder, history search,/configchrome, and help overlay. Missing/unsupported locales fall back to English. (crates/tui/src/localization.rs,docs/CONFIGURATION.md) - Grouped, searchable
/configeditor — settings organized by section (Model, Permissions, Display, Composer, Sidebar, History, MCP) with live substring filter. Typingj/knavigates when the filter is empty; otherwise they enter the filter. (crates/tui/src/tui/views/mod.rs) - Pending input preview widget — while a turn is running, queued messages, pending steers, rejected steers, and context chips render above the composer. Three-row-per-message truncation with ellipsis overflow. (
crates/tui/src/tui/widgets/pending_input_preview.rs) - Alt+↑ edit-last-queued — pops the most recently queued message back into the composer for editing. No-op when the composer is dirty. (
crates/tui/src/tui/app.rs) - Composer history search and draft recovery —
Alt+Ropens a live substring search acrossinput_historyanddraft_history(max 50 entries).Enteraccepts,Escrestores the pre-search draft. Unicode case-insensitive matching. (crates/tui/src/tui/app.rs) - Paste-burst detection — fallback rapid-key paste detection independent of terminal bracketed-paste mode. Configurable via
paste_burst_detectionsetting (default on). CRLF normalization (\r\n→\n,\r→\n). (crates/tui/src/tui/paste_burst.rs) - Composer attachment management —
↑at the composer start selects the attachment row;Backspace/Deleteremoves it without editing placeholder text. (crates/tui/src/tui/app.rs) - Searchable help overlay — live substring filter across slash commands and keybindings, multi-term AND matching, localized chrome. (
crates/tui/src/tui/views/help.rs) - Keyboard-binding documentation catalog — single source of truth for help overlay rendering. Documents 38+ keyboard chords across Navigation, Editing, Submission, Modes, Sessions, Clipboard, and Help sections. (
crates/tui/src/tui/keybindings.rs) - Legacy Rust deprecation audit — non-destructive compatibility audit covering legacy MCP sync API, prompt constants,
/compact,todo_*aliases, sub-agent aliases, providerapi_keycompatibility, model alias canonicalization, and palette aliases. Tracked by #218–#221. (docs/LEGACY_RUST_AUDIT_0_7_6.md)
Changed
- Shift+Tab cycles reasoning-effort through Off → High → Max (three behaviorally distinct tiers). Previously Tab cycled modes; Shift+Tab is now the reasoning-effort shortcut. (
crates/tui/src/tui/app.rs:1119) - Reasoning-effort
Offnow sends"off"to the API (wasNone). Allows explicit thinking disable. (crates/tui/src/tui/app.rs) - Media
@-mentions now emit<media-file>hints directing users to/attachinstead of inlining binary bytes. Tests lock in the contract. (crates/tui/src/tui/file_mention.rs) /attachrejects non-media files with a descriptive error pointing to@pathfor text. (crates/tui/src/commands/attachment.rs)- Configuration reference updated to cover all v0.7.6 settings:
locale,paste_burst_detection,reasoning_effort,composer_density,sidebar_focus, and more. (docs/CONFIGURATION.md)
Fixed
- Unicode-safe truncation in pending-input preview and view text — no more mid-character breaks on multi-byte UTF-8. (
crates/tui/src/tui/widgets/pending_input_preview.rs,crates/tui/src/tui/views/mod.rs) - CJK/emoji display-width handling in locale tests and config view rendering. (
crates/tui/src/localization.rs) - Context preview distinguishes
@media,/attach, missing, and included files with separate kind labels and inclusion status. (crates/tui/src/tui/file_mention.rs) - Config view filter accept
j/konly when filter is empty — typingjorkinto the filter field no longer navigates away. (crates/tui/src/tui/views/mod.rs)
Tests
- 7 localization tests (tag normalization, env resolution, shipped pack completeness, missing-key fallback, Unicode width truncation)
- 11 pending-input preview tests (context buckets, truncation, URL overflow, narrow-width)
- 13 paste tests (burst detection, CRLF normalization, clipboard images, Unicode)
- 9 draft/history search tests (match filter, unicode, accept/cancel, recovery)
- 93 config tests (grouping, filter, edit, j/k, localization, escape/cancel)
- 24 workspace tests (context refresh, scroll, mention completion)
- 7 file-mention tests (context references, media/attach distinction, removability)
[0.7.1] - 2026-04-28
Added
- Grouped active tool-call cards with compact rails and a live working-status row while tools run. (#142, #149)
- Selected-card-aware Alt+V details so the visible or selected tool card opens the matching detail payload. (#143)
- Compact terminal-native session context inspector with persisted
@pathand/attachreference metadata for resumed transcripts. (#146, #150)
Changed
- Polished tool cards, diff summaries, and pending context previews for denser terminal-native scanning. (#141, #144, #145, #148)
- Ranked Ctrl+P file-picker results with working-set relevance from modified files, recent
@filementions, and recent tool paths while keeping fuzzy filtering in memory. (#147)
[0.7.0] - 2026-04-28
Added
- OS keyring-backed auth storage with
deepseek authsubcommands, migration from plaintext config, provider-aware key resolution, and doctor visibility. (#134) - Egress network policy with allow/deny/prompt decisions, deny-wins matching, audit logging, and enforcement hooks for network-capable tools. (#135)
- LSP diagnostics auto-injection after edits so compile feedback can be reinjected into the next agent turn. (#136)
- Side-git workspace snapshots,
/restore, andrevert_turnso agent edits can be rolled back without moving the user's repository HEAD. (#137) - Esc-Esc backtrack over prior user turns, desktop turn-complete notifications, Alt+V tool-details access, safer command-prefix auto-allow matching, bundled
skill-creator, and/skill installmanagement for community skills. (#131, #132, #133, #138, #139, #140)
Changed
- Split more engine/tool primitives into focused modules and workspace crates, including shared tool result primitives and extracted turn/capacity flow. (#67, #74)
Tests
- Added mock LLM and skill-install integration coverage for streaming turns, reasoning replay, tool-call loops, network policy, and skill validation. (#69, #140)
[0.6.5] - 2026-04-27
Added
rlm_processtool — recursive language model as a tool call. The previous/rlmslash command had a UI rendering gap (the answer never made it back to the model's view) and required the user to remember to invoke it manually.rlm_processexposes the full RLM loop as a structured tool the model itself can choose, the same way it reaches foragent_spawnorrlm_query. Inputs:task(small instruction, shown to the root LLM each iteration) plus exactly one offile_path(workspace-relative, preferred — keeps the long input out of the model's context entirely) orcontent(inline, capped at 200k chars). Optionalchild_model(defaultdeepseek-v4-flash) andmax_depth(default 1, paper experiments). Returns the synthesized answer with metadata (iterations, duration, tokens, termination reason). Loaded across Plan / Agent / YOLO; never deferred via ToolSearch. (crates/tui/src/tools/rlm_process.rs)- Reference-aligned REPL surface. Aligned the in-REPL Python helpers with the canonical reference RLM (alexzhang13/rlm). The sub-agent now sees
context(the full input, notPROMPT),llm_query,llm_query_batched,rlm_query(wassub_rlm),rlm_query_batched,SHOW_VARS(),FINAL(...),FINAL_VAR(...), plusrepl_get/repl_set. Same prompt patterns and decomposition strategies from the paper now apply verbatim. (crates/tui/src/repl/runtime.rs) - Concurrent fanout from inside the REPL.
llm_query_batched(prompts, model=None)runs up to 16 child completions in parallel via a newPOST /llm_batchsidecar endpoint — much faster than serial[llm_query(p) for p in prompts].rlm_query_batched(prompts)does the same for recursive RLM sub-calls viaPOST /rlm_batch. (crates/tui/src/rlm/sidecar.rs) SHOW_VARS()— returns{name: type-name}for every user variable in the REPL. Lets the model inspect what it has accumulated across rounds before deciding whether to callFINAL_VAR(name).- Auto-persistence of REPL variables across rounds. Any top-level JSON-serializable variable the sub-agent creates in a
replblock now persists to the next round automatically — norepl_setceremony needed unless you want explicit control. Matches the in-process reference REPL semantics.
Changed
- Code fence is
repl, notpython. Matches the reference RLM language identifier so the same prompts and few-shot examples work here. Backward-compat fallback topython/pyretained for older model behaviors. FINAL/FINAL_VARparseable from raw response text. The reference RLM lets the model writeFINAL(value)on its own line outside any code block to terminate the loop. Addedparse_text_final()so that path works alongside the existing in-REPL Python sentinel mechanism. Code-fenced occurrences ofFINAL(...)are correctly ignored to avoid false positives.- Strict termination loop. The sub-agent must emit a ```repl block (or text-level FINAL) to make progress. One fence-less round triggers a reminder; two consecutive trigger a
RlmTermination::DirectAnswerexit so we don't loop forever. rlm_processseparatestask(root_prompt) fromfile_path/content(context). Thetaskrides along asroot_promptand is shown to the root LLM each iteration; the big input lives only in the REPL ascontext. Mirrors the reference'scompletion(prompt, root_prompt=...)API.- System prompt rewritten with the reference's strategy patterns (PREVIEW → CHUNK + map-reduce via
llm_query_batched→ RECURSIVE decomposition viarlm_query→ programmatic computation + LLM interpretation). - The
/rlmslash command stays for manual experimentation but is no longer the recommended path; the description incommands/mod.rsnow points the model towardrlm_processfor the in-agent flow.
Reference
- Zhang, Kraska, Khattab. "Recursive Language Models." arXiv:2512.24601.
- alexzhang13/rlm — reference implementation by the paper authors. Variable names, helper surface, and code-fence convention align with that repo so prompts and patterns transfer.
Fixed
/rlmactually recurses now (Algorithm 1 substrate, paper-faithful). The v0.6.3 RLM loop had the right shape but its recursive substrate was non-functional:llm_query()was a Python stub that returned a hardcoded string, andchild_modelwas bound with an underscore prefix and silently dropped. The loop ran but the sub-LLM never fired. v0.6.4 fixes this end-to-end:- HTTP sidecar. Each RLM turn spins up a localhost-only axum server on a kernel-assigned port for the duration of the turn. Python's
llm_query()andsub_rlm()are realurllib.request.urlopenPOSTs; Rust services them via the existing DeepSeek client and returns the completion text. No long-lived python process, no FIFOs, no two-pass replay — Python blocks on HTTP, Rust answers it. (crates/tui/src/rlm/sidecar.rs) child_modelis plumbed through.Op::RlmQueryandAppAction::RlmQuerycarry the configured child model (defaultdeepseek-v4-flash) all the way to the sidecar, where everyllm_query()call uses it. Token usage is folded intoRlmTurnResult.usageso cost tracking works.sub_rlm()is exposed as a paper-faithful recursive RLM call. The Python REPL gets a realsub_rlm(prompt)function that runs another full Algorithm-1 turn at depth-1 inside the same process (different sidecar route, decremented recursion budget). Defaultmax_depth = 2from the/rlmcommand — the model can recurse twice before the budget hits zero. The recursive opaque-future cycle (run_rlm_turn_inner→start_sidecar→sub_rlm_handler→run_rlm_turn_inner) is broken by returning a concretePin<Box<dyn Future + Send>>fromrun_rlm_turn_inner.- Strict termination. The loop only ends via
FINAL(value)(or the iteration cap). The previous "no fence = direct answer, end loop" early-exit deviated from the paper and could short-circuit on iteration 1 with a chatty model that never sawPROMPT. The new behavior tolerates one fence-less round (with a reminder appended), then falls back to aRlmTermination::DirectAnswerexit.RlmTurnResultnow carries atermination: RlmTerminationenum (Final | DirectAnswer | Exhausted | Error) so callers can tell what happened. - Richer
Metadata(state). The metadata message the root LLM sees now includes paper-required access patterns (repl_get, slicing,splitlines,repl_set,llm_query,sub_rlm,FINAL) and a live list of variable keys currently in the REPL state file — so the model can see what it's accumulated across rounds without us shipping the values themselves. - Unicode-safe truncation.
truncate_textnow counts Unicode codepoints (was mixingtext.len()bytes withchars().take(n)), so multi-byte previews can no longer mis-count. Per-turn temp state files are cleaned up on completion.ROOM_TEMPERATUREtypo →ROOT_TEMPERATURE. - End-to-end smoke test.
rlm::turn::tests::sidecar_url_is_exported_to_python_envstands up a stand-in axum server that always replies{"text":"pong-from-sidecar"}, runsprint(llm_query('hello'))in the realPythonRuntime, and asserts the reply round-trips. This catches future regressions in the sidecar URL passthrough.
- HTTP sidecar. Each RLM turn spins up a localhost-only axum server on a kernel-assigned port for the duration of the turn. Python's
Reference
- Zhang, Kraska, Khattab. "Recursive Language Models." arXiv:2512.24601 (Algorithm 1).
Added
- Sub-agents surface in the footer status strip. When N > 0 sub-agents are in flight, the footer grows a "1 agent" / "N agents" chip in DeepSeek-sky color matching the model badge. Hides entirely at zero. (
footer_agents_chipinwidgets/footer.rs) @-mention popup is fully wired in the composer. Previously only the App state fields existed (mention_menu_selected,mention_menu_hidden). The popup now renders below the input mirror-style with the slash menu, with@-prefixed entries; Up/Down navigates, Enter / Tab apply the selection, Esc hides until the next input edit. Mention takes precedence over slash because the positional check is stricter. (visible_mention_menu_entries+apply_mention_menu_selectioninfile_mention.rs)
Fixed
- Tool-call cells no longer flash
<command>/<file>placeholders. The engine used to emitToolCallStartedfromContentBlockStartwithinput: {}— before anyInputJsonDeltahad streamed in — which baked the placeholder into the cell at creation time. The emission is now deferred toContentBlockStopand routed throughfinal_tool_input, so the cell is created with the parsed args already in hand. (engine.rsfinal_tool_input; engine/tests.rsfinal_tool_input_*) parse_invocation_countflake. Twomarkdown_rendertests both read the global PARSE_INVOCATIONS atomic and raced when other tests calledparse()in parallel. Switched the counter tothread_local!<Cell<u64>>, so each test thread sees only its own invocations. Tested 8 sequential full-suite runs: 8/8 green (was ~40% green).
Changed
- System prompts redesigned with decomposition-first philosophy. All four prompt tiers (base, agent, plan, yolo) now teach the model to decompose tasks before acting —
todo_writefirst for granular task tracking,update_planfor high-level strategy, and sub-agents for parallelizable work. Inspired by the "mismanaged geniuses hypothesis" (Zhang et al., 2026): frontier LMs are already capable enough; the bottleneck is how we scaffold their self-management. The prompts now make work visible through the sidebar (Plan / Todos / Tasks / Agents) instead of letting the model work invisibly. - Tool labels use progressive verbs. "Read foo.rs" → "Reading foo.rs", "List X" → "Listing X", "Search pattern" → "Searching for
pattern", "List files" → "Listing files". Past-tense labels read wrong while a tool is still in flight; the new forms match what the user actually sees. - Long-running tools grow an elapsed badge. From 3 s onward the
runningstatus segment becomesrunning (3s),running (4s), … so the user can tell a tool isn't stuck. The status-animation tick (360 ms) drives the redraw; below 3 s the badge stays hidden so quick reads/greps don't churn. (history.rsrunning_status_label_with_elapsed) - Spinner pulse is twice as fast —
TOOL_STATUS_SYMBOL_MS1800 ms → 720 ms per glyph (full 4-glyph heartbeat in ~2.88 s instead of ~7.2 s). tools/subagent.rsis now a folder module. Tests live intools/subagent/tests.rs; runtime + manager + tool implementations stay intools/subagent/mod.rs. Public API unchanged. The runtime / tool-impl split was deferred —SubAgentTask,run_subagent_task,build_allowed_tools, the agent prompt constants, andnormalize_role_aliasare referenced from both layers and need a small API design pass before they cleanly separate.
Test hygiene
- 5 regression tests pin auto-scroll churn contract.
mark_history_updateddoes not scroll; tool-cell handlers onlymark_history_updated;add_messageandflush_active_cellgate onuser_scrolled_during_stream; the per-stream lock clears at TurnComplete and when the user returns to the live tail. (P2.4)
0.6.1 - 2026-04-26
Changed
- V4 cache-hit input prices cut to 1/10th per DeepSeek's pricing update. Pro promo 0.03625→0.003625, Pro base 0.145→0.0145, Flash 0.028→0.0028 per 1M tokens. Cache-miss and output rates unchanged.
- Removed the "light" theme option. It was never tested, looked bad, and the dark/whale palettes are the supported targets. Theme validation now accepts only
default,dark, andwhale. - System prompts redesigned with decomposition-first philosophy. All five prompt tiers teach the model to
todo_writebefore acting,update_planfor strategy, and sub-agents for parallel work. Inspired by the mismanaged-geniuses hypothesis (Zhang et al., 2026).
0.6.0 - 2026-04-25
Added
rlm_querytool — recursive language models as a first-class structured tool. Inspired by Alex Zhang's RLM work and Sakana AI's published novelty-search research, but trimmed to what an agent loop actually needs. The model callsrlm_querywith one prompt or up to 16 concurrent prompts; children run ondeepseek-v4-flashby default and can be promoted to Pro per-call. Children dispatch concurrently viatokio::join_allagainst the existing DeepSeek client — no external runtime, no fenced-block DSL, no Python sandbox. Returns plain text for one prompt, indexed[0] ...\n\n---\n\n[1] ...blocks for many. Available in Plan / Agent / YOLO. Cost is folded into the session's running total automatically.
Changed
- Scroll position survives content rewrites (#56).
TranscriptScroll::resolve_topandscrolled_byno longer teleport to bottom when the anchor cell vanishes. Three-level fallback chain: same line → same cell, line 0 → nearest surviving cell at-or-before. Previously, any rewrite of the assistant message (e.g. tool-result replacement) silently dropped the user back to the live tail mid-scroll. - Looser command-safety chains (#57).
cargo build && cargo test,git fetch && git rebase, and similar chains of known-safe commands now escalate toRequiresApprovalinstead of being hard-blocked asDangerous. Chains containing unknown commands still block. GettingCrowdedno longer surfaces a footer chip. The context-percent header already covers conversation pressure; the chip now only fires for active engine interventions (refreshing context,verifying,resetting plan).
[0.5.2] - 2026-04-25
Added
/modelopens a Pro/Flash + thinking-effort picker (#39). Typing/modelwith no argument now pops a two-pane modal: model on the left (deepseek-v4-proflagship,deepseek-v4-flashfast/cheap, plus a "current (custom)" row when the active id isn't one of the listed defaults), and thinking effort on the right. Tab/←/→ swaps panes, ↑/↓ moves within the focused pane, Enter applies both selections, Esc cancels. The effort pane intentionally exposes only Off / High / Max because DeepSeek's Thinking Mode docs statelow/mediumare mapped tohighserver-side andxhighis mapped tomax— the legacy variants stay valid in~/.deepseek/settings.tomlfor back-compat, the picker just doesn't surface them. Apply path persistsdefault_modelandreasoning_effortto settings, forwardsOp::SetModel+Op::SetCompactionto the running engine so the next turn picks up the change without a restart, and resets the per-turn token gauges (cache, replay) so the footer numbers reflect the new model./model <id>keeps working unchanged for power users.
[0.5.1] - 2026-04-25
Added
fetch_urltool for direct HTTP GET on a known URL — complementsweb_searchfor cases where the link is already known. Supportsformat(markdown/text/raw),max_bytes(default 1 MB, hard cap 10 MB),timeout_ms(default 15 s, max 60 s), redirect following, and structured{url, status, content_type, content, truncated}responses. 4xx/5xx bodies are returned (withsuccess: false) so the caller can read JSON error envelopes. (#33)- PDF support in
read_file. PDFs are auto-detected by extension or%PDF-magic bytes and extracted viapdftotext -layout(poppler) when available. New optionalpagesarg ("5"or"1-10") reads page slices. Withoutpdftotext, returns a structured{type: "binary_unavailable", kind: "pdf", reason, hint}with install commands for macOS/Debian. (#34) - Reasoning-content replay telemetry, end-to-end (#30). The chat-completions sanitizer now estimates replayed
reasoning_contenttokens (~4 chars/token), threads the value through the streamingUsagepayload, stores it on the App, and renders anrsn N.Nkchip in the footer next to the cache hit-rate. The chip turns warning-coloured when replay tokens exceed 50% of the input budget, so users on long thinking-mode loops can see at a glance how much of their context window is going to V4's "Interleaved Thinking" replay (paper §5.1.1). Logged atRUST_LOG=deepseek_tui=infofor tail-friendly diagnosis. @fileTab-completion (#28). Typing@<partial>and pressing Tab now resolves the mention against the workspace using the existingignore::WalkBuilder. A unique match is spliced into the input; multiple matches with a longer common prefix extend the partial; remaining ambiguity is surfaced via the status line. The mention-expansion path that ships file contents to the model is unchanged — this is purely a discovery aid for typing the path. Inline-contents and a fuzzy popup picker are queued for v0.5.2.- Per-workspace external trust list (#29).
~/.deepseek/workspace-trust.jsonnow records, for each workspace, the absolute paths the user has opted into reading/writing from outside that workspace. The new/trustslash command supportsadd <path>,remove <path>,list,on,off, and a status read with no args; the engine consults the list when constructing everyToolContextso changes apply on the next tool call without restart./diagnosticssurfaces the list. The interactive "Allow once / Always allow / Deny" approval prompt is deferred — for now grant access ahead of the turn with/trust add <path>.
Fixed
- TUI sidebar gutter bleed regression test (#36). Snapshot tests now lock in that long single-line tool results — including a
todo_writeecho of a multi-kilobyte JSON payload — never write any cells outsidechat_areaat the widths reported in the bug (80, 120, 165, 200 cols). A second test verifies the scrollbar coexists with content along the right edge instead of overdrawing the penultimate column. - Version drift caught in CI. New
versionsjob in.github/workflows/ci.ymlrunsscripts/release/check-versions.shon every push/PR, verifying every per-crateCargo.tomlinherits the workspace version, the npm wrapper matches the workspace version, andCargo.lockis in sync. The release runbook now listscheck-versions.shas the first preflight step. (#31) - Per-mode soft context budget for V4 compaction trigger (#27).
- Phantom
web.runreferences stripped from prompts and theweb_searchtool surface (#25). - Unused import +
cargo fmtdrift that landed withfeat(#27)and broke Build / Test / npm wrapper smoke under-Dwarnings.
[0.5.0] - 2026-04-25
Fixed
- Multi-turn tool calls on thinking-mode models no longer return HTTP 400. Every assistant message in the conversation now carries
reasoning_contentwhen thinking is enabled — not just tool-call rounds — matching DeepSeek's actual API validation, which rejects any assistant message missing the field even though the docs describe non-tool-call reasoning as "ignored". - Added a final-pass wire-payload sanitizer in the chat-completions client that forces a non-empty
reasoning_contentplaceholder onto any assistant message still missing one at request time. This is the last line of defense after engine-side and build-side substitution, so sessions restored from older checkpoints, sub-agents that append messages directly, and cached prefix mismatches all produce a valid request. - On a
reasoning_content-related 400, the client now logs the offending message indices to make future regressions diagnosable. - Stripped phantom
web.runreferences from prompts and theweb_searchtool surface (#25).
Changed
- Header/UI widget refactor in the TUI (
crates/tui/src/tui/ui.rs,widgets/header.rs) — internal cleanup, no user-visible behavior change.
0.4.9 - 2026-04-27
Fixed
- DeepSeek thinking-mode tool-call rounds now always replay
reasoning_contentin all subsequent requests (including across new user turns), matching DeepSeek's documented API contract that assistant messages with tool calls must retain their reasoning content forever. - Missing
reasoning_contenton a tool-call assistant message now substitutes a safe placeholder ("(reasoning omitted)") instead of dropping the tool calls and their matching tool results, preventing orphaned conversation chains and API 400 errors. - Session checkpoint now persists a Thinking-block placeholder for tool-call turns that produced no streamed reasoning text, keeping on-disk sessions structurally correct so subsequent requests avoid HTTP 400 rejections.
- Token estimation for compaction now counts thinking tokens across all tool-call rounds (not just the current user turn), aligning with the updated reasoning_content replay rule.
0.4.8 - 2026-04-25
Fixed
- DeepSeek V4 Pro cost estimates now use DeepSeek's current limited-time 75% discount until 2026-05-05 15:59 UTC, then automatically fall back to the base Pro rates.
[0.4.5] - 2026-04-24
Fixed
- Alternate-screen TUI sessions now capture mouse input by default so wheel scrolling moves the transcript instead of exposing terminal scrollback from before the TUI started. Use
--no-mouse-captureortui.mouse_capture = falsewhen terminal-native drag selection is preferred.
[0.4.2] - 2026-04-24
Fixed
- DeepSeek V4 thinking-mode tool turns now checkpoint the engine's authoritative API transcript, including assistant
reasoning_contenton reasoning-to-tool-call turns with no visible assistant text. - Chat Completions request building now drops stale V4 tool-call rounds that are missing required
reasoning_content, preventing old corrupted checkpoints from triggering DeepSeek HTTP 400 replay errors. - Web search now falls back to Bing HTML results when DuckDuckGo returns a bot challenge or otherwise yields no parseable results.
[0.4.1] - 2026-04-24
Fixed
- DeepSeek V4 tool-result context now preserves large file reads and command outputs instead of compacting noisy tools to a 900-character snippet after 2k characters.
- Capacity guardrail refresh no longer performs destructive summary compaction unless the normal model-aware compaction thresholds are actually crossed.
- V4 compaction summaries retain larger tool-result excerpts and summary input when compaction is genuinely needed.
- The transcript now follows the bottom again when sending a new message, shows an in-app scrollbar when internally scrolled, and leaves mouse capture off in
--no-alt-screenmode so terminal-native scrolling can work.
[0.4.0] - 2026-04-23
Added
- DeepSeek V4 support:
deepseek-v4-pro(flagship) anddeepseek-v4-flash(fast/cheap) are now first-class model IDs with 1M context windows. - Reasoning-effort tier: new
reasoning_effortconfig field (off | low | medium | high | max) mapped to DeepSeek'sreasoning_effort+thinkingrequest fields. Defaults tomax. - Shift+Tab cycles reasoning-effort through the three behaviorally distinct tiers (
off → high → max). The current tier is shown as a ⚡ chip in the header. - Per-model pricing table:
deepseek-v4-propriced at $0.145/$1.74/$3.48 per 1M tokens (cache-hit/miss/output);deepseek-v4-flashand legacy aliases at $0.028/$0.14/$0.28.
Changed
- Default model flipped to
deepseek-v4-pro(fromdeepseek-reasoner). deepseek-chat/deepseek-reasonerremain as silent aliases ofdeepseek-v4-flashfor API compatibility; priced identically.- Context compaction: 1M-context V4 models now compact at 800k input tokens or 2,000 messages, so short/tool-heavy sessions do not compact as if they were 128k-context runs.
- Cycling modes is now Tab-only; Shift+Tab is repurposed for reasoning-effort (reverse-mode cycle was low-value with only three modes).
- Updated help/hint strings, validator error messages, and the model picker to reference V4 IDs.
Fixed
requires_reasoning_contentnow recognizesdeepseek-v4*so thinking streams render correctly on V4 models.- DeepSeek V4 thinking-mode tool calls now preserve prior assistant
reasoning_contentwhenever a tool call is replayed, matching DeepSeek's multi-turn contract and avoiding HTTP 400 rejections on later turns. - Raw Chat Completions requests now send DeepSeek's top-level
thinkingparameter instead of the OpenAI SDK-onlyextra_bodywrapper. - Config, env, and UI model selection now normalize legacy DeepSeek aliases to
deepseek-v4-flashinstead of preserving old model labels. - npm wrapper first-run downloads now use process-unique temp files so concurrent
deepseek/deepseek-tuiinvocations do not race on*.downloadfiles.
0.3.33 - 2026-04-11
Changed
- Footer polish: simplified footer rendering, removed footer clock label, updated status line layout
- Palette cleanup: removed
FOOTER_HINTcolor constant
Removed
FOOTER_HINTcolor constant from palette (useTEXT_MUTEDorTEXT_HINTinstead)
Fixed
- Test updates to align with simplified footer logic
- Empty state placeholder text removed for cleaner UI
0.3.32 - 2026-04-11
Added
- Finance tool: Yahoo Finance v8 quote endpoint with chart fallback, supporting stocks, ETFs, indices, forex, and crypto lookups.
- Header widget redesign: proportional truncation, context-usage bar with gradient fill, streaming indicator, and graceful narrow-terminal degradation.
- Expanded test coverage: 680+ tests including footer state, context spans, plan prompt lifecycle, workspace context refresh, header rendering, and finance tool integration tests with wiremock.
- Workspace context refresh with configurable TTL and deferred initial fetch.
- Config command additions for runtime settings management.
Changed
- Redesigned footer status strip with mode/model/status layout, context bar, and narrow-terminal fallback.
- Plan prompt now uses numeric selection (1-4) instead of keyword input; old aliases are sent as regular messages.
- Archived outdated docs (
workspace_migration_status.md->docs/archive/). - Trimmed AGENTS.md boilerplate and updated task counts.
- Clarified release-surface documentation: crates.io publication may lag the workspace/npm wrapper.
Fixed
- Header
metadata_spansnow usessaturating_subto prevent underflow on narrow terminals. - Finance tool reuses a single HTTP client instead of rebuilding per request.
- Finance tool tests no longer leak temp directories.
0.3.31 - 2026-03-08
Added
- Replaced the finance tool backend with Yahoo Finance v8 + CoinGecko fallback for reliable real-time market data (stocks, ETFs, indices, forex, crypto).
- Added compaction UX: status strip shows animated COMPACTING indicator during context summarization, footer reflects compaction state, and CompactionCompleted events now include message count statistics.
- Added send flash: brief tinted background highlight on the last user message after sending.
- Added braille typing indicator with smooth 10-frame animation cycle.
Changed
- Redesigned the footer status strip with mode/model/token/cost layout, quadrant separators, and a context-usage bar.
- Added Unicode prefix indicators (▸ You, ◆ Answer, ● System) to chat history cells for visual distinction.
- Improved thinking token delineation with labeled delimiters in transcript rendering.
- Refactored source code into workspace crates for better modularity and dependency management.
Fixed
- Fixed Plan mode ESC key dismissing the prompt without clearing
plan_prompt_pending, which prevented the prompt from reappearing on subsequent plan completions. - Fixed clippy lint (collapsible_if) in web browsing session management.
[0.3.30] - 2026-03-06
Added
- Added a release-ready local npm smoke path that builds binaries, serves release assets locally, packs the wrapper, installs the tarball, and checks both entrypoints before publish.
- Added an opt-in full-matrix local release-asset fixture so
npm run release:checkcan be exercised before GitHub release assets exist.
Changed
- Bumped the Rust workspace crates and npm wrapper to
0.3.30. - Pointed the npm wrapper's default
deepseekBinaryVersionat0.3.30for the next coordinated Rust + npm release. - Updated the crates dry-run helper to work from a dirty workspace and to preflight dependent workspace crates without requiring unpublished versions to already exist on crates.io.
[0.3.29] - 2026-03-03
Added
- Added npm publish-time release asset verification for the
deepseek-tuipackage to fail fast when expected GitHub binaries are missing. - Added checksum manifests to GitHub release assets and checksum verification in the npm installer.
- Added
npm packinstall-and-smoke CI coverage for thedeepseek-tuiwrapper package. - Added an end-to-end release runbook covering crates.io, GitHub Releases, and npm publication.
Changed
- Updated npm package documentation for clearer install modes, environment overrides, and release integrity behavior.
- Improved installer support-matrix error messaging for unsupported platform/architecture combinations.
- Decoupled npm package version from default binary artifact version via
deepseekBinaryVersion, enabling packaging-only npm releases. - Moved the
deepseek-tuibinary target insidecrates/tuisocargo publish --dry-run -p deepseek-tuiworks from the workspace package layout. - Replaced the root-level crates publish workflow with an ordered workspace publish flow.
- Reworked first-run onboarding and README copy around primary workflows instead of shortcut memorization.
- Relaxed onboarding API-key format heuristics so unusual keys warn instead of blocking setup.
0.3.28 - 2026-03-02
Added
- Converted the project to a modular Cargo workspace using a
crates/layout. - Added new crate boundaries mirroring a deepseek architecture (
agent,config,core,execpolicy,hooks,mcp,protocol,state,tools,tui-core,tui, andapp-server).
Changed
- Added parity CI coverage with protocol/state/snapshot checks.
- Updated release workflow to build both
deepseekanddeepseek-tuibinaries.
[0.3.26] - 2026-03-02
Fixed
- Resolved SSE stream corruption caused by byte/string position mismatch in streaming parse flow.
- Hardened base URL validation to reject non-HTTP/HTTPS schemes.
- Prevented multi-byte UTF-8 truncation panics in common-prefix and runtime thread summary paths.
- Corrected context usage alert thresholds by separating warning and critical trigger levels.
Changed
- Removed non-code utility tools from the runtime tool registry (
calculator,weather,sports,finance,time) and related wiring. - Consolidated duplicate URL encoding helpers by delegating to shared
crate::utils::url_encode. - Replaced broad crate-level lint suppressions with targeted
#[allow(...)]annotations where justified. - Cleaned up dead APIs, unused struct fields, unused builder helpers, and non-integrated modules.
- Addressed clippy findings across the codebase (collapsible conditionals, defaults, indexing helpers, and API signature cleanup).
[0.3.24] - 2026-02-25
Fixed
- Preserve reasoning-only assistant turns for DeepSeek reasoning models (
deepseek-reasoner, R-series markers) when rebuilding chat history. - Align SSE tool streaming indices so each tool block start/delta/stop uses the same block index.
- Prevent transcript auto-scroll-to-bottom when a non-empty transcript selection is active.
- Allow session picker search mode to accept the current selection with a single
Enterpress. - Preserve tool output whitespace/indentation while still wrapping long unbroken tokens.
- Make transcript selection copy/highlighting display-width aware (wide chars and tabs).
- Gate execpolicy behavior on the
exec_policyfeature flag across CLI/tool execution paths. - Run doctor API connectivity checks using the effective loaded config/profile (instead of reloading defaults).
- Parse DeepSeek model context-window suffix hints such as
-32kand-256k. - Update README config docs with key environment overrides and a direct link to full configuration docs.
0.3.23 - 2026-02-24
Changed
- Updated project copy to describe the app as a terminal-native TUI/CLI for DeepSeek models (not pinned to a specific model generation).
Fixed
- Model selection and config validation now accept any valid
deepseek-*model ID (including future releases), while still normalizing common aliases likedeepseek-v3.2anddeepseek-r1. - Tool-call recovery now auto-loads deferred tools when the model requests them directly, instead of failing with manual
tool_search_*instructions. - YOLO mode now preloads tools by default (including deferred MCP tools), so model tool calls can run immediately without discovery indirection.
- Unknown tool-call failures now include discovery guidance and nearest tool-name suggestions instead of generic availability errors.
- Slash-command errors now suggest the closest known command (for example
/modle->/model) instead of only returning a generic unknown-command message.
0.3.22 - 2026-02-19
Added
- Interactive
/configediting modal for runtime settings updates.
Changed
- Retired user-facing
/setcommand path (no longer reachable/discoverable). - Replaced
/deepseekcommand behavior with/links(aliases:dashboard,api).
Fixed
- Legacy
/setand/deepseekinputs now return migration guidance instead of generic unknown-command errors.
0.3.21 - 2026-02-19
Added
- Parallel tool execution in
multi_tool_use.parallelfor independent task workflows. - Session resume-thread coverage in tests.
Changed
- Desktop and web parity polish across the TUI and runtime surfaces.
- Onboarding and approval UX refinement from prior phase 3 iteration.
Fixed
- Runtime pre-release startup issues and config-path edge cases.
- Clippy lint regressions introduced by the last parity pass.
Security/Hardening
- General pre-release hardening for runtime app behavior.
0.3.17 - 2026-02-16
Fixed
- Config loading now expands
~inDEEPSEEK_CONFIG_PATHand--configpaths. - When
DEEPSEEK_CONFIG_PATHpoints to a missing file, config loading now falls back to~/.deepseek/config.tomlif it exists.
Changed
- Removed committed transient runtime artifacts (
session_*.json,.deepseek/trusted) and added ignore rules to prevent re-commit.
0.3.16 - 2026-02-15
Added
deepseek modelsCLI command to fetch and list models from the configured/v1/modelsendpoint (with--jsonoutput mode)./modelsslash command to fetch and display live model IDs in the TUI.- Slash-command autocomplete hints in the composer plus
Tabcompletion for/commands. - Command palette modal (
Ctrl+K) for quick insertion of slash commands and skills. - Persistent right sidebar in wide terminals showing live plan/todo/sub-agent state.
- Expandable tool payload views (
vin transcript,vin approval modal) for full params/output inspection. - Runtime HTTP/SSE API (
deepseek serve --http) with durable thread/turn/item lifecycle, interrupt/steer, and replayable event timeline. - Background task queue (
/task add|list|show|cancelandPOST /v1/tasks) with persistent storage, bounded worker pool, and timeline/artifact tracking.
Changed
- Centralized the default text model (
DEFAULT_TEXT_MODEL) and shared common model list to reduce drift across runtime/config paths. /modelnow clarifies that any valid DeepSeek model ID is accepted (including future releases), while still showing common model IDs.
Fixed
- Expanded reasoning-model detection for chat history reconstruction (supports R-series and reasoner-style naming without hardcoding single versions).
- Aligned docs/config examples with the then-current runtime default model.
0.3.14 - 2026-02-05
Added
web.runnow supportsimage_query(DuckDuckGo image search)multi_tool_use.parallelnow supports safe MCP meta tools (list_mcp_resources,mcp_read_resource, etc.)
Fixed
- Encode tool-call function names when rebuilding Chat Completions history (keeps dotted tool names API-safe)
Changed
- Prompts: stronger
web.runcitation placement and quote-limit guidance
0.3.13 - 2026-02-04
Fixed
- Restore an in-app scrollbar for the transcript view
0.3.12 - 2026-02-04
Fixed
- Map dotted tool names to API-safe identifiers for DeepSeek tool calls
- Encode any invalid tool names for API tool lists while preserving internal names
0.3.11 - 2026-02-04
Fixed
- Fix tool name mapping for DeepSeek API
0.3.10 - 2026-02-04
Fixed
- Always enable mouse wheel scrolling in the TUI (even without alt screen)
[0.3.9] - 2026-02-04
Removed
- RLM mode, tools, and documentation pending a faithful implementation of the MIT RLM design
- Duo mode tools and prompts pending a citable research spec
Fixed
- Footer context usage bar remains visible while status toasts are shown
Changed
- Updated prompts and docs to reflect the simplified mode/tool surface
[0.3.8] - 2026-02-03
Fixed
- Resolve clippy warnings (CI
-D warnings) in new tool implementations
[0.3.7] - 2026-02-03
Added
- Tooling parity updates:
weather,finance,sports,time,calculator,request_user_input,multi_tool_use.parallel,web.run - Shell streaming helpers:
exec_shell_waitandexec_shell_interact - Sub-agent controls:
send_inputandwait(with aliases) - MCP resource helpers:
list_mcp_resources,list_mcp_resource_templates, andread_mcp_resourcealias
Changed
- Skills directory selection now prefers workspace
.agents/skills, then./skills, then global - Docs and prompts updated to reflect new tool surface and parity notes
0.3.6 - 2026-02-02
Added
- New welcome banner on startup showing "Welcome to DeepSeek TUI!" with directory, session ID, and model info
- Visual context progress bar in footer showing usage with block characters [████░░░░░░] and percentage
Changed
- Removed custom block-character scrollbar from chat area - now uses terminal's native scroll
- Simplified header bar: removed context percentage indicator (moved to footer as progress bar)
0.3.5 - 2026-01-30
Added
- Intelligent context offloading: large tool results (>15k chars) are automatically moved to RLM memory to preserve the context window
- Persistent history context: compacted messages are offloaded to RLM
historyvariable for recall - Full MCP protocol support: SSE transport, Resources (
resources/list,resources/read), and Prompts (prompts/list,prompts/get) mcp_read_resourceandmcp_get_promptvirtual tools exposed to the model- Dialectical Duo mode with specialized TUI rendering (
Player/Coachhistory cells) - Dynamic system prompt refreshing at each turn for up-to-date RLM/Duo/working-set context
project_maptool for automatic codebase structure discoverydelegate_to_agentalias for streamlined sub-agent delegation
Changed
- Default theme changed to 'Whale' with updated color palette
with_agent_toolsnow includesproject_map,test_runner, and conditionally RLM tools for all agent modes- MCP
McpServerConfig.commandis nowOption<String>to support URL-only (SSE) servers
Fixed
- MCP test compilation errors for updated
McpServerConfigstruct shape
0.3.4 - 2026-01-29
Changed
- Updated Cargo.lock dependencies
Fixed
- Compaction tool-call pairing: enforce bidirectional tool-call/tool-result integrity with fixpoint convergence
- Safety net scanning to drop orphan tool results in the request builder
- Double-dispatch race in parallel tool execution
0.3.3 - 2026-01-28
Added
- TUI polish: Kimi-style footer with mode/model/token display
- Streaming thinking blocks with dedicated rendering
- Loading animation improvements
0.3.2 - 2026-01-28
Fixed
- Preserve tool-call + tool-result pairing during compaction to avoid invalid tool message sequences
- Drop orphan tool results in request builder as a safety net to prevent API 400s
0.3.1 - 2026-01-27
Added
deepseek setupto bootstrap MCP config and skills directoriesdeepseek mcp initto generate a templatemcp.jsonat the configured path
Changed
deepseek doctornow follows the resolved config path and config-derived MCP/skills locations
Fixed
- Doctor no longer reports missing MCP/skills when paths are overridden via config or env
0.3.0 - 2026-01-27
Added
- Repo-aware working set tracking with prompt injection for active paths
- Working set signals now pin relevant messages during auto-compaction
- Offline eval harness (
deepseek eval) with CI coverage in the test job - Shell tool now emits stdout/stderr summaries and truncation metadata
- Dependency-aware
agent_swarmtool for orchestrating multiple sub-agents - Expanded sub-agent tool access (apply_patch, web_search, file_search)
Changed
- Auto-compaction now accounts for pinned budget and preserves working-set context
- Apply patch tool validates patch shape, reports per-file summaries, and improves hunk mismatch diagnostics
- Eval harness shell step now uses a Windows-safe default command
- Increased
max_subagentsclamp to1..=20
0.2.2 - 2026-01-22
Fixed
- Session save no longer panics on serialization errors
- Web search regex patterns are now cached for better performance
- Improved panic messages for regex compilation failures
[0.2.1] - 2026-01-22
Fixed
- Resolve clippy warnings for Rust 1.92
0.2.0 - 2026-01-20
Changed
- Removed npm package distribution; now Cargo-only
- Clean up for public release
Fixed
- Disabled automatic RLM mode switching; use /rlm or /aleph to enter RLM mode
- Fixed cargo fmt formatting issues
0.0.2 - 2026-01-20
Fixed
- Disabled automatic RLM mode switching; use /rlm or /aleph to enter RLM mode.
0.0.1 - 2026-01-19
Added
- DeepSeek Responses API client with chat-completions fallback
- CLI parity commands: login/logout, exec, review, apply, mcp, sandbox
- Resume/fork session workflows with picker fallback
- DeepSeek blue branding refresh + whale indicator
- Responses API proxy subcommand for key-isolated forwarding
- Execpolicy check tooling and feature flag CLI
- Agentic exec mode (
deepseek exec --auto) with auto-approvals
Changed
- Removed multimedia tooling and aligned prompts/docs for text-only DeepSeek API
0.1.9 - 2026-01-17
Added
- API connectivity test in
deepseek doctorcommand - Helpful error diagnostics for common API failures (invalid key, timeout, network issues)
0.1.8 - 2026-01-16
Added
- Renderable widget abstraction and modal view stack for TUI composition
- Parallel tool execution with lock-aware scheduling
- Interactive shell mode with terminal pause/resume handling
Changed
- Tool approval requirements moved into tool specs
- Tool results are recorded in original request order
0.1.7 - 2026-01-15
Added
- Duo mode (player-coach autocoding workflow)
- Character-level transcript selection
Fixed
- Approval flow tool use ID routing
- Cursor position sync for transcript selection
0.1.6 - 2026-01-14
Added
- Auto-RLM for large pasted blocks with context auto-load
chunk_autoandrlm_queryauto_chunksfor quick document sweeps- RLM usage badge with budget warnings in the footer
Changed
- Auto-RLM now honors explicit RLM file requests even for smaller files
0.1.5 - 2026-01-14
Added
- RLM prompt with external-context guidance and REPL tooling
- RLM tools for context loading, execution, status, and sub-queries (rlm_load, rlm_exec, rlm_status, rlm_query)
- RLM query usage tracking and variable buffers
- Workspace-relative
@pathsupport for RLM loads - Auto-switch to RLM when users request large file analysis (or the largest file)
Changed
- Removed Edit mode; RLM chat is default with /repl toggle
0.1.0 - 2026-01-12
Added
- Initial alpha release of DeepSeek TUI
- Interactive TUI chat interface
- DeepSeek API integration (OpenAI-compatible Responses API)
- Tool execution (shell, file ops)
- MCP (Model Context Protocol) support
- Session management with history
- Skills/plugin system
- Cost tracking and estimation
- Hooks system and config profiles
- Example skills and launch assets