Ccan1357feat(read): conditioned inspect_image docs on feature flag

725539ae创建于 2 天前历史提交

Read files, directories, archives, SQLite databases, images, documents, internal resources, and web URLs through a single path string.

- One tool for filesystem, archives, SQLite, images, documents (PDF/DOCX/PPTX/XLSX/RTF/EPUB/ipynb), internal URIs, and web URLs (reader-mode by default). - You SHOULD parallelize independent reads when exploring related files. - You SHOULD reach for `read` — not a browser/puppeteer tool — for fetching web content.

Parameters

path — required. Local path, internal URI (skill://, agent://, artifact://, memory://, rule://, local://, vault://, mcp://), or URL. Append :<sel> for line ranges, raw mode, or special modes (e.g. src/foo.ts:50-200, src/foo.ts:raw, db.sqlite:users:42).

Selectors

Append :<sel> to path. The bare path falls back to the default mode.

(none) — parseable code → structural summary (signatures kept, bodies elided); other files → read from the start (up to {{DEFAULT_LIMIT}} lines).
:50 / :50- — read from line 50 onward.
:50-200 — lines 50–200 inclusive.
:50+150 — 150 lines starting at line 50.
:20+1 — exactly one line.
:5-16,960-973 — multiple ranges in one call (sorted, overlaps merged).
:raw — verbatim text; no anchors, no summary, no line prefixes.
:2-4:raw or :raw:2-4 — range AND verbatim; the two compose in either order.
:conflicts — one-line-per-block index of every unresolved git merge conflict.

Files

Reading a directory path returns a depth-limited dirent listing. {{#if IS_HL_MODE}}
Reading a file with an explicit selector emits a file snapshot tag header and numbered lines: ¶src/foo.ts#0a then 41:def alpha():. Copy the ¶PATH#TAG header for anchored edits; ops use bare line numbers. NEVER fabricate the tag. {{else}} {{#if IS_LINE_NUMBER_MODE}}
Reading a file with an explicit selector returns lines prefixed with line numbers: 41|def alpha():. {{/if}} {{/if}}
Parseable code without a selector returns a structural summary: declarations kept, large bodies collapsed to .. (merged brace pair) or … (standalone). Summarized output ends with a footer demonstrating the multi-range selector you can use to recover the elided bodies, e.g.:

[NN lines elided; re-read needed ranges, e.g. <path>:5-16,40-80]

Re-issue only the relevant range(s) using the multi-range selector (e.g. <path>:5-16,120-200). NEVER guess what's inside .. / … — those markers carry no content. NEVER re-read the whole file or use :raw when targeted ranges suffice.

Documents & Notebooks

Extracts text from PDF, Word, PowerPoint, Excel, RTF, and EPUB. Notebooks (.ipynb) are shown as editable # %% [type] cell:N text; edits round-trip back to the underlying JSON preserving notebook metadata. Add :raw to a notebook to bypass the converter and read the JSON directly.

Images

{{#if INSPECT_IMAGE_ENABLED}} Reading an image path returns metadata (mime, bytes, dimensions, channels, alpha). For actual visual analysis, call inspect_image with the path and a question describing what to inspect. {{else}} Reading an image path returns the decoded image inline (PNG, JPEG, GIF, WEBP) for direct visual analysis. {{/if}}

SQLite

For .sqlite, .sqlite3, .db, .db3:

file.db — list tables with row counts
file.db:table — schema + sample rows
file.db:table:key — single row by primary key
file.db:table?limit=50&offset=100 — paginated rows
file.db:table?where=status='active'&order=created:desc — filtered rows
file.db?q=SELECT … — read-only SELECT query

URLs

Default reader-mode: HTML pages, GitHub issues/PRs, Stack Overflow, Wikipedia, Reddit, NPM, arXiv, RSS/Atom, JSON endpoints, PDFs → clean text/markdown.
:raw returns untouched HTML; line selectors (:50, :50-100, :50+150) paginate the cached fetched output.
Bare host:port URLs collide with the selector grammar — add a trailing slash before the selector: https://example.com/:80.

Internal URIs

skill://<name>, agent://<id>, artifact://<id>, memory://root, rule://<name>, local://<name>.md, vault://<vault>/<path>, mcp://<uri> resolve transparently and accept the same line selectors as filesystem paths. Use artifact://<id> to recover full output that a previous bash/eval/tool result spilled or truncated.

- You MUST use `read` for every file, directory, archive, and URL inspection. `cat`, `head`, `tail`, `less`, `more`, `ls`, `tar`, `unzip`, `curl`, `wget` are FORBIDDEN — any such bash call is a bug, regardless of how short or convenient it looks. - You MUST prefer `read` over a browser/puppeteer tool for URL content; only reach for a browser when `read` cannot deliver reasonable content. - You MUST always include `path`. NEVER call `read` with `{}`. - For line ranges, append the selector to `path` (`path="src/foo.ts:50-200"`, `path="src/foo.ts:50+150"`). NEVER substitute `sed -n`, `awk NR`, or `head`/`tail` pipelines. - Summary footer says `read :raw …`? Re-issue the exact selector it names. NEVER guess what's inside `..` / `…` markers — they carry no content. - You MAY combine selectors with URL reads and internal URIs; both paginate the cached resolved output.