feat(cli): support XLSX text extraction in read tool by marius-kilocode · Pull Request #10740 · Kilo-Org/kilocode

marius-kilocode · 2026-05-29T14:50:09Z

Spreadsheet files are still classified as binary by the read tool, so agents cannot inspect workbook contents without an external conversion step.

This adds .xlsx extraction for explicit read calls only. Visible worksheets are surfaced as labelled tab-separated text with readable formulas, formatted values, dates, hyperlinks, and errors. Hidden sheets are omitted, workbook inputs above 50 MB are rejected before parsing, worksheet extraction is bounded, and the existing read output limits continue to apply. Native PDF and image attachment behavior, along with rejection of unsupported binary spreadsheet formats, stays unchanged.

With notebook and DOCX reads now present in main, format selection is consolidated behind a Kilo-owned read extraction router rather than adding another format-specific control-flow branch to the shared read tool. The existing notebook and DOCX extractors continue to supply their behavior through the same narrow hook.

The parser uses official SheetJS CE 0.20.3 from its pinned distribution tarball because the public npm registry release is outdated and affected by known vulnerabilities. It adds no transitive runtime dependencies. Compared with current main, the current-platform compiled CLI artifact increases from 105,737,122 bytes to 106,678,306 bytes, an increase of 941,184 bytes (approximately 0.90 MiB).

kilo-code-bot · 2026-05-29T14:54:27Z

Code Review Summary

Status: 3 Issues Found | Recommendation: Address before merge

Overview

Severity	Count
CRITICAL	0
WARNING	1
SUGGESTION	2

Issue Details (click to expand)

WARNING

File	Line	Issue
`packages/opencode/src/kilocode/tool/xlsx.ts`	12	No file-size guard before reading entire XLSX into memory — a large workbook bypasses the 50 KB cap that exists for text files

SUGGESTION

File	Line	Issue
`packages/opencode/src/kilocode/tool/xlsx.ts`	65	`[...rows.entries()].sort(...)` allocates all sorted row tuples before iterating; minor but scales to 50k rows
`packages/opencode/src/tool/read.ts`	308	`readLines` drains the full XLSX generator even after the line-limit is hit (to count lines), wasting work for sheets near ROW_LIMIT

Other Observations (not in diff)

The isBinaryFile switch still has case ".xlsx" (line 129 of read.ts), which is now dead code for the XLSX path since !xlsx && isBinaryFile(...) short-circuits it. Harmless, but could be confusing.
readSample (up to 4 KB) is still fetched for XLSX files at line 278 even though the sample is only used for MIME sniffing and binary detection — both of which are bypassed for XLSX. Minor unnecessary I/O.

Files Reviewed (5 files)

packages/opencode/src/kilocode/tool/xlsx.ts - 2 issues
packages/opencode/src/tool/read.ts - 1 issue
packages/opencode/test/kilocode/read-xlsx.test.ts - clean
.changeset/read-xlsx-spreadsheets.md - clean
packages/opencode/package.json - clean

Fix these issues in Kilo Cloud

_{Reviewed by claude-4.6-sonnet-20260217 · 1,208,954 tokens}

_{Review guidance: REVIEW.md from base branch main}

…raction # Conflicts: # packages/opencode/src/tool/read.ts

feat(cli): support XLSX text extraction in read tool

2081af2

marius-kilocode enabled auto-merge May 29, 2026 14:50

kilo-code-bot Bot reviewed May 29, 2026

View reviewed changes

Comment thread packages/opencode/src/kilocode/tool/xlsx.ts Outdated

Comment thread packages/opencode/src/kilocode/tool/xlsx.ts

Comment thread packages/opencode/src/tool/read.ts Outdated

imanolmzd-svg approved these changes May 29, 2026

View reviewed changes

marius-kilocode disabled auto-merge May 29, 2026 14:55

fix(cli): bound XLSX read input size

f7830cb

marius-kilocode enabled auto-merge May 29, 2026 15:00

Merge remote-tracking branch 'origin/main' into feature-read-xlsx-ext…

277d09d

…raction # Conflicts: # packages/opencode/src/tool/read.ts

marius-kilocode disabled auto-merge May 29, 2026 15:35

marius-kilocode merged commit 979cb7e into main May 29, 2026
17 checks passed

marius-kilocode deleted the feature-read-xlsx-extraction branch May 29, 2026 16:02

This was referenced May 30, 2026

Cannot Read .ods (OpenDocument Spreadsheet) Files #10760

Closed

feat: add .ods (OpenDocument Spreadsheet) support to read tool #10761

Merged

teknium1 mentioned this pull request Jun 2, 2026

feat(read): extract .ipynb/.docx/.xlsx to text in read_file NousResearch/hermes-agent#37082

Merged

kilo-maintainer Bot mentioned this pull request Jun 3, 2026

release(jetbrains): v7.0.1-rc.5 #10889

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(cli): support XLSX text extraction in read tool#10740

feat(cli): support XLSX text extraction in read tool#10740
marius-kilocode merged 3 commits into
mainfrom
feature-read-xlsx-extraction

marius-kilocode commented May 29, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kilo-code-bot Bot commented May 29, 2026 •

edited

Loading

WARNING

SUGGESTION

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

marius-kilocode commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kilo-code-bot Bot commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review Summary

Overview

WARNING

SUGGESTION

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

marius-kilocode commented May 29, 2026 •

edited

Loading

kilo-code-bot Bot commented May 29, 2026 •

edited

Loading