feat(ai): optimize text accumulation runtime to O(N) by aayush-kapoor · Pull Request #15897 · vercel/ai

aayush-kapoor · 2026-06-08T17:10:15Z

Background

reported in #15670

the old code does this on every delta:

textPart.text += chunk.delta;

which step by step looks like:

start: text = ''

delta 'Hel':
text = '' + 'Hel'
=> 'Hel'

delta 'lo ':
text = 'Hel' + 'lo '
=> copies 'Hel', then appends 'lo '
=> 'Hello '

delta 'wor':
text = 'Hello ' + 'wor'
=> copies 'Hello ', then appends 'wor'
=> 'Hello wor'

delta 'ld':
text = 'Hello wor' + 'ld'
=> copies 'Hello wor', then appends 'ld'
=> 'Hello world'

for tiny strings this is fine. for long streams, the copied prefix keeps growing.

It gets worse when something reads .text between writes because that forces the JS engine to materialize/flatten the string before the next append.

Summary

the new code keeps chunks internally as

chunks.push(chunk.delta);

which step by step looks like:

start:
chunks = []
cachedText = ''

delta 'Hel':
chunks = ['Hel']

delta 'lo ':
chunks = ['Hel', 'lo ']

delta 'wor':
chunks = ['Hel', 'lo ', 'wor']

delta 'ld':
chunks = ['Hel', 'lo ', 'wor', 'ld']

so when someone reads:
textPart.text

the getter does:

chunks.join('')
=> 'Hello world'

Then it caches that:

chunks = ['Hello world']
cachedText = 'Hello world'

If .text is read again before a new delta arrives, it returns the cached string.

Manual Verification

na but have asked some providers to verify the fix by applying patch manaully

Checklist

All commits are signed (PRs with unsigned commits cannot be merged)
Tests have been added / updated (for bug fixes / features)
Documentation has been added / updated (for bug fixes / features)
A patch changeset for relevant packages has been added (for bug fixes / features - run pnpm changeset in the project root)
I have reviewed this pull request (self-review)

Future Work

investigate to see if similar patterns exist in the codebase

Related Issues

fixes #15670

lgrammel · 2026-06-09T08:12:40Z

we would need some benchmarks that show that this is an actual improve (and can be used to check for regressions)

meitalbensinai · 2026-06-10T09:29:02Z

Validation results + report of a sibling site this PR does not cover

Validated this PR (specifically the v6 backport, #15906) against an in-the-wild reproduction of the bug it was filed for. Short version: the PR is correctly written and a real improvement, but the bug still fires on tool-input-heavy workloads because of a third O(N²) site in the same file that this PR does not address. Posting here so you can decide whether to expand scope before merging or land it with a tracking issue.

Stack used

opencode (anomalyco fork) at v1.15.12 source rebuild + this PR's patched ai@6.0.168
For "both fixes" group: also patched the sibling opencode/processor.ts site we filed in anomalyco/opencode#30072 (same chunked-text shape as this PR)
Model: minimax/minimax-m2.7 via OpenRouter (verbose-reasoning + heavy edit tool inputs — the workload class that triggers the bug)
Instance: protonmail/webclients SWE-bench-Pro instance 7e54526774… (heavyweight TS monorepo; historically a reliable bug-firer)
N=5 per group, sequential, same environment

Results

Config	n	resolved	med total	med mean s/step	med max-step	p100 max-step
Both fixes (this PR + opencode#30072)	5	5/5	692s	6.3s	88s	97s
This PR only	5	4/5	1075s	8.6s	83s	103s
Unpatched	5	5/5	801s	7.4s	91s	120s

Max-step distribution per group (sorted desc):

Both fixes: 97s, 89s, 88s, 86s, 48s
This PR only: 103s, 99s, 83s, 61s, 59s
Unpatched: 120s, 94s, 91s, 88s, 63s

Every single run, in every config, has at least one step taking 48–120s. In a healthy run, no individual LLM step should take >15s on this model. The signature is the bug still firing — just attenuated.

The third site

packages/ai/src/ui/process-ui-message-stream.ts:577

case 'tool-input-delta': {
  const partialToolCall = state.partialToolCalls[chunk.toolCallId];
  …
  partialToolCall.text += chunk.inputTextDelta;          // ← same pattern as text-delta / reasoning-delta

  const { value: partialArgs } = await parsePartialJson(
    partialToolCall.text,                                // ← forces flatten on every chunk
  );

Same text += shape this PR fixes for the text and reasoning branches in the same switch. MiniMax M2.7 is especially exposed because its edit tool calls carry multi-line diffs streamed in many small chunks; at step ~50+ in a long agent loop, the partialToolCall.text for an in-flight edit grows large enough that the per-chunk concat + parse hits quadratic time. That matches what we observe — runs clean for ~50 steps, then late-step spikes once tool-input streaming bytes have accumulated.

Why a naïve `prepareTextAccumulator` here is harder

The parsePartialJson(partialToolCall.text) call on every delta needs the cumulative string, so a lazy-join getter alone doesn't break the quadratic — every delta still flattens. Options:

Incremental partial-JSON parser that consumes deltas without rebuilding the full string each time (most correct, real engineering).
Buffer-and-flush: only run parsePartialJson every N deltas (or after a debounce window). Drops some UI smoothness, large perf win.
Chunk + soft-rejoin cap: store as chunks, only flatten when needed for parsePartialJson, but rejoin if _chunks.length exceeds a threshold to bound worst-case.

Happy to file the follow-up PR if you'd like — wanted to flag it before #15906/#15897 merge so the scope decision is informed. Either way, thanks for the clean lifecycle design on the existing fix; the WeakMap + explicit finalize is nicer than what we shipped on our side.

lgrammel · 2026-06-10T14:55:42Z

+  Object.defineProperty(part, '__textAccumulator', {
+    configurable: true,
+    value: accumulator,
+  });
+
+  Object.defineProperty(part, 'text', {
+    configurable: true,
+    enumerable: true,
+    get() {
+      return accumulator.getText();
+    },
+    set(value: string) {
+      accumulator.setText(value);
+    },
+  });


this is weird why are those functions not methods on the class

lgrammel · 2026-06-10T14:56:38Z

          switch (chunk.type) {
            case 'text-start': {
-              const textPart: TextUIPart = {
+              const textPart = prepareTextAccumulator<TextUIPart>({


the map of part to accumulator should be maintained in this function, not in text-accumulator (which accumulates a single text)

lgrammel · 2026-06-10T14:59:15Z

+type TextAccumulatorPart = {
+  text: string;
+  __textAccumulator?: TextAccumulator;
+};
+
+function getTextAccumulator<PART extends { text: string }>(part: PART) {
+  return (part as TextAccumulatorPart).__textAccumulator;
+}


i don't like this

failing tests

0d32fb9

github-actions Bot assigned aayush-kapoor Jun 8, 2026

fix

bf3ab90

vercel Bot deployed to Preview June 8, 2026 17:12 View deployment

aayush-kapoor marked this pull request as draft June 8, 2026 17:15

This was referenced Jun 8, 2026

fix(ui): O(N²)→O(N) text/reasoning accumulation in processUIMessageStream + DefaultStreamTextResult #15669

Closed

backport: feat(ai): optimize text accumulation runtime to O(N) #15906

Draft

lgrammel reviewed Jun 9, 2026

View reviewed changes

Comment thread packages/ai/src/util/text-accumulator.ts Outdated

Merge branch 'main' into aayush/runtime-optimization-chunk

5059292

vercel Bot deployed to Preview June 9, 2026 14:49 View deployment

changed to class

1a9a743

vercel Bot deployed to Preview June 9, 2026 16:10 View deployment

Merge branch 'main' into aayush/runtime-optimization-chunk

13e53d3

vercel Bot deployed to Preview June 9, 2026 18:14 View deployment

lgrammel reviewed Jun 10, 2026

View reviewed changes

lgrammel added 2 commits June 10, 2026 18:41

benchmark

725ec33

b2

978986e

vercel Bot deployed to Preview June 10, 2026 16:55 View deployment

re

c110d74

vercel Bot deployed to Preview June 10, 2026 17:09 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ai): optimize text accumulation runtime to O(N)#15897

feat(ai): optimize text accumulation runtime to O(N)#15897
aayush-kapoor wants to merge 8 commits into
mainfrom
aayush/runtime-optimization-chunk

aayush-kapoor commented Jun 8, 2026 •

edited

Loading

Uh oh!

Uh oh!

lgrammel commented Jun 9, 2026

Uh oh!

meitalbensinai commented Jun 10, 2026

Uh oh!

lgrammel Jun 10, 2026

Uh oh!

lgrammel Jun 10, 2026

Uh oh!

lgrammel Jun 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

aayush-kapoor commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Background

Summary

Manual Verification

Checklist

Future Work

Related Issues

Uh oh!

Uh oh!

lgrammel commented Jun 9, 2026

Uh oh!

meitalbensinai commented Jun 10, 2026

Validation results + report of a sibling site this PR does not cover

Stack used

Results

The third site

Why a naïve prepareTextAccumulator here is harder

Uh oh!

lgrammel Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

lgrammel Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

lgrammel Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

aayush-kapoor commented Jun 8, 2026 •

edited

Loading

Why a naïve `prepareTextAccumulator` here is harder