perf: optimise `appendCompact` usage by dannykopping · Pull Request #262 · anthropics/anthropic-sdk-go

dannykopping · 2025-12-30T09:49:25Z

Skip appendCompact calls for MarshalJSON output in the internal JSON encoder. The appendCompact function scans every byte to validate and compact JSON, which is O(n) per nested MarshalJSON call. For deeply nested structures, this becomes a significant bottleneck as the same bytes get re-scanned at each nesting level.

There's no need to do HTML escaping again; shims.EscapeHTMLByDefault = true and all MarshalJSON implementations use Marshal() internally, which already does HTML escaping.

For our use of this lib, this func was the biggest CPU hog when encoding JSON payloads at scale.

This is safe because MarshalJSON implementations (both in this SDK and standard library) already produce valid, compact JSON - running appendCompact on the output is redundant work.

goos: linux
goarch: amd64
pkg: github.com/anthropics/anthropic-sdk-go/internal/encoding/json
cpu: AMD Ryzen 9 7900 12-Core Processor             
                                   │  /tmp/old   │               /tmp/new               │
                                   │   sec/op    │    sec/op     vs base                │
MarshalNestedMarshalJSON-24          5.577µ ± 9%   2.660µ ± 13%  -52.31% (p=0.000 n=10)
MarshalSliceOfNestedMarshalJSON-24   301.5µ ± 6%   137.1µ ±  9%  -54.54% (p=0.000 n=10)
geomean                              41.01µ        19.09µ        -53.44%

                                   │   /tmp/old   │              /tmp/new               │
                                   │     B/op     │     B/op      vs base               │
MarshalNestedMarshalJSON-24            914.0 ± 0%     913.0 ± 0%  -0.11% (p=0.000 n=10)
MarshalSliceOfNestedMarshalJSON-24   49.41Ki ± 0%   49.28Ki ± 0%  -0.26% (p=0.000 n=10)
geomean                              6.641Ki        6.629Ki       -0.19%

                                   │  /tmp/old  │              /tmp/new               │
                                   │ allocs/op  │ allocs/op   vs base                 │
MarshalNestedMarshalJSON-24          12.00 ± 0%   12.00 ± 0%       ~ (p=1.000 n=10) ¹
MarshalSliceOfNestedMarshalJSON-24   552.0 ± 0%   552.0 ± 0%       ~ (p=1.000 n=10) ¹
geomean                              81.39        81.39       +0.00%
¹ all samples are equal

For our project, under scale testing this fix reduced CPU usage by ~36%.

I enhanced test coverage and added benchmarks.

Disclaimer: produced alongside Opus 4.5.

Signed-off-by: Danny Kopping <dannykopping@gmail.com>

Signed-off-by: Danny Kopping <danny@coder.com>

dannykopping · 2026-01-12T09:21:25Z

@dtmeadows for visibility

dtmeadows · 2026-01-13T15:21:06Z

Thanks @dannykopping! Taking a look internally now to get a better understanding of this one.

SasSwart · 2026-01-27T10:47:45Z

Hey @dtmeadows, have y'all had a chance to look into this?

akosyakov · 2026-03-24T08:15:59Z

@dannykopping We profiled our application under load and can independently confirm the problem described in this PR. CPU profiles show appendCompact and the associated stateInString scanner accounting for ~55% of total CPU during JSON serialization. The recursive MarshalJSON → appendCompact chain re-scans each nested type's output at every level, making the cost grow with nesting depth × payload size. Would appreciate this getting reviewed. 🙏

dannykopping · 2026-03-24T08:58:04Z

@dtmeadows for viz, could you please have a look?

dtmeadows · 2026-03-24T19:09:35Z

Sorry for the delay everyone! I've opened up a PR internally to hopefully get this adopted. I'll update this issue once I have more to share.

dannykopping and others added 2 commits December 30, 2025 09:58

perf: optimise appendCompact usage

de98f8d

Signed-off-by: Danny Kopping <dannykopping@gmail.com>

benchmark

685e526

Signed-off-by: Danny Kopping <danny@coder.com>

dannykopping requested a review from a team as a code owner December 30, 2025 09:49

chore: add html escaping tests

88a4315

Signed-off-by: Danny Kopping <danny@coder.com>

dannykopping force-pushed the dk/perf-opt branch from 304fb9f to 88a4315 Compare December 30, 2025 11:13

dannykopping mentioned this pull request Dec 30, 2025

[epic] aibridge scalability coder/internal#1220

Closed

3 tasks

SasSwart mentioned this pull request Jan 13, 2026

Optimize AI Bridge's JSON handling for llm payloads coder/internal#1236

Closed

johnstcn mentioned this pull request Apr 9, 2026

refactor: port to coder/taskname to coder/anthropic-sdk-go coder/coder#24204

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: optimise `appendCompact` usage#262

perf: optimise `appendCompact` usage#262
dannykopping wants to merge 3 commits intoanthropics:mainfrom
dannykopping:dk/perf-opt

dannykopping commented Dec 30, 2025 •

edited

Loading

Uh oh!

dannykopping commented Jan 12, 2026

Uh oh!

dtmeadows commented Jan 13, 2026

Uh oh!

SasSwart commented Jan 27, 2026

Uh oh!

akosyakov commented Mar 24, 2026 •

edited

Loading

Uh oh!

dannykopping commented Mar 24, 2026 •

edited

Loading

Uh oh!

dtmeadows commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

dannykopping commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dannykopping commented Jan 12, 2026

Uh oh!

dtmeadows commented Jan 13, 2026

Uh oh!

SasSwart commented Jan 27, 2026

Uh oh!

akosyakov commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dannykopping commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dtmeadows commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

dannykopping commented Dec 30, 2025 •

edited

Loading

akosyakov commented Mar 24, 2026 •

edited

Loading

dannykopping commented Mar 24, 2026 •

edited

Loading