fix(logging): redact provider_specific_fields and request-body snapshots when message logging is off by michelligabriele · Pull Request #28611 · BerriAI/litellm

michelligabriele · 2026-05-22T14:59:24Z

Relevant issues

Related to closed issues #16336 (the proxy body snapshot leaking the prompt) and #15822 (the residual provider_specific_fields gap that survived the original flat-field fix). Both describe the bug class addressed here.

Linear ticket

Pre-Submission checklist

I have Added testing in the tests/test_litellm/ directory — 8 new tests on TestPerformRedaction in tests/test_litellm/litellm_core_utils/test_redact_messages.py.
My PR passes all unit tests on make test-unit — focused suite (pytest tests/test_litellm/litellm_core_utils/test_redact_messages.py) passes 27/27; full make test-unit not yet run locally, relying on CI.
My PR's scope is as isolated as possible, it only solves 1 specific problem — redaction-boundary surface gaps in perform_redaction. The independent self-referencing snapshot bug surfaced during the same investigation is going up as a separate PR.
I have requested a Greptile review by commenting @greptileai and received a Confidence Score of at least 4/5.

CI (LiteLLM team)

Screenshots / Proof of Fix

Verified end-to-end against a minimal CustomLogger registered as a success_callback on a local proxy. Three request shapes were inspected before and after the patch: no redaction header, redaction header + non-reasoning chat completion, redaction header + Anthropic extended-thinking. The relevant kwargs paths captured by the logger:

Before (redaction header set, Anthropic extended-thinking request):

```
kwargs.litellm_params.proxy_server_request.body.messages[0].content
-> 'CANARY_INPUT_should_be_redacted'
response_obj.choices[0].message.provider_specific_fields.thinking_blocks[0].thinking
-> ''
kwargs.additional_args.complete_input_dict.messages[0].content[0].text
-> 'CANARY_INPUT_should_be_redacted'
```

After:

```
kwargs.litellm_params.proxy_server_request.body.messages
-> [{"role": "user", "content": "redacted-by-litellm"}]
response_obj.choices[0].message.provider_specific_fields.thinking_blocks
-> null
kwargs.additional_args.complete_input_dict.messages
-> [{"role": "user", "content": "redacted-by-litellm"}]
```

Ground-truth canary-search across the full success-event kwargs returns zero hits for either input or reasoning canaries on the patched build.

Type

🐛 Bug Fix

Changes

`perform_redaction` was scrubbing only the top-level `messages` / `prompt` / `input` keys on `model_call_details` and the `standard_logging_object` payload when message logging was disabled. Three additional surfaces on the same custom-logger kwargs dict still carried the user prompt and reasoning content:

`Message.provider_specific_fields` — Anthropic populates `thinking_blocks` and `reasoning_content` here in addition to the flat fields on `Message`; the flat fields were scrubbed but the duplicates inside `provider_specific_fields` were not (and `reasoning_content` inside psf also embeds the raw `signature` blob). Bedrock converse populates `reasoningContentBlocks` on the same surface.
`litellm_params.proxy_server_request.body` — a separately-materialised copy of the request payload assembled in `litellm_pre_call_utils.add_litellm_data_to_request`. Custom loggers and the spend-logs builder (`spend_tracking_utils.py`) both read from it, so the leak fanned out past the immediate callback.
`additional_args.complete_input_dict` — the provider-native wire-format request body recorded by provider handlers for pre-call logs and the OpenTelemetry exporter (`integrations/opentelemetry.py:2426–2434`). Same root cause as the proxy-body snapshot, same fix shape.

Fix is three small additive helpers in `litellm/litellm_core_utils/redact_messages.py`:

`_redact_provider_specific_fields` — wired into both `_redact_choice_content` (object path, `Choices.message` + `StreamingChoices.delta`) and `_redact_model_response_dict_choices` (dict path, message + delta branches).
`_redact_proxy_server_request_body` — called from `perform_redaction` after `_redact_standard_logging_object`.
`_redact_additional_args_complete_input_dict` — called immediately after.

All three helpers are guarded on dict/type checks and only overwrite keys when present — no effect on the non-redaction path, no signature changes, no refactors elsewhere. Backwards-compatible.

Known follow-ups (out of scope here, intentionally narrowed):

The proxy `body` snapshot also contains pipeline-internal fields (`litellm_call_id`, `litellm_trace_id`, `user_api_key_*`, `api_version`, `ttl`) that were never part of the user POST body. Broadening the exclude set risks breaking downstream consumers that read `body.litellm_call_id` etc. — separate change.
`complete_input_dict.system` / `system_prompt` / `instructions` (provider-native system-prompt keys) — not scrubbed by this PR.
Bedrock `reasoningContentBlocks` is covered by code symmetry; the unit-test fixture exercises the dict path but an end-to-end Bedrock invocation was not run for this PR.

…ots when message logging is off

codecov · 2026-05-22T15:03:30Z

Codecov Report

❌ Patch coverage is 98.03922% with 1 line in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
litellm/litellm_core_utils/redact_messages.py	98.03%	1 Missing ⚠️

📢 Thoughts on this report? Let us know!

greptile-apps · 2026-05-22T15:04:19Z

Greptile Summary

This PR closes three prompt-leakage surfaces that survived the original perform_redaction flat-field scrub: Message.provider_specific_fields (Anthropic reasoning/thinking duplicates + Bedrock reasoningContentBlocks), litellm_params.proxy_server_request.body (the proxy's separately-materialised request copy), and additional_args.complete_input_dict (the provider-native wire-format payload read by OTel and pre-call loggers).

Adds _redact_provider_specific_fields wired into both object and dict choice paths, redacting three hardcoded reasoning keys.
Adds _redact_proxy_server_request_body and _redact_additional_args_complete_input_dict, each clearing messages, prompt, and input on the respective nested body dict.
8 new focused unit tests cover all three helpers across object, dict, streaming-delta, and missing-key paths with no real network calls.

Confidence Score: 4/5

Safe to merge; the fix is purely additive and all new code is guarded by isinstance checks so it cannot affect the non-redaction path.

The implemented fixes are correct and consistent with existing in-place mutation patterns throughout perform_redaction. The two new body-snapshot helpers do not yet cover the system key (Anthropic/Bedrock native system prompt), which means a request with a top-level system prompt will still expose that content through proxy_server_request.body and complete_input_dict even after this patch. The PSF redaction uses a hardcoded key tuple that will silently miss any new reasoning-related fields future providers add to provider_specific_fields.

litellm/litellm_core_utils/redact_messages.py — specifically the two new body-snapshot helpers and the PSF key tuple.

Important Files Changed

Filename	Overview
litellm/litellm_core_utils/redact_messages.py	Adds three targeted helpers to close three prompt-leakage paths previously missed by perform_redaction; system/system_prompt/instructions keys are not yet covered in the two new body-snapshot helpers.
tests/test_litellm/litellm_core_utils/test_redact_messages.py	Adds 8 new unit tests covering all three new helpers across object and dict paths, plus safe-when-absent edge cases; no real network calls, no mock weakening, all assertions are additive.

_{Reviews (1): Last reviewed commit: "fix(logging): redact provider_specific_f..." | Re-trigger Greptile}

greptile-apps · 2026-05-22T15:04:23Z

+    if "messages" in body:
+        body["messages"] = [{"role": "user", "content": "redacted-by-litellm"}]
+    if "prompt" in body:
+        body["prompt"] = ""
+    if "input" in body:
+        body["input"] = ""


system key not redacted in proxy body snapshot

Both _redact_proxy_server_request_body and _redact_additional_args_complete_input_dict only clear messages, prompt, and input, leaving system (Anthropic/Bedrock native top-level system prompt), system_prompt, and instructions untouched. Any custom logger inspecting proxy_server_request.body.system or complete_input_dict.system will still receive the unredacted system prompt when turn_off_message_logging is enabled. The PR description calls this out as a known follow-up, but since these two helpers are the entry point for fixing the proxy-body and wire-request leaks, users who rely on this fix to prevent all prompt leakage will still be exposed through the system-prompt path.

greptile-apps · 2026-05-22T15:04:31Z

+_PROVIDER_SPECIFIC_REASONING_KEYS = (
+    "reasoning_content",
+    "thinking_blocks",
+    "reasoningContentBlocks",
+)
+
+
+def _redact_provider_specific_fields(psf, redacted_str: str = "redacted-by-litellm"):
+    """Scrub reasoning-content duplicates inside Message.provider_specific_fields."""
+    if not isinstance(psf, dict):
+        return
+    for key in _PROVIDER_SPECIFIC_REASONING_KEYS:
+        if key not in psf:
+            continue
+        if key == "reasoning_content":
+            psf[key] = redacted_str
+        else:
+            psf[key] = None


Hardcoded key list won't cover new provider PSF fields

_PROVIDER_SPECIFIC_REASONING_KEYS names exactly three keys. If Anthropic or Bedrock later populate additional provider_specific_fields entries that carry raw reasoning or input content (e.g., a citations blob or future reasoning formats), those keys will pass through unredacted with no code change to catch them. Consider documenting in the module docstring that PSF redaction is intentionally key-allowlist-based so future provider additions know to update this tuple.

mateo-berri

Are the greptile P2's worth addressing?

veria-ai · 2026-05-28T21:53:38Z

PR overview

This pull request updates LiteLLM logging redaction behavior so provider-specific fields and request-body snapshots are suppressed when message logging is disabled. The touched redaction path also affects how proxy request bodies are prepared for spend logging.

Most of the reported redaction gaps have been addressed, with 3 issues already fixed. One open issue remains where the spend-log payload path can still persist sensitive request-body fields because it does not apply the same request-body redaction logic. The remaining exposure is limited to data written into spend logs, but it can still be triggered by an authenticated caller supplying sensitive content in affected request fields.

Open issues (1)

Medium: Request body fields still leak to spend logs — litellm/litellm_core_utils/redact_messages.py:230

Fixed/addressed: 3 · PR risk: 5/10

…y snapshots

…CR keys in body snapshot

…message logging is off

…itellm_fix_redaction_psf_and_body_snapshots # Conflicts: # litellm/litellm_core_utils/redact_messages.py # tests/test_litellm/litellm_core_utils/test_redact_messages.py

veria-ai · 2026-06-09T20:15:25Z

+    if not isinstance(body, dict):
+        return
+
+    _redact_request_body_dict(body)


Medium: Request body fields still leak to spend logs

_get_proxy_server_request_for_spend_logs_payload calls perform_redaction(model_call_details=_request_body, result=None) with the request body itself, not with a nested litellm_params.proxy_server_request.body. That means an authenticated caller can put sensitive input in contents, query, documents, document, or system-prompt fields and have it written to spend logs despite message logging redaction being enabled; reuse _redact_request_body_dict in that spend-log path or make perform_redaction apply the same body-key redaction when it is called with a raw request body.

fix(logging): redact provider_specific_fields and request-body snapsh…

e2d2b60

…ots when message logging is off

greptile-apps Bot reviewed May 22, 2026

View reviewed changes

michelligabriele mentioned this pull request May 22, 2026

fix(proxy): exclude proxy_server_request from its own body snapshot #28618

Merged

7 tasks

mateo-berri requested changes May 26, 2026

View reviewed changes

fix(logging): redact native system-prompt keys in request-body snapshots

4d1c83f

veria-ai Bot reviewed May 28, 2026

View reviewed changes

Comment thread litellm/litellm_core_utils/redact_messages.py Outdated

fix(logging): redact Gemini/Vertex native input fields in request-bod…

11da9d0

…y snapshots

veria-ai Bot reviewed Jun 8, 2026

View reviewed changes

Comment thread litellm/litellm_core_utils/redact_messages.py

fix(logging): wholesale-redact complete_input_dict and scrub rerank/O…

e223a02

…CR keys in body snapshot

veria-ai Bot reviewed Jun 8, 2026

View reviewed changes

Comment thread litellm/litellm_core_utils/redact_messages.py Outdated

michelligabriele added 2 commits June 9, 2026 02:39

fix(logging): wholesale-clear response provider_specific_fields when …

1b766e1

…message logging is off

Merge remote-tracking branch 'origin/litellm_internal_staging' into l…

28d31e3

…itellm_fix_redaction_psf_and_body_snapshots # Conflicts: # litellm/litellm_core_utils/redact_messages.py # tests/test_litellm/litellm_core_utils/test_redact_messages.py

veria-ai Bot reviewed Jun 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(logging): redact provider_specific_fields and request-body snapshots when message logging is off#28611

fix(logging): redact provider_specific_fields and request-body snapshots when message logging is off#28611
michelligabriele wants to merge 6 commits into
litellm_internal_stagingfrom
litellm_fix_redaction_psf_and_body_snapshots

michelligabriele commented May 22, 2026

Uh oh!

codecov Bot commented May 22, 2026 •

edited

Loading

Uh oh!

greptile-apps Bot commented May 22, 2026

Important Files Changed

Uh oh!

greptile-apps Bot May 22, 2026

Uh oh!

greptile-apps Bot May 22, 2026

Uh oh!

mateo-berri left a comment

Uh oh!

Uh oh!

veria-ai Bot commented May 28, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

veria-ai Bot Jun 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

michelligabriele commented May 22, 2026

Relevant issues

Linear ticket

Pre-Submission checklist

CI (LiteLLM team)

Screenshots / Proof of Fix

Type

Changes

Uh oh!

codecov Bot commented May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

greptile-apps Bot commented May 22, 2026

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Uh oh!

greptile-apps Bot May 22, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps Bot May 22, 2026

Choose a reason for hiding this comment

Uh oh!

mateo-berri left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

veria-ai Bot commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR overview

Open issues (1)

Uh oh!

Uh oh!

Uh oh!

veria-ai Bot Jun 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov Bot commented May 22, 2026 •

edited

Loading

veria-ai Bot commented May 28, 2026 •

edited

Loading