fix(advisor): attribute advisor sub-call spend to the originating key/user by bse-ai · Pull Request #30481 · BerriAI/litellm

bse-ai · 2026-06-15T20:02:43Z

What

The advisor orchestration sub-call (AdvisorOrchestrationHandler.handle) does not forward the parent request's proxy auth/attribution context to the advisor leg, so advisor spend is never attributed to the originating key/user in SpendLogs.

Why

The executor leg dispatches with **kwargs, so litellm_metadata / user_api_key_dict / proxy_server_request reach the @client wrapper and the call is logged + cost-attributed. The advisor leg calls _call_messages_handler(...) without spreading kwargs, so none of that context is present. The proxy cost-tracking callback's gate (_should_track_cost_callback) requires a non-None user_api_key/user_id/team_id/end_user_id, so for the advisor leg it returns False and the SpendLogs write is skipped entirely. The advisor sub-call still runs on resolved provider credentials, so its spend is real and visible in raw provider invocation logs but invisible in per-user litellm logs.

Impact

For any deployment using the advisor tool on a non-native provider, every advisor sub-call's spend is missing from SpendLogs and unattributable to a key/user/team — so per-user cost reporting silently undercounts, by the full cost of the advisor (typically a larger/more expensive reviewer model) on every invocation. The spend is real (it bills against provider credentials); it is simply not recorded by the gateway.

Fix

Forward the parent proxy context to the advisor leg, excluding:

litellm_logging_obj — so the advisor leg mints its own logging object and its spend is not double-counted against the parent request's litellm_call_id;
api_key / api_base — passed explicitly as the advisor's own credentials.

Additive and behaviour-preserving when there is no proxy context (SDK / native use): the passthrough dict is simply empty.

Test plan

test_advisor_subcall_forwards_proxy_attribution — asserts the advisor leg receives litellm_metadata / user_api_key_dict / proxy_server_request and does not receive litellm_logging_obj
tests/test_litellm/llms/anthropic/experimental_pass_through/messages/test_advisor_integration.py
tests/test_litellm/llms/anthropic/messages/test_advisor_orchestration.py

…/user The advisor orchestration sub-call did not forward the parent request's proxy auth/attribution context (litellm_metadata / user_api_key_dict / proxy_server_request) that the executor leg already spreads via **kwargs. With no key/user/team in scope the proxy cost-tracking callback skips the SpendLogs write entirely, so advisor spend is attributed to nobody — it runs on resolved provider credentials and is visible only in raw provider invocation logs, never in per-user litellm logs. Forward the proxy context to the advisor leg, excluding litellm_logging_obj so the advisor sub-call mints its own logging object and its spend is not double-counted against the parent request's call id (api_key/api_base are also excluded as they are passed explicitly). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

greptile-apps · 2026-06-15T20:05:31Z

Greptile Summary

This PR fixes advisor sub-call spend attribution by forwarding the parent request's proxy auth context (litellm_metadata, user_api_key_dict, proxy_server_request, etc.) to the advisor leg of AdvisorOrchestrationHandler.handle, matching the behavior already present in the executor leg via **kwargs.

Builds advisor_passthrough by filtering kwargs to exclude litellm_logging_obj (so the advisor gets its own logging/call ID), api_key, and api_base (passed explicitly from the tool definition); keys already popped before the loop (litellm_call_id, metadata) cannot conflict.
Adds a focused mock-based test that confirms the three attribution fields are forwarded and litellm_logging_obj is not; the test does not cover the api_key/api_base exclusion path.

Confidence Score: 4/5

The production change is a small, well-scoped dict filter that adds context to one previously context-free call; it is additive and backward-compatible when no proxy context is present.

The implementation correctly excludes the three keys that would conflict with explicit arguments or cause double-counting, and keys that could cause duplicate parameter errors were already removed before the loop. The only gap is that the new test does not verify api_key and api_base are absent from the forwarded dict.

The test file would benefit from adding assertions that api_key and api_base are excluded from the advisor passthrough.

Important Files Changed

Filename	Overview
litellm/llms/anthropic/experimental_pass_through/messages/interceptors/advisor.py	Forwards proxy auth/attribution context to the advisor sub-call by building `advisor_passthrough` from `kwargs` minus `litellm_logging_obj`, `api_key`, and `api_base`. Keys that could conflict with explicit params (`metadata`, `litellm_call_id`) are already popped before the loop, so the passthrough is clean.
tests/test_litellm/llms/anthropic/experimental_pass_through/messages/test_advisor_integration.py	Adds `test_advisor_subcall_forwards_proxy_attribution` which verifies the three attribution fields are forwarded and `litellm_logging_obj` is excluded. Does not assert that `api_key`/`api_base` are excluded from the passthrough.

_{Reviews (1): Last reviewed commit: "fix(advisor): attribute advisor sub-call..." | Re-trigger Greptile}

greptile-apps · 2026-06-15T20:05:35Z

+    with patch(
+        "litellm.llms.anthropic.experimental_pass_through.messages.interceptors.advisor._call_messages_handler",
+        side_effect=mock_handler,
+    ):
+        await AdvisorOrchestrationHandler().handle(
+            model="openai/gpt-4o-mini",
+            messages=MESSAGES,
+            tools=[ADVISOR_TOOL],
+            stream=False,
+            max_tokens=512,
+            custom_llm_provider="openai",
+            litellm_metadata=sentinel_meta,
+            user_api_key_dict=sentinel_key,
+            proxy_server_request=sentinel_psr,
+            litellm_logging_obj=object(),
+        )
+
+    advisor_legs = [c for c in captured if c["tools"] is None]
+    assert advisor_legs, "advisor sub-call (tools=None) must have fired"
+    adv = advisor_legs[0]["kwargs"]
+    assert adv.get("litellm_metadata") == sentinel_meta
+    assert adv.get("user_api_key_dict") is sentinel_key
+    assert adv.get("proxy_server_request") == sentinel_psr
+    # Own logging object → not stamped onto the parent request.
+    assert "litellm_logging_obj" not in adv


Test doesn't cover api_key/api_base exclusion

The filter in advisor_passthrough explicitly excludes api_key and api_base to prevent a caller-supplied key from silently overriding the advisor's own credentials (advisor_api_key/advisor_api_base). If that filter were accidentally removed, the advisor would call the provider with the wrong key, but this test wouldn't catch it. Adding api_key="sk-parent" and api_base="https://parent.example.com" to the handle() invocation and asserting they don't appear in adv would close this gap.

Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!

codecov · 2026-06-15T20:08:12Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

veria-ai · 2026-06-15T20:20:03Z

+            # is excluded so the advisor leg gets its own logging object and its
+            # spend is not double-counted against the parent request's call id;
+            # api_key/api_base are excluded because they are passed explicitly.
+            advisor_passthrough = {


Medium: Credential forwarding into advisor sub-call

advisor_tool["model"] comes from the request, but this forwards every parent kwarg except api_key/api_base. For routed deployments that use other credential fields like aws_access_key_id, aws_secret_access_key, vertex_credentials, or litellm_credential_name, a caller can invoke an arbitrary advisor model with the parent deployment's cloud credentials instead of only inheriting attribution context.

Suggested change

advisor_passthrough = {

advisor_attribution_keys = {

"litellm_metadata",

"user_api_key_dict",

"proxy_server_request",

}

advisor_passthrough = {

k: v for k, v in kwargs.items() if k in advisor_attribution_keys

}

veria-ai · 2026-06-15T20:20:25Z

PR overview

This PR updates the Anthropic experimental pass-through advisor interceptor so advisor sub-call spend can be attributed back to the originating key or user. The touched code handles how request context is passed into advisor tool model calls.

There is one open security concern around the advisor sub-call inheriting more parent request parameters than intended. Because the advisor model can come from the request, routed deployments that use nonstandard credential fields could allow a caller to run an advisor call using the parent deployment’s cloud credentials rather than only receiving attribution context. No issues have been fixed yet, so the PR still needs tightening around which fields are forwarded into advisor calls.

Open issues (1)

Medium: Credential forwarding into advisor sub-call — litellm/llms/anthropic/experimental_pass_through/messages/interceptors/advisor.py:162

Fixed/addressed: 0 · PR risk: 6/10

Sameerlite · 2026-06-16T03:44:48Z

Thanks for the contribution! A couple of things to address before this is ready for merge:

Greptile's code review left 1 unresolved comment(s) that could use your attention — could you take a look and address them?
Could you add some proof of the change working (screenshots, test output, or a sample request/response)? It really helps speed up the review.

Once those are in, we'll take another look!

bse-ai requested a review from a team June 15, 2026 20:02

greptile-apps Bot reviewed Jun 15, 2026

View reviewed changes

veria-ai Bot reviewed Jun 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(advisor): attribute advisor sub-call spend to the originating key/user#30481

fix(advisor): attribute advisor sub-call spend to the originating key/user#30481
bse-ai wants to merge 1 commit into
BerriAI:litellm_oss_stagingfrom
arcadia:pr/advisor-subcall-attribution

bse-ai commented Jun 15, 2026 •

edited

Loading

Uh oh!

greptile-apps Bot commented Jun 15, 2026

Important Files Changed

Uh oh!

greptile-apps Bot Jun 15, 2026

Uh oh!

codecov Bot commented Jun 15, 2026 •

edited

Loading

Uh oh!

veria-ai Bot Jun 15, 2026

Uh oh!

veria-ai Bot commented Jun 15, 2026

Uh oh!

Sameerlite commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

-            advisor_passthrough = {
+            advisor_attribution_keys = {
+                "litellm_metadata",
+                "user_api_key_dict",
+                "proxy_server_request",
+            }
+            advisor_passthrough = {
+                k: v for k, v in kwargs.items() if k in advisor_attribution_keys
+            }

Uh oh!

Conversation

bse-ai commented Jun 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Why

Impact

Fix

Test plan

Uh oh!

greptile-apps Bot commented Jun 15, 2026

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Uh oh!

greptile-apps Bot Jun 15, 2026

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented Jun 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

veria-ai Bot Jun 15, 2026

Choose a reason for hiding this comment

Uh oh!

veria-ai Bot commented Jun 15, 2026

PR overview

Open issues (1)

Uh oh!

Sameerlite commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bse-ai commented Jun 15, 2026 •

edited

Loading

codecov Bot commented Jun 15, 2026 •

edited

Loading