Skip to content

fix(codex): don't crash on startup when quota is exhausted (429)#744

Merged
nicoloboschi merged 1 commit into
vectorize-io:mainfrom
Rutimka:fix/codex-verify-quota-crash
Mar 30, 2026
Merged

fix(codex): don't crash on startup when quota is exhausted (429)#744
nicoloboschi merged 1 commit into
vectorize-io:mainfrom
Rutimka:fix/codex-verify-quota-crash

Conversation

@Rutimka
Copy link
Copy Markdown
Contributor

@Rutimka Rutimka commented Mar 28, 2026

Problem

When the Codex usage limit is reached, verify_connection() receives a 429 response and raises a RuntimeError, which causes the entire server to refuse to start:

RuntimeError: Codex LLM connection verification failed for gpt-5.4: ...429 Too Many Requests
ERROR: Application startup failed. Exiting.

A quota limit is not a configuration error — the credentials are valid, the model is correct, there's just no budget left until the quota resets. The server should start normally and serve retain/recall requests; it just can't make LLM calls until the quota resets.

Fix

Catch 429 / usage_limit_reached responses in verify_connection() and log a warning instead of raising, allowing startup to proceed.

Tested

Verified locally: server starts and serves requests with quota exhausted. Retain/recall works; LLM-dependent operations (fact extraction, consolidation) fail gracefully per their own retry/error handling.

🤖 Generated with Claude Code

A 429 usage_limit_reached response during verify_connection() caused the
server to refuse to start entirely. Quota exhaustion is not a configuration
error — the server should start and serve retain/recall requests normally,
it just can't make LLM calls until the quota resets.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@nicoloboschi nicoloboschi merged commit 111e8c7 into vectorize-io:main Mar 30, 2026
25 of 44 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants