antigravity-claude-proxy

Author	SHA1	Message	Date
Badri Narayanan S	a696ed0872	Merge pull request #64 from BrunoMarc/fix/empty-response-retry fix: add retry mechanism for empty API responses	2026-01-08 17:38:18 +05:30
Badri Narayanan S	f34aa50ba4	Antigravity compatibility to fix antigravity usage	2026-01-08 10:24:54 +05:30
BrunoMarc	1c80c8ba52	fix: address second round code review feedback Issues found by Claude Opus 4.5 + Gemini 3 Pro: HIGH PRIORITY FIXES: - Mark account rate-limited when 429 occurs during retry (was losing resetMs) - Add exponential backoff between retries (500ms, 1000ms, 2000ms) - Fix 5xx handling: don't pass error response to streamer, refetch instead - Use recognizable error messages (429/401) for isRateLimitError/isAuthError MEDIUM PRIORITY FIXES: - Refactor while loop to for loop for clearer retry semantics - Simplify logic flow with early returns Code review by: Claude Opus 4.5 + Gemini 3 Pro Preview 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-07 18:21:25 -03:00
BrunoMarc	05cd80ebb5	fix: address code review feedback - Move MAX_EMPTY_RESPONSE_RETRIES to constants.js for consistency - Handle 429/401/5xx errors properly during retry fetch - Use proper message ID format (crypto.randomBytes) instead of Date.now() - Add crypto import for UUID generation Code review by: Gemini 3 Pro Preview + Claude Opus 4.5 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-07 18:16:09 -03:00
BrunoMarc	49480847b6	fix: add retry mechanism for empty API responses When Claude Code sends requests with large thinking_budget values, the model may spend all tokens on "thinking" and return empty responses, causing Claude Code to stop mid-conversation. This commit adds a retry mechanism that: - Throws EmptyResponseError instead of emitting fake message on empty response - Retries up to 2 times before giving up - Emits fallback message only after all retries are exhausted Changes: - src/errors.js: Added EmptyResponseError class and isEmptyResponseError() - src/cloudcode/sse-streamer.js: Throw error instead of yielding fake message - src/cloudcode/streaming-handler.js: Added retry loop with fallback Tested for 6+ hours with 1,884 API requests and 88% recovery rate on empty responses. Fixes #61 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-07 18:11:03 -03:00
Badri Narayanan S	ac9ec6b358	Signature handling for fallback	2026-01-03 22:01:57 +05:30
Badri Narayanan S	df6625b531	fallback changes from PR #35	2026-01-03 18:01:21 +05:30
Badri Narayanan S	9c4a712a9a	Selective fixes from PR #35 : Model-specific rate limits & robustness improvements (#37 ) * feat: apply local user changes and fixes * ;D * Implement OpenAI support, model-specific rate limiting, and robustness fixes * docs: update pr title * feat: ensure unique openai models endpoint * fix: startup banner alignment and removed duplicates * feat: add model fallback system with --fallback flag * fix: accounts cli hanging after completion * feat: add exit option to accounts cli menu * fix: remove circular dependency warning for fallback flag * feat: show active modes in banner and hide their flags * Remove OpenAI compatibility and fallback features from PR #35 Cherry-picked selective fixes from PR #35 while removing: - OpenAI-compatible API endpoints (/openai/v1/) - Model fallback system (fallback-config.js) - Thinking block skip for Gemini models - Unnecessary files (pullrequest.md, test-fix.js, test-openai.js) Retained improvements: - Network error handling with retry logic - Model-specific rate limiting - Enhanced health check with quota info - CLI fixes (exit option, process.exit) - Startup banner alignment (debug mode only) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> banner alignment fix * Refactor: Model-specific rate limits and cleanup deprecated code - Remove global rate limit fields (isRateLimited, rateLimitResetTime) in favor of model-specific limits (modelRateLimits[modelId]) - Remove deprecated wrapper functions (is429Error, isAuthInvalidError) from handlers - Filter fetchAvailableModels to only return Claude and Gemini models - Fix getCurrentStickyAccount() to pass model param after waiting - Update /account-limits endpoint to show model-specific limits - Remove multi-account OAuth flow to avoid state mismatch errors 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: show (x/y) limited status in account-limits table - Status is now "ok" only when all models are available - Shows "(x/y) limited" when x out of y models are exhausted - Provides better visibility into partial rate limiting 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * docs: update CLAUDE.md with model-specific rate limiting - Document modelRateLimits[modelId] for per-model rate tracking - Add isNetworkError() helper to utilities section 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> --------- Co-authored-by: M1noa <minoa@minoa.cat> Co-authored-by: Minoa <altgithub@minoa.cat> Co-authored-by: Claude <noreply@anthropic.com>	2026-01-03 15:33:49 +05:30
Badri Narayanan S	f02364d4ef	refactor: Reorganize src/ into modular folder structure Split large monolithic files into focused modules: - cloudcode-client.js (1,107 lines) → src/cloudcode/ (9 files) - account-manager.js (639 lines) → src/account-manager/ (5 files) - Move auth files to src/auth/ (oauth, token-extractor, database) - Move CLI to src/cli/accounts.js Update all import paths and documentation. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-01 15:13:43 +05:30

9 Commits