antigravity-claude-proxy

Author	SHA1	Message	Date
Pedro Farias	6a6c4829ca	feat: server reliability and observability improvements	2026-01-17 13:21:15 -03:00
Badri Narayanan S	ed68f4b21e	fix: enable strict tool parameter validation for Claude models Set functionCallingConfig.mode = 'VALIDATED' when using Claude models to ensure strict parameter validation, matching opencode-antigravity-auth. Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-17 14:47:48 +05:30
Marvin	480b4a0bc1	fix: preserve whitespace-only chunks in SSE stream (#139 ) * fix: preserve whitespace-only chunks in SSE stream Fixes issue #138 where Claude models would swallow spaces between words because whitespace-only chunks (e.g., " ") were being filtered out as empty. Changes: - Modified sse-streamer.js to only skip truly empty strings (""), preserving strings that contain only whitespace. - Added regression test case in tests/test-streaming-whitespace.cjs to verify whitespace preservation. * test: add streaming whitespace regression test to main suite --------- Co-authored-by: walczak <walczak@ial.ruhr>	2026-01-17 12:16:28 +05:30
Badri Narayanan S	2516e90d06	Revert "fix: check allowedTiers when paidTier is free for tier detection" This reverts commit `11a1879fc7`.	2026-01-15 21:26:08 +05:30
Badri Narayanan S	11a1879fc7	fix: check allowedTiers when paidTier is free for tier detection Some accounts have paidTier.id = "free-tier" but allowedTiers includes "standard-tier", meaning they can use Pro-level quotas even though their Google One subscription doesn't grant Antigravity benefits. New priority: 1. paidTier - if Pro/Ultra, use immediately 2. allowedTiers - if paidTier is free, check for higher tiers 3. currentTier - fallback if still unknown 4. allowedTiers default - final fallback This fixes accounts that show as "free" but actually have Pro access through programs other than Google One AI. Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-15 21:23:19 +05:30
Badri Narayanan S	9ffb83ab74	fix: improve onboarding flow for non-free tier accounts - Use raw API tier IDs (e.g., 'free-tier', 'g1-pro-tier') consistently - Pass DEFAULT_PROJECT_ID for non-free tiers during onboarding - Fix free tier detection to use .includes('free') instead of exact match - Upgrade onboarding error logs from debug to warn for visibility - Add debug logging for onboarding request parameters Fixes onboarding failures for Pro/Ultra accounts that were previously failing because the API requires a project ID for paid tier onboarding. Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-15 20:25:10 +05:30
Badri Narayanan S	9809337190	fix: improve subscription tier detection with paidTier priority and verbose logging - Use parseTierId() consistently for all tier sources (paidTier, currentTier, allowedTiers) - Add tierSource tracking to identify which field tier was detected from - Add verbose debug logging to show all tier-related fields from loadCodeAssist API - Fix onboarding to prioritize paidTier > currentTier > allowedTiers (consistent with model-api.js) This helps diagnose issues where Pro/Ultra accounts are incorrectly detected as free. Fixes #128 Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-15 20:08:30 +05:30
Badri Narayanan S	2a0c110f9b	fix: try model fallback before throwing RESOURCE_EXHAUSTED error When all accounts are rate-limited for > 2 minutes, now attempts model fallback (if enabled) before throwing the error. This allows quota-exhausted accounts to gracefully fall back to alternate models. Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-15 16:41:02 +05:30
Badri Narayanan S	77363c679e	fix: accurate quota reporting with project ID and improved rate limit handling - Pass project ID to fetchAvailableModels for accurate per-project quota - Treat missing remainingFraction with resetTime as 0% (exhausted) - Fix double-escaped regex in rate-limit-parser.js (\\d -> \d) - Use ANTIGRAVITY_HEADERS for loadCodeAssist consistency - Store actual reset time from API instead of capping at default - Add getRateLimitInfo() for detailed rate limit state - Handle disabled accounts in rate limit checks Fixes issue where free tier accounts showed 100% quota but were actually exhausted. Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-15 16:18:13 +05:30
Badri Narayanan S	a20cba90ee	Merge pull request #125 from IrvanFza/feat/webui-notif docs: update startup message to mention WebUI	2026-01-15 12:46:42 +05:30
Irvan Fauziansyah	915ee88e80	docs: update startup message to mention WebUI - Changed "Server running at" to "Server and WebUI running at" - Clarifies that the web interface is available at the same port	2026-01-15 14:10:27 +07:00
Badri Narayanan S	44632dc301	feat: add automatic user onboarding for accounts without projects When loadCodeAssist returns no project, automatically call onboardUser API to provision a managed project. This handles first-time setup for new accounts. Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-15 12:27:37 +05:30
Badri Narayanan S	896bf81a36	revert: remove count_tokens endpoint (caused regression) Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-14 23:43:16 +05:30
Badri Narayanan S	fa29de7183	fix: handle unsigned thinking blocks in tool loops (#120 ) When Claude Code strips thinking signatures it doesn't recognize, the proxy would drop unsigned thinking blocks, causing the error "Expected thinking but found text". This fix detects unsigned thinking blocks and triggers recovery to close the tool loop. Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-14 23:01:13 +05:30
Badri Narayanan S	dee7512bd8	fix: improve subscription tier detection for Pro accounts The tier detection was incorrectly showing Pro accounts as "free" due to: 1. paidTier field being flaky/missing from some API responses 2. standard-tier not being recognized as a Pro tier 3. No fallback to allowedTiers when currentTier is missing Changes: - Add parseTierId() helper to centralize tier ID parsing - Recognize "standard-tier" as Pro (Gemini Code Assist paid tier) - Add fallback chain: paidTier > currentTier > allowedTiers - Return "unknown" instead of incorrectly defaulting to "free" Fixes #121 Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-14 21:27:24 +05:30
Badri Narayanan S	772dabe7e4	fix: defer inlineData parts to end of array for parallel tool calls (#91 ) When multiple tool_results contain images, the inlineData parts were being interleaved between functionResponse parts, breaking Claude's API requirement that functionResponse parts be consecutive. Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-14 21:04:36 +05:30
behemoth-phucnm	d33de409d4	docs: fix misleading tokenizer comments	2026-01-14 19:31:43 +07:00
minhphuc429	7da7e887bf	feat: use official tokenizers for 99.99% accuracy Replace gpt-tokenizer with model-specific official tokenizers: - Claude models: @anthropic-ai/tokenizer (official Anthropic tokenizer) - Gemini models: @lenml/tokenizer-gemini (GemmaTokenizer) Changes: - Add @anthropic-ai/tokenizer and @lenml/tokenizer-gemini dependencies - Remove gpt-tokenizer dependency - Update count-tokens.js with model-aware tokenization - Use getModelFamily() to select appropriate tokenizer - Lazy-load Gemini tokenizer (138MB) on first use - Default to local estimation for all content types (no API calls) Tested with all supported models: - claude-sonnet-4-5, claude-opus-4-5-thinking, claude-sonnet-4-5-thinking - gemini-3-flash, gemini-3-pro-low, gemini-3-pro-high	2026-01-14 16:04:13 +07:00
minhphuc429	2bdecf6e96	fix: ensure account manager initialized for count_tokens - Add ensureInitialized() call before count_tokens handler - Use hybrid approach: local estimation for text, API for images/docs - This prevents "No accounts available" error on first request	2026-01-14 15:43:25 +07:00
minhphuc429	df81ba5632	feat: use API-based token counting for 100% accuracy Switch from local estimation (gpt-tokenizer) to API-based counting via Google Cloud Code API for accurate token counts. Falls back to local estimation if API call fails.	2026-01-14 15:36:47 +07:00
minhphuc429	acc228b920	feat: implement /v1/messages/count_tokens endpoint Add Anthropic-compatible token counting endpoint using hybrid approach: - Local estimation with gpt-tokenizer for text content (~95% accuracy) - API-based counting for complex content (images, documents) - Automatic fallback to local estimation on API errors This resolves warnings in LiteLLM and other clients that rely on pre-request token counting.	2026-01-14 15:32:27 +07:00
Badri Narayanan S	12d196f6a0	refactor: centralize TEST_MODELS and DEFAULT_PRESETS in constants.js - Move TEST_MODELS and DEFAULT_PRESETS to src/constants.js as single source of truth - Update test-models.cjs helper to use dynamic import from constants - Make getTestModels() and getModels() async functions - Update all test files to await async model config loading - Remove duplicate THINKING_MODELS and getThinkingModels() from test helper - Make thinking tests more lenient for Gemini (doesn't always produce thinking blocks) Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-13 19:20:57 +05:30
Badri Narayanan S	632536e2d7	fix: use configured cooldown as cap for rate limit wait times - Cooldown now caps API-provided reset times instead of being a fallback - Fixed misleading UI descriptions for cooldown settings - Removed unused cooldownDurationMs from settings object - Updated default fallback values in frontend to 10s Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-13 18:28:52 +05:30
Badri Narayanan S	49e536e9a9	fix: reduce default cooldown from 60s to 10s Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-13 18:16:30 +05:30
Badri Narayanan S	70fd1baaa8	fix: improve loadCodeAssist for Google One AI Pro accounts - Add separate LOAD_CODE_ASSIST_ENDPOINTS (prod first) and LOAD_CODE_ASSIST_HEADERS (google-api-nodejs-client User-Agent) - Add duetProject to metadata for project discovery - Silent fallback when API returns success but no project (matches opencode-antigravity-auth behavior) - Only warn when all endpoints fail with actual errors Fixes #114, addresses discussion #113 Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-13 18:11:45 +05:30
Badri Narayanan S	99a06632ea	Merge pull request #102 from liuwuyu118/fix/image-response-and-ui-performance fix: add image response support	2026-01-13 17:17:39 +05:30
董飞祥	6172f5ef10	feat: add API key authentication for /v1/* endpoints	2026-01-13 16:46:31 +08:00
liuwuyu118	860c0d6c2d	fix: add image response support Convert Google's inlineData format to Anthropic's image format: - response-converter.js: Handle inlineData in non-streaming responses - sse-parser.js: Parse inlineData for thinking models - sse-streamer.js: Stream inlineData as image content blocks Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-13 11:17:17 +08:00
simon-ami	e24dff279c	feat(webui): add configuration presets for Claude CLI - Add backend storage logic in `src/utils/claude-config.js` to save/load/delete presets - Add API endpoints (`GET`, `POST`, `DELETE`) for presets in `src/webui/index.js` - Update `public/views/settings.html` with new Presets UI card and modals - Update `public/js/components/claude-config.js` with auto-load logic and unsaved changes protection - Add translations (EN/ZH) for new UI elements in `public/js/store.js` - Add integration tests in `tests/frontend/test-frontend-settings.cjs` - Update compiled CSS Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-12 11:59:32 +01:00
copilot-swe-agent[bot]	cd594f6e82	Address code review: use constant array for proxy env vars and clean empty env Co-authored-by: simon-ami <102378134+simon-ami@users.noreply.github.com>	2026-01-11 11:56:01 +00:00
copilot-swe-agent[bot]	8eba68e47a	Add Restore Default Claude CLI button to settings page Co-authored-by: simon-ami <102378134+simon-ami@users.noreply.github.com>	2026-01-11 11:54:13 +00:00
Badri Narayanan S	325acdba8c	fix: preserve tool_use stop reason from being overwritten by finishReason When a tool call is made, stopReason is set to 'tool_use'. However, when finishReason: STOP arrives later, it was overwriting stopReason back to 'end_turn', breaking multi-turn tool conversations in clients like OpenCode. Fix: Initialize stopReason to null and only set it from finishReason if not already set. This ensures tool_use is preserved once detected. Fixes #96 Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-11 11:52:35 +05:30
Wha1eChai	14ef814c1a	merge: sync with upstream/main (5xx error fallback feature)	2026-01-11 02:57:59 +08:00
Wha1eChai	bda9623f3a	feat(webui): optimize CSS build system and enhance UI quality	2026-01-11 02:57:42 +08:00
Badri Narayanan S	6868bf217c	Merge pull request #47 from Wha1eChai/feature/webui feat: Add Web UI for account and quota management	2026-01-10 22:21:57 +05:30
Tiago Rodrigues	0b477c2552	feat: fallback to alternate model when max retries exceeded with 5xx errors When all accounts fail with HTTP 500/503 errors (e.g., Google API returning 'Unknown Error' for Claude models on large conversations), the proxy now attempts to use a fallback model if --fallback is enabled. This enables graceful degradation when: - All accounts are exhausted due to 5xx errors (not just rate limits) - Claude models fail on very large conversations - The API has temporary issues with specific models The fallback uses the existing MODEL_FALLBACK_MAP configuration: - claude-opus-4-5-thinking -> gemini-3-pro-high - claude-sonnet-4-5-thinking -> gemini-3-flash Relates to #88	2026-01-10 12:08:53 +00:00
Wha1eChai	ee6d222e4d	feat(webui): add subscription tier and quota visualization Backend:	2026-01-10 06:05:17 +08:00
Wha1eChai	369a66e8cf	Merge branch 'main' into feature/webui	2026-01-10 04:46:30 +08:00
Wha1eChai	71c7c2e423	feat(webui): enhance settings UI, persistence and documentation - Update CLAUDE.md with comprehensive WebUI architecture and API documentation - Improve settings UI with searchable model dropdowns and visual family indicators - Migrate usage statistics persistence to user config directory with auto-migration - Refactor server request handling and fix model suffix logic	2026-01-10 04:22:59 +08:00
Badri Narayanan S	f1e945a7e6	change cleanSchemaForGemini to cleanSchema	2026-01-10 00:35:50 +05:30
Tiago Rodrigues	90214c43b0	fix: convert schema types to Google uppercase format (fixes #82 ) The /compact command was failing with 'Proto field is not repeating, cannot start list' error for Claude models because tool schemas were sent with lowercase JSON Schema types (array, object, string) but Google's Cloud Code API expects uppercase protobuf types (ARRAY, OBJECT, STRING). Changes: - Add toGoogleType() function to convert JSON Schema types to Google format - Add Phase 5 to cleanSchemaForGemini() for type conversion - Apply cleanSchemaForGemini() for ALL models (not just Gemini) since all requests go through Cloud Code API which validates schema format - Add comprehensive test suite with 10 tests covering nested arrays, complex schemas, and real-world Claude Code tool scenarios Fixes #82	2026-01-09 18:36:19 +00:00
Wha1eChai	169e18402f	merge: sync with origin/feature/webui after upstream merge	2026-01-09 18:23:52 +08:00
Wha1eChai	f2f0a7452e	merge: integrate upstream/main (v1.2.15) into feature/webui - Resolved conflict in src/constants.js: kept config-driven approach - Adopted upstream 10-second cooldown default - Added MAX_EMPTY_RESPONSE_RETRIES constant from upstream - Incorporated new test files and GitHub issue templates	2026-01-09 18:08:45 +08:00
Wha1eChai	a914821d49	perf(webui): refactor dashboard modules and optimize API performance	2026-01-09 17:58:09 +08:00
Badri Narayanan S	4c5236d4b3	fix: filter Antigravity system prompt from model responses - Add [ignore] tags around system instruction to prevent model from identifying as "Antigravity" when asked "Who are you?" - Replace full system instruction with minimal version used by CLIProxyAPI/gcli2api to reduce token usage and improve response quality Fixes #76 Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-09 14:10:41 +05:30
SvDp	45755bfa18	fix: add optimistic reset for transient 429 rate limit errors Fixes issue #71 - 'No accounts available' error when API returns 429 The Google Cloud Code API can return 429 RESOURCE_EXHAUSTED errors even when accounts have quota available due to: - Temporary API load/throttling - Per-minute request limits (not per-day quota) - Transient backend issues This fix adds: 1. A 500ms buffer after waiting for rate limits to expire 2. Optimistic rate limit reset when all accounts appear stuck 3. Retry logic that clears rate limits and tries again The fix works in conjunction with the server-level optimistic reset that already exists, providing multiple layers of protection against false 'No accounts available' errors. Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-08 21:31:50 +05:30
Wha1eChai	c9c5e7d486	feat(webui): add hot-reload account management with OAuth support	2026-01-08 23:52:31 +08:00
Badri Narayanan S	5f6ce1b97d	Update daily Cloud Code endpoint to production URL Remove sandbox subdomain from daily-cloudcode-pa endpoint. Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-08 20:24:23 +05:30
Badri Narayanan S	a696ed0872	Merge pull request #64 from BrunoMarc/fix/empty-response-retry fix: add retry mechanism for empty API responses	2026-01-08 17:38:18 +05:30
Wha1eChai	217053839f	feat(webui): Enhance dashboard, global styles, and settings module ## Dashboard Enhancements - Add Request Volume trend chart with Chart.js line graph - Support Family/Model display modes for aggregation levels - Show Total/Today/1H usage statistics - Hierarchical filter dropdown with Smart select (Top 5 by 24h usage) - Persist chart preferences to localStorage - Improve account health detection logic - Core models (sonnet/opus/pro/flash) require >5% quota to be healthy - Dynamic quota ring chart supporting any model family - Unify table styles with standard-table class ## Global Style Refactoring - Add CSS variable system for theming - Space color scale (950/900/850/800/border) - Neon accent colors (purple/green/cyan/yellow/red) - Text hierarchy (main/dim/muted/bright) - Chart palette (16 colors) - Add unified component classes - .view-container for consistent page layouts - .section-header/.section-title/.section-desc - .standard-table for table styling - Update scrollbar, nav-item, progress-bar to use theme variables ## Settings Module Extensions - Add model mapping column in Models tab - Enhance model selectors with family color indicators - Support horizontal scroll for tabs on narrow screens - Add defaultCooldownMs and maxWaitBeforeErrorMs config options ## New Module - Add src/modules/usage-stats.js for request tracking - Track /v1/messages and /v1/chat/completions endpoints - Hierarchical storage: { hour: { family: { model: count } } } - Auto-save every minute, 30-day retention - GET /api/stats/history endpoint for dashboard chart ## Backend Changes - Add direct account manipulation helpers (bypass AccountManager) - Add POST /api/config/password endpoint for WebUI password change - Auto-reload AccountManager after account operations - Use CSS variables in OAuth callback pages ## Other - Update .gitignore for runtime data directory - Add i18n keys for new UI elements (EN/zh_CN) Co-Authored-By: Claude <noreply@anthropic.com>	2026-01-08 19:04:43 +08:00

1 2 3

111 Commits