13 KiB
CLAUDE.md
This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
Project Overview
Antigravity Claude Proxy is a Node.js proxy server that exposes an Anthropic-compatible API backed by Antigravity's Cloud Code service. It enables using Claude models (claude-sonnet-4-5-thinking, claude-opus-4-5-thinking) and Gemini models (gemini-3-flash, gemini-3-pro-low, gemini-3-pro-high) with Claude Code CLI.
The proxy translates requests from Anthropic Messages API format → Google Generative AI format → Antigravity Cloud Code API, then converts responses back to Anthropic format with full thinking/streaming support.
Commands
# Install dependencies
npm install
# Start server (runs on port 8080)
npm start
# Start with model fallback enabled (falls back to alternate model when quota exhausted)
npm start -- --fallback
# Start with debug logging
npm start -- --debug
# Start with file watching for development
npm run dev
# Account management
npm run accounts # Interactive account management
npm run accounts:add # Add a new Google account via OAuth
npm run accounts:add -- --no-browser # Add account on headless server (manual code input)
npm run accounts:list # List configured accounts
npm run accounts:verify # Verify account tokens are valid
# Run all tests (server must be running on port 8080)
npm test
# Run individual tests
npm run test:signatures # Thinking signatures
npm run test:multiturn # Multi-turn with tools
npm run test:streaming # Streaming SSE events
npm run test:interleaved # Interleaved thinking
npm run test:images # Image processing
npm run test:caching # Prompt caching
npm run test:crossmodel # Cross-model thinking signatures
npm run test:oauth # OAuth no-browser mode
Architecture
Request Flow:
Claude Code CLI → Express Server (server.js) → CloudCode Client → Antigravity Cloud Code API
Directory Structure:
src/
├── index.js # Entry point
├── server.js # Express server
├── constants.js # Configuration values
├── errors.js # Custom error classes
├── fallback-config.js # Model fallback mappings and helpers
│
├── cloudcode/ # Cloud Code API client
│ ├── index.js # Public API exports
│ ├── session-manager.js # Session ID derivation for caching
│ ├── rate-limit-parser.js # Parse reset times from headers/errors
│ ├── request-builder.js # Build API request payloads
│ ├── sse-parser.js # Parse SSE for non-streaming
│ ├── sse-streamer.js # Stream SSE events in real-time
│ ├── message-handler.js # Non-streaming message handling
│ ├── streaming-handler.js # Streaming message handling
│ └── model-api.js # Model listing and quota APIs
│
├── account-manager/ # Multi-account pool management
│ ├── index.js # AccountManager class facade
│ ├── storage.js # Config file I/O and persistence
│ ├── selection.js # Account picking (round-robin, sticky)
│ ├── rate-limits.js # Rate limit tracking and state
│ └── credentials.js # OAuth token and project handling
│
├── auth/ # Authentication
│ ├── oauth.js # Google OAuth with PKCE
│ ├── token-extractor.js # Legacy token extraction from DB
│ └── database.js # SQLite database access
│
├── webui/ # Web Management Interface
│ └── index.js # Express router and API endpoints
│
├── cli/ # CLI tools
│ └── accounts.js # Account management CLI
│
├── format/ # Format conversion (Anthropic ↔ Google)
│ ├── index.js # Re-exports all converters
│ ├── request-converter.js # Anthropic → Google conversion
│ ├── response-converter.js # Google → Anthropic conversion
│ ├── content-converter.js # Message content conversion
│ ├── schema-sanitizer.js # JSON Schema cleaning for Gemini
│ ├── thinking-utils.js # Thinking block validation/recovery
│ └── signature-cache.js # Signature cache (tool_use + thinking signatures)
│
└── utils/ # Utilities
├── helpers.js # formatDuration, sleep
├── logger.js # Structured logging
└── native-module-helper.js # Auto-rebuild for native modules
Frontend Structure (public/):
public/
├── index.html # Main entry point
├── js/
│ ├── app.js # Main application logic (Alpine.js)
│ ├── store.js # Global state management
│ ├── components/ # UI Components
│ │ ├── dashboard.js # Real-time stats & charts
│ │ ├── account-manager.js # Account list & OAuth handling
│ │ ├── logs-viewer.js # Live log streaming
│ │ └── claude-config.js # CLI settings editor
│ └── utils/ # Frontend utilities
└── views/ # HTML partials (loaded dynamically)
├── dashboard.html
├── accounts.html
├── settings.html
└── logs.html
Key Modules:
- src/server.js: Express server exposing Anthropic-compatible endpoints (
/v1/messages,/v1/models,/health,/account-limits) and mounting WebUI - src/webui/index.js: WebUI backend handling API routes (
/api/*) for config, accounts, and logs - src/cloudcode/: Cloud Code API client with retry/failover logic, streaming and non-streaming support
model-api.js: Model listing, quota retrieval (getModelQuotas()), and subscription tier detection (getSubscriptionTier())
- src/account-manager/: Multi-account pool with sticky selection, rate limit handling, and automatic cooldown
- src/auth/: Authentication including Google OAuth, token extraction, database access, and auto-rebuild of native modules
- src/format/: Format conversion between Anthropic and Google Generative AI formats
- src/constants.js: API endpoints, model mappings, fallback config, OAuth config, and all configuration values
- src/fallback-config.js: Model fallback mappings (
getFallbackModel(),hasFallback()) - src/errors.js: Custom error classes (
RateLimitError,AuthError,ApiError, etc.)
Multi-Account Load Balancing:
- Sticky account selection for prompt caching (stays on same account across turns)
- Model-specific rate limiting via
account.modelRateLimits[modelId] - Automatic switch only when rate-limited for > 2 minutes on the current model
- Session ID derived from first user message hash for cache continuity
- Account state persisted to
~/.config/antigravity-proxy/accounts.json
Account Data Model:
Each account object in accounts.json contains:
- Basic Info:
email,source(oauth/manual/database),enabled,lastUsed - Credentials:
refreshToken(OAuth) orapiKey(manual) - Subscription:
{ tier, projectId, detectedAt }- automatically detected vialoadCodeAssistAPItier: 'free' | 'pro' | 'ultra' (detected frompaidTierorcurrentTier)
- Quota:
{ models: {}, lastChecked }- model-specific quota cachemodels[modelId]:{ remainingFraction, resetTime }fromfetchAvailableModelsAPI
- Rate Limits:
modelRateLimits[modelId]- temporary rate limit state (in-memory during runtime) - Validity:
isInvalid,invalidReason- tracks accounts needing re-authentication
Prompt Caching:
- Cache is organization-scoped (requires same account + session ID)
- Session ID is SHA256 hash of first user message content (stable across turns)
cache_read_input_tokensreturned in usage metadata when cache hits- Token calculation:
input_tokens = promptTokenCount - cachedContentTokenCount
Model Fallback (--fallback flag):
- When all accounts are exhausted for a model, automatically falls back to an alternate model
- Fallback mappings defined in
MODEL_FALLBACK_MAPinsrc/constants.js - Thinking models fall back to thinking models (e.g.,
claude-sonnet-4-5-thinking→gemini-3-flash) - Fallback is disabled on recursive calls to prevent infinite chains
- Enable with
npm start -- --fallbackorFALLBACK=trueenvironment variable
Cross-Model Thinking Signatures:
- Claude and Gemini use incompatible thinking signatures
- When switching models mid-conversation, incompatible signatures are detected and dropped
- Signature cache tracks model family ('claude' or 'gemini') for each signature
hasGeminiHistory()detects Gemini→Claude cross-model scenarios- Thinking recovery (
closeToolLoopForThinking()) injects synthetic messages to close interrupted tool loops - For Gemini targets: strict validation - drops unknown or mismatched signatures
- For Claude targets: lenient - lets Claude validate its own signatures
Native Module Auto-Rebuild:
- When Node.js is updated, native modules like
better-sqlite3may become incompatible - The proxy automatically detects
NODE_MODULE_VERSIONmismatch errors - On detection, it attempts to rebuild the module using
npm rebuild - If rebuild succeeds, the module is reloaded; if reload fails, a server restart is required
- Implementation in
src/utils/native-module-helper.jsand lazy loading insrc/auth/database.js
Web Management UI:
- Stack: Vanilla JS + Alpine.js + Tailwind CSS (via CDN)
- Architecture: Single Page Application (SPA) with dynamic view loading
- State Management: Alpine.store for global state (accounts, settings, logs)
- Features:
- Real-time dashboard with Chart.js visualization and subscription tier distribution
- Account list with tier badges (Ultra/Pro/Free) and quota progress bars
- OAuth flow handling via popup window
- Live log streaming via Server-Sent Events (SSE)
- Config editor for both Proxy and Claude CLI (
~/.claude/settings.json)
- Security: Optional password protection via
WEBUI_PASSWORDenv var - Smart Refresh: Client-side polling with ±20% jitter and tab visibility detection (3x slower when hidden)
Testing Notes
- Tests require the server to be running (
npm startin separate terminal) - Tests are CommonJS files (
.cjs) that make HTTP requests to the local proxy - Shared test utilities are in
tests/helpers/http-client.cjs - Test runner supports filtering:
node tests/run-all.cjs <filter>to run matching tests
Code Organization
Constants: All configuration values are centralized in src/constants.js:
- API endpoints and headers
- Model mappings and model family detection (
getModelFamily(),isThinkingModel()) - Model fallback mappings (
MODEL_FALLBACK_MAP) - OAuth configuration
- Rate limit thresholds
- Thinking model settings
Model Family Handling:
getModelFamily(model)returns'claude'or'gemini'based on model name- Claude models use
signaturefield on thinking blocks - Gemini models use
thoughtSignaturefield on functionCall parts (cached or sentinel value) - When Claude Code strips
thoughtSignature, the proxy tries to restore from cache, then falls back toskip_thought_signature_validator
Error Handling: Use custom error classes from src/errors.js:
RateLimitError- 429/RESOURCE_EXHAUSTED errorsAuthError- Authentication failuresApiError- Upstream API errors- Helper functions:
isRateLimitError(),isAuthError()
Utilities: Shared helpers in src/utils/helpers.js:
formatDuration(ms)- Format milliseconds as "1h23m45s"sleep(ms)- Promise-based delayisNetworkError(error)- Check if error is a transient network error
Data Persistence:
- Subscription and quota data are automatically fetched when
/account-limitsis called - Updated data is saved to
accounts.jsonasynchronously (non-blocking) - On server restart, accounts load with last known subscription/quota state
- Quota is refreshed on each WebUI poll (default: 30s with jitter)
Logger: Structured logging via src/utils/logger.js:
logger.info(msg)- Standard info (blue)logger.success(msg)- Success messages (green)logger.warn(msg)- Warnings (yellow)logger.error(msg)- Errors (red)logger.debug(msg)- Debug output (magenta, only when enabled)logger.setDebug(true)- Enable debug modelogger.isDebugEnabled- Check if debug mode is on
WebUI APIs:
/api/accounts/*- Account management (list, add, remove, refresh)/api/config/*- Server configuration (read/write)/api/claude/config- Claude CLI settings/api/logs/stream- SSE endpoint for real-time logs/api/auth/url- Generate Google OAuth URL/account-limits- Fetch account quotas and subscription data- Returns:
{ accounts: [{ email, subscription: { tier, projectId }, limits: {...} }], models: [...] } - Query params:
?format=table(ASCII table) or?includeHistory=true(adds usage stats)
- Returns:
Maintenance
When making significant changes to the codebase (new modules, refactoring, architectural changes), update this CLAUDE.md and the README.md file to keep documentation in sync.