Files

Fahad 8b16405f06 feat: GPT-5.2 support

2025-12-11 19:11:50 +00:00

17 KiB

Raw Blame History

Getting Started with PAL MCP Server

This guide walks you through setting up the PAL MCP Server from scratch, including installation, configuration, and first usage.

Prerequisites

Python 3.10+ (3.12 recommended)
Git
uv installed (for uvx method)
Windows users: WSL2 required for Claude Code CLI

Step 1: Get API Keys

You need at least one API key. Choose based on your needs:

Option A: OpenRouter (Recommended for beginners)

One API for multiple models

Visit OpenRouter and sign up
Generate an API key
Control spending limits in your dashboard
Access GPT-4, Claude, Gemini, and more through one API

Option B: Native Provider APIs

Gemini (Google):

Visit Google AI Studio
Generate an API key
Note: For Gemini 3.0 / 2.5 Pro, use a paid API key (free tier has limited access)

OpenAI:

Visit OpenAI Platform
Generate an API key for GPT-5.2, GPT-5.1-Codex, GPT-5, O3 access

X.AI (Grok):

Visit X.AI Console
Generate an API key for Grok models

DIAL Platform:

Visit DIAL Platform
Generate API key for vendor-agnostic model access

Option C: Local Models (Free)

Ollama:

# Install Ollama
curl -fsSL https://ollama.ai/install.sh | sh

# Start Ollama service
ollama serve

# Pull a model (e.g., Llama 3.2)
ollama pull llama3.2

Other local options:

vLLM: Self-hosted inference server
LM Studio: Local model hosting with OpenAI-compatible API
Text Generation WebUI: Popular local interface

👉 Complete custom model setup guide

Step 2: Installation

Choose your preferred installation method:

Method A: Instant Setup with uvx (Recommended)

Prerequisites: Install uv first

Choose your AI coding assistant and add the corresponding configuration:

For Claude Desktop:

Open Claude Desktop → Settings → Developer → Edit Config
Add this configuration:

{
  "mcpServers": {
    "pal": {
      "command": "sh",
      "args": [
        "-c", 
        "for p in $(which uvx 2>/dev/null) $HOME/.local/bin/uvx /opt/homebrew/bin/uvx /usr/local/bin/uvx uvx; do [ -x \"$p\" ] && exec \"$p\" --from git+https://github.com/BeehiveInnovations/pal-mcp-server.git pal-mcp-server; done; echo 'uvx not found' >&2; exit 1"
      ],
      "env": {
        "PATH": "/usr/local/bin:/usr/bin:/bin:/opt/homebrew/bin:~/.local/bin",
        "GEMINI_API_KEY": "your_api_key_here"
      }
    }
  }
}

For Claude Code CLI: Create .mcp.json in your project root:

{
  "mcpServers": {
    "pal": {
      "command": "sh", 
      "args": [
        "-c",
        "for p in $(which uvx 2>/dev/null) $HOME/.local/bin/uvx /opt/homebrew/bin/uvx /usr/local/bin/uvx uvx; do [ -x \"$p\" ] && exec \"$p\" --from git+https://github.com/BeehiveInnovations/pal-mcp-server.git pal-mcp-server; done; echo 'uvx not found' >&2; exit 1"
      ],
      "env": {
        "PATH": "/usr/local/bin:/usr/bin:/bin:/opt/homebrew/bin:~/.local/bin",
        "GEMINI_API_KEY": "your_api_key_here"
      }
    }
  }
}

For Gemini CLI: Edit ~/.gemini/settings.json:

{
  "mcpServers": {
    "pal": {
      "command": "sh",
      "args": [
        "-c",
        "for p in $(which uvx 2>/dev/null) $HOME/.local/bin/uvx /opt/homebrew/bin/uvx /usr/local/bin/uvx uvx; do [ -x \"$p\" ] && exec \"$p\" --from git+https://github.com/BeehiveInnovations/pal-mcp-server.git pal-mcp-server; done; echo 'uvx not found' >&2; exit 1"  
      ],
      "env": {
        "PATH": "/usr/local/bin:/usr/bin:/bin:/opt/homebrew/bin:~/.local/bin",
        "GEMINI_API_KEY": "your_api_key_here"
      }
    }
  }
}

For Codex CLI: Edit ~/.codex/config.toml:

[mcp_servers.pal]
command = "bash"
args = ["-c", "for p in $(which uvx 2>/dev/null) $HOME/.local/bin/uvx /opt/homebrew/bin/uvx /usr/local/bin/uvx uvx; do [ -x \\\"$p\\\" ] && exec \\\"$p\\\" --from git+https://github.com/BeehiveInnovations/pal-mcp-server.git pal-mcp-server; done; echo 'uvx not found' >&2; exit 1"]
tool_timeout_sec = 1200  # 20 minutes; added automatically by the setup script so upstream providers can respond

[mcp_servers.pal.env]
PATH = "/usr/local/bin:/usr/bin:/bin:/opt/homebrew/bin:$HOME/.local/bin:$HOME/.cargo/bin:$HOME/bin"
GEMINI_API_KEY = "your_api_key_here"

Enable Codex's built-in web-search tool so PAL's apilookup instructions can execute successfully:

[tools]
web_search = true

Add the block above if [tools] is missing from the file; otherwise ensure web_search = true appears in that section.

For Qwen Code CLI: Create or edit ~/.qwen/settings.json:

{
  "mcpServers": {
    "pal": {
      "command": "bash",
      "args": [
        "-c",
        "for p in $(which uvx 2>/dev/null) $HOME/.local/bin/uvx /opt/homebrew/bin/uvx /usr/local/bin/uvx uvx; do [ -x \"$p\" ] && exec \"$p\" --from git+https://github.com/BeehiveInnovations/pal-mcp-server.git pal-mcp-server; done; echo 'uvx not found' >&2; exit 1"
      ],
      "cwd": "/path/to/pal-mcp-server",
      "env": {
        "PATH": "/usr/local/bin:/usr/bin:/bin:/opt/homebrew/bin:~/.local/bin",
        "GEMINI_API_KEY": "your_api_key_here"
      }
    }
  }
}

Replace the placeholder API key with the providers you use (Gemini, OpenAI, OpenRouter, etc.).

For OpenCode CLI: Edit ~/.config/opencode/opencode.json:

{
  "$schema": "https://opencode.ai/config.json",
  "mcp": {
    "pal": {
      "type": "local",
      "command": [
        "/path/to/pal-mcp-server/.pal_venv/bin/python",
        "/path/to/pal-mcp-server/server.py"
      ],
      "cwd": "/path/to/pal-mcp-server",
      "enabled": true,
      "environment": {
        "GEMINI_API_KEY": "your_api_key_here"
      }
    }
  }
}

Add any other API keys you rely on (OPENAI_API_KEY, OPENROUTER_API_KEY, etc.).

IDE Clients (Cursor & VS Code)

PAL works in GUI IDEs that speak MCP. The configuration mirrors the CLI examples above—point the client at the uvx launcher and set any required environment variables.

Cursor IDE

Open Cursor → Settings (Cmd+,/Ctrl+,) → Integrations › Model Context Protocol (MCP).
Click Add MCP Server and supply the following values:
- Command: sh
- Args: -c and for p in $(which uvx 2>/dev/null) $HOME/.local/bin/uvx /opt/homebrew/bin/uvx /usr/local/bin/uvx uvx; do [ -x "$p" ] && exec "$p" --from git+https://github.com/BeehiveInnovations/pal-mcp-server.git pal-mcp-server; done; echo 'uvx not found' >&2; exit 1
- Environment (example):
  - PATH=/usr/local/bin:/usr/bin:/bin:/opt/homebrew/bin:~/.local/bin
  - GEMINI_API_KEY=your_api_key_here
Save the configuration—Cursor will launch the MCP server on demand. See the Cursor MCP guide for screenshots of the UI.

Visual Studio Code (Claude Dev extension)

Install the Claude Dev extension v0.6.0 or later.
Open the Command Palette (Cmd+Shift+P/Ctrl+Shift+P) → Claude: Configure MCP Servers → Add server.
When prompted, use the same values as above:
- Command: sh
- Args: -c and the uvx bootstrap loop
- Environment: add the API keys you need (e.g. GEMINI_API_KEY, OPENAI_API_KEY)
Save the JSON snippet the extension generates. VS Code will reload the server automatically the next time you interact with Claude.

👉 Pro tip: If you prefer a one-line command, replace the long loop with uvx --from git+https://github.com/BeehiveInnovations/pal-mcp-server.git pal-mcp-server—just make sure uvx is on your PATH for every client.

Benefits of uvx method:

✅ Zero manual setup required
✅ Always pulls latest version
✅ No local dependencies to manage
✅ Works without Python environment setup

Method B: Clone and Setup

# Clone the repository
git clone https://github.com/BeehiveInnovations/pal-mcp-server.git
cd pal-mcp-server

# One-command setup (handles everything)
./run-server.sh

# Or for Windows PowerShell:
./run-server.ps1

# View configuration for Claude Desktop
./run-server.sh -c

# See all options
./run-server.sh --help

What the setup script does:

✅ Creates Python virtual environment
✅ Installs all dependencies
✅ Creates .env file for API keys
✅ Configures Claude integrations
✅ Provides copy-paste configuration

After updates: Always run ./run-server.sh again after git pull.

Windows users: See the WSL Setup Guide for detailed WSL configuration.

Step 3: Configure API Keys

For uvx installation:

Add your API keys directly to the MCP configuration shown above.

For clone installation:

Edit the .env file:

nano .env

Add your API keys (at least one required):

# Choose your providers (at least one required)
GEMINI_API_KEY=your-gemini-api-key-here      # For Gemini models  
OPENAI_API_KEY=your-openai-api-key-here      # For GPT-5.2, GPT-5.1-Codex, O3
XAI_API_KEY=your-xai-api-key-here            # For Grok models
OPENROUTER_API_KEY=your-openrouter-key       # For multiple models

# DIAL Platform (optional)
DIAL_API_KEY=your-dial-api-key-here
DIAL_API_HOST=https://core.dialx.ai          # Default host (optional)
DIAL_API_VERSION=2024-12-01-preview          # API version (optional) 
DIAL_ALLOWED_MODELS=o3,gemini-2.5-pro       # Restrict models (optional)

# Custom/Local models (Ollama, vLLM, etc.)
CUSTOM_API_URL=http://localhost:11434/v1     # Ollama example
CUSTOM_API_KEY=                              # Empty for Ollama
CUSTOM_MODEL_NAME=llama3.2                   # Default model name

Prevent Client Timeouts

Some MCP clients default to short timeouts and can disconnect from PAL during long tool runs. Configure each client with a generous ceiling (we recommend at least five minutes); the PAL setup script now writes a 20-minute tool timeout for Codex so upstream providers contacted by the server have time to respond.

Claude Code & Claude Desktop

Claude reads MCP-related environment variables either from your shell or from ~/.claude/settings.json. Add (or update) the env block so both startup and tool execution use a 5-minute limit:

{
  "env": {
    "MCP_TIMEOUT": "300000",
    "MCP_TOOL_TIMEOUT": "300000"
  }
}

You can scope this block at the top level of settings.json (applies to every session) or under a specific mcpServers.<name>.env entry if you only want it for PAL (the server name may still be pal while configurations catch up). The values are in milliseconds. Note: Claude’s SSE transport still enforces an internal ceiling of roughly five minutes; long-running HTTP/SSE servers may need retries until Anthropic ships their fix.

Codex CLI

Codex exposes per-server timeouts in ~/.codex/config.toml. Add (or bump) these keys under [[mcp_servers.<name>]]:

[mcp_servers.pal]
command = "..."
args = ["..."]
startup_timeout_sec = 300    # default is 10 seconds
tool_timeout_sec = 1200      # default is 60 seconds; setup script pre-populates 20 minutes so upstream providers can respond

startup_timeout_sec covers the initial handshake/list tools step, while tool_timeout_sec governs each tool call. Raise the latter if the providers your MCP server invokes routinely need more than 20 minutes.

Gemini CLI

Gemini uses a single timeout field per server inside ~/.gemini/settings.json. Set it to at least five minutes (values are milliseconds):

{
  "mcpServers": {
    "pal": {
      "command": "uvx",
      "args": ["pal-mcp-server"],
      "timeout": 300000
    }
  }
}

Versions 0.2.1 and newer currently ignore values above ~60 seconds for some transports due to a known regression; if you still see premature disconnects we recommend breaking work into smaller calls or watching the Gemini CLI release notes for the fix.

Important notes:

⭐ No restart needed - Changes take effect immediately
⭐ If multiple APIs configured, native APIs take priority over OpenRouter
⭐ Configure model aliases in conf/custom_models.json

Step 4: Test the Installation

For Claude Desktop:

Restart Claude Desktop
Open a new conversation
Try: "Use pal to list available models"

For Claude Code CLI:

Exit any existing Claude session
Run claude from your project directory
Try: "Use pal to chat about Python best practices"

For Gemini CLI:

Note: While PAL MCP connects to Gemini CLI, tool invocation isn't working correctly yet. See Gemini CLI Setup for updates.

For Qwen Code CLI:

Restart the Qwen Code CLI if it's running (qwen exit).
Run qwen mcp list --scope user and confirm pal shows CONNECTED.
Try: "/mcp" to inspect available tools or "Use pal to analyze this repo".

For OpenCode CLI:

Restart OpenCode (or run OpenCode: Reload Config).
Open Settings › Tools › MCP and confirm pal is enabled.
Start a new chat and try: "Use pal to list available models".

For Codex CLI:

Restart Codex CLI if running
Open a new conversation
Try: "Use pal to list available models"

Test Commands:

"Use pal to list available models"
"Chat with pal about the best approach for API design"
"Use pal thinkdeep with gemini pro about scaling strategies"  
"Debug this error with o3: [paste error]"

Note: Codex CLI provides excellent MCP integration with automatic environment variable configuration when using the setup script.

Step 5: Start Using PAL

Basic Usage Patterns:

Let Claude pick the model:

"Use pal to analyze this code for security issues"
"Debug this race condition with pal"
"Plan the database migration with pal"

Specify the model:

"Use pal with gemini pro to review this complex algorithm"
"Debug with o3 using pal for logical analysis"
"Get flash to quickly format this code via pal"

Multi-model workflows:

"Use pal to get consensus from pro and o3 on this architecture"
"Code review with gemini, then precommit validation with o3"  
"Analyze with flash, then deep dive with pro if issues found"

Quick Tool Reference:

🤝 Collaboration: chat, thinkdeep, planner, consensus 🔍 Code Analysis: analyze, codereview, debug, precommit
⚒️ Development: refactor, testgen, secaudit, docgen 🔧 Utilities: challenge, tracer, listmodels, version

👉 Complete Tools Reference with detailed examples and parameters

Common Issues and Solutions

"pal not found" or "command not found"

For uvx installations:

Ensure uv is installed and in PATH
Try: which uvx to verify uvx is available
Check PATH includes /usr/local/bin and ~/.local/bin

For clone installations:

Run ./run-server.sh again to verify setup
Check virtual environment: which python should show .pal_venv/bin/python

API Key Issues

"Invalid API key" errors:

Verify API keys in .env file or MCP configuration
Test API keys directly with provider's API
Check for extra spaces or quotes around keys

"Model not available":

Run "Use pal to list available models" to see what's configured
Check model restrictions in environment variables
Verify API key has access to requested models

Performance Issues

Slow responses:

Use faster models: flash instead of pro
Lower thinking modes: minimal or low instead of high
Restrict model access to prevent expensive model selection

Token limit errors:

Use models with larger context windows
Break large requests into smaller chunks
See Working with Large Prompts

More Help

👉 Complete Troubleshooting Guide with detailed solutions

👉 Advanced Usage Guide for power-user features

👉 Configuration Reference for all options

What's Next?

🎯 Try the example workflows in the main README

📚 Explore the Tools Reference to understand what each tool can do

⚡ Read the Advanced Usage Guide for complex workflows

🔧 Check out Configuration Options to customize behavior

💡 Join discussions and get help in the project issues or discussions

Quick Configuration Templates

Development Setup (Balanced)

DEFAULT_MODEL=auto
GEMINI_API_KEY=your-key
OPENAI_API_KEY=your-key
GOOGLE_ALLOWED_MODELS=flash,pro
OPENAI_ALLOWED_MODELS=gpt-5.1-codex-mini,gpt-5-mini,o4-mini

Cost-Optimized Setup

DEFAULT_MODEL=flash
GEMINI_API_KEY=your-key
GOOGLE_ALLOWED_MODELS=flash

High-Performance Setup

DEFAULT_MODEL=auto
GEMINI_API_KEY=your-key
OPENAI_API_KEY=your-key
GOOGLE_ALLOWED_MODELS=pro
OPENAI_ALLOWED_MODELS=gpt-5.1-codex,gpt-5.2

Local-First Setup

DEFAULT_MODEL=auto
CUSTOM_API_URL=http://localhost:11434/v1
CUSTOM_MODEL_NAME=llama3.2
# Add cloud APIs as backup
GEMINI_API_KEY=your-key

Happy coding with your AI development team! 🤖✨

17 KiB Raw Blame History Unescape Escape