my-pal-mcp-server

Author	SHA1	Message	Date
Beehive Innovations	95556ba9ea	Add Consensus Tool for Multi-Model Perspective Gathering (#67 ) * WIP Refactor resolving mode_names, should be done once at MCP call boundary Pass around model context instead Consensus tool allows one to get a consensus from multiple models, optionally assigning one a 'for' or 'against' stance to find nuanced responses. * Deduplication of model resolution, model_context should be available before reaching deeper parts of the code Improved abstraction when building conversations Throw programmer errors early * Guardrails Support for `model:option` format at MCP boundary so future tools can use additional options if needed instead of handling this only for consensus Model name now supports an optional ":option" for future use * Simplified async flow * Improved model for request to support natural language Simplified async flow * Improved model for request to support natural language Simplified async flow * Fix consensus tool async/sync patterns to match codebase standards CRITICAL FIXES: - Converted _get_consensus_responses from async to sync (matches other tools) - Converted store_conversation_turn from async to sync (add_turn is synchronous) - Removed unnecessary asyncio imports and sleep calls - Fixed ClosedResourceError in MCP protocol during long consensus operations PATTERN ALIGNMENT: - Consensus tool now follows same sync patterns as all other tools - Only execute() and prepare_prompt() are async (base class requirement) - All internal operations are synchronous like analyze, chat, debug, etc. TESTING: - MCP simulation test now passes: consensus_stance ✅ - Two-model consensus works correctly in ~35 seconds - Unknown stance handling defaults to neutral with warnings - All 9 unit tests pass (100% success rate) The consensus tool async patterns were anomalous in the codebase. This fix aligns it with the established synchronous patterns used by all other tools while maintaining full functionality. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com> * Fixed call order and added new test * Cleanup dead comments Docs for the new tool Improved tests --------- Co-authored-by: Claude <noreply@anthropic.com>	2025-06-17 10:53:17 +04:00
Fahad	35f37fb92e	Fixed integration test for auto mode	2025-06-16 07:00:27 +04:00
Fahad	903aabd311	Fixed imports and lint	2025-06-16 06:24:33 +04:00
Fahad	2cfe0b163a	Fix all failing tests and pytest collection warnings Fixed MagicMock comparison errors across multiple test suites by: - Adding proper ModelCapabilities mocks with real values instead of MagicMock objects - Updating test_auto_mode.py with correct provider mocking for model availability tests - Updating test_thinking_modes.py with proper capabilities mocking in all thinking mode tests - Updating test_tools.py with proper capabilities mocking for CodeReview and Analyze tools - Fixing test_large_prompt_handling.py by adding proper provider mocking to prevent errors before large prompt detection Fixed pytest collection warnings by: - Renaming TestGenRequest to TestGenerationRequest to avoid pytest collecting it as a test class - Renaming TestGenTool to TestGenerationTool to avoid pytest collecting it as a test class - Updated all imports and references across server.py, tools/__init__.py, and test files All 459 tests now pass without warnings or MagicMock comparison errors. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-06-16 06:02:12 +04:00
Fahad	07a078b4f2	Updated tests and additional tests for folder expansion during conversation tracking	2025-06-15 16:03:43 +04:00
Fahad	6304b7af6b	Native support for xAI Grok3 Model shorthand mapping related fixes Comprehensive auto-mode related tests	2025-06-15 12:21:44 +04:00
Fahad	4becd70a82	Perform prompt size checks only at the MCP boundary New test to confirm history build-up and system prompt does not affect prompt size checks Also check for large prompts in focus_on Fixed .env.example incorrectly did not comment out CUSTOM_API causing the run-server script to think at least one key exists	2025-06-15 10:37:08 +04:00
Fahad	8ac5bbb5af	Fixed workspace path mapping Refactoring Improved system prompts, more generalized Home folder protection and detection Retry logic for gemini	2025-06-14 00:26:59 +04:00
Fahad	3aedb16101	Use the new Gemini 2.5 Flash Updated to support Thinking Tokens as a ratio of the max allowed Updated tests Updated README	2025-06-12 20:46:54 +04:00
Fahad	79af2654b9	Use the new flash model Updated tests	2025-06-12 13:44:09 +04:00
Fahad	fb66825bf6	Rebranding, refactoring, renaming, cleanup, updated docs	2025-06-12 10:40:43 +04:00
Fahad	2a067a7f4e	WIP major refactor and features	2025-06-12 07:14:59 +04:00
Fahad	98eab46abf	WIP - improvements to token usage tracking, simulator added for live testing, improvements to file loading	2025-06-11 13:24:59 +04:00
Fahad	ba8f7192c3	refactor: rename think_deeper to thinkdeep for brevity - Renamed `think_deeper` tool to `thinkdeep` for shorter, cleaner naming - Updated all imports from ThinkDeeperTool to ThinkDeepTool - Updated all references from THINK_DEEPER_PROMPT to THINKDEEP_PROMPT - Updated tool registration in server.py - Updated all test files to use new naming convention - Updated README documentation to reflect new tool names - All functionality remains the same, only naming has changed This completes the tool renaming refactor for improved clarity and consistency. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-06-10 12:38:38 +04:00
Fahad	5f8ed3aae8	refactor: rename review tools for clarity and consistency - Renamed `review_code` tool to `codereview` for better naming convention - Renamed `review_changes` tool to `precommit` to better reflect its purpose - Updated all tool descriptions to remove "Triggers:" sections and improve clarity - Updated all imports and references throughout the codebase - Renamed test files to match new tool names - Updated server.py tool registrations - All existing functionality preserved with improved naming This refactoring improves code organization and makes tool purposes clearer. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-06-10 12:30:06 +04:00
Fahad	67f18ef3c9	refactor: rename debug_issue tool to debug for brevity - Rename debug_issue.py to debug.py - Update tool name from 'debug_issue' to 'debug' throughout codebase - Update all references in server.py, tests, and README - Keep DebugIssueTool class name for backward compatibility - All tests pass with the renamed tool This makes the tool name shorter and more consistent with other tool names like 'chat' and 'analyze'. The functionality remains exactly the same. 🤖 Generated with Claude Code Co-Authored-By: Claude <noreply@anthropic.com>	2025-06-10 11:43:47 +04:00
Fahad	27add4d05d	feat: Major refactoring and improvements v2.11.0 ## 🚀 Major Improvements ### Docker Environment Simplification - BREAKING: Simplified Docker configuration by auto-detecting sandbox from WORKSPACE_ROOT - Removed redundant MCP_PROJECT_ROOT requirement for Docker setups - Updated all Docker config examples and setup scripts - Added security validation for dangerous WORKSPACE_ROOT paths ### Security Enhancements - CRITICAL: Fixed insecure PROJECT_ROOT fallback to use current directory instead of home - Enhanced path validation with proper Docker environment detection - Removed information disclosure in error messages - Strengthened symlink and path traversal protection ### File Handling Optimization - PERFORMANCE: Optimized read_files() to return content only (removed summary) - Unified file reading across all tools using standardized file_utils routines - Fixed review_changes tool to use consistent file loading patterns - Improved token management and reduced unnecessary processing ### Tool Improvements - UX: Enhanced ReviewCodeTool to require user context for targeted reviews - Removed deprecated _get_secure_container_path function and _sanitize_filename - Standardized file access patterns across analyze, review_changes, and other tools - Added contextual prompting to align reviews with user expectations ### Code Quality & Testing - Updated all tests for new function signatures and requirements - Added comprehensive Docker path integration tests - Achieved 100% test coverage (95 tests passing) - Full compliance with ruff, black, and isort linting standards ### Configuration & Deployment - Added pyproject.toml for modern Python packaging - Streamlined Docker setup removing redundant environment variables - Updated setup scripts across all platforms (Windows, macOS, Linux) - Improved error handling and validation throughout ## 🔧 Technical Changes - Removed: `_get_secure_container_path()`, `_sanitize_filename()`, unused SANDBOX_MODE - Enhanced: Path translation, security validation, token management - Standardized: File reading patterns, error handling, Docker detection - Updated: All tool prompts for better context alignment ## 🛡️ Security Notes This release significantly improves the security posture by: - Eliminating broad filesystem access defaults - Adding validation for Docker environment variables - Removing information disclosure in error paths - Strengthening path traversal and symlink protections 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-06-10 09:50:05 +04:00
Fahad	7ea790ef88	fix: Docker path translation for review_changes and code deduplication - Fixed review_changes tool to properly translate host paths to container paths in Docker - Prevents "No such file or directory" errors when running in Docker containers - Added proper error handling with clear messages when paths are inaccessible refactor: Centralized token limit validation across all tools - Added _validate_token_limit method to BaseTool to eliminate code duplication - Reduced ~25 lines of duplicated code across 5 tools (analyze, chat, debug_issue, review_code, think_deeper) - Maintains exact same error messages and behavior feat: Enhanced large prompt handling - Added support for prompts >50K chars by requesting file-based input - Preserves MCP's ~25K token capacity for responses - All tools now check prompt size before processing test: Added comprehensive Docker path integration tests - Tests for path translation, security validation, and error handling - Tests for review_changes tool specifically with Docker paths - Fixed failing think_deeper test (updated default from "max" to "high") chore: Code quality improvements - Applied black formatting across all files - Fixed import sorting with isort - All tests passing (96 tests) - Standardized error handling follows MCP TextContent format The changes ensure consistent behavior across all environments while reducing code duplication and improving maintainability. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-06-10 07:20:24 +04:00

18 Commits