my-pal-mcp-server

Author	SHA1	Message	Date
Fahad	bc93b5343b	fix: CI test	2025-10-04 14:32:47 +04:00
Fahad	2c534ac06e	feat: centralized environment handling, ensures ZEN_MCP_FORCE_ENV_OVERRIDE is honored correctly fix: updated tests to override env variables they need instead of relying on the current values from .env	2025-10-04 14:28:56 +04:00
Fahad	4015e917ed	fix: listmodels to always honor restricted models fix: restrictions should resolve canonical names for openrouter fix: tools now correctly return restricted list by presenting model names in schema fix: tests updated to ensure these manage their expected env vars properly perf: cache model alias resolution to avoid repeated checks	2025-10-04 13:46:22 +04:00
Fahad	06d7701cc3	refactor: removed subclass override when the base class should be resolving the model name refactor: always disable "stream"	2025-10-04 10:35:32 +04:00
Fahad	f955100f3a	refactor: improved retry logic and moved core logic to base class	2025-10-03 23:48:55 +04:00
Fahad	4968e1f7bc	refactor: cleanup, remove unused method	2025-10-03 23:26:22 +04:00
Fahad	a8fbafd5a7	feat: minor tweaks to chat system prompt to help with alignment	2025-10-03 23:09:12 +04:00
Fahad	8759edc817	fix: consensus now advertises a short list of models to avoid the CLI getting the names wrong	2025-10-03 22:41:28 +04:00
Beehive Innovations	775ac8ff58	Merge pull request #268 from Coquinate/feat/gpt5-codex-responses-api feat: add GPT-5-Codex support with Responses API integration	2025-10-03 21:16:13 +04:00
aberemia24	f2653427ca	feat: add GPT-5-Codex support with Responses API integration Adds support for OpenAI's GPT-5-Codex model which uses the new Responses API endpoint (/v1/responses) instead of the standard Chat Completions API. Changes: - Add GPT-5-Codex to MODEL_CAPABILITIES with 400K context, 128K output - Prioritize GPT-5-Codex for EXTENDED_REASONING tasks - Add aliases: codex, gpt5-codex, gpt-5-code - Update tests to expect GPT-5-Codex for extended reasoning Benefits: - 40-80% cost savings through Responses API caching - 3% better performance on coding tasks (SWE-bench) - Leverages existing dual-API infrastructure	2025-10-03 13:59:44 +03:00
Fahad	88493bd357	test: cross tool memory recall, testing continuation via cassette recording	2025-10-03 13:51:32 +04:00
Fahad	3c4b1368c6	test: http cassettes added for improved testing of consensus	2025-10-03 13:07:42 +04:00
Fahad	d55130a430	fix: external model name now recorded properly in responses test: http cassettes added for improved integration tests refactor: generic name for the CLI agent	2025-10-03 11:29:06 +04:00
Fahad	87ccb6b25b	test: fixed integration tests, removed magicmock	2025-10-02 23:47:44 +04:00
Fahad	8b3a2867fb	fix: https://github.com/BeehiveInnovations/zen-mcp-server/issues/194	2025-10-02 23:12:52 +04:00
Fahad	a199e4a955	refactor: cleanup old use_websearch param	2025-10-02 22:46:58 +04:00
Fahad	6cab9e56fc	feat: added `intelligence_score` to the model capabilities schema; a 1-20 number that can be specified to influence the sort order of models presented to the CLI in `auto selection` mode fix: model definition re-introduced into the schema but intelligently and only a summary is generated per tool. Required to ensure CLI calls and uses the correct model fix: removed `model` param from some tools where this wasn't needed fix: fixed adherence to `*_ALLOWED_MODELS` by advertising only the allowed models to the CLI fix: removed duplicates across providers when passing canonical names back to the CLI; the first enabled provider wins	2025-10-02 21:43:44 +04:00
Fahad	4c288b2385	fix: improved conversation retrieval	2025-10-02 14:47:24 +04:00
Fahad	d285fadf4c	fix: custom provider must only accept a model if it's declared explicitly. Upon model rejection (in auto mode) the list of available models is returned up-front to help with selection.	2025-10-02 13:49:23 +04:00
Fahad	693b84db2b	refactor: cleanup provider base class; cleanup shared responsibilities; cleanup public contract docs: document provider base class refactor: cleanup custom provider, it should only deal with `is_custom` model configurations fix: make sure openrouter provider does not load `is_custom` models fix: listmodels tool cleanup	2025-10-02 12:59:45 +04:00
Fahad	14a35afa1d	refactor: moved image related code out of base provider into a separate utility	2025-10-02 11:23:15 +04:00
Fahad	a254ff2220	refactor: removed method from provider, should use model capabilities instead refactor: cleanup temperature factory method	2025-10-02 11:08:56 +04:00
Fahad	6d237d0970	refactor: moved temperature method from base provider to model capabilities refactor: model listing cleanup, moved logic to model_capabilities.py docs: added AGENTS.md for onboarding Codex	2025-10-02 10:25:41 +04:00
Fahad	1dc25f6c3d	refactor: renaming to reflect underlying type docs: updated to reflect new modules	2025-10-02 09:07:40 +04:00
Fahad	250545e34f	refactor: removed hard coded checks, use model capabilities instead	2025-10-02 08:32:51 +04:00
Fahad	182aa627df	refactor: code cleanup	2025-10-02 08:09:44 +04:00
Fahad	cc8a4dfd21	Overall savings should now be 50%+ tokens used perf: tweaks to schema descriptions, aiming to reduce token usage without performance degradation	2025-10-01 22:39:12 +04:00
Fahad	f69ff03c4d	refactor: trimmed some prompts style: version tool output now mentions the default model in use	2025-10-01 21:55:05 +04:00
Fahad	d9449c7bb6	feat: depending on the number of tools in use, this change should save ~50% of overall tokens used. fixes https://github.com/BeehiveInnovations/zen-mcp-server/issues/255 but also refactored individual tools to instead encourage the agent to use the listmodels tool if needed.	2025-10-01 21:40:31 +04:00
Fahad	696b45f25e	fix: https://github.com/BeehiveInnovations/zen-mcp-server/issues/258	2025-10-01 20:23:38 +04:00
Fahad	7efb4094d4	test: update tests to match new Claude Sonnet 4.5 alias configuration - Updated sonnet alias to point to claude-sonnet-4.5 instead of 4.1 - Removed references to deprecated 'claude' alias - Added sonnet4.1 alias for claude-sonnet-4.1 backwards compatibility - All 809 tests passing	2025-10-01 19:57:43 +04:00
Fahad	bf9344963f	Merge branch 'pr-247-modified'	2025-10-01 19:51:29 +04:00
Fahad	104d09502a	test: fixed annotation	2025-10-01 19:28:54 +04:00
Beehive Innovations	77caef6f54	Merge pull request #260 from DragonFSKY/fix/consensus-model-context-issue fix: resolve consensus tool model_context parameter missing issue	2025-10-01 19:27:47 +04:00
Fahad	70fa088c32	feat: implement semantic cassette matching for o3 models Adds flexible cassette matching that ignores system prompt changes for o3 models, preventing CI failures when prompts are updated. Changes: - Semantic matching: Only compares model name, user question, and core params - Ignores: System prompts, conversation memory instructions, metadata - Prevents cassette breaks when prompts change between code versions - Added comprehensive tests for semantic matching behavior - Created maintenance documentation (tests/CASSETTE_MAINTENANCE.md) This solves the CI failure where o3-pro test cassettes would break whenever system prompts or conversation memory format changed. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-01 18:53:30 +04:00
Fahad	cff6d8998f	fix: removed use_websearch; this parameter was confusing Codex. It started using this to prompt the external model to perform searches! web-search is enabled by Claude / Codex etc by default and the external agent can ask claude to search on its behalf.	2025-10-01 18:44:11 +04:00
Beehive Innovations	f51da6e5f8	Merge branch 'main' into main	2025-10-01 18:19:02 +04:00
Devon Hillard	d13700c14c	test: Update OpenAI provider alias tests to match new format Updated test_supported_models_aliases.py to reflect the removal of self-referencing aliases: - Removed assertion for "o4-mini" in its own aliases (no longer self-referencing) - Updated "o3-pro" alias test to use "o3pro" (normalized alias format) - Fixed alias resolution test for o3pro -> o3-pro These changes align with the fix for duplicate model listings in listmodels output. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-09-09 19:08:10 -06:00
Sven Lito	525f4598ce	refactor: address code review feedback from Gemini - Extract restriction checking logic into reusable helper method - Refactor validate_model_name to reduce code duplication - Fix logging import by using existing module-level logger - Clean up test file by removing print statement and main block - All tests continue to pass after refactoring	2025-09-05 11:04:45 +07:00
Sven Lito	2db1323813	fix: respect custom OpenAI model temperature settings (#245 ) - OpenAI provider now checks custom models registry for user configurations - Custom models with supports_temperature=false no longer send temperature to API - Fixes 400 errors for custom o3/gpt-5 models configured without temperature support - Added comprehensive tests to verify the fix works correctly - Maintains backward compatibility with built-in models Fixes #245	2025-09-05 10:53:28 +07:00
谢栋梁	9044b63809	fix: resolve consensus tool model_context parameter missing issue Fixed runtime bug where _prepare_file_content_for_prompt was called without required model_context parameter, causing RuntimeError when processing requests with relevant_files. - Create ModelContext instance with model_name in _consult_model method - Pass model_context parameter to _prepare_file_content_for_prompt call - Add comprehensive regression test to prevent future occurrences - Maintain consensus tool's blinded design with independent model contexts	2025-09-03 10:55:22 +08:00
Fahad	4b202f5d1d	feat: refactored and tweaked model descriptions / schema to use fewer tokens at launch (average reduction per field description: 60-80%) without sacrificing tool effectiveness Disabled secondary tools by default (for new installations), updated README.md with instructions on how to enable these in .env run-server.sh now displays disabled / enabled tools (when DISABLED_TOOLS is set)	2025-08-22 09:23:59 +04:00
David Knedlik	4930824052	feat: Add comprehensive GPT-5 series model support - Add GPT-5, GPT-5-mini, and GPT-5-nano models to unified configuration - Implement proper thinking mode support via dynamic capability checking - Add OpenAI provider model enumeration methods for registry integration - Update tests to cover all GPT-5 models and their aliases - Fix critical bug where thinking mode was hardcoded instead of using model capabilities Breaking Changes: - None (backward compatible) New Models Available: - gpt-5 (400K context, 128K output, reasoning support) - gpt-5-mini (400K context, 128K output, efficient variant) - gpt-5-nano (400K context, fastest/cheapest variant) Aliases: - gpt5, gpt5-mini, gpt5mini, gpt5-nano, gpt5nano, nano All models support: - Extended thinking mode (reasoning tokens) - Vision capabilities - JSON mode - Function calling 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-08-21 14:27:00 -05:00
Fahad	80d21e57c0	feat: refactored and improved codereview in line with precommit. Reviews are now either external (default) or internal. Takes away anxiety and loss of tokens when Claude incorrectly decides to be 'confident' about its own changes and bungle things up. fix: Minor tweaks to prompts fix: Improved support for smaller models that struggle with strict structured JSON output Rearranged reasons to use the MCP above quick start (collapsed)	2025-08-21 14:04:32 +04:00
Fahad	0af9202012	Precommit updated to take always prefer external analysis (via _other_ model) unless specified not to. This prevents Claude from being overconfident and inadequately performing subpar precommit checks.	2025-08-20 11:55:40 +04:00
google-labs-jules[bot]	0959d6f0fa	feat: Update Claude models to Opus 4.1 and Sonnet 4.1 This commit updates all references to Claude Opus 4 and Sonnet 4 to their newer 4.1 versions throughout the codebase. The changes include: - Updating model names in `conf/custom_models.json` and `providers/dial.py`. - Updating aliases and descriptions to match the new model versions. - Updating `.env.example` to reflect the new model names. - Updating all relevant test suites to use the new model names and ensure all tests pass.	2025-08-17 16:08:52 +00:00
Beehive Innovations	e6213d4ca1	Merge pull request #227 from svnlto/fix/uvx-resource-packaging fix: uvx resource packaging issues for OpenRouter functionality	2025-08-11 12:31:49 -07:00
Sven Lito	673d78be6d	Fix failing tests and exclude .zen_venv from linting - Fix test_resource_loading_success by removing outdated mock targeting non-existent 'files' import - Simplify resource loading test to validate registry functionality directly - Add .zen_venv exclusion to ruff and black in code_quality_checks.sh - All tests now passing (793/793) with clean linting	2025-08-10 22:18:08 +07:00
Sven Lito	5e599b9e7d	Complete PR review feedback implementation with clean importlib.resources approach - Remove redundant path checks between Path("conf/custom_models.json") and Path.cwd() variants - Implement proper importlib.resources.files('conf') approach for robust packaging - Create conf/__init__.py to make conf a proper Python package - Update pyproject.toml to include conf* in package discovery - Clean up verbose comments and simplify resource loading logic - Fix test mocking to use correct importlib.resources.files target - All tests passing (8/8) with proper resource and fallback functionality Addresses all gemini-code-assist bot feedback from PR #227	2025-08-10 22:13:25 +07:00
Sven Lito	84de9b026f	Address PR review feedback: Implement proper importlib.resources approach Improvements based on gemini-code-assist bot feedback: 1. Proper importlib.resources implementation: - Use files("providers") / "../conf/custom_models.json" for resource loading - Prioritize resource loading over file system paths for packaged environments - Maintain backward compatibility with explicit config paths and env variables 2. Remove redundant path checks: - Eliminated duplicate Path("conf/custom_models.json") and Path.cwd() / "conf/custom_models.json" - Streamlined fallback logic to development path + working directory only 3. Enhanced test coverage: - Mock-based testing of actual fallback scenarios with Path.exists - Proper resource loading simulation and failure testing - Comprehensive coverage of both resource and file system modes 4. Robust error handling: - Graceful fallback from resources to file system when resource loading fails - Clear logging of which loading method is being used - Better error messages indicating resource vs file system loading The implementation now follows Python packaging best practices using importlib.resources while maintaining full backward compatibility and robust fallback behavior. Tested: All 8 test cases pass, resource loading works in development, file system fallback works when resources fail.	2025-08-10 21:36:40 +07:00

1 2 3 4 5 ...

280 Commits