my-pal-mcp-server

Author	SHA1	Message	Date
Fahad	d2773f488a	fix: configure codex with a longer timeout refactor: param names	2025-10-21 10:35:44 +04:00
Fahad	ece8a5ebed	feat!: Full code can now be generated by an external model and shared with the AI tool (Claude Code / Codex etc)! model definitions now support a new `allow_code_generation` flag, only to be used with higher reasoning models such as GPT-5-Pro and-Gemini 2.5-Pro When `true`, the `chat` tool can now request the external model to generate a full implementation / update / instructions etc and then share the implementation with the calling agent. This effectively allows us to utilize more powerful models such as GPT-5-Pro to generate code for us or entire implementations (which are either API-only or part of the $200 Pro plan from within the ChatGPT app)	2025-10-07 18:49:13 +04:00
Fahad	fcb0fe3ef2	Fix o3-pro model resolution to use o3-pro consistently - Use o3-pro throughout the codebase instead of o3-pro-2025-06-10 - Update test expectations to match o3-pro model name - Update cassette to use o3-pro for consistency - Ensure responses endpoint routing works correctly with o3-pro 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-08-08 10:52:23 +05:00
Josh Vera	7003ae60e0	lint	2025-07-13 12:13:43 -06:00
Josh Vera	780d4ef207	fix: Clear restriction service in o3-pro test setup for proper isolation The o3-pro test now clears the restriction service singleton in its setup_method to ensure it re-reads environment variables set by the @patch.dict decorator. This prevents cached restrictions from previous tests (like test_fallback_with_shorthand_restrictions) from blocking the o3-pro model. This is a minimal, targeted fix that only affects the specific test that needs it, without breaking other tests that may depend on the restriction service state. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-13 11:59:54 -06:00
Josh Vera	ac7d489cb4	refactor: Simplify logging and conform to pytest conventions - Removed excessive debug logging in http_transport_recorder.py - Consolidated redundant log statements - Fixed exception logging to use logger.exception() - Removed emojis from log messages for cleaner output - Removed __main__ block from test_o3_pro_output_text_fix.py per pytest conventions - Applied black formatting to comply with CI checks 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-13 10:49:37 -06:00
Josh Vera	9248947e39	fix: Resolve o3-pro test isolation issues and convert print to logging - Fix test isolation by clearing LOCALE env var in o3-pro test - Add restriction service cleanup in test_model_restrictions.py - Fix PII sanitizer phone regex to not match timestamps - Convert all print statements to logging in test files per PR review - Re-record o3-pro cassette with correct environment 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-13 10:41:43 -06:00
Josh Vera	3b1c80865b	fix: Resolve test isolation issues for o3-pro test - Fixed test_fallback_with_shorthand_restrictions to clear restriction service singleton in finally block, preventing state leakage - Updated o3-pro test to use @patch.dict for OPENAI_ALLOWED_MODELS, following standard pattern and allowing both o3-pro and o3-pro-2025-06-10 - Removed invalid cassette file that had wrong request content The test now passes in both isolated and full suite runs. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-13 10:22:03 -06:00
Josh Vera	538ac55880	fix: Clear restriction service singleton in o3-pro test setup The test was failing when run in the full test suite because the ModelRestrictionService singleton persisted restrictions from previous tests. Specifically, test_fallback_with_shorthand_restrictions sets OPENAI_ALLOWED_MODELS="mini" which blocked o3-pro. Added utils.model_restrictions._restriction_service = None to ensure the test starts with clean restriction state. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-13 10:08:05 -06:00
Josh Vera	91605bbd98	feat: Implement code review improvements from gemini-2.5-pro analysis ✨ Key improvements: • Added public reset_for_testing() method to registry for clean test state management • Updated test setup/teardown to use new public API instead of private attributes • Enhanced inject_transport helper to ensure OpenAI provider registration • Migrated additional test files to use inject_transport pattern • Reduced code duplication by ~30 lines across test files 🔧 Technical details: • transport_helpers.py: Always register OpenAI provider for transport tests • test_o3_pro_output_text_fix.py: Use reset_for_testing() API, remove redundant registration • test_o3_pro_fixture_bisect.py: Migrate all 4 test methods to inject_transport • test_o3_pro_simplified.py: Migrate both test methods to inject_transport • providers/registry.py: Add reset_for_testing() public method ✅ Quality assurance: • All 7 o3-pro tests pass with new helper pattern • No regression in test isolation or provider state management • Improved maintainability through centralized transport injection • Follows single responsibility principle with focused helper function 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-13 09:53:49 -06:00
Josh Vera	17b97751ab	refactor: Simplify o3-pro test by removing fixture and monkey patching boilerplate - Remove over-engineered allow_all_models fixture (6 operations → 1 line API key setting) - Replace 10 lines of monkey patching boilerplate with 1-line inject_transport helper - Remove cargo-cult error handling that allowed test to pass with API failures - Create reusable transport_helpers.py for HTTP transport injection patterns - Fix provider registration state pollution between batch test runs - Test now works reliably in both individual and batch execution modes The test is significantly cleaner and addresses root cause (provider registration timing) rather than symptoms (cache clearing). 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-13 08:12:20 -06:00
Josh Vera	83e8b67234	test: Enhance o3-pro test to verify model metadata and response parsing - Add verification that o3-pro model was actually used (not just requested) - Verify model_used and provider_used metadata fields are populated - Add graceful handling for error responses in test - Improve test documentation explaining what's being verified - Confirm response parsing uses output_text field correctly This ensures the test properly validates both that: 1. The o3-pro model was selected and used via the /v1/responses endpoint 2. The response metadata correctly identifies the model and provider 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-13 06:09:31 -06:00
Josh Vera	3db49413ff	fix: Resolve o3-pro response parsing and test execution issues - Fix lint errors: trailing whitespace and deprecated typing imports - Update test mock for o3-pro response format (output.content[] → output_text) - Implement robust test isolation with monkeypatch fixture - Clear provider registry cache to prevent test interference - Ensure o3-pro tests pass in both individual and full suite execution 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-12 20:24:34 -06:00
Josh Vera	a1451befd2	refactor: Clean up test files and simplify documentation - Remove unused cassette files with incomplete recordings - Delete broken respx test files (test_o3_pro_respx_simple.py, test_o3_pro_http_recording.py) - Fix respx references in docstrings to mention HTTP transport recorder - Simplify vcr-testing.md documentation (60% reduction, more task-oriented) - Add simplified PR template with better test instructions - Fix cassette path consistency in examples - Add security note about reviewing cassettes before committing 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-12 19:24:51 -06:00
Josh Vera	7f92085c70	feat: Fix o3-pro response parsing and implement HTTP transport recorder - Fix o3-pro response parsing to use output_text convenience field - Replace respx with custom httpx transport solution for better reliability - Implement comprehensive PII sanitization to prevent secret exposure - Add HTTP request/response recording with cassette format for testing - Sanitize all existing cassettes to remove exposed API keys - Update documentation to reflect new HTTP transport recorder - Add test suite for PII sanitization and HTTP recording This change: 1. Fixes timeout issues with o3-pro API calls (was 2+ minutes, now ~15-22 seconds) 2. Properly captures response content without httpx.ResponseNotRead exceptions 3. Preserves original HTTP response format including gzip compression 4. Prevents future secret exposure with automatic PII sanitization 5. Enables reliable replay testing for o3-pro interactions Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-12 18:47:17 -06:00

15 Commits