Commit Graph

  • ed386375be Complete Redis mocking fixes for image support integration tests Fahad 2025-06-16 16:26:23 +04:00
  • a65c63c8da Fix Redis mocking in image support integration tests Fahad 2025-06-16 16:20:14 +04:00
  • 3049c85e3c Update base.py Beehive Innovations 2025-06-16 14:51:34 +04:00
  • d7982b55f8 Update advanced-usage.md Beehive Innovations 2025-06-16 14:50:13 +04:00
  • ff063cf247 Update CLAUDE.md Beehive Innovations 2025-06-16 14:50:00 +04:00
  • 0143140c34 Fix line length violations and code quality improvements Fahad 2025-06-16 13:21:17 +04:00
  • 061fb8691d Merge main into feature/images - resolve conflicts favoring our approach Fahad 2025-06-16 13:19:08 +04:00
  • 97fa6781cf Vision support via images / pdfs etc that can be passed on to other models as part of analysis, additional context etc. Image processing pipeline added OpenAI GPT-4.1 support Chat tool prompt enhancement Lint and code quality improvements Fahad 2025-06-16 13:14:53 +04:00
  • d6d7bf8cac Fixed internal file path translation into docker Fahad 2025-06-16 11:30:02 +04:00
  • d498e9854b Updated readme with an amazing new discovery Fahad 2025-06-16 10:35:39 +04:00
  • 8307e32541 Updated readme with an amazing new discovery Lint Fahad 2025-06-16 10:28:39 +04:00
  • 8bbadc6505 Updated readme with an amazing new discovery Fahad 2025-06-16 10:25:17 +04:00
  • 8e2b53b90d Updated readme with an amazing new discovery Improved prompt Fahad 2025-06-16 09:55:40 +04:00
  • eebe67170d Merge pull request #51 from BeehiveInnovations/improve/file-loading Beehive Innovations 2025-06-16 07:19:07 +04:00
  • 0b94dd8cdd Lint Fahad 2025-06-16 07:18:45 +04:00
  • 4c0bd3b86d Improved documentation for conversation / file collection strategy, context budget allocation etc Fahad 2025-06-16 07:17:35 +04:00
  • 5a49d196c8 More integration tests Fahad 2025-06-16 07:07:38 +04:00
  • 35f37fb92e Fixed integration test for auto mode Fahad 2025-06-16 07:00:27 +04:00
  • c643970ffb Fixed integration test for auto mode Fahad 2025-06-16 06:57:06 +04:00
  • 903aabd311 Fixed imports and lint Fahad 2025-06-16 06:24:33 +04:00
  • b43b30b49d Fixed regex Fahad 2025-06-16 06:22:10 +04:00
  • e183e1bfff Refactor log monitor to eliminate code duplication Fahad 2025-06-16 06:19:53 +04:00
  • be157ab771 Remove unused create_line_handler function Fahad 2025-06-16 06:16:21 +04:00
  • c9798325c7 Extra logging Fahad 2025-06-16 06:09:58 +04:00
  • 805e8d6d01 Fix remaining TestGenRequest reference in format_response method Fahad 2025-06-16 06:08:58 +04:00
  • 2cfe0b163a Fix all failing tests and pytest collection warnings Fahad 2025-06-16 06:02:12 +04:00
  • 8c3efd5676 Bump Fahad 2025-06-16 05:52:17 +04:00
  • 91077e3810 Performance improvements when embedding files: - Exit early at MCP boundary if files won't fit within given context of chosen model - Encourage claude to re-run with better context - Check file sizes before embedding - Drop files from older conversations when building continuations and give priority to newer files - List and mention excluded files to Claude on return - Improved tests - Improved precommit prompt - Added a new Low severity to precommit - Improved documentation of file embedding strategy - Refactor Fahad 2025-06-16 05:51:52 +04:00
  • 56333cbd86 Fixed numbering Fahad 2025-06-15 19:23:45 +04:00
  • 1d070e43fd Improved Fahad 2025-06-15 19:23:08 +04:00
  • d8f4eb99f5 Improved Fahad 2025-06-15 19:21:57 +04:00
  • 978b6ef155 Improved Fahad 2025-06-15 19:20:15 +04:00
  • 0bb54d721a Moved API editing instructions above run-server Fahad 2025-06-15 19:16:01 +04:00
  • 7ef99fb9ce Improved schema description for precommit Fahad 2025-06-15 19:11:37 +04:00
  • 6dcf095c3d Improved schema description to allow Claude to pre-think harder before invoking thinkdeep Fahad 2025-06-15 19:05:23 +04:00
  • ad6cff4498 Lint, bump Fahad 2025-06-15 18:43:37 +04:00
  • dfed6f0cbd New tool: "tracer" helps with static analysis / call-flow generation. Does NOT use external models. Used as a quick prompt generator to aid in call-flow / dependency-chart generation. Can be used as an input into another tool / model for extended analysis and deeper thought. Faster docker restarts Fahad 2025-06-15 18:42:10 +04:00
  • 6f8d3059a1 Merge branch 'main' into feature/tracer Fahad 2025-06-15 16:09:54 +04:00
  • f3720ad8e9 Use mock-reddis Fahad 2025-06-15 16:09:07 +04:00
  • 07a078b4f2 Updated tests and additional tests for folder expansion during conversation tracking Fahad 2025-06-15 16:03:43 +04:00
  • d36a85a3f3 Fix directory expansion tracking in conversation memory Fahad 2025-06-15 15:36:12 +04:00
  • 99ed09be8d Update tracer tool for new file tracking system Fahad 2025-06-15 15:36:33 +04:00
  • 64a1d8664e Fix directory expansion tracking in conversation memory Fahad 2025-06-15 15:36:12 +04:00
  • 86728a1442 WIP Fahad 2025-06-15 15:32:41 +04:00
  • 9b8ea72280 Fixed for git actions Fahad 2025-06-15 14:14:15 +04:00
  • 3bc7956239 Implement TracePath tool for static call path analysis Fahad 2025-06-15 14:07:23 +04:00
  • 6304b7af6b Native support for xAI Grok3 Model shorthand mapping related fixes Comprehensive auto-mode related tests Fahad 2025-06-15 12:21:44 +04:00
  • 4becd70a82 Perform prompt size checks only at the MCP boundary New test to confirm history build-up and system prompt does not affect prompt size checks Also check for large prompts in focus_on Fixed .env.example incorrectly did not comment out CUSTOM_API causing the run-server script to think at least one key exists Fahad 2025-06-15 10:37:08 +04:00
  • 3b03783ea7 Instructions on how to add a new tool Fahad 2025-06-15 09:42:28 +04:00
  • 44db8c9e63 Updated to reflect actual prompt Fahad 2025-06-15 09:37:56 +04:00
  • 3c56694d3f Formatting Fahad 2025-06-15 09:34:37 +04:00
  • f4395a796c Typo Fahad 2025-06-15 09:33:48 +04:00
  • 81ab4da60c Typo Fahad 2025-06-15 09:33:18 +04:00
  • 77faea24e1 Typo Fahad 2025-06-15 09:30:55 +04:00
  • 8364170881 Merge remote-tracking branch 'origin/main' Fahad 2025-06-15 09:25:58 +04:00
  • c7835e7eef Easier access to logs at startup with -f on the run script Improved prompt for immediate action Additional logging of tool names Updated documentation Context aware decomposition system prompt New script to run code quality checks Fahad 2025-06-15 09:25:52 +04:00
  • 318b5c7ae7 Easier access to logs at startup Updated documentation Fahad 2025-06-15 07:09:44 +04:00
  • 99fab3e83d Docs added to show how a new provider is added Docs added to show how a new tool is created All tools should add numbers to code for models to be able to reference if needed Enabled line numbering for code for all tools to use Additional tests to validate line numbering is not added to git diffs Fahad 2025-06-15 07:02:27 +04:00
  • b5004b91fc Major new addition: refactor tool Supports decomposing large components and files, finding codesmells, finding modernizing opportunities as well as code organization opportunities. Fix this mega-classes today! Line numbers added to embedded code for better references from model -> claude Fahad 2025-06-15 06:00:01 +04:00
  • be907a7b29 Merge pull request #39 from NikolaiUgelvik/fix-openrouter-auto-mode Beehive Innovations 2025-06-14 21:37:17 +04:00
  • 0eeea3dd67 Apply black formatting to test_openrouter_provider.py Nikolai Ugelvik 2025-06-14 19:33:20 +02:00
  • be2612752a Fix auto mode when only OpenRouter is configured Nikolai Ugelvik 2025-06-14 18:51:45 +02:00
  • 70f1356e3e Improved trigger words to enforce large prompts are passed in as a file reference Fahad 2025-06-14 21:03:17 +04:00
  • 4cacd2dad9 Fixed trigger word Fahad 2025-06-14 20:14:40 +04:00
  • 6b05096ba0 testgen grounding Fahad 2025-06-14 20:10:59 +04:00
  • b2489409eb Move o3-pro test into its own Fahad 2025-06-14 19:53:33 +04:00
  • 9f973b90e5 Merge pull request #36 from lox/add-o3-pro-support Beehive Innovations 2025-06-14 19:44:14 +04:00
  • 68a75a7791 Updated lint instructions for PRs Fahad 2025-06-14 19:37:53 +04:00
  • a8fd7f3d24 Bump Fahad 2025-06-14 19:32:11 +04:00
  • f1ad06c529 Fixed lint, tests after recent fix Updated readme Fahad 2025-06-14 19:31:31 +04:00
  • b41b874e31 Fixed model name mapping for openrouter Fahad 2025-06-14 19:19:59 +04:00
  • b405aaf8bd some justifications Fahad 2025-06-14 19:01:53 +04:00
  • a4f9e22256 Renamed version tool Fahad 2025-06-14 18:54:53 +04:00
  • 442decba70 Improved model response handling to handle additional response statuses in future Improved testgen; encourages follow-ups with less work in between and less token generation to avoid surpassing the 25K barrier Improved coderevew tool to request a focused code review instead where a single-pass code review is too large or complex Fahad 2025-06-14 18:43:56 +04:00
  • ec5fee4409 Bump Fahad 2025-06-14 16:48:21 +04:00
  • 698c1611d2 Lint fix Fahad 2025-06-14 16:47:59 +04:00
  • d0d0a171dc Ensure duplicate file references are gracefully handled Improved prompt to encourage immediate action Fahad 2025-06-14 16:37:02 +04:00
  • ec707e021a Fix for path translation within docker Fahad 2025-06-14 16:00:54 +04:00
  • acbfa1c94e Improved prompt for next steps Fahad 2025-06-14 15:51:04 +04:00
  • 4086306c58 New tool: testgen Generates unit tests and encourages model to auto-detect framework and testing style from existing sample (if available) Fahad 2025-06-14 15:41:47 +04:00
  • 40aa1eaeb6 Format test_auto_mode.py with black Lachlan Donald 2025-06-14 21:09:47 +10:00
  • 7d33aafcab Configurable conversation limit now set to 10 exchanges. This helps when you want to manually continue a thread of thought across different models manually. Fahad 2025-06-14 14:00:13 +04:00
  • bc3f98a291 Make conversation timeout configuration (so that you're able to resume a discussion manually with another model with a gap of several hours in case you stepped away) Fahad 2025-06-14 13:27:19 +04:00
  • a569e316af Update README.md Beehive Innovations 2025-06-14 12:41:15 +04:00
  • 710a2ab0eb Surface important public notice Fahad 2025-06-14 11:48:57 +04:00
  • 7481af5c8f Surface important public notice Fahad 2025-06-14 11:47:29 +04:00
  • 4a3767921a Surface important public notice Fahad 2025-06-14 11:46:32 +04:00
  • 002203f0da Surface important public notice Fahad 2025-06-14 11:45:45 +04:00
  • b17fe06d27 bump Fahad 2025-06-14 11:34:28 +04:00
  • e0a05b86f1 Add encouraging message about powerful models to schema in case it's not on Opus 4 or above OPENROUTER_ALLOWED_MODELS environment variable support to further limit the models to allow from within Claude. This will put a limit on top of even the ones listed in custom_models.json Fahad 2025-06-14 11:34:17 +04:00
  • 21037c2d81 Refactored prompts for better maintainability Fahad 2025-06-14 11:09:13 +04:00
  • 94b2a4d407 Bump Fahad 2025-06-14 10:57:08 +04:00
  • 23353734cd Support for allowed model restrictions per provider Tool escalation added to analyze to a graceful switch over to codereview is made when absolutely necessary Fahad 2025-06-14 10:56:53 +04:00
  • c12dc1d765 Fix syntax error from incomplete merge conflict resolution Lachlan Donald 2025-06-14 15:50:40 +10:00
  • a3aaf6f79b Enhance o3-pro test coverage with comprehensive codereview testing Lachlan Donald 2025-06-14 13:51:37 +10:00
  • 69ec38d1af Add o3-pro model support and extend test coverage Lachlan Donald 2025-06-14 13:02:44 +10:00
  • ac9c58ce61 Use flash for the comprehensive simulation test to run quicker Fahad 2025-06-14 09:42:10 +04:00
  • 2c805d6637 Fixed mock comparison error Fahad 2025-06-14 09:34:56 +04:00
  • 746380eb7f Renamed setup script to avoid confusion (https://github.com/BeehiveInnovations/zen-mcp-server/issues/35) Further fixes to tests Pass O3 simulation test when keys are not set, along with a notice Updated docs on testing, simulation tests / contributing Support for OpenAI o4-mini and o4-mini-high Fahad 2025-06-14 09:28:20 +04:00
  • c5f682c7b0 Fix tests to work with effective auto mode changes Fahad 2025-06-14 02:43:29 +04:00