chore: add empty response retry test and fix flaky tests

- Add test:emptyretry script and include in test suite
- Fix test-interleaved-thinking: use complex prompt to force thinking
- Fix test-multiturn-thinking-tools: make Turn 2 lenient (thinking optional)
- Fix test-multiturn-thinking-tools-streaming: same lenient approach
- Use TEST_MODELS helper instead of hardcoded model ID

Models may skip thinking on obvious next steps - this is valid behavior.
Tests now only require thinking on first turn to verify signatures work.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
This commit is contained in:
Badri Narayanan S
2026-01-08 17:54:48 +05:30
parent a696ed0872
commit 7375a2ef6d
6 changed files with 19 additions and 13 deletions

View File

@@ -106,7 +106,7 @@ Please do this step by step, reading each file before modifying.`
messages: [
{
role: 'user',
content: `Read src/config.js and tell me if debug mode is enabled.`
content: `Analyze the src/config.js file structure and explain the security implications of each setting. What are the potential risks if this config were exposed in production?`
},
{ role: 'assistant', content: result.content },
{