• v1.0.8 5c4fae1db9

    v1.0.8 Stable

    kerem released this 2026-01-04 06:51:32 +03:00 | 141 commits to main since this release

    📅 Originally published on GitHub: Sun, 04 Jan 2026 03:55:59 GMT
    🏷️ Git tag created: Sun, 04 Jan 2026 03:51:32 GMT

    This release adds full support for AWS SSO OIDC authentication (Builder ID) and introduces fake reasoning with extended thinking capabilities.

    New Features

    • Extended Thinking (Fake Reasoning): Implement fake reasoning with extended thinking support. The gateway can now inject thinking mode tags into prompts and parse <thinking> blocks from responses, enabling reasoning-like behavior with compatible models. Enabled by default. Includes system prompt legitimization to prevent models from treating thinking tags as prompt injection (#11)
    • AWS SSO OIDC Support: Full support for kiro-cli credentials using AWS IAM Identity Center (SSO) / Builder ID authentication. The gateway automatically reads credentials from kiro-cli's SQLite database, supporting both kirocli:* and codewhisperer:* key formats for compatibility with different kiro-cli versions. Builder ID users don't need to specify PROFILE_ARN (#12)

    🐛 Bug Fixes

    • User-Agent Format: Use original KiroIDE User-Agent format (KiroIDE-0.7.45-{fingerprint}) for better API compatibility

    ⚙️ Configuration

    AWS SSO OIDC (kiro-cli) Support:

    Variable Description Default
    KIRO_CLI_DB_FILE Path to kiro-cli SQLite database (e.g., ~/.local/share/kiro-cli/data.sqlite3) -
    KIRO_CREDS_FILE Path to AWS SSO cache JSON file (e.g., ~/.aws/sso/cache/*.json) -
    PROFILE_ARN AWS CodeWhisperer profile ARN. Not needed for AWS SSO OIDC (Builder ID) -

    Extended Thinking (Fake Reasoning):

    Variable Description Default
    FAKE_REASONING_ENABLED Enable fake reasoning mode with thinking tags injection true
    FAKE_REASONING_MAX_TOKENS Maximum tokens for thinking content 4000
    FAKE_REASONING_HANDLING How to handle thinking blocks: as_reasoning_content, remove, pass, strip_tags as_reasoning_content
    FAKE_REASONING_INITIAL_BUFFER_SIZE Buffer size for tag detection (characters). Lower = faster first token 20

    📝 Documentation

    • Code comments: Translating comments from Russian to English (only in modified files) for better international collaboration

    🙏 Contributors

    • Thanks to @uratmangun for testing, debugging, and providing the fix for AWS SSO OIDC support (#12)
    • Thanks to @JoeGrimes123 for suggesting the fake reasoning approach (#11)

    Full Changelog: https://github.com/jwadow/kiro-openai-gateway/compare/v1.0.7...v1.0.8

    Downloads