starred/DeepClaude

Fork 0

mirror of https://github.com/ErlichLiu/DeepClaude.git synced 2026-04-25 05:05:57 +03:00

[GH-ISSUE #9] 高效利用R1的剩余回答 #3

New issue

Open

opened 2026-02-27 01:55:48 +03:00 by kerem · 4 comments

kerem commented

2026-02-27 01:55:48 +03:00

Owner

Originally created by @GowayLee on GitHub (Feb 4, 2025).
Original GitHub issue: https://github.com/ErlichLiu/DeepClaude/issues/9

项目基础功能好像实现得差不多啦🎉

但是... 感觉每次都截断只使用R1的推理过程有点浪费token.

🤯希望大家可以讨论一下如何高效利用R1的剩余回答

例如

可以把R1的回答也发给Sonnet, 让它审查并改进? 但是这样需要精妙设置Prompt, 而且可能会随机扰乱Sonnet的发挥.
...

Originally created by @GowayLee on GitHub (Feb 4, 2025). Original GitHub issue: https://github.com/ErlichLiu/DeepClaude/issues/9 项目基础功能好像实现得差不多啦🎉 **但是**... 感觉每次都截断只使用R1的推理过程有点浪费token. 🤯希望大家可以讨论一下如何高效利用R1的剩余回答 ### 例如 - 可以把R1的回答也发给Sonnet, 让它审查并改进? 但是这样需要精妙设置Prompt, 而且可能会随机扰乱Sonnet的发挥. - ...

kerem added the

question

label

2026-02-27 01:55:48 +03:00

kerem commented

2026-02-27 01:55:48 +03:00

Author

Owner

@jiacheo commented on GitHub (Feb 5, 2025):

把 max_tokens 设置为 1 试试

@jiacheo commented on GitHub (Feb 5, 2025): 把 max_tokens 设置为 1 试试

kerem commented

2026-02-27 01:55:48 +03:00

Author

Owner

@GowayLee commented on GitHub (Feb 5, 2025):

把 max_tokens 设置为 1 试试

推理模型的思维链也算token, 刚刚试了一下, 发现这样会导致推理过程输出1token后直接停了

@GowayLee commented on GitHub (Feb 5, 2025): > 把 max_tokens 设置为 1 试试推理模型的思维链也算token, 刚刚试了一下, 发现这样会导致推理过程输出1token后直接停了

kerem commented

2026-02-27 01:55:48 +03:00

Author

Owner

@ErlichLiu commented on GitHub (Feb 5, 2025):

是的，我也是尝试了一下，这个方案确实不太行，我们再想想其他的方案吧

@ErlichLiu commented on GitHub (Feb 5, 2025): 是的，我也是尝试了一下，这个方案确实不太行，我们再想想其他的方案吧

kerem commented

2026-02-27 01:55:48 +03:00

Author

Owner

@GowayLee commented on GitHub (Feb 5, 2025):

DeepSeek和SiliconFlow都支持在补全请求中设置stop参数

可以尝试用Prompt强迫模型在开口回答前先输出某一个特殊字符组合, 然后配合stop参数来中止推理

(之前可以很固定输出</think>, 结果现在没有了😅)

@GowayLee commented on GitHub (Feb 5, 2025): DeepSeek和SiliconFlow都支持在补全请求中设置`stop`参数 ![Image](https://github.com/user-attachments/assets/20e3ad56-e1bb-4890-9368-2e188d60b67b) 可以尝试用Prompt强迫模型在开口回答前先输出某一个特殊字符组合, 然后配合`stop`参数来中止推理 (之前可以很固定输出`</think>`, 结果现在没有了😅)

kerem referenced this issue

2026-02-27 01:56:12 +03:00

[PR #4] [MERGED] 适配基于OneApi的中转Claude #93

kerem referenced this issue

2026-02-27 01:56:12 +03:00

[PR #3] [MERGED] add support to openrouter claude api #94