-
Notifications
You must be signed in to change notification settings - Fork 2k
feat: add and update openai reasoning_effort tests #4815
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: add and update openai reasoning_effort tests #4815
Conversation
Signed-off-by: Alexandros Pappas <apappascs@gmail.com>
8a29f9d to
aec4bf8
Compare
Signed-off-by: Alexandros Pappas <apappascs@gmail.com>
… in OpenAiChatModelIT Signed-off-by: Alexandros Pappas <apappascs@gmail.com>
| } | ||
|
|
||
| @Test | ||
| @Disabled("The reasoning_effort option is only available in o1 models.") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This test result isn't consistent with the current settings as the reasoning tokens end up being "0" in some cases when the model doesn't seem to use reasoning tokens for simple problems unless being more specific with the prompt message.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I will update the prompt when merging.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you @ilayaperumalg for pointing this out.
just updated with the example from claude https://docs.claude.com/en/docs/build-with-claude/extended-thinking
and set it to high. already run it 10 times without failure, but I am not sure if there is a case to have reasoning tokens end up being "0"
| } | ||
|
|
||
| @Test | ||
| void testReasoningEffortWithDifferentLevels() { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This test can be an overhead as this doesn't exactly verify the reasoning tokens from the output. What do you think?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree just wanted to verify all reasoning levels work. lemme update the PR
Signed-off-by: Alexandros Pappas <apappascs@gmail.com>
|
@apappascs Thank you for the quick follow up and updates. The changes look good. Rebased, squashed and merged as e157775 |
#4804
Thank you for taking time to contribute this pull request!
You might have already read the contributor guide, but as a reminder, please make sure to:
git commit -s) per the DCOmainbranch and squash your commitsFor more details, please check the contributor guide.
Thank you upfront!