High output tokens in a structured output query #2693
Replies: 1 comment
-
|
High output token usage with structured outputs can happen for a few reasons: Since you're using GPT-5, it might be generating internal reasoning tokens that count toward Also, you're passing a PDF file with input_file. The model might be processing/analyzing that Try adding some debugging to see what's actually happening: You can also try capping the output with max_tokens: If you're still seeing 4000 tokens for such a small output, that does seem excessive. Could you |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello.
I've implemented a query to gpt5 API using a structured output (no more than 20 fields of less than 3 words each). When inspecting the token usage of my queries, I am using 1000 input tokens and 4000 output tokens for each query. Why am I getting that high output tokens?
Beta Was this translation helpful? Give feedback.
All reactions