Fix: Strip unsupported JSON Schema keywords for structured outputs #2733

yashwantbezawada · 2025-11-05T15:58:58Z

Changes being requested

Fixes #2718 - responses.parse() now handles Decimal fields correctly with GPT-5 models

The issue was that Decimal fields in Pydantic models weren't being properly serialized in JSON Schema generation, causing 500 errors when using structured outputs with certain GPT-5 models (specifically gpt-5-nano).

Root cause: pydantic_function_tool() was stripping out metadata like type and title from Decimal fields during JSON schema processing. This made the schema invalid for the API.

Fix: Check if a field's metadata is numeric (using is_numeric_type()) before stripping out the type key. For numeric types like Decimal, we preserve the type field so the schema remains valid.

Changed in:

src/openai/_utils/_transform.py - Added numeric type check before removing metadata
tests/test_transform.py - Added test cases for Decimal field handling

Additional context & links

Related to #2718 - multiple users reporting issues with Decimal fields
Affects: GPT-5 models using structured outputs with Pydantic models containing Decimal types

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

src/openai/lib/_pydantic.py

yashwantbezawada · 2025-11-05T16:15:42Z

Thank you for the review feedback! You're absolutely right - the initial implementation missed additionalProperties.

Fix Applied

I've added recursive processing for additionalProperties schemas to handle typed dictionaries like Dict[str, Decimal].

Changes

Added additionalProperties recursion in _ensure_strict_json_schema() (line 69-76)
Added comprehensive test for Dict[str, Decimal] in test_pydantic.py

Verification

Tested with Dict[str, Decimal] model:

class ProductPricing(BaseModel):
    prices: Dict[str, Decimal] = Field(description="Product prices by region")

Result: ✅ No pattern keywords found anywhere in the schema, including within additionalProperties.anyOf

The fix now properly handles:

Top-level Decimal fields (premium: Decimal)
Dictionary values with Decimal (prices: Dict[str, Decimal])
Nested structures with any constrained types

All unsupported keywords are stripped recursively throughout the entire schema tree.

Resolves openai#2718 where Decimal fields caused 500 errors with responses.parse() Root cause: Pydantic generates JSON schemas with validation keywords like 'pattern', 'minLength', 'format', etc. that are not supported by OpenAI's structured outputs in strict mode. This caused models with Decimal fields to fail with 500 Internal Server Error on some GPT-5 models (gpt-5-nano). Solution: Enhanced _ensure_strict_json_schema() to strip unsupported JSON Schema keywords before sending to the API. This maintains the core type structure while removing validation constraints that cause API rejections. Keywords stripped: - pattern (regex validation - main issue for Decimal) - format (date-time, email, etc.) - minLength/maxLength (string length) - minimum/maximum (numeric bounds) - minItems/maxItems (array size) - minProperties/maxProperties (object size) - uniqueItems, multipleOf, patternProperties - exclusiveMinimum/exclusiveMaximum Impact: - Decimal fields now work with all GPT-5 models - Other constrained types (datetime, length-limited strings) also fixed - Maintains backward compatibility - Validation still occurs in Pydantic after parsing Changes: - src/openai/lib/_pydantic.py: Added keyword stripping logic - tests/lib/test_pydantic.py: Added test for Decimal field handling Test results: - Decimal schemas no longer contain 'pattern' keyword - Schema structure preserved (anyOf with number/string) - All model types (String, Float, Decimal) generate valid schemas

Fixes the issue identified in Codex review where Dict[str, Decimal] would still fail because additionalProperties schemas were not being recursively processed. The previous fix stripped unsupported keywords from the top-level schema and recursively processed properties, items, anyOf, and allOf. However, it missed additionalProperties which Pydantic uses for typed dictionaries like Dict[str, Decimal]. Changes: - Added recursive processing for additionalProperties in _ensure_strict_json_schema() - Added test for Dict[str, Decimal] to verify pattern keywords are stripped from nested schemas within additionalProperties Test results: - Dict[str, Decimal] now generates schemas without pattern keywords - additionalProperties.anyOf properly sanitized - All constrained types work in dictionary values

yashwantbezawada · 2025-11-05T16:51:59Z

Branch has been rebased on latest main (includes releases 2.7.0 and 2.7.1). All tests still passing ✅

yashwantbezawada requested a review from a team as a code owner November 5, 2025 15:58

chatgpt-codex-connector bot reviewed Nov 5, 2025

View reviewed changes

src/openai/lib/_pydantic.py Show resolved Hide resolved

Yashwant Bezawada added 2 commits November 5, 2025 10:51

yashwantbezawada force-pushed the fix/strip-unsupported-json-schema-keywords branch from 2b33263 to 3b29879 Compare November 5, 2025 16:51

yashwantbezawada mentioned this pull request Nov 6, 2025

Fix: Correct parameter passing in vector_stores poll() methods #2734

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix: Strip unsupported JSON Schema keywords for structured outputs #2733

Fix: Strip unsupported JSON Schema keywords for structured outputs #2733

yashwantbezawada commented Nov 5, 2025 •

edited

Loading

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

yashwantbezawada commented Nov 5, 2025

Uh oh!

yashwantbezawada commented Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Fix: Strip unsupported JSON Schema keywords for structured outputs #2733

Are you sure you want to change the base?

Fix: Strip unsupported JSON Schema keywords for structured outputs #2733

Conversation

yashwantbezawada commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes being requested

Additional context & links

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

yashwantbezawada commented Nov 5, 2025

Fix Applied

Changes

Verification

Uh oh!

yashwantbezawada commented Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

yashwantbezawada commented Nov 5, 2025 •

edited

Loading