Extract Pydantic schema utilities to dedicated module #4309

daveey · 2025-12-10T07:15:01Z

d### TL;DR

Refactored Pydantic schema extraction into a dedicated module with enhanced functionality for JSON schema export.

What changed?

Moved get_pydantic_field_info() from run_tool.py to a new dedicated module schema.py
Added new utility functions in schema.py:
- get_type_str() - Converts type annotations to readable strings
- serialize_default() - Serializes default values to JSON-compatible format
- extract_schema() - Enhanced schema extraction with richer metadata
Created a new tool pydantic_config_schema.py that can extract schema from any Pydantic model and output as JSON

How to test?

Run the new schema extraction tool with a Pydantic model:

uv run ./tools/pydantic_config_schema.py metta.rl.trainer_config.TrainerConfig

Or with a tool name:

uv run ./tools/pydantic_config_schema.py arena.train

Why make this change?

This refactoring improves code organization by moving schema extraction logic to a dedicated module. The enhanced schema extraction capabilities enable better documentation generation and configuration validation. The new JSON schema export tool allows for programmatic access to configuration schemas, which can be used for generating documentation, UI forms, or validation rules.

daveey · 2025-12-10T07:15:28Z

Add per-agent cooldown for assemblers and protocol inflation #4171 : 3 dependent PRs (#4222 , #4247 , #4321 )
Extract Pydantic schema utilities to dedicated module #4309 👈 (View in Graphite)
main

This stack of pull requests is managed by Graphite. Learn more about stacking.

graphite-app · 2025-12-10T07:18:46Z

common/src/metta/common/tool/schema.py

+        if not has_default and field.default_factory is not None:
+            try:
+                default_val = field.default_factory()
+                has_default = True
+            except Exception:
+                default_val = "<factory>"
+                has_default = True


Calling field.default_factory() without arguments can fail for factories that require parameters. Additionally, executing factory functions during schema extraction can trigger unintended side effects (file I/O, network calls, resource allocation, etc.).

# Current code executes the factory: if not has_default and field.default_factory is not None: try: default_val = field.default_factory() # ⚠️ Executes factory has_default = True except Exception: default_val = "<factory>" has_default = True # Should avoid execution: if not has_default and field.default_factory is not None: default_val = "<factory>" has_default = True

The serialize_default() function already handles callable values by returning "<factory>" (line 78-79), so the factory should not be executed here. This prevents side effects and makes the behavior consistent with how get_pydantic_field_info() handles factories (line 130, 140).

Suggested change

if not has_default and field.default_factory is not None:

try:

default_val = field.default_factory()

has_default = True

except Exception:

default_val = "<factory>"

has_default = True

if not has_default and field.default_factory is not None:

default_val = "<factory>"

has_default = True

Spotted by Graphite Agent

Is this helpful? React 👍 or 👎 to let us know.

nishu-builder

this is very exciting! one thing that's coming to mind is that pydantic has some built-in support for schema serialization. e.g. try this out:

"""Extract JSON schema from a Pydantic model using built-in Pydantic schema export."""

import argparse
import importlib
import json
import sys

from pydantic import BaseModel


def load_model_class(spec: str) -> type[BaseModel]:
    if "." not in spec:
        raise ValueError(f"Invalid spec: {spec}. Use module.path.ClassName format")

    module_path, class_name = spec.rsplit(".", 1)
    module = importlib.import_module(module_path)
    cls = getattr(module, class_name)
    if not isinstance(cls, type) or not issubclass(cls, BaseModel):
        raise ValueError(f"{spec} is not a Pydantic BaseModel")
    return cls


def main():
    parser = argparse.ArgumentParser(description="Extract JSON schema from Pydantic model (built-in)")
    parser.add_argument("model", help="Model spec: module.path.ClassName")
    parser.add_argument("--compact", action="store_true", help="Compact JSON output")
    args = parser.parse_args()

    try:
        model_class = load_model_class(args.model)
    except Exception as e:
        print(f"Error loading model: {e}", file=sys.stderr)
        sys.exit(1)

    schema = model_class.model_json_schema()  # could replace this with `extract_schema` to compare to yours
    indent = None if args.compact else 2
    print(json.dumps(schema, indent=indent))


if __name__ == "__main__":
    main()

it has a few advantages:

handles a few types yours may not, like sets, Path, UUID (i think we use all of these)
field constraints: keeps ge, etc
handles it gets model-level docstrings (i see yours extracts descriptions, which could be cool to show up when forming the command in skydeck, and maybe you want some analagous thing for objects themselves?)

yours has the advantage of:

flat dot-path keys. maybe this makes handling in skydeck easier, or you've already written it with dot-path keys in mind
when there's an inner other pydantic object, you print <PydanticClassName>, and don't have to do any $ref jsonschema-type resolution, which the builtin one requires

I'm happy with either, so approving this. If we go with your impl, may want to add special type handling for sets, Path, UUID

tools/skydeck/SKYDECK_API.md

openhands-ai · 2025-12-14T22:28:51Z

Looks like there are a few issues preventing this PR from being merged!

GitHub Actions are failing:
- Test and Benchmark

If you'd like me to help, just leave a comment, like

@OpenHands please fix the failing actions on PR #4309 at branch `daveey-pydantic-schema`

Feel free to include any additional details that might help me get this PR into a better state.

_{^{You can manage your notification settings}}

d### TL;DR Refactored Pydantic schema extraction into a dedicated module with enhanced functionality for JSON schema export. ### What changed? - Moved `get_pydantic_field_info()` from `run_tool.py` to a new dedicated module `schema.py` - Added new utility functions in `schema.py`: - `get_type_str()` - Converts type annotations to readable strings - `serialize_default()` - Serializes default values to JSON-compatible format - `extract_schema()` - Enhanced schema extraction with richer metadata - Created a new tool `pydantic_config_schema.py` that can extract schema from any Pydantic model and output as JSON ### How to test? Run the new schema extraction tool with a Pydantic model: ```bash uv run ./tools/pydantic_config_schema.py metta.rl.trainer_config.TrainerConfig ``` Or with a tool name: ```bash uv run ./tools/pydantic_config_schema.py arena.train ``` ### Why make this change? This refactoring improves code organization by moving schema extraction logic to a dedicated module. The enhanced schema extraction capabilities enable better documentation generation and configuration validation. The new JSON schema export tool allows for programmatic access to configuration schemas, which can be used for generating documentation, UI forms, or validation rules.

This was referenced Dec 10, 2025

Add per-agent cooldown for assemblers and protocol inflation #4171

Draft

Add diversity injection for automatic exploration when gradients vanish #4222

Closed

skydeck #4237

Closed

daveey changed the title cp Extract Pydantic schema utilities to dedicated module Dec 10, 2025

This was referenced Dec 10, 2025

fix: observation inventory reads only from agent's own position #4238

Draft

refactor: loss and diversity injection changes #4240

Closed

daveey mentioned this pull request Dec 10, 2025

Move skydeck from projects/ to packages/ directory #4247

Closed

daveey marked this pull request as ready for review December 10, 2025 07:15

github-actions bot assigned daveey Dec 10, 2025

graphite-app bot reviewed Dec 10, 2025

View reviewed changes

daveey force-pushed the daveey-pydantic-schema branch 2 times, most recently from 6450469 to b501e73 Compare December 10, 2025 20:58

This was referenced Dec 10, 2025

Add build action to mettagrid #4321

Draft

Add building demolition functionality to MettaGrid #4323

Draft

Add AOE effects to grid objects #4328

Closed

daveey changed the base branch from main to graphite-base/4309 December 14, 2025 06:38

daveey force-pushed the daveey-pydantic-schema branch from b501e73 to 3f5a2da Compare December 14, 2025 06:38

daveey changed the base branch from graphite-base/4309 to daveey-resource-limit December 14, 2025 06:38

daveey mentioned this pull request Dec 14, 2025

Increase inventory capacity with base-100 encoding for values up to 25599 #4355

Closed

daveey force-pushed the daveey-resource-limit branch from 4aea879 to d0a68f9 Compare December 14, 2025 19:03

daveey force-pushed the daveey-pydantic-schema branch from 3f5a2da to 91dfa7f Compare December 14, 2025 19:03

daveey mentioned this pull request Dec 14, 2025

Refactor inventory system into dedicated InventoryConfig class #4356

Merged

daveey marked this pull request as draft December 14, 2025 19:33

daveey marked this pull request as ready for review December 14, 2025 19:35

daveey removed their assignment Dec 14, 2025

github-actions bot assigned daveey Dec 14, 2025

daveey assigned nishu-builder and unassigned daveey Dec 14, 2025

daveey requested a review from nishu-builder December 14, 2025 19:35

nishu-builder approved these changes Dec 14, 2025

View reviewed changes

daveey changed the base branch from daveey-resource-limit to graphite-base/4309 December 14, 2025 22:22

daveey force-pushed the graphite-base/4309 branch from d0a68f9 to 82c1007 Compare December 14, 2025 22:22

daveey force-pushed the daveey-pydantic-schema branch from 91dfa7f to 012fa28 Compare December 14, 2025 22:22

daveey changed the base branch from graphite-base/4309 to main December 14, 2025 22:22

graphite-app bot reviewed Dec 14, 2025

View reviewed changes

tools/skydeck/SKYDECK_API.md Outdated Show resolved Hide resolved

cp

2afd2b3

daveey force-pushed the daveey-pydantic-schema branch from 012fa28 to 2afd2b3 Compare December 14, 2025 22:37

daveey added this pull request to the merge queue Dec 14, 2025

Merged via the queue into main with commit b935bc4 Dec 14, 2025
12 checks passed

daveey deleted the daveey-pydantic-schema branch December 14, 2025 22:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Extract Pydantic schema utilities to dedicated module #4309

Extract Pydantic schema utilities to dedicated module #4309

Uh oh!

daveey commented Dec 10, 2025 •

edited

Loading

Uh oh!

daveey commented Dec 10, 2025 •

edited

Loading

Uh oh!

graphite-app bot Dec 10, 2025

Uh oh!

nishu-builder left a comment

Uh oh!

Uh oh!

openhands-ai bot commented Dec 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Extract Pydantic schema utilities to dedicated module #4309

Extract Pydantic schema utilities to dedicated module #4309

Uh oh!

Conversation

daveey commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changed?

How to test?

Why make this change?

Uh oh!

daveey commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

graphite-app bot Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

nishu-builder left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

openhands-ai bot commented Dec 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

daveey commented Dec 10, 2025 •

edited

Loading

daveey commented Dec 10, 2025 •

edited

Loading