Add training API for vector data rasterization #1
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Implements a Python library for rasterizing vector data (points, lines, polygons) with a direct training API for machine learning workflows.
Core Components
Rasterizer- Converts vector geometries to raster grids withrasterize_points(),rasterize_lines(),rasterize_polygon(), andbatch_rasterize()Trainer- High-level training API with three usage modes:prepare_training_data()- Rasterize and normalize for any ML frameworktrain()- Train models directly with prepared datatrain_from_raw_data()- One-step pipeline from raw vectors to trained modelUsage
Implementation Details
fit()methodEPSILON = 1e-8(batch, height, width, 1)for CNN compatibilityTesting
20 unit tests covering core functionality and edge cases, bilingual documentation with examples.
Original prompt
✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.