[WIP] [New Model] PrepLGBM #245

atschalz · 2025-11-18T18:19:43Z

Issue #, if available:

Description of changes:
Adds a new model that extends the current LightGBM implemented in AutoGluon by adding several preprocessing techniques.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Remove debugging code update pytabkit version tmp commit Update update CVSplitter import switch to AG SquashingScaler Update prep_mixin to be faster and fix crashes Update prep_mixin to work with lists instead of tuples Fix yaml serialization fix pyproject.toml Add passthrough_types Update passthrough logic Use `remove_unused_features="false_recursive"` and `post_drop_duplicates=True` Update memory estimation to use n_numeric, and n_categorical estimates from preprocessors. Minor improvements to linear residual Use BulkFeatureGenerator and parallel stage preprocessing, >2x inference speedup. Minor fix Reduce ModelAgnosticPrepMixin._estimate_memory_usage runtime by 4x cleanup Add missing __init__.py stop saving training job logs to s3 add `memory_usage_estimate` tracking cleanup torch memory and gc minor update Update RealTabPFN-2.5 to new prep logic refactor prep logic streamline prep logic update Small bug fix Extend preprocessing logic to TabM, TabPFN2.5, CatBoost and XGBoost; Move preprocessing models to a separate directory minor updates Add skrub to dependencies bug fix in search space Adjust search space Fix bug in memory estimation for OOF-TE Add memory estimation, switch to using AG preprocessors Add functions for linear residual boosting Undo example changes Remove unnecessary old code Add explorative search space Make preprocessors and residuals use the correct random seeds Adjust to new tabarena structure Initialize with AG LightGBM Update PrepGBM: Add residual logic and preprocessors, switch to use LGBModel as superclass; Adapt search space Add new model and search space

Innixma force-pushed the prep_lgb branch 3 times, most recently from 827655b to 73ce839 Compare November 21, 2025 01:40

atschalz force-pushed the prep_lgb branch 2 times, most recently from 6b62286 to ea510d6 Compare December 1, 2025 22:58

Innixma force-pushed the prep_lgb branch 2 times, most recently from 83e94fe to 4e26a4b Compare December 9, 2025 00:15

Innixma force-pushed the prep_lgb branch 2 times, most recently from bf832b6 to c0fdcd0 Compare December 17, 2025 00:02

Innixma force-pushed the prep_lgb branch 2 times, most recently from a5256e4 to e203149 Compare January 4, 2026 22:17

atschalz force-pushed the prep_lgb branch from 1df2d06 to 635a1a3 Compare January 9, 2026 11:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] [New Model] PrepLGBM #245

[WIP] [New Model] PrepLGBM #245

Uh oh!

atschalz commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[WIP] [New Model] PrepLGBM #245

Are you sure you want to change the base?

[WIP] [New Model] PrepLGBM #245

Uh oh!

Conversation

atschalz commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant