Skip to content

Recommendation for adding models present in TabZilla to AutoGluon #101

@Innixma

Description

@Innixma

Hello, first of all great work in getting so many model families implemented in one place!

I am wondering if there is a recommendation from the authors on how I would best add certain model families such as ResNet to work in AutoGluon (example of custom model implementation in AutoGluon: https://auto.gluon.ai/stable/tutorials/tabular/advanced/tabular-custom-model.html)

While I could potentially do it from scratch such as adapting rtdl (https://github.com/yandex-research/rtdl/tree/main), this would end up being a lot of duplicated work that TabZilla already did. I notice in your code base that you have a lot of code logic that is specialized such as RTDL_ResNet_Model, but trying to call these classes requires many args being specified that are benchmark specific.

At the end of the day, I'm looking for something akin to a sklearn interface, and any required data preprocessing would happen inside the model implementation:

train_data = pd.read_csv("some_messy_data_with_categoricals_and_missing.csv")
X_test = pd.read_csv("some_messy_data_with_categoricals_and_missing_test.csv")
X = train_data.drop(target_column, axis=1)
y = train_data[target_column] 
for model_class in all_models_implemented_in_tabzilla:
     model = model_class(**params)
     model.fit(X, y)
     y_pred = model.predict(X_test)

The final goal would be to support all TabZilla models in TabRepo to improve the strength of the learned portfolios and find model families that synergize when ensembled.

Any guidance would be appreciated!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions