Skip to content

Are class percentages are maintained in train, val and test datasets? #10

@DivyenduDutta

Description

@DivyenduDutta

Currently train, val and test split is monolithic. See data_preprocess.py -> make_train_val_test_split()

If the split is so that class percentages are not maintained then thats not a good way to split the dataset.

First, we need to check if class percentages are maintained in the splits or not?

If not, then split the dataset into train, val and test so that the class percentages match those of the original full dataset.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or requestquestionFurther information is requested

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions