Are class percentages are maintained in train, val and test datasets?

Currently train, val and test split is monolithic. See `data_preprocess.py -> make_train_val_test_split()`

If the split is so that class percentages are not maintained then thats not a good way to split the dataset. 

First, we need to check if class percentages are maintained in the splits or not?

If not, then split the dataset into train, val and test so that the class percentages match those of the original full dataset.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Are class percentages are maintained in train, val and test datasets? #10

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Are class percentages are maintained in train, val and test datasets? #10

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions