Skip to content

Conversation

@runalih
Copy link

@runalih runalih commented Dec 8, 2025

In this PR we are including 3 new files that aim to add a new dataset and a task to go along with that dataset.

This dataset is Heart_Disease_UCI.csv. It contains many columns that all pertains to a patients current health status (e.g. age, sex, cholesterol, blood pressure) and their current heart problems. Using this csv file, we’ve created a Dataset that is imported into pyhealth to interpret this data.

This data is then pulled into a task that aims to use the columns in this dataset to try to predict where or not patients have any heart issues. This task also provides a train/testing split in order to test if the model is accurate.

New Files added/modified:
pyhealth/datasets/heart_dataset.py
pyhealth/datasets/rawdata/heart_disease_uci.csv
pyhealth/tasks/heart_task.py

Examples of usages:
# pull in datasets from local

dataset = HeartDiseaseDataset(root="pyhealth/datasets/rawdata/heart_disease_uci.csv")

# run tasks on dataset to predict information

task = HeartDiseasePrediction(dataset)



# output information on splits

(train_X, train_y), (test_X, test_y) = task.split()

…ks for predicting heart problems from dataset.
@LogicFan LogicFan added dataset Contribute a new dataset to PyHealth task Contribute a new task to PyHealth labels Dec 18, 2025
@LogicFan LogicFan changed the title Init Commit, adding files for importing heart disease dataset and tas… HeartDisease dataset & task Dec 18, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dataset Contribute a new dataset to PyHealth task Contribute a new task to PyHealth

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants