Skip to content

Conversation

@anumala2
Copy link

@anumala2 anumala2 commented Dec 8, 2025

Purpose

This pull request aims to add the drug information dataset from the Genomics of Drug Sensitivity in Cancer to the pyhealth library. The GDSC drug_info table is a drug-centric metadata table that describes compounds screened across the Genomics of Drug Sensitivity in Cancer (GDSC) cell-line drug-sensitivity project. Typical columns include unique drug identifiers, canonical names, alternate names/synonyms, molecular or protein targets, higher-level pathways targeted, external chemical identifiers (e.g., PubChem CID), and bookkeeping counts such as sample sizes or number of experiments. The broader GDSC resource pairs these drug metadata with measured drug response (e.g., IC50) across hundreds to thousands of cancer cell lines, enabling pharmacogenomic analyses. This data is sourced from the extending-cadre repository.

@LogicFan LogicFan added the dataset Contribute a new dataset to PyHealth label Dec 18, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dataset Contribute a new dataset to PyHealth

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants