GitHub - AfsharJoyceInfoLab/AlcoholNLP_Classification

@author Brihat Sharma

Introduction

Alcohol classifier to identify alcohol misuse from the Electronic Health Record of Emergency Department and Hospitalized Patients. The first 24hr of clinical notes are needed as an input, which should be first processed using Apache cTAKES to concept map the raw with UMLS into Concept Unique Identifiers (CUIs).

Original research article describing development and internal validation: https://www.ncbi.nlm.nih.gov/pubmed/30602031. The classifier was trained against patients that completed the Alcohol Use Disorders Identification Test.

Dependencies Library: Pandas, os, pickle

Steps:

cTAKES:

Download cTAKES from https://ctakes.apache.org/downloads.cgi
cTAKES comes with default dictionary, this dictionary can also be cutomized creating own version. Our dictionary consists of rxnorms and snomedCT but default dictionary also works well
Process the input data using cTAKES, this will crete .txt files with CUIs which will be input data to the model

Model:

Open the Alcohol_Predict.py script and change the input and output directory
Run the sript as python3 Alcohol_predict.py
The result will be inside the output directory inside a csv file, first column represents the files, second column represents predicted labels and the third column represents predict probability. 1 as current alcohol misuse and 0 as no alcohol misuse for the second column.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

AfsharJoyceInfoLab/AlcoholNLP_Classification

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages