ChestGPT Training, Evaluation and Layout Files/Folders

Foreword

This Repo contains clones from MiniGPT-4 and MiniGPT-Med (Can be seen in respective folders).

Some of the training files might be outdated as they were uploaded to a cluster and might've been modified.

I would suggest looking at updated multimodal frameworks and LLMs or creating your own Vision Transformer -> LLM. If you do use these cloned repos, make sure to read their READMEs.

The dataset used is VinDr-CXR, I can't redistribute this dataset as it's a restricted-access resource. If you have access, feel free to contact me for the post-processed dataset + annotations and results. However you should be able to create the post-processed dataset given these files.

Requirements

Not explicitly stated in the MiniGPT-4 and MiniGPT-Med repos, to train you must be on a Linux environment and have at least 16GB VRAM (12GB is not enough).

Llama 2 needs to be downloaded somewhere on your computer.

Values that need to be changed in files (i.e. changing paths) are designated by putting "CHANGE ME" where you need to change the value.

Structure

annotations

Where the postprocessed train and test annotations lie.

data

Where the postprocessed train and test imgs lie.

vindr-dataset

Where the unprocessed vindr-dataset lies.

Root

These Python files process the dataset into the correct size, annotations and viewable pngs/jpegs. The python files listed here are ordered alphabetically

data_viewer.py

Given a test dicom id, takes the postprocessed img and draws rectangles on it given either model output or labels.

image_processing.py

Crops original dicom images given data_type (train or test files) to 448x448 and saves to data directory. Also saves scaling factor used to crop image for use in another python file.

label_gen.py

Generates labels given the scaling annotations from scale_annotations.py and the annotations from the vindr dataset. Saves to annotations.

label_to_text.py

Translates the labels generated in label_gen.py to a prompt to feed LLM. Prompts are saved to annotations.

list_of_dict_to_dict.py

Changes the prompt output (list of dictionaries) to just a single dictionary and saves to file in annotations.

metric_eval_boxes_text.py

Evaluates the model in terms of IoU (bounding boxes), Rouge and BLEU (text eval). Needs parsed results from parse_vindr_results.py.

metric_eval_local.py

Evaluates the model's ability to predict localized diseases using various metrics. Needs parsed results from parse_vindr_results.py.

metric_eval.py

Evaluates the model's ability to predict global diseases using various metrics. Needs parsed results from parse_vindr_results.py.

parse_vindr_results.py

Parses the results file after running the eval script in the respective repos. To be used for metric evaluation.

scale_annotations.py

Adjusts the bounding boxes in the vindr-cxr dataset to be scaled correctly after the postprocessing of the image. To be used in label_gen.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ChestGPT Training, Evaluation and Layout Files/Folders

Foreword

Requirements

Structure

annotations

data

vindr-dataset

Root

data_viewer.py

image_processing.py

label_gen.py

label_to_text.py

list_of_dict_to_dict.py

metric_eval_boxes_text.py

metric_eval_local.py

metric_eval.py

parse_vindr_results.py

scale_annotations.py

About

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
MiniGPT-4		MiniGPT-4
MiniGPT-Med		MiniGPT-Med
annotations		annotations
data		data
vindr-dataset		vindr-dataset
README.md		README.md
data_viewer.py		data_viewer.py
image_processing.py		image_processing.py
label_gen.py		label_gen.py
label_to_text.py		label_to_text.py
list_of_dict_to_dict.py		list_of_dict_to_dict.py
metric_eval.py		metric_eval.py
metric_eval_boxes_text.py		metric_eval_boxes_text.py
metric_eval_local.py		metric_eval_local.py
parse_vindr_results.py		parse_vindr_results.py
requirements.txt		requirements.txt
scale_annotations.py		scale_annotations.py

przuljpe/ChestGPT

Folders and files

Latest commit

History

Repository files navigation

ChestGPT Training, Evaluation and Layout Files/Folders

Foreword

Requirements

Structure

annotations

data

vindr-dataset

Root

data_viewer.py

image_processing.py

label_gen.py

label_to_text.py

list_of_dict_to_dict.py

metric_eval_boxes_text.py

metric_eval_local.py

metric_eval.py

parse_vindr_results.py

scale_annotations.py

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages