- Install dependencies: INSTALLING.md
- Process dataset: DataFuncs.py
- Extract Features:
- Extract parse trees add_parse_tree.py
- Extract parse tree features add_path_features.py with utility functions defined in instance_parser.py.
- Run example experiments. run_all.py
https://github.com/rvente/NLP-Final-Project/blob/release/Code/orpheus
βββ analysis Use these notebooks for analysis, reading and writing to /results .
βΒ Β βββ analysis.ipynb
βΒ Β βββ chart_nb_x_alpha.ipynb
βΒ Β βββ chart_prev_seen.ipynb
βΒ Β βββ chart_svc_prev_seen.ipynb
βΒ Β βββ generate_charts.ipynb
βΒ Β βββ Presentation.ipynb
βΒ Β βββ Presentation-NB.ipynb
βΒ Β βββ Presentation-SVC.ipynb
βββ data Store data and feature extraction output here.
βΒ Β βββ 1000A30D__doc+pos.pkl
βΒ Β βββ 1000A30D_with_doc.pkl
βΒ Β βββ 100A50D.csv
βΒ Β βββ 100A50D__doc+pos.pkl
βΒ Β βββ 100A50D_POS.pkl
βΒ Β βββ DataFuncs.py
βΒ Β βββ Run_All.py
βΒ Β βββ ...
βΒ Β βββ small_with_doc.pkl
βΒ Β βββ small.xlsx
βββ experimentation Configure and run the machine learning models
βΒ Β βββ l0_100a_50d.py
βΒ Β βββ __pycache__
βΒ Β βββ run_all.py Outlines the most general combinations of hyper-parameters.
βΒ Β βββ run_prev_seen.py
βΒ Β βββ sandbox.py
βΒ Β βββ svc.py
βββ feature_extraction
βΒ Β βββ add_parse_tree.py
βΒ Β βββ add_path_features.py
βΒ Β βββ instance_parser.py
βΒ Β βββ __pycache__
βββ figures Figures generated by the analysis scripts.
βΒ Β βββ nb_x_alpha.pdf
βΒ Β βββ nb_x_alpha.svg
βΒ Β βββ nb_x_prev_seen.pdf
βΒ Β βββ svm_x_prev_seen.pdf
βββ INSTALLING.md How to install and configure
βββ logs gitignored: The filesystem database of experiments
βΒ Β βββ 1
βΒ Β βββ 10
βΒ Β βββ 100
βΒ Β βββ 101
βΒ Β βββ ...
βΒ Β βββ _sources
βββ prev_seen_logs not gitignored: view sample logs here on another branch
βΒ Β βββ 1
βΒ Β βββ 10
βΒ Β βββ 11
βΒ Β βββ ...
βΒ Β βββ _sources
βββ INSTALLING.md
βββ requirements_2.txt
βββ requirements.txt
βββ results
βΒ Β βββ nb_df_acc.pkl
βΒ Β βββ nb_df_f1.pkl
βΒ Β βββ nb_x_alpha_df_acc.pkl
βΒ Β βββ svc_df_acc.pkl
βΒ Β βββ svc_df_f1.pkl
βΒ Β βββ svm_x_prev_seen.pkl
βββ software_citations.bib
βββ virtualenv We recommend a virtual environment for installing packages.