Skip to content

rvente/author-attrib

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

68 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Overview

  1. Install dependencies: INSTALLING.md
  2. Process dataset: DataFuncs.py
  3. Extract Features:
  4. Run example experiments. run_all.py

Directory Tree

https://github.com/rvente/NLP-Final-Project/blob/release/Code/orpheus
β”œβ”€β”€ analysis Use these notebooks for analysis, reading and writing to /results .
β”‚Β Β  β”œβ”€β”€ analysis.ipynb
β”‚Β Β  β”œβ”€β”€ chart_nb_x_alpha.ipynb
β”‚Β Β  β”œβ”€β”€ chart_prev_seen.ipynb
β”‚Β Β  β”œβ”€β”€ chart_svc_prev_seen.ipynb
β”‚Β Β  β”œβ”€β”€ generate_charts.ipynb
β”‚Β Β  β”œβ”€β”€ Presentation.ipynb
β”‚Β Β  β”œβ”€β”€ Presentation-NB.ipynb
β”‚Β Β  └── Presentation-SVC.ipynb
β”œβ”€β”€ data Store data and feature extraction output here.
β”‚Β Β  β”œβ”€β”€ 1000A30D__doc+pos.pkl
β”‚Β Β  β”œβ”€β”€ 1000A30D_with_doc.pkl
β”‚Β Β  β”œβ”€β”€ 100A50D.csv
β”‚Β Β  β”œβ”€β”€ 100A50D__doc+pos.pkl
β”‚Β Β  β”œβ”€β”€ 100A50D_POS.pkl
β”‚Β Β  β”œβ”€β”€ DataFuncs.py
β”‚Β Β  β”œβ”€β”€ Run_All.py
β”‚Β Β  β”œβ”€β”€ ...
β”‚Β Β  β”œβ”€β”€ small_with_doc.pkl
β”‚Β Β  └── small.xlsx
β”œβ”€β”€ experimentation Configure and run the machine learning models
β”‚Β Β  β”œβ”€β”€ l0_100a_50d.py
β”‚Β Β  β”œβ”€β”€ __pycache__
β”‚Β Β  β”œβ”€β”€ run_all.py Outlines the most general combinations of hyper-parameters.
β”‚Β Β  β”œβ”€β”€ run_prev_seen.py
β”‚Β Β  β”œβ”€β”€ sandbox.py
β”‚Β Β  └── svc.py
β”œβ”€β”€ feature_extraction
β”‚Β Β  β”œβ”€β”€ add_parse_tree.py
β”‚Β Β  β”œβ”€β”€ add_path_features.py
β”‚Β Β  β”œβ”€β”€ instance_parser.py
β”‚Β Β  └── __pycache__
β”œβ”€β”€ figures Figures generated by the analysis scripts.
β”‚Β Β  β”œβ”€β”€ nb_x_alpha.pdf
β”‚Β Β  β”œβ”€β”€ nb_x_alpha.svg
β”‚Β Β  β”œβ”€β”€ nb_x_prev_seen.pdf
β”‚Β Β  └── svm_x_prev_seen.pdf
β”œβ”€β”€ INSTALLING.md How to install and configure
β”œβ”€β”€ logs gitignored: The filesystem database of experiments
β”‚Β Β  β”œβ”€β”€ 1
β”‚Β Β  β”œβ”€β”€ 10
β”‚Β Β  β”œβ”€β”€ 100
β”‚Β Β  β”œβ”€β”€ 101
β”‚Β Β  β”œβ”€β”€ ...
β”‚Β Β  └── _sources
β”œβ”€β”€ prev_seen_logs not gitignored: view sample logs here on another branch
β”‚Β Β  β”œβ”€β”€ 1
β”‚Β Β  β”œβ”€β”€ 10
β”‚Β Β  β”œβ”€β”€ 11
β”‚Β Β  β”œβ”€β”€ ...
β”‚Β Β  └── _sources
β”œβ”€β”€ INSTALLING.md
β”œβ”€β”€ requirements_2.txt
β”œβ”€β”€ requirements.txt
β”œβ”€β”€ results
β”‚Β Β  β”œβ”€β”€ nb_df_acc.pkl
β”‚Β Β  β”œβ”€β”€ nb_df_f1.pkl
β”‚Β Β  β”œβ”€β”€ nb_x_alpha_df_acc.pkl
β”‚Β Β  β”œβ”€β”€ svc_df_acc.pkl
β”‚Β Β  β”œβ”€β”€ svc_df_f1.pkl
β”‚Β Β  └── svm_x_prev_seen.pkl
β”œβ”€β”€ software_citations.bib
└── virtualenv We recommend a virtual environment for installing packages.

About

Authorship attribution using Constituency Parse Tree Paths

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •