- A locally run application that demonstrates different matching algorithms
- Current release compares files as a one to one connection
- Outputs given percentage of similarity and highlighted visualization of the matching sections of input documents
- Supports, raw text files and python files currently, with C++ and java planned in the future releases
it is recommended to create and navigate to virtual environemnt using python to run script
python3.8 -m venv <dir>
source <dir>/bin/activate
install source_analyzer python package from Python Package Index. Note: must be running python version 3.8 or greater
pip install source_analyzer
Download the latest built compressed file release from source_analyzer-X.X.X.tar.gz
then install downloaded file
pip3 install /<path_to_file>/source_analyzer-0.1.18.tar.gz
(make sure xlaunch is running) run script
source_analyzer
Errors- Multiple of the same substring found in file B will return only the first instance of that substring.
Issues- Python files featuring a heavy amount of print statements may cause skewed data.
Test files can be found in:
/lib/python3.8/site-packages/source/test_file there are python files and .txt files, make sure to change the file filter down below to be able to see either of them.- Djoni Austin | @dcaust1n
- Jared Dawson | @lukinator1
- Shane Eising | @seising99
- Julian Marott | @jmmoratta
https://theory.stanford.edu/~aiken/publications/papers/sigmod03.pdf