Skip to content
This repository was archived by the owner on Nov 24, 2020. It is now read-only.
/ SourceAnalyzer Public archive

open source code honesty analyzer

License

Notifications You must be signed in to change notification settings

jmmarotta/SourceAnalyzer

Repository files navigation

SourceAnalyzer

GitHub Repo Link

  • A locally run application that demonstrates different matching algorithms
  • Current release compares files as a one to one connection
  • Outputs given percentage of similarity and highlighted visualization of the matching sections of input documents
  • Supports, raw text files and python files currently, with C++ and java planned in the future releases

Getting Started

STEP 1

it is recommended to create and navigate to virtual environemnt using python to run script

python3.8 -m venv <dir> 
source <dir>/bin/activate

STEP 2

install source_analyzer python package from Python Package Index. Note: must be running python version 3.8 or greater

pip install source_analyzer
OR

Download the latest built compressed file release from source_analyzer-X.X.X.tar.gz

then install downloaded file

pip3 install /<path_to_file>/source_analyzer-0.1.18.tar.gz

STEP 3

(make sure xlaunch is running) run script

source_analyzer

Known Errors/Issues

Errors- Multiple of the same substring found in file B will return only the first instance of that substring.

Issues- Python files featuring a heavy amount of print statements may cause skewed data.

Test Files

Test files can be found in:

/lib/python3.8/site-packages/source/test_file there are python files and .txt files, make sure to change the file filter down below to be able to see either of them.

Project Group: Codalyzers

  • Djoni Austin | @dcaust1n
  • Jared Dawson | @lukinator1
  • Shane Eising | @seising99
  • Julian Marott | @jmmoratta

References:

https://theory.stanford.edu/~aiken/publications/papers/sigmod03.pdf

About

open source code honesty analyzer

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •