Crawler_and_sentiment_analysis
In settings.py change variable
CLOSESPIDER_ITEMCOUNT = 10
cd in spiders folder. Then type in cmd line
scrapy crawl quotes
scrapy crawl quotes -o items.json
return to one earlier folder with cd .. then
python inv_index.py
python query_processing.py Concordia
the result of the query is saved in the text file named Result_01.txt
dataset.txt -- [general informations about all the files analysed]
infoForQueries.txt -- [url, length in words of this url, sentiment value for this url]
sentiment_for_each_url.txt: -- [url, sentiment value for this url]