Crawling Tweets as a Network.
Given a list of keywords, TwiSpider will crawl for real time tweets with this keyword(s). It saves the original tweet of it if that's a retweet. It then monitors the original tweets in a regualar basis.
postgresql@11
postgis@3.0
python@3.6+
pip install -r requirements.txt
-
Fill in configs. Make a copy from
.templatefiles. For example, copy a file fromtwitter.ini.templateand name it astwitter.ini, fill your api information inside. -
Edit
src/keywords.txtfor monitored keywords, one at a line. Space means boolean AND.hpv vaccinewill monitor tweets that both containhpvandvaccine -
Create tables with
src/sqls/create_tables.sql -
Run with
python3 src/main.py