python run_trade.py- scrape the home page (the default: top report)python run_trade.py --report banks- to scrape a specific report (to get a full list of available reports use:--print-reportsarg)python run_trade.py --print-reports- to print out all available reports
Output:
[INFO] report: mylist
[INFO] report: top
[INFO] report: toppct
[INFO] report: automobilescomponents
[INFO] report: banks
[INFO] report: capitalgoods
and so on ...
[INFO] report: top
[INFO] report: toppct
[INFO] report: automobilescomponents
[INFO] report: banks
[INFO] report: capitalgoods
and so on ...
python run_trade.py --report all- scrape all reports.
Clean (from scratch):
- install Python 2.7.12 ( or newer e.g. 2.7.13 ) - https://www.python.org/downloads/release/python-2712/
- WINDOWS: run command line as Administrator and type in :
pipto make sure is installed if you get an error while runningpip, in command line you need to go to C:\Python27\Scripts to install the following packadges, otherwise you should be able to run it from any directory in the command line. Make sure the command line is started as Administrator - win:
pip install lxml==3.6.0(other e.g. Macpip install lxml) pip install requestspip installpip install beautifulsoup4
Selenium:
pip install -U seleniumIf it happens that you are on a Windows machine, chromedriver.exe is included in the project. For OSX, please download chromedriver.exe from https://sites.google.com/a/chromium.org/chromedriver/ and drop it in/usr/bin/or any dir which exists on the PATH