English | 中文
A Python basic program based on Selenium for crawling recruitment position information, supporting CSV and SQLite data storage
- 51job
- Boss zhipin
Tip
Due to the limitation of 51job's API, the max page number of per search word is limited to 200
Python 3.11.2
Edge
beautifulsoup4==4.12.2
colorlog==6.8.0
fake-useragent==1.4.0
pandas==2.1.3
selenium==4.15.2
requests==2.31.0
pip install requirements.txt
run test/spider_test.py
├─README.md
├─LICENSE
├─.gitignore
├─requirements.txt
├─log
│ ├─handler_logger.py
│ └─__init__.py
├─output
│ ├─area
│ │ ├─51area.csv
│ │ └─51area.db
│ └─job
│ ├─51job.csv
│ └─51job.db
├─spider
│ ├─jobspider51.py
│ ├─__init__.py
│ └─area
│ ├─areaspider51.py
│ └─__init__.py
└─test
└─spider_test.py
Warning
This program is only for learning and research purposes. Please do not use it for any business or illegal purpose. If you violate the regulations, please be responsible for yourself.。