This educational project uses web scraping to extract and analyze sneaker listings from Amazon.sg, focusing on brand, price, rating, and sponsorship data. It provides hands-on experience with dynamic content extraction and reveals patterns like top brands, common price ranges, and high customer ratings.
- Create a new conda environment called "coupang-test" and activate it
- Perform a "pip install -r requirements.txt" to install all of the project's dependencies
- In the project directory, open the terminal and run this command "python3 webscrape_amazon.py"
- In the project directory, open the terminal and run this command "python3 -m notebook"
- Open the "ETL & EDA.ipynb" file
- In part 1 of the script, update the name of the file for read_csv to the .csv file generated from Section 3
- Go to the script's top menu, select 'Kernel' and click on 'Restart Kernel and Run All Cells..."