From 1ff442384e26054fc5dbbb121e7611e229954eb6 Mon Sep 17 00:00:00 2001 From: Anshul Gupta Date: Tue, 1 Oct 2019 12:41:06 +0530 Subject: [PATCH] changes --- README.md | 20 +++++++++----------- 1 file changed, 9 insertions(+), 11 deletions(-) diff --git a/README.md b/README.md index c5582dd..d96ff2d 100644 --- a/README.md +++ b/README.md @@ -1,13 +1,13 @@ # SeQueL to Python -This repository constitutes the first beginner project. It includes sample +This repository is constituting the first beginner project. It includes sample code as well as a vagrant box development environment for you to use. -## High Level Instructions +## High Level Instructions are 1. Fork this repository on github to track the changes you make 2. Set up vagrant ([for Mac](https://github.com/ADSA-UIUC/Resources/blob/master/dev-environment/vagrant/mac-setup.md), [for Windows](https://github.com/ADSA-UIUC/Resources/blob/master/dev-environment/vagrant/windows-setup.md)) 3. Gather data from any source on the web (Be careful of rate limits!) @@ -16,12 +16,12 @@ code as well as a vagrant box development environment for you to use. -## Detailed Instructions +## Detailed Instructions are as follows: The two samples in this repository do the same thing in two different ways. -Both print a list of the titles and links on the front page of reddit. The +Both print a list of the titles and links on the front page of reddit. The first, ```html_scraping_sample.py``` uses the BeautifulSoup4 library to read -the html that reddit gives to web browsers. If you take a look at the +the html that reddit gives to web browsers. If you take a look at the code, the important part is the usage of BeautifulSoup to extract all the links in the page that have a class of "title". This is the html scraping method. The other file, ```json_scraping_sample.py``` is how to do it @@ -35,11 +35,9 @@ html, json or another format. Get it into python first, then work on getting int MySQL. -In order to use MySQL, the first thing you need to do is create a table that +In order to use MySQL, the first thing you need to do is create a table that will store your data. For a tutorial on creating a table, see [here] -(http://www.tutorialspoint.com/mysql/mysql-create-tables.htm). Make sure that -the table you design has columns for every piece of information that you want. -Once you have that done, you can use the MySQLdb library to store the data you +(http://www.tutorialspoint.com/mysql/mysql-create-tables.htm). Make sure that +the table you design has columns for every piece of information that you want. +Once you have that done, you can use the MySQLdb library to store the data you gathered in the first part. The code samples include how to do this. - -