This project looks at the 2019 Stack Overflow Developer Survey to understand what technologies developers were using in 2019 and what they wanted to learn next. I also bring in job‑market signals (GitHub Jobs API) and a small web‑scraped dataset to compare survey trends with real hiring demand.
The end result is a simple BI‑style report and dashboard (see the slides / PDF in this repo).
- Collect job‑market data using APIs and basic web scraping.
- Clean and wrangle the survey dataset into a usable format.
- Explore the data with EDA to find patterns and gaps.
- Build visualisations that answer “what’s popular now vs next.”
- Summarise findings in a BI report/dashboard.
-
Stack Overflow Developer Survey 2019 (subset)
A cleaned subset of the public 2019 survey. It includes:- Technologies worked with and desired next year
(languages, databases, platforms, web frameworks) - Demographics
(age, gender, education, country, etc.)
Files used here:
- Technologies worked with and desired next year
-
GitHub Jobs API (job demand snapshot)
Used to count how many jobs mention certain technologies. -
Web‑scraped list of popular programming languages
A small scrape to practice extraction + saving results to CSV.
This repo follows a full mini data‑science / BI pipeline:
-
Collecting job data through APIs
Notebook:1. Collecting_job_data_using_APIs.ipynb- Pulls job listings from the GitHub Jobs API
- Counts demand for specific technologies
- Saves results into a spreadsheet
-
Collecting extra data through web scraping
Notebook:2a. Web_Scraping.ipynb- Scrapes a public webpage listing popular languages
- Stores results in a CSV for comparison
-
Survey dataset exploration
Notebook:2b. Explore_Dataset.ipynb- Loads the survey data
- Checks shape, data types, missing values, duplicates
-
Data wrangling
Notebook:3. Data_Wrangling.ipynb- Removes duplicates and irrelevant columns
- Handles missing data
- Normalises multi‑select columns (e.g., multiple languages per respondent)
-
Exploratory data analysis
Notebook:4. Exploratory_Data_Analysis.ipynb- Looks for distributions and relationships
- Aggregates counts for “worked with” vs “desired next year”
-
Data visualisation + BI reporting
Notebook:5. Data_Visualization.ipynb- Builds charts for technology usage, trends, and demographics
- Output is summarised into a BI report / dashboard:
Top languages developers worked with
- JavaScript
- HTML/CSS
- SQL
- Bash/Shell/PowerShell
- Python
- Java
- C#
- TypeScript
- PHP
- C++
Top databases developers worked with
- MySQL
- Microsoft SQL Server
- PostgreSQL
- SQLite
- MongoDB
- Redis
- Elasticsearch
- Oracle
- MariaDB
- Firebase
Top platforms developers worked with
- Linux
- Windows
- Docker
- AWS
- Slack
- MacOS
- Android
- Microsoft Azure
- Raspberry Pi
- WordPress
Top web frameworks developers worked with
- jQuery
- Angular / Angular.js
- React.js
- ASP.NET
- Express
- Spring
- Vue.js
- Flask
- Django
- Laravel
Top languages developers wanted to learn next
- JavaScript
- HTML/CSS
- Python
- SQL
- TypeScript
- C#
- Bash/Shell/PowerShell
- Java
- Go
- Kotlin
Big takeaway: Python, TypeScript, Go, and Kotlin were climbing fast even if they weren’t top‑3 in current use.
Top databases developers wanted next
- PostgreSQL
- MongoDB
- Redis
- MySQL
- Elasticsearch
- Microsoft SQL Server
- SQLite
- Firebase
- MariaDB
- DynamoDB
Big takeaway: PostgreSQL and modern NoSQL/fast‑cache tools were clearly gaining momentum.
Top platforms developers wanted next
- Linux
- Docker
- AWS
- Windows
- Android
- Kubernetes
- MacOS
- Raspberry Pi
- Google Cloud Platform
- Slack
Big takeaway: Docker + Kubernetes interest shows how strongly cloud‑native dev was taking over.
Top web frameworks developers wanted next
- React.js
- Vue.js
- Angular / Angular.js
- ASP.NET
- jQuery
- Express
- Spring
- Django
- Flask
- Ruby on Rails
Big takeaway: React and Vue were the main “next‑step” frameworks people were aiming for.
- Gender split is heavily uneven in this subset:
- “Man” respondents dominate by a lot, with “Woman” and non‑binary groups much smaller.
- Average respondent age is about 31, and the median is 29.
- Top countries represented: United States, India, United Kingdom, Germany, Canada.
- Education level: most respondents report a Bachelor’s degree, followed by Master’s degrees.
This project is notebook‑based. Open the .ipynb files in Jupyter Lab / VS Code.
Main libraries used: pandas, numpy, matplotlib, seaborn, requests, beautifulsoup4.
Example install:
pip install pandas numpy matplotlib seaborn requests beautifulsoup4- Cleaned and normalised survey datasets (CSV + SQLite)
- EDA + visualisation notebooks
- BI dashboard summary in slides/PDF:
- Add 2020–2024 survey years to show trend direction over time.
- Compare survey popularity with job‑market popularity more formally.
- Turn the dashboard into a live Power BI / Tableau / Dash app.
- Segment trends by role (student vs professional, backend vs frontend, etc.).
Yasir Savanur
LinkedIn: https://www.linkedin.com/in/yasir-savanur/