Skip to content

stepan5dol/practicum_projects

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

95 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Studying projects form "Data Science" course at Yandex.Practicum

Each project in numerical order reflects the stages of studying aspects of the Data Scientist profession.

Most of projects were based on real data.

All projects were aimed to train specific skill. United projects were needed to train all previously studied material.

Table of context

Name Skills Description Libraries
1 Yandex.Music Pandas A study of the behavior of Yandex.Music users in two big cities pandas
2 Investigation of the reliability of borrowers Data Preprocessing Analyze statistics on the solvency of customers of the bank's credit department seaborn pandas matplotlib
3 Research of ads for the sale of apartments) EDA Based on Yandex service data.Determine the market value of real estate and set parameters that will help track anomalies and fraudulent activity. pandas seaborn matplotlib
4 Determination of a promising tariff for a telecom company SDA The study of customer behavior, based on a preliminary analysis of tariffs on a small sample of customers, and the identification of the best tariff for a certain category of customers. numpy seaborn pandas matplotlib
5 United Project 1: Computer games store DP + EDA + SDA A study of the sale of games around the world in order to identify patterns that determine the success of the game. numpy seaborn pandas matplotlib scipy
6 Recommendation of tariffs ML: RandomForestClassifier, LogisticRegression Basic classification models trained to analyze customer behavior and offer users a new tariff. scikit-learn numpy seaborn pandas matplotlib
7 Customer outflow RandomForestClassifier, AUC-ROC, OrdinalEncoder Building a forecast model for the outflow of customers from the bank. pandas sklearn pandas_profiling
8 Choosing the location for the well Bootstrap, StandardScaler & MinMaxScaler Building a machine learning model that will help determine the region where mining will bring the greatest profit. Analysis of possible profits and risks by Bootstrap technique. sklearn numpy scipy pandas
9 Recovery of gold from ore LinearRegression, Ridge, ElasticNet, Bootstrap, CrossValidation, KNNImputer, StandardScaler & MinMaxScaler Construction of a model that should predict the recovery rate of gold from gold-bearing ore. The model will help optimize production so as not to launch an enterprise with unprofitable characteristics. sklearn numpy scipy pandas seaborn
10 Linear Algebra Understanding matrix structure and profitability of vector calculations, matrix multiplication, LaTeX use. Work with matrics to encypt private data & use it in ML. sklearn numpy scipy pandas seaborn
11 Determining car prices RandomizedSearchCV, CatBoostRegressor, LGBMRegressor, XGBoost Building a model for quickly determining the cost of cars. catboost lgbm xgboost
12 Forecasting taxi orders Time Series (statsmodels.tsa.seasonal) Building a model for predicting the number of taxi orders for the next hour in order to attract more drivers during peak load. statsmodels pandas catboost scikit-learn numpy
13 Wikishop NLP Building an NLP model to search for toxic comments and submit them for moderation pandas matplotlib numpy scikit-learn seaborn NLTK re catboost xgboost lightgbm
14 Determining the age of buyers Conmuter Vision (keras, using ResNet50) Building a model that will determine the age of a person from a photo. tensorflow keras numpy pandas matplotlib

About

Projects done throughout the studying Data Science in Yandex.Practicum

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published