Skip to content

FedericoCanepuzzi/Statistics_for_Data_Science

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Statistics_for_Data_Science

Statistics for Data Science project 2021/2022 - University Of Pisa

G. Segurini, F. Canepuzzi

Introduction

This project shows and explains the analysis of the AIDA dataset, which aims at a business failure prediction, risk factors exploration, and distributions investigation.

Question A

  • Compare the distributions of size and age between failed and active companies at a specific year
  • Do they change for a specific company form?
  • Do they change for a specific sector?

Question B

  • Compare the distributions of size and age of failed companies over different years
  • Do they change for a specific company form?
  • Do they change for a specific location?

Question C

  • What is the probability of failures conditional to size/age of firms at a specific year?
  • Does it change for a specific company form?
  • Does it change for a specific sector?
  • Does it change for a specific location?

Question D

  • Fit a parametric model

Question E

  • Extend the model with a selective classification

About

Statistics for Data Science project 2021/2022

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages