Skip to content

Ellasx7/data550_midt_project

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Project description

The project aims to analyze Mexico's COVID-19 dataset to identify patterns, relationships, and impacts of the pandemic. The team will clean and preprocess the data, conduct statistical tests to explore associations between variables, and evaluate the performance of statistical models used in the analysis.

The process involves:

  • Data Pre-processing: Cleaning and preparing the data for analysis.
  • Chi-square Analysis: Testing the relationship between categorical variables.
  • Associations Analysis: Using Logistic Regression to explore deeper associations in the data.
  • Model Evaluation: Assessing the accuracy and effectiveness of our models.
  • Final Report: Compiling all findings into a comprehensive report, tailored to suit different audience needs.

The goal is to provide insights into the COVID-19 impact in Mexico through detailed statistical analysis and modeling.

The Covid-19 report has been broken into two subprojects.

  • data/ contains the code, original file and output related to the project
    • data/clean.R produces the data_clean.rds
      • teammate can choose to use the original file or data_clean.rds
  • subproject1/ contains all code and output related to the descriptive analysis
    • subproject1/code/descriptive_analysis.R produces tables and plots
      • output should be saved to subproject1/output/descriptive_output/
    • subproject1/code/chi-square.R produces the chi-square results containing tables and plots
      • output should be saved to subproject1/output/chi-square/
    • subproject1/report.Rmd reads in output from subproject1/output/ and creates the report for the descriptive analysis
  • subproject2/ contains all code and output related to the regression analysis and model evaluation
    • subproject2/code/models.R fits Multinominal Logistic Regression models
      • summary tables and plots should be saved to subproject2/output/model/
    • subproject2/code/model_evaluation.R evaluate the effectiveness of model
      • summary tables and plots should be saved to subproject2/output/model_evaluation/
    • subproject2/report.Rmd reads in output from subproject2/output/ and creates the report for the regression analysis

The subprojects are combined in combined_report.Rmd. config.yml will be used to format tables, plots and parameterize reports

  • child documents are used to stitch together a final report

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • R 86.5%
  • Makefile 13.5%