Skip to content

VerneShi/DATA550_Final_Project

Repository files navigation

Data 550 Final Project

Overview

This project explores obesity trends in the United States using data from the Behavioral Risk Factor Surveillance System. Through detailed data analysis and visualization, it examines the relationship between obesity rates and various behavioral factors across different demographic and socioeconomic stratifications from 2011 to 2022. The final report includes a comprehensive analysis of obesity trends, with summary tables and distribution plots highlighting key findings.

Project Setup

To replicate the computing environment necessary to run this project, follow these steps:

  1. Clone this repository to your local machine.
  2. Ensure that you have R installed on your system.
  3. Navigate to the project directory and run make install from the terminal. This will set up the R environment using the renv package.
  4. Ensure Docker is installed on your system to handle the project containerization.
  5. Use the provided Makefile to build the Docker image, which sets up the R environment with all necessary dependencies.
make project_image

DockerHub Repository

  1. Accessing the Docker Image View and pull the Docker image directly from DockerHub to avoid local builds: DockerHub Repository for Data 550 Final Project
  2. Pulling the Docker Image To pull the latest version of the Docker image:
docker pull yiweishi/data550_final_project:latest
  1. Generate Report Using Docker After pulling the image, generate the final report by running the Docker container:
docker run --rm -v "${PWD}/final_report:/project/final_report" yiweishi/data550_final_project

Generating the Final Report

The final report is generated from the final_report.Rmd document, which compiles the analysis results, including tables and figures, into a cohesive narrative. To generate the report:

  1. Ensure all prerequisites are installed.
  2. Open a terminal and navigate to the project's root directory.
  3. Execute the command make all. This will run the data processing, analysis, and plotting scripts, followed by knitting the final_report.Rmd into an HTML or PDF document.

Contents of the Final Report

The report includes:

  • An introduction to the dataset and research objectives.
  • A data processing section detailing the cleaning and preparation steps.
  • Descriptive analysis findings, with summary tables highlighting key statistics.
  • Visualizations of obesity trends over time and across various stratifications.
  • Conclusions and potential areas for further research.

Key Components

  • Summary Tables: Generated in the analysis.R script, summary tables provide an overview of key statistics for each stratification. These tables are saved in the output/tables directory.

  • Figures: The main figure illustrating the trend of obesity over the years is created in the plotting.R script. This and other figures are saved in the output/figures directory.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published