Project objective and overview:
The present project aims to use the Kaggle dataset, "US Cost of Living Dataset (1877 Counties)" by asaniczka to practice skills in data engineering, analysis, visualization, predictive modeling, diagnostic analysis, and presentation of insights.
According to the Kaggle dataset page,
The US Family Budget Dataset provides insights into the cost of living in different US counties based on the Family Budget Calculator by the Economic Policy Institute (EPI). This dataset offers community-specific estimates for ten family types, including one or two adults with zero to four children, in all 1877 counties and metro areas across the United States. |
I aim to maintain this page to document my progress on this project. Below, I post my workflow that contributed productively to the goals of the project, and I include ideas to work on and their respective statuses.
The average time I spend weekly on this project is approximately 5 hours. I anticipate that every hour I spend on the project
Feature matrix:
Feature | Scope | Milestones and Dates | Status | Roadblocks | Value to Add | Notes |
---|---|---|---|---|---|---|
Various Exploratory Analyses of Data | Broad look into dataset to bring out questions about what the data can show. |
|
EDA complete; delving deeper! |
|
Gain a better idea of the data being investigated; formulate specific and well-formed questions to ask and examine. | Notes |
Feature | Scope | Milestones and Dates | Status | Roadblocks | Value to Add | Notes |
Ideas | Backlog | Priority | Work in Progress | Completed |
---|---|---|---|---|
a | Backlog | Priority | Work in Progress | Completed |