!-- start of navigation -->
covid header

Another Internet-Epidemiologist,
GREAT! 🤦‍♂️

Do not fret, there will be no talk about Rt nor R0; instead, we will import CDC data from across the United States and display the data to gain insights. Use this colab link to get a detailed view into the process, comments welcomed. This blog is for reference, and a rough gauge of risk relative to the population of the United States.

About the Data

States and territories of the United States submit patient-level data to the CDC and HealthData.gov aggregates and standardizes the data. At the time of use, the most recent update to the database was made first day of September. Data represented within this database is subject to change and has a 14-day delay in reporting. Quality assurance measures are put in place to ensure consistency (onset date in the future, those values are set to null until an update is received). In total, this dataset contains in excess of 3 million rows representing individuals.

Processing Techniques

After import, reported mortality was prioritized by removing any observations whose definitiveness was not included; any 'Missing' or 'Unknown' values in mortality were removed and not used for analysis. Age groups not reported are labeled as 'Unknown'. This data also measures the presence of comorbidity--other significant health issues present at time of reporting--and cleaning was conducted by renaming 'Missing' values as 'Unknown'. In total roughly 48% of the data was removed solely on the presence of a definitive mortality outcome. Before calculating sums, both mortality and comorbidity were converted to binary data.

Who's at Greatest Risk from COVID-19?

Besides, anything that isn't human; we all have a risk of contracting and spreading COVID-19 to others. With the holidays fast approaching and as we collectively poke our heads out of our cave-of-isolation, desperate for closer human contact, we all should tread carefully armed with actionable information to protect those most at risk. Who should we focus on protecting most? In our first look--comparing mortality grouped by age--the majority of survival occurs in the younger population. As age increases, the amount of mortality due to COVID-19 also increases. A jump in mortality becomes evident when the age approaches 50 years old. Below is an interactive graph with the capability of toggling mortality:


As the age increases, the change in mortality is drastic. The first biggest jump occurs around the 50-59 age group. At the other extreme, those below 39 years seem to be experiencing the least mortality due to COVID-19. Transitioning to observing comorbidity of the patients, we see a flip at roughly 40 years old where the amount of people reporting comorbidity starts to overtake those who report none. This correlates with the jump in mortality for this age range.


Conclusion

Based on the information processed, it is safe to conclude that older and sicker populations of people bear the brunt of mortality from COVID-19. Those who have comorbidity also share the majority of mortality. Protecting those populations is of most importance and should be prioritized to limit their exposure to COVID-19.

Related posts