This website shows the results of a project I've created for a Health Data Science Course at the University of Southern California-- Keck School of Medicine. The dataset Iām using for this project is a year of data collected on my fitness and sleep activity through my FitBit. Some of the important variables in this dataset are: date, day of the week, steps, floors, distance, calories burned, calories burned from activity, weight, minutes of sleep, Los Angeles daily average temperature, and binary variables of pre-covid 19 pandemic/post-pandemic, and whether or not I was traveling outside of Los Angeles on any given day. This dataset spans October 2019-September 2020.
Activity is an important part of a healthy lifestyle, and the CDC recommends 150 minutes a week of moderate-intensity exercise, or 75 minutes a week or vigorous-intensity, or a an equivalent combination (1). One study found that adults average 4,000-18,000 steps a day. The study claims that a goal of 10,000 steps per day is reasonable for healthy adults (2). Recommended sleep for adults is generally between 7-9 hours per night (3), with the goal to not get too little, or too much sleep. Physical activity is beneficial not only for general health, but also for mental health. As such, it has been recommended to stay physically active during the Covid-19 pandemic, although types of exercise may have to shift from going to the gym to doing outdoor or home-based workouts. However, it has recently been found that step counts decreased worldwide in the period after COVID-19 was declared a global pandemic (4).
Ultimately, I wanted to explore my health and fitness, and how it interacted with external factors such as weather, the Covid-19 pandemic, and days of the week.
After downloading the FitBit and Weather data, I cleaned and wrangled the dataset using what I learned in the data science course. I created extra categorical variables, and checked for missing data and outliers. Presented below are some of the findings from my analysis.
Post-Covid (N=201) |
Pre-Covid (N=165) |
Overall (N=366) |
|
---|---|---|---|
Steps | |||
Mean (SD) | 9910 (4170) | 10800 (4470) | 10300 (4330) |
Median [Min, Max] | 10400 [1980, 26100] | 10900 [2230, 23200] | 10500 [1980, 26100] |
Missing | 24 (11.9%) | 3 (1.8%) | 27 (7.4%) |
Average Daily Temperature (degrees farenheit) | |||
Mean (SD) | 72.2 (9.64) | 62.1 (7.38) | 68.0 (10.1) |
Median [Min, Max] | 73.5 [48.0, 98.5] | 61.0 [46.0, 79.0] | 68.0 [46.0, 98.5] |
Missing | 10 (5.0%) | 30 (18.2%) | 40 (10.9%) |
Minutes Asleep | |||
Mean (SD) | 476 (75.4) | 470 (86.7) | 473 (80.6) |
Median [Min, Max] | 469 [289, 754] | 469 [289, 738] | 469 [289, 754] |
Missing | 41 (20.4%) | 33 (20.0%) | 74 (20.2%) |
Weight (lb) | |||
Mean (SD) | 137 (4.52) | 142 (1.28) | 139 (4.30) |
Median [Min, Max] | 136 [130, 145] | 141 [140, 145] | 141 [130, 145] |
Daily Calories Burned | |||
Mean (SD) | 2130 (465) | 2330 (359) | 2220 (431) |
Median [Min, Max] | 2100 [1360, 4400] | 2340 [1440, 3160] | 2180 [1360, 4400] |
This scatterplot shows how my step count changed over time, as well as how my step count interacted with the daily average temperature, which is shown by the size of each point on the plot. There isn't too much range in temperature, since Los Angeles generally doesn't get cold very often and doesn't fluctuate all too much, besides getting very got in the summer. You can see from the sizes of the points on the right that the weather got a lot hotter in July-September, which is to be expected. It's also worth noting that I tended to have higher step counts in the beginning of the dataset than towards the end.
This plot shows my calories burned (y-axis) and step count (size of point) over time, and is also colored by weekday. Some of the highest points of calories burned are on Saturdays, and higher calories burned tend to generally be associated with higher step counts.
In this scatterplot, you can see how the calories I burned per day are related to my minutes of sleep time. The size of the points corresponds to calories.
This boxplot demonstrates how the Covid-19 pandemic affected my activity level in terms of calories burned daily. The median amount of calories burned was higher before Covid-19 than after.
This final plot illustrates some of the key variables in this dataset. The size of the points is calories burned, which is closely related to step count. It is evident that there is an inverse relationship with step count and average daily temperature-- that is, when it is hotter outside, I walk less steps than when it is colder.
If you would like to read more details about my project including the methodology, and more detailed results (with extra plots, as well as significance tests), please feel free to download my report at: https://github.com/mbolshakova/PM-566-Final-Project/blob/master/Final-Report.pdf
1- CDC Physical Activity Guidelines (2020) https://www.cdc.gov/physicalactivity/basics/adults/index.htm
2- Tudor-Locke, C., Craig, C. L., Brown, W. J., Clemes, S. A., De Cocker, K., Giles-Corti, B., ... & Oppert, J. M. (2011). How many steps/day are enough? For adults. International Journal of Behavioral Nutrition and Physical Activity, 8(1), 1-17
3- Sleep Guidelines (2020) https://www.sleepfoundation.org/articles/how-much-sleep-do-we-really-need#:~:text=National%20Sleep%20Foundation%20guidelines1,to%208%20hours%20per%20night.
4- Tison, G. H., Avram, R., Kuhar, P., Abreau, S., Marcus, G. M., Pletcher, M. J., & Olgin, J. E. (2020). Worldwide effect of COVID-19 on physical activity: a descriptive study. Annals of internal medicine.