Classification of Data Science Tasks
Description
Prediction
Causal Inference
An Example: Physical Activity and Cardiovascular Health
What are you trying to learn?
What are the ideal data to answer this question?
A large national health survey (e.g., NHANES) that includes self-reported or accelerometer-measured physical activity data.
Demographic information (age, sex, education, etc.) to describe the population.
What’s the purpose of this?
What are you trying to learn?
What are the ideal data to answer this question?
Longitudinal data from wearable fitness trackers that capture continuous measures like heart rate variability, step count, sleep patterns, and stress levels.
Follow-up clinical data on cardiovascular health outcomes (e.g., diagnosis of hypertension, heart disease).
What’s the purpose of this?
What are you trying to learn?
What are the ideal data to answer this question?
What’s the purpose of this?