Quick Answer
Correlation describes the statistical relationship between two variables on the Digital SAT. Found primarily in the Math section's Data Analysis questions, it measures how closely data points follow a linear trend. Students encounter this concept approximately 2-4 times per test, often requiring them to distinguish between positive, negative, or no association.
Correlation is a statistical measure that describes the strength and direction of a linear relationship between two quantitative variables. It is often represented by the correlation coefficient $r$, which ranges from -1 to 1, indicating whether variables increase together or move in opposite directions.
Question: A researcher finds that for a group of students, the number of books read per year ($x$) and final exam scores ($y$) have a correlation of $r = 0.85$. What does this suggest? Solution: Since $0.85$ is positive and close to $1$, there is a strong positive correlation. This means as the number of books read increases, exam scores typically increase as well.
Mistake 1: Confusing correlation with causation. Students often assume that because two variables are related, one must cause the other, which the SAT frequently tests as a logic trap.
Mistake 2: Misinterpreting negative correlation strength. Students sometimes believe a negative $r$ value (like -0.9) indicates a 'weak' relationship, when it actually indicates a very strong inverse relationship.
Mistake 3: Overlooking outliers. Students may fail to realize that a single data point far from the trend line can significantly weaken the perceived correlation of the entire set.
Students targeting 750+ should know that while the SAT focuses on linear correlation, you must be prepared to identify when a linear model is inappropriate. If a scatterplot shows a clear 'U' shape or curve, the linear correlation may be near zero even if a strong non-linear relationship exists.
Causation
Causation describes a relationship where one variable directly influences another. On the Digital SAT, this concept typically appears in the Math section under Data Analysis. It is frequently tested through questions involving study design, specifically distinguishing between mere correlation and cause-and-effect relationships established by randomized controlled experiments.
Line of Best Fit
A line of best fit is a linear model that represents the trend of data points in a scatter plot. On the Digital SAT, this concept appears frequently in the Math section, typically within the Problem Solving and Data Analysis domain, requiring students to interpret or select equations that best represent visual data.
Scatter Plot
A scatter plot is a graphical representation of data points on a coordinate plane showing the relationship between two variables. On the Digital SAT, these typically appear in the Math section under Problem Solving and Data Analysis, occurring in approximately 2 to 4 questions per exam.
Slope
Slope measures the constant rate of change in a linear relationship. On the Digital SAT, slope is a high-frequency algebra concept appearing in both Math modules. It typically features in approximately 15-20% of algebra-based questions, requiring students to interpret steepness, calculate rates, or analyze coordinate geometry.
Standard Deviation
Standard deviation is a statistical measure of how spread out data values are from the mean. On the Digital SAT, this concept typically appears in Math Modules 1 or 2 within Data Analysis questions. Students are usually asked to compare the spread of two data sets rather than calculating the exact value.
Correlation on the SAT refers to the association between two variables, typically visualized on a scatterplot in the Math section. It indicates whether the variables tend to increase together (positive) or if one decreases as the other increases (negative). Understanding this concept is vital for the Data Analysis category, as it appears in approximately 2 to 4 questions per exam.
You identify correlation by looking at the pattern of data points on a scatterplot. If the points generally move from the bottom-left to the top-right, the correlation is positive. If they move from the top-left to the bottom-right, it is negative. The closer the points are to forming a straight line, the stronger the correlation is considered to be.
The difference between correlation and causation is that correlation only shows a relationship exists, while causation proves one event triggers the other. On the SAT, you must remember that a strong correlation in observational data does not imply causation. Causal conclusions can typically only be drawn from randomized controlled experiments, not from simple observed associations.
Typically, you will see approximately 2 to 4 questions that directly or indirectly test your understanding of correlation. These are usually found in the Problem Solving and Data Analysis portion of the Math modules. While the exact number varies by test form, mastering the interpretation of scatterplots and association is essential for a high score in the Math section.