7.2 Pearson’s Correlation 99Figure 7.1. Scatter diagram of systolic and diastolic blood pressure (data from Table 7.1 ).
6065707580859095100105100 110 120 130 140 150 160 170 180 190
Systolic BPDiastolic BPdiastolic pressure to be higher than its average when the systolic
pressure is higher than its average and lower than its average when the
systolic pressure is below its average. Next in Section 7.2 , we will see
how the Pearson correlation describes aspects of the scatter plot, and
in Section 7.3 , how a regression line can be plotted through the data
using the least squares criterion.
7.2 PEARSON ’ S CORRELATION
The Pearson correlation coeffi cient is a parameter that has a natural
place in the bivariate distribution. When we have a sample such as in
the scatter plot for systolic and diastolic blood pressure, we can obtain
a sample estimate. Generally, ρ is used to denote the parameter, and r
to denote the sample estimate of ρ.
rXXYY XXYYiiin
in
iiin
in
=−∑∑== 1 ()ˆˆ∑= 1 ()− ∑=()−ˆ ˆ()−.22
1 1