Data Analysis with Microsoft Excel: Updated for Office 2007

(Tuis.) #1
Chapter 8 Regression and Correlation 349

during 1991. INC90 is the percentage
net income for 1990, and similarly,
INC91 is the percentage net income for



  1. CAPRET90 is the percentage capi-
    tal gain for 1990, and CAPRET91 is the
    percentage capital gain for 1991.


a. Open the Fidelity workbook from
the Chapter08 folder and save it as
Fidelity Financial Analysis.
b. What is the correlation between the
percentage capital gains for 1990 and
1991? Do your analysis using both the
Pearson and Spearman correlations,
calculating the p value for both. Is
there evidence to support the sup-
position that the percentage capital
gains from 1990 are highly correlated
with the percent capital gains for
1991?
c. What is the correlation between the
percentage net income for 1990 and
1991? Use both the Pearson and
Spearman correlation coeffi cents and
include the p values. Is net income
from 1990 highly correlated with net
income from 1991?
d. Create a scatter plot for the two cor-
relations in parts a and b. Label each
point on the scatter plot with labels
from the Sector column.
e. You should get a stronger correlation
for income than for capital gains. How
do you explain this?
f. Calculate the correlation between the
percentage increase in net asset value
in 1990 to 1991 using the NAV90 and
NAV91 variables and then generate
the scatter plot, labeling the points
with the sector names. Note that the
Biotechnology Fund stands out in the
plot. It was the only fund that per-
formed well in both years.
g. Compare the Pearson and Spearman
correlation values for NAV90 and
NAV91. Are they the same sign? What
could account for the different corre-
lation values? Which do you think is


more representative of the scatter plot
you created?
h. If the correlation is this weak, what
does it suggest about using fund per-
formance in one year as a guide to
fund performance in the following
year?
i. Save your changes to the workbook
and write a report summarizing your
observations.


  1. The Draft workbook contains information
    on the 1970 military draft lottery. Draft
    numbers were determined by placing
    all 366 possible birth dates in a rotating
    drum and selecting them one by one. The
    fi rst birth date drawn received a draft
    number of 1 and men born on that date
    were drafted fi rst, the second birth date
    entered received a draft number of 2, and
    so forth. Is there any relationship between
    the draft number and the birth date?
    a. Open the Draft workbook from the
    Chapter08 folder and save it as Draft
    Correlation Analysis.
    b. Using the values in the Draft Numbers
    worksheet, calculate the Pearson and
    Spearman correlation coeffi cients and
    p value between the Day_of_the_Year
    and the Draft number. Is there a sig-
    nifi cant correlation between the two?
    Using the value of the correlation,
    would you expect higher draft num-
    bers to be assigned to people born ear-
    lier in the year or later?
    c. Create a scatter plot of Number versus
    Day_of_the_Year. Is there an obvious
    relationship between the two in the
    scatter plot?
    d. Add a trend line to your scatter plot
    and include both the regression equa-
    tion and the R^2 value. How much
    of the variation in draft number is
    explained by the Day_of_the_Year
    variable?
    e. Calculate the average draft number
    for each month and then calculate

Free download pdf