Data Analysis with Microsoft Excel: Updated for Office 2007

(Tuis.) #1

346 Statistical Methods


c. Create a scatter plot of mortality
versus temperature and add a linear
trend line to the plot, showing both the
R^2 value and the regression equation.
d. Calculate the regression statistics for
the new data set.
e. Create scatter plots of the residuals of
the regression equation versus tem-
perature and the predicted values.
Also create a normal probability plot
of the residuals.
f. Calculate the Pearson and Spearman
correlation coeffi cients, including the
p values.
g. Save your changes to the workbook
and write a report summarizing your
results, including a description of the
diagnostic tests you performed. How
does this regression compare with the
regression you performed earlier that
included the possible outlier? How do
the diagnostic plots compare?


  1. You’ve been given an Excel workbook
    containing nutritional information on 10
    wheat products. Perform the following
    analysis:
    a. Open the Wheat workbook from the
    Chapter08 folder and save it as Wheat
    Regression Analysis.
    b. Plot Calories versus ServingGrams,
    adding a regression line, equation,
    and R^2 value to the plot. How does
    the serving size (in grams) predict
    the calories of the different wheat
    products?
    c. Compute Pearson’s correlation and
    the corresponding p value between
    Calories and ServingGrams.
    d. Use the Data Analysis ToolPak to cal-
    culate the statistics for the regression
    equation.
    e. Create diagnostic plots of residuals
    versus ServingGrams, and the normal
    probability plot of the residuals. Do
    the regression assumptions seem to be
    satisfi ed?


f. In the plot of residuals versus pre-
dicted values, label each point with
the food type (pretzel, bagel, bread,
etc). Where do the residuals for the
breads appear?
g. Breads are often low in calories be-
cause of high moisture content. One
way of removing the moisture content
from the equation is to create a new
variable that sums up the total of the
nutrient weights. With this in mind,
create a new variable, total, which
is the sum of the weights of carbo-
hydrates, proteins, and fats. From this
total subtract the value of the Fiber
variable since fi ber does not contrib-
ute to the calorie total. Plot Calories
versus Total on a new chart sheet.
h. Redo your regression equation, re-
gressing the Calories variable on the
new variable Total. How does the di-
agnostic plot of residuals versus pre-
dicted values compare to the earlier
plot? How do the R^2 values compare?
Where are the residuals for the bread
values located?
i. Save your changes to the workbook
and write a report summarizing your
observations.


  1. Continue to investigate the nutritional
    data in the Wheat workbook by perform-
    ing the following analysis:
    a. Open the Wheat workbook from the
    Chapter08 folder and save it as Wheat
    Correlation Matrix.
    b. Create scatter plot and correlation ma-
    trices (Pearson correlation only) for
    the variables serving grams, calories,
    protein, carbohydrate, and fat.
    c. Why is fat so weakly related to the
    other variables? Given that fat is
    supposed to be very important in
    calories, why is the correlation so
    weak here?
    d. Would the relationship between the
    fat and calories variables be stronger

Free download pdf