Data Analysis with Microsoft Excel: Updated for Office 2007

(Tuis.) #1

176 Fundamentals of Statistics


e. Create a boxplot of the size of the
homes in square feet. Place the boxplot
in a chart sheet named Sq Ft. Boxplot.
f. What value appears to be an extreme
outlier in the boxplot?
g. Create a second boxplot of house size,
this time breaking the boxplot down
by whether the home is a corner lot.
Place the plot in a chart sheet named
Sq Ft. Boxplot by Corner Lot.
h. Interpret your boxplot in terms of the
relationship between size of a house
and whether it lies on a corner lot.
i. What happened to the extreme outlier
you identifi ed earlier? Discuss this in
terms of the defi nition of outlier given in
the text. Is this value an outlier or not?
j. Recreate the table of univariate statis-
tics for house size, and this time break
the table down by the Corner Lot vari-
able. Place the table in a worksheet
named Sq Ft. Stats by Corner Lot.
k. Create a new column containing the
price per square foot of each house.
Assign the values in the new column
a range name. What type of variable
is this?
l. Create a histogram with 20 evenly
spaced bins of the price per square
foot on a chart sheet named PPSqFt
Histogram. Count the bins totals to
the right of the cutoff points.
m. What is the shape of the distribution
of price per square foot?
n. Create a boxplot of the price per
square foot saved to the chart sheet
PPSqFt. Boxplot. Are there any severe
outliers? How does the median value
compare to the mean value?
o. Save your workbook and then
write a report summarizing your
observations.


  1. Data have been recorded on 50 of the
    largest woman-owned businesses in
    Wisconsin. Analyze and report the
    descriptive statistics on this data set.


a. Open the Woman-Owned Businesses
from the Chapter04 folder and save it
as Woman-Owned Business Statistics.
b. Create a table of the distribution, vari-
ability, and summary statistics except
the mode for the Employees variable.
Store the table in a worksheet named
Employee Stats.
c. What is the average number of em-
ployees for the 50 businesses? What
is the median amount? Which statistic
do you think more adequately
describes the size of these businesses?
How does the average number of em-
ployees compare to the third quartile?
d. Create a boxplot of employees stored
in a chart sheet named Employee
Boxplot. How would you describe
this distribution?
e. Create a new variable containing the
base 10 log of the Employees vari-
able. Assign a range name to this new
column and then create a boxplot of
these values in a chart sheet named
Log Employee Boxplot. How does the
shape of this distribution compare to
the untransformed values?
f. Create a table of descriptive sta-
tistics except the mode for the
log(Employees) and store the table
in a worksheet named Log Employee
Stats. Compare the skewness and
kurtosis values between the Employ-
ees and log(Employees) variables.
Explain how the difference in the dis-
tribution shapes is refl ected in these
two statistics.
g. Calculate the mean log(Employees)
value in terms of the number of peo-
ple employed (in other words, trans-
form this value back to the original
scale). How does this value compare
to the geometric mean of the number
of employees in each company?
h. The geometric mean is used for val-
ues that either are ratios or are best
compared as ratios. Which pair of
Free download pdf