178 Fundamentals of Statistics
- You’ve been given an Excel workbook
containing annual salary fi gures for ma-
jor league baseball players (in terms of
hundreds of thousands of dollars) for the
2007 season. Use the workbook to calcu-
late statistics on the players’ salaries.
a. Open the Baseball workbook from the
Chapter04 folder and save it as
Baseball Salary Statistics.
b. Create a histogram of the players’ sala-
ries with bin intervals of $1,000,000
ranging from $0 up to $25,000,000.
Have the counts within each bin be $
the bin value and < the next bin value.
c. Create a frequency table of the play-
ers’ salaries, using the same bin inter-
vals and options you used to create
the histogram.
d. Calculate the 10th and 90th percen-
tiles of the salaries.
e. Using the value for the 90th percen-
tile, fi lter the player data to show only
those players who were paid in the
upper 10% of the salary range.
f. What is the average player’s salary?
What is the median player’s salary?
If a player made the average salary, at
what percentile would he be ranked
in the data?
g. Save your changes to the workbook
and write a report summarizing your
calculations. - You’ve been asked to compare the
changing nature of baseball salaries. An
Excel workbook has been prepared for
you that contains salaries from the years
1985, 2002, and 2007. Examine these
salaries and prepare a statistical report.
a. Open the Salary Comparison work-
book from the Chapter04 folder
and save it as Salary Comparison
Statistics.
b. Create a histogram of the salary data
broken down by year. Have the his-
togram display salaries in $1,000,000
intervals up to $25,000,000 and have
the bars of the histogram display the
percentages, not the frequencies, for
each bar. Display the histogram bars
side by side rather than stacked. Does
the distribution of the salaries appear
the same in the three years?
c. Calculate the count, mean, median,
percentiles, skewness, and kurtosis
for the three years. How does the typi-
cal salary compare within the three
years? What do players in the upper
10% of each year make?
d. Has the distribution of the salaries
become more skewed or less skewed
or remained the same over the years?
Answer this question by examining
the skewness and kurtosis statistics.
You’ve learned that the 2007 data are
just for starters. How could this affect
your conclusion?
e. Save your changes to the workbook
and then write a report summarizing
your calculations.
- The Cancer workbook contains data
comparing the cigarette per capita for
each of the 50 states and the District of
Columbia to those state’s rates of bladder
cancer, kidney cancer, lung cancer, and
leukemia per 100,000. Each state was
ranked on the basis of cigarette use with
0 for low rates of cigarette use, 1 for me-
dium, and 2 for high. Analyze the data
from this workbook.
a. Open the Cancer workbook from the
Chapter04 folder and save it as
Cancer Statistics.
b. Create boxplots of the rates of bladder
cancer, kidney cancer, lung cancer,
and leukemia broken down by ciga-
rette use. Label the charts and chart
sheets appropriately.
c. Create a table of univariate statistics
for the rate of each illness broken
down by the cigarette use category.