Data Analysis with Microsoft Excel: Updated for Office 2007

(Tuis.) #1

178 Fundamentals of Statistics



  1. You’ve been given an Excel workbook
    containing annual salary fi gures for ma-
    jor league baseball players (in terms of
    hundreds of thousands of dollars) for the
    2007 season. Use the workbook to calcu-
    late statistics on the players’ salaries.
    a. Open the Baseball workbook from the
    Chapter04 folder and save it as
    Baseball Salary Statistics.
    b. Create a histogram of the players’ sala-
    ries with bin intervals of $1,000,000
    ranging from $0 up to $25,000,000.
    Have the counts within each bin be $
    the bin value and < the next bin value.
    c. Create a frequency table of the play-
    ers’ salaries, using the same bin inter-
    vals and options you used to create
    the histogram.
    d. Calculate the 10th and 90th percen-
    tiles of the salaries.
    e. Using the value for the 90th percen-
    tile, fi lter the player data to show only
    those players who were paid in the
    upper 10% of the salary range.
    f. What is the average player’s salary?
    What is the median player’s salary?
    If a player made the average salary, at
    what percentile would he be ranked
    in the data?
    g. Save your changes to the workbook
    and write a report summarizing your
    calculations.

  2. You’ve been asked to compare the
    changing nature of baseball salaries. An
    Excel workbook has been prepared for
    you that contains salaries from the years
    1985, 2002, and 2007. Examine these
    salaries and prepare a statistical report.
    a. Open the Salary Comparison work-
    book from the Chapter04 folder
    and save it as Salary Comparison
    Statistics.
    b. Create a histogram of the salary data
    broken down by year. Have the his-
    togram display salaries in $1,000,000


intervals up to $25,000,000 and have
the bars of the histogram display the
percentages, not the frequencies, for
each bar. Display the histogram bars
side by side rather than stacked. Does
the distribution of the salaries appear
the same in the three years?
c. Calculate the count, mean, median,
percentiles, skewness, and kurtosis
for the three years. How does the typi-
cal salary compare within the three
years? What do players in the upper
10% of each year make?
d. Has the distribution of the salaries
become more skewed or less skewed
or remained the same over the years?
Answer this question by examining
the skewness and kurtosis statistics.
You’ve learned that the 2007 data are
just for starters. How could this affect
your conclusion?
e. Save your changes to the workbook
and then write a report summarizing
your calculations.


  1. The Cancer workbook contains data
    comparing the cigarette per capita for
    each of the 50 states and the District of
    Columbia to those state’s rates of bladder
    cancer, kidney cancer, lung cancer, and
    leukemia per 100,000. Each state was
    ranked on the basis of cigarette use with
    0 for low rates of cigarette use, 1 for me-
    dium, and 2 for high. Analyze the data
    from this workbook.
    a. Open the Cancer workbook from the
    Chapter04 folder and save it as
    Cancer Statistics.
    b. Create boxplots of the rates of bladder
    cancer, kidney cancer, lung cancer,
    and leukemia broken down by ciga-
    rette use. Label the charts and chart
    sheets appropriately.
    c. Create a table of univariate statistics
    for the rate of each illness broken
    down by the cigarette use category.

Free download pdf