- #1

Beth N

- 41

- 4

- TL;DR Summary
- I am calculating some statistical measures for a homework assignment but did not seem to find the correct answer. I would love to clarify my understanding

**THE PROBLEM**

6) Table 1 below contains data on offensive statistics for each game in the 2019 UW Husky Baseball

season. Answer the following questions and/or complete the specified tasks using these data. Do

everything by hand and show your work (good practice for the tests).

a. Construct a relative frequency histogram of the runs per game. You will first need to

construct a frequency table. Use individual bins for 0, 1, 2, 3…9 runs per game and then a

final bin of 10+ runs.

b. Calculate and plot the mean and median value using the raw data. Also calculate the mean

using the frequency table from part a. and compare (don’t plot).

c. Calculate the variance, standard deviation, and coefficient of variation using the raw data.

Repeat using the frequency table from part a. and compare (don’t plot). Why do the two

methods differ so much when the means were fairly close?

**MY ATTEMPT:**

- Using the raw data I get mean=

**5.71**, variance=14.07, stdv=3.7 and CV=65.68%

-Using the frequency table where the last few data points are grouped into one bin (10+), I acqured mean=

**6.740**, variance=4.778, stdev=2.186 and CV=32.4%.

I am concerned because the prompts says that

**the means acquired using both methods are fairly close. I don't think mine are very close, so I don't know whether my method could be wrong at some point.**

I am using Population, not Sample calculation. For the frequency table, I construct my last bin to be 10-25 with a midpoint of 17.5. Is this accurate?

Attached below are the prompts and the data table.

Thank you very much for your consideration.