1 / 65

Summary Measures of Ungrouped Numerical Data

Summary Measures of Ungrouped Numerical Data. Comparing Data Sets with respect to:. Location. Variability. (Dispersion). Shape. Calculation and Properties of Some Measures of Location of Ungrouped Numerical Data. Location Measures Measures of Central Tendency

montana
Download Presentation

Summary Measures of Ungrouped Numerical Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Summary Measures of Ungrouped Numerical Data

  2. Comparing Data Sets with respect to: Location Variability (Dispersion) Shape

  3. Calculation and Properties of Some Measures of Location of Ungrouped Numerical Data

  4. Location Measures Measures of Central Tendency Measures of Noncentral Tendency

  5. Find the following measures of LOCATION. Central Tendency Measures: _ 1) mean (X) 2) median (M) 3) mode Noncentral Tendency Measures: (Quantiles or Fractiles) 1) First Quartile (Q1) 2) Third Quartile (Q3) 3) Sixth Decile (D6) 4) Eighty-third percentile (P83) First, the mean.

  6. Calculation of the Sample Mean The following sample (n = 7) of data contains the number of years that a company was in business. For the small sample (n = 7), the mean will be calculated. The symbol used for the sample mean is called X-bar. For simplicity, the formula will be written as follows:

  7. Properties of the Sample Mean The following sample (n = 7) of data contains the number of years that a company was in business. For the small sample (n = 7), X-bar = 9.

  8. Properties of the Sample Mean The following sample (n = 7) of data contains the number of years that a company was in business. For the small sample (n = 7), X-bar = 9. The mean is a measure of central tendency (the middle). Q: In what “sense” is the mean in the middle of a set of data?

  9. Properties of the Sample Mean The following sample (n = 7) of data contains the number of years that a company was in business. For the small sample (n = 7), X-bar = 9. The mean is a measure of central tendency (the middle). Q: In what “sense” is the mean in the middle of a set of data? A: It has to do with the deviation from the mean. The mean is the place where the deviations “balance.” It is easier to see by examining the dot plot.

  10. Properties of the Sample Mean The following sample (n = 7) of data contains the number of years that a company was in business. Deviations from the mean. Sum = -15 Sum = +15 Deviations below the mean “balance” the deviations above the mean. The sum of the deviations from the mean EQUALS zero!

  11. Properties of the Sample Mean The following sample (n = 7) of data contains the number of years that a company was in business. Note that the order does not matter. The sum of the deviations from the sample mean is ALWAYS zero!

  12. Properties of the Sample Mean The sample mean is in the “middle” of a sample because the sum of the deviations from the sample mean equals zero. In order for the mean to represent the “middle,” the difference, must be well defined for each value of x. In other words, for the mean to be a useful calculation, the data must be of at least interval level. Next, a LARGE sample is considered.

  13. A Large Sample Example (presented as Raw Data) The following sample (n = 120) of data contains the number of years that a company was in business. Q: What does one need to know to find the mean of the sample above? A: The sum of the values and the sample size.

  14. A Large Sample Example (presented as Raw Data) The following sample (n = 120) of data contains the number of years that a company was in business. If , then

  15. A Large Sample Example (presented as Raw Data) The following sample (n = 120) of data contains the number of years that a company was in business. If , then

  16. A Large Sample Example (presented as Raw Data) The following sample (n = 120) of data contains the number of years that a company was in business. If , then what is the “typical” number of years that a value in the sample differs from the mean?

  17. Find the following measures of LOCATION. Central Tendency Measures: _ 1) mean (X) 2) median (M) 3) mode Noncentral Tendency Measures: (Quantiles or Fractiles) 1) First Quartile (Q1) 2) Third Quartile (Q3) 3) Sixth Decile (D6) 4) Eighty-third percentile (P83) Next, the median.

  18. Determination of the Sample Median The following sample (n = 7) of data contains the number of years that a company was in business. For the small sample (n = 7), the median will be found. The symbol used for the sample median is M. The median of the sample is a value for which the same number of sample values are less than (or equal) as sample values that are greater than (or equal). This is the “sense” in which the median is in the “middle” of the sample. To determine the value of the median, the values in the sample must be sorted. Use the ordered array. In this example, it is easy to see that the median (M) is 10.

  19. Determination of the Sample Median If there is an odd number of values in the sample, there will be a value in the middle (when sorted). That value is the median. If there is an even number of values in the sample, then any value that is between the two values “in the middle” (when sorted) could serve as the median. However, the convention is to go “half way” between those two middle values. For example, remove a 3 from the sample.

  20. Determination of the Sample Median If there is an odd number of values in the sample, there will be a value in the middle (when sorted). That value is the median. If there is an even number of values in the sample, then any value that is between the two values “in the middle” (when sorted) could serve as the median. However, the convention is to go “half way” between those two middle values. For example, remove a 3 from the sample. The value 10.25 could be used as the median. The value 11.75 could be used as the median. However, the convention is to use (10 + 12)/2 = 11 as the value of the median.

  21. Determination of the Sample Median • General Process for determining the median of a sample: • Form the ordered array. • Calculate the position of the median. • Find the sample value at that position (if n is odd) • or average the two values on each side of that position • (if n is even). Note: Later on when finding quantiles (fractiles), a similar process will be used. Next, a LARGE sample is considered.

  22. A Large Sample Example (presented as Raw Data) The following sample (n = 120) of data contains the number of years that a company was in business. Q: What does one need to know to find the median of the sample above? A: The ordered array and the sample size (n).

  23. A Large Sample Example (presented as an Ordered Array) The following sample (n = 120) of data contains the number of years that a company was in business.

  24. A Large Sample Example (presented as an Ordered Array) The following sample (n = 120) of data contains the number of years that a company was in business. Position 60.

  25. A Large Sample Example (presented as an Ordered Array) The following sample (n = 120) of data contains the number of years that a company was in business. Position 60. Position 61.

  26. A Large Sample Example (presented as an Ordered Array) The following sample (n = 120) of data contains the number of years that a company was in business. Position 60. Position 61.

  27. Find the following measures of LOCATION. Central Tendency Measures: _ 1) mean (X) 2) median (M) 3) mode Noncentral Tendency Measures: (Quantiles or Fractiles) 1) First Quartile (Q1) 2) Third Quartile (Q3) 3) Sixth Decile (D6) 4) Eighty-third percentile (P83) Next, the mode.

  28. Determination of the Sample Mode The following sample (n = 7) of data contains the number of years that a company was in business. For the small sample (n = 7), the mode will be found. The mode of the sample is the value that occurs most frequently. It is in the “middle” in a probabilistic sense. If one value is selected randomly from the sample, which value is most likely to be selected? (In a normal distribution, mean = median = mode.) It is easier to determine the value of the mode from the ordered array (below) than from the raw data (above). Next, a LARGE sample is considered. In this example, it is easy to see that the mode is 3.

  29. A Large Sample Example (presented as Raw Data) The following sample (n = 120) of data contains the number of years that a company was in business. Q: What does one need to know to find the mode of the sample above? A: It is a lot easier to find from the ordered array.

  30. A Large Sample Example (presented as an Ordered Array) The following sample (n = 120) of data contains the number of years that a company was in business. Mode = 4. The value 4 occurs most frequently (17 times). The mode is even easier to find from the dot plot.

  31. Dot Plot of Years Datafor Large Sample (n=120)

  32. Quasi-Normal Dot Plot mean = median = mode = 15

  33. Find the following measures of LOCATION. Central Tendency Measures: _ 1) mean (X) 2) median (M) 3) mode Noncentral Tendency Measures: (Quantiles or Fractiles) 1) First Quartile (Q1) 2) Third Quartile (Q3) 3) Sixth Decile (D6) 4) Eighty-third percentile (P83) Next, the noncentral tendency measures. Note: The process is essentially the same as that for finding the median.

  34. Concept of the Quantile (Fractile) M = Median Position 1 n Smallest M = median Largest

  35. Concept of the Quantile (Fractile) M = Median Position 1 n Smallest M = median Largest 1/2 of the values ≤ M 1/2 of the values ≥ M

  36. Concept of the Quantile (Fractile) M = Median Position 1 n Smallest M = median Largest 1/2 of the values ≤ M 1/2 of the values ≥ M Q1 = First Quartile Position 1 n Smallest Q1 = first quartile Largest

  37. Concept of the Quantile (Fractile) M = Median Position 1 n Smallest M = median Largest 1/2 of the values ≤ M 1/2 of the values ≥ M Q1 = First Quartile Position 1 n Smallest Q1 = first quartile Largest 1/4 of the values ≤ Q1 3/4 of the values ≥ Q1

  38. Concept of the Quantile (Fractile) M = Median Position 1 n Smallest M = median Largest 1/2 of the values ≤ M 1/2 of the values ≥ M Q3 = Third Quartile Position 1 n Smallest third quartile = Q3 Largest

  39. Concept of the Quantile (Fractile) M = Median Position 1 n Smallest M = median Largest 1/2 of the values ≤ M 1/2 of the values ≥ M Q3 = Third Quartile Position 1 n Smallest third quartile = Q3 Largest 3/4 of the values ≤ Q3 1/4 of the values ≥ Q3

  40. Concept of the Quantile (Fractile) M = Median Position 1 n Smallest M = median Largest 1/2 of the values ≤ M 1/2 of the values ≥ M D6 = Sixth Decile Position 1 n sixth decile = D6 Smallest Largest

  41. Concept of the Quantile (Fractile) M = Median Position 1 n Smallest M = median Largest 1/2 of the values ≤ M 1/2 of the values ≥ M D6 = Sixth Decile Position 1 n sixth decile = D6 Smallest Largest 6/10 of the values ≤ D6 4/10 of the values ≥ D6

  42. Concept of the Quantile (Fractile) M = Median Position 1 n Smallest M = median Largest 1/2 of the values ≤ M 1/2 of the values ≥ M P83 = 83rd Percentile Position 1 n 83rd percentile = P83 Smallest Largest

  43. Concept of the Quantile (Fractile) M = Median Position 1 n Smallest M = median Largest 1/2 of the values ≤ M 1/2 of the values ≥ M P83 = 83rd Percentile Position 1 n 83rd percentile = P83 Smallest Largest 83% of the values ≤ P83 17% of the values ≥ P83

  44. Determination of Quantiles (Fractiles) • General Process for determining a Quantile (Fractile)of a sample: • Form the ordered array. • Calculate the position of the quantile (fractile). • Find the sample value closer to that position • (if fraction is NOT .5) or average the two values • on each side of that position (if fraction is .5). Next, the process is shown for a LARGE sample.

  45. A Large Sample Example (presented as Raw Data) The following sample (n = 120) of data contains the number of years that a company was in business. Q: What does one need to know to find the 1st quartile of the sample? A: The ordered array and the sample size (n).

  46. A Large Sample Example (presented as an Ordered Array) The following sample (n = 120) of data contains the number of years that a company was in business.

  47. A Large Sample Example (presented as an Ordered Array) The following sample (n = 120) of data contains the number of years that a company was in business. Position 30.

  48. A Large Sample Example (presented as an Ordered Array) The following sample (n = 120) of data contains the number of years that a company was in business. Position 30. Position 31.

  49. A Large Sample Example (presented as an Ordered Array) The following sample (n = 120) of data contains the number of years that a company was in business. Position 30. Position 31. The position is closer to 30 than 31, so the value of Q1 is the value at position 30.

  50. A Large Sample Example (presented as Raw Data) The following sample (n = 120) of data contains the number of years that a company was in business. Q: What does one need to know to find the 3rd quartile of the sample? A: The ordered array and the sample size (n).

More Related