270 likes | 434 Views
Descriptive Statistics. Measures of Center. File Information: 24 Slides To Print : You may need to save this to your p: drive or jump drive before printing. Set PRINT WHAT to Handouts.
E N D
Descriptive Statistics Measures of Center • File Information: 24 Slides • To Print : • You may need to save this to your p: drive or jump drive before printing. Set PRINT WHAT to Handouts. • Under HANDOUTS select the number of slides per page. A sample of the layout on a page appears to the right. • To change the orientation of the printing, select the PREVIEW button (lower left) and then the Orientation option on the Print Preview menu.
Essentials: Measures of Center(The great mean vs. median conundrum.) • Be able to identify the characteristics of the median, mean and mode, and to which types of data each applies. • Be able to calculate the median, mean and mode, as appropriate, for a set of data. • Affected by vs. resistant to extreme values. What are the implications for the mean and median?.
Essentials:Sigma - S(Yeah, I got this – so everyone thinks, but it isn’t as easy as it looks.) Understand what Sigma (S) means and how it is used. Be able to interpret what S is telling you to do in a given formula. When you think you’ve got it, practice some more.
Some Notation • denotes the addition of a set of values • x is the variable usually used to represent the individual data values • n represents the number of data values in a sample • N represents the number of data values in a population
Measures of Center • Measures of Central Tendency • Indicate where the center or most typical value of a data set lies • Are often thought of as averages • Include the Mean, Median, Mode, and Midrange
The Mean (Arithmetic) • The “average” of a set of data. • Is the sum of the observations divided by the number of observations. • Is used only with quantitative data.
The Formula: MEAN = n
Population Mean vs. Sample Mean • A Population Mean is represented by the lower case Greek letter m (mu) • A Sample Mean is represented by the lower case Arabic letter x with a bar above it (called x-bar)
Weighted Means • Weighted Mean – a mean computed with different scores assigned different weights. To find the weighted mean
Weighted Example: Finding a Weighted Mean You are taking a class in which your grade is determined from five sources: 50% from your test mean, 15% from your midterm, 20% from your final exam, 10% from your computer lab work, and 5% from your homework. Your scores are 86 (test mean), 96 (midterm), 82 (final exam), 98 (computer lab), and 100 (homework). What is the weighted mean of your scores? If the minimum average for an A is 90, did you get an A? Source: Larson/Farber 4th ed.
Solution: Finding a Weighted Mean Your weighted mean for the course is 88.6. You did not get an A. Source: Larson/Farber 4th ed.
Weighted Means Example • Calculating a GPA. • Given the following four grades, calculate the semester GPA. • Statistics A (of course; 3 CrHrs; numeric value for an A = 4) • History B (3 CrHr; B = 3) • Physics C (3 CrHr; C = 2) • Physical Education C (1 CrHr) • The grade numeric equivalents are the x values. The credit hour values are the weights. • Calculate the student’s GPA.
Finding a Mean From a Frequency Table (Grouped Data) When we view data in a frequency table, it is impossible to know the exact values falling in a particular class. To find this value, obtain the product of each frequency and class midpoint (here “x”), add the products, and then divide by the sum of the frequencies.
Finding the Mean of a Frequency Distribution In Words In Symbols • Find the midpoint of each class. • Find the sum of the products of the midpoints and the frequencies. • Find the sum of the frequencies. • Find the mean of the frequency distribution. Source: Larson/Farber 4th ed.
Example: Find the Mean of a Frequency Distribution Use the frequency distribution to approximate the mean number of minutes that a sample of Internet subscribers spent online during their most recent session. Source: Larson/Farber 4th ed.
Median • The middle observation in a set of data. • Divides the data such that 50% of the observations lie below the median and 50% lie above it. • Is used only with quantitative data. • To obtain the median, the data must be placed in increasing order.
The Formula: MEDIAN First: Arrange the scores in increasing order. Second: Apply the formula (n+1)/2. (Where n is the number of data values. • If there is an ODD number of scores, the middle score is the value of the Median. • e.g: 1, 3, 6 => Median is (n+1)/2 = (3+1)/2 = 2. So, the Median is value in the second second position of the list of values. Here the second value is the number 3. • If there is an EVEN number of scores, the Median lies between the two middle scores. • e.g: 1, 2, 8, 15 => Median is (n+1)/2 = (4+1)/2 = 2.5. So, the Median is the data value that lies 1/2 way between the second and third data values. Here that value would be 5. • Remember, the formula computes a position, not a data value.
Calculating a Median: • Determine the median for the following backpack weights: • Backpack weights (lb): 10, 14, 12, 18, 32, 15, 22, 19, 23, 61.
Mode • The most frequently occurring score in a data set. • Is used with both qualitative and quantitative data. • There may be more than one Mode • If there are two modes, the data set is bimodal. • If there are more than two modes, the data set is multimodal.
The Formula: MODE • Obtain the frequency of each value. • A Frequency Table based upon Single-Value Grouping or a Dot Plot would display this information. • If no value has more than one occurrence,there is no Mode. • Otherwise, the most frequently occurring value is the Mode.
Example: Comparing the Mean, Median, and Mode Find the mean, median, and mode of the sample ages of a class shown. Which measure of central tendency best describes a typical entry of this data set? Are there any outliers? Source: Larson/Farber 4th ed.
Solution: Comparing the Mean, Median, and Mode Mean: Median: Mode: Source: Larson/Farber 4th ed.
Solution: Comparing the Mean, Median, and Mode Mean ≈ 23.8 years Median = 21.5 years Mode = 20 years • The mean takes every entry into account, but is influenced by the outlier of 65. • The median here was determined by taking the middle two entries into account, and it is not affected by the outlier. • In this case the mode exists, but it doesn't appear to represent a typical entry. Source: Larson/Farber 4th ed.
Solution: Comparing the Mean, Median, and Mode Sometimes a graphical comparison can help you decide which measure of central tendency best represents a data set. In this case, it appears that the median best describes the data set. Source: Larson/Farber 4th ed.
Mean vs. Median vs. Mode • Which is the best Measure of Center???? • MEAN: • Is sensitive to the influence of extreme scores (outliers), which will “pull” the mean away from the center. • Involves ALL data values in the calculation • MODE: • May not be anywhere near the center of the data. • Not really aimed at finding the middle of the data. • Is the ONLY “Measure of Center” for Qualitative Data. • MEDIAN: • Is resistant to the influence of extreme values. • Only uses One or Two points in its calculation.
Midrange • Another measure of center is the Midrange - the value midway between the highest and lowest values in a data set. To find the midrange. Highest Value + Lowest Value 2