1k likes | 1.14k Views
Warm-Up: Believe It or Not?. A student claims that they have flipped a fair coin 200 times and only had 84 times the heads side of the coin showed up. Do you believe this student or not, discuss with your neighbor why or why not.
E N D
Warm-Up: Believe It or Not? • A student claims that they have flipped a fair coin 200 times and only had 84 times the heads side of the coin showed up.Do you believe this student or not, discuss with your neighbor why or why not.
Chapters 2 - 4The Role of Statistics&Graphical Methods for Describing Data
In order to learn Statistics, we need to learn the language of statistics first. We’ll be learning a lot of new vocabulary today – through examples and activities
Statistics the science of collecting, analyzing, and drawing conclusions from data
Suppose we wanted to know something about the GPAs of high school graduates in the nation this year. We could collect data from all high schools in the nation.
Suppose we wanted to know something about the GPAs of high school graduates in the nation this year. We could collect data from all high schools in the nation. What term would be used to describe “all high school graduates”?
Population The entire group of individuals or objects we want information about A censusattempts to contact every individual in the entire population What do you call it when you collect data about the entire population?
Suppose we wanted to know something about the GPAs of high school graduates in the nation this year. We could collect data from all high schools in the nation.
Why might we not want to use a census here? Suppose we wanted to know something about the GPAs of high school graduates in the nation this year. We could collect data from all high schools in the nation. If we didn’t perform a census, what would we do?
Sample A part of the population that we actually examine in order to gather information What would a sample of all high school graduates across the nation look like? A list created by randomly selecting the GPAs of all high school graduates from each state.
Suppose we wanted to know something about the GPAs of high school graduates in the nation this year. We could collect data from a sample of high schools in the nation.
Once we have collected the data, what would we do with it? Suppose we wanted to know something about the GPAs of high school graduates in the nation this year. We could collect data from a sample of high schools in the nation.
Descriptive Statistics the methods of organizing & summarizing data If the sample of high school GPAs contained 10,000 numbers, how could the data be described or summarized? • Create a graph • State the range of GPAs • Calculate the average GPA
Suppose we wanted to know something about the GPAs of high school graduates in the nation this year. We could collect data from a sample of high schools in the nation. Could we use the data from this sample to answer our question?
Inferential statistics involves making generalizations from a sample to a population Be sure to sample from the population of interest!!
Inferential statistics involves making generalizations from a sample to a population Based on the sample, if the average GPA for high school graduates was 3.0, what generalization could be made? The average national GPA for this year’s high school graduate is approximately 3.0. Could someone claim that the average GPA for FISD graduates is 3.0? No. Generalizations based on the results of a sample can only be made back to the population from which the sample came from.
Variable any characteristic whose value may change from one individual or object to another
Variable any characteristic whose value may change from one individual or object to another Is this a variable . . . The number of wrecks per week at the intersection outside?
Data observations on a single variable or simultaneously on two or more variables
Data observations on a single variable or simultaneously on two or more variables For this variable . . . The number of wrecks per week at the intersection outside . . . what could the observations be?
Variability The range of possible data values The goal of statistics is to understand the nature of variability in a population
Variability The range of possible data values The goal of statistics is to understand the nature of variability in a population Populations with no variability are rare and boring (of little statistical interest). Can you think of a population that has no variability?
Variability The two histograms below display the distribution of heights of gymnasts and the distribution of heights of female basketball players. Which is which? Why? Heights – Figure A Heights – Figure B
Suppose you found a pair of size 6 shoes left outside the locker room. Which team would you go to first to find the owner of the shoes? Why? Suppose a tall woman (5 ft 11 in) you see is looking for her sister who is practicing in the gym. To which team would you send her? Why?
Suppose you found a pair of size 6 shoes left outside the locker room. Which team would you go to first to find the owner of the shoes? Why? Suppose a tall woman (5 ft 11 in) you see is looking for her sister who is practicing in the gym. To which team would you send her? Why? What aspects of the graphs helped you answer these questions?
Categorical variables • (qualitative) • Variables where the possible values are set of categories
Numerical variables • or quantitative • Variables where the values are numbers (are numerical) • (makes sense to average these values) • two types - discrete & continuous
Numerical: Discrete • Values are isolated points on a number line • usually counts of items
Numerical: Continuous • Set of possible values form an entire interval on the number line • usually measurements of something
Classifying variables by the number of variables in a data set Suppose that the PE coach records the heightof each student in his class. Univariate - data that describes a single characteristic of the population This is an example of a univariate data
Classifying variables by the number of variables in a data set Suppose that the PE coach records the height and weightof each student in his class. Bivariate - data that describes two characteristics of the population This is an example of a bivariate data
Classifying variables by the number of variables in a data set Suppose that the PE coach recordsthe height, weight, number of sit-ups, and number of push-upsfor each student in his class. Multivariate - data that describes more than two characteristics (beyond the scope of this course) This is an example of a multivariate data
the appraised value of homes in Faraway the color of cars in the teacher’s lot the number of calculators owned by students at your school the zip code of an individual the amount of time it takes students to drive to school Identify the following variables: Continuous numerical Categorical Discrete numerical Categorical Continuous numerical
Warm-Up: Classifying variables Write an example of a variable on the index card provided (try to come up with something we have not discussed in class already). Please include your name. When done, fold your index card in half and place in the bowl in the back of the room. We will classify these before completing notes on display types.
Bar Graph • Used for categorical data • Bars do not touch • Categorical variable is typically on the horizontal axis • Best used to describe or comment on which occurred the most often or least often • May make a double bar graph or segmented bar graph for bivariate categorical data sets
Comparative Bar Charts • Use relative frequency • If observations are the same for all groups (50 boys and 50 girls), you could use the frequency • Vertical scale the same always label both axis compare!!
Pie Chart (circle graph) • Used for categorical data • To make: • Proportion X 360° • Using a protractor, mark off each part • Best used to describe or comment on which occurred the most often or least often
Using class survey data, make bar graphs for:birth month gender & handedness
Dotplot • Used with numerical data (either discrete or continuous) • Made by putting dots (or X’s) on a number line • Can make comparative dotplots by using the same axis for multiple groups
Dotplot • To compare the weights of the males and females we put the dotplots on top of each other, using the same scales.
Using class survey data make dot plots of:# AP classes# siblings
1) Symmetrical • refers to data in which both sides are (more or less) the same when the graph is folded vertically down the middle • bell-shaped is a special type • has a center mound with two sloping tails
2) Uniform • refers to data in which every class has equal or approximately equal frequency
3) Skewed (left or right) • refers to data in which one side (tail) is longer than the other side • the direction of skewness is on the side of the longer tail
4) Bimodal (multi-modal) • refers to data in which two (or more) classes have the largest frequency & are separated by at least one other class
Warm-Up: Example 1 (From Your Notes) • Looking at Example 1 (about sports-related injuries), complete the columns titled “Tally” and “Frequency”.