1 / 21

Probability Distributions

Probability Distributions. How many kittens lie in any selected part of a pile of kittens?. Probability Distributions. Sometimes we want to know the chances that something will occur? For example: What are the odds that I will win the lottery? What are my chances of getting an A?

Download Presentation

Probability Distributions

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Probability Distributions How many kittens lie in any selected part of a pile of kittens?

  2. Probability Distributions Sometimes we want to know the chances that something will occur? For example: • What are the odds that I will win the lottery? • What are my chances of getting an A? • If a person is young, what are the chances that he or she will be in poverty? • What chances do poor people have of graduating from college? To answer questions such as these, we turn to probability.

  3. Probability Distributions Probability: Out of all possible outcomes, the proportionate expectation of a given outcome. Values for statistical probability range from 0 (never) to 1 (always) or from 0% chance to 100% chance. For example: 12 of 25 students in an engineering class are women. The probability that a randomly selected student in that engineering class will be a woman is 12/25 = .48 or 48%. 13 12 F M

  4. Probability Distributions What is the probability that a student will get a C in Research Methods? What about a C or Higher? 10 5 0 3 5 12 7 5 F D C B A

  5. Probability Distributions What is the probability that a student will get a C in Research Methods? 12/32 = .375 What about a C or Higher? 24/32 = .75 What is the probability that a person in the class got a grade? 32/32 = 1 10 5 0 3 5 12 7 5 F D C B A

  6. Probability Distributions Empirical probability distribution: All the outcomes in a distribution of research results and each of their probabilities The probability distribution of a variable lists the possible outcomes together with their probabilities

  7. Probability Distributions What is the probability that a student will get a C in Research Methods? 12/32 = .375 What about a C or Higher? 24/32 = .75 .375 .219 0 F D C B A P=1 100% of cases P=.25 or 25% F D C B A

  8. Probability Distributions Theoretical probability distribution: The number of times we would expect to get a particular outcome in a large number of trials. Q: Why are these important? A: Sociologists usually get only one chance to draw a sample from a population. Therefore, if know how repeated sampling would behave theoretically, we can judge the chance that our sample’s estimate of the mean is near the true population mean.

  9. Probability Distributions Theoretical probability distribution: The number of times we would expect to get a particular outcome in a large number of trials. For Example: Randomly take 100 USA students’ GPAs. Record it. Now, take 100 more USA students’ GPAs. Record that. Now, repeat the above. Record again. Now, lather, rinse, repeat. Again. Again. And on and on. What might you see?

  10. Probability Distributions Theoretical probability distribution: The number of times we would expect to get a particular outcome in a large number of trials. 50% of samples would have a mean GPA greater than 2.5 1.3 1.5 1.7 1.9 2.1 2.3 2.52.7 2.9 3.1 3.3 3.5 3.7 3.9 = a sample’s mean 2.5 = Overall Mean

  11. Probability Distributions The Normal Probability Distribution A continuous probability distribution in which the horizontal axis represents all possible values of a variable and the vertical axis represents the probability of those values occurring. Values are clustered around the mean in a symmetrical, unimodal pattern known as the bell-shaped curve or normal curve.

  12. Probability Distributions The Normal Probability Distribution No matter what the actual s.d. () value is, the proportion of cases under the curve that corresponds with the mean ()+/- 1s.d. is the same (68%). The same is true of mean+/- 2s.d. (95%) And mean +/- 3s.d. (almost all cases) Because of the equivalence of all Normal Distributions, these are often described in terms of the Standard Normal Curve where mean = 0 and s.d. = 1 and is called “z”

  13. Probability Distributions The Normal Probability Distribution No matter what the actual s.d. () value is, the proportion of cases under the curve that corresponds with the mean ()+/- 1s.d. is the same (68%). The same is true of mean+/- 2s.d. (95%) And mean +/- 3s.d. (almost all cases) Because of the equivalence of all Normal Distributions, these are often described in terms of the Standard Normal Curve where mean = 0 and s.d. = 1 and is called “z” Z = # of standard deviations away from the mean 68% 68% Z = -3 -2 -1 0 1 2 3 Z=-3 -2 -1 0 1 2 3

  14. Probability Distributions Converting to z-scores To compare different normal curves, it is helpful to know how to convert data values into z-scores. It is like have two rulers beneath each normal curve. One for data values, the second for z-scores. IQ  = 100  = 15 Values 55 70 85 100 115 130 145 Z-scores -3 -2 -1 0 1 2 3

  15. Probability Distributions Converting to z-scores Z = Y –   Z = 100 – 100 / 15 = 0 Z = 145 – 100 / 15 = 45/15 = 3 Z = 70 – 100 / 15 = -30/15 = -2 Z = 105 – 100 / 15 = 5/15 = .33 IQ  = 100  = 15 Values 55 70 85 100 115 130 145 Z-scores -3 -2 -1 0 1 2 3

  16. Probability Distributions Engagement Ring Example: Mean cost of an engagement ring is $500, and the standard deviation is $100. Z = Y –   Z = 500 – 500 / 100 = 0 Z = 600 – 500 / 100 = 100/100 = 1 Z = 200 – 500 / 100 = -300/100 = -3 Z = 550 – 500 / 100 = 50/100 = .5 IQ  = 100  = 15 Values 200 300 400 500 600 700 800 Z-scores -3 -2 -1 0 1 2 3

  17. Probability Distributions Engagement Ring Example: Mean cost of an engagement ring is $500, and the standard deviation is $100. Now, use the empirical rule… What percentage of people will be above or below my preferred ring price of $300? IQ  = 100  = 15 2.5% 2.5% 68% Values 200 300 400 500 600 700 800 Z-scores -3 -2 -1 0 1 2 3

  18. Probability Distributions Comparing two distributions by Z-score Imagine that your partner didn’t get you a ring, but took you on a trip to express their love for you. You could convert the trip’s price into a ring price using z-scores. Your trip cost $2,000. The average “love trip” costs $1,500 with a s.d. of $250. What is the equivalent ring price? Trips Rings 200 300 400 500 600 700 800 750 1000 1250 1500 1750 2000 2250 -3 -2 -1 0 1 2 3 -3 -2 -1 0 1 2 3

  19. Probability Distributions Comparing two distributions by Z-score Your trip cost $2,000. The average “love trip” costs $1,500 with a s.d. of $250. What is the equivalent ring price? What percentage of persons got a trip that cost less than yours? Trips Rings 200 300 400 500 600 700 800 750 1000 1250 1500 1750 2000 2250 -3 -2 -1 0 1 2 3 -3 -2 -1 0 1 2 3

  20. Probability Distributions Comparing two distributions by Z-score What about ACT versus SAT scores? NOTE: This is a helpful process, but can be illogical at times. Remember that you are comparing scores on a “population base” or percent of people above or below each score. Is it logical to compare SAT score to self-esteem this way? No. SAT ACT 15 18 21 24 27 30 33 400 600 800 1000 1200 1400 1600 -3 -2 -1 0 1 2 3 -3 -2 -1 0 1 2 3

  21. Probability Distributions How to use a z-score table. (I could use some z z z z’s). A&F Table A has reports from the literal measurements of area under normal curves. The table gives you the percent of values above, below, or between particular z-scores (# of s.d.s away from the mean). Left column = first two digits of z Top row, last digit of z Where these meet in crosshairs is the proportion of cases above that z-score. Can work in reverse to find z-scores too. Other tables will use different layouts, online you can get automatic answers without using a table.

More Related