270 likes | 403 Views
Random Variable. Probability Distribution. Summary Characteristics. X= Amt in next bottle. N( μ =10.2, σ =0.16). Mean Median Mode Std dev Variance skew. X=#G in 4. B(n=4,p=.5). X=total of 2 tossed dice. We must admit that we cannot know exactly what value X will take….
E N D
Random Variable Probability Distribution Summary Characteristics X=Amt in next bottle N(μ=10.2,σ=0.16) Mean Median Mode Stddev Variance skew X=#G in 4 B(n=4,p=.5) X=total of 2 tossed dice We must admit that we cannot know exactly what value X will take…. 2 3 4 5 6 7 8 9 10 11 12 There are summary characteristics of any probability distribution… But knowing these summary measures do not replace our need to know the probability distribution. …so that we can do the intelligent thing and talk about something we CAN know, the probability distribution of X.
Class 06: Descriptive Statistics EMBS: 3.1, 3.2, first part of 3.3
Characteristics of probability distributions Descriptive Statistics (for numerical data) • Measures of Location • Mean • Median • Mode • Measures of Variability • Standard Deviation • Variance • Measure of Shape • skewness • Measures of Location • Sample Mean • Sample Median • Sample Mode • Measures of Variability • Sample StDev • Sample Variance • Measure of Shape • Sample skewness
A positively-skewed pdf Mode is the most likely value P(X<median) = P(X>median) = 0.5 Mean is the probability-weighted average Skewness > 0
http://dept.econ.yorku.ca/~jbsmith/ec2500_1998/lecture9/Lecture9.htmlhttp://dept.econ.yorku.ca/~jbsmith/ec2500_1998/lecture9/Lecture9.html
A negatively-skewed pdf Skewness < 0
An exhibit at MOMA invites visitors to mark their heights on a wall. A normal distribution results: Well, not quite. The distribution is actually slightly negatively skewed by the confounding presence of children, who are obviously shorter than adults - you can see this in the great number of names well below the central band which are not mirrored by names higher up. Rest assured, however, that the ex-children distribution is itself Gaussian. http://www.thisisthegreenroom.com/2009/bell-curves-in-action/
The Normal pdf Mean = μ median = μ mode = μ Skewness= 0 http://www.comfsm.fm/~dleeling/statistics/notes06.html
Measures of Variability σ = 0.7 σ = 1.0 σ = 1.5 http://www.google.com/imgres?q=standard+deviation+curve&hl=en&gbv=2&biw=1226&bih=866&tbm=isch&tbnid=pppxDi8aC37y8M:&imgrefurl=http://www.comfsm.fm/~dleeling/statistics/notes06.html&docid=Hu1RM-siu0MevM&imgurl=http://www.comfsm.fm/~dleeling/statistics/normal_curve_diff_sx.gif&w=401&h=322&ei=9qAqT8KXAcPptgfC3uX0Dw&zoom=1&iact=hc&vpx=748&vpy=508&dur=1013&hovh=201&hovw=251&tx=142&ty=111&sig=106136691078404837864&page=1&tbnh=149&tbnw=186&start=0&ndsp=20&ved=1t:429,r:13,s:0
Skewed pdfs can also have different standard deviations Which pdf has the largest σ?
Pdfs Can have different means, but identical standard deviations Which pdf has the largest σ? Which pdf has the largest μ?
Characteristics of probability distributions Descriptive Statistics (for numerical data) • Measures of Location • Mean • Median • Mode • Measures of Variability • Standard Deviation • Variance • Measure of Shape • skewness • Measures of Location • Sample Mean • Sample Median • Sample Mode • Measures of Variability • Sample StDev • Sample Variance • Measure of Shape • Sample skewness Probability weighted average 50% point Most likely Expected squared distance from mean Neg if skewed left, 0 if symmetric, pos if skewed right.
Characteristics of probability distributions Descriptive Statistics (for numerical data) • Measures of Location • Mean • Median • Mode • Measures of Variability • Standard Deviation • Variance • Measure of Shape • skewness • Measures of Location • Sample Mean • Sample Median • Sample Mode • Measures of Variability • Sample StDev • Sample Variance • Measure of Shape • Sample skewness Probability weighted average =average() 50% point =median() Most likely =mode() =stdev() Expected squared distance from mean =var() Neg if skewed left, 0 if symmetric, pos if skewed right. =skew()
Characteristics of probability distributions Descriptive Statistics (for numerical data) • Measures of Location • Mean • Median • Mode • Measures of Variability • Standard Deviation • Variance • Measure of Shape • skewness • Measures of Location • Sample Mean • Sample Median • Sample Mode • Measures of Variability • Sample StDev • Sample Variance • Measure of Shape • Sample skewness Probability weighted average GET THEM ALL USING DATA ANALYSIS, DESCRIPTIVE STATISTICS, SUMMARY STATISTCS 50% point Most likely Expected squared distance from mean Neg if skewed left, 0 if symmetric, pos if skewed right.
Characteristics of probability distributions Descriptive Statistics (for numerical data) • Measures of Location • Mean • Median • Mode • Measures of Variability • Standard Deviation • Variance • Measure of Shape • skewness • Measures of Location • Sample Mean • Sample Median • Sample Mode • Measures of Variability • Sample StDev • Sample Variance • Measure of Shape • Sample skewness Probability weighted average 50% point Most likely RANGE Expected squared distance from mean Neg if skewed left, 0 if symmetric, pos if skewed right. COUNT
Understanding sample standard deviation All the data contribute to the measure. It measures variability …. In either direction. It measures variability about the mean.
Fill Test DataHistogram Data Data Analysis Histogram Check chart output 1 data point was < 9.758 2 data points were between 9.758 and 9.841 1 was above 10.673
Preview of Coming Attractions • Class 07 • Find out how to use these counts to test H0: these data came from N(10.2,.16) • Find out how to use the Denmark family counts to test H0: those data came from Binomial(4,.5)