1 / 24

Design of Statistical Investigations

Design of Statistical Investigations. Random Sampling 2. Stephen Senn. Stratified Random Sampling. A stratified random sample is one obtained by separating the population elements into nonoverlapping groups, called strata , and then selecting a simple random sample from each stratum.

judith
Download Presentation

Design of Statistical Investigations

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Design of Statistical Investigations Random Sampling 2 Stephen Senn SJS SDI_17

  2. Stratified Random Sampling A stratified random sample is one obtained by separating the population elements into nonoverlapping groups, called strata, and then selecting a simple random sample from each stratum. Scheaffer, Mendenhall and Ott, Elementary Survey Sampling, Fourth Edition SJS SDI_17

  3. Why? • Stratification can be efficient as regards estimation • Lower variances • Consequently it may be cost-effective • It may be desired to make statements about subgroups SJS SDI_17

  4. General Model L = number of strata Ni = number of sampling units in stratum I N = number of sampling units in population = N1 + N2 +…NL ni = number is sample from stratum i etc. Basic idea of estimation. For any stratum we can estimate the stratum total by multiplying the sample mean by the number in the population in that stratum. We then calculate the population total by summing all strata and so forth SJS SDI_17

  5. Estimation NB Ignoring FPCF SJS SDI_17

  6. Example Surv_3 • Advertising firm surveying three areas for mean weekly hours television viewing • Town A, 1550 households • Town B, 620 households • Rural area, 930 households • Samples are taken at random within these three strata. • Results on next slide SJS SDI_17

  7. SJS SDI_17

  8. Sample Size Case 1 Equal Allocation SJS SDI_17

  9. Suppose that in planning Surv_3 we had suspected the following SJS SDI_17

  10. SJS SDI_17

  11. Sample Size Case 2 Equal Proportions SJS SDI_17

  12. SJS SDI_17

  13. Sample Size Case 3 Optimal allocation Approximate allocation that minimises cost for a given variance or minimises variance for a given cost. (ci is the cost per observation sampled in stratum i) This is set as an exercise to prove in the coursework SJS SDI_17

  14. SJS SDI_17

  15. (Again this is ignoring FPCF) SJS SDI_17

  16. SJS SDI_17

  17. Cluster Sampling A cluster sample is a probability sample in which each sampling unit is a collection, or cluster, of elements Schaeffer, Mendenhall and Ott Example. We wish to obtain a n impression of reading skills amongst year 8 children in the UK. We select a simple random sample of schools and test each year 8 child in the schools chosen for reading skills. SJS SDI_17

  18. Cluster Sampling Why and Why Not? • Why: Less costly than simple or stratified sampling per sampled unit • It may be costly to establish sample frame of individuals • It may be cheaper to sample units close together • Why not: For a given number of sampled units, the variance will be higher SJS SDI_17

  19. A Model for Cluster Sampling N = number of clusters in population n = number of clusters selected in a simple random sample of clusters mi = number of elements in cluster i, i = 1,……N SJS SDI_17

  20. Minimum Variance EstimationGeneral Theory Suppose that we have a series of unbiased estimators of a given parameter with known but different variances. What is the linear combination of the estimators with the minimum variance? SJS SDI_17

  21. Setting = 0 yields Setting = 0 yields SJS SDI_17

  22. SJS SDI_17

  23. Now suppose that the true cluster means have a variance but that the variance within strata is constant Between cluster variance Within cluster variance SJS SDI_17

  24. Questions • In the design and analysis of experiments variance estimates are often based on pooled variances. In sampling theory they generally are not. Why the difference in practice? • For a given total number of observations how do simple, stratified and cluster sampling compare in terms of variance? SJS SDI_17

More Related