1 / 47

RESEARCH COURSE Statistical Data Analysis and Scientific Research Dr. Francois O. Tuamokumo,

National Naval Medical Center Directorate for Professional Education Clinical Investigation Department. RESEARCH COURSE Statistical Data Analysis and Scientific Research Dr. Francois O. Tuamokumo, Mathematical Statistician , Department of Research Programs,

barb
Download Presentation

RESEARCH COURSE Statistical Data Analysis and Scientific Research Dr. Francois O. Tuamokumo,

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. National Naval Medical CenterDirectorate for Professional EducationClinical Investigation Department RESEARCH COURSE Statistical Data Analysis and Scientific Research Dr. Francois O. Tuamokumo, Mathematical Statistician , Department of Research Programs, WRNMMC, Bethesda, MD “The National Naval Medical Center is an approved provider of continuing nursing education by the Navy Medicine Manpower, Personnel, Training and Education Command, an accredited approver by the American Nurses Credentialing Center’s Commission on Accreditation.”

  2. Disclosure Statement This CE/CME activitydoes not have commercial support,and has no conflicts of interest. Research Course

  3. DATA MANAGEMENT: INTRODUCTIONBy design of a study we mean planning the study in such a way that appropriate data can be collected and analyzed. Research Course

  4. DATA are measurements collected on some characteristics called variables. • VARIABLES are the characteristics on which measurements are made Research Course

  5. TYPES OF DATA • Qualitative data • Quantitative data • Qualitative data are categories. example: Gender (male, female) Stage of cancer (I, II, III, IV) • Quantitative data are numbers. example: Age, height, weight # of first trimester visits Research Course

  6. Conclusion Methods of analysis depend on the data. • Data Management & Quality Assurance Access Excel Minitab BMDP SPSS STATA SAS Research Course

  7. Identifying & resolving outliers • Identifying & and resolving missing data • Identifying duplicate records • Data Dictionary VariableAbbreviation Identification Code ID Low Birth Weight LoBrtWt Research Course

  8. (0 = Birth Weight ≥ 2500g, 1 = Birth Weight < 2500g) Age of mother in years Age Weight in pounds at last wtLMst menstrual period Race (1=White, 2=black, Race 3=Hispanic, 4=other) Smoking status during Smoke Research Course

  9. pregnancy (1=yes, 0=no) History of premature labor Prmtrlbo (0=none,1=one, 2=two, etc) History of hypertension Hptnsion (1= yes, 0 = no) Research Course

  10. Presence of uterine Utrnirrt irritability (1=yes, 0= no) Number of physician visits visits during first trimester (0 = none, 1=one, 2=two, etc) Birth weight in grams brtwt Research Course

  11. Some Considerations in Research • What are the variables of interest on which data will be collected? • What are the testable research questions of interest? • Are these questions clearly, concisely, and completely stated? Research Course

  12. Objectives and Methods of Analysis • Objective: The effect of dose (two levels 250, 500mg) on pain status(relief/no relief); 2 controlling, for previous exposure (yes/no), gender (male/female); 3 age, bmi • Analysis Methods (vary) • Pearson’s chi-square (1) • Mantel-Haenszel Chi square (1,2) • Logistic regression (1, 2, 3)

  13. Objective: The effect of dose (250, 500, 750mg) on pain status(quantified). • Analysis Method • independent variable - categorical • dependent variable - continuous • DO ANOVA • The effect of the type of medication (A, B, C, D) on cholesterol level

  14. Objective: The effect of the number of miles run per day on weight loss Analysis method • independent variable - numerical • dependent variable - continuous • DO REGRESSION ANALYSIS

  15. What is the purpose of the study? a. Descriptive b. Hypothesis testing, or c. Modeling • Descriptive study: To estimate a population parameter. • Ex: meanarterial blood pressure. proportion (percent) with improved respiratory outcome Research Course

  16. Provide a 95% confidence interval for the estimates. • Confidence Interval: An interval over which the true value is expected to lie. Confidence Interval for: 1. population mean 2. population proportion Research Course

  17. Research Course

  18. How large a sample do I need? Answer: Depends on type of study A. Estimation B. Testing Hypothesis A. Estimation of population mean, μ and population proportion, p Research Course

  19. Research Course

  20. Example • A hospital administrator wishes to estimate the mean weight of babies born in her hospital. How large a sample of birth records should be taken if she wants to be 95% confident that the sample mean weight will be within 0.50 pound of the true mean weight of all babies born in her hospital? Research Course

  21. Assume that a reasonable estimate of σ is 1 pound. Using the formula, Research Course

  22. Research Course

  23. Research Course

  24. Research Course

  25. Research Course

  26. HYPOTHESES TESTING AND SAMPLE SIZE AN HYPOTHESIS IS AN ASSERTION ABOUT A POPULATION PARAMETER, SUCH AS THE POPULATION MEAN OR THE POPULATION PROPORTION. Research Course

  27. TWO TYPES OF HYPOTHESES: NULL ALTERNATIVE THE RESEARCHER WISHES TO DISCREDIT THE NULL STATEMENT. Research Course

  28. ERRORS IN HYPOTHESIS TESTING TYPE I ERROR TYPE II ERROR • TYPE I ERROR: REJECTING THE NULL HYPOTHESIS WHEN IT IS TRUE • TYPE II ERROR: ACCEPTING THE NULL HYPOTHESIS WHEN IT IS FALSE Research Course

  29. PROBABILITY OF COMMITTING THESE ERRORS: ALPHA AND BETA Research Course

  30. P-value • It is the smallest significance level for which the null hypothesis is rejected. • Compare it to level of significance, α (normally, .05) Research Course

  31. II. SAMPLE SIZE FOR COMPARISON OF TWO GROUPS: • DATA TYPE: A. NUMERICAL DEPENDENT VARIABLE Research Course

  32. PROBLEM:THE RESEARCH QUESTION IS WHETHER THERE IS A DIFFERENCE IN THE EFFICACY OF SALBUTAMOL AND IPRATROPIUM BROMIDE FOR THE TREATMENT OF ASTHMA. • DESIGN: RANDOMIZED TRIAL TO DETERMINE THE EFFECT OF THESE DRUGS ON FEV1 (FORCED EXPIRATORY VOLUME IN 1 SECOND) AFTER 1 WEEK OF TREATMENT. Research Course

  33. ANALYSIS: DIFFERENCES IN MEANS • TEST: t-TEST • SPECIFICATIONS: 1. NULL AND ALTERNATIVE HYPOTHESES: NULL: MEAN FEV1 AFTER ONE WEEK OF TREATMENT IS THE SAME IN ASTHMATIC PATIENTS TREATED WITH SALBUTAMOL AS IN THOSE TREATED WITH IPRATROPIUM BROMIDE. ALTERNATIVE (2-SIDED): Research Course

  34. 2. MEAN FEV1 = 2 LITERS STD. DEVIATION = 1 LITER - IPRATROPIUM(LITERATURE) 3. EFFECT SIZE: = 0.2 LITERS (10% * 2) STANDARDIZED EFFECT SIZE = (EFFECT SIZE / STD.DEV.) = 0.2 LITERS Research Course

  35. 4. LEVEL OF SIGNIFICANCE = .05 POWER = .80 THUS SAMPLE SIZE PER GROUP = 393 Research Course

  36. EXISTENCE OF HIGH IN-BETWEEN VARIABILITY AMONGST OBSERVATIONS • DESIGN: RANDOMIZED TRIAL • ANALYSIS: PRE-POST CHANGES • TEST: t-TEST Research Course

  37. 1. HYPOTHESES H0: CHANGE IN MEAN FEV1s ARE THE SAME HA: CHANGE IN MEAN FEV1s ARE DIFFERENT 2. STANDARD DEVIATION OF THE CHANGE = 0.25 (FROM PILOT) 3.EFFECT SIZE = 0.2 LITERS STANDARDIZED EFFECT SIZE = .80 4. LEVEL OF SIGNIFICANCE = .05, POWER = .80 FROM FORMULA, n = 25 PER GROUP Research Course

  38. DATA TYPE - • B. CATEGORICAL (BINARY) DEPENDENT VARIABLE • EXAMPLE: PROPORTION OF MEN WHO DEVELOP CORONARY HEART DISEASE (CHD) WHILE TREATED WITH ASPIRIN COMPARED WITH THE PROPORTION WHO DEVELOP CHD WHILE TAKING A PLACEBO Research Course

  39. SPECIFICATION REQUIREMENTS: • EFFECT SIZE IS SPECIFIED BY SPECIFYING P1 AND P2 TYPE OF STUDIES: A. COHORT STUDIES: P1AND P2 ARE PROPORTIONS OF SUBJECTS EXPECTED TO HAVE THE OUTCOME IN THE TWO GROUPS. 2. STATE THE NULL AND ALTERNATIVE HYPOTHESES. 3. SET ALPHA AND BETA. Research Course

  40. PROBLEM: THE RESEARCH QUESTION IS WHETHER ELDERLY SMOKERSHAVE GREATER INCIDENCE OF SKIN CANCER THAN ELDERLY NONSMOKERS - COHORT Research Course

  41. EXAMPLE: HOW MANY SMOKERS AND NONSMOKERS WILL NEED TO BE STUDIED TO DETERMINE WHETHER THE 5-YEAR SKIN CANCER INCIDENCE IS AT LEAST 30% IN SMOKERS? 1.H0: THE INCIDENCE IS THE SAME HA: THE INCIDENCE IS DIFFERENT 2. 5-YEAR INCIDENCE OF SKIN CANCER IS ABOUT 20% IN NONSMOKERS – LITERATURE REVIEW. 3. ALPHA = 0.05 AND POWER = 0.80 n = 313, FOR A TWO-SIDED HA n = 250, FOR A ONE-SIDED HA ** ABOVE PROBLEM MAY BE STATED IN FORM OF RELATIVE RISK. Research Course

  42. EXAMPLE: AN INVESTIGATOR IS INTERESTED IN WHETHER WOMEN WHO USE ORAL CONTRACEPTIVES ARE AT A MUCH HIGHER RISK OF HAVING A MYOCARDIAL INFARCTION WHEN COMPARED TO NON-USERS (PROSPECTIVE) Research Course

  43. B. CASE-CONTROL STUDY: • SPECIFICATION REQUIREMENTS: • 1. THE ODDS RATIO TO BE DETECTED IN THE CASE GROUP • 2. P2: THE PROPORTION OF CONTROLS EXPOSED TO THE PREDICTOR VARIABLE Research Course

  44. WHERE,P1IS THE PROPORTION OF CASES EXPOSED TO THE PREDICTOR VARIABLE Research Course

  45. EXAMPLE: 1. EXPECTS THAT 10% OF CONTROLS WILL BE EXPOSED TO ORAL CONTRACEPTIVES (P2) 2. WISHES TO DETECT AN ODDS RATIO OF 3 ASSOCIATED WITH THE EXPOSURE FROM FORMULA, P1 = 0.25 HENCE, FOR A TWO-SIDED HYPOTHESIS, n = 112 PER GROUP Research Course

  46. Questions ??? Research Course

  47. Thank You! My contact Information: Dr. Francois O. Tuamokumo Phone: (301) 319 8788 Email: francois.tuamokumo@med.navy.mil Research Course

More Related