120 likes | 282 Views
GINI/LISA Birth Cohort Studies Internal Record Linking For Analysis. Wolfgang Lohr Helmholtz Zentrum München Institute of Epidemiology. Wesel. Leipzig. Bad Honnef. Munich. study population. GINI 2 study centers recruitment period 1995-1998 5991 newborns recruited LISA
E N D
GINI/LISA Birth Cohort StudiesInternal Record Linking For Analysis Wolfgang Lohr Helmholtz Zentrum München Institute of Epidemiology
Wesel Leipzig Bad Honnef Munich study population GINI • 2 study centers • recruitment period 1995-1998 • 5991 newborns recruited LISA • 4 study centers • recruitment period 1997-1999 • 3097 newborns recruited
study objectives • development of atopic diseases • development of the immune system • growth and obesity • metabolic and inflammatory markers • pre- and postnatal environmental exposure • genetic predisposition • lifestyle factors • nutrition
SAS .csv/.txt Access Excel source data – intial situation • non-uniform data sources and formats • different file locations • different file versions • considerable effort to create a subset of data for analysis • modifications/corrections can hardly be achieved
SAS .csv/.txt Access Excel coding labels standardised data 1
standardised data 2 • conversion to a uniform data format (Access) • plausibility checks of master data (ID, date of birth, sex) • harmonization of variable names/coding
export of linked datasets (user form) • list of all variables (sort/filter option) • fast access to original datasets • save/load user-defined variable sets • view information on single variables (frequency table/summary statistics)
export of linked datasets (result) • csv-file with actual data • script to import csv-file into SAS • script to apply SAS-formats • variable description/code book
record linkage with google maps 1 features: • check addresses for correct spelling • complete missing address information • uniform address format • geocoding
record linkage with google maps 2 concerns: • googles’ “collection mania”: do they use the sent addresses? • limit of 2500 addresses/day • free use of google maps geocoding data only for publication in google maps(http://code.google.com/intl/en/apis/maps/terms.html) technical documentation: • http://code.google.com/intl/en/apis/maps/documentation/geocoding implementation with ms access: • www.access-im-unternehmen.de/746