310 likes | 327 Views
Explore the journey from data access to integration at the IAOS conference in Shanghai 2008, presented by Annegrete Wulff from Statistics Denmark. Accessible products include publications since 1850, municipality statistics databank, and StatBank Denmark. Learn about dissemination principles, simultaneous releases, and the Statistical Information System. Discover how StatBank Denmark evolved from restricted access to a comprehensive online resource with over 2,000 tables. Satisfy diverse user needs, improve response time, and enhance data visualization with self-service options. The integration process involves defining concepts, dimensions, and members in the StatBank. Experience seamless data integration, metadata access, and statistical abstracts at Statistics Denmark.
E N D
From Data Access to Data Integration IAOS, Shanghai 14-16 October 2008 Annegrete Wulff, Statistics Denmark awu@dst.dk
Accessible products • Publications (ca. 1850 - ) • Municipality statistics databank 1986-98 • StatBank Denmark (www.statbank.dk) 1998 – • Homepage www.dst.dk 1996 -
Dissemination principles • Electronic first • StatBank is the place for all official statistics • StatBank is the source for all publications • StatBank is online available & free-of-charge for everyone • Simultaneous releases in all media 9:30:00 am • Dissemination should address well-defined • target groups • types of usage • …jet still use the same source (data and metadata)
Educa- tion Employ- ment CPR Person id: Person Number Inter- view Tax Question- naire Social Health etc Dwelling id: Address Enterprise id: CBR-No Cadastre BDR CBR The Statistical Information System
Population Labour market www.statbank.dk Income Databank Finance Agriculture etc. Access
StatBank Denmark - - Subject matter division Dissemination, IT-Centre Cleaned micro data Statistical registers Annonymos micro data for Researchers Aggregation to macro data Charged statistics and analysis SumDatabase pdf Publication www.dst.dk International organisations Print Binding dst.dk The Public
Users needs • More users • Variety of users • Different – and increasing user needs • User satisfaction surveys
Shift of focus …..1980’s • Electronic on-line access • Content: more details than on paper • Design of tables on the fly • Calculations • Download possibilities • Output formats
Statistics Denmark’s first databank 1986 200 users – and we new them all
Shift of focus …..1990’s • Internet access • - but also off-line products: CD-ROM • Functionality (calculations) • Interactive aggregations • Contact possibilities – who to ask for more
Internet databank, ver. 1.0 1998 Access “restricted” by a fee
Shift of focus …..2000’s • Presentation, layout • Documentation • Linking, coherence • Long time span • Search
Homepage 2004 Documentation, quality declaration, graphics
Shift of focus …..2008’s • Response time • Definitions • Search and browse • Visualisation, maps and graphics • Self service • Integration with own systems
StatBank Denmark • More than 2,000 tables, several billions of data • Links to documentation (declaration of contents) • Links to publications • Saved queries • Data shooting • Excel web queries • Output formats: Excel, PC-AXIS, xml, SAS, comma separated, time series,… • Maps • Graphs
2,500 matrices in Danish and English 2 million retrievals 77 % only on screen 6 % in maps, 17 % graphs HTML table on screen Downloads of a file • 23% downloads. Of these: • 86 % in Excel • 9 % in PC-AXIS • 5% in other formats
Many sources of metadata Subject matter division Statisticians Technical sources Statistics Denmark publications External documents Legal documents etc TIMES, microdata documentation Quality declarations Statistical Abstracts Annual publications, Nomenclatures Statistical Yearbook & Ten year review Metadataproject
Metadata project Statistical Yearbook & Ten year review Annual publications, Nomenclatures Statistical Abstracts Quality declarations TIMES, microdata documentation Legal documents etc External documents Statistics Denmark publications Subject matter division Technical sources Statisticians One source for all metadata
StatBank Denmark - - Metadata • code explanations • source • quality • accessability • concepts, definitions • methodologies • contacts • release info Annonymos micro data for researchers Charged statistics and analysis www.dst.dk International organisations dst.dk
The process • Select the concepts • Define the concept • Making the definition accessible
Concepts in the StatBank • 850 dimensions • 173,000 dimension members • 8,000 will be defined
Examples: table titles • Unemployed by region, ancestry and sex (monthly) • Immigrated by region, country of origin, age and sex (continous years)
Born in Denmark and neither of the parents is born in Denmark and has Danish citizenship as well
At least one parent is born in Denmark and has Danish citizenship
Integration and linking • Integration across • Media • Level of detail • Topic