1 / 31

From Data Access to Data Integration

Explore the journey from data access to integration at the IAOS conference in Shanghai 2008, presented by Annegrete Wulff from Statistics Denmark. Accessible products include publications since 1850, municipality statistics databank, and StatBank Denmark. Learn about dissemination principles, simultaneous releases, and the Statistical Information System. Discover how StatBank Denmark evolved from restricted access to a comprehensive online resource with over 2,000 tables. Satisfy diverse user needs, improve response time, and enhance data visualization with self-service options. The integration process involves defining concepts, dimensions, and members in the StatBank. Experience seamless data integration, metadata access, and statistical abstracts at Statistics Denmark.

jquintero
Download Presentation

From Data Access to Data Integration

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. From Data Access to Data Integration IAOS, Shanghai 14-16 October 2008 Annegrete Wulff, Statistics Denmark awu@dst.dk

  2. Accessible products • Publications (ca. 1850 - ) • Municipality statistics databank 1986-98 • StatBank Denmark (www.statbank.dk) 1998 – • Homepage www.dst.dk 1996 -

  3. Dissemination principles • Electronic first • StatBank is the place for all official statistics • StatBank is the source for all publications • StatBank is online available & free-of-charge for everyone • Simultaneous releases in all media 9:30:00 am • Dissemination should address well-defined • target groups • types of usage • …jet still use the same source (data and metadata)

  4. Educa- tion Employ- ment CPR Person id: Person Number Inter- view Tax Question- naire Social Health etc Dwelling id: Address Enterprise id: CBR-No Cadastre BDR CBR The Statistical Information System

  5. Population Labour market www.statbank.dk Income Databank Finance Agriculture etc. Access

  6. StatBank Denmark - - Subject matter division Dissemination, IT-Centre Cleaned micro data Statistical registers Annonymos micro data for Researchers Aggregation to macro data Charged statistics and analysis SumDatabase pdf Publication www.dst.dk International organisations Print Binding dst.dk The Public

  7. Users needs • More users • Variety of users • Different – and increasing user needs • User satisfaction surveys

  8. Shift of focus …..1980’s • Electronic on-line access • Content: more details than on paper • Design of tables on the fly • Calculations • Download possibilities • Output formats

  9. Statistics Denmark’s first databank 1986 200 users – and we new them all

  10. Shift of focus …..1990’s • Internet access • - but also off-line products: CD-ROM • Functionality (calculations) • Interactive aggregations • Contact possibilities – who to ask for more

  11. Internet databank, ver. 1.0 1998 Access “restricted” by a fee

  12. Shift of focus …..2000’s • Presentation, layout • Documentation • Linking, coherence • Long time span • Search

  13. Homepage 2004 Documentation, quality declaration, graphics

  14. Shift of focus …..2008’s • Response time • Definitions • Search and browse • Visualisation, maps and graphics • Self service • Integration with own systems

  15. StatBank Denmark • More than 2,000 tables, several billions of data • Links to documentation (declaration of contents) • Links to publications • Saved queries • Data shooting • Excel web queries • Output formats: Excel, PC-AXIS, xml, SAS, comma separated, time series,… • Maps • Graphs

  16. Satisfied users

  17. Unsatisfied users

  18. 2,500 matrices in Danish and English 2 million retrievals 77 % only on screen 6 % in maps, 17 % graphs HTML table on screen Downloads of a file • 23% downloads. Of these: • 86 % in Excel • 9 % in PC-AXIS • 5% in other formats

  19. Many sources of metadata Subject matter division Statisticians Technical sources Statistics Denmark publications External documents Legal documents etc TIMES, microdata documentation Quality declarations Statistical Abstracts Annual publications, Nomenclatures Statistical Yearbook & Ten year review Metadataproject

  20. Metadata project Statistical Yearbook & Ten year review Annual publications, Nomenclatures Statistical Abstracts Quality declarations TIMES, microdata documentation Legal documents etc External documents Statistics Denmark publications Subject matter division Technical sources Statisticians One source for all metadata

  21. StatBank Denmark - - Metadata • code explanations • source • quality • accessability • concepts, definitions • methodologies • contacts • release info Annonymos micro data for researchers Charged statistics and analysis www.dst.dk International organisations dst.dk

  22. The process • Select the concepts • Define the concept • Making the definition accessible

  23. Concepts in the StatBank • 850 dimensions • 173,000 dimension members • 8,000 will be defined

  24. Examples: table titles • Unemployed by region, ancestry and sex (monthly) • Immigrated by region, country of origin, age and sex (continous years)

  25. Example of concepts

  26. Born in Denmark and neither of the parents is born in Denmark and has Danish citizenship as well

  27. At least one parent is born in Denmark and has Danish citizenship

  28. Glossary

  29. Integration and linking • Integration across • Media • Level of detail • Topic

  30. Thank you

More Related