1 / 17

BLS Metadata Repository – Issues and Progress

BLS Metadata Repository – Issues and Progress. Daniel Gillman US Bureau of Labor Statistics. Outline. BLS Programs Time Series Data Dissemination Metadata Model BLS Repository. Wolfram Data Summit. BLS Programs. 8 Major Program Areas Inflation & Prices Employment Unemployment

selia
Download Presentation

BLS Metadata Repository – Issues and Progress

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. BLS Metadata Repository – Issues and Progress Daniel Gillman US Bureau of Labor Statistics

  2. Outline • BLS Programs • Time Series • Data Dissemination • Metadata • Model • BLS Repository Wolfram Data Summit Wolfram Data Summit

  3. BLS Programs • 8 Major Program Areas • Inflation & Prices • Employment • Unemployment • Pay & Benefits • Spending & Time Use • Productivity • Workplace Injuries • International Wolfram Data Summit Wolfram Data Summit

  4. Time Series • Measure or index over time • Index: number relative to fixed point • 30 series types • Subset by • Industry • Occupation • Geography (state, county, MSA, etc) • Tables • Generated from time series data Wolfram Data Summit

  5. Data Dissemination • Web site: http://www.bls.gov • 8 major numbers • Unemployment rate (m) • Consumer price index (m) • Producer price index (m) • Employment cost index (q) • Average hourly earnings (m) • Payroll employment (m) • Productivity (q) • Import price index (m) • All time series • Tables Wolfram Data Summit

  6. Data dissemination Wolfram Data Summit

  7. Data Dissemination • Organized by programs • Time series in ASCII files by FTP • Some tables • Crude database search • Little metadata • Web site itself • Hidden in FTP directories • Handbook of Methods • Seasonal adjustment Wolfram Data Summit

  8. Data Dissemination • Requires knowing • Organization of BLS • Specific surveys or programs • Specific series • Terms & technical meaning • E.g., earnings • Relies on “Series ID” • Brittle scheme for identifying series • Known by power users Wolfram Data Summit

  9. Metadata • Supports • Dissemination • Support Data.Gov • Time series and tables • Does not support • Internal processing • Describing survey life-cycle • Microdata (respondent level) Wolfram Data Summit

  10. Metadata • Hard to collect • Need “simple” model • Maybe not so easy • Basic metadata already on FTP sites • Support finding data by • Traditional means • Series ID, BLS structure • New means • Subject matter Wolfram Data Summit

  11. Metadata • Previous BLS focus group study • Users find data by • Time • Place • Subject (title or keywords) • Structure of agencies not known • Technical terms not known • Metadata must support this Wolfram Data Summit

  12. Model • Model – • Time Series • Data Element • Classification • Concept • Naming Convention Wolfram Data Summit

  13. Model Wolfram Data Summit

  14. BLS Repository • Under development • Requirement – fast response • Testing – • Flat single table design • Using Apache Lucene Solr • Open source enterprise search • Various interface approaches • Visual Basic • Java Wolfram Data Summit

  15. BLS Repository • Need term map • Common terms to technical terms • Definitions for technical terms • Concept based management • Link terms to relevant data • Manage multi-faceted search • Development schedule • Still research project Wolfram Data Summit

  16. Daniel Gillman gillman.daniel@bls.gov

More Related