1 / 24

Executive Briefing: Data catalogs – Concepts , capabilities, and key platforms

Executive Briefing: Data catalogs – Concepts , capabilities, and key platforms. Andrew J. Brust Founder & CEO Blue Badge Insights, Inc. Meet Andrew. Founder and CEO Big Data blogger for ZDNet Data/analytics analyst for Gigaom Microsoft Regional Director, MVP

tannar
Download Presentation

Executive Briefing: Data catalogs – Concepts , capabilities, and key platforms

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Executive Briefing: Data catalogs – Concepts, capabilities, and key platforms Andrew J. Brust Founder & CEO Blue Badge Insights, Inc.

  2. Meet Andrew Founder and CEO Big Data blogger for ZDNet Data/analytics analyst for Gigaom Microsoft Regional Director, MVP Co-chair Visual Studio Live! Twitter: @andrewbrust

  3. Shameless Plugs bit.ly/abrustzdnet

  4. Agenda The nucleus, value-adds Motivation, sources, taxonomy, MLDC Overlaps and embedded Cloud, OSS Acquisitions, map of the market Assessment and forecast

  5. The Nucleus

  6. Value-Adds

  7. Agenda The nucleus, value-adds Motivation, taxonomy, sources, MLDC Overlaps and embedded Cloud, OSS Acquisitions, map of the market Assessment and forecast

  8. Motivation, Importance

  9. Major Data Sources

  10. Taxonomy

  11. The Many Faces of Machine Learning Data Catalogs

  12. Agenda The nucleus, value-adds Motivation, sources, taxonomy, MLDC Overlaps and embedded Cloud, OSS Acquisitions, map of the market Assessment and forecast

  13. TheData Catalog “Orbit”

  14. Embedded Catalog Examples

  15. Agenda The nucleus, value-adds Motivation, sources, taxonomy, MLDC Overlaps and embedded Cloud, OSS Acquisitions, map of the market Assessment and forecast

  16. Cloud Data Catalogs Purpose #1: serve as metadata store for data prep/transformation, data lake, ML and other services Purpose #2: be a full-featured standalone data catalog AWS Glue and Google Data Catalog fall under 1 Microsoft Azure falls under 2, but underperforms • Rule of thumb: keep an eye out for announcements during Ignite conference in early November • Consider Microsoft’s history with SharePoint’s Business Data Catalog and its current Common Data Service initiative

  17. Open Source

  18. Agenda The nucleus, value-adds Motivation, sources, taxonomy, MLDC Overlaps and embedded Cloud, OSS Acquisitions, map of the market Assessment and forecast

  19. Acquisitions Cloudera/Hortonworks • Cloudera Navigator being phased out in favor of Cloudera Data Catalog, based on the former Hortonworks Data Steward Studio (and Atlas) Qlik/Podium Data • Podium Data becomes Qlik Data Catalyst • Qlik also bought Attunity, giving it BI, catalog and data integration under one roof

  20. Map of the Market Embedded Cloud Enterprise Data Management Pure Plays

  21. Agenda The nucleus, value-adds Motivation, sources, taxonomy, MLDC Overlaps and embedded Cloud, OSS Acquisitions, map of the market Assessment and forecast

  22. Assessment and Forecast

  23. Rate today’s session Session page on conference website O’Reilly Events App

  24. Thank You! andrew.brust@bluebadgeinsights.com @andrewbrust http://bit.ly/abrustzdnet

More Related