Querying The Web Database

Querying The Web Database Michael J. Cafarella University of Michigan CS4HS August 18, 2010

Two kinds of databases • Structured databases (your bank) • Expensive, hard to use • Few sources of data • Powerful queries • “Who lives in Ypsilanti and has a balance between $800 and $1400?” • Unstructured databases (the Web) • Cheap, easy to use • Many sources of data • Very boring “topic” queries • britney spears, etc.

The Structured Web? • What if we had a structured-data version of everything on the Web? • “A Database of Everything” • “List all scientists from Belgium who were left-handed” • “Which heart surgeon in Michigan has the highest success rate?” • “List Miami hotels with hot tubs near a beach”

This page contains 16 distinct HTML tables, but only one structured database

WebTables Schema Statistics Applications • WebTables system automatically extracts dbs from web crawl • An extracted database is one table plus labeled columns • Estimate that our crawl of 14.1B raw HTML tables contains ~154M good structured dbs Raw crawled pages Raw HTML Tables Recovered Databases

Easy Data Analysis • Knowledge worker queries for“city population”[VLDB08, “WebTables: Exploring…”, Cafarella et al]

Auto Synonym Discovery

Structure Autocomplete

Conclusions • The Structured Web exists in raw form today, but tools largely ignore it • Information Extraction helps gather structural information from existing Web info • These techniques bring the promise of the Structured Web much closer

Querying The Web Database

Querying The Web Database

Presentation Transcript

Querying the Semantic Web with RQL *

Querying the Web for Genealogical Information

Advances in Database Querying

Lesson 31: Querying a Database

Tutorial 3 Maintaining and Querying a Database

Querying Ontology Based Database Using OntoQL

Index Structures for Querying the Deep Web

Reasoning and Querying for the Web urq.deri.ie

Querying the deep Web

Deep Web Integration: Querying Structured Data on the Deep Web

Natural Language Querying of the Semantic Web

QUERYING A DATABASE

Querying Web Data – The WebQA Approach

Querying a Database

Querying Structured Text in an XML Database

Chapter 3 Querying the Semantic Web

Querying a Database Access Project 2

“ Artificial Intelligence ” in Database Querying

Querying an Avian Inventory Database and Visualizing the Results

Web database

Reasoning and Querying for the Web urq.deri.ie

Querying the State Capital Database