270 likes | 403 Views
Linking Open Data with Location: Gazetteers and the Semantic Web. Xavier Lopez, Director, Product Management. Overview. Linked Open Data (LOD) Concepts Role of Gazetteers in LOD Interconnected Web of Content Towards Geospatial Knowledge Management. Linked Data.
E N D
Linking Open Data with Location: Gazetteers and the Semantic Web Xavier Lopez, Director, Product Management
Overview • Linked Open Data (LOD) Concepts • Role of Gazetteers in LOD • Interconnected Web of Content • Towards Geospatial Knowledge Management
Linked Data “A method of publishing structured data, so that it can be interlinked and become more useful” Wikipedia
Linked Open Data – The Opportunity • Manage relationships for massive collections of structured and unstructured data • Flexible and extensible data model supports powerful search and end-user discovery of related content • Enable users to define their social networks or communities based on common interests, subjects, image scenes, locations, etc. • Rich platform for data integration, data repurposing, and better quality control and classification • Tactical, non-invasive, iterative solution for strategic modernization Semantic Aggregation & Navigation of Data
Simple Linked Data Architecture Domain & Task Ontologies User Query & results Data Ontologies (Reasoning/Inferencing) Engine Data Sources
Resource Description Framework (RDF) RDF is a general framework for describing a Web site's metadata, or the information about the information on the site. It provides interoperability between applications that exchange machine-understandable information on the Web. W3C
Gazetteers and Linked Open Data Services • Provide common terms (place names) to link across existing spatial data resources • Enable consolidated view across the map layers • Reconcile differences in data semantics so that they can all “talk”and interoperate • Resolving semantic discrepancies across databases gazetteers and applications • Integrate full breath of enterprise content continuum (structured, spatial, email, documents, web services)
Modeling: Enterprise Integration • Ordnance Survey maintains definitive mapping data of Great Britain, the world’s largest and most detailed Geo DB • Semantic Web is used to integrate different, semantically diverse sources of data • General ontologies already developed to bridge differences in terminology • The data is queried efficiently via the ontology or RDF • Advantages include efficient data integration, data repurposing, and better quality control and classification Source: http://www.w3.org/2001/sw/sweo/public/UseCases/
Knowledge Management Conditions • Filtering search queries with “context” • Discovery of data relationships across… • Structured data (database, apps, web services, RSS schemas) • Unstructured data (email, office documents) • Queries are not defined in advance • Schemas are continuously evolving • Support Machine2Machine interaction • Location can be common link, along with names, concepts, synonyms
What Can Linked Data Enable? Mapping & Geotagging Social Network Relations Analysis of Complex Relations Rule-based Reasoning
Simple Features GeoRaster Topology Networks Gazateers … Geographic Names Spatial Data Raster Data Ontology-driven Map Apps Application Ontologies Situational Awareness National Map Core Datasets RDF & OWL Data Theater • Data Integration • National Map schemas • Geographic names • Temporal • Naïve Geography • … Targeting
A “Simple” Knowledge Query Which hospitals within 30 mins of Alpine, CA provide burn treatment?” • We need to associate a number of factors, including hospital type and facilities – its accessibility after a disaster – and the staff available • The query needs to be structured based on Concepts & Relationships that can be retrieved and then customized for the specific query. • Using this approach, a listing of the hospitals capable of dealing with large number of burn cases is returned to the user and information associated with the query retrieved.
“Typical” Analyst Query: “Which hospitals within 30 mins of Alpine, CA provide burn treatment?” Feature Reference Type? What does this mean? Buffer or proximity? Definition? Driving or Flying? Road Closures? Where is this? Centroid or outline?
Ontologies for Problem Solving Specialists Treatment Weather Burns Unit Duty Rota Emergency Team Burns Illness Skin Graft Flood Hazards Location Hospital Vehicles Route Type Helipad A&E Roads Beds Size Type Obstructions
Structured DBMS, Unstructured, Spatial, RSS, email, Documents Oracle 11g RDF/OWL Graph Data Management • Storage & Loading • Native W3C RDF graph data store • Fast Bulk, batch & Incremental load • Query • SQL: SEM_MATCH graph pattern query • SPARQL: supported via Jena plug-in • Reasoning • RDF, OWL Prime, RDF++ semantic rules • Forward chaining inference model • User defined rule base • Scalability • Scales to billions of triples • Partitioning, RAC, Adv. Compression • Standards & Interoperability • Aligned with W3C specifications • Supported by leading semantic tools
Conclusions • Key semantic technologies are mature • Semantic technologies are key enablers for enterprise and Web • Reuse existing of authoritative gazetteers are needed • Model the real world rather than data artifacts
oracle.com/database/spatial.html Q Find out more... & A oracle.com/technology/products/spatial oracle.com/technology/products/spatial/htdocs/pro_oracle_spatial.html
Information Explosion • Structured data stores are growing in size • Amount of semi-structured data is expanding (XML, RDF, Semantics, Spatial) • Metric data, beacons, sensors supplying mega volumes • Unstructured data is gathered at a staggering pace (email, documents, messages, streams, feeds)
Resource Description Framework (RDF) RDF is a general framework for describing a Web site's metadata, or the information about the information on the site. It provides interoperability between applications that exchange machine-understandable information on the Web. W3C
Simple Transitive Reasoning :partOf :partOf :partOf :NorthAmerica :USA :California rdf:type owl:TransitiveProperty :partOf Asserted Facts :partOf rdf:type owl:TransitiveProperty:California :partOf :USA:USA :partOf :NorthAmerica Derived Facts :California :partOf :NorthAmerica Query: SELECT ?x ?y FROM … WHERE { ?x :partOf ?y } Result: ?x______?y__________ :California :USA :California :NorthAmerica :USA :NorthAmerica
Integrated Bioinformatics Networks Source: Siderean Software
Text/Spatial Mining Workflow Ontology Engineering Modeling Process Information Extraction Categorization, Feature/term Extraction RDF/OWL OWL Ontologies Processed Document Collection Web Resources Domain Specific Knowledge Base News, Email, RSS SQL/SPARQL Query Content Mgmt. Systems Explore Analyst Browsing, Presentation, Reporting, Visualization, Query Spatial Data