70 likes | 173 Views
Entity Recognition via Querying DBpedia. ElShaimaa Ali. Introduction.
E N D
Entity Recognition via Querying DBpedia ElShaimaa Ali
Introduction • Wikipedia articles consist mostly of free text, but also include structured information embedded in the articles, such as “InfoBox" tables, that contains categorization information, images, geo-coordinates and links to external Web pages. This structured information is extracted and put in a uniform dataset which can be queried.
Dbpedia • DBpedia — from "DB" for "database" — is a project aiming to extract structured content from the information created as part of the Wikipedia project. This structured information is then made available in the form of RDF triples. • DBpediaone of the most famous parts of the Linked Data project.
Entity Recognition • Also known as entity identification and entity extraction) is a subtask of information extraction that aims to classify concepts or entities into predefined categories such as person, organization, locations…etc.
JENA • Apache Jena is a free and open source Java framework for building semantic web and Linked Data applications. The framework is composed of different APIs interacting together to process RDF data. The most important are: • RDF API • SPARQL API
SPARQL • SQL-like query Language for RDF called SPARQL. For example, “PREFIX foaf: <http://xmlns.com/foaf/0.1/> SELECT ?name WHERE { ?person foaf:name ?name . }”
Contacts • Email: • eea7236@ull.edu • Elshaimaa.ali@hotmail.com