Advanced

Entity extraction: From unstructured text to DBpedia RDF triples

Exner, Peter LU and Nugues, Pierre LU (2012) The Web of Linked Entities Workshop (WoLE 2012) In Proceedings of the Web of Linked Entities Workshop in conjuction with the 11th International Semantic Web Conference (ISWC 2012) p.58-69
Abstract
In this paper, we describe an end-to-end system that automatically extracts RDF triples describing entity relations and properties from unstructured text. This system is based on a pipeline of text processing modules that includes a semantic parser and a coreference solver. By using coreference chains, we group entity actions and properties described in different sentences and convert them

into entity triples. We applied our system to over 114,000 Wikipedia articles and we could extract more than 1,000,000 triples. Using an ontology-mapping system that we bootstrapped using existing DBpedia triples, we mapped 189,000 extracted triples onto the DBpedia namespace. These extracted entities are availableonline in the N-Triple format.... (More)
In this paper, we describe an end-to-end system that automatically extracts RDF triples describing entity relations and properties from unstructured text. This system is based on a pipeline of text processing modules that includes a semantic parser and a coreference solver. By using coreference chains, we group entity actions and properties described in different sentences and convert them

into entity triples. We applied our system to over 114,000 Wikipedia articles and we could extract more than 1,000,000 triples. Using an ontology-mapping system that we bootstrapped using existing DBpedia triples, we mapped 189,000 extracted triples onto the DBpedia namespace. These extracted entities are availableonline in the N-Triple format. 1



1 http://semantica.cs.lth.se/ (Less)
Please use this url to cite or link to this publication:
author
organization
publishing date
type
Chapter in Book/Report/Conference proceeding
publication status
published
subject
in
Proceedings of the Web of Linked Entities Workshop in conjuction with the 11th International Semantic Web Conference (ISWC 2012)
pages
58 - 69
publisher
CEUR
conference name
The Web of Linked Entities Workshop (WoLE 2012)
external identifiers
  • scopus:84889575071
ISSN
1613-0073
language
English
LU publication?
yes
id
636681f3-82d6-46f2-bd70-76114b130c1a (old id 3191701)
alternative location
http://ceur-ws.org/Vol-906/paper7.pdf
date added to LUP
2012-11-22 10:38:42
date last changed
2017-09-10 03:53:26
@inproceedings{636681f3-82d6-46f2-bd70-76114b130c1a,
  abstract     = {In this paper, we describe an end-to-end system that automatically extracts RDF triples describing entity relations and properties from unstructured text. This system is based on a pipeline of text processing modules that includes a semantic parser and a coreference solver. By using coreference chains, we group entity actions and properties described in different sentences and convert them<br/><br>
into entity triples. We applied our system to over 114,000 Wikipedia articles and we could extract more than 1,000,000 triples. Using an ontology-mapping system that we bootstrapped using existing DBpedia triples, we mapped 189,000 extracted triples onto the DBpedia namespace. These extracted entities are availableonline in the N-Triple format. 1<br/><br>
<br/><br>
1 http://semantica.cs.lth.se/},
  author       = {Exner, Peter and Nugues, Pierre},
  booktitle    = {Proceedings of the Web of Linked Entities Workshop in conjuction with the 11th International Semantic Web Conference (ISWC 2012)},
  issn         = {1613-0073},
  language     = {eng},
  pages        = {58--69},
  publisher    = {CEUR},
  title        = {Entity extraction: From unstructured text to DBpedia RDF triples},
  year         = {2012},
}