Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

REFRACTIVE: An Open Source Tool to Extract Knowledge from Syntactic and Semantic Relations

Exner, Peter LU and Nugues, Pierre LU orcid (2014) LREC, The 9th edition of the Language Resources and Evaluation Conference p.2584-2589
Abstract
The extraction of semantic propositions has proven instrumental in applications like IBM Watson (Ferrucci, 2012) and in Google’s knowledge graph (Singhal, 2012). One of the core components of IBM Watson is the PRISMATIC knowledge base consisting of one billion propositions extracted from the English version of Wikipedia and the New York Times (Fan et al., 2010). However, extracting the propositions from the English version of Wikipedia is a time-consuming process. In practice, this task requires multiple machines and a computation distribution involving a good deal of system technicalities. In this paper, we describe REFRACTIVE, an open-source tool to extract propositions from a parsed corpus based on the Hadoop variant of MapReduce. While... (More)
The extraction of semantic propositions has proven instrumental in applications like IBM Watson (Ferrucci, 2012) and in Google’s knowledge graph (Singhal, 2012). One of the core components of IBM Watson is the PRISMATIC knowledge base consisting of one billion propositions extracted from the English version of Wikipedia and the New York Times (Fan et al., 2010). However, extracting the propositions from the English version of Wikipedia is a time-consuming process. In practice, this task requires multiple machines and a computation distribution involving a good deal of system technicalities. In this paper, we describe REFRACTIVE, an open-source tool to extract propositions from a parsed corpus based on the Hadoop variant of MapReduce. While the complete process consists of a parsing part and an extraction part, we focus here on the extraction from the parsed corpus and we hope this tool will help computational linguists speed up the development of applications. (Less)
Please use this url to cite or link to this publication:
author
and
organization
publishing date
type
Chapter in Book/Report/Conference proceeding
publication status
published
subject
host publication
Proceedings of LREC 2014, the 9th edition of the Language Resource and Evaluation Conference
pages
2584 - 2589
publisher
European Language Resources Association
conference name
LREC, The 9th edition of the Language Resources and Evaluation Conference
conference dates
2014-05-28 - 2014-05-30
external identifiers
  • wos:000355611004033
  • scopus:85037130345
ISBN
978-2-9517408-8-4
language
English
LU publication?
yes
id
ee799a1c-0ec0-4ba8-9080-463f9f32ed84 (old id 4450627)
alternative location
http://www.lrec-conf.org/proceedings/lrec2014/pdf/12_Paper.pdf
date added to LUP
2016-04-04 10:03:50
date last changed
2022-03-23 07:29:44
@inproceedings{ee799a1c-0ec0-4ba8-9080-463f9f32ed84,
  abstract     = {{The extraction of semantic propositions has proven instrumental in applications like IBM Watson (Ferrucci, 2012) and in Google’s knowledge graph (Singhal, 2012). One of the core components of IBM Watson is the PRISMATIC knowledge base consisting of one billion propositions extracted from the English version of Wikipedia and the New York Times (Fan et al., 2010). However, extracting the propositions from the English version of Wikipedia is a time-consuming process. In practice, this task requires multiple machines and a computation distribution involving a good deal of system technicalities. In this paper, we describe REFRACTIVE, an open-source tool to extract propositions from a parsed corpus based on the Hadoop variant of MapReduce. While the complete process consists of a parsing part and an extraction part, we focus here on the extraction from the parsed corpus and we hope this tool will help computational linguists speed up the development of applications.}},
  author       = {{Exner, Peter and Nugues, Pierre}},
  booktitle    = {{Proceedings of LREC 2014, the 9th edition of the Language Resource and Evaluation Conference}},
  isbn         = {{978-2-9517408-8-4}},
  language     = {{eng}},
  pages        = {{2584--2589}},
  publisher    = {{European Language Resources Association}},
  title        = {{REFRACTIVE: An Open Source Tool to Extract Knowledge from Syntactic and Semantic Relations}},
  url          = {{https://lup.lub.lu.se/search/files/5451686/4450628.pdf}},
  year         = {{2014}},
}