Header menu link for other important links
X
Co-clustering triples from open information extraction
, Vinh Ho Thinh, Gerhard Weikum
Published in Association for Computing Machinery
2020
Pages: 190 - 194
Abstract
Similar facts are often expressed in different ways in natural language text, which introduces the redundancy and ambiguity of Subject-Predicate-Object (SPO) triples in Open Information Extraction (Open IE). This work focuses on canonicalizing such SPO triples. We propose a clustering framework using non-negative matrix tri-factorization that jointly clusters predicate phrases and subject-object pairs and aligns them in a meaningful manner. The evaluation shows that our co-clustering method outperforms significantly over rule mining and Knowledge-Base-embedding approaches for two existing datasets.
About the journal
JournalData powered by TypesetPervasiveHealth: Pervasive Computing Technologies for Healthcare
PublisherData powered by TypesetAssociation for Computing Machinery
ISSN21531633
Open AccessNo