Learning non-taxonomical semantic relations from domain texts
Document Type
Article
Publication Date
2-1-2012
Abstract
Ontology of a domain mainly consists of concepts, taxonomical (hierarchical) relations and non-taxonomical relations. Automatic ontology construction requires methods for extracting both taxonomical and non-taxonomical relations. Compared to extensive works on concept extraction and taxonomical relation learning, little attention has been given on identification and labeling of non-taxonomical relations in text mining. In this paper, we propose an unsupervised technique for extracting non-taxonomical relations from domain texts. We propose the VFICF metric for measuring the importance of a verb as a representative relation label, in much the same spirit as the TFIDF measure in information retrieval. Domain-relevant concepts (nouns) are extracted using techniques developed earlier. Candidate non-taxonomical relations are generated as (SVO) triples of the form (subject, verb, object) from domain texts. A statistical method with log-likelihood ratios is used to estimate the significance of relationships between concepts and to select suitable relation labels. Texts from two domains, the Electronic Voting (EV) domain texts and the Tenders and Mergers (TNM) domain texts are used to compare our method with one of the existing approaches. Experiments showed that our method achieved better performance in both domains. © 2011 Springer Science+Business Media, LLC.
Publication Source (Journal or Book title)
Journal of Intelligent Information Systems
First Page
191
Last Page
207
Recommended Citation
Punuru, J., & Chen, J. (2012). Learning non-taxonomical semantic relations from domain texts. Journal of Intelligent Information Systems, 38 (1), 191-207. https://doi.org/10.1007/s10844-011-0149-4