Extracting information from business documents using linguistic and rule-based system
Document Type
Conference Proceeding
Publication Date
1-1-2012
Abstract
Large amounts of business and engineering knowledge is located in financial and project management reports, business case analysis reports, standard operating procedures, employee hand book, and other types of technical documents which are located outside traditional databases and therefore not easily accessible to database query and mining techniques. There is a growing need for information technologies to extract knowledge from these unstructured data sources. In this study, a corpus of technical documents has been compiled. New algorithms have been developed for automatically extracting domain knowledge from the corpus of technical reports. New methods have been developed for text processing, business rule and taxonomic data extraction from corpus of such reports. For process extraction, rhetorical structure analysis has been used and for concept validation, Wordnet based word sense disambiguation has been used.
Publication Source (Journal or Book title)
62nd IIE Annual Conference and Expo 2012
First Page
2626
Last Page
2631
Recommended Citation
Halder, A., Javadpour, L., Khazaeli, M., & Knapp, G. (2012). Extracting information from business documents using linguistic and rule-based system. 62nd IIE Annual Conference and Expo 2012, 2626-2631. Retrieved from https://repository.lsu.edu/mechanical_engineering_pubs/1485