An intelligent extraction and analysis approach of professional technical words for hydraulic engineering by improved Word2vec technology with Attention mechanism
Document Type
Article
Publication Date
7-1-2020
Abstract
The traditional text information processing and analysis of hydraulic engineering mainly rely on manual interaction, which exists some problems such as complicated processes, low efficiency, error-prone and so on. In this study, an intelligent and high-efficiency method of professional technical word recognition extraction and analysis is proposed for hydraulic engineering based on the Natural Language Processing (NLP) technology, integrating the Word2vec technique with the attention mechanism. The word vector computing model by the improved Word2vec technology is established. The word vector is used to calculate the similarity between words. The similarity between words serves as a combination standard to extract professional technical words of hydraulic engineering. An intelligent recognition and analysis framework for professional technical words of hydraulic project management is established by professional texts to verify the credibility and realize the automatic extraction accuracy of professional technical words. This approach is applied to analyze the weekly supervision report text of a practical concrete dam construction for 229 weeks. There are 9034 extracted professional technical words after three iterations, and the accuracy is 87.58%. It effectively improves the efficiency, accuracy and intelligence level of text information extraction and analysis of hydraulic engineering.
Publication Source (Journal or Book title)
Shuili Xuebao/Journal of Hydraulic Engineering
First Page
816
Last Page
826
Recommended Citation
Li, M., Tian, D., Shen, Y., Shi, J., & Han, S. (2020). An intelligent extraction and analysis approach of professional technical words for hydraulic engineering by improved Word2vec technology with Attention mechanism. Shuili Xuebao/Journal of Hydraulic Engineering, 51 (7), 816-826. https://doi.org/10.13243/j.cnki.slxb.20190920