A Lexical Resource-Constrained Topic Model for Word Relatedness

Yongjing Yin,Jiali Zeng,Hongji Wang,Keqing Wu,Bin Luo,Jinsong Su

A Lexical Resource-Constrained Topic Model for Word Relatedness

2019

Word relatedness computation is an important supporting technology for many tasks in natural language processing. Traditionally, there have been two distinct strategies for word relatedness measurement: one utilizes corpus-based models, whereas the other leverages external lexical resources. However, either solution has its strengths and weaknesses. In this paper, we propose a lexical resource-constrained topic model to integrate the two complementary strategies effectively. Our model is an extension of probabilistic latent semantic analysis, which automatically learns word-level distributed representations forward relatedness measurement. Furthermore, we introduce generalized expectation maximization (GEM) algorithm for statistical estimation. The proposed model not merely inherit the advantage of conventional topic models in dimension reduction, but it also refines parameter estimation by using word pairs that are known to be related. The experimental results in different languages demonstrate the effectiveness of our model in topic extraction and word relatedness measurement.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations