Semantic representation of scientific documents for the e‐science Knowledge Grid

2008 
Semantic representation of a topic is the basic element representation of a scientific document, which can be reflected by keywords' relations and their weights, as well as the state values of co‐occurring keywords in a documental fragment in which a topic is discussed. Concept lattice and probability statistical methods are proposed to generate fuzzy cognitive maps (FCMs) as the original semantic representation of documental fragments (SRDFs) for the e‐science Knowledge Grid. The additive capability of FCMs is used to merge SRDFs for reducing the noise and redundant information hidden in the original SRDFs. The loss of documental information in the merging process of SRDFs is measured by the proposed semantic merging entropy. Based on the distributional property of keywords on SRDFs, semantic distribution entropy is introduced to evaluate the change of keywords' distribution on SRDFs with the merging process. The semantic representation of a scientific document is automatically generated by the interconnection of the merged SRDFs belonging to one domain. The generating, merging and evaluating methods of the semantic representation of a scientific document are validated by experiments. The proposed semantic representation highlights the relations among keywords and topics, which reduces the complexity of document analysis in an e‐science Knowledge Grid effectively. Copyright © 2007 John Wiley & Sons, Ltd.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []