An Abnormal Data Analysis and Processing Method for Genealogy Graph Databases

2020 
Large amounts of data are generated continually in the real world. The objects in the data and the relationships between them have become increasingly complex. Graph is a powerful tool for representing these data and the complex relationships between them. To effectively describe the entities and connections in these data, the concept of a knowledge graph was proposed, and knowledge graph has become one of the bases for storing graph data, and the importance of it is self-evident. Genealogy data is a kind of graph data where nodes are used to represent the person in the genealogy data, and edges are used to represent the relationship between the person. Moreover, in a genealogy graph database, the "critical" relationship that is unique for an entity is defined. In contrast, the abnormal data derive from the existence of multiple "critical" relationships between entities. In a genealogy graph, abnormal data will cause the wrong relationships and redundant entities. To avoid such abnormal data in genealogy graph database, the abnormal data are categorized into four different types, and the corresponding processing methods are proposed for each type of abnormal data, respectively. The experiment results demonstrate that the processing method can effectively solve these abnormal data.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    15
    References
    0
    Citations
    NaN
    KQI
    []