基于领域模式的Web数据抽取与集成系统研究与实现 The Research and Implementation of Extraction and Integration of Web Data Based on Domain Pattern

2016 
提供面向领域的信息增值服务是Web数据挖掘的目标之一,面向领域的Web数据抽取与集成是提供领域信息增值服务的基础,也是Web数据挖掘领域的一个主要研究方向,结合领域需求,本文提出一种面向领域的Web数据抽取与集成架构,在给出Web数据模型与Web数据模式、领域数据模型和领域数据模式等相关概念基础上,提出Web数据模式与领域数据模式的映射方法和数据层次上的集成方法,用于解决集成过程中的模式层次和数据层次的冲突问题,并讨论了web数据抽取和领域增值服务的实现方法。结合实际需求开发了房地产信息平台及综合应用系统,验证了模型和算法的有效性。 One of the objectives of the Web data mining is to provide the domain-oriented information value added service. Domain-oriented web data extraction and integration is the basis of providing value added services, and is also a major research direction in the field of web data mining. In com-bination with the requirement of the field, we proposed the domain-oriented web data extraction and integration architecture. Based on the concepts of web data model and web data pattern, do-main data model and domain data pattern, the mapping method of web data pattern and domain data pattern and integration method on data level are proposed to solve the conflict problem of pattern layer and data layer in the integration process. We also discussed the implementation method of web data extraction and domain value added services. Real estate information platform and integrated application system are developed with the actual requirements, and the effective-ness of the model and algorithm is verified.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    8
    References
    0
    Citations
    NaN
    KQI
    []