Design of Data Mining System Based on Cloud Computing

2020 
In order to improve the performance and speed of data storage, this paper designs a data mining system based on cloud computing. The basic structure, operation mode and programming principle of the cloud computing platform Spark are described in detail. The cloud computing platform Spark realizes the parallel design of decision tree C4.5 algorithm and K-medoids clustering algorithm, which greatly improves the operation speed, convergence speed and result stability of the algorithm. The experimental results show that the data mining system designed in this paper has faster operation speed and better classification efficiency when analyzing and processing massive data.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    0
    Citations
    NaN
    KQI
    []