Time-Series Data Analytics Using Spark and Machine Learning

2017 
This work presents a scalable architecture capable to provide real-time analysis over large-scale time-series data. Spark streaming, Spark MLlib and machine learning methods are combined to process and analyse the data streams. A high performance training model is automatically built and applied for the time-series forecasting. In order to validate the proposed architecture, authors developed a prototype system to predict the average energy consumption at real-time (estimated from 6 K Irish home- and business consumers) from 30 to 90 min ahead. The results show the best prediction was done with a convolutional neural network model, where the Mean Absolute Error and Root Mean Square Error were 7.5% and 10.5% correspondingly.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    12
    References
    1
    Citations
    NaN
    KQI
    []