Distributed NoSQL Data Stores: Performance Analysis and a Case Study

2018 
NoSQL data-stores are commonly used to provide flexibility and availability for big data handling. However, there is a lack of comprehensive studies about which NoSQL data-store performs the best from the two scalability aspects, (scale-up, and scale-out), in a distributed and parallel processing environment. This paper compares the popular NoSQL data-stores (Cassandra, HBase, and MongoDB) and analyzes the resulting performance. Our experiments measure throughput, latency, and run-time of the evaluated data-stores on a big data set that consist of standard benchmarking workloads. Our results provide that the performance of each NoSQL data-store varies according to two main factors, (a) the type of executed operation, (read, scan, update, write, and insert), and (b) the level of distribution.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    19
    References
    5
    Citations
    NaN
    KQI
    []