Offloading Online MapReduce tasks with Stateful Programmable Data Planes

2020 
In-network computation based on programmable data plane hardware provides a tremendous opportunity to improve throughput, latency and reduce congestion in data center scenarios. However, a judicious use of these network devices must be done based on their limited resources and the specific features of the application to be offloaded. This paper promotes FlowBlaze, a stateful hardware programmable data plane, as a candidate for offloading online MapReduce tasks. Above all, tasks with strict time requirements can benefit from in-network computing since it can significantly lower their latency. Given that MapReduce is a generic programming paradigm, in this paper we first try to identify which subset of MapReduce operations can be transparently offloaded to a specific hardware architecture and which are the limitations of this offloading in terms of memory and computational resources. After, we show how the FlowBlaze architecture can match the partition/aggregation paradigm and we discuss a set of primitives exposed by the FlowBlaze abstraction to perform mapping and aggregation. Finally, we prove the feasibility of this approach applying it to a click-stream analysis use case.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    22
    References
    4
    Citations
    NaN
    KQI
    []