...
首页> 外文期刊>Parallel and Distributed Systems, IEEE Transactions on >Exploiting Efficient and Scalable Shuffle Transfers in Future Data Center Networks
【24h】

Exploiting Efficient and Scalable Shuffle Transfers in Future Data Center Networks

机译:在未来的数据中心网络中利用高效且可扩展的随机传输

获取原文
获取原文并翻译 | 示例
           

摘要

Distributed computing systems like MapReduce in data centers transfer massive amount of data across successive processing stages. Such shuffle transfers contribute most of the network traffic and make the network bandwidth become a bottleneck. In many commonly used workloads, data flows in such a transfer are highly correlated and aggregated at the receiver side. To lower down the network traffic and efficiently use the available network bandwidth, we propose to push the aggregation computation into the network and parallelize the shuffle and reduce phases. In this paper, we first examine the gain and feasibility of the in-network aggregation with BCube, a novel server-centric networking structure for future data centers. To exploit such a gain, we model the in-network aggregation problem that is NP-hard in BCube. We propose two approximate methods for building the efficient IRS-based incast aggregation tree and SRS-based shuffle aggregation subgraph, solely based on the labels of their members and the data center topology. We further design scalable forwarding schemes based on Bloom filters to implement in-network aggregation over massive concurrent shuffle transfers. Based on a prototype and large-scale simulations, we demonstrate that our approaches can significantly decrease the amount of network traffic and save the data center resources. Our approaches for BCube can be adapted to other server-centric network structures for future data centers after minimal modifications.
机译:数据中心中的分布式计算系统(如MapReduce)在连续的处理阶段中传输大量数据。这种随机传输贡献了大多数网络流量,并使网络带宽成为瓶颈。在许多常用的工作负载中,此类传输中的数据流在接收器端高度相关并聚合。为了降低网络流量并有效利用可用的网络带宽,我们建议将聚合计算推入网络并并行化混洗和减少阶段。在本文中,我们首先研究使用BCube(一种用于未来数据中心的新颖的以服务器为中心的网络结构)进行网络内聚合的收益和可行性。为了利用这种收益,我们对BCube中的NP困难的网络内聚合问题进行建模。我们仅基于成员的标签和数据中心拓扑结构,提出了两种近似的方法来构建有效的基于IRS的播种聚合树和基于SRS的混洗聚合子图。我们进一步设计基于Bloom过滤器的可伸缩转发方案,以在大量并发洗牌传输上实现网络内聚合。基于原型和大规模仿真,我们证明了我们的方法可以显着减少网络流量,并节省数据中心资源。我们对BCube的方法经过最小的修改,便可以适应其他以服务器为中心的网络结构,以用于将来的数据中心。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号