Abstract
In this work we examine the implications of building a single logical link out of multiple physical links. We use MultiEdge to examine the throughput-CPU utilization tradeoffs and examine how overheads and performance scale with the number and speed of links. We use low- level instrumentation to understand associated overheads, we experiment with setups between 1 and 8 1-GBit/s links, and we contrast our results with a single 10-GBit/s link. We find that: (a) Our base protocol achieves up-to 65% of the nominal aggregate throughput, (b) Replacing the interrupts with polling significantly impacts only the multiple link configurations, reaching 80% of nominal throughput, (c) The impact of copying on CPU overhead is significant, and removing copying results in up-to 66% improvement in maximum throughput, reaching almost 100% of the nominal throughput, (d) Scheduling packets over heterogeneous links requires simple but dynamic scheduling to account for different link speeds and varying load.
Original language | English |
---|---|
Title of host publication | Proceedings of Workshop on Communication Architecture for Clusters, CAC 2008 |
Publisher | IEEE Computer Society Press |
Publication date | 2008 |
ISBN (Print) | 978-1-4244-1693-6 |
DOIs | |
Publication status | Published - 2008 |
Event | 2008 IEEE International Symposium on Parallel and Distributed Processing - Miami, United States Duration: 14 Apr 2008 → 18 Apr 2008 Conference number: 22 https://ieeexplore.ieee.org/xpl/conhome/4519061/proceeding |
Conference
Conference | 2008 IEEE International Symposium on Parallel and Distributed Processing |
---|---|
Number | 22 |
Country/Territory | United States |
City | Miami |
Period | 14/04/2008 → 18/04/2008 |
Internet address |