Abstract
Typical communication networks for parallel processing are based on sending data from one processor to one, or all, of the other processors. Using such a network, many simple operations that require information from every processor requires many point-to-point or broadcast communications. These aggregate operations can be as simple as a barrier synchronization or as complex as an arithmetic reduction. In this paper, we discuss a class of networks that directly implement a wide range of aggregate operations. These networks are capable of performing aggregate operations in a single communication operation using only simple bitwise combining logic in a trivially scalable tree configuration.
Original language | English |
---|---|
Pages (from-to) | 306-313 |
Number of pages | 8 |
Journal | IEEE Symposium on Parallel and Distributed Processing - Proceedings |
State | Published - 1996 |
Event | Proceedings of the 1996 8th IEEE Symposium on Parallel and Distributed Processing - New Orleans, LA, USA Duration: Oct 23 1996 → Oct 26 1996 |
ASJC Scopus subject areas
- General Engineering