The Communication Framework optimizes communication operations in
Charm++. The communication cost of a parallel application can greatly
affect its scalability. Although communication bandwidth increases
have kept pace with increases in processor speed over the past decade,
the communication latencies (including the software overhead) for each
message have not decreased proportionately.
The framework currently has three major motivations
- Optimize collective communication operations like AlltoAll
personalized communication, AlltoAll multicast, AllReduce
etc. Collective communication operations often involve most processors
in a system. They are also time consuming and can involve massive data movement.
These operations can be optimized by using message combining for small
messages and smart message sequencing for large messages.
Message combining is achieved by imposing a virtual topology on the
processors and routing messages along that topology. Messages destined
to a group of processors into one message. The combined message is
then sent to a representative processor which forwards the message to
the correct destination. For example if the virtual topology is
Hypercube, dimensional exchange can be used to combine messages. There
will be log(p) stages and in stage i each processor will exchange
messages with its ith dimension neighbor. We have also implemented two
other virtual topologies 2D Mesh and 3D Grid.
For large messages smart message sequencing like prefix send can be
used to reduce network contention.
- Optimize implementations of the Charm++ machine layers to exploit
the special features provided by the lower lever API's.
- Develop a learning framework which will learn the communication
patterns of an application and use known strategies to optimize those
patterns.
|
- 03-15
Sameer Kumar and L. V. Kale, Opportunities and Challenges of Modern Communication Architectures: Case Study with QsNet, To be presented at the CAC Workshop at IPDPS 2004
- 03-11
Sameer Kumar and L. V. Kale, Scaling Collective Multicast on Fat-tree Networks, Submitted to ICPADS 2004
- 03-04
L. V. Kale and Sameer Kumar, Scaling Collective Multicast on High Performance Clusters,
- 02-10
L. V. Kale and Sameer Kumar and Krishnan Vardarajan, A Framework for Collective Personalized Communication, Accpeted for Presentation at IPDPS 2003.
- 99-08
Krishnan Varadarajan, Thesis: Communication Library for Parallel Architectures, Master's Thesis, Dept. of Computer Science, University of Illinois 2001
|