Communication Optimization Framework

The Communication Framework optimizes communication operations in Charm++. The communication cost of a parallel application can greatly affect its scalability. Although communication bandwidth increases have kept pace with increases in processor speed over the past decade, the communication latencies (including the software overhead) for each message have not decreased proportionately.

The framework currently has three major motivations

  • Optimize collective communication operations like AlltoAll personalized communication, AlltoAll multicast, AllReduce etc. Collective communication operations often involve most processors in a system. They are also time consuming and can involve massive data movement. These operations can be optimized by using message combining for small messages and smart message sequencing for large messages. Message combining is achieved by imposing a virtual topology on the processors and routing messages along that topology. Messages destined to a group of processors into one message. The combined message is then sent to a representative processor which forwards the message to the correct destination. For example if the virtual topology is Hypercube, dimensional exchange can be used to combine messages. There will be log(p) stages and in stage i each processor will exchange messages with its ith dimension neighbor. We have also implemented two other virtual topologies 2D Mesh and 3D Grid. For large messages smart message sequencing like prefix send can be used to reduce network contention.
  • Optimize implementations of the Charm++ machine layers to exploit the special features provided by the lower lever API's.
  • Develop a learning framework which will learn the communication patterns of an application and use known strategies to optimize those patterns.

 
People
Papers
  • 03-15    Sameer Kumar and L. V. Kale,  Opportunities and Challenges of Modern Communication Architectures: Case Study with QsNet,  To be presented at the CAC Workshop at IPDPS 2004
  • 03-11    Sameer Kumar and L. V. Kale,  Scaling Collective Multicast on Fat-tree Networks,  Submitted to ICPADS 2004
  • 03-04    L. V. Kale and Sameer Kumar,  Scaling Collective Multicast on High Performance Clusters,  
  • 02-10    L. V. Kale and Sameer Kumar and Krishnan Vardarajan,  A Framework for Collective Personalized Communication,  Accpeted for Presentation at IPDPS 2003.
  • 99-08    Krishnan Varadarajan,  Thesis: Communication Library for Parallel Architectures,  Master's Thesis, Dept. of Computer Science, University of Illinois 2001

This page maintained by Filippo Gioachin. Back to the PPL Research Page