Parallel Languages/Paradigms: Charm ++ - Parallel Objects
Charm++ is a machine independent parallel programming system. Programs written using this system will run unchanged on MIMD machines with or without a shared memory. It provides high-level mechanisms and strategies to facilitate the task of developing even highly complex parallel applications.

Charm++ programs are written in C++ with a few library calls and an interface description language for publishing Charm++ objects. Charm++ supports multiple inheritance, late bindings, and polymorphism.

Platforms: The system currently runs on IBM's Blue Gene/P, Blue Gene/L, Cray XT3, XT4, XT5, Infiniband clusters such as Ranger, LoneStar and Abe, clusters of UNIX workstations and even single-processor UNIX, Solaris and Windows machines. It also runs on accelerators such as the Cell BE and GPGPUs.

The design of the system is based on the following tenets:
  1. Efficient Portability: Portability is an essential catalyst for the development of reusable parallel software. Charm++ programs run unchanged on MIMD machines with or without a shared memory. The programming model induces better data locality, allowing it to support machine independence without losing efficiency.
  2. Latency Tolerance: Latency of communication - the idea that remote data will take longer to access - is a significant issue common across most MIMD platforms. Message-driven execution, supported in Charm++ , is a very useful mechanism for tolerating or hiding this latency. In message driven execution (which is distinct from just message-passing), a processor is allocated to a process only when a message for the process is received. This means when a process blocks, waiting for a message, another process may execute on the processor. It also means that a single process may block for any number of distinct messages, and will be awakened when any of these messages arrive. Thus, it forms an effective way of scheduling a processor in the presence of potentially large latencies.
  3. Dynamic Load Balancing: Dynamic creation and migration of work is necessary in many applications. Charm++ supports this by providing dynamic (as well as static) load balancing strategies.
  4. Reuse and Modularity: It should be possible to develop parallel software by reusing existing parallel software. Charm++ supports this with a well-developed ``module'' construct and associated mechanisms. These mechanisms allow for compositionality of modules without sacrificing the latency-tolerance. With them, two modules, each spread over hundreds of processors, may exchange data in a distributed fashion.

The Programming Model: Programs consist of potentially medium-grained processes (called chares), a special type of replicated process, and collections of chares. These processes interact with each other via messages. There may be thousands of medium-grained processes on each processor, or just a few, depending on the application. The ``replicated processes'' can also be used for implementing novel information sharing abstractions, distributed data structures, and intermodule interfaces. The system can be considered a concurrent object-oriented system with a clear separation between sequential and parallel objects. As shown in this figure, the objects are mapped by the runtime system to appropriate processors to balance the load.

Reusable Libraries: The modularity-related features make the system very attractive for building library modules that are highly reusable because they can be used in a variety of data-distributions. We have just begun the process of building such libraries, and have a small collection of library modules. However, we expect such libraries, contributed by us and other users, to be one of the most significant aspects of the system.

Regular and Irregular Computations: For regular computations, the system is useful because it provides portability, static load balancing, and latency tolerance via message driven execution, and facilitates construction and flexible reuse of libraries. The system is unique for the extensive support it provides for highly irregular computations. This includes management of many medium-grained processes, support for prioritization, dynamic load balancing strategies, handling of dynamic data-structures such as lists and graphs, etc.
People
Papers/Talks
11-56
2011
[Talk]
Charm++ Tutorial [ICS 2011]
| Phil Miller | Abhishek Gupta
11-48
2011
[Talk]
Charm++ for Productivity and Performance: A Submission to the 2011 HPC Class II Challenge [Supercomputing 2011]
11-41
2011
[Paper]
Using Shared Arrays in Message-Driven Parallel Programs [ParCo 2011]
| Phil Miller | Aaron Becker | Laxmikant Kale
11-27
2011
[Paper]
Optimizations for Message Driven Applications on Multicore Architectures [HiPC 2011]
| Pritish Jetley | Laxmikant Kale
10-13
2010
[Paper]
Optimizing a Parallel Runtime System for Multicore Clusters: A Case Study [TeraGrid 2010]
| Chao Mei | Gengbin Zheng | Filippo Gioachin | Laxmikant Kale
10-06
2009
[Paper]
Charm++ and AMPI: Adaptive Runtime Strategies via Migratable Objects [Advanced Computational Infrastructures for Parallel and Distributed Applications 2009]
| Laxmikant Kale | Gengbin Zheng
08-09
2008
[Paper]
Some Essential Techniques for Developing Efficient Petascale Applications [SciDAC 2008]
| Laxmikant Kale
07-04
2007
[Paper]
Programming Petascale Applications with Charm++ and AMPI [Petascale Computing: Algorithms and Applications 2007]
| Laxmikant Kale | Eric Bohm | Celso Mendes | Terry Wilmarth | Gengbin Zheng
05-06
2005
[PhD Thesis]
Achieving High Performance on Extremely Large Parallel Machines: Performance Prediction and Load Balancing [Thesis 2005]
| Gengbin Zheng
04-16
2004
[Paper]
Performance and Modularity Benefits of Message-Driven Execution [Journal of Parallel and Distributed Computing 2004]
| Attila Gursoy | Laxmikant Kale
03-17
2003
[MS Thesis]
An Efficient Implementation of Charm++ on Virtual Machine Interface [Thesis 2003]
| Greg Koenig
96-11
1996
[Paper]
Charm++: Parallel Programming with Message-Driven Objects [Book Chapter 1996]
| Laxmikant Kale | Sanjeev Krishnan
96-09
1995
[Paper]
Threads for Interoperable Parallel Programming [LCPC 1995]
| Laxmikant Kale | Joshua Yelon | Timothy Knauff
95-03
1995
[Paper]
The Charm Parallel Programming Language and System:Part II - The Runtime System [PPL Technical Report 1995]
| Laxmikant Kale | B. Ramkumar | Amitabh Sinha | Attila Gursoy
95-02
1994
[Paper]
The Charm Parallel Programming Language and System:Part I --- Description of Language Features [PPL Technical Report 1994]
| Laxmikant Kale | B. Ramkumar | Amitabh Sinha | Attila Gursoy
93-02
1993
[Paper]
CHARM++ : A Portable Concurrent Object Oriented System Based On C++ [OOPSLA 1993]
| Laxmikant Kale | Sanjeev Krishnan
90-10
1990
[PhD Thesis]
Machine Independent Parallel Execution of Speculative Computations [Thesis 1990]
| Vikram Saletore
90-08
1990
[Paper]
Chare Kernel - A Runtime Support System for Parallel Computations [Journal of Parallel and Distributed Computing 1990]
| Wennie Shu | Laxmikant Kale
90-03
1990
[Paper]
The Chare Kernel Parallel Programming Language and System [ICPP 1990]
| Laxmikant Kale