A Checkpoint and Restart Mechanism for Parallel Programming Systems
Thesis 2000
Publication Type: MS Thesis
Repository URL:
Abstract
This MS Thesis describes an application-independent checkpointing system for Charm++. It uses the pup framework to save in-flight messages, all running objects, and all Charm++/Converse state. The restart can occur on a different number of processors, or a different machine architecture.
TextRef
Sameer Paranjpye "A Checkpoint and Restart Mechanism for Parallel Programming Systems", University of Illinois at Urbana-Champaign, 2000.
People
Research Areas