Grant DOE W-7405-ENG-48 (HPC-Colony)

Project Title:

Principal Investigator(s):

Project Period:

Documentation:

Project Summary:

We will research and develop system software that enables general purpose operating and runtime systems for tens of thousands of processors. To make an operating system with the desired performance and functionality scale to such levels, new technology is required for each of the following four areas: memory management, fault management, parallel resource management and global system management. Our focus will include a consolidated approach for all of these interrelated issues in a large parallel context. Woven together into a more capable and efficient operating and runtime system, these technologies will be demonstrated on multiple platforms including IBM's BlueGene class machines.

Publications:

Related Links: