Computer aided engineering of cluster computers

William R. Dieter, Henry G. Dietz

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

There are many scientific and engineering applications that require the resources of a dedicated supercomputer: drug design, weather prediction, simulating vehicle crashes, fluid dynamics simulations of aircraft or even consumer products. Cluster supercomputers can leverage commodity parts with standard interfaces that allow them to be used interchangeably to build supercomputers customized for these and other applications. However, the best design for one application is not necessarily the best design for other applications. Supercomputer design is challenging, but this problem is harder due to the huge range of possible configurations, volatile component availability and pricing, and constraints on available power, cooling, and floor space. Cluster Design Rules (CDR) is a computer-aided engineering tool that uses resource constraints and application performance models to identify the few best designs among the trillions of designs that could be constructed using parts from a given database. It uses a branch-and-bound strategy based on cluster design principles that can eliminate many inferior designs from the search without evaluating them. For the millions of designs that remain, CDR measures fitness by one of several user-specified application performance models. New application performance models can be added by means of a programming interface. This paper details the concepts and mechanisms inside CDR and shows how it facilitates model-based engineering of custom clusters.

Original languageEnglish
Title of host publicationISPASS 2008 - IEEE International Symposium on Performance Analysis of Systems and Software
Pages44-53
Number of pages10
DOIs
StatePublished - 2008
EventIEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2008 - Austin, TX, United States
Duration: Apr 20 2008Apr 22 2008

Publication series

NameISPASS 2008 - IEEE International Symposium on Performance Analysis of Systems and Software

Conference

ConferenceIEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2008
Country/TerritoryUnited States
CityAustin, TX
Period4/20/084/22/08

ASJC Scopus subject areas

  • Computer Science Applications
  • Software
  • Theoretical Computer Science

Fingerprint

Dive into the research topics of 'Computer aided engineering of cluster computers'. Together they form a unique fingerprint.

Cite this