Parallel Computing on the Berkeley NOW

Abstract:

The UC Berkeley Network of Workstations

(NOW) project demonstrates a new approach to large-

scale system design enabled by technology advances

that provide inexpensive, low latency, high bandwidth,

scalable interconnection networks. This paper provides

an overview of the hardware and software architecture

of NOW and reports on the performance obtained at

each layer of the system: Active Messages, MPI mes-

sage passing, and benchmark parallel applications.

1 Introduction

In the early 1990’s it was often said that the “Killer

Micro” had attacked the supercomputer market, much as

it had the minicomputer and mainframe markets earlier.

This attack came in the form of massively parallel pro-

cessors (MPPs) which repackaged the single-chip

microprocessor, cache, DRAM, and system chip-set of

workstations and PCs in a dense configuration to con-

struct very large parallel computing systems. However,

another technological revolution was brewing in these

MPP systems – the single-chip switch – which enabled

building inexpensive, low latency, high bandwidth, scal-

able interconnection networks. As with other important

technologies, this “killer switch” has taken on a role far

beyond its initial conception. Emerging from the eso-

teric confines of MPP backplanes, it has become avail-

able in a form that is readily deployed with commodity

workstations and PCs. This switch is the basis for

sys-

tem area networks

, which have performance and scal-

ability of the MPP interconnects and the flexibility of a

local area network, but operate on a somewhat restricted

physical scale.

The Berkeley NOW project seeks to demonstrate that it

is viable to build large parallel computing systems that

are fast, inexpensive, and highly available, by simply

snapping these switches together with the latest com-

modity components. Such cost-effective, incrementally

scalable systems provide a basis for traditional parallel

computing, but also for novel applications, such as inter-

net services[Brew96].

This paper provides an overview of the Berkeley NOW

as a parallel computing system. Section 2 gives a

description of the NOW hardware configuration and its

layered software architecture. In the following sections,

the layers are described from the bottom-up. Section 3

describes the Active Message layer and compares its

performance to what has been achieved on MPPs.

Section 4 shows the performance achieved through MPI,

built on top of Active Messages. Section 5 illustrates the

application performance of NOW using the NAS Paral-

lel Benchmarks in MPI. Section 6 provides a more

detailed discussion of the world’s leading disk-to-disk

sort, which brings out a very important property of this

class of system: the ability to concurrently perform I/O

to disks on every node.

2 Berkeley NOW System

The hardware configuration of the Berkeley NOW sys-

tem consists of one hundred and five Sun Ultra 170

workstations, connected by a large Myricom net-

work[Bode95], and packaged into 19-inch racks. Each

workstation contains a 167 MHz Ultra1 microprocessor

with 512 KB level-2 cache, 128 MB of memory, two 2.3

GB disks, ethernet, and a Myricom “Lanai” network

interface card (NIC) on the SBus. The NIC has a 37.5

MHz embedded processor and three DMA engines,

which compete for bandwidth to 256 KB of embedded

SRAM. The node architecture is shown in Figure 1.

The network uses multiple stages of Myricom switches,

each with eight 160 MB/s bidirectional ports, in a vari-

ant of a fat-tree topology.

2.1 Packaging

We encountered a number of interesting engineering

issues in assembling a cluster of this size that are not so

apparent in smaller clusters, such as our earlier 32-node

prototype. This rack-and-stack style of packaging is

extremely scalable, both in the number of nodes and the

ability to upgrade nodes over time. However, structured

cable management is critical. In tightly packaged sys-

tems the interconnect is hidden in the center of the

Parallel Computing on the Berkeley NOW

David E. Culler, Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau, Brent Chun,

Steven Lumetta, Alan Mainwaring, Richard Martin, Chad Yoshikawa, Frederick Wong

Computer Science Division

University of California, Berkeley

Parallel Computing on the Berkeley NOW, Lecture notes of Computer Networks

Related documents

Partial preview of the text

Download Parallel Computing on the Berkeley NOW and more Lecture notes Computer Networks in PDF only on Docsity!

1 Introduction

2 Berkeley NOW System

2.1 Packaging

David E. Culler, Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau, Brent Chun,

Steven Lumetta, Alan Mainwaring, Richard Martin, Chad Yoshikawa, Frederick Wong

Computer Science Division

University of California, Berkeley

2.2 Network topology

128 MB

B/A

UPA

Myricom Network

3 Active Messages

4 MPI

5 NAS Parallel Benchmarks

6.2 Local disk performance

6.3 OS Interfaces for Buffer Management

SCSI

6.4 Using Active Message communication

6.5 Sorting and Writing

6.6 Performance Measurements

8 Acknowledgments

9 References