Computational Grids: Understanding the Concept, Applications, and Architecture - Prof. Ala, Study notes of Computer Science

A set of class notes from a university course, cmsc 818s, on computational grids taught by alan sussman. The notes cover the basics of grids, their applications, and the architecture of grid systems. Grids are defined as dependable, consistent, pervasive, and inexpensive computing resources that enable coordinated resource sharing and problem solving in dynamic, multi-institutional virtual organizations. Applications of grids include distributed supercomputing, high-throughput computing, on-demand computing, data-intensive computing, and collaborative computing. The document also discusses the analogy of grids to the electric power grid and the challenges of using grids. Useful for university students taking a course on distributed computing or grid computing.

Typology: Study notes

Pre 2010

Uploaded on 02/13/2009

koofers-user-85e
koofers-user-85e 🇺🇸

5

(1)

10 documents

1 / 4

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
September 5, 2002
CMCS 818S - Alan Sussman 1
CMSC 818S
Computational Grids
Alan Sussman
September 5, 2002
CMSC 818S - Alan Sussman 2
Administrivia
Class introductions
•Project
MPI project coming soon
it will count, but only for a small part of your grade
sign up for a Linux cluster account if you don’t have
one already (redleader.umiacs.umd.edu)
Next class will be led by Henrique Andrade, on
Globus toolkit
chapter/paper available by tomorrow – see Readings
web page
We’ll talk about choosing topics to present next
Thursday
CMSC 818S - Alan Sussman 3
What is a Grid?
The hardware and software infrastructure that
provides computing resources that are:
dependable – performance guarantees
consistent – standard services/interfaces
pervasive – available everywhere supported
inexpensive – economic argument
“Coordinated resource sharing and problem
solving in dynamic, multi-institutional virtual
organizations” – from The Anatomy of the Grid
(Foster and Tuecke, Argonne/UI-Chicago)
CMSC 818S - Alan Sussman 4
Why Grids?
To provide more computing power
from new technology – take advantage of new
hardware, wherever it is located
to share available resources on demand (like time-
sharing)
to increase utilization of underused resources (cycle
stealing)
to share computational results – collaboratories
take advantage of new tools and techniques
e.g., network enabled solvers (Netsolve @UTK, Ninf in Japan)
teleimmersion – collaborative data exploration/ analysis
(Discover @ Rutgers)
CMSC 818S - Alan Sussman 5
Analogy to the Electric Power Grid
Salient characteristics
heterogeneity
generators/outlets vs. machines/networks
consumers have different requirements
power consumption/service guarantees/$$$ vs.
computational requirements/QOS/$$$
want to take advantage of economies of scale
politics
local control, but interfaced to uncontrolled
environment – need standards
CMSC 818S - Alan Sussman 6
Applications
Distributed supercomputing
to maximize available resources for large
problems
within an organization, or across organizations
examples include distributed interactive
simulation, simulating complex physical
processes
challenges include co-scheduling, scalability of
services to large numbers of nodes, tolerating
latency, and obtaining high performance across
heterogeneous systems
pf3
pf4

Partial preview of the text

Download Computational Grids: Understanding the Concept, Applications, and Architecture - Prof. Ala and more Study notes Computer Science in PDF only on Docsity!

CMSC 818S

Computational Grids

Alan Sussman

September 5, 2002

CMSC 818S - Alan Sussman 2

Administrivia

  • Class introductions
  • Project
    • MPI project coming soon
      • it will count, but only for a small part of your grade
    • sign up for a Linux cluster account if you don’t have one already ( redleader.umiacs.umd.edu )
  • Next class will be led by Henrique Andrade, on Globus toolkit - chapter/paper available by tomorrow – see Readings web page
  • We’ll talk about choosing topics to present next Thursday

CMSC 818S - Alan Sussman 3

What is a Grid?

  • The hardware and software infrastructure that provides computing resources that are: - dependable – performance guarantees - consistent – standard services/interfaces - pervasive – available everywhere supported - inexpensive – economic argument
  • “Coordinated resource sharing and problem solving in dynamic, multi-institutional virtual organizations” – from The Anatomy of the Grid (Foster and Tuecke, Argonne/UI-Chicago)

CMSC 818S - Alan Sussman 4

Why Grids?

  • To provide more computing power
    • from new technology – take advantage of new hardware, wherever it is located
    • to share available resources on demand (like time- sharing)
    • to increase utilization of underused resources (cycle stealing)
    • to share computational results – collaboratories
    • take advantage of new tools and techniques
      • e.g., network enabled solvers (Netsolve @UTK, Ninf in Japan)
      • teleimmersion – collaborative data exploration/analysis (Discover @ Rutgers)

Analogy to the Electric Power Grid

  • Salient characteristics
    • heterogeneity
      • generators/outlets vs. machines/networks
    • consumers have different requirements
      • power consumption/service guarantees/$$$ vs. computational requirements/QOS/$$$
    • want to take advantage of economies of scale
    • politics
      • local control, but interfaced to uncontrolled environment – need standards

Applications

  • Distributed supercomputing
    • to maximize available resources for large problems - within an organization, or across organizations
    • examples include distributed interactive simulation, simulating complex physical processes
    • challenges include co-scheduling, scalability of services to large numbers of nodes, tolerating latency, and obtaining high performance across heterogeneous systems

CMSC 818S - Alan Sussman 7

Applications (cont.)

  • High-throughput computing
    • to schedule large numbers of loosely coupled or independent tasks onto available resources
    • examples include Condor (Wisconsin), LSF (Platform Computing)

CMSC 818S - Alan Sussman 8

Applications (cont.)

  • On-demand computing
    • to provide access to non-local resources – computation, software, data, sensors, etc.
    • often driven by cost-performance rather than absolute performance
    • examples
      • software/computation - NEOS (Argonne), Netsolve (UTK), Ninf (Japan)
      • sensors – telemicroscopy
      • data – on-demand meteorological satellite data processing

CMSC 818S - Alan Sussman 9

Applications (cont.)

  • Data intensive computing
    • Data analysis applications
    • Examples include:
      • GriPhyN – Grid Physics Network (griphyn.org)
      • Sloan Digital Sky Survey
      • Weather forecasting – using remote sensing data from satellites
      • Petroleum reservoir simulation data analysis
    • Challenges include scheduling/configuring storage and network resources

CMSC 818S - Alan Sussman 10

Applications (cont.)

  • Collaborative computing
    • to enhance human-human interaction
    • to provide a virtual shared space
    • to share resources such as data archives or ongoing simulation results
    • examples include BoilerMaker (Argonne), CAVE5D (virtual reality hardware/software), NICE (Illinois)
    • challenges include realtime requirements for human interaction, and UI issues

Grid Community Examples

  • Government – e.g., NSF Supercomputing Centers, DOE labs, etc. – national grid
  • HMO – hospitals in a metro area – private grid
  • Materials science collaboratory – university researchers at many sites – virtual grid
  • Computational market economy – consumer/producer market – public grid - companies are doing this – e.g., Entropia, Parabon - free efforts too – e.g., SETI@home, Mersennes primes

Using Grids

  • Enabling Grid programming
    • to allow apps to adapt to changes in resource availability, and deal with resource heterogeneity in general
    • need standards for apps, programming models, tools, services – the Global Grid Forum (GGF, gridform.org)
    • basic Grid services talked about last

CMSC 818S - Alan Sussman 19

Intranets

  • Grid belonging to a single organization
    • so has centralized admin control
    • but end systems and networks heterogeneous, individual systems separately administered, and no accurate global knowledge of system structure or current state
  • End up with simpler, less tightly integrated set of services than for a cluster - data sharing (e.g., distributed FSs, DBs, Web services) - examples include DCE, DCOM, CORBA - interactions via RPC/RMI suing standard protocols (layered on TCP/IP) - hard to get good performance CMSC 818S - Alan Sussman 20

Intranets (cont.)

  • Additional services over clusters include
    • resource discovery – what is available out there?
    • more security concerns due to reduced trust – e.g., use Kerberos
  • Main issue is getting high performance
    • much research has been done here

CMSC 818S - Alan Sussman 21

Internets

  • Span multiple organizations
    • large and heterogeneous like intranets
    • with no centralized control, geographic distribution of resources (network issues), and international issues (e.g., export controls)
  • New approaches required for many basic services
    • security – cross domain authentication via public key cryptography, as in Globus GSI
    • coscheduling across multiple scheduling policies
  • Hot topic now – the TeraGrid, an NSF funded project across several sites (NPACI, NCSA, Caltech, Argonne)
  • Software projects include Globus, Legion