

Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
The requirements for homework 3 in the fall 2005 cs 441/632/732 parallel computing course. Students are tasked with implementing matrix-matrix multiplication using cannon's algorithm and fox's algorithm with 2-d data distribution. They must analyze the performance of both algorithms and plot speedup plots for different process grid layouts and matrix sizes. Students are encouraged to allocate and initialize only the required parts of the matrices in each process and use blas routine dgemm for local matrix-matrix multiplication.
Typology: Assignments
1 / 2
This page cannot be seen from the preview
Don't miss anything!


Fall 2005 CS 441/632/732 Parallel Computing Homework-
Individual work only. 150 points. Due Oct 25, 2005.
Process (^) 1x1 = 1 2x2 = 4 4x4 = 16 1x1 = 1 2x2 = 4 4x4 = 16
Cannon Fox
Process
1x1 = 1 1000x
2x2 = 4 2000x
3x3 = 9 3000x
4x4 = 16 4000x Cannon
Fox
General Comments:
You must implement and test these programs on the CIS cluster (Everest) and use MPI for communication. Instructions for using the CIS cluster and submitting jobs to SGE can be found at: http://www.cis.uab.edu/ccl/resources/everest/EverestGridNodeUserGuide.php. While submitting to the queue you must request # of processors = # of processes, for example, for the process grid layout 3x3, total # of processors requested = 9.
Fall 2005 CS 441/632/732 Parallel Computing Homework-
Submission: Email the source code along with any Makefile and scripts as a single tar file attachment to [email protected] with the subject “CS 441/632/732 Homework-3.” Turn-in a printed report in class using the format provided at http://www.cis.uab.edu/cs441/report.html. After submission, do not make any changes to your source code on Everest, you will be asked to demonstrate your program on Everest and the timestamp of the files will be used to determine late submissions.
Grading: Correct implementation and testing of the programs (including collecting timing information for the tables above)
115 points
Performance Analysis 25 points Lab report format/presentation 10 points