[Scalapack] Getting CPU performance for pdgemm on a cluster
From: julie langou
Date: Thu, 8 May 2008 14:01:03 -0600
Here is one of my pdgemm timer. (you have 2 version on with BLACS C
interface, one with Fortran interface).
As you may have noticed the testing and timer are very complicated
code, I usually do not play with those.
Do not forget to use an optimized BLAS for your performance check.
Fell free to modify to suit your needs.
On May 8, 2008, at 11:33 AM, Michael Bader wrote:
Dear ScaLAPACK group,
I would like to compare the performance of parallel matrix
(pdgemm) in ScaLAPACK's PBLAS library with other approaches on our
Opteron-Infiniband cluster. Is there any example program that could
me decent results on that issue?
For a first try, I've checked the PDBLAS3TIM program in the TIMINGS
subdirectory of PBLAS, but got stuck with the syntax of the input file
(PDBLAS3TIM.dat) - is there a more detailed description of the
than that given in the source file pdblas3tim.f ?
I tried to simply change the values of M,N,K (the matrix sizes), but
leads immediately to incompatible parameter errors.
Is the pdblas3tim code suitable for a solid performance check for
multiplication? I'd like to compare the speed for a single matrix
multiplication (matrix sizes ~15000*15000) of up to 128 processes
nodes) with the implementaion in the Global Arrays package, and with
my own code.
Any help would be appreciated :-)
With best regards
Dr. Michael Bader Email: bader@Domain.Removed
Scientific Computing in Computer Science Tel.: 089/289-18634
Technische Universit?t M?nchen WWW: http://www5.in.tum.de/~bader
Scalapack mailing list