problem testing blacs on mac os x 10.6.5

Post here if you have some questions about the ScaLAPACK installer

problem testing blacs on mac os x 10.6.5

Postby apologia » Tue Jan 25, 2011 2:04 pm

I'm having some problems with installing and testing the blacs library on my 8-core mac pro running 10.6.5
(as part of installing scalapack). I've used the python installer to do the installation, i.e.

sudo python setup.py --prefix=/usr/scalapack --mpibindir=/usr/local/bin/ --mpiincdir=/usr/local/include/ --f90=/sw64/bin/gfortran --downblas --downblacs --downlapack

I've used the installer to install the necessary blacs, blas and lapack libraries (though to get past our firewall, I've downloaded these packages
manually -- perhaps I've missed something in doing so as I see the blacs library has some sort of patch to it at netlib.org).
I'm doing a 64bit installation with openmpi 1.4.3 (which I've put in /usr/local/bin to keep it separate from the openmpi
that comes installed with the mac).

When I go and test the blacs library, I get the following results. For xFbtest_MPI--0, the tests run fine until

RUNNING REPEATABLE SUM TEST

which then hangs and must be killed. For xCbtest_MPI--0, I get the following (truncated) error

sudo /usr/local/bin/mpiexec -np 4 xCbtest_MPI--0
Password:
BLACS WARNING 'No need to set message ID range due to MPI communicator.'
from {-1,-1}, pnum=0, Contxt=-1, on line 18 of file 'blacs_set_.c'.

[dh28:10794] *** Process received signal ***
[dh28:10794] Signal: Segmentation fault (11)
[dh28:10794] Signal code: Address not mapped (1)
[dh28:10794] Failing at address: 0x3d7810
BLACS WARNING 'No need to set message ID range due to MPI communicator.'
from {-1,-1}, pnum=1, Contxt=-1, on line 18 of file 'blacs_set_.c'.

[dh28:10795] *** Process received signal ***
[dh28:10795] Signal: Segmentation fault (11)
[dh28:10795] Signal code: Address not mapped (1)
[dh28:10795] Failing at address: 0x3d7810
BLACS WARNING 'No need to set message ID range due to MPI communicator.'
from {-1,-1}, pnum=2, Contxt=-1, on line 18 of file 'blacs_set_.c'.

[dh28:10796] *** Process received signal ***
[dh28:10796] Signal: Segmentation fault (11)
[dh28:10796] Signal code: Address not mapped (1)
[dh28:10796] Failing at address: 0x3d7810
[dh28:10794] [ 0] 2 libSystem.B.dylib 0x00007fff82b8167a _sigtramp + 26
[dh28:10794] [ 1] 3 ??? 0x0000000032aaaba7 0x0 + 850045863
[dh28:10794] [ 2] 4 xCbtest_MPI--0 0x000000010007a0f3 BI_TransUserComm + 35
[dh28:10794] [ 3] 5 xCbtest_MPI--0 0x0000000100088d7b Cblacs_gridmap + 251
[dh28:10794] [ 4] 6 xCbtest_MPI--0 0x0000000100088c0b Cblacs_gridinit + 123
[dh28:10794] [ 5] 7 xCbtest_MPI--0 0x00000001000017b1 MAIN__ + 219
[dh28:10794] [ 6] 8 xCbtest_MPI--0 0x000000010008971c main + 44
[dh28:10794] [ 7] 9 xCbtest_MPI--0 0x00000001000016b4 start + 52
[dh28:10794] [ 8] 10 ??? 0x0000000000000001 0x0 + 1
[dh28:10794] *** End of error message ***


I see the identifier "TransUserComm" seems to be involved somehow. I'm not quite sure if this is related to the
Transcomm flag in the Bmake file for BLACS. I ran the tests in BLACS/INSTALL to see if I should set this flag
in a certain way but these tests seems to suggest that I should leave it blank.

I found this

viewtopic.php?f=2&t=1821&p=5227&hilit=TRANSCOMM#p5227

but I'm not sure if it's on point.

Any ideas would be appreciated. Do I need a different (older) version of openmpi? Should I step away from the 64bit installation?

Cheers,

Robert
apologia
 
Posts: 4
Joined: Tue Jan 25, 2011 1:41 pm

Re: problem testing blacs on mac os x 10.6.5

Postby admin » Tue Jan 25, 2011 2:38 pm

Yes indeed, check that the following is set in the Bmake.inc
Code: Select all
TRANSCOMM =  -DUseMpi2
admin
Site Admin
 
Posts: 490
Joined: Wed Dec 08, 2004 7:07 pm

Re: problem testing blacs on mac os x 10.6.5

Postby apologia » Tue Jan 25, 2011 7:46 pm

Thanks, that did the trick. Thought I tried that last night -- clearly though
it was too late.
apologia
 
Posts: 4
Joined: Tue Jan 25, 2011 1:41 pm


Return to ScaLAPACK installer

Who is online

Users browsing this forum: Bing [Bot] and 1 guest