I have a 9 node Intel quad-core cluster and on the master node I have a pci-express 16 slot a BFG-GTX-260 NVIDIA GPU card and the latest 2.3 CUDA SDK, toolkit, and drivers. My GPU card has 216 processing elements and I managed to get your new MAGMA library working with GotoBLAS v1.26 (academic) and gfortran with the attached Make.inc. One has to add -llapack and remove -lguide and
yum install lapack
in fedora core 11 Linux because that's what I am running on my cluster. Also I have hard coded the path to the GotoBLAS library so you will have to change that from -L/bummer/GotoBLAS to something like -L$(HOME)/GotoBLAS. All that is required on fc11 is the change to Make.inc and it compiles beautifully in the make all in ../testing directory of the magma-0.1 install!
I tried it and it works for testing_spotrf -N 1024 3072 with good results even during a run where all host cpus were running another program (long run program related to number theory).
Best wishes,
Allan MeneZes MMATH