Hi
I have downloaded and built MAGMA, and would just like to say how EXCITED I am about it.
However, I run into some problems when I run the sgetrf tests on both a 8600 GT and a 9500 GT.
When running the tests for matrix sizes greater than the block size (which I have changed set in the magma_get_sgetrf_nb function in get_np.c).
For example if the block size returns 128 if m is less than 2048, then the following output is obtained when running for -N 129:
./testing_sgetrf_gpu -N 129
device 0: GeForce 9500 GT, 1350.0 MHz clock, 1023.3 MB memory
N CPU GFlop/s GPU GFlop/s ||PA-LU|| / (||A||*N)
==========================================================
129 0.62 0.49 1.718544e-01
as soon as the matrix size is 128 or less, it seems to work well:
./testing_sgetrf_gpu -N 128
device 0: GeForce 9500 GT, 1350.0 MHz clock, 1023.3 MB memory
N CPU GFlop/s GPU GFlop/s ||PA-LU|| / (||A||*N)
==========================================================
128 0.73 0.70 1.657047e-09
45 0.42 0.40 2.046684e-09
Does anyone have any ideas as to how I can go about attempting to debug and fix this?
I have also tested the code on another machine running a GTX 280, and it appears to be running well.
Thanks
Evan
