when i run testing_zgetrf_gpu, when m*n small than about 9000*9000 it can run correct, but when it get bigger,
it just can caculate correct for m or n should be 960*x. but my card is tesla c2050 ,i think it can solve about 13000*13000.
such as 10800*10800 its wrong ,but 11520*11520 its right. such as 7881*7881 not 960 *x,its also right,i dont know the reason. can u help me ?
thank u very much.
lv
- Code: Select all
./testing_zgetrf_gpu -M 9500 -N 9500
device 0: Tesla C2050 / C2070, 1147.0 MHz clock, 2687.4 MB memory, capability 2.0
device 1: Quadro 4000, 950.0 MHz clock, 2047.2 MB memory, capability 2.0
testing_zgetrf -M 9500 -N 9500
M N CPU GFlop/s GPU GFlop/s ||PA-LU||/(||A||*N)
============================================================
Argument 103 of zgetrf had an illegal value.
magma_zgetrf_gpu returned with error code -103
9500 9500 117.20 3600382.87 2.500074e-01
[zhanghw@localhost testing]$ ./testing_zgetrf_gpu -M 10800 -N 10800
device 0: Tesla C2050 / C2070, 1147.0 MHz clock, 2687.4 MB memory, capability 2.0
device 1: Quadro 4000, 950.0 MHz clock, 2047.2 MB memory, capability 2.0
testing_zgetrf -M 10800 -N 10800
M N CPU GFlop/s GPU GFlop/s ||PA-LU||/(||A||*N)
============================================================
Argument 103 of zgetrf had an illegal value.
magma_zgetrf_gpu returned with error code -103
