by pdgetrf » Fri Jan 28, 2011 12:19 pm
I planted a printf in dtrsm so that when magma_dtrsm is called, we know it for sure.
This is what I'm seeing so far:
du:disco ~/disco-home/sandbox/bugfix> ./nan <- 11:08AM
[ [ 0.391137 , 0.37845 , 0.676826 , 0.828647 ];
[ 0.572013 , 1.29715 , 1.72296 , 3.94056 ];
[ 0.382288 , 0.33823 , 1.39019 , 2.54674 ];
[ 0.719967 , 1.69088 , 1.85096 , 1.95143 ] ]
[ [ 0.729483 , 0 , 0 , 0 ];
[ 0.551107 , 1.98647 , 0 , 0 ];
[ 0.946236 , 0.61067 , 2.91446 , 0 ];
[ 0.493657 , 0.600847 , 0.152826 , 3.33746 ] ]
in magmablas_dtrsm_tesla
[ [ 0.536184 , 0.0417602 , 0.0493978 , 0.159198 ];
[ 0.784135 , 0.43545 , 0.245352 , 0.975092 ];
[ 0.524053 , 0.0248786 , 0.301641 , 0.667271 ];
[ 0.986955 , 0.577387 , 0.193681 , 0.325904 ] ]
And this is the result after removing the #define:
du:disco ~/disco-home/sandbox/bugfix> !. <- 11:09AM
./nan
[ [ 0.391137 , 0.37845 , 0.676826 , 0.828647 ];
[ 0.572013 , 1.29715 , 1.72296 , 3.94056 ];
[ 0.382288 , 0.33823 , 1.39019 , 2.54674 ];
[ 0.719967 , 1.69088 , 1.85096 , 1.95143 ] ]
[ [ 0.729483 , 0 , 0 , 0 ];
[ 0.551107 , 1.98647 , 0 , 0 ];
[ 0.946236 , 0.61067 , 2.91446 , 0 ];
[ 0.493657 , 0.600847 , 0.152826 , 3.33746 ] ]
[ [ 0.536184 , 0.0417602 , 0.0493978 , 0.159198 ];
[ 0.784135 , 0.43545 , 0.245352 , 0.975092 ];
[ 0.524053 , 0.0248786 , 0.301641 , 0.667271 ];
[ 0.986955 , 0.577387 , 0.193681 , 0.325904 ] ]
One difference is that ourh C2050 machine is using CUDA driver 3.1. We are still looking into it.