by Stan Tomov » Fri Aug 26, 2011 12:44 am
The difference is in the interface (and the implementation). The _gpu files test the _gpu routines. These routines take input matrices on the GPU memory and produce the result on the GPU memory. The routines without _gpu are CPU LAPACK compliant interface - input matrices and output is on the CPU memory. Both use the GPU for calculations. The GPU interface routines are in general slightly faster.