I am looking at sgetrf_gpu.cpp file to understand how clmagma uses LU factorization.
However, I am very new to clmagma and programming, I have hard time understanding the code.
Here is the part of sgetrf_gpu.cpp:
/* Use hybrid blocked code. */
maxm = ((m + 31)/32)*32;
maxn = ((n + 31)/32)*32;
lddat = maxn;
lddwork = maxm;
Could anybody explain what the variables lddat and lddwork stand for? and also nb, and dAT?
Also, could anybody explain why (m+31) is divided by 32 and then multiplied by 32?
Thank you in advance!