Hi,
I am taking great interest in the MAGMA project, as it looks very promising. I was just wondering there are plans to support multi-GPU configurations (if available)? This is the next logical step, and could unlock another level of performance.
FYI, my problem is that of a multi-GPU multigrid solver. On the coarsest level, the density of the resulting linear system means that using an LU factorization is much more efficient than an iterative indirect approach. My problem is spread across 4 GPUs (probably more in the future), so the vector to which I wish to apply the factorization is spread over these GPUs. Before I stumbled across MAGMA, I assumed the optimum approach would be just to reassemble the vector in full on the CPU and do the LU factorization there. My problem has O(5000) degrees of freedom.
Cheers,
Mike.
