According to magma-v02 documentation, magma should be able to balance work upon cores and GPU. Is it functionnal with magma 0.2?
I am trying MAGMA 0.2 on multicore + GPU machines. The configuration is:
2 x Intel Xeon E5520 => 8 cores
2 x Tesla C1060 (I know Magma only use one GPU)
I run some test programs provided with magma, with 8 and 4 OpenMP threads.
e.g, for testing_cgeqrf, which time both cgeqrf for MKL and MAGMA libraries, I get these timings:
- with 8 omp threads: MKL: 119.38 GFlops - MAGMA 69.66 GFlops
- with 4 omp threads: MKL: 62.33 GFlops - MAGMA 69.29 GFlops
I did not run these tests with all the MAGMA functions. So my questions are:
- are all the MAGMA functions hybrid?
- if the tested function is, why the number of threads does not affect the performances?