I am quite newbie in CUDA and i am very interested in Magma. I try to use magma_dgeev.
I would like my programm to do :
1 - Copy datas from host to device
2 - Execute magma_dgeev
3 - Do other stuff with the results on the device.
4 - Copy results from device to host
In the testing example testing_dgeev.cpp, matrix memory is allocated with cudaMallocHost, that is on the host. My first attemp works the same way, but to achieve "step 3" i need to (re)send the data to the device... Is it possible to keep it there ? I mean, i would rather call magma_dgeev with inputs that are in device memory, and results written in device memory.
How can i do that ?
(and sorry for english mistakes)