Search found 8 matches

by keitat
Mon Jun 04, 2012 5:15 pm
Forum: User discussion
Topic: dgesv MAGMA Fortran Device Interface w/ CRAY+OpenACC
Replies: 7
Views: 4431

Re: dgesv MAGMA Fortran Device Interface w/ CRAY+OpenACC

Have you ever tried libsci_acc bundled with Cray OpenACC compiler?
by keitat
Tue May 29, 2012 11:47 am
Forum: User discussion
Topic: Bug in ZINPLACE_TRANSPOSE?
Replies: 6
Views: 2747

Re: Bug in ZINPLACE_TRANSPOSE?

It appears transpose routine looks OK. I am also looking into magmablas_zpermute_long2 to find any out-of-bound memory access. Just for clarification. The error occurs when: M=3584 LDA=3616 Input matrix starts at A(32,32). I am still seeing wrong answer if A and X are allocated in contiguous memory ...
by keitat
Mon May 28, 2012 12:19 am
Forum: User discussion
Topic: Bug in ZINPLACE_TRANSPOSE?
Replies: 6
Views: 2747

Re: Bug in ZINPLACE_TRANSPOSE?

I am still seeing wrong answer if A and X are allocated in contiguous memory as described below. I suspect that zinplace_tranpose is making out-of-bound memory access when M!=LDA. In my application, I finally get the correct answer after changing the source to apply zinplace_transpose for LDA=M only...
by keitat
Fri May 11, 2012 3:09 pm
Forum: User discussion
Topic: MAGMA 1.2 Released
Replies: 7
Views: 14280

Re: MAGMA 1.2 Released

It appears zhetrd_mgpu.cpp and zlatrd_mgpu.cpp are missing. You could recover them from chetrd_mgpu.cpp and clatrd_mgpu.cpp, but I'd like to Magma developers team to fix this problem. Also, there are several undefined reference for some data transfer and BLAS kernel calls such as magma_dgetmatrix an...
by keitat
Fri May 04, 2012 5:19 pm
Forum: User discussion
Topic: Bug in DGETRF2_OOC?
Replies: 0
Views: 912

Bug in DGETRF2_OOC?

Hi, I am testing dgetrf_ooc routine using testing_dgetrf.cpp. Then, I got inaccurate answer. See the output below. In the tester source, I call dgetrf_ooc and dgetrf_piv in sequence. Is there a bug in dgetrf_ooc? #if 0 magma_dgetrf( M, N, h_R, lda, ipiv, &info); #else magma_dgetrf_ooc( M, N, h_R, ld...
by keitat
Tue May 01, 2012 5:45 pm
Forum: User discussion
Topic: magma_zgetri_gpu segfaults in OpenMP parallel for
Replies: 14
Views: 9767

Re: magma_zgetri_gpu segfaults in OpenMP parallel for

CUBLAS's old API does not guarantee thread safety, and the CUBLAS4.0 or higher document recommends using Version 2 API with stream created by each individual CPU thread. Current version of Magma relies on CUBLAS's old API and thay may break the execution order of kernels in a few calls using multipl...
by keitat
Tue Apr 24, 2012 9:55 pm
Forum: User discussion
Topic: Bug in ZINPLACE_TRANSPOSE?
Replies: 6
Views: 2747

Re: Bug in ZINPLACE_TRANSPOSE?

Stan,

I can see the same problem with sgetrf_gpu, dgetrf_gpu and cgetrf_gpu.

Thanks,

Keita
by keitat
Tue Apr 24, 2012 12:47 am
Forum: User discussion
Topic: Bug in ZINPLACE_TRANSPOSE?
Replies: 6
Views: 2747

Bug in ZINPLACE_TRANSPOSE?

Hi, I have seen strange error with zgetrf_gpu (wrong answer or segfault with memory calls after the routine) on Cray XK (Fermi+). The error occurs when (1) M=N, (2) M is multiple of 32, and (3) LDA is mulitple of 32, but not equal to M. I thought this could be a bug in zinplace_tranpsose. So I chang...