Search found 268 matches

by Stan Tomov
Tue Apr 22, 2014 6:26 pm
Forum: User discussion
Topic: zgemm for matrices that don't fit
Replies: 2
Views: 1714

Re: zgemm for matrices that don't fit

You can try the new NVIDIA cuBLAS-XT library. See:
https://developer.nvidia.com/cublasxt
Stan
by Stan Tomov
Fri Sep 13, 2013 1:14 am
Forum: User discussion
Topic: the error when I compile magma1.4.0 in vs2010
Replies: 3
Views: 2533

Re: the error when I compile magma1.4.0 in vs2010

You should be able to specify compiler options in vs2010, but I haven't used it recently and I am not sure where exactly. The other way is to revise the magma_types.h, e.g., by adding after the include statements a

Code: Select all

#define HAVE_CUBLAS
by Stan Tomov
Mon Sep 09, 2013 9:20 am
Forum: User discussion
Topic: the error when I compile magma1.4.0 in vs2010
Replies: 3
Views: 2533

Re: the error when I compile magma1.4.0 in vs2010

HAVE_CUBLAS has to be defined, e.g., by adding -DHAVE_CUBLAS to the compiler options. In Linux, the compiler options are set in Makefile.internal based on user input from make.inc.
by Stan Tomov
Thu Aug 22, 2013 5:55 pm
Forum: User discussion
Topic: Error: BLAS/LAPACK routine 'magma_' gave error code -7
Replies: 2
Views: 3953

Re: Error: BLAS/LAPACK routine 'magma_' gave error code -7

I see that the work space is indeed not large enough. You have

Code: Select all

lwork = max( lwork, max( nb, 2*nb*nb ));
but it should be

Code: Select all

lwork = max( lwork, max( n_col*nb, 2*nb*nb ));
Alternatively, you could have called directly magma_dgeqrf with work space size query (instead of lapackf77_dgeqrf).
by Stan Tomov
Sat Jun 22, 2013 10:33 am
Forum: User discussion
Topic: magma-1.4.0-beta1 does not compile...
Replies: 3
Views: 1947

Re: magma-1.4.0-beta1 does not compile...

Sorry about this - we use release generation scripts and we had a bug there. To fix it, please add

Code: Select all

.DEFAULT_GOAL :=
at the end of file Makefile.internal. Thanks.
by Stan Tomov
Fri Jun 14, 2013 1:46 pm
Forum: User discussion
Topic: GPU interface to dgetrf with streams
Replies: 4
Views: 3875

Re: GPU interface to dgetrf with streams

Austin, I see. This sounds good. We have been asked by users to provide this type of stream interface, so any experimental results on performance would be very useful for us to know. I can check with NVIDIA developers if routine arguments are always sent asynchronously or if there are cases that the...
by Stan Tomov
Thu Jun 13, 2013 11:18 pm
Forum: User discussion
Topic: GPU interface to dgetrf with streams
Replies: 4
Views: 3875

Re: GPU interface to dgetrf with streams

The current code uses stream 0 for the GPU BLAS. This would not allow concurrent BLAS execution on the GPU from the different threads. Related to the communications, I think magmablas_dpermute_long2s does not use synchronous communications. The routine does not have any explicit communication, only ...
by Stan Tomov
Fri May 24, 2013 5:33 pm
Forum: User discussion
Topic: Segmentation fault in magma_dsetmatrix
Replies: 11
Views: 7460

Re: Segmentation fault in magma_dsetmatrix

We have
Ubuntu 12.04.2 LTS
OpenCL 1.2
AMD-APP 1124.2
Driver version 12.10.5 (module loaded - fglrx 12.10.5 [Mar 20 2013] with 1 minors)
clAmdBlas 1.11.314 (also 1.8.286 and 1.8.291)
by Stan Tomov
Tue May 21, 2013 10:19 am
Forum: User discussion
Topic: problem MAGMA-1.2.0 ATLAS-3.10.0 Lapack-3.4.1..help !!!
Replies: 12
Views: 9612

Re: problem MAGMA-1.2.0 ATLAS-3.10.0 Lapack-3.4.1..help !!!

Hi, These undefined routines are supposed to be in lapack. Maybe check to see if they are indeed there, e.g., nm liblapack.a | grep ssyevd If it is not there, probably you have some incomplete version of LAPACK, e.g., something that comes with ATLAS. If it is there, probably just adding -llapack onc...
by Stan Tomov
Tue May 21, 2013 10:02 am
Forum: User discussion
Topic: Segmentation fault in magma_dsetmatrix
Replies: 11
Views: 7460

Re: Segmentation fault in magma_dsetmatrix

Actually we have similar problem on a new system that we just setup. We rely on the clAmdBlas so this is one of the first things that we check when there are problems. On dgemm we also get somehow similar results ./testing_dgemm Initializing clMAGMA runtime ... Usage: testing_dgemm [-NN|NT|TN|TT] [-...