Search found 897 matches

by mgates3
Fri Jan 06, 2012 4:52 pm
Forum: User discussion
Topic: where is the realization of magmablas_sgemm?
Replies: 4
Views: 3677

Re: where is the realization of magmablas_sgemm?

I think you mean you found the declaration in include/magmablas_s.h
The actual code is in several files matching magmablas/sgemm*.cu

-mark
by mgates3
Fri Jan 06, 2012 4:47 pm
Forum: User discussion
Topic: problem with magma_dgetrf_gpu
Replies: 1
Views: 958

Re: problem with magma_dgetrf_gpu

According to http://developer.nvidia.com/cuda-gpus, the GeForce GT 220 is CUDA 1.2, which doesn't support double precision. Double precision was added in CUDA 1.3. Does the single precision magma_sgetrf routine work?

-mark
by mgates3
Fri Jan 06, 2012 4:39 pm
Forum: User discussion
Topic: dgemv returns wrong results
Replies: 1
Views: 1229

Re: dgemv returns wrong results

I'm not sure that I understand your question. Doing gemv( A, x ) yields A*x. Doing potrf( A ) then potrs( A, x ) yields A^{-1}*x, or in Matlab notation, A\x. These two, A*x and A\x, should not be the same. Can you clarify what are your inputs to each function, and what is the expected output? -mark
by mgates3
Fri Jan 06, 2012 4:24 pm
Forum: User discussion
Topic: Workspace
Replies: 1
Views: 1025

Re: Workspace

To achieve any reasonable performance, MAGMA requires using a blocked algorithm, which requires using a workspace based on the block size. In some cases, MAGMA uses a larger block size or otherwise needs more workspace than LAPACK. Incidentally, the LAPACK performance will also greatly increase if t...
by mgates3
Fri Jan 06, 2012 3:21 pm
Forum: User discussion
Topic: Build error Magma 1.1
Replies: 1
Views: 1321

Re: Build error Magma 1.1

Try adding -x f95-cpp-input to the FOPTS in make.inc. This forces gfortran to run the C pre-processor before compiling. See for instance the sample make.inc.atlas.

-mark
by mgates3
Fri Jan 06, 2012 3:14 pm
Forum: User discussion
Topic: BLAS and LAPACK needed for install
Replies: 2
Views: 2469

Re: BLAS and LAPACK needed for install

MAGMA needs regular BLAS (the Fortran interface); it does not use CBLAS (the C interface). To get good performance you must install an optimized version of BLAS, such as ATLAS, which is available in Ubuntu, or Intel's MKL. As for LAPACK, both LAPACK and CLAPACK provide the same (Fortran) interface, ...
by mgates3
Fri Jan 06, 2012 3:01 pm
Forum: User discussion
Topic: Does MAGMA support out-of-core computations ?
Replies: 4
Views: 2024

Re: Does MAGMA support out-of-core computations ?

Yes, for LU, Cholesky, and QR the MAGMA code automatically handles matrices larger than the GPU's memory ("non-GPU resident"). The input matrix must fit in the host's memory.

-mark