Search found 897 matches

by mgates3
Wed May 02, 2012 4:27 pm
Forum: User discussion
Topic: Magma compiling with MPF failed
Replies: 2
Views: 2261

Re: Magma compiling with MPF failed

MAGMA does not replace the existing BLAS and LAPACK libraries on the CPU. It supplements them with functions to run on the GPU. In fact, MAGMA routines use both the existing BLAS on the CPU (e.g., ATLAS, MKL) and BLAS on the GPU (e.g., CUBLAS, MAGMABLAS). Certainly you could write a wrapper to call ...
by mgates3
Wed May 02, 2012 4:07 pm
Forum: User discussion
Topic: Problem building Magma 1.1.0
Replies: 3
Views: 1847

Re: Problem building Magma 1.1.0

What version of CUDA are you using? Version 4.0 and 4.1 both define cublasStatus_t in the cublas.h. If you can't upgrade, a work around is to define it as John suggests.
-mark
by mgates3
Wed May 02, 2012 3:53 pm
Forum: User discussion
Topic: Dynamic library (.so) compilation on Linux
Replies: 3
Views: 3674

Re: Dynamic library (.so) compilation on Linux

You can safely use the cublas functions. The magmablas functions are in some cases faster but otherwise provide the same functionality. The magmablas_dtrsm should be defined in magmablas/dtrsm_tesla.cu.
-mark
by mgates3
Wed May 02, 2012 3:31 pm
Forum: User discussion
Topic: Problem with "magma_zhegvx" function
Replies: 3
Views: 2274

Re: Problem with "magma_zhegvx" function

No, that's the correct function to use. I didn't realize that function had only the complex version.
-mark
by mgates3
Wed May 02, 2012 3:14 pm
Forum: User discussion
Topic: clMAGMA 0.1 Beta Released
Replies: 8
Views: 19748

Re: clMAGMA 0.1 Beta Released

All four precisions (single, double, single complex, double complex) are available in clMAGMA for QR and Cholesky. However, LU factorization is currently only in single precision. We intend to add support for other precisions but don't have a definite time table yet.
-mark
by mgates3
Tue Apr 03, 2012 1:28 pm
Forum: User discussion
Topic: Tridiagonal Solver
Replies: 3
Views: 3032

Re: Tridiagonal Solver

Magma does not have tridiagonal (or banded) solvers for the GPU. My guess is the tridiagonal solver in LAPACK (dgtsv or dptsv) on the CPU is faster than transferring a tridiagonal matrix to the GPU, solving, and transferring the results back. This is because a tridiagonal solve has O(n) operations o...
by mgates3
Mon Mar 19, 2012 4:53 pm
Forum: User discussion
Topic: Error while installing MAGMA installation
Replies: 1
Views: 1758

Re: Error while installing MAGMA installation

Can you check that the CUDADIR is correct? I.e., if you list that directory you should see something like: > ls /usr/local/cuda/lib64/ libcublas.so@ libcufft.so@ libcusparse.so@ libcublas.so.4@ libcufft.so.4@ libcusparse.so.4@ libcublas.so.4.0.17* libcufft.so.4.0.17* libcusparse.so.4.0.17* libcudart...
by mgates3
Mon Mar 19, 2012 4:46 pm
Forum: User discussion
Topic: MAGMA on windows or linux
Replies: 6
Views: 3148

Re: MAGMA on windows or linux

Can you give some details about what you tried, what it did, what you expected, what is your environment (version of Linux, compiler, etc.)?

On Linux, as a first step you need to edit the make.inc file. Several example make.inc files are given.

-mark
by mgates3
Mon Mar 19, 2012 4:41 pm
Forum: User discussion
Topic: How to install MAGMA on a login node
Replies: 1
Views: 2197

Re: How to install MAGMA on a login node

Yes, that header is in the CUDA install, for instance, /usr/local/cuda/include/cuda_runtime_api.h Based on your compile line, I would expect that in /usr/share/Modules/modulefiles/cuda/include/cuda_runtime_api.h If you can't read that directory, you need to consult with your system administrator. Ch...
by mgates3
Mon Mar 19, 2012 4:33 pm
Forum: User discussion
Topic: dtrsm bug in magma 1.1
Replies: 7
Views: 5035

Re: dtrsm bug in magma 1.1

I couldn't exactly replicate the issue, but it seems there is a problem. Running memcheck finds an invalid memory reference, which indicates a bug in MAGMA. We'll look into it. Otherwise, use the cublas dtrsm. -mark > cuda-memcheck ./dtrsm ========= CUDA-MEMCHECK 1.000000 0.000000 0.000000 -0.400000...