Search found 279 matches

by Stan Tomov
Tue May 25, 2010 12:57 am
Forum: User discussion
Topic: MAGMA Shared libraries
Replies: 4
Views: 17989

Re: MAGMA Shared libraries

Hi,
Thanks for your interest in MAGMA. The 64-bit shared library was posted in topic
Matlab/nvmex - failure to compile
Stan
by Stan Tomov
Mon May 17, 2010 4:22 pm
Forum: User discussion
Topic: multiGPU support
Replies: 2
Views: 9335

Re: multiGPU support

Hi,
We haven't set up a date yet when to include the multiGPU support. We have to decide issues on interface, provide supporting programs, etc.
Stan
by Stan Tomov
Mon May 17, 2010 1:37 pm
Forum: User discussion
Topic: MAGMA versus CUBLAS performance
Replies: 1
Views: 10258

Re: MAGMA versus CUBLAS performance

Hi, MAGMA BLAS implements only part of the BLAS specification and is meant as a complement to CUBLAS. It improves on certain CUBLAS routines in specific situations (arguments) that are needed in the MAGMA routines. In general the improvements are only up to 2 times only in some cases. If you see imp...
by Stan Tomov
Fri Apr 23, 2010 10:58 pm
Forum: User discussion
Topic: Magma 0.2 with Nvidia Fermi GTX470 and GotoBLAS2 results
Replies: 5
Views: 16987

Re: Magma 0.2 with Nvidia Fermi GTX470 and GotoBLAS2 results

Allan, Yes, 300 GFlop/s is very interesting, actually (even more interesting) it is 360 GFlop/s, and is in double (the kernel is not available yet; I think the GTX470 should get that speed). We hoped to get a new magma release by the end of the month but are running late on this deadline. Regards, S...
by Stan Tomov
Fri Apr 23, 2010 10:11 pm
Forum: User discussion
Topic: Difficulty Compiling Magma in Fortran
Replies: 16
Views: 23741

Re: Difficulty Compiling Magma in Fortran

I don't have access to a pgf90 but I tried with gfortran to produce an object file and to link with it without problem. You should not let your compiler add underscores after the routine names (as magma_dsgesv_gpu_ is not defined, only magma_dsgesv_gpu is defined in libmagma.a). Can you add option -...
by Stan Tomov
Fri Apr 23, 2010 8:53 pm
Forum: User discussion
Topic: Help with sgesv
Replies: 1
Views: 9655

Re: Help with sgesv

Hi, The memory allocation had to be changed slightly. I have included below the modified code. I marked the places with changes with // TTT before the modification. I also added an error check at the end to get || B - A X || / ||A|| = 6.506779e-07 for hard coded N = 1024, random matrix, and NRHS = 2...
by Stan Tomov
Fri Apr 23, 2010 4:41 pm
Forum: User discussion
Topic: what is the size of d_A (A matrix on device) for sgesv_gpu?
Replies: 1
Views: 9452

Re: what is the size of d_A (A matrix on device) for sgesv_gpu?

A strip around the matrix is required for padding (to make the new size divisible by 32). This would result in potentially increasing the leading dimension of the matrix, e.g., see in testing_sgesv_gpu.cpp int dlda = (N/32)*32; if (dlda<N) dlda+=32; Thus, if you have allocated enough memory, your pr...
by Stan Tomov
Tue Apr 20, 2010 5:28 pm
Forum: User discussion
Topic: Magma 0.2 with Nvidia Fermi GTX470 and GotoBLAS2 results
Replies: 5
Views: 16987

Re: Magma 0.2 with Nvidia Fermi GTX470 and GotoBLAS2 results

Hi Allan, This is very interesting. Thanks for sharing with everyone these first experiences with the new Fermi. We also got one on Friday but I went to IPDPS in Atlanta (this week) and didn't have enough time to play with it. I still can offer two comments though. First, MAGMA 0.2 is a release befo...
by Stan Tomov
Wed Apr 14, 2010 1:06 pm
Forum: User discussion
Topic: when's wrong with this?
Replies: 2
Views: 11220

Re: when's wrong with this?

Now I see we have shipped 64-bit liblapacktest.a with the distribution. Hopefully, all you have to do is go to testing/lin/ and do
make clean all
to regenerate it in 32-bit for your system and everything else should be fine. Please let me know if this didn't work.
Thanks,
Stan
by Stan Tomov
Wed Apr 14, 2010 1:01 pm
Forum: User discussion
Topic: Difference between sgetrf, sgetrf_gpu and sgetrf_gpu2
Replies: 2
Views: 10538

Re: Difference between sgetrf, sgetrf_gpu and sgetrf_gpu2

Hi, Yes, all functions use the GPU. The difference is just the interface. Routine magma_sgetrf and magma_sgetrf_gpu takes input matrix and produces result on the CPU memory (as shown in testing_sgetrf.cpp), while magma_sgetrf_gpu assumes the input matrix and the output factorization are on the GPU m...