Search found 23 matches

by bravegag
Tue Jan 21, 2014 10:42 am
Forum: User discussion
Topic: dgetrf_gpu crashes with M non multiple of 32?
Replies: 3
Views: 1736

Re: dgetrf_gpu crashes with M non multiple of 32?

Thank you mark. Very spot on. I will change my code to avoid such mistakes in the future.

Best !
by bravegag
Mon Jan 13, 2014 10:54 am
Forum: User discussion
Topic: dgetrf_gpu crashes with M non multiple of 32?
Replies: 3
Views: 1736

Re: dgetrf_gpu crashes with M non multiple of 32?

SOLVED.

I'm sorry, my bad, the line:

Code: Select all

magma_malloc_cpu((void**) &ipiv, min_mn);
should instead be:

Code: Select all

magma_malloc_cpu((void**) &ipiv, min_mn*sizeof(magma_int_t));
by bravegag
Mon Jan 13, 2014 10:42 am
Forum: User discussion
Topic: dgetrf_gpu crashes with M non multiple of 32?
Replies: 3
Views: 1736

dgetrf_gpu crashes with M non multiple of 32?

Hello, I'm using the latest MAGMA 1.4.1 and I noticed that for all examples in testing_dgetrf_gpu M is multiple of 32 i.e. the matrix size is set to LDDAxN where LDDA is ((M + 31) / 32)*32. When I try using the function from my application with M sizes that are not multiple of 32 it crashes (no erro...
by bravegag
Mon Jan 13, 2014 10:30 am
Forum: User discussion
Topic: Book on CUBLAS and MAGMA
Replies: 2
Views: 1830

Re: Book on CUBLAS and MAGMA

I found it very useful thank you for sharing!
by bravegag
Wed Aug 21, 2013 8:35 am
Forum: User discussion
Topic: will ?gees (Schur decomposition) be supported?
Replies: 1
Views: 1563

will ?gees (Schur decomposition) be supported?

Hello,

Is by any chance ?gees in the pipeline for development as part of MAGMA?

TIA,
Best regards,
Giovanni
by bravegag
Fri Aug 16, 2013 11:50 am
Forum: User discussion
Topic: Eigen MAGMA backend implementation project
Replies: 10
Views: 9510

Re: Eigen MAGMA backend implementation project

Doing the following (increasing the DP rate of the Titan card) improved the benchmark results greatly in some cases e.g. now the matrixMulCUBLAS example modified to DGEMM tops at 1.3 TFlop/s:

Image
by bravegag
Wed Aug 14, 2013 8:46 am
Forum: User discussion
Topic: Eigen MAGMA backend implementation project
Replies: 10
Views: 9510

Re: Eigen MAGMA backend implementation project

Hi Mark, Thank you very much for your feedback and help. I tried all versions already and the gpu versions performed best in my benchmarks even while having to copy unpinned memory between Host <-> Device. It would be great to have the same benchmarks executed using a Tesla K20 card and before askin...
by bravegag
Tue Aug 13, 2013 10:55 am
Forum: User discussion
Topic: Eigen MAGMA backend implementation project
Replies: 10
Views: 9510

Re: Eigen MAGMA backend implementation project

After integrating magma_?geqrf3_gpu implementation I get very disappointing benchmark results. This was very surprising to me given the excellent result of the magma_?geqp3_gpu. Eigen MAGMA integration: https://github.com/bravegag/eigen-magma/blob/master/Eigen/src/QR/HouseholderQR_MAGMA.h Benchmark ...
by bravegag
Mon Aug 12, 2013 8:39 am
Forum: User discussion
Topic: Eigen MAGMA backend implementation project
Replies: 10
Views: 9510

Re: Eigen MAGMA backend implementation project

Bug fix related to the Cholesky decomposition, the benchmark input matrix A was not SPD, this has been fixed and now the results are correct. Now MAGMA shines reaching over 120 Gflop/s:
https://github.com/bravegag/eigen-magma-benchmark
by bravegag
Mon Aug 12, 2013 4:40 am
Forum: User discussion
Topic: Eigen MAGMA backend implementation project
Replies: 10
Views: 9510

Re: Eigen MAGMA backend implementation project

Hi Mark, Thank you very much for your response! Please find my comments below: First, on this page https://github.com/bravegag/eigen-magma-benchmark the images appear broken for me. I have to click on each one to see the image in a separate window. Thank you. I have corrected that. Now all the image...