Search found 12 matches

by katayama
Thu Feb 17, 2011 10:01 am
Forum: User discussion
Topic: Error from magmablas_dtrsm
Replies: 15
Views: 9099

Re: Error from magmablas_dtrsm

Yes. It does!
with the fix, both testing_dpotrf and testing_dpotrf_gpu work.
These are the ones I should test, right?
Thanks,
Nobu
by katayama
Mon Jan 31, 2011 8:35 am
Forum: User discussion
Topic: Nan problems with dgetrf_gpu on RC3
Replies: 20
Views: 8455

Re: Nan problems with dgetrf_gpu on RC3

Dear Stan/John In my previous post dated on Jan. 29, I made a mistake (See line with <<<<<<<<<<<<<<<< below) I commented out the #define statement in dgetrf_gpu and ran testing_dgetrf not testing_dgetrf_gpu. (I only checked the usage line which says it is testing_dgetrf_gpu...) I checked it again an...
by katayama
Sat Jan 29, 2011 7:19 pm
Forum: User discussion
Topic: Error from magmablas_dtrsm
Replies: 15
Views: 9099

Re: Error from magmablas_dtrsm

Hi, I also did the same thing. I copied other info. I am using 64bit OS and thus cula/lib64. Best [katayama@lb01 ~]$ uname -a Linux lb01.kek.jp 2.6.18-194.26.1.el5 #1 SMP Tue Nov 9 12:54:20 EST 2010 x86_64 x86_64 x86_64 GNU/Linux [katayama@lb01 ~]$ g++ -v Using built-in specs. Target: x86_64-unknown...
by katayama
Sat Jan 29, 2011 7:38 am
Forum: User discussion
Topic: Nan problems with dgetrf_gpu on RC3
Replies: 20
Views: 8455

Re: Nan problems with dgetrf_gpu on RC3

Dear stan, Here is the result. Sometimes the first 1024X1024 also gets nan. Othertimes it is OK. I also tried on GTX 580 and show the result at the end [katayama@lb01 magma_1.0.0-rc3]$ nm src/dgetrf_gpu.o 0000000000000000 r .LC0 0000000000000008 r .LC1 U __gxx_personality_v0 U cuCtxSynchronize U cub...
by katayama
Fri Jan 28, 2011 8:23 am
Forum: User discussion
Topic: Nan problems with dgetrf_gpu on RC3
Replies: 20
Views: 8455

Re: Nan problems with dgetrf_gpu on RC3

Dear Stan, I just want to say that I get the nans as well. Nobu [katayama@lb01 testing]$ cat nohup.out ### (./testing_dgetrf) device 0: Tesla C2050, 1147.0 MHz clock, 2687.2 MB memory Usage: testing_dgetrf_gpu -M 1024 -N 1024 M N CPU GFlop/s GPU GFlop/s ||PA-LU||/(||A||*N) ==========================...
by katayama
Thu Jan 27, 2011 7:09 pm
Forum: User discussion
Topic: Error from magmablas_dtrsm
Replies: 15
Views: 9099

Re: Error from magmablas_dtrsm

Hi,
Thanks.
Please let me know if you could reproduce it.
Best
by katayama
Mon Jan 24, 2011 2:02 am
Forum: User discussion
Topic: Error from magmablas_dtrsm
Replies: 15
Views: 9099

cublas result

When I comment out #define line, I get [katayama@lb01 magma]$ ./nan.cublas [ [ 0.391137 , 0.37845 , 0.676826 , 0.828647 ]; [ 0.572013 , 1.29715 , 1.72296 , 3.94056 ]; [ 0.382288 , 0.33823 , 1.39019 , 2.54674 ]; [ 0.719967 , 1.69088 , 1.85096 , 1.95143 ] ] [ [ 0.729483 , 0 , 0 , 0 ]; [ 0.551107 , 1.9...
by katayama
Sat Jan 22, 2011 4:17 am
Forum: User discussion
Topic: Error from magmablas_dtrsm
Replies: 15
Views: 9099

Re: Error from magmablas_dtrsm

Uh mm. I see no attachment. Here is the code, output and Makefile. I use cuda 3.2. Thanks to you! katayama@lb01 magma]$ more out.nan [ [ 0.391137 , 0.37845 , 0.676826 , 0.828647 ]; [ 0.572013 , 1.29715 , 1.72296 , 3.94056 ]; [ 0.382288 , 0.33823 , 1.39019 , 2.54674 ]; [ 0.719967 , 1.69088 , 1.85096 ...
by katayama
Sat Jan 22, 2011 4:12 am
Forum: User discussion
Topic: Error from magmablas_dtrsm
Replies: 15
Views: 9099

Re: Error from magmablas_dtrsm

Hi, Sorry for a late reply. Here is what I have. I've attached a test program, makefile and result. (Hope attachment works.) [katayama@lb01 magma]$ ~/NVIDIA_GPU_Computing_SDK/C/bin/linux/release/deviceQueryDrv CUDA Device Query (Driver API) statically linked version There is 1 device supporting CUDA...
by katayama
Wed Jan 19, 2011 12:08 pm
Forum: User discussion
Topic: Error from magmablas_dtrsm
Replies: 15
Views: 9099

Error from magmablas_dtrsm

Dear experts, I am trying to use rc2. I have the following line cublasDtrsm('R', 'L','T','N', g, g, 1.0, dev_2, g, dev_1, g); Where g is something like 4 - 6144. With #define cublasDtrsm magmablas_dtrsm I get nans in dev_2 on return. I get the right answer with the define statement commented out. I ...