Search found 20 matches

by roalmar2
Thu Nov 20, 2014 7:45 am
Forum: User discussion
Topic: Segmentation faul error testing
Replies: 5
Views: 3588

Re: Segmentation faul error testing

I solved it !!!! make.inc file: #GPU_TARGET ?= Tesla Fermi Kepler GPU_TARGET = Kepler CC = gcc CXX = g++ NVCC = nvcc FORT = gfortran ARCH = ar ARCHFLAGS = cr RANLIB = ranlib # Defining MAGMA_ILP64 or MKL_ILP64 changes magma_int_t to int64_t in include/magma_types.h # Compiling with -std=c++98 -pedan...
by roalmar2
Tue Nov 18, 2014 7:42 am
Forum: User discussion
Topic: Segmentation faul error testing
Replies: 5
Views: 3588

Re: Segmentation faul error testing

Hello, I reinstall with MAGMA 1.6, but it not works. GPU=Kepler Some examples: ./testing_zgetrf -M 32 -N 32 MAGMA 1.6.0 compiled for CUDA capability >= 3.0 CUDA runtime 6050, driver 6050. OpenMP threads 24. MKL 11.1.0, MKL threads 12. device 0: Tesla K20m, 705.5 MHz clock, 4799.6 MB memory, capabili...
by roalmar2
Mon Nov 17, 2014 11:03 am
Forum: User discussion
Topic: Segmentation faul error testing
Replies: 5
Views: 3588

Segmentation faul error testing

Hello, I installed MAGMA 1.5.0_beta3. My make.inc is: # GPU_TARGET contains one or more of Tesla, Fermi, or Kepler, # to specify for which GPUs you want to compile MAGMA: # Tesla - NVIDIA compute capability 1.x cards # Fermi - NVIDIA compute capability 2.x cards # Kepler - NVIDIA compute capability ...
by roalmar2
Fri Sep 26, 2014 7:20 am
Forum: User discussion
Topic: Pdf results function dsytrd
Replies: 2
Views: 1099

Re: Pdf results function dsytrd

Ok, thank you very much.

Some error in my compilation occurs and I haven't got the executable file.

I will recompile it.
by roalmar2
Fri Sep 26, 2014 7:10 am
Forum: User discussion
Topic: testing_dorgqr_m api-trace
Replies: 1
Views: 1191

testing_dorgqr_m api-trace

Hello, I am looking for the time differences between local computation and remote one and I am searching inside the functions. (Other ideas will be welcomed) Taking testing_dorgqr_m example, Time Measurement code is: gpu_time = magma_wtime(); magma_dorgqr_m( m, n, k, hR, lda, tau, hT, nb, &info ); g...
by roalmar2
Tue Sep 23, 2014 12:29 pm
Forum: User discussion
Topic: Pdf results function dsytrd
Replies: 2
Views: 1099

Pdf results function dsytrd

Hello, I am working with multigpus system. In http://icl.cs.utk.edu/magma/news/news.html?id=345 at pdf: http://icl.cs.utk.edu/projectsfiles/magma/pubs/MAGMA_1.4.pdf at the page 24, you expose dsytrd function using it with several gpus. Why you use it? I put different commands and the results are all...
by roalmar2
Wed Sep 10, 2014 10:33 am
Forum: User discussion
Topic: Different checksum with >1 gpu (testing_dgetrf_mgpu)
Replies: 6
Views: 1634

Re: Different checksum with >1 gpu (testing_dgetrf_mgpu)

Hello, mark, when you said: The Gflop/s is the total performance of all CPUs + GPUs that MAGMA uses. It is simply (2/3) n^3 / time. What did you exactly refer with, time ? Options: - time initialize cuda driver + data transfer host-device + kernel + data transfer device-host - time initialize cuda d...
by roalmar2
Mon Sep 01, 2014 6:11 am
Forum: User discussion
Topic: testing_dsyr2k_mgpu and use of nb
Replies: 1
Views: 1280

testing_dsyr2k_mgpu and use of nb

Hello,

I am taking times of ejecution and Gflops for this function (dsyr2k).

Which is the main purpose of the function and how works the option nb (size of block)? Is the way of computacion, or packet sending ...?

All the information is wellcoming ;)

Thanks for all.
by roalmar2
Tue Aug 05, 2014 9:03 am
Forum: User discussion
Topic: Different checksum with >1 gpu (testing_dgetrf_mgpu)
Replies: 6
Views: 1634

Re: Different checksum with >1 gpu (testing_dgetrf_mgpu)

One question, the output value GFlops/s, when is used with mgpu apps, shows the GFlops/s of 1 gpu or the total number of operations (the gpu total GFlops )?

Thanks
by roalmar2
Tue Aug 05, 2014 3:46 am
Forum: User discussion
Topic: Different checksum with >1 gpu (testing_dgetrf_mgpu)
Replies: 6
Views: 1634

Re: Different checksum with >1 gpu (testing_dgetrf_mgpu)

Ok, the output is this one: ./testing_dgetrf_mgpu -c -N 5000 --ngpu 8 MAGMA 1.4.1 , compiled for CUDA capability >= 1.0 device 0: Tesla K20m, 705.5 MHz clock, 4799.6 MB memory, capability 3.5 device 1: Tesla K20m, 705.5 MHz clock, 4799.6 MB memory, capability 3.5 device 2: Tesla K20m, 705.5 MHz cloc...