Search found 20 matches
- Thu Nov 20, 2014 7:45 am
- Forum: User discussion
- Topic: Segmentation faul error testing
- Replies: 5
- Views: 4249
Re: Segmentation faul error testing
I solved it !!!! make.inc file: #GPU_TARGET ?= Tesla Fermi Kepler GPU_TARGET = Kepler CC = gcc CXX = g++ NVCC = nvcc FORT = gfortran ARCH = ar ARCHFLAGS = cr RANLIB = ranlib # Defining MAGMA_ILP64 or MKL_ILP64 changes magma_int_t to int64_t in include/magma_types.h # Compiling with -std=c++98 -pedan...
- Tue Nov 18, 2014 7:42 am
- Forum: User discussion
- Topic: Segmentation faul error testing
- Replies: 5
- Views: 4249
Re: Segmentation faul error testing
Hello, I reinstall with MAGMA 1.6, but it not works. GPU=Kepler Some examples: ./testing_zgetrf -M 32 -N 32 MAGMA 1.6.0 compiled for CUDA capability >= 3.0 CUDA runtime 6050, driver 6050. OpenMP threads 24. MKL 11.1.0, MKL threads 12. device 0: Tesla K20m, 705.5 MHz clock, 4799.6 MB memory, capabili...
- Mon Nov 17, 2014 11:03 am
- Forum: User discussion
- Topic: Segmentation faul error testing
- Replies: 5
- Views: 4249
Segmentation faul error testing
Hello, I installed MAGMA 1.5.0_beta3. My make.inc is: # GPU_TARGET contains one or more of Tesla, Fermi, or Kepler, # to specify for which GPUs you want to compile MAGMA: # Tesla - NVIDIA compute capability 1.x cards # Fermi - NVIDIA compute capability 2.x cards # Kepler - NVIDIA compute capability ...
- Fri Sep 26, 2014 7:20 am
- Forum: User discussion
- Topic: Pdf results function dsytrd
- Replies: 2
- Views: 1506
Re: Pdf results function dsytrd
Ok, thank you very much.
Some error in my compilation occurs and I haven't got the executable file.
I will recompile it.
Some error in my compilation occurs and I haven't got the executable file.
I will recompile it.
- Fri Sep 26, 2014 7:10 am
- Forum: User discussion
- Topic: testing_dorgqr_m api-trace
- Replies: 1
- Views: 1463
testing_dorgqr_m api-trace
Hello, I am looking for the time differences between local computation and remote one and I am searching inside the functions. (Other ideas will be welcomed) Taking testing_dorgqr_m example, Time Measurement code is: gpu_time = magma_wtime(); magma_dorgqr_m( m, n, k, hR, lda, tau, hT, nb, &info ); g...
- Tue Sep 23, 2014 12:29 pm
- Forum: User discussion
- Topic: Pdf results function dsytrd
- Replies: 2
- Views: 1506
Pdf results function dsytrd
Hello, I am working with multigpus system. In http://icl.cs.utk.edu/magma/news/news.html?id=345 at pdf: http://icl.cs.utk.edu/projectsfiles/magma/pubs/MAGMA_1.4.pdf at the page 24, you expose dsytrd function using it with several gpus. Why you use it? I put different commands and the results are all...
- Wed Sep 10, 2014 10:33 am
- Forum: User discussion
- Topic: Different checksum with >1 gpu (testing_dgetrf_mgpu)
- Replies: 6
- Views: 2343
Re: Different checksum with >1 gpu (testing_dgetrf_mgpu)
Hello, mark, when you said: The Gflop/s is the total performance of all CPUs + GPUs that MAGMA uses. It is simply (2/3) n^3 / time. What did you exactly refer with, time ? Options: - time initialize cuda driver + data transfer host-device + kernel + data transfer device-host - time initialize cuda d...
- Mon Sep 01, 2014 6:11 am
- Forum: User discussion
- Topic: testing_dsyr2k_mgpu and use of nb
- Replies: 1
- Views: 1621
testing_dsyr2k_mgpu and use of nb
Hello,
I am taking times of ejecution and Gflops for this function (dsyr2k).
Which is the main purpose of the function and how works the option nb (size of block)? Is the way of computacion, or packet sending ...?
All the information is wellcoming ;)
Thanks for all.
I am taking times of ejecution and Gflops for this function (dsyr2k).
Which is the main purpose of the function and how works the option nb (size of block)? Is the way of computacion, or packet sending ...?
All the information is wellcoming ;)
Thanks for all.
- Tue Aug 05, 2014 9:03 am
- Forum: User discussion
- Topic: Different checksum with >1 gpu (testing_dgetrf_mgpu)
- Replies: 6
- Views: 2343
Re: Different checksum with >1 gpu (testing_dgetrf_mgpu)
One question, the output value GFlops/s, when is used with mgpu apps, shows the GFlops/s of 1 gpu or the total number of operations (the gpu total GFlops )?
Thanks
Thanks
- Tue Aug 05, 2014 3:46 am
- Forum: User discussion
- Topic: Different checksum with >1 gpu (testing_dgetrf_mgpu)
- Replies: 6
- Views: 2343
Re: Different checksum with >1 gpu (testing_dgetrf_mgpu)
Ok, the output is this one: ./testing_dgetrf_mgpu -c -N 5000 --ngpu 8 MAGMA 1.4.1 , compiled for CUDA capability >= 1.0 device 0: Tesla K20m, 705.5 MHz clock, 4799.6 MB memory, capability 3.5 device 1: Tesla K20m, 705.5 MHz clock, 4799.6 MB memory, capability 3.5 device 2: Tesla K20m, 705.5 MHz cloc...