- Code: Select all
cores = 8;
N = 20000;
LDA = 20016;
NRHS = 1;
LDB = LDA;
However, when I do this, the program starts complaining that it's getting the wrong answer:
- Code: Select all
% make example_dgetrs && ./example_dgetrs
icc -O2 -diag-disable vec -DADD_ -I../include -c example_dgetrs.c -o example_dgetrs.o
ifort -L/opt/intel/Compiler/11.1/064/mkl/lib/em64t -nofor_main example_dgetrs.o -o example_dgetrs -L../../../lib -lplasma -lcoreblas -lcorelapack -lcblas -Wl,--start-group /opt/intel/Compiler/11.1/064/mkl/lib/em64t/libmkl_intel_lp64.a /opt/intel/Compiler/11.1/064/mkl/lib/em64t/libmkl_intel_thread.a /opt/intel/Compiler/11.1/064/mkl/lib/em64t/libmkl_core.a -Wl,--end-group -liomp5 -lpthread -lm
-- PLASMA is initialized to run on 8 cores.
============
Checking the Residual of the solution
-- ||Ax-B||_oo/((||A||_oo||x||_oo+||B||_oo).N.eps) = 2.250816e+11
-- The solution is suspicious !
-- Error in DGETRS example !
If I run smaller sizes (e.g., N=10000, LDA=10000), the program reports that the solution was correct.
FWIW, I checked that I am running with no MKL threads:
- Code: Select all
% env | grep -i threads
OMP_NUM_THREADS=1
MKL_NUM_THREADS=1
Am I doing something wrong? Are the N/LDA values that I specified incorrect in some way?
Thanks!
