PLASMA
2.4.5
PLASMA - Parallel Linear Algebra for Scalable Multi-core Architectures
|
Go to the source code of this file.
Macros | |
#define | A(m, n) BLKADDR(A, double, m, n) |
#define | AMAX1BUF_SIZE (48 << 1) |
Functions | |
void | CORE_dgetrf_rectil_init (void) |
int | CORE_dgetrf_rectil (const PLASMA_desc A, int *IPIV, int *info) |
void | QUARK_CORE_dgetrf_rectil (Quark *quark, Quark_Task_Flags *task_flags, PLASMA_desc A, double *Amn, int size, int *IPIV, PLASMA_sequence *sequence, PLASMA_request *request, PLASMA_bool check_info, int iinfo, int nbthread) |
void | CORE_dgetrf_rectil_quark (Quark *quark) |
PLASMA core_blas kernel PLASMA is a software package provided by Univ. of Tennessee, Univ. of California Berkeley and Univ. of Colorado Denver
d Tue Nov 22 14:35:21 2011
Definition in file core_dgetrf_rectil.c.
#define A | ( | m, | |
n | |||
) | BLKADDR(A, double, m, n) |
Definition at line 24 of file core_dgetrf_rectil.c.
#define AMAX1BUF_SIZE (48 << 1) |
CORE_dgetf2 computes an LU factorization of a general M-by-N matrix A using partial pivoting with row interchanges.
WARNING: You cannot call this kernel on different matrices at the same time
The factorization has the form
A = P * L * U
where P is a permutation matrix, L is lower triangular with unit diagonal elements (lower trapezoidal if m > n), and U is upper triangular (upper trapezoidal if m < n).
This is the right-looking LAPACK Level 2 BLAS version of the algorithm.
[in] | M | The number of rows of the matrix A. M >= 0. |
[in] | N | The number of columns of the matrix A. N >= 0. |
[in,out] | A | On entry, the m by n matrix to be factored. On exit, the factors L and U from the factorization A = P*L*U; the unit diagonal elements of L are not stored. |
[in] | LDA | The leading dimension of the array A. LDA >= max(1,M). |
[out] | IPIV | The pivot indices; for 1 <= i <= min(M,N), row i of the matrix was interchanged with row IPIV(i). |
[out] | INFO | = k if U(k,k) is exactly zero. The factorization has been completed, but the factor U is exactly singular, and division by zero will occur if it is used to solve a system of equations. |
PLASMA_SUCCESS | successful exit |
-k,the | k-th argument had an illegal value |
Definition at line 85 of file core_dgetrf_rectil.c.
int CORE_dgetrf_rectil | ( | const PLASMA_desc | A, |
int * | IPIV, | ||
int * | info | ||
) |
Definition at line 653 of file core_dgetrf_rectil.c.
References coreblas_error, plasma_desc_t::m, min, plasma_desc_t::mt, plasma_desc_t::n, and plasma_desc_t::nt.
void CORE_dgetrf_rectil_init | ( | void | ) |
Definition at line 92 of file core_dgetrf_rectil.c.
References AMAX1BUF_SIZE.
void CORE_dgetrf_rectil_quark | ( | Quark * | quark | ) |
Definition at line 726 of file core_dgetrf_rectil.c.
References A, CORE_dgetrf_rectil(), IPIV, plasma_sequence_flush(), PLASMA_SUCCESS, QUARK_Get_RankInTask(), and quark_unpack_args_8.
void QUARK_CORE_dgetrf_rectil | ( | Quark * | quark, |
Quark_Task_Flags * | task_flags, | ||
PLASMA_desc | A, | ||
double * | Amn, | ||
int | size, | ||
int * | IPIV, | ||
PLASMA_sequence * | sequence, | ||
PLASMA_request * | request, | ||
PLASMA_bool | check_info, | ||
int | iinfo, | ||
int | nbthread | ||
) |
Definition at line 699 of file core_dgetrf_rectil.c.
References CORE_dgetrf_rectil_quark(), DAG_CORE_GETRF, INOUT, plasma_desc_t::n, OUTPUT, QUARK_Insert_Task(), and VALUE.