News
Displaying 1-5 of 53 Entries
MAGMA 2.0 released
2016-02-09

MAGMA 2.0 is now available from the Downloads section. This includes a major interface change, adding a queue to all MAGMA BLAS functions. Most higher level functions such as magma_zgetrf have not changed their interface. MAGMA sparse has added new solvers (IDR, QMR, CGS) as well as performance improvements.


MAGMA 2.0 beta released
2016-01-06

MAGMA 2.0 beta has been released. This includes a major interface change, adding a queue to all MAGMA BLAS functions. Most higher level functions such as magma_zgetrf have not changed their interface. MAGMA sparse has added new solvers (IDR, QMR, CGS) as well as performance improvements.

See the Downloads section for a download link.


MAGMA 1.7.0 released
2015-09-15
MAGMA 1.7.0 is now available, featuring:
  • Added results archive to compare historical performance.
  • Added Fortran code to example directory.
  • Added magmaf_wtime for consistency with other Fortran interfaces; deprecated magma_wtime_f.
  • Added and template batched MAGMA BLAS routine gemm, gemv, herk, trsv, and trsm.
  • Tuned batched MAGMA BLAS routines, in particular gemm, gemv, herk, and trsm.
  • Tuned batched MAGMA LAPACK routines, in particular Cholesky factorizations.
  • Tuned two stage symmetric eigenvalue code, {sy|he}heevdx_2stage, to improve performance.
  • Tuned symmetric eigenvalue code, {sy|he}evd, to improve performance for N < 2000.
  • Fixed NaN result with {sy|he}mv and {sy|he}mv_mgpu if GPU shared memory had NaN.
  • Fixed Fortran constants (MagmaTrans, MagmaUpper, etc.).
  • Fixed workspace requirements for the two stage symmetric eigenvalue problem
    {sy|he}heevdx_2stage and multi-GPU {sy|he}heevdx_2stage_m.
  • Fixed workspace requirements for Hessenberg (gehrd and gehrd_m) and multi-GPU geev_m.
  • Fixed trtri for unit diagonal, and added tester.
  • Fixed testing check for inverse (getri).
  • Fixed multi-GPU {or|un}gqr_m for some k < n. (Currently only used in geev_m with m = n = k.)
  • Fixed bug for batched routines.
  • Rename lapack_const to lapack_const_str, to avoid name conflict with PLASMA.
  • Allow CMake build without Fortran (already existed for make).
MAGMA sparse
  • Added Induced Dimension Reduction Iterative solver (IDR).
  • Added iterative sparse triangular solves for incomplete factorization preconditioners.

See the Downloads section for a download link.


MAGMA MIC 1.4.0 for Intel Xeon Phi Coprocessors Released
2015-07-12

MAGMA MIC 1.4.0 is now available. This release provides implementations for MAGMA's one-sided (LU, QR, and Cholesky) and two-sided (Hessenberg, bi- and tridiagonal reductions) dense matrix factorizations, as well as linear and eigenproblem solver for Intel Xeon Phi Coprocessors. More information on the approach is given in this presentation.

The MAGMA MIC 1.4.0 release adds
  • Added port of MAGMA Sparse, including:
    CG, GMRES, BiCGSTAB (support for both hybrid and native versions);
    Auxiliary routines;
    Preconditioned versions.
  • Added mixed-precision iterative refinement auxiliary routines and a solver for symmetric and positive definite matrices
    {zc|ds}posv_mic
  • Improved dsymv and dgemv in expert interface
  • Added auxiliary bulge chasing routines used in two-stage eigensolvers
  • Accelerated reductions to tridiagonal (dsytrd) and upper Hessenberg form (dgehrd) using the expert dsymv and dgemv, respectively
  • Added test drivers and benchmarking routines.

See the Software section for a download link.


MAGMA 1.6.2 released
2015-05-05
MAGMA 1.6.2 is now available. This is primarily a bug fix for MAGMA sparse.
  • Added magma_{s,d,c,z}sqrt for real and complex scalar square root.
  • Added magma_ceildiv and magma_roundup.
  • Fixed magmablas_zlaset and magmablas_zlacpy for large M or N > 4M.
  • Fixed testers for geqrf_batched and trsm_batched to compile with CUDA 5.x.
MAGMA sparse
  • All allocation failures and other errors now return error codes.
  • cuSPARSE error codes mapped to MAGMA error codes.
  • LOBPCG sparse eigensolver enabled for preconditioning using Jacobi and incomplete LU factorizations.
  • Some name changes in MAGMA sparse for consistency with dense MAGMA.
  • Added a tester for the sparse level 1 BLAS.
  • Bug fix in complex FGMRES.
  • Added iterative incomplete factorization routines (iterative ILU/iterative IC).
  • Enhance the ILU/IC with fill-in (level-ILU).

See the Downloads section for a download link.


Displaying 1-5 of 53 Entries
Feb 11 2016 Admin Login