News
Displaying 11-15 of 41 Entries
clMAGMA 1.1 Beta Released
2013-11-17
clMAGMA 1.1 Beta is now available. clMAGMA is an OpenCL port of the MAGMA library. This release adds the following new functionalities:
  • MultiGPU implementations for the LU, QR, and Cholesky factorizations;
  • LU, QR, and Cholesky factorizations and solvers with CPU interfaces;
  • Multi-buffer LU, QR, and Cholesky factorizations that overcome size limitations for single memory allocation, enabling the solution of large problems;
  • Performance improvements.

 

See the MAGMA software homepage for a download link.

MAGMA 1.4 Released
2013-08-14
MAGMA 1.4 is now available. This release provides performance improvements and support for the new NVIDIA Kepler GPUs. More information is given in the MAGMA: a New Generation of Linear Algebra Libraries for GPU and Multicore Architectures presentation. The MAGMA 1.4 release adds the following new functionalities:
  • Merge libmagmablas into libmagma to eliminate circular dependencies.
    Link with just -lmagma now;
  • Add multi-GPU Hessenberg and non-symmetric eigenvalue routines:
    geev_m, gehrd_m, unghr_m, ungqr_m;
  • Fix required workspace size in gels_gpu, gels3_gpu, geqrs_gpu, geqrs3_gpu;
  • Fix required workspace size in [zcsd]geqrf;
  • Add macro USE_INT64 to compile with int being 64-bit. See make.inc.int64;
  • Add panel factorizations for LU, QR, and Cholesky entirely on the GPU, correspondingly in [zcsd]getf2_gpu, [zcsd]geqr2_gpu, and [zcsd]potf2_gpu;
  • Add QR with pivoting in GPU interface (functions [zcsd]geqp3_gpu), and improve the performance for both CPU and GPU interface QRs with pivoting;
  • Add multi-GPU symmetric eigenvalue routines (one-stage):
    [zhe|che|ssy|dsy]trd_mgpu,
    [zhe|che|ssy|dsy]evd_m, [zhe|che|ssy|dsy]evdx_m,
    [zhe|che|ssy|dsy]gvd_m, [zhe|che|ssy|dsy]gvdx_m ;
  • Add single and multi-GPU symmetric eigenvalue routines (two-stage):
    [zhe|che|ssy|dsy]evdx_2stage,     [zhe|che|ssy|dsy]gvdx_2stage,
    [zhe|che|ssy|dsy]evdx_2stage_m, [zhe|che|ssy|dsy]gvdx_2stage_m .

See the Downloads section for a download link.


MAGMA 1.4 Beta Released
2013-06-20
MAGMA 1.4 Beta is now available. This release provides performance improvements and support for the new NVIDIA Kepler GPUs. More information is given in the MAGMA: a New Generation of Linear Algebra Libraries for GPU and Multicore Architectures SC12 presentation. The MAGMA 1.4 Beta release adds the following new functionalities:
  • Merge libmagmablas into libmagma to eliminate circular dependencies.
    Link with just -lmagma now;
  • Add multi-GPU Hessenberg and non-symmetric eigenvalue routines:
    geev_m, gehrd_m, unghr_m, ungqr_m;
  • Fix required workspace size in gels_gpu, gels3_gpu, geqrs_gpu, geqrs3_gpu;
  • Fix required workspace size in [zcsd]geqrf;
  • Add macro USE_INT64 to compile with int being 64-bit. See make.inc.int64;
  • Add panel factorizations for LU, QR, and Cholesky entirely on the GPU, correspondingly in [zcsd]getf2_gpu, [zcsd]geqr2_gpu, and [zcsd]potf2_gpu;
  • Add QR with pivoting in GPU interface (functions [zcsd]geqp3_gpu), and improve the performance for both CPU and GPU interface QRs with pivoting;
  • Add multi-GPU symmetric eigenvalue routines (one-stage):
    [zhe|che|ssy|dsy]trd_mgpu,
    [zhe|che|ssy|dsy]evd_m, [zhe|che|ssy|dsy]evdx_m,
    [zhe|che|ssy|dsy]gvd_m, [zhe|che|ssy|dsy]gvdx_m ;
  • Add single and multi-GPU symmetric eigenvalue routines (two-stage):
    [zhe|che|ssy|dsy]evdx_2stage,     [zhe|che|ssy|dsy]gvdx_2stage,
    [zhe|che|ssy|dsy]evdx_2stage_m, [zhe|che|ssy|dsy]gvdx_2stage_m .

See the Software section for a download link.


MAGMA MIC 1.0 for Intel Xeon Phi Coprocessors Released
2013-05-03
MAGMA MIC 1.0 is now available. This release provides implementations for MAGMA's one-sided (LU, QR, and Cholesky) and two-sided (Hessenberg, bi- and tridiagonal reductions) dense matrix factorizations for Intel Xeon Phi Coprocessors. More information on the approach is given in this presentation.
The MAGMA MIC 1.0 release adds the following new functionalities:
  • Added multiple MIC LU factorization (routines {z|c|d|s}getrf_mmic)
  • Added multiple MIC QR factorization (routines {z|c|d|s}geqrf_mmic)
  • Added multiple MIC Cholesky factorization (routines {z|c|d|s}potrf_mmic)
  • Performance improvements for the single MIC LU, QR, and Cholesky factorizations
  • Added LU factorization in CPU interface
  • Added mixed-precision iterative refinement LU solver (with CPU and MIC interfaces)
  • Added reduction to band diagonal for Hermitian/symmetric matrices (routines {z|c|d|s}hetrd_he2hb)
  • Added Hessenberg reduction algorithm ({z|c|d|s}gehrd)
  • Added reduction to tridiagonal for Hermitian/symmetric matrices (routines {zhe|che|dsy|ssy}trd)
  • Added reduction to bidiagonal (routines {z|c|d|s}gebrd)
  • Added {zun|cun|dor|sor}gqr
  • Added {zun|cun|dor|sor}ghr
  • Added {zun|cun|dor|sor}mqr_mic
  • Added GEMV benchmark to test MIC's bandwidth.

 

See the Software section for a download link.


MAGMA MIC 1.0 Beta for Intel Xeon Phi Coprocessors Released
2013-03-12
MAGMA MIC 1.0 Bata is now available. This release provides implementations for MAGMA's one-sided (LU, QR, and Cholesky) and two-sided (Hessenberg, bi- and tridiagonal reductions) dense matrix factorizations for Intel Xeon Phi Coprocessors. More information on the approach is given in this presentation.
The MAGMA MIC 1.0 Beta release adds the following new functionalities:
  • Added multiple MIC LU factorization (routines {z|c|d|s}getrf_mmic)
  • Added multiple MIC QR factorization (routines {z|c|d|s}geqrf_mmic)
  • Added multiple MIC Cholesky factorization (routines {z|c|d|s}potrf_mmic)
  • Performance improvements for the single MIC LU, QR, and Cholesky factorizations
  • Added LU factorization in CPU interface
  • Added mixed-precision iterative refinement LU solver (with CPU and MIC interfaces)
  • Added reduction to band diagonal for Hermitian/symmetric matrices (routines {z|c|d|s}hetrd_he2hb)
  • Added Hessenberg reduction algorithm ({z|c|d|s}gehrd)
  • Added reduction to tridiagonal for Hermitian/symmetric matrices (routines {zhe|che|dsy|ssy}trd)
  • Added reduction to bidiagonal (routines {z|c|d|s}gebrd)
  • Added {zun|cun|dor|sor}gqr
  • Added {zun|cun|dor|sor}ghr
  • Added {zun|cun|dor|sor}mqr_mic
  • Added GEMV benchmark to test MIC's bandwidth.

 

See the Software section for a download link.


Displaying 11-15 of 41 Entries
Sep 02 2014 Admin Login