Software
MAGMA Downloads

MAGMA provides implementations for CUDA, Intel Xeon Phi, and OpenCL. The latest releases are MAGMA 2.5, MAGMA MIC 1.4.0, and clMAGMA 1.3, respectively. The libraries available for download are listed below in the order of their release dates.

Please use any of the following publications to reference MAGMA.

 


 

MAGMA 2.5.0
2019-01-02

MAGMA 2.5.0 is now released. Updates include:

  • New routines: Magma is releasing the Nvidia Tensor Cores version of its linear mixed-precision solver that is able to provide an FP64 solution with up to 4X speedup using the fast FP16 Tensor Cores arithmetic. The release includes:
    magma_dhgesv_iteref_gpu (FP64-FP16 solver with FP64 input and solution);
    magma_dsgesv_iteref_gpu (FP64-FP32 solver with FP64 input and solution);
    magma_hgetrf_gpu        (mixed precision FP32-FP16 LU factorization);
    magma_htgetrf_gpu       (mixed precision FP32-FP16 LU factorization using Tensor Cores).
    Further details for the function names and the testing routines are given in file:
    README_FP16_Iterative_Refinement.txt
  • New routine: magmablas_Xgemm_batched_strided (X = {s, d, c, z}) is the stride-based variant of magmablas_Xgemm_batched;
  • New routine: magma_Xgetrf_native (X = {s, d, c, z}) performs the LU factorization with partial pivoting using the GPU only. It has the same interface as the hybrid (CPU+GPU) implementation provided by magma_Xgetrf_gpu. Testing the performance of this routine is possible through running testing_Xgetrf_gpu with the option (--version 3);
  • New routine: magma_Xpotrf_native (X = {s, d, c, z}) performs the Cholesky factorization using the GPU only. It has the same interface as the hybrid (CPU+GPU) implementation provided by magma_Xpotrf_gpu.
    Testing the performance of this routine is possible through running testing_Xpotrf_gpu with the option (--version 2)
  • Added benchmark for GEMM in FP16 arithmetic (HGEMM) as well as auxiliary functions to cast matrices from FP32 to FP16 storage (magmablas_slag2h) and from FP16 to FP32 (magmablas_hlag2s).
magma-2.5.0.tar.gz   Download View License

MagmaDNN 0.2
2019-01-01

MagmaDNN 0.2 is now available. MagnaDNN provides HP data analytics and machine learning tools using MAGMA as its computational backend. Updates in this release include:

  • Bug fixes and performance improvements;
  • Winograd convolutions to accelerate CNNs;
  • Hyperparameter optimization framework;
  • MNIST and CIFAR-10 benchmarks using MagmaDNN;
  • Performance comparisons, accuracy validations, etc. (w\ TensorFlow, Theano, and PyTorch).

More information on MagmaDNN 0.2 is given in this presentation.

MagmaDNN's repository is on Bitbucket: https://bitbucket.org/icl/magmadnn.

magmadnn-0.2.0.tar.gz   Download View License

MAGMA MIC 1.4.0
2015-07-12

MAGMA MIC 1.4.0 is now available. This release provides implementations for MAGMA's one-sided (LU, QR, and Cholesky) and two-sided (Hessenberg, bi- and tridiagonal reductions) dense matrix factorizations, as well as linear and eigenproblem solver for Intel Xeon Phi Coprocessors. More information on the approach is given in this presentation.

magmamic-1.4.0.tar.gz   Download View License

clMAGMA 1.3
2014-11-14

clMAGMA is an OpenCL port of MAGMA. It supports AMD GPUs. The clMAGMA library dependancies, in particular optimized GPU OpenCL BLAS and CPU optimized BLAS and LAPACK for AMD hardware, can be found in the AMD clMath Libraries (formerly APPML).

Included in the clMAGMA 1.3 release are routines for the following algorithms:

  • LU, QR, and Cholesky factorizations in both real and complex  arithmetic (single and double);
  • Linear and least squares solvers based on correspondingly the LU/Cholesky and QR factorizations in both real and complex  arithmetic (single and double);
  • Reductions to Hessenberg, bidiagonal, and tridiagonal forms using orthgonal similarity transformationsin both real and complex arithmetic (single and double);
  • Eigen and singular value problem solvers in both real and complex arithmetic (single and double);
  • Orthogonal transformation routines.
clmagma-1.3.0.tar.gz   Download View License

License

Copyright © 2019 The University of Tennessee. All rights reserved.

Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met:
· Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer.
· Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer listed in this license in the documentation and/or other materials provided with the distribution.
· Neither the name of the copyright holders nor the names of its contributors may be used to endorse or promote products derived from this software without specific prior written permission.

This software is provided by the copyright holders and contributors "as is" and any express or implied warranties, including, but not limited to, the implied warranties of merchantability and fitness for a particular purpose are disclaimed. in no event shall the copyright owner or contributors be liable for any direct, indirect, incidental, special, exemplary, or consequential damages (including, but not limited to, procurement of substitute goods or services; loss of use, data, or profits; or business interruption) however caused and on any theory of liability, whether in contract, strict liability, or tort (including negligence or otherwise) arising in any way out of the use of this software, even if advised of the possibility of such damage.

Feb 20 2019 Admin Login