Publications
ICL PublicationsOther Publications

   

Showing records 1 - 10 of 78

Haidar, A., Luszczek, P., Tomov, S., Dongarra, J. "Batched Matrix Computations on Hardware Accelerators," EuroMPI/Asia 2015 Workshop, Bordeaux, France, September, 2015.

Kabir, K., Haidar, A., Tomov, S., and Dongarra, J. "On the Design, Development, and Analysis of Optimized Matrix-Vector Multiplication Routines for Coprocessors," ISC High Performance 2015, Frankfurt, Germany, July 12-16, 2015.

PDF
Haidar, A., Dong, T., Tomov, S., Luszczek, P., Dongarra, J. "Framework for Batched and GPU-resident Factorization Algorithms Applied to Block Householder Transformations," ISC HPC, Springer LNCS, Frankfurt, Germany, July 12-16, 2015.

PDF
Kabir, K., Haidar, A., Tomov, S., and Dongarra, J. "Performance Analysis and Optimisation of Two-Sided Factorization Algorithms for Heterogeneous Platform," The International Conference on Computational Science (ICCS 2015), Reykjavík, Iceland, June 1-3, 2015.

PDF
Kabir, K., Haidar, A., Tomov, S., and Dongarra, J. "Performance Analysis and Design of a Hessenberg Reduction using Stabilized Blocked Elementary Transformations for New Architectures," The Spring Simulation Multi-Conference 2015 (SpringSim'15), Alexandria, VA, April 12-15, 2015.

PDF
Haidar, A., Dong, T., Luszczek, P., Tomov, S., and Dongarra, J. "Batched matrix computations on hardware accelerators based on GPUs," International Journal of High Performance Computing Applications, Sage Publications, Inc., February 9, 2015.

Haidar, A., Dong, T., Luszczek, P., Tomov, S., and Dongarra, J. "Optimization for performance and energy for batched matrix computations on GPUs," GPGPU 2015 Proceedings of the 8th Workshop on General Purpose Processing using GPUs, ACM, San Francisco, CA, pp. 59-69, February 7, 2015.

Anzt, H., Tomov, S., Dongarra, J. "Energy efficiency and performance frontiers for sparse computations on GPU supercomputers," Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores (PMAM '15), ACM, San Francisco, CA, February, 2015.

PDF
Yamazaki, I., Tomov, S., and Dongarra, J. "Mixed-Precision Cholesky QR Factorization and its Case Studies on Multicore CPU with Multiple GPUs," to appear in SIAM Journal on Scientific Computing, 2015, 2015.

Anzt, H., Sawyer, W., Tomov, S., Luszczek, P., Dongarra, J. "Acceleration of GPU-based Krylov solvers via Data Transfer Reduction," IJHPCA special issue for ASHES workshop, 2015.


Showing records 1 - 10 of 78

May 22 2015 Admin Login