Publications
PublicationsOther Publications

   

Showing records 1 - 10 of 101

Abdelfatah, A., Haidar, A., Tomov, S., Dongarra, J. "Fast Cholesky Factorization on GPUs for Batch and Native Modes in MAGMA," University of Tennessee Computer Science Technical Report, UT-EECS-16-748, December 28, 2016.

PDF
Haidar, A., Abdelfatah, A., Tomov, S., Dongarra, J. "High-performance Cholesky factorization for GPU-only execution," University of Tennessee Computer Science Technical Report, UT-EECS-16-747, December 26, 2016.

PDF
Lopez, M., Larrea, V., Joubert, W., Hernandez, O., Haidar, A., Tomov, S., Dongarra, J. "Towards Achieving Performance Portability Using Directives for Accelerators," The International Conference for High Performance Computing, Networking, Storage and Analysis (SC'16), Third Workshop on Accelerator Programming Using Directives (WACCPD), Salt Lake City, Utah, November 13-18, 2016.

PDF
Haidar, A., Tomov, S., Arturov, K., Guney, M., Story, S., Dongarra, J. "LU, QR, and Cholesky Factorizations: Programming Model, Performance Analysis and Optimization Techniques for the Intel Knights Landing Xeon Phi," IEEE High Performance Extreme Computing Conference (HPEC'16), Waltham, MA, September 13-15, 2016.

Haidar, A., Brock, B., Tomov, S., Guidry, M., Billings, J., Shyles, D., Dongarra, J. "Performance Analysis and Acceleration of Explicit Integration for Large Kinetic Networks using Batched GPU Computations," 2016 IEEE High Performance Extreme Computing Conference (HPEC ‘16), September 13-15, 2016.

PDF
Masliah, I., Abdelfattah, A., Haidar, A., Tomov, S., Baboulin, M., Falcou, J., Dongarra, J. "High-performance matrix-matrix multiplications of very small matrices," 22nd International European Conference on Parallel and Distributed Computing (Euro-Par'16), Grenoble, France, August 22-26, 2016.

PDF
Abdelfattah, A., Haidar, A., Tomov, S., Dongarra, J. "Performance, Design, and Autotuning of Batched GEMM for GPUs," The International Supercomputing Conference (ISC High Performance 2016), Frankfurt, Germany, June 19-23, 2016.

PDF
Abdelfattah, A., Haidar, A., Tomov, S., Dongarra, J. "Performance Tuning and Optimization Techniques of Fixed and Variable Size Batched Cholesky Factorization on GPUs," International Conference on Computational Science (ICCS'16), San Diego, California, U.S.A., June 6-8, 2016.

PDF
Abdelfattah, A., Haidar, A., Tomov, S., Dongarra, J. "On the Development of Variable Size Batched Computation for Heterogeneous Parallel Architectures," The 17th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC 2016), IPDPS 2016, IEEE, Chicago, IL, USA, May 27, 2016.

PDF
Newburn, CJ., Bansal, G., Wood, M., Crivelli, L., Planas, J., Duran, A., Souza, P., Borges, L., Luszczek, P., Tomov, S., Dongarra, J., Anzt, H., Gates, M., Haidar, A., Jia, Y., Kabir, K., Yamazaki, I., Labarta, J. "Heterogeneous Streaming," The Sixth International Workshop on Accelerators and Hybrid Exascale Systems (AsHES), IPDPS 2016, IEEE, Chicago, IL, USA, May 23, 2016.

PDF

Showing records 1 - 10 of 101

Jan 22 2017 Admin Login