Publications
PublicationsOther Publications

   

Showing records 1 - 10 of 99

Lopez, M., Larrea, V., Joubert, W., Hernandez, O., Haidar, A., Tomov, S., Dongarra, J. "Towards Achieving Performance Portability Using Directives for Accelerators," The International Conference for High Performance Computing, Networking, Storage and Analysis (SC'16), Third Workshop on Accelerator Programming Using Directives (WACCPD), Salt Lake City, Utah, November 13-18, 2016.

PDF
Haidar, A., Tomov, S., Arturov, K., Guney, M., Story, S., Dongarra, J. "LU, QR, and Cholesky Factorizations: Programming Model, Performance Analysis and Optimization Techniques for the Intel Knights Landing Xeon Phi," IEEE High Performance Extreme Computing Conference (HPEC'16), Waltham, MA, September 13-15, 2016.

Haidar, A., Brock, B., Tomov, S., Guidry, M., Billings, J., Shyles, D., Dongarra, J. "Performance Analysis and Acceleration of Explicit Integration for Large Kinetic Networks using Batched GPU Computations," 2016 IEEE High Performance Extreme Computing Conference (HPEC ‘16), September 13-15, 2016.

PDF
Masliah, I., Abdelfattah, A., Haidar, A., Tomov, S., Baboulin, M., Falcou, J., Dongarra, J. "High-performance matrix-matrix multiplications of very small matrices," 22nd International European Conference on Parallel and Distributed Computing (Euro-Par'16), Grenoble, France, August 22-26, 2016.

PDF
Abdelfattah, A., Haidar, A., Tomov, S., Dongarra, J. "Performance, Design, and Autotuning of Batched GEMM for GPUs," The International Supercomputing Conference (ISC High Performance 2016), Frankfurt, Germany, June 19-23, 2016.

PDF
Abdelfattah, A., Haidar, A., Tomov, S., Dongarra, J. "Performance Tuning and Optimization Techniques of Fixed and Variable Size Batched Cholesky Factorization on GPUs," International Conference on Computational Science (ICCS'16), San Diego, California, U.S.A., June 6-8, 2016.

PDF
Abdelfattah, A., Haidar, A., Tomov, S., Dongarra, J. "On the Development of Variable Size Batched Computation for Heterogeneous Parallel Architectures," The 17th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC 2016), IPDPS 2016, IEEE, Chicago, IL, USA, May 27, 2016.

PDF
Newburn, CJ., Bansal, G., Wood, M., Crivelli, L., Planas, J., Duran, A., Souza, P., Borges, L., Luszczek, P., Tomov, S., Dongarra, J., Anzt, H., Gates, M., Haidar, A., Jia, Y., Kabir, K., Yamazaki, I., Labarta, J. "Heterogeneous Streaming," The Sixth International Workshop on Accelerators and Hybrid Exascale Systems (AsHES), IPDPS 2016, IEEE, Chicago, IL, USA, May 23, 2016.

PDF
Abdelfattah, A., Haidar, A., Tomov, S., Dongarra, J. "Performance, Design, and Autotuning of Batched GEMM for GPUs," University of Tennessee Computer Science Technical Report, UT-EECS-16-739, February 1, 2016.

PDF
Abdelfattah, A., Baboulin, M., Dobrev, V., Dongarra, J., Earl, C., Falcou, J., Haidar, A., Karlin, I., Kolev, Tz., Masliah, I., Tomov, S. "High-Performance Tensor Contractions for GPUs," University of Tennessee Computer Science Technical Report, UT-EECS-16-738, January 21, 2016.

PDF

Showing records 1 - 10 of 99

Dec 03 2016 Admin Login