Publications
PublicationsOther Publications

   

Showing records 1 - 10 of 97

Haidar, A., Tomov, S., Arturov, K., Guney, M., Story, S., Dongarra, J. "LU, QR, and Cholesky Factorizations: Programming Model, Performance Analysis and Optimization Techniques for the Intel Knights Landing Xeon Phi," IEEE High Performance Extreme Computing Conference (HPEC'16), Waltham, MA, September 13-15, 2016.

Haidar, A., Brock, B., Tomov, S., Guidry, M., Billings, J., Shyles, D., Dongarra, J. "Performance Analysis and Acceleration of Explicit Integration for Large Kinetic Networks using Batched GPU Computations," 2016 IEEE High Performance Extreme Computing Conference (HPEC ‘16), September 13-15, 2016.

PDF
Masliah, I., Abdelfattah, A., Haidar, A., Tomov, S., Baboulin, M., Falcou, J., Dongarra, J. "High-performance matrix-matrix multiplications of very small matrices," 22nd International European Conference on Parallel and Distributed Computing (Euro-Par'16), Grenoble, France, August 22-26, 2016.

PDF
Abdelfattah, A., Haidar, A., Tomov, S., Dongarra, J. "Performance, Design, and Autotuning of Batched GEMM for GPUs," The International Supercomputing Conference (ISC High Performance 2016), Frankfurt, Germany, June 19-23, 2016.

PDF
Abdelfattah, A., Haidar, A., Tomov, S., Dongarra, J. "Performance Tuning and Optimization Techniques of Fixed and Variable Size Batched Cholesky Factorization on GPUs," International Conference on Computational Science (ICCS'16), San Diego, California, U.S.A., June 6-8, 2016.

PDF
Abdelfattah, A., Haidar, A., Tomov, S., Dongarra, J. "On the Development of Variable Size Batched Computation for Heterogeneous Parallel Architectures," The 17th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC 2016), IPDPS 2016, IEEE, Chicago, IL, USA, May 27, 2016.

PDF
Newburn, CJ., Bansal, G., Wood, M., Crivelli, L., Planas, J., Duran, A., Souza, P., Borges, L., Luszczek, P., Tomov, S., Dongarra, J., Anzt, H., Gates, M., Haidar, A., Jia, Y., Kabir, K., Yamazaki, I., Labarta, J. "Heterogeneous Streaming," The Sixth International Workshop on Accelerators and Hybrid Exascale Systems (AsHES), IPDPS 2016, IEEE, Chicago, IL, USA, May 23, 2016.

PDF
Abdelfattah, A., Haidar, A., Tomov, S., Dongarra, J. "Performance, Design, and Autotuning of Batched GEMM for GPUs," University of Tennessee Computer Science Technical Report, UT-EECS-16-739, February 1, 2016.

PDF
Abdelfattah, A., Baboulin, M., Dobrev, V., Dongarra, J., Earl, C., Falcou, J., Haidar, A., Karlin, I., Kolev, Tz., Masliah, I., Tomov, S. "High-Performance Tensor Contractions for GPUs," University of Tennessee Computer Science Technical Report, UT-EECS-16-738, January 21, 2016.

PDF
Anzt, H., Dongarra, J., Kreutzer, M., Wellein, G., Koehler, M. "Efficiency of general Krylov methods on GPUs – An experimental study," The Sixth International Workshop on Accelerators and Hybrid Exascale Systems (AsHES), Chicago, 2016.

PDF

Showing records 1 - 10 of 97

Jul 30 2016 Admin Login