Publications
ICL Publications
   

Showing records 1 - 10 of 15

Yulu Jia, George Bosilca, Piotr Luszczek, and Jack Dongarra "Parallel Reduction to Hessenberg Form with Algorithm-Based Fault Tolerance," International Conference for High Performance Computing, Networking, Storage and Analysis, IEEE-SC 2013, Denver, CO, November, 2013.

PDF
Yulu Jia, Piotr Luszczek, George Bosilca, Jack Dongarra "CPU-GPU Hybrid Bidiagonal Reduction With Soft Error Resilience," ScalA '13 Proceedings of the Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, November, 2013.

PDF
Bosilca, G., Bouteiller, A., Herault, T., Robert, Y., and Jack Dongarra "Assessing the impact of {ABFT} and Checkpoint composite strategies," University of Tennessee Computer Science Technical Report, ICL-UT-13-03, September, 2013.

PDF
Jia, Y., Luszczek, P., Dongarra, J. "Transient Error Resilient Hessenberg Reduction on GPU-based Hybrid Architectures," University of Tennessee Computer Science Technical Report, UT-CS-13-712 (lawn279), June, 2013.

PDF
Bland, W., Bouteiller, A., Herault, T., Hursey, J., Bosilca, G., Dongarra, J.J. "An evaluation of User-Level Failure Mitigation support in MPI," Computing, Springer, Vienna, DOI 10.1007/s00607-013-0331-3, 1-14, May, 2013.

PDF
Jack Dongarra, Thomas Herault and Yves Robert "Revisiting the Double Checkpointing Algorithm," 15th Workshop on Advances in Parallel and Distributed Computational Models, at the IEEE International Parallel & Distributed Processing Symposium, Boston, MA, January, 2013.

PDF
Bland, W., Du, P., Bouteiller, A., Herault, T., Bosilca, G., Dongarra, J. "A Checkpoint-on-Failure Protocol for Algorithm-Based Recovery in Standard MPI," 18th International European Conference on Parallel and Distributed Computing (Euro-Par 2012) (Best Paper Award), Christos Kaklamanis, Theodore Papatheodorou and Paul Spirakis eds. Springer-Verlag, Rhodes, Greece, August 27-31, 2012.

PDF
Du, P., Luszczek, P., Dongarra, J. "High Performance Dense Linear System Solver with Resilience to Multiple Soft Errors," ICCS 2012, Omaha, NE, June, 2012.

PDF
Du, P., Bouteiller, A., Bosilca, G., Herault, T., Dongarra, J. "Algorithm-Based Fault Tolerance for Dense Matrix Factorization," Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2012, J. Ramanujam, P. Sadayappan eds. ACM, New Orleans, LA, USA, 225-234, February 25-29, 2012.

PDF
Du, P., Luszczek, P., Tomov S., Dongarra, J. "Soft Error Resilient QR Factorization for Hybrid System with GPGPU," Journal of Computational Science, Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems at SC11, Seattle, WA, November 14, 2011.

PDF

Showing records 1 - 10 of 15

Apr 16 2014 Admin Login