Publications
ICL Publications
   

Showing records 1 - 10 of 49

Herault, T., Bouteiller, A., Bosilca, G., Gamell, M., Teranishi, K., Parashar, M., Dongarra, J. "Practical Scalable Consensus for Pseudo-Synchronous Distributed Systems: Formal Proof," University of Tennessee Computer Science Technical Report, ICL-UT-15-01, April, 2015.

George Bosilca, Aurelien Bouteiller, Thomas Herault, Yves Robert and Jack Dongarra "Composing resilience techniques: ABFT, periodic and incremental checkpointing," International Journal of Networking and Computing (IJNC), Computer Science Journals, 501-525, January, 2015.

PDF
Benoit A., Robert, Y., Raina S.K. "Efficient checkpoint/verification patterns for silent error detection," University of Tennessee Computer Science Technical Report, ICL-UT-14-03, May, 2014.

PDF
Bland, W., Bouteiller, A., Herault, T., Hursey, J., Bosilca, G., Dongarra, J.J. "An evaluation of User-Level Failure Mitigation support in MPI," Computing, Springer, Vienna, DOI 10.1007/s00607-013-0331-3, 1-14, May, 2013.

PDF
Wesley Bland and Aurelien Bouteiller and Thomas Herault and Joshua Hursey and George Bosilca and Jack J. Dongarra "An evaluation of User-Level Failure Mitigation support in MPI," Computing, Vol. 95, No. 12, 1171--1184, 2013.

PDF
Bland, W. "User Level Failure Mitigation in MPI," Euro-Par 2012: Parallel Processing Workshops, Caragiannis, I., Alexander, M., Badia, R., Cannataro, M., Costan, A., Danelutto, M., Desprez, F., Krammer, B., Sahuquillo, J., Scott, S., and Weidendorfer, J. eds. Springer Berlin Heidelberg, Rhodes Island, Greece, 7640, 499-504, August, 2012.

PDF
Du, P., Bouteiller, A., Bosilca, G., Herault, T., Dongarra, J. "Algorithm-Based Fault Tolerance for Dense Matrix Factorization," Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2012, J. Ramanujam, P. Sadayappan eds. ACM, New Orleans, LA, USA, 225-234, February 25-29, 2012.

PDF
Bland, W., Bosilca, G., Bouteiller, A., Herault, T., Dongarra, J. "A Proposal for User-Level Failure Mitigation in the MPI-3 Standard," University of Tennessee Electrical Engineering and Computer Science Technical Report, ut-cs-12-693, February 24, 2012.

PDF
Bland, W., Du, P., Bouteiller, A., Herault, T., Bosilca, G., Dongarra, J. "Extending the Scope of the Checkpoint-on-Failure Protocol for Forward Recovery in Standard MPI," University of Tennessee Computer Science Technical Report, ut-cs-12-702, 2012.

PDF
Bosilca, G., Herault, T., Lemarinier, P. Rezmerita, A., Dongarra, J. "Scalable Runtime for MPI: Efficiently Building the Communication Infrastructure," Proceedings of Recent Advances in the Message Passing Interface - 18th European MPI Users' Group Meeting, EuroMPI 2011, Yiannis Cotronis, Anthony Danalis, Dimitrios S. Nikolopoulos, Jack Dongarra eds. Springer, Santorini, Greece, LNCS 6960, 342-344, September 18-21, 2011.

PDF

Showing records 1 - 10 of 49

Aug 01 2015 Admin Login