View All Search View Year
Showing Records 21 to 40 out of 664 total records
Next 20 >

Agullo, E., Augonnet, C., Dongarra, J., Ltaief, H., Namyst, R., Thibault, S., and Tomov, S. "Faster, Cheaper, Better - a Hybridization Methodology to Develop Linear Algebra Software for GPUs," LAPACK Working Note 230, 2010. [ pdf ]

Agullo, E., Augonnet, C., Dongarra, J., Ltaief, H., Namyst, R., Thibault, S., Tomov, S. "A Hybridization Methodology for High-Performance Linear Algebra Software for GPUs," in GPU Computing Gems, Jade Edition, Hwu, W. eds. Elsevier, 2, 473-484, 2011.

Agullo, E., Bosilca, G., Castagnède, C., Dongarra, J., Ltaief, H., Tomov, S. "Matrices Over Runtime Systems at Exascale," Supercomputing '12 (poster), Salt Lake City, Utah, November, 2012.

Agullo, E., Coti, C., Dongarra, J., Herault, T., Langou, J. "QR Factorization of Tall and Skinny Matrices in a Grid Computing Environment," 24th IEEE International Parallel and Distributed Processing Symposium (also LAWN 224), Atlanta, GA, April 19-23, 2010. [ pdf ]

Agullo, E., Coti, C., Herault, T., Langou, J., Peyronnet, S., Rezmerita, A., Cappello, F., Dongarra, J. "QCG-OMPI: MPI Applications on Grids," Future Generation Computer Systems, Vol. 27, No. 4, pp. 357-369, April, 2011. [ pdf ]

Agullo, E., Coti, C., Herault, T., Langou, J., Peyronnet, S., Rezmerita, A., Cappello, F., Dongarra, J. "QCG-OMPI: MPI Applications on Grids.," Future Generation Computer Systems, Vol. 27, No. 4, 435-369, January, 2011. [ pdf ]

Agullo, E., Demmel, J., Dongarra, J., Hadri, B., Kurzak, J., Langou, J., Ltaief, H., Luszczek, P., Tomov, S. "Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects," Journal of Physics: Conference Series, Vol. 180, 2009. [ pdf ]

Agullo, E., Giraud, L., Guermouche, A., Haidar, A., Lanteri, S., Roman, J. "Algebraic Schwarz Preconditioning for the Schur Complement: Application to the Time-Harmonic Maxwell Equations Discretized by a Discontinuous Galerkin Method.," 20th International Conference on Domain Decomposition Methods, UC San Diego, in La Jolla, California, February 7-11, 2011.

Agullo, E., Giraud, L., Guermouche, A., Haidar, A., Roman, J. "Towards a Complexity Analysis of Sparse Hybrid Linear Solvers," PARA 2010, Reykjavik, Iceland, June 6-9, 2010.

Agullo, E., Giraud, L., Guermouche, A., Haidar, A., Roman, J. "Parallel algebraic domain decomposition solver for the solution of augmented systems.," Parallel, Distributed, Grid and Cloud Computing for Engineering, Ajaccio, Corsica, France, 12-15 April, 2011.

Agullo, E., Giraud, L., Guermouche, A., Haidar, A., Roman, J., Lee-Tin-Yen, Y. "MaPHyS or the Development of a Parallel Algebraic Domain Decomposition Solver in the Course of the Solstice Project," Sparse Days 2010 Meeting at CERFACS, Toulouse, France, June 15-17, 2010.

Agullo, E., Hadri, B., Ltaief, H., Dongarra, J. "Comparative Study of One-Sided Factorizations with Multiple Software Packages on Multi-Core Hardware," 2009 International Conference for High Performance Computing, Networking, Storage, and Analysis (SC '09) (to appear), 2009. [ pdf ]

Alam, S. R., Barrett, R. F., Jagode, H., Kuehn, J. A., Poole, S. W. and Sankaran, R. "Impact of Quad-core Cray XT4 System and Software Stack on Scientific Computation," Euro-Par 2009, Lecture Notes in Computer Science, Delft, The Netherlands, Springer Berlin / Heidelberg, Volume 5704/2009, pp. 334-344, August 25-28, 2009. [ pdf ]

Alvaro, W., Kurzak, J., Dongarra, J. "Fast and Small Short Vector SIMD Matrix Multiplication Kernels for the CELL Processor," University of Tennessee Computer Science Technical Report, UT-CS-08-609, (also LAPACK Working Note 189), January, 2008. [ pdf ]

Alvaro, W., Kurzak, J., Dongarra, J. "Optimizing Matrix Multiplication for a Short-Vector SIMD Architecture - CELL Processor," Parallel Computing, Volume 35, pp. 138-150, 2009. [ pdf ]

Anderson, E., Bai, Z., Bischof, C., Blackford, S., Demmel, J., Dongarra, J., Du Croz, J., Greenbaum, A., Hammarling, S., McKenney, A., Sorensen, D. "LAPACK Users' Guide, 3rd ed.," Philadelphia: Society for Industrial and Applied Mathematics, 1999.

Andersson, U., Mucci, P. "Analysis and Optimization of Yee_Bench using Hardware Performance Counters," Proceedings of Parallel Computing 2005 (ParCo) (to appear), Malaga, Spain, September, 2005. [ pdf ]

Angskun, T., Bosilca, G., Dongarra, J. "Self-Healing in Binomial Graph Networks," 2nd International Workshop On Reliability in Decentralized Distributed Systems (RDDS 2007), Vilamoura, Algarve, Portugal, November, 2007. [ pdf ]

Angskun, T., Bosilca, G., Dongarra, J. "Binomial Graph: A Scalable and Fault- Tolerant Logical Network Topology," Proceedings of The Fifth International Symposium on Parallel and Distributed Processing and Applications (ISPA07), Niagara Falls, Canada, Springer, August 29-30, 2007. [ pdf ]

Angskun, T., Bosilca, G., Fagg, G., Pjesivac-Grbovic, J., Dongarra, J. "Reliability Analysis of Self-Healing Network using Discrete-Event Simulation," Proceedings of Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07), IEEE Computer Society, 437-444, May, 2007.


Showing Records 21 to 40 out of 664 total records
Next 20 >