Mathieu Faverge

 
I'm actually Alumni at the University of Tennessee in the Innovative Computing Laboratory where I work on different project related to dense linear algebra for hybrid architectures. Before that, I did my PhD jointly in Runtime and ScAlAplix teams from INRIA Bordeaux Sud-Ouest on dynamic scheduling for sparse direct solvers. I defended my thesis under the direction of Raymond Namyst and Jean Roman in December 2009. In relation to these works, I also work on different projects to analyze complex applications on distributed architectures.

Dense Linear Algebra

  • PLASMA
    The Parallel Linear Algebra for Scalable Multi-core Architectures (PLASMA) project aims to address the critical and highly disruptive situation that is facing the Linear Algebra and High Performance Computing community due to the introduction of multi-core architectures.
  • MAGMA
    The MAGMA project aims to develop a dense linear algebra library similar to LAPACK but for heterogeneous/hybrid architectures, starting with current "Multicore+GPU" systems.
  • DPLASMA
    DAGuE aims at enabling scientific computing on large scale distributed environments featuring many cores, accelerators and high speed networks. The framework includes libraries, a runtime system, and development tools to help application developers tackle the difficult task of porting their applications to highly heterogeneous and diverse environment. DPLASMA is dense linear algebra library based on this runtime system for distributed and hybrid architectures.
  • Tile-MAGMA
    Tile-MAGMA is an extension to the MAGMA project. It aims to provide a dense linear algebra library for nodes of multi-cores enhanced with multiples GPUs. This project relies on the StarPU runtime system, the GPU kernels from MAGMA and the CPU kernels and the algorithms from PLASMA.

Sparse Linear Algebra

  • PaStiX
    The Parallel Sparse matriX package is a scientific library that provides a high performance MPI/thread solver for very large sparse linear systems based on direct and block ILU(k) iterative methods.
  • Murge
    Murge is an interface definition for sparse solvers created by developpers from the HIPS and PaStiX projects to provide a common interface to both solvers. It aims to be as simple as PETSc without the overcost introduced by the internal structures.

Others

  • ViTE
    ViTE is a tool to visualize execution traces in Paje or OTF format to help user to debug and/or profile parallel applications. It is an open source software licenced under CeCILL-A.
  • EZTrace
    EZTrace is a tool that aims at generating automatically execution trace from HPC (High Performance Computing) programs. It generates execution trace files that can be interpreted by visualization tools such as ViTE.


Thesis
  1. M. Faverge. "Ordonnancement hybride statique-dynamique en algèbre linéaire creuse pour de grands clusters de machines NUMA et multi-coeurs". PhD thesis, LaBRI, Université Bordeaux I, Talence, Talence, France, 2009. [pdf] Keyword(s): Sparse. [bibtex]

Articles in journal or book chapters
  1. J. Dongarra, M. Faverge, T. Herault, M. Jacquelin, J. Langou, and Y. Robert. "Hierarchical QR factorization algorithms for multi-core clusters". Parallel Computing, (0):-, 2013. [pdf] [doi:10.1016/j.parco.2013.01.003] Keyword(s): Multi-core, QR factorization, Numerical linear algebra, Hierarchical architecture, Distributed memory, Cluster. [bibtex]

  2. J. Kurzak, P. Luszczek, M. Faverge, and J. Dongarra. "LU Factorization with Partial Pivoting for a Multicore System with Accelerators". IEEE Transactions on Parallel and Distributed Systems, 99(PrePrints):1, 2012. [doi:http://doi.ieeecomputersociety.org/10.1109/TPDS.2012.242] [bibtex]

  3. J. Dongarra, M. Faverge, H. Ltaief, and P. Luszczek. "Achieving Numerical Accuracy and High Performance using Recursive Tile LU Factorization". Submitted to Concurrency and Computation: Practice & Experience, LAWN 259, September 2011. [bibtex]

  4. J. Kurzak, P. Luszczek, A. YarKhan, M. Faverge, J. Langou, H. Bouwmeester, and J. Dongarra. "Handbook of Multi and Many-Core Processing: Architecture, Algorithms, Programming, and Applications", chapter Multithreading in the PLASMA Library. Chapman and Hall/CRC, To Be Published 26th March 2014. [bibtex]

Conference articles
  1. J. Dongarra, M. Faverge, T. Herault, J. Langou, and Y. Robert. "Hierarchical QR Factorization Algorithms for Multi-core Cluster Systems". In Parallel Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International, pages 607-618, 2012. [doi:10.1109/IPDPS.2012.62] [bibtex]

  2. J. Kurzak, P. Luszczek, M. Faverge, and J. Dongarra. "Programming the LU Factorization for a Multicore System with Accelerators.". In , April 2012. [pdf] [bibtex]

  3. E. Agullo, C. Augonnet, J. Dongarra, M. Faverge, H. Ltaief, S. Thibault, and S. Tomov. "QR Factorization on a Multicore Node Enhanced with Multiple GPU Accelerators". In Proceedings of the 25th IEEE International Parallel & Distributed Processing Symposium (IPDPS'11), Anchorage, United Sttes, pages 932-943, mai 2011. [pdf] [bibtex]

  4. G. Bosilca, A. Bouteiller, A. Danalis, M. Faverge, H. Haidar, T. Herault, J. Kurzak, J. Langou, P. Lemarinier, H. Ltaief, P. Luszczek, A. YarKhan, and J. Dongarra. "Flexible Development of Dense Linear Algebra Algorithms on Massively Parallel Architectures with DPLASMA". In Proceedings of the 25th IEEE International Symposium on Parallel & Distributed Processing Workshops and Phd Forum (IPDPSW'11), PDSEC 2011, Anchorage, United States, pages 1432-1441, mai 2011. [pdf] [bibtex]

  5. K. Coulomb, A. Degomme, M. Faverge, and F. Trahay. "An open-source tool-chain for performance analysis". In Parallel Tools Workshop, 2011. [bibtex]

  6. J. Dongarra, M. Faverge, H. Ltaief, and P. Luszczek. "Exploiting Fine-Grain Parallelism in Recursive LU Factorization". In Proceedings of ParCo 2011, July 2011. [bibtex]

  7. J. Dongarra, M. Faverge, H. Ltaief, and P. Luszczek. "High Performance Matrix Inversion Based on LU Factorization for Multicore Architectures". In Proceedings of MTAGS11, 2011. [bibtex]

  8. E. Agullo, C. Augonnet, J. Dongarra, M. Faverge, J. Langou, H. Ltaief, and S. Tomov. "LU Factorization for Accelerator-based Systems". In Proceedings of the 9th IEEE/ACS International Conference on Computer Systems and Applications (AICCSA'11), pages 217-224, December 2010. Note: Best Paper award. [pdf] [bibtex]

  9. M. Faverge. "A NUMA Aware Scheduler for a Parallel Sparse Direct Solver". In Journées Informatique Massivement Multiprocesseur et Multicoeur, Rocquencourt, France, 2009. [pdf] Keyword(s): Sparse. [bibtex]

  10. M. Faverge. "Dynamic Scheduling for Sparse Direct Solver on NUMA and Multicore Architectures". In Sparse Days, Toulouse, France, 2009. Keyword(s): Sparse. [bibtex]

  11. M. Faverge. "Vers un solveur de systèmes linéaires creux adapté aux machines NUMA". In ACTES RenPar'2009, Toulouse, France, 2009. [pdf] Keyword(s): Sparse. [bibtex]

  12. M. Faverge, X. Lacoste, and P. Ramet. "A NUMA Aware Scheduler for a Parallel Sparse Direct Solver". In Proceedings of PMAA'2008, Neuchatel, Swiss, 2008. Keyword(s): Sparse. [bibtex]

  13. M. Faverge and P. Ramet. "Dynamic Scheduling for sparse direct Solver on NUMA architectures". In Proceedings of PARA'2008, Trondheim, Norway, 2008. [pdf] Keyword(s): Sparse. [bibtex]

Internal reports
  1. X. Lacoste, P. Ramet, M. Faverge, I. Yamazaki, and J. Dongarra. "Sparse direct solvers with accelerators over DAG runtimes". Rapport de recherche RR-7972, INRIA, 2012. [pdf] [PDF] [bibtex]

  2. D. Becker, M. Faverge, and J. Dongarra. "Towards a Parallel Tile LDL Factorization for Multicore Architectures". Technical report, Innovative Computing Laboratory, University of Tennessee, 2011. [bibtex]

  3. G. Bosilca, A. Bouteiller, A. Danalis, M. Faverge, H. Haidar, T. Herault, J. Kurzak, J. Langou, P. Lemarinier, H. Ltaief, P. Luszczek, A. YarKhan, and J. Dongarra. "Distibuted Dense Numerical Linear Algebra Algorithms on Massively Parallel Architectures: DPLASMA". Technical report, Innovative Computing Laboratory, University of Tennessee, April 2010. [pdf] [bibtex]

  4. J. Dongarra, M. Faverge, Y. Ishikawa, R. Namyst, F. Rue, and F. Trahay. "EZTrace: a generic framework for performance analysis". Technical report, Innovative Computing Laboratory, University of Tennessee, December 2010. Note: Poster at CCGrid 2011. [bibtex]



Email
Phone 865-974-9408
Office Claxton 352

University of Tennessee
Computer Science Department
Innovative Computing Laboratory
1122 Volunteer Blvd, Claxton Building
Knoxville, Tennessee 37996-3450
Fax 865-974-8296