- 11
- 5
- 1
Olivier Aumage
17
Documents
Présentation
Position
--------
Olivier Aumage holds a permanent Researcher position as part of the RUNTIME Team at INRIA in Bordeaux and is member from the LaBRI laboratory since 2003.
Research Topics
---------------
His research topics include runtime systems and multithread scheduling in the context of multicores, manycores and accelerators programming, and communication optimization on high performance networks, as well as application analysis and profiling.
Collaborations and Projects
---------------------------
He has been the scientific coordinator for the ANR ProHMPT project dedicated to accelerating nanoscale material simulations through the use of GPU. He is now involved in the ongoing FP7 IRSES Project HPC-GA High Performance Computing for Geophysics Applications, FP7 ICT Project Mont-Blanc 2 aiming at designing programming tools for energy efficient high-performance computing, MORSE associated team with INRIA HiePacs and UTK, and ANR Project Solhar on porting matrix solvers over runtimes.
Position
--------
Olivier Aumage holds a permanent Researcher position as part of the RUNTIME Team at INRIA in Bordeaux and is member from the LaBRI laboratory since 2003.
Research Topics
---------------
His research topics include runtime systems and multithread scheduling in the context of multicores, manycores and accelerators programming, and communication optimization on high performance networks, as well as application analysis and profiling.
Collaborations and Projects
---------------------------
He has been the scientific coordinator for the ANR ProHMPT project dedicated to accelerating nanoscale material simulations through the use of GPU. He is now involved in the ongoing FP7 IRSES Project HPC-GA High Performance Computing for Geophysics Applications, FP7 ICT Project Mont-Blanc 2 aiming at designing programming tools for energy efficient high-performance computing, MORSE associated team with INRIA HiePacs and UTK, and ANR Project Solhar on porting matrix solvers over runtimes.
Publications
- 4
- 3
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 17
- 7
- 5
- 5
- 4
- 4
- 3
- 3
- 2
- 2
- 2
- 2
- 2
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
|
Achieving High Performance on Supercomputers with a Sequential Task-based Programming ModelIEEE Transactions on Parallel and Distributed Systems, inPress, ⟨10.1109/TPDS.2017.2766064⟩
Article dans une revue
hal-01618526v1
|
|
sOMP: Simulating OpenMP Task-Based Applications with NUMA EffectsIWOMP 2020 - 16th International Workshop on OpenMP, Sep 2020, Austin / Virtual, United States. ⟨10.1007/978-3-030-58144-2_13⟩
Communication dans un congrès
hal-02933803v1
|
|
Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime System21st International Workshop on High-Level Parallel Programming Models and Supportive Environments, May 2016, Chicago, United States. ⟨10.1109/IPDPSW.2016.105⟩
Communication dans un congrès
hal-01284004v1
|
|
Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime SystemSIAM Conference on Parallel Processing for Scientific Computing (SIAM PP 2016), Apr 2016, Paris, France. pp.318 - 327
Communication dans un congrès
hal-01380126v1
|
|
Towards seismic wave modeling on heterogeneous many-core architectures using task-based runtime system27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), Oct 2015, Florianopolis, Brazil. ⟨10.1109/SBAC-PAD.2015.33⟩
Communication dans un congrès
hal-01182746v1
|
|
Overview of Distributed Linear Algebra on Hybrid Nodes over the StarPU RuntimeSIAM Conference on Parallel Processing for Scientific Computing (SIAM PP 2014), Feb 2014, Portland, Oregon, United States
Communication dans un congrès
hal-00978602v1
|
|
Harnessing clusters of hybrid nodes with a sequential task-based programming modelInternational Workshop on Parallel Matrix Algorithms and Applications (PMAA 2014), Jul 2014, Lugano, Switzerland
Communication dans un congrès
hal-01283949v1
|
|
Evaluation of OpenMP Dependent Tasks with the KASTORS Benchmark Suite10th International Workshop on OpenMP, IWOMP2014, Sep 2014, Salvador, Brazil. pp.16 - 29, ⟨10.1007/978-3-319-11454-5_2⟩
Communication dans un congrès
hal-01081974v1
|
|
Adaptive Task Size Control on High Level Programming for GPU/CPU Work SharingThe 2013 International Symposium on Advances of Distributed and Parallel Computing (ADPC 2013), Dec 2013, Vietri sul Mare, Italy. ⟨10.1007/978-3-319-03889-6_7⟩
Communication dans un congrès
hal-00920915v1
|
|
A NUMA-aware fine grain parallelization framework for multi-core architecturePDSEC - 14th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing - 2013, May 2013, Boston, United States. ⟨10.1109/IPDPSW.2013.204⟩
Communication dans un congrès
hal-00858350v1
|
|
StarPU-MPI: Task Programming over Clusters of Machines Enhanced with AcceleratorsEuroMPI 2012 - The 19th European MPI Users' Group Meeting, Sep 2012, Vienna, Austria
Communication dans un congrès
hal-00725477v1
|
|
Structuring the execution of OpenMP applications for multicore architecturesInternational Parallel and Distributed Symposium (IPDPS 2010), Apr 2010, Atltanta, United States. ⟨10.1109/IPDPS.2010.5470442⟩
Communication dans un congrès
inria-00441472v1
|
|
Scheduling Dynamic OpenMP Applications over Multicore ArchitecturesInternational Workshop on OpenMP, May 2008, West Lafayette, IN, United States. ⟨10.1007/978-3-540-79561-2_15⟩
Communication dans un congrès
inria-00329934v1
|
|
Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model[Research Report] RR-8927, Inria Bordeaux Sud-Ouest; Bordeaux INP; CNRS; Université de Bordeaux; CEA. 2016, pp.27
Rapport
hal-01332774v1
|
|
StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators[Research Report] RR-8538, INRIA. 2014
Rapport
hal-00992208v2
|
Association de modèles de programmation pour l'exploitation de clusters de GPUs dans le calcul intensifCalcul parallèle, distribué et partagé [cs.DC]. 2011
Mémoire d'étudiant
hal-00803304v1
|
|
Etude de la parallélisation du produit Matrice/Vecteur creux sur processeurs hétérogènes.Calcul parallèle, distribué et partagé [cs.DC]. 2011
Mémoire d'étudiant
hal-00793702v1
|