Raymond Namyst
99
Documents
Affiliations actuelles
Domaines de recherche
Publications
Peachy Parallel Assignments (EduPar 2022)IPDPSW 2022 - IEEE International Parallel and Distributed Processing Symposium Workshops, May 2022, Lyon, France. pp.361-368, ⟨10.1109/IPDPSW55747.2022.00068⟩
Communication dans un congrès
hal-03938420v1
|
|
|
Exploring scheduling algorithms for parallel task graphs: a modern game engine case studyInternational European Conference on Parallel and Distributed Computing (Euro-Par), Aug 2022, Glasgow, United Kingdom. pp.103-118, ⟨10.1007/978-3-031-12597-3_7⟩
Communication dans un congrès
hal-03580775v1
|
|
Programming Heterogeneous Architectures Using Hierarchical TasksHeteroPar 2022 - twentieth international workshop, Aug 2022, Glasgow, United Kingdom. pp.12
Communication dans un congrès
hal-03789625v1
|
|
TEXTAROSSA: Towards EXtreme scale Technologies and Accelerators for euROhpc hw/Sw Supercomputing Applications for exascaleDSD 2021 - 24th Euromicro Conference on Digital System Design, Sep 2021, Palermo / Virtual, Italy
Communication dans un congrès
hal-03329640v1
|
|
Combining Task-based Parallelism and Adaptive Mesh Refinement Techniques in Molecular Dynamics SimulationsICPP18, International Conference on Parallel Processing., Aug 2018, Eugene, United States. ⟨10.1145/3225058.3225085⟩
Communication dans un congrès
hal-01833266v1
|
|
Resource-Management Study in HPC Runtime-Stacking Context SBAC-PAD 2017 - 29th International Symposium on Computer Architecture and High Performance Computing, Oct 2017, Campinas, Brazil. pp.177-184, ⟨10.1109/SBAC-PAD.2017.30⟩
Communication dans un congrès
hal-01682286v1
|
Resource aggregation in task-based applications over accelerator-based multicore machinesHeteroPar'2016 worshop of Euro-Par, Aug 2016, Grenoble, France
Communication dans un congrès
hal-01355385v1
|
|
|
Resource aggregation for task-based Cholesky Factorization on top of heterogeneous machinesHeteroPar'2016 worshop of Euro-Par, Aug 2016, Grenoble, France
Communication dans un congrès
hal-01181135v3
|
|
SPAWN: An Iterative, Potentials-Based, Dynamic Scheduling and Partitioning ToolSuperComputing'15 - RESPA Workshop, Nov 2015, Austin, United States
Communication dans un congrès
hal-01223897v1
|
Automatic OpenCL code generation for multi-device heterogeneous architecturesICPP 2015 : 44th International Conference on Parallel Processing, Sep 2015, Beijing, China. pp.959 - 968, ⟨10.1109/ICPP.2015.105⟩
Communication dans un congrès
hal-01275482v1
|
|
|
Dynamic Load Balancing with Pair PotentialsEuro-Par 2014 International Workshops, Luis Lopez, Aug 2014, Porto, Portugal. pp.462--473, ⟨10.1007/978-3-319-14313-2_39⟩
Communication dans un congrès
hal-01223876v1
|
|
Toward OpenCL Automatic Multi-Device SupportEuro-Par 2014, Aug 2014, Porto, Portugal
Communication dans un congrès
hal-01005765v1
|
|
High Performance Code Generation for Stencil Computation on Heterogeneous Multi-device ArchitecturesHPCC 2013 - 15th IEEE International Conference on High Performance Computing and Communications, Nov 2013, Zhangjiajie, China
Communication dans un congrès
hal-00925481v1
|
Implementation of FEM Application on GPU with StarPUSIAM CSE13 - SIAM Conference on Computational Science and Engineering 2013, SIAM, Feb 2013, Boston, United States
Communication dans un congrès
hal-00926144v1
|
|
High-performance code generation for stencil computations on heterogeneous multi-device architecturesHPPC 2013 - 15th IEEE International Conference on High Performance Computing and Communications, Nov 2013, Zhangjiajie, China
Communication dans un congrès
hal-00952258v1
|
|
|
Towards exascale with the ANR-JST japanese-french project FP3C (Framework and Programming for Post- Petascale Computing)9th International Conference on Computer Science and Information Technologies, Sep 2013, Yerevan, Armenia
Communication dans un congrès
hal-00922754v1
|
|
Adaptive Task Size Control on High Level Programming for GPU/CPU Work SharingThe 2013 International Symposium on Advances of Distributed and Parallel Computing (ADPC 2013), Dec 2013, Vietri sul Mare, Italy. ⟨10.1007/978-3-319-03889-6_7⟩
Communication dans un congrès
hal-00920915v1
|
|
Composing multiple StarPU applications over heterogeneous machines: a supervised approachThird International Workshop on Accelerators and Hybrid Exascale Systems, May 2013, Boston, United States
Communication dans un congrès
hal-00824514v1
|
|
StarPU-MPI: Task Programming over Clusters of Machines Enhanced with AcceleratorsEuroMPI 2012 - The 19th European MPI Users' Group Meeting, Sep 2012, Vienna, Austria
Communication dans un congrès
hal-00725477v1
|
|
High-Level Support for Pipeline Parallelism on Many-Core ArchitecturesEuropar - International European Conference on Parallel and Distributed Computing - 2012, Aug 2012, Rhodes Island, Greece. ⟨10.1007/978-3-642-32820-6_61⟩
Communication dans un congrès
hal-00697020v1
|
|
Programmability and Performance Portability Aspects of Heterogeneous Multi-/Manycore SystemsDesign, Automation and Test in Europe (DATE), Mar 2012, Dresden, Germany. ⟨10.1109/DATE.2012.6176582⟩
Communication dans un congrès
hal-00776610v1
|
|
A sampling-based approach for communication libraries auto-tuningIEEE International Conference on Cluster Computing, Sep 2011, Austin, United States
Communication dans un congrès
inria-00605735v1
|
|
EZTrace: a generic framework for performance analysisIEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), May 2011, Newport Beach, CA, United States
Communication dans un congrès
inria-00587216v1
|
|
The PEPPHER Approach to Programmability and Performance Portability for Heterogeneous many-core ArchitecturesParCo, Aug 2011, Ghent, Belgium
Communication dans un congrès
hal-00661320v1
|
|
Structuring the execution of OpenMP applications for multicore architecturesInternational Parallel and Distributed Symposium (IPDPS 2010), Apr 2010, Atltanta, United States. ⟨10.1109/IPDPS.2010.5470442⟩
Communication dans un congrès
inria-00441472v1
|
|
Adaptive MPI Multirail Tuning for Non-Uniform Input/Output AccessThe 17th European MPI Users Group conference, Sep 2010, Stuttgart, Germany. pp.239-248, ⟨10.1007/978-3-642-15646-5_25⟩
Communication dans un congrès
inria-00486178v1
|
|
Data-Aware Task Scheduling on Multi-Accelerator based Platforms16th International Conference on Parallel and Distributed Systems, Dec 2010, Shangai, China
Communication dans un congrès
inria-00523937v1
|
|
hwloc: a Generic Framework for Managing Hardware Affinities in HPC ApplicationsPDP 2010 - The 18th Euromicro International Conference on Parallel, Distributed and Network-Based Computing, Feb 2010, Pisa, Italy. ⟨10.1109/PDP.2010.67⟩
Communication dans un congrès
inria-00429889v1
|
|
Dynamically scheduled Cholesky factorization on multicore architectures with GPU accelerators.Symposium on Application Accelerators in High Performance Computing (SAAHPC), Jul 2010, Knoxville, United States
Communication dans un congrès
inria-00547616v1
|
|
Optimizing MPI Communication within large Multicore nodes with Kernel assistanceWorkshop on Communication Architecture for Clusters, held in conjunction with IPDPS 2010, Apr 2010, Atlanta, United States. 7 p., ⟨10.1109/IPDPSW.2010.5470849⟩
Communication dans un congrès
inria-00451471v1
|
|
Automatic Calibration of Performance Models on Heterogeneous Multicore Architectures3rd Workshop on Highly Parallel Processing on a Chip (HPPC 2009), Aug 2009, Delft, Netherlands
Communication dans un congrès
inria-00421333v1
|
|
Exploiting the Cell/BE architecture with the StarPU unified runtime systemSAMOS Workshop, Jul 2009, SAMOS, Greece
Communication dans un congrès
inria-00378705v1
|
|
StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore ArchitecturesEuro-Par 2009, Aug 2009, Delft, Netherlands
Communication dans un congrès
inria-00384363v1
|
|
Dynamic Task and Data Placement over NUMA Architectures: an OpenMP Runtime PerspectiveInternational Workshop on OpenMP (IWOMP), Jun 2009, Dresden, Germany. ⟨10.1007/978-3-642-02303-3_7⟩
Communication dans un congrès
inria-00367570v1
|
|
A unified runtime system for heterogeneous multicore architectures2nd Workshop on Highly Parallel Processing on a Chip (HPPC 2008), Aug 2008, Las Palmas de Gran Canaria, Spain
Communication dans un congrès
inria-00326917v1
|
MPC: A Unified Parallel Runtime for Clusters of NUMA Machinesthe 14th International Euro-Par Conference, Aug 2008, Las Palmas de Gran Canaria, Spain. pp.78-88, ⟨10.1007/978-3-540-85451-7_9⟩
Communication dans un congrès
inria-00422229v1
|
|
|
A multithreaded communication engine for multicore architecturesCommunication Architecture for Clusters, Apr 2008, Miami, United States. ⟨10.1109/IPDPS.2008.4536139⟩
Communication dans un congrès
inria-00224999v1
|
|
Scheduling Dynamic OpenMP Applications over Multicore ArchitecturesInternational Workshop on OpenMP, May 2008, West Lafayette, IN, United States. ⟨10.1007/978-3-540-79561-2_15⟩
Communication dans un congrès
inria-00329934v1
|
|
High-Performance Multi-Rail Support with the NewMadeleine Communication LibraryThe Sixteenth International Heterogeneity in Computing Workshop (HCW 2007), workshop held in conjunction with IPDPS 2007, Mar 2007, Long Beach, California, United States
Communication dans un congrès
inria-00126254v1
|
|
NewMadeleine: a Fast Communication Scheduling Engine for High Performance NetworksWorkshop on Communication Architecture for Clusters (CAC 2007), workshop held in conjunction with IPDPS 2007, Mar 2007, Long Beach, California, United States
Communication dans un congrès
inria-00127356v1
|
|
Building Portable Thread Schedulers for Hierarchical Multiprocessors: the BubbleSched FrameworkEuroPar, Aug 2007, Rennes, France. ⟨10.1007/978-3-540-74466-5_6⟩
Communication dans un congrès
inria-00154506v1
|
|
Improving Reactivity and Communication Overlap in MPI using a Generic I/O ManagerEuroPVM/MPI 2007, Oct 2007, Paris, France. pp.170-177, ⟨10.1007/978-3-540-75416-9_27⟩
Communication dans un congrès
inria-00177167v1
|
|
An Efficient OpenMP Runtime System for Hierarchical ArchitecturesInternational Workshop on OpenMP (IWOMP), Jun 2007, Beijing, China. pp.148--159, ⟨10.1007/978-3-540-69303-1_19⟩
Communication dans un congrès
inria-00154502v1
|
Efficient runtime systems for gridsEXPGRID, Experimental Grid testbeds for the assessment of large-scale distributed applications and tools, Workshop held in conjunction with the 15th International Symposium on High Performance Distributed Computing (HPDC-15), Jun 2006, Paris, France
Communication dans un congrès
inria-00404196v1
|
|
|
Short Paper : Dynamic Optimization of Communications over High Speed NetworksThe 15th IEEE International Symposium on High Performance Distributed Computing (HPDC-15), Jun 2006, Paris/France, France
Communication dans un congrès
inria-00110773v1
|
ACI Grid'5000, Site de BordeauxPaRISTIC : Panorama des Recherches Incitatives en STIC, Nov 2006, Nancy, France
Communication dans un congrès
inria-00404192v1
|
|
An Efficient Multi-level Trace Toolkit for Multi-threaded ApplicationsEuro-Par 2005 Parallel Processing, Aug 2005, Lisbon, Portugal. pp.166-175, ⟨10.1007/11549468_21⟩
Communication dans un congrès
hal-00360309v1
|
|
Grid'5000: a large scale, reconfigurable, controlable and monitorable Grid platform6th IEEE/ACM International Workshop on Grid Computing - GRID 2005, Nov 2005, Seattle, USA, United States
Communication dans un congrès
inria-00000284v1
|
|
ALTA: Asynchronous Loss Tolerant Algorithms for Grid Computing3rd International workshop on Parallel Matrix Algorithms and Applications (PMAA'04), Oct 2004, Marseille, France
Communication dans un congrès
hal-01101475v1
|
|
|
Implementing Java consistency using a generic multithreaded DSM runtime systemProc. Intl. Parallel and Distributed Processing Symposium( IPDPS'00), Workshop on Java for Parallel and Distributed Computing, 2000, Cancun, Mexico. pp.560-567, ⟨10.1007/3-540-45591-4_76⟩
Communication dans un congrès
inria-00563587v1
|
|
Compiling Data-parallel Programs to A Distributed Runtime Environment with Thread IsomigrationThe 1999 Intl Conf. on Parallel and Distributed Processing Techniques and Applications (PDPTA '99), Technical Session on parallel and distributed languages: mechanisms implementations, and tools, 2000, Las Vegas, NV, United States. pp.1756-1762
Communication dans un congrès
inria-00563794v1
|
DSM-PM2: a multi-protocol DSM layer for the PM2 multithreaded runtime systemProc. 2nd Workshop on Parallel Computing for Irregular Applications (WPCIA2), 2000, Toulouse, France
Communication dans un congrès
inria-00563590v1
|
|
|
Compiling multithreaded Java bytecode for distributed executionEuro-Par 2000: Parallel Processing, Aug 2000, Munchen, Germany. pp.1039-1052, ⟨10.1007/3-540-44520-X_148⟩
Communication dans un congrès
inria-00563684v1
|
|
An Efficient and Transparent Thread Migration Scheme in the PM2 Runtime SystemProceedings of the 11 IPPS/SPDP'99 Workshops Held in Conjunction with the 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing, Apr 1999, Po Rico, Puerto Rico. pp.496--510
Communication dans un congrès
inria-00565361v1
|
Proceedings of Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29 - September 2, 2011, Revised Selected Papers, Part IMichael Alexander and Pasqua D'Ambra and Adam Belloum and George Bosilca and Mario Cannataro and Marco Danelutto and Beniamino Di Martino and Michael Gerndt and Emmanuel Jeannot and Raymond Namyst and Jean Roman and Stephen L. Scott and Jesper Larsson Trä. Springer, 7155, pp.524, 2012, LNCS, 978-3-642-29736-6
Ouvrages
hal-00788213v1
|
|
Proceedings of Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29 - September 2, 2011, Revised Selected Papers, Part IIMichael Alexander and Pasqua D'Ambra and Adam Belloum and George Bosilca and Mario Cannataro and Marco Danelutto and Beniamino Di Martino and Michael Gerndt and Emmanuel Jeannot and Raymond Namyst and Jean Roman and Stephen L. Scott and Jesper Larsson Trä. Springer, 7156, pp.480, 2012, 978-3-642-29739-7
Ouvrages
hal-00788214v1
|
|
Proceedings of Euro-Par 2011 Parallel Processing - 17th International Conference, Part IIEmmanuel Jeannot and Raymond Namyst and Jean Roman. Springer, 6853, pp.488, 2011, LNCS, 978-3-642-23396-8. ⟨10.1007/978-3-642-23397-5⟩
Ouvrages
hal-00788208v1
|
|
Proceedings of Euro-Par 2011 Parallel Processing - 17th International Conference, Part IEmmanuel Jeannot and Raymond Namyst and Jean Roman. Springer, 6852, pp.598, 2011, LNCS, 978-3-642-23399-9. ⟨10.1007/978-3-642-29737-3⟩
Ouvrages
hal-00788206v1
|
|
Faster, Cheaper, Better – a Hybridization Methodology to Develop Linear Algebra Software for GPUsWen-mei W. Hwu. GPU Computing Gems, 2, Morgan Kaufmann, 2010
Chapitre d'ouvrage
inria-00547847v1
|
Des réseaux de calculateurs aux grilles de calculAkoka, Jacky; Comyn-Wattiau, Isabelle. Encyclopédie de l'informatique et des systèmes d'information, Section 2 - Architectures et syst\`emes distribu\'es, Vuibert, pp.211-239, 2006, Collection informatique
Chapitre d'ouvrage
hal-01271123v1
|
|
EASYPAP: a Framework for Learning Parallel Programming2020
Pré-publication, Document de travail
hal-02469919v1
|
|
Resource aggregation for task-based Cholesky Factorization on top of modern architectures2016
Pré-publication, Document de travail
hal-01409965v1
|
|
Efficient shared memory message passing for inter-VM communications2008
Pré-publication, Document de travail
hal-00368622v1
|
|
Programming heterogeneous, accelerator-based multicore machines:current situation and main challengesInternational Conference On Preconditioning Techniques For Scientific And Industrial Applications, Preconditioning 2011, May 2011, Bordeaux, France
Document associé à des manifestations scientifiques
inria-00590670v1
|