Recherche - Archive ouverte HAL Accéder directement au contenu

Filtrer vos résultats

51 résultats

Proceedings of the 24th High Performance Computing Symposium (HPC 2016)

Marc Baboulin , Josef Weinbub , William Thacker , Lukas Polok
ACM. Spring Simulation Multiconference Pasadena, CA, USA, Apr 2016, Pasadena, United States. , 2016, 978-1-5108-2318-1
Proceedings/Recueil des communications hal-01486178v1
Image document

An efficient distributed randomized solver with application to large dense linear systems

Marc Baboulin , Dulceneia Becker , George Bosilca , Anthony Danalis , Jack Dongarra
[Research Report] RR-8043, INRIA. 2012
Rapport hal-00724059v1

Dense Symmetric Indefinite Factorization on GPU Accelerated Architectures

Marc Baboulin , Jack Dongarra , Adrien Rémy , Stanimire Tomov , Ichitaro Yamazaki
International Conference on Parallel Processing and Applied Mathematics, Sep 2015, Krakow, Poland. pp.86-95, ⟨10.1007/978-3-319-32149-3_9⟩
Communication dans un congrès hal-01223022v1

Accelerating linear system solutions using randomization technique

Marc Baboulin , Jack Dongarra , Julien Herrmann , Stanimire Tomov
ACM Transactions on Mathematical Software, 2013, 39 (2), ⟨10.1145/2427023.2427025⟩
Article dans une revue hal-00908496v1
Image document

Synthesizing Quantum Circuits via Numerical Optimization

Timothée Goubault de Brugière , Marc Baboulin , Benoît Valiron , Cyril Allouche
19th International Conference in Computational Science - ICCS 2019, Jun 2019, Faro, Portugal. pp.3-16, ⟨10.1007/978-3-030-22741-8_1⟩
Communication dans un congrès hal-02174967v1
Image document

Statistical estimates for the conditioning of linear least squares problems

Marc Baboulin , Serge Gratton , Rémi Lacroix , Alan J Laub
International Conference on Parallel Processing and Applied Mathematics, Sep 2013, Warsaw, Poland. pp.124-133, ⟨10.1007/978-3-642-55224-3_13⟩
Communication dans un congrès hal-01766923v1
Image document

Scalable Algorithms Using Sparse Storage for Parallel Spectral Clustering on GPU

Guanlin He , Stephane Vialle , Nicolas Sylvestre , Marc Baboulin
IFIP International Conference on Network and Parallel Computing (NPC 2021), Christophe Cérin; Depei Qian; Jean-Luc Gaudiot; Guangming Tan; Stéphane Zuckerman, Nov 2021, Paris, France. pp.40-52, ⟨10.1007/978-3-030-93571-9_4⟩
Communication dans un congrès hal-04138695v1

Metaprogramming dense linear algebra solvers. Applications to multi and many-core architectures.

Ian Masliah , Marc Baboulin , Joel Falcou
13th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA 2015), Aug 2015, Helsinki, Finland
Communication dans un congrès hal-01221358v1
Image document

Towards an automatic generation of dense linear algebra solvers on parallel architectures

Marc Baboulin , Joel Falcou , Ian Masliah
[Research Report] RR-8615, Université Paris-Sud; INRIA. 2014, pp.20
Rapport hal-01075663v1

Computing least squares condition numbers on hybrid multicore/GPU systems

Marc Baboulin , Jack Dongarra , Rémi Lacroix
Interdisciplinary Topics in Applied Mathematics, Modeling and Computational Science, 117, Springer International Publishing, 2015, 978-3-319-12306-6. ⟨10.1007/978-3-319-12307-3_6⟩
Chapitre d'ouvrage hal-01204804v1

Using Random Butterfly Transformations to Avoid Pivoting in Sparse Direct Methods

Marc Baboulin , Xiaoye S. Li , Rouet François-Henry
High Performance Computing for Computational Science (VECPAR 2014), Jun 2014, EUGENE (OR), United States. ⟨10.1007/978-3-319-17353-5_12⟩
Communication dans un congrès hal-01205703v1

A parallel solver for incompressible fluid flows

Yushan Wang , Marc Baboulin , Jack Dongarra , Joel Falcou , Yann Fraigneau , et al.
International Conference on Computational Science, Jun 2013, Barcelona, Spain. ⟨10.1016/j.procs.2013.05.207⟩
Communication dans un congrès hal-00915356v1
Image document

Solving dense symmetric indefinite systems using GPUs

Marc Baboulin , Jack Dongarra , Adrien Rémy , Stanimire Tomov , Ichitaro Yamazaki
Concurrency and Computation: Practice and Experience, 2017, 29 (9), pp.1 - 17. ⟨10.1002/cpe.4055⟩
Article dans une revue hal-01662358v1
Image document

Solving 3D incompressible Navier-Stokes equations on hybrid CPU/GPU systems

Yushan Wang , Marc Baboulin , Karl Rupp , Olivier Le Maitre , Yann Fraigneau
High Performance Computing Symposium (HPC'14), Spring Simulation Multiconference, Apr 2014, Tampa, Florida, USA, United States
Communication dans un congrès hal-01205305v1
Image document

Accelerating linear system solutions using randomization techniques

Marc Baboulin , Jack Dongarra , Julien Herrmann , Stanimire Tomov
[Research Report] RR-7616, INRIA. 2011
Rapport inria-00593306v1
Image document

A contribution to the conditioning of the total least squares problem

Marc Baboulin , Serge Gratton
[Research Report] RR-7488, INRIA. 2010
Rapport inria-00546886v1
Image document

Decoding techniques applied to the compilation of CNOT circuits for NISQ architectures

Timothée Goubault de Brugière , Marc Baboulin , Benoît Valiron , Simon Martiel , Cyril Allouche
Science of Computer Programming, 2022, 214, pp.102726. ⟨10.1016/j.scico.2021.102726⟩
Article dans une revue hal-03547113v1
Image document

A Comparison of Soft-Fault Error Models in the Parallel Preconditioned Flexible GMRES

Evan Coleman , Aygul Jamal , Marc Baboulin , Amal Khabou , Masha Sosonkina
International Conference on Parallel Processing and Applied Mathematics, Sep 2017, Lublin, Poland. pp.36-46, ⟨10.1007/978-3-319-78024-5_4⟩
Communication dans un congrès hal-01768773v1
Image document

Reducing the depth of linear reversible quantum circuits

Timothee Goubault De Brugiere , Marc Baboulin , Benoît Valiron , Simon Martiel , Cyril Allouche
IEEE Transactions on Quantum Engineering, In press, 2, pp.3102422. ⟨10.1109/TQE.2021.3091648⟩
Article dans une revue hal-03553916v1
Image document

A class of communication-avoiding algorithms for solving general dense linear systems on CPU/GPU parallel machines

Marc Baboulin , Simplice Donfack , Jack Dongarra , Laura Grigori , Adrien Rémy , et al.
[Research Report] RR-7854, INRIA. 2012
Rapport hal-00656457v3

Locality Optimization on a NUMA Architecture for Hybrid LU Factorization

Adrien Rémy , Marc Baboulin , Masha Sosonkina , Brigitte Rozoy
Parallel Computing: Accelerating Computational Science and Engineering, 25, pp.153-162, 2014, Advances in Parallel Computing, ⟨10.3233/978-1-61499-381-0-153⟩
Chapitre d'ouvrage hal-00987284v1
Image document

Meta-programming and Multi-stage Programming for GPGPUs

Marc Baboulin , Joel Falcou , Ian Masliah
[Research Report] RR-8780, Inria Saclay Ile de France; Paris-Sud XI. 2015
Rapport hal-01204661v1

Parallel Tools for Solving Incremental Dense Least Squares Problems: Application to Space Geodesy

Marc Baboulin , Luc Giraud , Serge Gratton , Julien Langou
Journal of Algorithms & Computational Technology, 2009, 3 (1), pp.117-133. ⟨10.1260/174830109787186541⟩
Article dans une revue hal-03973310v1
Image document

Parallelization of the k-means Algorithm in a Spectral Clustering Chain on CPU-GPU Platforms

Guanlin He , Stéphane Vialle , Marc Baboulin
HeteroPar Workshop of 2020 Euro-Par International Conference, Aug 2020, Warsaw, Poland
Communication dans un congrès hal-02985021v1
Image document

Mixed precision iterative refinement for low-rank matrix and tensor approximations

Marc Baboulin , Oguz Kaya , Théo Mary , Matthieu Robeyns
2023
Pré-publication, Document de travail hal-04115337v1
Image document

Collective mind: Towards practical and collaborative auto-tuning

Grigori Fursin , Renato Miceli , Anton Lokhmotov , Michael Gerndt , Marc Baboulin , et al.
Scientific Programming, 2014, Automatic Application Tuning for HPC Architectures, 22 (4), pp.309-329. ⟨10.3233/SPR-140396⟩
Article dans une revue hal-01054763v1
Image document

Gaussian elimination versus Greedy methods for the synthesis of linear reversible circuits

Timothée Goubault de Brugière , Marc Baboulin , Benoît Valiron , Simon Martiel , Cyril Allouche
ACM Transactions on Quantum Computing, 2021, 2 (3), pp.11. ⟨10.1145/3474226⟩
Article dans une revue hal-03547117v1
Image document

Fast and reliable solutions for numerical linear algebra solvers in high-performance computing.

Marc Baboulin
Distributed, Parallel, and Cluster Computing [cs.DC]. Université Paris Sud - Paris XI, 2012
HDR tel-00967523v1

Using Random Butterfly Transformations in Parallel Schur Complement-Based Preconditioning

Marc Baboulin , Aygul Jamal , Masha Sosonkina
8th Workshop on Computer Aspects of Numerical Algorithms (CANA'15), Sep 2015, Lodz, Poland
Communication dans un congrès hal-01223090v1
Image document

Towards a High-Performance Tensor Algebra Package for Accelerators

Marc Baboulin , Veselin Dobrev , Jack Dongarra , Christopher Earl , Joel Falcou , et al.
Smoky Mountains Computational Sciences and Engineering Conference (SMC 2015), Aug 2015, Gatlinburg, United States. , 2015
Poster de conférence hal-01231234v1