Recherche - Archive ouverte HAL Accéder directement au contenu

Filtrer vos résultats

95 résultats
Image document

A Proposal-based Paradigm for Self-supervised Sound Source Localization in Videos

Hanyu Xuan , Zhiliang Wu , Jian Yang , Yan Yan , Xavier Alameda-Pineda
CVPR 2022 - IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun 2022, New Orleans, United States. pp.1-10, ⟨10.1109/CVPR52688.2022.00110⟩
Communication dans un congrès hal-03626420v1

Successor Feature Neural Episodic Control

David Emukpere , Xavier Alameda-Pineda , Chris Reinke
NeurIPS 2021 - 35th International Conference on Neural Information Processing Systems, Dec 2021, Virtual, Canada. pp.1-12
Communication dans un congrès hal-03426874v1
Image document

Learning Deep Structured Multi-Scale Features using Attention-Gated CRFs for Contour Prediction

Dan Xu , Wanli Ouyang , Xavier Alameda-Pineda , Elisa Ricci , Xiaogang Wang , et al.
Advances in Neural Information Processing Systems, Dec 2017, Long Beach, United States. pp.3961-3970
Communication dans un congrès hal-01646112v1

Back to MLP: A Simple Baseline for Human Motion Prediction

Wen Guo , Yuming Du , Xi Shen , Vincent Lepetit , Xavier Alameda-Pineda , et al.
WACV 2023 - IEEE Winter Conference on Applications of Computer Vision, Jan 2023, Waikoloa, United States. pp.1-11
Communication dans un congrès hal-03906936v1

PI-Net: Pose Interacting Network for Multi-Person Monocular 3D Pose Estimation

Wen Guo , Enric Corona , Francesc Moreno-Noguer , Xavier Alameda-Pineda
WACV 2021 - IEEE Winter Conference on Applications of Computer vision, Jan 2021, Waikoloa, United States. pp.1-11, ⟨10.1109/WACV48630.2021.00284⟩
Communication dans un congrès hal-02971754v1
Image document

CANU-ReID: A Conditional Adversarial Network for Unsupervised person Re-IDentification

Guillaume Delorme , Yihong Xu , Stéphane Lathuilière , Radu Horaud , Xavier Alameda-Pineda
ICPR 2020 - 25th International Conference on Pattern Recognition, Jan 2021, Milano, Italy. pp.4428-4435, ⟨10.1109/ICPR48806.2021.9412431⟩
Communication dans un congrès hal-02882285v1
Image document

Univariate Radial Basis Function Layers: Brain-inspired Deep Neural Layers for Low-Dimensional Inputs

Basavasagar Patil , Xavier Alameda-Pineda , Chris Reinke
2023
Pré-publication, Document de travail hal-04342724v1

Multi-Person Extreme Motion Prediction

Wen Guo , Xiaoyu Bie , Xavier Alameda-Pineda , Francesc Moreno-Noguer
IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun 2022, New Orleans, United States. ⟨10.1109/CVPR52688.2022.01271⟩
Communication dans un congrès hal-03295672v1

How to Make an Image More Memorable? A Deep Style Transfer Approach

Aliaksandr Siarohin , Gloria Zen , Cveta Majtanovic , Xavier Alameda-Pineda , Elisa Ricci , et al.
ICMR 2017 - ACM International Conference on Multimedia Retrieval, Jun 2017, Bucharest, Romania. pp.322-329, ⟨10.1145/3078971.3078986⟩
Communication dans un congrès hal-01858385v1
Image document

Semi-supervised learning made simple with self-supervised clustering

Enrico Fini , Pietro Astolfi , Karteek Alahari , Xavier Alameda-Pineda , Julien Mairal , et al.
CVPR 2023 – IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun 2023, Vancouver, Canada. pp.1-11
Communication dans un congrès hal-04073630v1
Image document

A Recurrent Variational Autoencoder for Speech Enhancement

Simon Leglaive , Xavier Alameda-Pineda , Laurent Girin , Radu Horaud
ICASSP 2020 - IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, May 2020, Barcelone (virtual), Spain. pp.371-375, ⟨10.1109/ICASSP40776.2020.9053164⟩
Communication dans un congrès hal-02329000v2

Deep Variational Generative Models for Audio-visual Speech Separation

Viet-Nhat Nguyen , Mostafa Sadeghi , Elisa Ricci , Xavier Alameda-Pineda
MLSP 2021 - IEEE International Workshop on Machine Learning for Signal Processing, Oct 2021, Gold Coast, Australia. ⟨10.1109/MLSP52302.2021.9596406⟩
Communication dans un congrès hal-02930662v1
Image document

Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation

Xiaoyu Lin , Laurent Girin , Xavier Alameda-Pineda
Transactions on Machine Learning Research Journal, 2024, pp.1-19
Article dans une revue hal-03584014v1
Image document

Dynamical Variational Autoencoders: A Comprehensive Review

Laurent Girin , Simon Leglaive , Xiaoyu Bie , Julien Diard , Thomas Hueber , et al.
Foundations and Trends in Machine Learning, 2021, 15 (1-2), pp.1-175. ⟨10.1561/2200000089⟩
Article dans une revue hal-02926215v2
Image document

EM Algorithms for Weighted-Data Clustering with Application to Audio-Visual Scene Analysis

Israel Dejene Gebru , Xavier Alameda-Pineda , Florence Forbes , Radu Horaud
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 38 (12), pp.2402 - 2415. ⟨10.1109/TPAMI.2016.2522425⟩
Article dans une revue hal-01261374v1
Image document

Geometrically-constrained time delay estimation-based sound source localisation (gTDESSL)

Xavier Alameda-Pineda , Radu Horaud
[Research Report] RR-7988, INRIA. 2012, pp.28
Rapport hal-00704986v2
Image document

Towards Probabilistic Generative Models for Socially Intelligent Robots

Xavier Alameda-Pineda
Computer Vision and Pattern Recognition [cs.CV]. Université Grenoble - Alpes, 2020
HDR tel-03192456v1
Image document

Switching Variational Auto-Encoders for Noise-Agnostic Audio-visual Speech Enhancement

Mostafa Sadeghi , Xavier Alameda-Pineda
ICASSP 2021 - 46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto / Virtual, Canada. pp.1-5, ⟨10.1109/ICASSP39728.2021.9414097⟩
Communication dans un congrès hal-03155445v1

Speech Modeling with a Hierarchical Transformer Dynamical VAE

Xiaoyu Lin , Xiaoyu Bie , Simon Leglaive , Laurent Girin , Xavier Alameda-Pineda
ICASSP 2023 - IEEE International Conference on Acoustics, Speech and Signal Processing, Jun 2023, Rhodes, Greece. pp.1-5, ⟨10.1109/ICASSP49357.2023.10096751⟩
Communication dans un congrès hal-04132313v1
Image document

SocialInteractionGAN: Multi-person Interaction Sequence Generation

Louis Airale , Dominique Vaufreydaz , Xavier Alameda-Pineda
IEEE Transactions on Affective Computing, 2022, ⟨10.1109/TAFFC.2022.3171719⟩
Article dans une revue hal-03163467v2

Probabilistic Graph Attention Network with Conditional Kernels for Pixel-Wise Prediction

Dan Xu , Xavier Alameda-Pineda , Wanli Ouyang , Elisa Ricci , Xiaogang Wang , et al.
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44 (5), pp.2673-2688. ⟨10.1109/TPAMI.2020.3043781⟩
Article dans une revue hal-03328687v1
Image document

Finding Audio-Visual Events in Informal Social Gatherings

Xavier Alameda-Pineda , Vasil Khalidov , Radu Horaud , Florence Forbes
ACM/IEEE International Conference on Multimodal Interaction, Nov 2011, Alicante, Spain. pp.247-254, ⟨10.1145/2070481.2070527⟩
Communication dans un congrès inria-00623489v2
Image document

Extending the Cascaded Gaussian Mixture Regression Framework for Cross-Speaker Acoustic-Articulatory Mapping

Laurent Girin , Thomas Hueber , Xavier Alameda-Pineda
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2017, 25 (3), pp.662-673. ⟨10.1109/TASLP.2017.2651398⟩
Article dans une revue hal-01485540v1
Image document

Exploiting the Intermittency of Speech for Joint Separation and Diarization

Dionyssos Kounades-Bastian , Laurent Girin , Xavier Alameda-Pineda , Radu Horaud , Sharon Gannot
WASPAA 2017 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct 2017, New Paltz, NY, United States. pp.41-45, ⟨10.1109/WASPAA.2017.8169991⟩
Communication dans un congrès hal-01568813v1
Image document

Multi-Paced Dictionary Learning for Cross-Domain Retrieval and Recognition

Dan Xu , Jingkuan Song , Xavier Alameda-Pineda , Elisa Ricci , Nicu Sebe
IEEE International Conference on Pattern Recognition, Dec 2016, Cancun, Mexico. pp.3228-3233, ⟨10.1109/ICPR.2016.7900132⟩
Communication dans un congrès hal-01416419v1
Image document

Unsupervised Speech Enhancement using Dynamical Variational Autoencoders

Xiaoyu Bie , Simon Leglaive , Xavier Alameda-Pineda , Laurent Girin
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2022, 30, pp.2993 - 3007. ⟨10.1109/TASLP.2022.3207349⟩
Article dans une revue hal-03295630v1
Image document

Les auto-encodeurs variationnels dynamiques et leur application à la modélisation de spectrogrammes de parole

Laurent Girin , Xiaoyu Bie , Simon Leglaive , Thomas Hueber , Xavier Alameda-Pineda
JEP 2022 - 34e Journées d’Études sur la Parole, Université de Nantes, Jun 2022, Noirmoutier, France. pp.655-663, ⟨10.21437/JEP.2022-69⟩
Communication dans un congrès hal-03978396v1

Variational Structured Attention Networks for Deep Visual Representation Learning

Guanglei Yang , Paolo Rota , Xavier Alameda-Pineda , Dan Xu , Mingli Ding , et al.
2021
Pré-publication, Document de travail hal-03296152v1
Image document

Audio-Visual Variational Fusion for Multi-Person Tracking with Robots

Xavier Alameda-Pineda , Soraya Arias , Yutong Ban , Guillaume Delorme , Laurent Girin , et al.
ACMMM 2019 - 27th ACM International Conference on Multimedia, Oct 2019, Nice, France. pp.1059-1061, ⟨10.1145/3343031.3350590⟩
Communication dans un congrès hal-02354514v1

Learning and controlling the source-filter representation of speech with a variational autoencoder

Samir Sadok , Simon Leglaive , Laurent Girin , Xavier Alameda-Pineda , Renaud Seguier
CFA 2022 - 16ème Congrès Français d'Acoustique, Société Française d'Acoustique (SFA), Apr 2022, Marseille, France
Communication dans un congrès hal-03603791v1