Filtrer vos résultats
- 143
- 27
- 128
- 24
- 5
- 3
- 3
- 3
- 2
- 2
- 4
- 1
- 167
- 23
- 9
- 2
- 1
- 1
- 1
- 4
- 3
- 10
- 9
- 6
- 2
- 11
- 13
- 12
- 12
- 17
- 23
- 13
- 9
- 6
- 9
- 1
- 6
- 1
- 2
- 143
- 27
- 168
- 167
- 14
- 8
- 7
- 6
- 5
- 4
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 170
- 51
- 17
- 16
- 15
- 14
- 12
- 10
- 9
- 8
- 7
- 7
- 6
- 6
- 6
- 5
- 5
- 5
- 5
- 5
- 5
- 5
- 4
- 4
- 4
- 4
- 4
- 4
- 4
- 4
- 4
- 4
- 4
- 4
- 4
- 4
- 4
- 4
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
170 résultats
|
SSIG and IRISA at Multimodal Person DiscoveryWorking Notes Proceedings of the MediaEval Workshop, 2015, Wurzen, Germany
Communication dans un congrès
hal-01196171v1
|
||
|
Is it time to switch to Word Embedding and Recurrent Neural Networks for Spoken Language Understanding?InterSpeech, Sep 2015, Dresde, Germany
Communication dans un congrès
hal-01196915v1
|
||
|
Multimodal and Crossmodal Representation Learning from Textual and Visual Features with Bidirectional Deep Neural Networks for Video HyperlinkingACM Multimedia 2016 Workshop: Vision and Language Integration Meets Multimedia Fusion (iV&L-MM'16), ACM Multimedia, Oct 2016, Amsterdam, Netherlands
Communication dans un congrès
hal-01374727v1
|
||
|
Sequential pattern mining on multimedia dataEuropean Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Database Workshop on Advanced Analytics and Learning on Temporal Data, 2015, Porto, Portugal
Communication dans un congrès
hal-01186446v1
|
||
Stochastic Models for Multimodal Video AnalysisMaragos, Petros and Potamianos, Alexandros and Gros, Patrick. Multimodal Processing and Interaction, 33, Springer, pp.89-107, 2008, 978-0-387-76315-6. ⟨10.1007/978-0-387-76316-3_3⟩
Chapitre d'ouvrage
hal-00770993v1
|
|||
|
Babaz: a large scale audio search system for video copy detectionICASSP - 37th International Conference on Acoustics, Speech, and Signal Processing, Mar 2012, Kyoto, Japan
Communication dans un congrès
hal-00661581v1
|
||
|
Retrieving Geo-Location of Videos with a Divide & Conquer Hierarchical Multimodal ApproachICMR - International Conference of Multimedia Retrieval, Apr 2013, Dallas, United States
Communication dans un congrès
hal-00801698v1
|
||
An efficient method for the unsupervised discovery of signalling motifs in large audio streamsInternational Workshop on Content-Based Multimedia Indexing, Jun 2011, Madrid, Spain
Communication dans un congrès
inria-00572817v1
|
|||
Unsupervised Motif Acquisition in Speech via Seeded Discovery and Template Matching CombinationIEEE Transactions on Audio, Speech and Language Processing, 2012, 20 (7), pp.2031 - 2044. ⟨10.1109/TASL.2012.2194283⟩
Article dans une revue
hal-00740978v1
|
|||
|
The Spoken Web Search TaskWorking Notes Proceedings of the MediaEval 2012 Workshop, 2012, Italy
Communication dans un congrès
hal-00757594v1
|
||
|
A framework for integrating heterogeneous sporadic knowledge sources into automatic speech recognitionWorkshop on Speech, Language and Audio in Multimedia, 2013, France. pp.37-42
Communication dans un congrès
hal-00906348v1
|
||
|
IRISA at TRECVid 2017: Beyond Crossmodal and Multimodal Models for Video HyperlinkingWorking Notes of the TRECVid 2017 Workshop, 2017, Gettysburg, United States
Communication dans un congrès
hal-01643232v1
|
||
|
NexGenTV: Providing Real-Time Insight during Political Debates in a Second Screen ApplicationMM 2017 - 25th ACM International Conference on Multimedia, Oct 2017, Moutain View, United States
Communication dans un congrès
hal-01635966v1
|
||
|
Filtrage et régularisation pour améliorer la plausibilité des poids d'attention dans la tâche d'inférence en langue naturelleTALN 2022 - Traitement Automatique des Langues Naturelles, Jun 2022, Avignon, France. pp.95-103
Communication dans un congrès
hal-03701492v1
|
||
Is Syllable Stress Information Robust for ASR in Adverse Conditions?International Conference on Speech Prosody, May 2014, Dublin, Ireland. pp.939-943
Communication dans un congrès
hal-01026423v1
|
|||
|
Technicolor/INRIA/Imperial College London at the MediaEval 2012 Violent Scene Detection TaskWorking Notes Proceedings of the MediaEval 2012 Workshop, 2012, Italy
Communication dans un congrès
hal-00757584v1
|
||
|
Morphosyntactic resources for automatic speech recognition6th International Conference on Language Resources and Evaluation (LREC), 2008, Marrakech, Morocco
Communication dans un congrès
hal-02021879v1
|
||
|
Constraint selection for topic-based MDI adaptation of language models10th Annual Conference of the International Speech Communication Association, Interspeech'09, Sep 2009, Brighton, United Kingdom. pp.368--371
Communication dans un congrès
hal-00760610v1
|
||
|
Séparation de sources à partir d'un seul capteur pour la reconnaissance robuste de la paroleJournées d'Etude sur la Parole: JEP 2004, Apr 2004, Fès, Maroc
Communication dans un congrès
inria-00567339v1
|
||
|
Audio Event Detection in Movies using Multiple Audio Words and Contextual Bayesian NetworksCBMI - 11th International Workshop on Content Based Multimedia Indexing - 2013, Jun 2013, Veszprém, Hungary
Communication dans un congrès
hal-00822022v1
|
||
|
Texmix: an automatically generated news navigation portalICMR - ACM International Conference on Multimedia Retrieval, ACM, Jun 2012, Hong-Kong, China. ⟨10.1145/2324796.2324868⟩
Communication dans un congrès
hal-00767253v1
|
||
|
Efficient Mining of Repetitions in Large-Scale TV Streams with Product Quantization HashingWorkshop on Web-scale Vision and Social Media, in conjunction with ECCV, Oct 2012, Firenze, Italy
Communication dans un congrès
hal-00731090v1
|
||
|
Zero-resource audio-only spoken term detection based on a combination of template matching techniquesINTERSPEECH 2011: 12th Annual Conference of the International Speech Communication Association, Aug 2011, Florence, Italy
Communication dans un congrès
inria-00597907v1
|
||
|
Investigating domain-independent NLP techniques for precise target selection in video hyperlinkingISCA/IEEE Workshop on Speech, Language and Audio in Multimedia, Sep 2014, Penang, Malaysia
Communication dans un congrès
hal-01053698v1
|
||
|
Audio word similarity for clustering with zero resources based on iterative HMM classificationInternational Conference on Acoustics, Speech and Signal Processing, ICASSP, Mar 2016, Shanghai, China. pp.5340 - 5344, ⟨10.1109/ICASSP.2016.7472697⟩
Communication dans un congrès
hal-01394757v1
|
||
|
IRISA at TrecVid2015: Leveraging Multimodal LDA for Video HyperlinkingTRECVid 2015 Workshop, Nov 2015, Gaithersburg, United States
Communication dans un congrès
hal-01403726v1
|
||
|
Generative Adversarial Networks for Multimodal Representation Learning in Video HyperlinkingACM International Conference on Multimedia Retrieval (ICMR) 2017, ACM, Jun 2017, Bucharest, Romania. ⟨10.1145/3078971.3079038⟩
Communication dans un congrès
hal-01522419v1
|
||
|
Towards large scale multimedia indexing: A case study on person discovery in broadcast newsContent-Based Multimedia Indexing CBMI, Jun 2017, Firenze, Italy. ⟨10.1145/3095713.3095732⟩
Communication dans un congrès
hal-01551690v1
|
||
|
A Study of the Plausibility of Attention between RNN Encoders in Natural Language InferenceICMLA 2021 - 20th IEEE International Conference on Machine Learning and Applications, Dec 2021, Pasadena, United States. pp.1-7
Communication dans un congrès
hal-03372669v1
|
||
|
Rethinking deep active learning: Using unlabeled data at model trainingICPR 2020 - 25th International Conference on Pattern Recognition, Jan 2021, Milan, Italy. pp.1-12
Communication dans un congrès
hal-02372102v1
|