Recherche - Archive ouverte HAL Accéder directement au contenu

Filtrer vos résultats

7 résultats
Image document

TubeDETR: Spatio-Temporal Video Grounding with Transformers

Antoine Yang , Antoine Miech , Josef Sivic , Ivan Laptev , Cordelia Schmid
CVPR 2022 - IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun 2022, New Orleans, United States
Communication dans un congrès hal-03625586v2
Image document

Learning Visual Language Models for Video Understanding

Antoine Yang
Computer Vision and Pattern Recognition [cs.CV]. Ecole Normale Superieure de Paris - ENS Paris, 2023. English. ⟨NNT : ⟩
Thèse tel-04307117v2
Image document

Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning

Antoine Yang , Arsha Nagrani , Paul Hongsuck Seo , Antoine Miech , Jordi Pont-Tuset , et al.
CVPR 2023 - IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun 2023, Vancouver, Canada
Communication dans un congrès hal-04039246v1
Image document

VidChapters-7M: Video Chapters at Scale

Antoine Yang , Arsha Nagrani , Ivan Laptev , Josef Sivic , Cordelia Schmid
NeurIPS 2023 - Conference on Neural Information Processing Systems - Track on Datasets and Benchmarks, Dec 2023, New Orleans (LA), United States
Communication dans un congrès hal-04217697v1
Image document

Just Ask: Learning to Answer Questions from Millions of Narrated Videos

Antoine Yang , Antoine Miech , Josef Sivic , Ivan Laptev , Cordelia Schmid
ICCV 2021 - IEEE International Conference on Computer Vision, Oct 2021, Montréal, Canada
Communication dans un congrès hal-03328749v1
Image document

Zero-Shot Video Question Answering via Frozen Bidirectional Language Models

Antoine Yang , Antoine Miech , Josef Sivic , Ivan Laptev , Cordelia Schmid
NeurIPS 2022 - 36th Conference on Neural Information Processing Systems, Nov 2022, New Orleans, United States
Communication dans un congrès hal-03807016v2
Image document

Learning to Answer Visual Questions from Web Videos

Antoine Yang , Antoine Miech , Josef Sivic , Ivan Laptev , Cordelia Schmid
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, ⟨10.1109/tpami.2022.3173208⟩
Article dans une revue hal-03664182v1