Abstract
Understanding the semantic shifts of multimodal information is only possible with models that capture cross-modal interactions over time. Under this paradigm, a new embedding is needed that structures visual-textual interactions according to the temporal dimension, thus, preserving data's original temporal organisation. This paper introduces a novel diachronic cross-modal embedding (DCM), where cross-modal correlations are represented in embedding space, throughout the temporal dimension, preserving semantic similarity at each instant t. To achieve this, we trained a neural cross-modal architecture, under a novel ranking loss strategy, that for each multimodal instance, enforces neighbour instances' temporal alignment, through subspace structuring constraints based on a temporal alignment window. Experimental results show that our DCM embedding successfully organises instances over time. Quantitative experiments, confirm that DCM is able to preserve semantic cross-modal correlations at each instant t while also providing better alignment capabilities. Qualitative experiments unveil new ways to browse multimodal content and hint that multimodal understanding tasks can benefit from this new embedding.
| Original language | English |
|---|---|
| Title of host publication | MM 2019 - Proceedings of the 27th ACM International Conference on Multimedia |
| Place of Publication | New York, NY, USA |
| Publisher | ACM - Association for Computing Machinery |
| Pages | 2061-2069 |
| ISBN (Print) | 978-1-4503-6889-6 |
| DOIs | |
| Publication status | Published - 15 Oct 2019 |
| Event | MM '19: The 27th ACM International Conference on Multimedia - Nice, France Duration: 21 Oct 2019 → 25 Oct 2021 |
Conference
| Conference | MM '19: The 27th ACM International Conference on Multimedia |
|---|---|
| Country/Territory | France |
| City | Nice |
| Period | 21/10/19 → 25/10/21 |
Fingerprint
Dive into the research topics of 'Diachronic cross-modal embeddings'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver