Abstract
Original language | English |
---|---|
Title of host publication | Pattern Recognition and Image Analysis |
Subtitle of host publication | 11th Iberian Conference, IbPRIA 2023, Alicante, Spain, June 27–30, 2023, Proceedings |
Editors | Antonio Pertusa, Antonio Javier Gallego, Joan Andreu Sánchez, Inês Domingues |
Place of Publication | Cham |
Publisher | Springer |
Pages | 67-81 |
Number of pages | 15 |
ISBN (Electronic) | 978-3-031-36616-1 |
ISBN (Print) | 978-3-031-36615-4 |
DOIs | |
Publication status | Published - 2023 |
Event | 11th Iberian Conference on Pattern Recognition and Image Analysis, IbPRIA 2023 - Alicante, Spain Duration: 27 Jun 2023 → 30 Jun 2023 |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Publisher | Springer |
Volume | 14062 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 11th Iberian Conference on Pattern Recognition and Image Analysis, IbPRIA 2023 |
---|---|
Country/Territory | Spain |
City | Alicante |
Period | 27/06/23 → 30/06/23 |
Keywords
- batch size
- Embeddings learning
- priority batch priority queue
Fingerprint
Dive into the research topics of 'Learning Semantic-Visual Embeddings with a Priority Queue'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver
}
Pattern Recognition and Image Analysis: 11th Iberian Conference, IbPRIA 2023, Alicante, Spain, June 27–30, 2023, Proceedings. ed. / Antonio Pertusa; Antonio Javier Gallego; Joan Andreu Sánchez; Inês Domingues. Cham: Springer, 2023. p. 67-81 (Lecture Notes in Computer Science; Vol. 14062 LNCS).
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
TY - GEN
T1 - Learning Semantic-Visual Embeddings with a Priority Queue
AU - Valério, Rodrigo
AU - Magalhães, João
N1 - Funding Information: Acknowledgements. This work was partially funded by the Horizon EU project MUSAE (No. 01070421), 2021-SGR-01094 (AGAUR), Icrea Academia’2022 (General-itat de Catalunya), Robo STEAM (2022-1-BG01-KA220-VET-000089434, Erasmus+ EU), DeepSense (ACE053/22/000029, ACCIÓ), DeepFoodVol (AEI-MICINN, PDC-2022-133642-I00) and CERCA Programme/Generalitat de Catalunya. B. Nagarajan acknowledges the support of FPI Becas, MICINN, Spain. We acknowledge the support of NVIDIA Corporation with the donation of the Titan Xp GPUs. As Serra Húnter Fellow, Ricardo Marques acknowledges the support of the Serra Húnter Programme. Funding Information: This work was partially supported by Grant PID2020-115225RB-I00 funded by MCIN/AEI/ 10.13039/501100011033. Funding Information: Supported by Pattern Recognition and Human Language Technology Center (PRHLT). Funding Information: Acknowledgments. This paper is part of the project I+D+i PID2020-118447RAI00 (MultiScore), funded by MCIN/AEI/10.13039/501100011033. The second author is supported by grant FPU19/04957 from the Spanish Ministerio de Universidades. Funding Information: Acknowledgements. This work is Co-financed by Component 5 - Capitalization and Business Innovation, integrated in the Resilience Dimension of the Recovery and Resilience Plan within the scope of the Recovery and Resilience Mechanism (MRR) of the European Union (EU), framed in the Next Generation EU, for the period 2021– 2026, and by National Funds through the Portuguese funding agency, FCT-Foundation for Science and Technology Portugal, a PhD Grant Number 2021.06275. Funding Information: Thanks to VLIRUOS for financial support in the framework of the Institutional University Cooperation project with Universidad de Oriente, Cuba. Funding Information: Acknowledgment. This research was partially funded by the Spanish Ministerio de Ciencia e Innovación (grant number PID2020-112623GB-I00), and the Galician Consellería de Cultura, Educación e Universidade (grant numbers ED431C 2018/29, ED431C 2021/048, ED431G 2019/04). These grants are co-funded by the European Regional Development Fund (ERDF). Funding Information: Acknowledgements. This work was partially funded by the FCT project NOVA LINCS (UIDP/04516/2020), and the CMU Portugal project iFetch (LISBOA-01-0247-FEDER-045920). Funding Information: Supported by Generalitat Valenciana under AICO 2023 program. Funding Information: Acknowledgments. This work was supported by project PEGADA 4.0 (PRR-C05-i03-000099), financed by the PPR - Plano de Recupera¸cão e Resiliência and by national funds through FCT, within the scope of the project CISUC (UID/CEC/00326/2020). Funding Information: Acknowledgements. This research was in part sponsored by the NATO Science for Peace and Security Programme under grant id. G6032. Funding Information: Acknowledgements. Work partially supported by the research grants: the SimancasSearch project as Grant PID2020-116813RB-I00a funded by MCIN/AEI/ 10.13039/501100011033 and ValgrAI - Valencian Graduate School and Research Network of Artificial Intelligence and the Generalitat Valenciana, co-funded by the European Union. The second author’s work was partially supported by the Universitat Politècnica de València under grant FPI-I/SP20190010. The third author’s work is supported by a María Zambrano grant from the Spanish Ministerio de Universidades and the European Union NextGenerationEU/PRTR. Funding Information: This project is supported by CONACYT (PhD scholarship) and DGAPA PAPIIT IN105222. We thank the anonymous reviewers for their helpful feedback. We gratefully acknowledge the support of Paul Hernandez-Herrera in image segmentation. Funding Information: Supported by ZF Friedrichshafen AG. Funding Information: This research/project is supported by the National Research Foundation, Singapore under its Industry Alignment Fund - Pre-positioning (IAF-PP) Funding Initiative. Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not reflect the views of the National Research Foundation, Singapore. Funding Information: Acknowledgement. We would like to thank Andre Kåsen from the National Library of Norway for providing the Norwegian Handwritten word images used in this study, We are grateful to Simula HPC cluster, The research presented in this paper has benefited from the Experimental Infrastructure for Exploration of Exascale Computing (eX3), which the Research Council of Norway financially supports under contract 270053, We are grateful to Professor Marius Pedersen from Department of Computer Science at Norwegian University of Science and Technology, Gjøvik, for his informative comments, invaluable support, and encouragement throughout this work. This work is supported under the aegis of the Hugin-Munin project funded by the Norwegian Research Council (Project number 328598). Funding Information: Acknowledgment. Work supported by the Horizon 2020 - European Commission (H2020) under the SELENE project (grant agreement no 871467) and the project Deep learning for adaptive and multimodal interaction in pattern recognition (DeepPattern) (grant agreement PROMETEO/2019/121). We gratefully acknowledge the support of NVIDIA Corporation with the donation of a GPU used for part of this research. Funding Information: Supported by the MCIN Project TED2021-129151B-I00/AEI/10.13039/ 501100011 033/ European Union NextGenerationEU/PRTR, project PID2019-103871GB-I00 of the Spanish Ministry of Economy, Industry and Competitiveness, and project PAIDI P20 00430 of the Junta de Andalućıa, FEDER. We thank H. Sarmadi his contribution to the data preparation. Funding Information: Acknowledgment. This work was funded by the New Zealand Ministry of Business, Innovation and Employment under contract C09X1923 (Catalyst: Strategic Fund). Funding Information: Supported by the Portuguese Foundation for Science and Technology — FCT within PhD grant 2020.06434.BD. Funding Information: This paper is part of the I+D+i project PID2020-118447RA-I00 (MultiScore), funded by MCIN/AEI/10.13039/501100011033. Funding Information: This work is supported by FEDER, through POR LISBOA 2020 and COMPETE 2020 of the Portugal 2020 Project CityCatalyst POCI-01-0247-FEDER-046119. Ana Almeida acknowledges the Doctoral Grant from Funda¸cão para a Ciência e Tecnologia (2021.06222.BD). Susana Brás is funded by national funds, European Regional Development Fund, FSE, through COMPETE2020 and FCT, in the scope of the framework contract foreseen in the numbers 4, 5 and 6 of the article 23, of the Decree-Law 57/2016, of August 29, changed by Law 57/2017, of July 19. We thank OpenWeather for providing the datasets. Funding Information: Acknowledgments. This work was partially supported by Analog Devices, Inc. and by the Agencia Valenciana de la Innovacion of the Generalitat Valenciana under program “Plan GEnT. Doctorados Industriales. Innodocto” Funding Information: Supported by VINNOVA (projects 2017-02447, 2020-03611, 2021-01420) and the Centre for Interdisciplinary Mathematics (CIM), Uppsala University. Funding Information: Acknowledgments. This work has received financial support from the Spanish government (project PID2020-119367RB-I00); from the Xunta de Galicia, Consellaría de Cultura, Educación e Ordenación Universitaria (accreditations 2019-2022 ED431G-2019/04 and ED431G 2019/01, and reference competitive groups 2021-2024 ED431C 2021/48 and ED431C 2021/30), and from the European Regional Development Fund (ERDF/FEDER). Funding Information: This work was partially funded by FCT - Funda¸cäo para a Ciência e a Tecnologia (FCT), I.P., through national funds, within the scope of the UIDB/00127/2020 project (IEETA/UA, http://www.ieeta.pt/). S. Brás acknowledges the support by national funds, European Regional Development Fund, FSE through COMPETE2020, through FCT, in the scope of the framework contract foreseen in the numbers 4, 5, and 6 of the article 23, of the Decree-Law 57/2016, of August 29, changed by Law 57/2017, of July 19. Funding Information: Supported by University of Canterbury, New Zealand. Funding Information: Acknowledgements. Work partially supported by the LARSyS - FCT Project [UIDB/50009/2020], the H2020 FET-Open project Reconstructing the Past: Artificial Intelligence and Robotics Meet Cultural Heritage (RePAIR) under EU grant agreement 964854, the Lisbon Ellis Unit (LUMLIS). Funding Information: This work was supported by the I+D+i project TED2021-132103A-I00 (DOREMI), funded by MCIN/AEI/10.13039/501100011033. Funding Information: Acknowledgements. The research was supported by the Joan Oró grant (FI 2022) from the DRU of the Generalitat de Catalunya and the European Social Fund (2023 FI-2 00160). The authors would also like to thank the Agència de Gestió d’Ajuts Uni-versitaris i de Recerca (AGAUR) of the Generalitat de Catalunya (2021 SGR01396, 2021 SGR00706, 2021 SGR1475), the Spanish Ministry of Science, Innovation, and Universities under grant PID2020-113609RB-C21, and Fondation Jerome Lejeune under grant 2020b cycle-Project No.2001. Funding Information: Acknowledgments. This work was supported by projects: TED2021-129410B-I00 (MCIN/AEI/10.13039/501100011033 and NextGenerationEU/PRTR) and JIUZ-2021-TEC-01 and the government of Panama under the IFARHU-SENACYT scholarship program for PhD studies. Funding Information: 501100011033 under the grant PID2020-116813RB-I00 (SimancasSearch); the General-itat Valenciana under the FPI grant CIACIF/2021/313; and by the support of valgrAI -Valencian Graduate School and Research Network of Artificial Intelligence and the Generalitat Valenciana, and co-funded by the European Union. Funding Information: Supported by the MCIN Project TED2021-129151B-I00/AEI/10.13039/ 501100011033/ European Union NextGenerationEU/PRTR, project PID2019-103871GB-I00 of the Spanish Ministry of Economy, Industry and Competitiveness, and project PAIDI P20 00430 of the Junta de Andalućıa, FEDER. Funding Information: Acknowledgements. This research is supported by Flanders Make, the strategic research Centre for the Manufacturing Industry and the Flemish Innovation and Entrepreneurship Agency through the research project ‘DAP2CHEM’ (project number: HBC.2020.2455). The authors would like to thank all project’s partners for their inputs and support to make this publication. Funding Information: Acknowledgments. This paper is part of the project I+D+i PID2020-118447RAI00 (MultiScore), funded by MCIN/AEI/10.13039/501100011033. The first author is supported by grant CIACIF/2021/465 from “Programa I+D+i de la Generalitat Valen-ciana“. The second author is supported by grant FPU19/04957 from the Spanish Min-isterio de Universidades. Funding Information: Acknowledgments. We thank Tragsatec’s Management of Agricultural and Fisheries Information Systems and the General Secretariat of Fisheries of the Spanish Ministry of Agriculture, Fisheries and Food for the data and expertise provided to carry out the study. This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sk lodowska-Curie grant agreement No 860813 - TReSPAsS-ETN. This study is also supported by the project INTER-ACTION (PID2021-126521OB-I00 MICINN/FEDER). Funding Information: Acknowledgments. This work was partially funded by the project “POCI-01-0247-FEDER-046964”, supported by Operational Program for Competitiveness and Internationalization (COMPETE 2020), under the PORTUGAL 2020 Partnership Agreement, through the European Regional Development Fund (ERDF 1). This work was also partially funded by national funds (PIDDAC), through the FCT - Funda¸cão para a Ciência e Tecnologia and FCT/MCTES under the scope of the projects UIDB/05549/2020 and UIDP/05549/2020. This paper was also partially funded by national funds, through the FCT - Funda¸cão para a Ciência e a Tecnologia and FCT/MCTES under the scope of the project LASI-LA/P/0104/2020. Funding Information: Acknowledgment. This work has been partially supported by MCIN/AEI/10.13039/ Publisher Copyright: © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2023
Y1 - 2023
N2 - The Stochastic Gradient Descent (SGD) algorithm and margin-based loss functions have been the learning workhorse of choice to train deep metric learning networks. Often, the random nature of SGD will lead to the selection of sub-optimal mini-batches, several orders of magnitude smaller than the larger dataset. In this paper, we propose to augment SGD mini-batch with a priority learning queue, i.e., SGD+PQ. While the mini-batch SGD replaces all learning samples in the mini-batch at each iteration, the proposed priority queue replaces samples by removing the less informative ones. This novel idea introduces a sample update strategy that balances two sample removal criterion: (i) removal of stale samples from the PQ that are likely outdated, and (ii) removal of samples that are not contributing to the error, i.e. their sample error is not changing during training. Experimental results demonstrate the success of the proposed approach across three datasets.
AB - The Stochastic Gradient Descent (SGD) algorithm and margin-based loss functions have been the learning workhorse of choice to train deep metric learning networks. Often, the random nature of SGD will lead to the selection of sub-optimal mini-batches, several orders of magnitude smaller than the larger dataset. In this paper, we propose to augment SGD mini-batch with a priority learning queue, i.e., SGD+PQ. While the mini-batch SGD replaces all learning samples in the mini-batch at each iteration, the proposed priority queue replaces samples by removing the less informative ones. This novel idea introduces a sample update strategy that balances two sample removal criterion: (i) removal of stale samples from the PQ that are likely outdated, and (ii) removal of samples that are not contributing to the error, i.e. their sample error is not changing during training. Experimental results demonstrate the success of the proposed approach across three datasets.
KW - batch size
KW - Embeddings learning
KW - priority batch priority queue
UR - http://www.scopus.com/inward/record.url?scp=85164915590&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-36616-1_6
DO - 10.1007/978-3-031-36616-1_6
M3 - Conference contribution
AN - SCOPUS:85164915590
SN - 978-3-031-36615-4
T3 - Lecture Notes in Computer Science
SP - 67
EP - 81
BT - Pattern Recognition and Image Analysis
A2 - Pertusa, Antonio
A2 - Gallego, Antonio Javier
A2 - Sánchez, Joan Andreu
A2 - Domingues, Inês
PB - Springer
CY - Cham
T2 - 11th Iberian Conference on Pattern Recognition and Image Analysis, IbPRIA 2023
Y2 - 27 June 2023 through 30 June 2023
ER -