Challenges on the Automatic Translation of Collocations

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Downloads (Pure)


As a linguistic phenomenon, collocations have been the subject of numerous re- searches both in the fields of theoreti- cal and descriptive linguistics, and, more recently, in automatic Natural Language Processing. In the area of Machine there is still improvements to be done, as ma- jor translation engines do not handle col- locations in the appropriate way and end up producing literal unsatisfactory transla- tions. Having as a starting point our pre- vious work on machine translation error analysis (Costa et al., 2015), in this ar- ticle we present a corpus annotated with collocation errors and their classification. To our believe, to have a clear understand- ing of the difficulties that the collocations represent to the Machine Translations en- gines, it is necessary a detailed linguistic analysis of their errors.
Original languageEnglish
Title of host publicationWorkshop Proceedings on Multi-word Units in Machine Translation and Translation Technology
EditorsGloria Pastor, Johanna Monti, Violeta Seretan, Ruslan Mitkov
Place of PublicationGeneva
PublisherEditions Tradulex
Number of pages4
ISBN (Print)978-2-9700736-9-7
Publication statusPublished - 2016
EventEUROPHRAS 2015 - Malaga, Spain
Duration: 29 Jun 20152 Jul 2015


ConferenceEUROPHRAS 2015
Internet address


  • Collocations
  • Machine Translation

Fingerprint Dive into the research topics of 'Challenges on the Automatic Translation of Collocations'. Together they form a unique fingerprint.

  • Cite this

    Costa, Â., Lino, M. T. R. F., & Coheur, L. (2016). Challenges on the Automatic Translation of Collocations. In G. Pastor, J. Monti, V. Seretan, & R. Mitkov (Eds.), Workshop Proceedings on Multi-word Units in Machine Translation and Translation Technology (pp. 74-77). Editions Tradulex.