Creating and sharing knowledge for telecommunications

Project: Unified Transcription and Translation for Extended Reality

Acronym: UTTER
Main Objective:
The research in UTTER is motivated by the following two use-cases:
- Virtual Assistant for Online Multilingual Meetings The assistant should be able to translate from the speaker
into the language of the listener, producing a summary of the meeting, and noting action points (minuting).
- Multilingual Customer Service Dialogue Tool The tool should enable a customer service agent to provide support to global users (by text or voice) in cases where the customer and agent speak di erent languages, using the context of the conversation to assist the agent, with guidance to provide helpful, personalized answers that take into account the formality level and the cultural context of the customer and the brand, as well as monitoring the satisfaction
and engagement of the customer.

We tackle these use-cases by extending the state-of-the-art in translation and summarisation in the following ways:
- Translation should be multimodal, i.e. equally strong for speech input as it is for text input.
- All language technologies should be multilingual – in this project we will cover 6 languages: English, French,
German, Portuguese, Dutch, and Korean.
- Dialogue generation and translation should take into account the context of the conversation and its history, as well as other forms of context, such as the meeting notes, desired politeness level, and the speakers’ emotional status
(e.g. the sentiment of the customer).
- Translation of speech should be able to take into account paralinguistic aspects such as intonation, as this can often
change the meaning of the utterance, and should track the identity of the speaker.
- A summary of a meeting should include the action points generated by the meeting (i.e. the minutes).
- Summarisation and minuting should be explainable, in other words everything included in the meeting summary
should be relatable to the content.
- Translation and summarisation should be e cient; in particular, speech translation should be real-time.
- Systems should be robust and confidence-aware: they should be resilient to typos, acoustic noise, and recognition
errors, and they should be able to report their uncertainty.

We will achieve these through the use of pre-trained XR models, but in an fully open pipeline, where questions on
bias, fairness, risk etc. can be examined.
Reference: Horizon Europe, contract 101070631
Funding: EU
Start Date: 01-10-2022
End Date: 30-09-2025
Team: André Filipe Torres Martins, Gabriel Falcao Paiva Fernandes, Chrysoula Zerva, António José Evaristo Farinhas, Duarte Miguel Rodrigues dos Santos Marques Alves, Benjamin Paul Oscar Peters, Nuno Miguel Serrano Guerreiro, Marcos Vinicius Treviso, Xiaocheng Li, Kshitij Vitthal Ambilduke, Sonal Sannigrahi, Miguel Moura Ramos, Patrick Santos Fernandes
Groups: Pattern and Image Analysis – Lx, Multimedia Signal Processing – Co
Partners: University of Amsterdam, University of Edinburgh, Naver Labs, Unbabel, Instituto de Telecomunicacoes
Local Coordinator: André Filipe Torres Martins
Links: https://cordis.europa.eu/project/id/101070631

Associated Publications
  • 1Papers in Journals
  • C Zerva, A. Martins, Conformalizing Machine Translation Evaluation, Transactions of the Association for Computational Linguistics, Vol. 12, No., pp. 1460 - 1478, November, 2024,
    | Abstract
    | BibTex
  • 18Papers in Conferences
  • G. Gomes, B. Martins, C Zerva, Evaluation of Multilingual Image Captioning: How far can we get with CLIP models?, Annual Conference of the North American Chapter of the Association for Computational Linguistics NAACL, Albuquerque, New Mexico, United States, Vol., pp. -, April, 2025,
    | Abstract
    | Full text (PDF 4 MBs) | BibTex
  • J. Mire, Z. Aysola, D. Chechelnitsky, N. Deas, C Zerva, M. Sap, Rejected Dialects: Biases Against African American Language in Reward Models, Annual Conference of the North American Chapter of the Association for Computational Linguistics NAACL, Albuquerque, New Mexico, United States, April, 2025,
    | Abstract
    | BibTex
  • G. Faria, S. A. Agrawal, A. F. Farinhas, R. Rei, J. G. de Souza, A. Martins, QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine Translatio, Advances in Neural Information Processing Systems - NIPS, Vancouver, Canada, December, 2024 | Full text (PDF 3 MBs) | BibTex
  • A. Himmi, G. Staerman, M. Picot, P. Colombo, N. Guerreiro, Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation, Empirical Methods in Language Processing - EMNLP, Miami, United States, Vol., pp. -, November, 2024,
    | Abstract
    | BibTex
  • C Zerva, F. Blain, J. G. Sousa, D. Kanojia, S. Deoghar, N. Guerreiro, G. A. Attanasio, R. Rei, C. Orasan, M. Negri, M. Turchi, R. Chatterjee, P. Bhattacharyya, M. Freitag, A. Martins, Findings of the Quality Estimation Shared Task at WMT 2024 Are LLMs Closing the Gap in QE?, Conference on Machine Translation WMT, Miami, United States, Vol., pp. -, November, 2024,
    | Abstract
    | BibTex
  • M. Freitag, N. Mathur, D. Deutsch, C. Lo, E. Avramidis, R. Rei, B. Thompson, T. Kocmi, F. Blain, J. Wang, D. Adelani, M. Buchicchio, C Zerva, A. Lavie, Are LLMs breaking MT metrics? results of the WMT24 metrics shared task, Conference on Machine Translation WMT, Miami, United States, Vol., pp. -, November, 2024,
    | Abstract
    | BibTex
  • K. Thomas, G. F. Filandrianos, M. Lymperaiou, C Zerva, G. Stamou, ”I Never Said That”: A dataset, taxonomy and baselines on response clarity classification, Empirical Methods in Language Processing - EMNLP, Miami, United States, Vol., pp. -, November, 2024,
    | Abstract
    | Full text (PDF 2 MBs) | BibTex
  • P. Martins, N. Guerreiro, P. Fernandes, J. Alves, R. Rei, D. A. Alves, J. Pombal, M. Farajian, M. Faysse, P. Colombo, B. Haddow, J. G. de Souza, A. Birch, A. Martins, EuroLLM: Multilingual Language Models for Europe, EuroHPC User Day, Amsterdam, Netherlands, Vol., pp. -, October, 2024,
    | Abstract
    | BibTex
  • N. Guerreiro, R. Rei, D. Stigt, L. C. Coheur, P. Colombo, A. Martins, xCOMET: Transparent Machine Translation Evaluation through Fine-grained Error Detection, Empirical Methods in Language Processing - EMNLP, Bangkok, Thailand, Vol., pp. -, August, 2024,
    | Abstract
    | BibTex
  • A. F. Farinhas, D. Ulmer, C Zerva, A. Martins, Non-Exchangeable Conformal Risk Control, International Conference on Learning Representations ICLR, Vienna, Austria, Vol., pp. -, May, 2024,
    | Abstract
    | BibTex
  • N. Guerreiro, D. A. Alves, J. Waldendorf, J. Waldendorf, B. Haddow, A. Birch, P. Colombo, A. Martins, Hallucinations in Large Multilingual Translation Models, Empirical Methods in Language Processing - EMNLP, Singapore, Singapore, Vol., pp. -, December, 2023,
    | Abstract
    | BibTex
  • D. A. Alves, N. Guerreiro, J. Alves, J. Pombal, R. Rei, J. G. de Souza, P. Colombo, A. Martins, Steering Large Language Models for Machine Translationwith Finetuning and In-Context Learning, Empirical Methods in Language Processing - EMNLP, Singapore, Singapore, Vol., pp. -, December, 2023,
    | Abstract
    | BibTex
  • P. Fernandes, A. Madaan, E. L. Liu, A. F. Farinhas, P. H. Martins, A, B. Bertsch, J. G. de Souza, S. Z. Zhou, S. W. Wu, G. Neubig, A. Martins, Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation, Empirical Methods in Language Processing - EMNLP, Singapore, Singapore, Vol., pp. -, December, 2023,
    | Abstract
    | BibTex
  • F. Blain, C Zerva, R. Rei, N. Guerreiro, D. Kanojia, J. G. Sousa, B. Silva, T. Vaz, Y. Jinxuan, F. Azadi, C. Orasan, A. Martins, Findings of the WMT 2023 Shared Task on Quality Estimation, Conference on Machine Translation WMT, Singapore, Singapore, Vol., pp. -, December, 2023,
    | Abstract
    | BibTex
  • M. Freitag, N. Mathur, C. Lo, E. Avramidis, R. Rei, B. Thompson, T. Kocmi, F. Blain, D. Deutsch, C. Stuart, C Zerva, S. Castilho, A. Lavie, G. Foster, Results of WMT23 Metrics Shared Task: Metrics might be Guilty but References are not Innocent, Conference on Machine Translation WMT, Singapore, Singapore, December, 2023 | BibTex
  • S. Honda, P. Fernandes, C Zerva, Context-aware Neural Machine Translation for English-Japanese Business Scene Dialogues, Machine Translation Summit MT Summit, Macau, China, Vol., pp. -, September, 2023,
    | Abstract
    | BibTex
  • R. Rei, J. G. Sousa, D. M. A. Alves, C Zerva, A. C. Farinha, T. Glushkova, A. Lavie, L. C. Coheur, A. Martins, COMET-22: Unbabel-IST 2022 submission for the metrics shared task, Conference on Machine Translation WMT, Abu Dhabi, United Arab Emirates, Vol., pp. -, December, 2022,
    | Abstract
    | BibTex
  • C Zerva, F. Blain, R. Rei, P. Lertvittayakumjorn, J. G. Sousa, S. Eger, D. Kanojia, D. A. Alves, C. Orasan, M. Fomicheva, A. Martins, L. Specia, Findings of the wmt 2022 shared task on quality estimation, Conference on Machine Translation WMT, Abu Dhabi, United Arab Emirates, Vol., pp. -, December, 2022,
    | Abstract
    | BibTex