Publications
Recent publications and arXiv papers
L. Goncalves, P. Mathur, C. Lavania, M. Cekic, M. Federico, K. J. Han, "PEAVS: Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers’ Opinion Scores", arXiv, 2024. (Accepted at EECV)
B. Thompson, M. P. Dhaliwal, P. Frisch, T. Domhan, M. Federico, "A Shocking Amount of the Web is Machine Translated: Insights from Multi-Way Parallelism", Findings of the ACL, 2024.
J. Zuluaga-Gomez, Z. Huang, X. Niu, R. Paturi, S. Srinavasan, P. Mathur, B. Thompson, M. Federico, "End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation", Proceedings of EMNLP, 2023.
Y. Virkar, B. Thompson, R. Paturi, S. Srinivasan, M. Federico, "Speaker Diarization of Scripted Audiovisual Content", arXiv, 2023.
P. Pal, B. Thompson, Y. Virkar, P. Mathur, A. Chronopoulou, M. Federico, "Improving Isochronous Machine Translation with Target Factors and Auxiliary Counters", Proceedings of Interspeech, 2023.
Cuong Hoang, Devendra Sachan, Prashant Mathur, Brian Thompson, Marcello Federico, “Improving Retrieval Augmented Neural Machine Translation by Controlling Source and Fuzzy-Match Interactions”, Proceedings of EACL (Findings), 2023.
G. Vernikos, B. Thompson, P. Mathur, M. Federico, “Embarrassingly Easy Document-Level MT Metrics: How to Convert Any Pretrained Metric Into a Document-Level Metric”, EMNLP Conference on Machine Translation (WMT), 2022.
S. Majumder, S. Lauly, M. Nadeje, M. Federico, G. Dinu, "A baseline revisited: pushing the limits of multi-segment models for context-aware translation", arXiv, 2022
Cuong Hoang, Devendra Sachan, Prashant Mathur, Brian Thompson, Marcello Federico, “Improving Robustness of Retrieval Augmented Translation via Shuffling of Suggestions”, arXiv, 2022.
Felix Hieber, Michael Denkowski, Tobias Domhan, Barbara Darques Barros, Celina Dong Ye, Xing Niu, Cuong Hoang, Ke Tran, Benjamin Hsu, Maria Nadejde, Surafel Lakew, Prashant Mathur, Anna Currey, Marcello Federico, "Sockeye 3: Fast Neural Machine Translation with PyTorch", arXiv, 2022.
D. Tam, S. M. Lakew, Y. Virkar, P. Mathur, M. Federico, "Isochrony-aware Neural Machine Translation for Dubbing", Proceedings of Interspeech, 2022.
Y. Virkar, M. Federico, R. Enyedi, R. Barra-Chicote, "Prosodic alignment for off-screen automatic dubbing", Proceedings of Interspeech, 2022.
M. Nadejde, A. Currey, B. Hsu, X. Niu, M. Federico, G. Dinu, "CoCoA-MT: A Dataset and Benchmark for Contrastive Controlled MT with Application to Formality", Proceedings of NAACL, 2022.
S. M. Lakew, Y. Virkar, P. Mathur, M. Federico, "Isometric MT: Neural Machine Translation for Automatic Dubbing", Proceedings of IEEE-ICASSP, 2022.
J. Effendi, Y. Virkar, R. Barra-Chicote, M. Federico, "Duration Modeling for Neural TTS for Automatic Dubbing", Proceedings of IEEE-ICASSP, 2022.
Complete list of publications
Journals
S. M. Lakew, M. Federico, M. Negri, M. Turchi, "Multilingual Neural Machine Translation for Low-Resource Languages", Italian Journal of Computational Linguistics, Vol. 1, n. 4, pp. 11-26 , June 2018.
L. Bentivogli, A. Bisazza, M. Federico, M. Cettolo, "Neural versus Phrase-Based MT Quality: an In-Depth Analysis on English-German and English-French", Computer Speech and Language, Vol. 49, pp. 52-70, May 2018.
M. Negri, D. Ataman, M.J. Sabet, M. Turchi, M. Federico, "Automatic Translation Memory Cleaning", Machine Translation, 31:3, pp. 93-115, doi: 10.1007/s10590-017-9191-5, Sept 2017. (accepted version)
E. Barbu, C. P. Escartín, L. Bentivogli, M. Negri, M. Turchi, C. Orasan, M. Federico, "The first Automatic Translation Memory Cleaning Shared Task", Machine Translation, 30:3-4, pp. 145-166, doi: 10.1007/s10590-016-9183-x, Dec, 2016.
A. Bisazza, M. Federico, "A Survey of Word Reordering in Statistical Machine Translation: Computational Models and Language Phenomena", Computational Linguistics, 42:2, pp. 163-205, 2016.
L. Bentivogli, N. Bertoldi, M. Cettolo, M. Federico, M. Negri, M. Turchi, "On the Evaluation of Adaptive Machine Translation for Human Post-Editing", IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 24, no. 2, pp. 388-399, 2016.
N. Bertoldi, P. Simianer, M. Cettolo, K. Waeschle, M. Federico, S. Riezler, "Online Adaptation to Post-Edits for Phrase-Based Statistical Machine Translation", Machine Translation, vol. 28, issue 3-4, pp. 309-339, 2014.
M. Cettolo, N. Bertoldi, M. Federico, H. Schwenk, L. Barrault, C. Servan, "Translation Project Adaptation for MT-Enhanced Computer Assisted Translation", Machine Translation, vol. 28, issue 2, pp.127-150, 2014.
M. Turchi, M. Negri, M. Federico, "Data-driven Annotation of Binary MT Quality Estimation Corpora Based on Human Post-editions", Machine Translation, vol. 28, issue 3-4, pp.281-308, 2014.
B. Magnin, M. Baroni, M. Federico, R. Navigli, "Recent Advancements in Human Language Technology in Italy", Intelligenza Artificiale, 194, pp. 2-27, Elsevier, 2013.
A Bisazza, M. Federico, “Dynamically Shaping the Reordering Search Space of Phrase-Based Statistical Machine Translation”, Transactions of the Association for Computational Linguistics, 1(July): pp. 327-340, 2013.
A. Bisazza, D. Pighin, M. Federico, “Chunk-lattices for Verb Reordering in Arabic–English Statistical MachineTranslation”, Machine Translation, vol. 26, no. 1-2, 2012.
N. Bertoldi, R. Zens, M. Federico, W. Shen, “Efficient Speech Translation Trough Confusion Network Decoding”, IEEE Transactions on Audio, Speech, and Language Processing, vol. 16, no. 8, pp. 1696-1705, 2008.
Federico, Marcello, Diego Giuliani and Gianni Lazzari, “Recent Advances at FBK-irst in UnrestrictedSpoken Language Translation”, Magazine of the Information Processing Society of Japan, vol. 49, no. 6, pp. 624-628, 2008.
F. Casacuberta, M. Federico, H. Ney, E. Vidal, “Recent Efforts in Spoken Language Translation”, IEEE Signal Processing, vol. 25, no. 3, pp. 80-88, May 2008.
E. Matusov, G. Leusch, R. E. Banchs , N. Bertoldi, D. Dechelotte, M. Federico, M. Kolss, Y-S. Lee , Josè B. Marino , M. Paulik, S. Roukos, H. Schwenk, H. Ney “System Combination for Machine Translation of Spoken and Written Language”, IEEE Transactions on Audio, Speech, and Language Processing, vol. 16, no. 7, pp. 1222-1237, September 2008.
M. Federico and N. Bertoldi, “A Word-to-Phrase Statistical Translation Model”, ACM Transactions on Speech and Language Processing, vol. 2., n. 2, pp. 1-24, 2005.
J. Goldman, S. Renals, S. Bird, F. de Jong, M. Federico, C. Fleischhauer, M. Kornbluh, L. Lamel, D. W. Oard, C. Stewart, and R. Wright, “Accessing the Spoken Word”, International Journal on Digital Libraries, vol. 5, n. 4, pp. 287-298. August, 2005.
M. Federico and N. Bertoldi, "BroadcastNews LM Adaptation over Time", Computer Speech and Language, vol. 18, n. 4, pp. 417-435, 2004.
N. Bertoldi and M. Federico, "Statistical Models for Monolingual and Bilingual InformationRetrieval", Information Retrieval, n. 7, 51-70, 2004.
N. Bertoldi, F. Brugnara, M. Cettolo, M. Federico and D. Giuliani, "Cross-Task Portability of a Broadcast NewsSpeech Recognition System", Speech Communication, vol. 38, n. 3-4, pp. 335-347, 2002.
M. Federico, "A System for the Retrieval of Italian Broadcast News", Speech Communication, n. 32, vol. 1-2, pp. 37-47, 2000.
M. Federico, M. Cettolo, F. Brugnara and G. Antoniol, "Language Modelling for Efficient Beam-Search". Computer Speech and Language, n. 9, pp. 353-379, 1995.
Conferences and workshops
L. Goncalves, P. Mathur, C. Lavania, M. Cekic, M. Federico, K. J. Han, "PEAVS: Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers’ Opinion Scores", arXiv, 2024. (Accepted at EECV)
B. Thompson, M. P. Dhaliwal, P. Frisch, T. Domhan, M. Federico, "A Shocking Amount of the Web is Machine Translated: Insights from Multi-Way Parallelism", Findings of the ACL, 2024.
J. Zuluaga-Gomez, Z. Huang, X. Niu, R. Paturi, S. Srinavasan, P. Mathur, B. Thompson, M. Federico, "End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation", Proceedings of EMNLP, 2023.
Sweta Agrawal, Antonios Anastasopoulos, Luisa Bentivogli, Ondřej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, Mingda Chen, William Chen, Khalid Choukri, Alexandra Chronopoulou, Anna Currey, Thierry Declerck, Qianqian Dong, Kevin Duh, Yannick Estève, Marcello Federico, Souhir Gahbiche, Barry Haddow, Benjamin Hsu, Phu Mon Htut, Hirofumi Inaguma, Dávid Javorský, John Judge, Yasumasa Kano, Tom Ko, Rishu Kumar, Pengwei Li, Xutai Ma, Prashant Mathur, Evgeny Matusov, Paul McNamee, John P. McCrae, Kenton Murray, Maria Nadejde, Satoshi Nakamura, Matteo Negri, Ha Nguyen, Jan Niehues, Xing Niu, Atul Kr. Ojha, John E. Ortega, Proyag Pal, Juan Pino, Lonneke van der Plas, Peter Polák, Elijah Rippeth, Elizabeth Salesky, Jiatong Shi, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Yun Tang, Brian Thompson, Kevin Tran, Marco Turchi, Alex Waibel, Mingxuan Wang, Shinji Watanabe, Rodolfo Zevallos, "Findings of the IWSLT 2023 Evaluation Campaign", In Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), pages 1–61, 2023.
P. Pal, B. Thompson, Y. Virkar, P. Mathur, A. Chronopoulou, M. Federico, "Improving Isochronous Machine Translation with Target Factors and Auxiliary Counters", Proceedings of Interspeech, 2023.
Cuong Hoang, Devendra Sachan, Prashant Mathur, Brian Thompson, Marcello Federico, “Improving Retrieval Augmented Neural Machine Translation by Controlling Source and Fuzzy-Match Interactions”, Proceedings of EACL (Findings), 2023.
G. Vernikos, B. Thompson, P. Mathur, M. Federico, “Embarrassingly Easy Document-Level MT Metrics: How to Convert Any Pretrained Metric Into a Document-Level Metric”, EMNLP Conference on Machine Translation (WMT), 2022.
D. Tam, S. M. Lakew, Y. Virkar, P. Mathur, M. Federico, "Isochrony-aware Neural Machine Translation for Dubbing", Proceedings of Interspeech, 2022.
Y. Virkar, M. Federico, R. Enyedi, R. Barra-Chicote, "Prosodic alignment for off-screen automatic dubbing", Proceedings of Interspeech, 2022.
M. Nadejde, A. Currey, B. Hsu, X. Niu, M. Federico, G. Dinu, "CoCoA-MT: A Dataset and Benchmark for Contrastive Controlled MT with Application to Formality", Proceedings of NAACL, 2022.
Antonios Anastasopoulos, Loïc Barrault, Luisa Bentivogli, Marcely Zanon Boito, Ondřej Bojar, Roldano Cattoni, Anna Currey, Georgiana Dinu, Kevin Duh, Maha Elbayad, Clara Emmanuel, Yannick Estève, Marcello Federico, Christian Federmann, Souhir Gahbiche, Hongyu Gong, Roman Grundkiewicz, Barry Haddow, Benjamin Hsu, "Findings of the IWSLT 2022 Evaluation Campaign", In Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022), pages 98–157, 2022.
S. M. Lakew, Y. Virkar, P. Mathur, M. Federico, "Isometric MT: Neural Machine Translation for Automatic Dubbing", Proceedings of IEEE-ICASSP, 2022.
J. Effendi, Y. Virkar, R. Barra-Chicote, M. Federico, "Duration Modeling for Neural TTS for Automatic Dubbing", Proceedings of IEEE-ICASSP, 2022.
A. Anastasopoulos, O. Bojar, J. Bremerman, R. Cattoni, M. Elbayad, M. Federico, X. Ma, S. Nakamura, M. Negri, J. Niehues, J. Pino, E. Salesky, S. Stüker, K. Sudoh, M. Turchi, A. Waibel, C. Wang, "Findings of the IWSLT 2021 Evaluation Campaign", Proceedings of the International Conference on Spoken Language Translation (ACL-IWSLT), pp. 1-29, 2021.
Md Mahfuz Ibn Alam, I. Kvapilíková, A. Anastasopoulos, L. Besacier, G. Dinu, M. Federico, M. Gallé, K. Jung, P. Koehn and V. Nikoulina, "Findings of the WMT Shared Task on Machine Translation Using Terminologies", Proceedings of the 6th Conference on Machine Translation (WMT21), 2021.
D. Vilar, M. Federico, "A statistical extension of byte-pair encoding", Proceedings of the International Conference on Spoken Language Translation (ACL-IWSLT), pp. 263-275, 2021.
M. Sharma, Y. Virkar, M. Federico, R. Barra-Chicote, R. Enyedi, "Intra-sentential speaking rate control in neural text-to-speech for automatic dubbing", Proceedings of Interspeech, 2021.
Y. Wang, C. Hoang, M. Federico, "Toward Modeling the Style of Translators in Neural Machine Translation", Proceedings of NAACL, 2021.
Y. Virkar, M. Federico, R. Enyedi, R. Barra-Chicote, "Improvements to Prosodic Alignment for Automatic Dubbing", Proceedings of IEEE-ICASSP, 2021.
S. M. Lakew, M. Federico, Y. Wang, C. Hoang, Y. Virkar, R. Barra-Chicote, R. Enyedi, "Machine Translation Verbosity Control for Automatic Dubbing", Proceedings of IEEE-ICASSP, 2021.
M. Federico, Y. Virkar, R. Enyedi, R. Barra-Chicote, "Evaluating and Optimizing Prosodic Alignment for Automatic Dubbing", Proceedings of Interspeech, 2020.
A. Anastasopoulos, A. Cattelan, Zi-Yi Dou, M. Federico, C. Federman, D. Genzel, F. Guzmán, J. Hu, M. Hughes, P. Koehn, R. Lazar, W. Lewis, G. Neubig, M. Niu, A. Öktem, E. Paquin, G. Tang, S. Tur, "TICO-19: the Translation Initiative for Covid-19", Proceedings of the EMNLP NLP Covid-19 Workshop , 2020.
E. Ansari, A. Axelrod, N. Bach, O. Bojar, R. Cattoni, F. Dalvi, N. Durrani, M. Federico, C. Federmann, J. Gu, F. Huang, K. Knight, X. Ma, A. Nagesh, M. Negri, J. Niehues, J. Pino, E. Salesky, X. Shi, S. Stüker, M. Turchi, A. Waibel, C. Wang, "Findings of the IWSLT Evaluation Campaign", Proceedings of the International Conference on Spoken Language Translation (ACL-IWSLT), pp. 1-34, 2020.
M. Federico, R. Enyedi, R. Barra-Chicote, R. Giri, U. Isik, A. Krishnaswamy, H. Sawaf, "From Speech-toSpeech Translation to Automatic Dubbing", Proceedings of the International Conference on Spoken Language Translation (ACL-IWSLT), pp. 257-264, 2020.
G. Dinu, P. Mathur, M. Federico, S. Lauly and Y. Al-Onaizan, "Joint Translation and Unit Conversion for End-to-end Localization", Proceedings of the International Conference on Spoken Language Translation (ACL-IWSLT), pp. 265-271, 2020.
D. Ataman, O. Firat, M. Di Gangi, M. Federico, A. Birch, "On the Importance of Word Boundaries in Character-level Neural Machine Translation", Proceedings of the 3rd Workshop on Neural Generation and Translation (WGNT), Hong Kong, China, 2019.
J. Niehues, R. Cattoni, S. Stüker, M. Negri, M. Turchi, T. Ha, E. Salesky, R. Sanabria, L. Barrault, L. Specia, M. Federico, "The IWSLT 2019 Evaluation Campaign", Proceedings of the 16th International Workshop on Spoken Language Translation (IWSLT 2019), pp. 2-6, Hong Kong, China, 2019.
S. M. Lakew, A. Karakanta, M. Negri, M. Federico, M. Turchi, "Adapting Neural Machine Translation to Unseen Languages", Proceedings of the 16th International Workshop on Spoken Language Translation (IWSLT 2018), Hong Kong, China, 2019.
M. Di Gangi, R. Enyedi, A. Brusadin, M. Federico, "Robust Neural Machine Translation for Clean and Noisy Speech Transcripts", Proceedings of the 16th International Workshop on Spoken Language Translation (IWSLT 2018), Hong Kong, China, 2019.
S. M. Lakew, M. Di Gangi, M. Federico, "Controlling the Ouput Length of Neural Machine Translation", Proceedings of the 16th International Workshop on Spoken Language Translation (IWSLT 2018), Hong Kong, China, 2019.
G. Dinu, P. Mathur, M. Federico, Y. Al-Onaizan, "Training Neural Machine Translation To Apply Terminology Constraints", Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), Florence, Italy, 2019.
J. Niehues, R. Cattoni, S. Stüker, M. Cettolo, M. Turchi, M. Federico, "The IWSLT 2018 Evaluation Campaign", Proceedings of the 15th International Workshop on Spoken Language Translation (IWSLT 2018), pp. 2-6, Bruges, Belgium, 2018.
L. Bentivogli, M. Cettolo, M. Federico, C. Federman, "Machine Translation Human Evaluation: an investigation of evaluation based on Post-Editing and its relation with Direct Assessment", Proceedings of the 15th International Workshop on Spoken Language Translation (IWSLT 2018), pp. 62-69, Bruges, Belgium, 2018.
S. M. Lakew, M. Federico, "Adapting Multilingual NMT to Extremely Low Resource Languages, FBK’s Participation in the Basque-English Low Resource MT Task", Proceedings of the 15th International Workshop on Spoken Language Translation (IWSLT 2018), pp. 160-166, Brussels, Belgium, 2018.
S. M. Lakew, A. Aerofeeva, M. Negri, M. Federico, M. Turchi, "Transfer Learning in Multilingual Neural Machine Translation with Dynamic Vocabulary", Proceedings of the 15th International Workshop on Spoken Language Translation (IWSLT 2018), pp. 54-62, Brussels, Belgium, 2018.
S. M. Lakew, A. Aerofeeva, M. Federico, "Neural Machine Translation into Language Varieties", Proceedings of the Third Conference on Machine Translation (WMT), pp. 156-164, Brussels, Belgium, 2018.
D. Ataman, M. Federico, "Compositional Representation of Morphologically-Rich Input for Neural Machine Translation", Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL), pp. 305-311, Melbourne, Australia, 2018.
S. M. Lakew, M. Cettolo, M. Federico, "A Comparison of Transformer and Recurrent Neural Networks on Multilingual Neural Machine Translation", Proceedings of the 27th International Conference on Computational Linguistics (COLING), pp. 641-652, Santa Fe, NM, USA, 2018.
M. A. Di Gangi, M. Federico, "Deep Neural Machine Translation with Weakly-Recurrent Units", Proceedings of the 21st Annual Conference of the European Association for Machine Translation (EAMT), pp.119-128, Alicante, Spain, 2018.
M. A. Farajian, N. Bertoldi, M. Negri, M. Turchi, M. Federico, "Evaluation of Terminology Translation in Instance-Based NMT Adaptation", Proceedings of the 21st Annual Conference of the European Association for Machine Translation (EAMT), pp. 149-158, Alicante, Spain, 2018.
D. Ataman, M. A. Di Gangi, M. Federico, "Compositional Source Word Representations for Neural Machine Translation", Proceedings of the 21st Annual Conference of the European Association for Machine Translation (EAMT), pp.31-40, Alicante, Spain, 2018.
D. Ataman, M. Federico, "An Evaluation of Two Vocabulary Reduction Methods for Neural Machine Translation", Proceedings of the 13th Conference of the Association for Machine Translation in the Americas (AMTA), pp. 97-110, Boston, MA, 2018.
M. Cettolo, M. Federico, L. Bentivogli, J. Niehues, S. Stüker, K. Sudoh, K. Yoshino, C. Federman, "Overview of the IWSLT 2017 Evaluation Campaign", Proceedings of the 14th International Workshop on Spoken Language Translation, pp. 2-14, Tokyo, Japan, 2017.
S. M. Lakew, Q. F. Lotito, M. Turchi, M. Negri, M. Federico, "FBK's Multilingual Machine Translation System for IWSLT 2017", Proceedings of the 14th International Workshop on Spoken Language Translation, pp. 35-41, Tokyo, Japan, 2017.
M. A. Di Gangi, M. Federico, "Monolingual Embeddings for Low Resourced Neural Machine Translation", Proceedings of the 14th International Workshop on Spoken Language Translation, pp. 97-104, Tokyo, Japan, 2017.
S. M. Lakew, Q. F. Lotito, M. Negri, M. Turchi, M. Federico, "Improving Zero-Shot Translation for Low-Resourced Languages", Proceedings of the 14th International Workshop on Spoken Language Translation, pp. 113-119, Tokyo, Japan, 2017. (Best student paper award)
S. M. Lakew, M. A. Di Gangi, M. Federico, "Multilingual Neural Machine Translation for Low Resource Languages", Proceedings of the 4th Italian Conference on Computational Linguistics (CLIC-IT), Rome, Dec 2017.
R. Scansani, L. Bentivogli, M. Federico, "Assessing the Use of Terminology in Phrase-Based Statistical Machine Translation for Academic Course Catalogues Translation", Proceedings of the 4th Italian Conference on Computational Linguistics (CLIC-IT), Rome, Dec 2017.
M. A. Di Gangi, M. Federico, "Can Monolingual Embeddings Improve Neural Machine Translation?", Proceedings of the 4th Italian Conference on Computational Linguistics (CLIC-IT), Rome, Dec 2017.
M. A. Di Gangi, N. Bertoldi, M. Federico, "FBK's Participation to the English-to-German News Translation Task of WMT 2017", Proceedings of the 2nd Conference on Machine Translation (WMT217), pp. 271-275, Copenhagen, 2017.
M. A. Farajian, M. Turchi, M. Negri and M. Federico, "Multi-Domain Neural Machine Translation through Unsupervised Adaptation", Proceedings of the 2nd Conference on Machine Translation (WMT17), pp. 127-137, Copenhagen, 2017.
R. Chatterjee, M. Negri, M. Turchi, M. Federico, L. Specia and F. Blain, "Guiding Neural Machine Translation Decoding with External Knowledge", Proceedings of the 2nd Conference on Machine Translation (WMT17), pp. 157-168, Copenhagen, 2017.
N. Ruiz, M. A. Di Gangi, N. Bertoldi, M. Federico, "Assessing the Tolerance of Neural Machine Translation Systems Against Speech Recognition Errors", Proceedings of Interspeech, pp. 2635-2639, Stockholm, 2017.
M. Turchi, M. Negri, A. Farajian and A. Federico, "Continuous Learning from Human Post-edits for Neural Machine Translation", Proceedings of the 20th Annual Conference of the European Association for Machine Translation, Prague, 2017.
D. Ataman, M. Negri and M. Federico, "Linguistically Motivated Vocabulary Reduction for Neural Machine Translation", Proceedings of the 20th Annual Conference of the European Association for Machine Translation, Prague, 2017.
N. Bertoldi, R. Cattoni, M. Cettolo, A. F., M. Federico, D. Caroselli, L. Mastrostefano, A. Rossi, M. Trombetti, U. Germann, D. Madl, "MMT: New Open Source MT for the Translation Industry", Proceedings of the 20th Annual Conference of the European Association for Machine Translation, Prague, 2017.
M. Amin Farajian, Marco Turchi, Matteo Negri, Nicola Bertoldi and Marcello Federico, "Neural vs. Phrase-Based Machine Translation in Multi-Domain Scenario", Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, Volume 2, Short Papers, Valencia, Spain, pp. 280-284. 2017.
M. Cettolo, J. Niehues, S. Stüker, L. Bentivogli, M. Federico, "The IWSLT 2016 Evaluation Campaign", Proceedings of the 13th International Workshop on Spoken Language Translation, Seattle, pp. 14, WA, 2016.
M. A. Farajian, R. Chatterjee, C. Conforti, S. Jalalvand, V. Balaraman, M. A. Di Gangi, D. Ataman, M. Turchi, M. Negri, M. Federico, "FBK's Neural Machine Translation Systems for IWSLT 2016", Proceedings of the 13th International Workshop on Spoken Language Translation, pp. 8, Seattle, WA, 2016.
L. Bechberger, M. Schimdt, A. Waibel, M. Federico, "Personalized NewsEvent Retrieval for Small Talk in Social Dialog Systems", Proc. 12th ITG Symposium on Speech Communication, pp. 1-5, Padeborn, Germany, 2016.
L. Bentivogli, A. Bisazza, M. Cettolo, M. Federico, "Neural versus Phrase-based Machine Translation Quality: A Case Study", Proceedings of Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 257-267, Austin, Texas, USA, 2016.
M. J. Sabet, M. Negri, M. Turchi, J. G. C. de Souza, M. Federico, "TMOP: A Tool for Unsupervised Translation Memory Cleaning", Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, (Demo Papers), pp. 49-54, Berlin, Germany, 2016.
L. Bentivogli, M. Federico, S. Stüker, M. Cettolo, J. Niehues, "The IWSLT Evaluation Campaign: Challenges, Achievements, Future Directions", Proceedings of the LREC 2016 Workshop "Translation Evaluation - From Fragmented Tools and Data Sets to an Integrated Ecosystem" , pp. 14-19, Portorož, Slovenia, 2016.
L. Bentivogli, M. Cettolo, M. A. Farajian, M. Federico, "WAGS: A Beautiful English-Italian Benchmark Supporting Word Alignment Evaluation of Rare Words", Proceedings of the 10th Language Resources and Evaluation Conference, pp. 3535-3542, Portorož, Slovenia, 2016.
M. Cettolo, J. Niehues, S. Stüker, L. Bentivogli, R. Cattoni, M. Federico, "The IWSLT 2015 Evaluation Campaign", Proceedings of the 12th International Workshop on Spoken Language Translation, pp. 2-14, Da Nang, Vietnam, 2015.
N. Ruiz, M. Federico, "Phonetically-Oriented Word Error Alignment for Speech Recognition Error Analysis in Speech Translation", Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp. 296-302, Scottsdale, AZ, 2015.
P. Mathur, M. Federico, S. Köprü, S. Khadivi, H. Sawaf, "Topic Adaptation for Machine Translation of E-Commerce Content", Proceedings of MT Summit XV, vol. 1: MT Researchers' Track, pp. 270-281, Miami, FL, 2015.
J. G. C. de Souza, M. Federico, H. Sawaf, "MT Quality Estimation for E-Commerce Data", Proceedings of MT Summit XV, vol. 2: MT Users' Track, pp. 20-29, Miami, FL, 2015.
N. Ruiz, Q. Gao, W. Lewis, M. Federico, "Adapting Machine Translation Models towards Misrecognized Speech with Text-to-Speech Pronunciation Rules and Acoustic Confusability", Proceedings of Interspeech, Dresden, Germany, 2015. (Best student paper award)
M. Turchi, M. Negri, M. Federico, "MT Quality Estimation for Computer-assisted Translation: Does it Really Help?", Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics, (Short Papers), pp. 530-535, Beijing, China, 2015.
M. Cettolo, J. Niehues, S. Stüker, L. Bentivogli, M. Federico, "Report on the 11th IWSLT Evaluation Campaign", Proceedings of the 11th International Workshop on Spoken Language Translation, pp. 2-16, Lake Tahoe, California, 2014.
N. Bertoldi, P. Mathur, N. Ruiz, M. Federico, "FBK's Machine Translation and Speech Translation Systems for the IWSLT 2014 Evaluation Campaign", Proceedings of the 11th International Workshop on Spoken Language Translation, pp. 42-48, Lake Tahoe, California, 2014.
M. Freitag, J. Wuebker, S. Peitz, H. Ney, M. Huck, A. Birch, N. Durrani, P. Koehn, M. Mediani, I. Slawik, J, Niehues, E. Cho, A. Waibel, N. Bertoldi, M. Cettolo, M. Federico, "Combined Spoken Language Translation", Proceedings of the 11th International Workshop on Spoken Language Translation, pp. 57-64, Lake Tahoe, California, 2014.
P. Mathur, M. Cettolo, M. Federico, J. G. C. de Souza, "Online Multi-User Adaptive Statistical Machine Translation", Proceedings of the Eleventh Conference of the Association for Machine Translation of the Americas (AMTA), pp. 152-165, Vancouver, Canada, 2014.
M. Cettolo, N. Bertoldi, M. Federico, "The Repetition Rate of Text as a Predictor of the Effectiveness of Machine Translation Adaptation", Proceedings of the Eleventh Conference of the Association for Machine Translation of the Americas (AMTA), pp. 166-179, Vancouver, Canada, 2014.
N. Ruiz and M. Federico, "Assessing the Impact of Speech Recognition Errors on Machine Translation Quality", Proceedings of the Eleventh Conference of the Association for Machine Translation of the Americas (AMTA), pp. 261-274, Vancouver, Canada, 2014.
M. Federico, M. Negri, L. Bentivogli, M. Turchi, "Assessing the Impact of Translation Errors on Machine Translation Quality with Mixed-effects Models", Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, 2014.
C. Girardi, L. Bentivogli, M. A. Farajian, M. Federico,"MT-EQuAl: a Toolkit for Human Assessment of Machine Translation Output", Proceedings of the 25th International Conference on Computational Linguistics: System Demonstrations (COLING), pp. 120--123, Dublin, Ireland, 2014.
M. Federico, N. Bertoldi, M. Cettolo, M. Negri, M. Turchi, M. Trombetti, A. Cattelan, A. Farina, D. Lupinetti, A. Martines, A. Massidda, H. Schwenk, L. Barrault, F. Blain, P. Koehn, C. Buck, U. Germann, "The MATECAT TOOL", Proceedings of the 25th International Conference on Computational Linguistics: System Demonstrations (COLING), pp. 129--132, Dublin, Ireland, 2014.
N. Ruiz, M. Federico, "Complexity of Spoken versus Written Language for Machine Translation", Proceedings of the 17th Annual Conference of the European Association for Machine Translation (EAMT), pp. 173-180, Dubrovnik, Croatia, 2014.
M. A. Farajian, N. Bertoldi, M. Federico, "Online Word Alignment for Online Adaptive Machine Translation", Proceedings of the EACL 2014 Workshop on Humans and Computer-assisted Translation, pp. 84-92, Gothenburg, Sweden, 2014.
M. Cettolo, J. Niehues, S. Stüker, L. Bentivogli, M. Federico: Report on the 10th IWSLT Evaluation Campaign, Proceedings of the 10th International Workshop on Spoken Language Translation, 18 pages, Heidelberg, Germany, 2013.
N. Bertoldi, M. A. Farajian, P. Mathur, N. Ruiz, M. Federico: FBK's Machine Translation Systems for the IWSLT 2013 Evaluation Campaign, Proceedings of the 10th International Workshop on Spoken Language Translation, 8 pages, Heidelberg, Germany, 2013.
M. Freitag, S. Peitz, J. Wuebker, H. Ney, N. Durrani, M. Huck, P. Koehn, T. Ha, J. Niehues, M. Mediani, T. Herrmann, A. Waibel, N. Bertoldi, M. Cettolo, and M. Federico, "EU-BRIDGE MT: Text Translation of Talks in the EU-BRIDGE Project", Proceedings of the 10th International Workshop on Spoken Language Translation, 8 pages, Heidelberg, Germany, 2013.
N. Bertoldi, M. Cettolo, M. Federico: Cache-based online adaptation for machine translation enhanced computer assisted translation. Proceedings of the XIV Machine Translation Summit, pp.35-42, Nice, France, 2013.
K. Wäschle, P. Simianer, N. Bertoldi, S. Riezler, M. Federico: Generative and discriminative methods for online adaptation in SMT. Proceedings of the XIV Machine Translation Summit, pp.11-18, Nice, France, 2013.
M. Cettolo, N. Bertoldi, M. Federico: Project adaptation for MT-enhanced computer-assisted translation. Proceedings of the XIV Machine Translation Summit, pp.27-34, Nice, France, 2013.
M. Cettolo, C. Servan, N. Bertoldi, M. Federico, Loïc Barrault, Holger Schwenk: Issues in incremental adaptation of statistical MT from human post-edits. Proceedings of MT Summit XIV Workshop on Post-editing Technology and Practice (WPTP-2), pp. 111-118, Nice, France, 2013.
A. Bisazza, A. Federico: Efficient solutions for word reordering in German-English phrase-based statistical machine translation. Proceedings of the 8th Workshop on Statistical Machine Translation, pp. 440-451, Sofia, Bulgaria, 2013.
P. Mathur, M. Cettolo, M. Federico: Online learning approaches in computer assisted translation. Proceedings of the 8th Workshop on Statistical Machine Translation, pp.301-308, Sofia, Bulgaria, 2013.
Marco Turchi, Matteo Negri, Marcello Federico: Coping with the subjectivity of human judgements in MT quality estimation. Proceedings of the 8th Workshop on Statistical Machine Translation, pp. 240-251, Sofia, Bulgaria, 2013.
M. Federico, A. Cattelan, M. Trombetti. “Measuring user productivity in machine translation enhanced computer assisted translation”. Proceedings of the Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA), San Diego, CA, October 28 – November 1, 2012.
A. Bisazza, M. Federico. “Modified Distortion Matrices for Phrase-Based Statistical Machine Translation”, Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL), pp 478-487, Jeju, Korea, 2012.
Y. Mehdad, M. Negri, M. Federico. “Detecting Semantic Equivalence and Information Disparity in Cross-lingual Documents”, Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL), pp 120-124, Jeju, Korea, 2012.
A. Bisazza, M. Federico. “Cutting the Long Tail: Hybrid Language Models for Translation Style Adaptation”, Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics (EACL), pp 439-448, Avignon, France, 2012.
M. Federico, S. Stüker, L. Bentivogli, M. Paul, M. Cettolo, T. Herrmann, J Niehues, G. Moretti, “The IWSLT 2011 evaluation campaign on automatic talk translation”. Proceedings of the 8th international conference on Language Resources and Evaluation (LREC), pp.3543-3540, Istanbul, Turkey, 21-27 May 2012.
M Cettolo, C. Girardi, M Federico. “WIT3: web inventory of transcribed and translated talks”, Proceedings of the 16th Annual Conference of the European Association for Machine Translation (EAMT), pp. 261-268, Trento, Italy, May 28-30, 2012.
M. Paul, E. Sumita, L. Bentivogli, M. Federico. “Crowd-based MT evaluation for non-English target languages”, Proceedings of the 16th Annual Conference of the European Association for Machine Translation, pp. 228-237, Trento, Italy, May 28-30 2012.
M. Federico, M. Cettolo, L. Bentivogli, M. Paul, S. Stüker, “Overview of the IWSLT 2012 evaluation campaign”, Proceedings of the 9th International Workshop on Spoken Language Translation, pp. 12-33, Hong Kong, China, 6-7 December, 2012.
N. Ruiz, M. Federico, “MDI adaptation for the lazy: avoiding normalization in LM adaptation for lecture translation”. Proceedings of the 9th International Workshop on Spoken Language Translation, pp. 244-251, Hong Kong, China, 6-7 December, 2012.
N.Ruiz, A.Bisazza, R.Cattoni, M.Federico. “FBK’s machine translation systems for IWSLT 2012’s TED lectures”, Proceedings of the 9th International Workshop on Spoken Language Translation, pp.61-68, Hong Kong, China, 6-7 December, 2012.
N. Bertoldi, M. Cettolo, M. Federico, C. Buck “Evaluating the learning curve of domain adaptive statistical machine translation systems”, Proceedings of the 7th Workshop on Statistical Machine Translation (WMT), pp. 433-441, Montreal, Canada, June 7-8, 2012.
Y. Mehdad, M. Negri, M. Federico, “Match without a referee: evaluating MT adequacy without reference translations”, Proceedings of the 7th Workshop on Statistical Machine Translation (WMT), pp. 171-181, Montreal, Canada, June 7-8, 2012.
Bisazza, N. Ruiz, M. Federico, “Fill-up versus Interpolation Methods for Phrase-based SMT Adaptation”, In Proceedings of the International Workshop on Spoken Language Translation (IWSLT), pp. 136-143. San Francisco, California, 8-9 December, 2011. (Best student paper award.)
M. Federico, L. Bentivogli, M. Paul, S. Stüker, “Overview of the IWSLT 2011 evaluation campaign”, In Proceedings of the International Workshop on Spoken Language Translation (IWSLT), pp. 11-27, San Francisco, California, 8-9 December 2011.
N. Ruiz, A. Bisazza, F. Brugnara, D. Falavigna, D. Giuliani, S. Jaber, R. Gretter, M. Federico, “FBK @ IWSLT 2011”, In Proceedings of the International Workshop on Spoken Language Translation (IWSLT), pp. 86-93, San Francisco, California, 8-9 December 2011.
N. Ruiz, M. Federico, “Topic Adaptation for Lecture Translation through Bilingual Latent Semantic Models”, Proceedings of the 6th Workshop on Statistical Machine Translation, pp. 294-302, Edinburgh, UK, 2011.
C. Hardmeier, J. Tiedemann, M. Saers, M. Federico, M. Prashant, “The Uppsala-FBK systems at WMT 2011”, Proceedings of the 6th Workshop on Statistical Machine Translation, pp. 372-378, Edinburgh, UK, 2011.
M. Cettolo, N. Bertoldi, M. Federico, “Methods for Smoothing the Optimizer Instability in SMT”, Proceedings of the Machine Translation Summit XIII, Xiamen, China, 2011.
L. Bentivogli, M. Federico, G. Moretti and M. Paul, “Getting Expert Quality from the Crowd for Machine Translation Evaluation,” Proceedings of the Machine Translation Summit XIII, Xiamen, China, 2011.
F. Brugnara, D. Falavigna, M. Federico, C. Girardi, D. Giuliani, R. Gretter, “NeMo: a Platform for Multilingual News Monitoring”, Proceedings of the 12thAnnual Conference of the International Speech Communication Association (Interspeech), Florence, Italy, 27-31 August, 2011.
Y. Mehdad, M. Negri, M. Federico, “Using Bilingual Parallel Corpora for Cross-Lingual Textual Entailment”, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL), pp. 1336-1345, Portland, Oregon, 2011.
N. Bertoldi, M. Cettolo, M. Federico, “Bootstrapping Arabic-Italian SMT through comparable texts and pivot translation”, Proceedings of the 15th Conference of the European Association for Machine Translation (EAMT), Leuven, Belgium, 2011.
M. Paul, M. Federico, S. Stüker, “Overview of the IWSLT 2010 Evaluation Campaign“, In Proceedings of the International Workshop on Spoken Language Translation (IWSLT), Paris, France, 2-3 December, 2010.
C. Hardmeier, M. Federico, “Modelling Pronominal Anaphora in Statistical Machine Translation“, In Proceedings of the International Workshop on Spoken Language Translation (IWSLT), Paris, France, 2-3 December 2010.
M. Cettolo, M. Federico, N. Bertoldi, “Mining Parallel Fragments from Comparable Texts“, In Proceedings of the International Workshop on Spoken Language Translation (IWSLT), Paris, France, 2-3 December 2010.
Bisazza, I. Klasinas, M. Cettolo, “FBK @ IWSLT 2010“, In Proceedings of the International Workshop on Spoken Language Translation (IWSLT), Paris, France, 2-3 December 2010.
Bisazza, M. Federico, “Chunk-based Verb Reordering for Arabic-English Statistical Machine Translation“, Proceedings of the 5th Workshop on Statistical Machine Translation, Uppsala, Sweden, 2010.
Hardmeier, A. Bisazza, M. Federico, “FBK at WMT 2010: Word Lattices for Morphological Reduction and Chunk-based Reordering“, Proceedings of the 5th Workshop on Statistical Machine Translation, Uppsala, Sweden, 2010.
N. Bertoldi, M. Cettolo, M. Federico, “Statistical Machine Translation of Texts with Misspelled Words“, Proceedings of the Annual Meeting of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL), pp. 412-419, Los Angeles, CA, 2010.
Y. Mehdad, M. Negri, M. Federico, “Towards Cross-Lingual Textual Entailment“, Proceedings of the Annual Meeting of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL), pp. 321-324, Los Angeles, CA, 2010
M. Gerosa, M. Federico, “Coping with out-of-vocabulary words: open versus huge vocabulary ASR“, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Taipei, Taiwan, 2009.
G. Sanchis-Trilles, M. Cettolo, N. Bertoldi and M. Federico. "Online LM Adaptation for Spoken Dialog Translation", In Proceedings of the International Workshop on Spoken Language Translation (IWSLT), Tokyo, Japan, 1-2 December 2009.
N. Bertoldi, A. Bisazza, M. Cettolo, M. Federico and G. Sanchis-Trilles. "FBK @ IWSLT 2009", In Proceedings of the International Workshop on Spoken Language Translation (IWSLT), Tokyo, Japan, 1-2 December 2009.
Bisazza, M. Federico. "Morphological Pre-Processing for Turkish to English Statistical Machine Translation". In Proceedings of the International Workshop on Spoken Language Translation (IWSLT), Tokyo, Japan, 1-2 December 2009.
N. Bertoldi, M. Federico, “Domain Adaptation for Statistical Machine Translation with Monolingual Resources“, Proceedings of the 4th Workshop on Statistical Machine Translation, pp. 182-189, Athens, Greece, 2009.
M. Cettolo, M. Federico, D. Pighin and N. Bertoldi. "Shallow-Syntax Phrase-Based Translation: Joint versus Factored String-to-Chunk Models". In Proceedings of AMTA 2008, Honolulu, Hawaii, 2008.
N. Bertoldi, M. Federico, D. Falavigna and M. Gerosa. "Fast Speech Decoding through Phone Confusion Networks". In Proceedings of Interspeech, Brisbane, Australia, 2008.
M. Federico, N. Bertoldi and M. Cettolo. "IRSTLM: an Open Source Toolkit for Handling Large Scale Language Models". In Proceedings of Interspeech, pp. 1618-1621, Brisbane, Australia, 2008.
N. Bertoldi, M. Barbaiani, M. Federico. R. Cattoni, “Phrase-Based Statistical Machine Translation with Pivot Languages“, In Proceedings of the International Workshop on Spoken Language Translation (IWSLT), Honolulu, HW, October 2008.
N. Bertoldi, M. Federico, R. Cattoni, M. Barbaiani, “FBK @ IWSLT 2008“, In Proceedings of the International Workshop on Spoken Language Translation (IWSLT), Honolulu, HW, October 2008.
R. Cattoni, N. Bertoldi and M. Federico, “Punctuating Confusion Networks for Speech Translation“, In Proceedings of Interspeech - European Conference on Speech Communication and Technology, Antwerp, Belgium 2007.
B. Chen, M. Federico and M. Cettolo. “Better N-best Translations through Generative n-gram Language Models“, In Proceedings of the 11th Machine Translation Summit Conference, Copenhagen, Denmark, 2007.
D. Gupta, M. Cettolo and M. Federico, “POS-based Reordering Models for Statistical Machine Translation“, In Proceedings of the 11th Machine Translation Summit Conference, Copenhagen, Denmark, 2007.
D. Falavigna, N. Bertoldi, F. Brugnara, R. Cattoni, M. Cettolo, B. Chen, M. Federico, D. Giuliani, R. Gretter, D. Gupta and D. Seppi. “The IRST English-Spanish Translation System for European Parliament Speeches“. In Proceedings of Interspeech - European Conference on Speech Communication and Technology, Antwerp, Belgium 2007.
P. Koehn, H. Hoang, A. Birch, C. Callison-Burch, M. Federico, N. Bertoldi, B. Cowan, W. Shen, C. Moran, R. Zens, C. Dyer, O. Bojar, A. Constantin, and E. Herbst. “Moses: Open Source Toolkit for Statistical Machine Translation“, In Proceedings of ACL – Posters and Demos, pp. 177-180, Prague, Czech Republic, 2007.
N. Bertoldi, R. Zens, M. Federico, “Speech Translation by Confusion Network Decoding“, In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Honolulu, Hawaii, 2007.
N. Bertoldi, M. Cettolo, R. Cattoni, M. Federico,”FBK @ IWSLT 2007“, In Proceedings of the International Workshop on Spoken Language Translation (IWSLT), Trento, Italy, October, 2007.
M. Federico, M. Cettolo, “Efficient Handling of N-gram Language Models for Statistical Machine Translation“, In Proceedings of the ACL Workshop on Statistical Machine Translation, pp. 88-95, Prague, Czech Republic, 2007.
M. Cettolo, M. Federico. “Text Segmentation Criteria for Statistical Machine Translation“. In Proceedings of the 5th International Conference on Natural Language Processing (FinTal). pp. 664-673. Springer Verlag, LNCS, Turku, Finland, 2006.
B. Chen, M. Federico. “Improving Phrase-Based Statistical Translation through Combination of Word Alignment.”. In Proceedings of the 5th International Conference on Natural Language Processing (FinTal). pp. 356-367. Springer Verlag, LNCS, Turku, Finland, 2006.
A. de Gispert, D. Gupta, M. Popovic, P. Lambert, J. Mariño, M. Federico, H. Ney, R. Banchs. “Improving Statistical Word Alignment with Morpho-syntactic Transformations“. In Proceedings of the 5th International Conference on Natural Language Processing (FinTal). Springer Verlag, LNCS, Turku, Finland, 2006.
R. Cattoni, N. Bertoldi, M. Cettolo, B. Chen, M. Federico. “A Web-based Interface to a Multi-lingual Phrase-based Translation System“. In Proceedings of European Conference of Artificial Intelligence (ECAI), Systems Demonstrations. Riva del Garda, Italy, 2006.
D. Gupta, M. Federico. “Exploiting word transformation in statistical machine translation from Spanish to English“. In Proceedings of the 11th Annual Conference of the European Association for Machine Translation (EAMT), pp. 75-80, Oslo, Norway 2006.
R. Cattoni, N. Bertoldi, M. Cettolo, B. Chen, M. Federico. “A Web-based Demonstrator of a Multi-lingual Phrase-based Translation System“. In Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Posters and Demonstrations. pp. 91-94. 2006.
Chen, M. Cettolo, M. Federico, “Reordering Rules from Phrase-Based Statistical Machine Translation“, In Proceedings of the International Workshop on Spoken Language Translation (IWSLT), pp. 182-189, Kyoto, Japan, 2006.
W. Shen, R. Zens, N. Bertoldi, M. Federico, “The JHU Workshop 2006 IWSLT System“, In Proceedings of the International Workshop on Spoken Language Translation (IWSLT), pp. 59-63, Kyoto, Japan, 2006.
B. Chen, R. Cattoni, N. Bertoldi, M. Cettolo, M. Federico, “The ITC-irst SMT System for IWSLT 2006“, In Proceedings of the International Workshop on Spoken Language Translation (IWSLT), pp. 53-58, Kyoto, Japan, 2006.
M. Federico, N. Bertoldi, R. Cattoni, M. Cettolo, B. Chen, “ITC-irst’s System for the NIST 2006 MT Evaluation”, In Working Notes of the NIST MT Workshop, Washington, DC, 6-7 September, 2006.
M. Popovic, A. De Gispert, D. Gupta, P. Lambert, J. Marino, M Federico, H. Ney, R. Banchs. “Morpho-syntactic Information for Automatic Error Analysis of Statistical Machine Translation Output“. In Proceedings of the HLT-NAACL Workshop on Statistical Machine Translation, pp. 1-6, New York City, NY, 2006.
M. Federico and N. Bertoldi, “How Many Bits are Needed to Store Probabilities for Phrase-based Translation?“, In Proceedings of the HLT-NAACL Workshop on Statistical Machine Translation, pp. 94-101, New York City, NY, 2006.
V. Sandrini, M. Federico, M. Cettolo. “Maximum Entropy Tagging with Binary and Real-valued Features“, In Proceedings of the EACL Workshop on Learning Structured Information in Natural Language Applications, pp. 1-8, Trento, Italy, 2006.
N. Bertoldi, R. Cattoni, M. Cettolo, B. Chen, M. Federico, “ITC-irst at the 2006 TC-STAR SLT Evaluation Campaign“. In Proceedings of the TC-STAR Workshop on Speech-to-Speech Translation. pp. 18-24, Barcelona, Spain, 2006.
M. Cettolo, M. Federico, N. Bertoldi, R. Cattoni and B. Chen "A look inside the ITC-irst SMT System“, In Proceedings of the 10th Machine Translation Summit Conference, pp. 451-457, Phuket, Thailand, 2005.
V. H. Quan, M. Federico, M. Cettolo, “Integrated N-best Re-ranking for Spoken Language Translation“, In Proceedings of Interspeech - European Conference on Speech Communication and Technology, Lisbon, Portugal, 2005.
N. Bertoldi, M. Federico, "A New Decoder for Spoken Language Translation based on Confusion Networks", In Proceedings of the Automatic Speech Recognition and Understanding Workshop (ASRU), Cancun, Mexico, 2005.
N. Bertoldi, R. Cattoni, M. Cettolo, B. Chen, M. Federico, "The ITC-irst SMT System for IWSLT-2005", In Proceedings of the International Workshop Spoken Language Translation (IWSLT), Pittsburgh, PA, October 2005.
M. Federico, N. Bertoldi, R. Cattoni, M. Cettolo, B. Chen, “Chinese-English MT at ITC-irst”, In Working Notes of the NIST MT Workshop, Bethesda, MD, 20-21 June, 2005.
M. Federico, N. Bertoldi, G.-A. Levow, G. J. F. Jones:, “CLEF 2004 Cross-Language Spoken Document Retrieval Track”, In C. Peters, P. Clough, J. Gonzalo, G. J. F. Jones, M. Kluck, B. Magnini (Eds.): Multilingual Information Access for Text, Speech and Images, LNCS, pp. 816-820, Springer Verlag, 2005.
M. Cettolo, F. Brugnara, M. Federico, “Advances in the Automatic Transcription of Lectures“, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Montreal, Canada, 2004.
M. Paul, H. Nakaiwa, M. Federico, “Towards Innovative Evaluation Methodologies for Speech Translation“, Proceedings of the NTCIR-4 2004, Tokyo, Japan, 2004.
Yasuhiro, M. Federico, N. Kando, H. Nakaiwa, M. Paul, J. Tsujii, “Overview of the IWSLT04 Evaluation Campaign“, In Proceedings of the International Workshop on Spoken Language Translation (IWSLT), pp. 1-112. September, Kyoto, Japan, 2004.
M. Cettolo, M. Federico, “Minimum Error Training of Log-Linear Translation Models“, In Proceedings of the International Workshop on Spoken Language Translation (IWSLT), pp. 103-106. September, Kyoto, Japan, 2004.
N. Bertoldi, R. Cattoni, M. Cettolo, M. Federico, “The ITC-irst Statistical Machine Translation System for IWSLT-2004“, In Proceedings of the International Workshop on Spoken Language Translation (IWSLT), pp. 51-58. September. Kyoto, Japan, 2004.
M. Federico, G. F. Jones, “The CLEF 2003 Cross-Language Spoken Document Retrieval Track”, In Peters, C.; Gonzalo, J.; Braschler, M.; Kluck, M. (eds.), Comparative Evaluation of Multilingual Information Access Systems, LNCS, pp. 646-652, Springer Verlag, 2004.
N. Bertoldi, M. Federico, “ITC-irst at CLEF 2003: Cross-Language Spoken Document Retrieval”, In Peters, C.; Gonzalo, J.; Braschler, M.; Kluck, M. (eds.), Comparative Evaluation of Multilingual Information Access Systems, LNCS, pp. 672-675, Springer Verlag, 2004.
N. Bertoldi, M. Federico, “ITC-irst at CLEF 2003: Monolingual, Bilingual, and Multilingual Information Retrieval”, In Peters, C.; Gonzalo, J.; Braschler, M.; Kluck, M. (eds.), Comparative Evaluation of Multilingual Information Access Systems, LNCS, pp. 140-151, Springer Verlag, 2004.
M. Federico, “Evaluation Frameworks for Speech Translation Technologies“, In Proceedings of the European Conference on Speech Communication and Technology, Ginevra, Switzerland, 2003.
N. Bertoldi, F. Brugnara, M. Cettolo, M. Federico, D. Giuliani, E. Leeuwis, V. Sandrini, "The ITC-irst News on Demand Platform", Proceedings of the European Conference on Information Retrieval Research, Pisa, 2003.
V. Sandrini, M. Federico, "Spoken Information Extraction from Italian Broadcast News", Proceedings of the European Conference on Information Retrieval Research, Pisa, Italy 2003.
E. Leeuwis, M. Federico, and M. Cettolo, "Language Modeling and Transcription of TED Corpus Lectures", Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 232-235, Honk Kong, China, 2003.
G. F. Jones, M. Federico, "CLEF 2002 Cross-Language Spoken Document Retrieval Pilot Track Report", in C. Peters, M. Braschler, J. Gonzalo, M. Kluck (Ed.), Advances in Cross-Language Information Retrieval, pp. 446-457, LNCS 2785, Springer, Berlin, 2003.
N. Bertoldi, M. Federico, "ITC-irst at CLEF 2002: Using N-best Query Translations for CLIR", in C. Peters, M. Braschler, J. Gonzalo, M. Kluck (Ed.), Advances in Cross-Language Information Retrieval, pp. 49-58, LNCS 2785, Springer, Berlin, 2003.
N. Bertoldi, M. Federico, "Cross-Language Spoken Document Retrieval on the TREC SDR Collection", in C. Peters, M. Braschler, J. Gonzalo, M. Kluck (Ed.), Advances in Cross-Language Information Retrieval, pp. 476-481, LNCS 2785, Springer, Berlin, 2003.
N. Bertoldi, M. Federico, "ITC-irst at CLEF 2001: Monolingual and Bilingual Tracks", in C. Peters, M. Braschler, J. Gonzalo, M. Kluck (Ed.), Evaluation of Cross-Language Information Retrieval Systems, pp. 94-101, LNCS 2406, Springer, Berlin, 2002.
F. Brugnara, M. Cettolo, M. Federico, D. Giuliani, "Issues in Automatic Transcription of Historical Audio Data", In Proceedings of the International Conference on Spoken Language Processing, Denver, CO, 2002.
M. Federico, N. Bertoldi, V. Sandrini, “Bootstrapping Named Entity Recognition for Italian Broadcast News“, In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP’) , Philadelphia, PA, July, 2002.
M. Federico, N. Bertoldi, “Cross-Language Information Retrieval through N-best Query Translations“, In Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Tampere, Finland, August, 2002.
M. Federico, “LM Adaptation through Topic Decomposition and MDI Estimation“, In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Orlando, FL, May, 2002.
M. Federico, N. Bertoldi, "Broadcast News LM Adaptation using Contemporary Texts", In Proceedings of the European Conference on Speech Communication and Technology, Aarlborg, Denmark, 2001.
N. Bertoldi, M. Cettolo, F. Brugnara, M. Federico and D. Giuliani, "From Broadcast News to Spontaneous Dialogue Transcription: Portability Issues". In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Salt Lake City, UT, 2001.
R. Cattoni, M. Federico, and A. Lavie, “Robust Analysis of Spoken Input combining Statistical and Knowledge-based Information Sources“, In Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), Madonna di Campiglio, Italy, 2001.
Giuliani, M. Federico, "Unsupervised Language and Acoustic Model Adaptation for Cross Domain Portability", In Proceedings of the ISCA ITR Workshop on Adaptation Methods, Sophia-Antipolis, France, 2001.
N. Bertoldi, M. Federico, "Lexical Adaptation for Broadcast News Transcription", In Proceedings of the ISCA ITR Workshop on Adaptation Methods, Sophia-Antipolis, France, 2001.
N. Bertoldi, M. Federico, "ITC-irst at CLEF 2000: Italian Monolingual Track", in C. Peters (Ed.), Cross-Language Information Retrieval and Evaluation, pp. 261-272, LNCS 2069, Springer, Berlin, 2001.
Cettolo, F. Brugnara, M. Federico and D. Giuliani, "Advances in Automatic Transcription of Italian Broadcast News". In Proceedings of the International Conference on Spoken Language Processing, Beijing, China, 2000.
M. Federico, D. Giordani, and P. Coletti, "Development and evaluation of an Italian broadcast news corpus", In Proceedings of the International Conference on Language Resources and Evaluation, Athens, Greece, 2000.
M. Cettolo, F. Brugnara, M. Federico and D. Giuliani, "A baseline for the transcription of Italian broadcast news". In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Istanbul, Turkey, 2000.
M. Cettolo, F. Brugnara, M. Federico and D. Giuliani, "A system for the segmentation and transcription of Italian radio news". In Proceedings of RIAO Content-Based Multimedia Information Access, Paris, France, 2000.
N. Bertoldi and M. Federico, "Italian Text Retrieval for CLEF 2000 at ITC-irst", Workshop on Cross-Language Information Retrieval and Evaluation, Lisbon, Portugal, 2000.
M. Cettolo and M. Federico, "Model Selection Criteria for Acoustic Segmentation", In Proceedings of the ISCA ITR Workshop ASR 2000, Paris, France, 2000.
M. Federico, F. Brugnara, and R. Gretter, "Usability Field-test of a Spoken Data-entry System". In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Phoenix, AZ, 1999.
M. Federico, "Efficient language model adaptation through MDI estimation". In Proceedings of the European Conference on Speech Communication and Technology, Budapest, Hungary, 1999.
M. Federico and P. Coletti, "A two-stage speech recognition method for information retrieval applications". In Proceedings of the European Conference on Speech Communication and Technology, Budapest, Hungary, 1999.
M. Federico, "Usability evaluation of a spoken data-entry interface". In Proceedings of the IEEE International Conference on Multimedia Computing and Systems (ICMCS), Firenze, Italy , 1999.
M. Cettolo, F. Brugnara, M. Federico and D. Giuliani, "A baseline for the transcription of Italian broadcast news". In Proceedings Workshop Annuale della Associazione Italiana di Intelligenza Artificiale, Trento, Italy,1999.
M. Federico, "In-the-field Evaluation of a speech based data-entry system". In Proceedings of the International Workshop Speech and Computer (SPECOM), pp. 67-72, San Petersburg, Russia. 1998
F. Brugnara and M. Federico, "Dynamic Language Models for Interactive Speech Applications". In Proceedings of the European Conference on Speech Communication and Technology, Rhodes, Greece. 1997.
U. Ackermann, B. Angelini, F. Brugnara, M. Federico, D. Giuliani, R. Gretter, and H. Niemann, "SpeeData: a Prototype for Multilingual Spoken Data-entry". In Proceedings of the European Conference on Speech Communication and Technology, Rhodes, Greece, 1997.
F. Brugnara and M. Federico, "Techniques for Approximating a Trigram Language Model". In Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, 1996.
M. Federico, "Bayesian estimation methods for n-gram language model adaptation". In Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, 1996.
U. Ackermann, B. Angelini, F. Brugnara, M. Federico, D. Giuliani, R. Gretter, G. Lazzari, and H. Niemann, "SpeeData: Multilingual Spoken Data Entry". ". In Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, 1996.
U. Ackermann, F. Brugnara, M. Federico, and H. Niemann, "Application of Speech Technology in the Multilingual Speedata Project". In Proceedings of the CRIM-FORWISS Workshop, Montreal, Canada, 1996.
M. Federico and F. Vernesoni, "A speech understanding architecture for an information query system". In Proceedings of the European Conference on Speech Communication and Technology, Madrid, Spain, 1995
G. Antoniol, F. Brugnara, M. Cettolo and M. Federico, "Language Model Representations for Beam-Search Decoding". In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Detroit, MI, 1995. [IEEE]
B. Angelini, F. Brugnara, D. Falavigna, M. Federico, D. Giuliani, M. Omologo, and P. Svaizer. "A Hands Free Dictation System for the Italian Language''. In Proceedings of the IEEE Workshop on Automatic Speech Recognition, Snowbird, Utah, 1995.
G. Antoniol, F. Brugnara, M. Cettolo and M. Federico, "Language Model Estimations and Representations for Real-Time Continuous Speech Recognition“. In Proceedings of the International Conference on Spoken Language Processing, Yokohama, Japan, 1994.
B. Angelini, G. Antoniol, F. Brugnara, M. Cettolo, M. Federico, R. Fiutem and G. Lazzari, "Radiological Reporting by Speech Recognition: the A.Re.S. System“. In Proceedings of the International Conference on Spoken Language Processing, Yokohama, Japan, 1994.
G. Antoniol, M. Cettolo and M. Federico. "Robust and Reliable Speech Understanding in Restricted Domains''. In Proceedings of the IEEE Workshop on Automatic Speech Recognition, Snowbird, Utah, 1993.
M. Federico. "Stacked Estimation of Interpolated N-gram Language Models''. In Proceedings of the IEEE Workshop on Automatic Speech Recognition, Snowbird, Utah, 1993.
G. Antoniol, R. Cattoni, M. Cettolo and M. Federico, "Robust Speech Understanding for Robot Telecontrol". In Proceedings of the International Conference on Advanced Robotics, Tokyo, Japan, 1993.
G. Antoniol, M. Cettolo and M. Federico. "Techniques for Robust Recognition in Restricted Domains". In Proceedings of the European Conference on Speech Communication and Technology, Berlin, Germany, 1993.
Book contributions
M. Federico, “La traduzione automatica: sviluppi recenti e prospettive”, In E. Ioratti (Ed.): La traduzione del diritto comunitario ed europeo: riflessioni metodologiche, Quaderni del Dipartimento di scienze giuridiche dell'Università degli studi di Trento, Vol. 63, p. 235-258, 2006.
R. DeMori and M. Federico, "Language Model Adaptation". In K. Pointing (ed.), Computational Models of Speech Pattern Processing, pp. 280-301, NATO ASI Series, Springer Verlag, 1999.
M. Federico and R. DeMori, "Language Modelling". In Renato De Mori (ed.), Spoken Dialogues with Computers, chapter 7, Academic Press, London, 1998.
U. Ackermann and M. Federico, "Bilinguale Sprachverarbeitung zur Dateneingabe mit SpeeData - Ansätze und erste Ergebnisse". In K. Fellbaum (ed.), Elektronische Sprachsignalverarbeitung, Studientexte zur Sprachkommunikation, BTU Cottbus, Germany, 1997.
G. Antoniol, F. Brugnara, M. Cettolo and M. Federico, "Language Models Comparison in a Robot Telecontrol Application". In A.J. Rubio Ayuso and J.M. Lopez Soler (eds.), Speech Recognition and Coding: New Advances and Trends, NATO ASI Series, Springer Verlag, 1995.
A. Corazza, M. Federico, R. Gretter and G.Lazzari, "Design and Acquisition of a Task-Oriented Spontaneous-Speech Data Base", In V. Roberto (ed.), Intelligent Perceptual Systems, Lecture Notes in Artificial Intelligence, Springer Verlag, 1993.
Proceedings
E. Salesky, M. Federico, M. Costa-jussà (Eds), Proceedings of the 19th International Conference on Spoken Language Translation, ACL 2022.
M. Federico, A. Waibel, M. R. Costa-jussà, J. Niehues, S. Stüker, E. Salesky (Eds), Proceedings of the 18th International Conference on Spoken Language Translation, ACL 2021.
M. Federico, A. Waibel, K. Knight, S. Nakamura, H. Ney, J. Niehues, S. Stüker, D. Wu, J. Mariani, F. Yvon (Eds), Proceedings of the 17th International Conference on Spoken Language Translation, ACL 2020.
J. E. Ortega, M. Federico, C. Orasan, M. Popović, Proceedings of the 1st Workshop on Post-Editing in Modern-Day Translation, AMTA, 2020.
M. Turchi, J. Niehues, M. Federico (Eds.), Proceedings of the 15th International Workshop on Spoken Language Translation, Bruges, Belgium 2018.
C. Orasan, C. Parra, E. Barbu, M. Federico (Eds), Proceedings of the 2nd Workshop on Natural Language Processing for Translation Memories (NLP4TM), Portorož, Slovenia, 2016.
G. Rehm, et al. (Eds), Proceedings of the LREC 2016 Workshop "Translation Evaluation: From Fragmented Tools and Data Sets to an Integrated Ecosystem", Portorož, Slovenia, 2016.
M. Federico, S. Stüker, J. Niehues (Eds.), Proceedings of the International Workshop on Spoken Language Translation, Da Nang, Vietnam, 2015.
M. Federico, S. Stüker, F. Yvon (Eds.), Proceedings of the International Workshop on Spoken Language Translation, Lake Tahoe, California, 2014.
F. Casacuberta, M. Federico, P. Koehn (Eds), Proceedings of the AMTA 2014 Workshop on Interactive and Adaptive Machine Translation, Vancouver, Canada, 2014.
M. Cettolo, M. Federico, L. Specia, A. Way (Eds), Proceedings of the Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, 2012
M. Federico, M.-Y. Hwang, M. Rödder, S. Stüker (Eds.), Proocedings of the InternationalWorkshop on Spoken Language Translation, San Francisco, California, 2011.
M. Federico, I. Lane, M. Paul, F. Yvon, J. Mariani (Eds.), Proceedings of the International Workshop on Spoken Language Translation, Paris, France, 2010.
M. Cettolo, M. Federico (Eds.), Proceedings of the International Workshop on Spoken LanguageTranslation, Trento, Italy, 2007.
Patents
M. Federico, et al. “Multilingual Speech Translation with Adaptive Speech Synthesis and Adaptive Physiognomy”, US Patent No. 11,545,134, 2023.
M Federico, et al. “Systems, methods, and apparatuses for controlling output length in neural machine translation”, US Patent No. 11,295,081, 2022.
C. Hoang, et al. “Fast adaptation with fuzzy match augmented translation”, P76883-US01, 2022. (Pending)
A. Upadhyay, et al. “System and Methods for real time dynamic adaptation for Neural Machine Translation”, P72108-US01, 2020 (Pending)
N. Bertoldi, D. Caroselli, M. A. Farajian, M. Federico, M. Negri, M. Trombetti, M. Turchi, "Translation System and Method", US Patent No. 11,036,940 B2, 2021.
F. Brugnara, M. Federico. "A system and method for automatic speech recognition", EP0903727A1, 1997
G. Antoniol, F. Brugnara, M. Cettolo, M. Federico. "Continuous Speech Recognition System", IT1279171B1, 1995.
Reviews
Cross-Language Information Retrieval, Jian-Yun Nie (University of Montreal) San Rafael, CA: Morgan & Claypool (Synthesis Lectures on Human Language Technologies, edited by Graeme Hirst, volume 8), 2010. Marcello Federico, Computational Linguistics, Vol. 37, No. 2, Pages 411-412, June 2011. (pdf)
Mathematical Foundations of Speech and Language Processing , Mark Johnson, Sanjeev P. Khudanpur, Mari Ostendorf and Roni Rosenfeld (eds). Springer Verlag, 2004. Marcello Federico, Machine Translation, Volume 20, Number 3, pp. 223-225, 2006.
My First Machine Learning Paper
M. Federico, S. Messelodi and L. Stringa, Optimum Classifier Approximation for Network-Based Handwritten Recognition, ITC-irst Technical Report #9201-16, January 1992