In 2015, I was asked to design a postgraduate course on machine translation (MT) and post-editing. Following a preliminary theoretical part, the module concentrated on the building and practical use of custom machine translation (CMT) engines. This was a particularly ambitious proposition since it was not certain that students with undergraduate degrees in languages, translation and interpreting, without particular knowledge of computer science or computational linguistics, would succeed in assembling the necessary corpora and building a CMT engine. This paper looks at how the task was successfully achieved using KantanMT to build the CMT engines and Wordfast Anywhere to convert and align the training data. The course was clearly a success since all students were able to train a working CMT engine and assess its output. The majority agreed their raw CMT engine output was better than Google Translate’s for the kinds of text it was trained for, and better than the raw output (pre-translation) from a translation memory tool. There was some initial scepticism among the students regarding the effective usefulness of MT, but the mood clearly changed at the end of the course with virtually all students agreeing that post-edited MT has a legitimate role to play.

Building a Custom Machine Translation Engine as part of a Postgraduate University Course: a Case Study, 2017-11-17.

Building a Custom Machine Translation Engine as part of a Postgraduate University Course: a Case Study

Farrell, Michael
2017-11-17

Abstract

In 2015, I was asked to design a postgraduate course on machine translation (MT) and post-editing. Following a preliminary theoretical part, the module concentrated on the building and practical use of custom machine translation (CMT) engines. This was a particularly ambitious proposition since it was not certain that students with undergraduate degrees in languages, translation and interpreting, without particular knowledge of computer science or computational linguistics, would succeed in assembling the necessary corpora and building a CMT engine. This paper looks at how the task was successfully achieved using KantanMT to build the CMT engines and Wordfast Anywhere to convert and align the training data. The course was clearly a success since all students were able to train a working CMT engine and assess its output. The majority agreed their raw CMT engine output was better than Google Translate’s for the kinds of text it was trained for, and better than the raw output (pre-translation) from a translation memory tool. There was some initial scepticism among the students regarding the effective usefulness of MT, but the mood clearly changed at the end of the course with virtually all students agreeing that post-edited MT has a legitimate role to play.
Inglese
https://www.researchgate.net/publication/363281353_Building_a_Custom_Machine_Translation_Engine_as_part_of_a_Postgraduate_University_Course_a_Case_Study
Translating and the Computer
39
London
2017
internazionale
Proceedings of the 39th Conference Translating and the Computer
35
39
978-2-9701095-3-2
Online
Settore INF/01 - Informatica
1
File in questo prodotto:
File Dimensione Formato  
Building-a-Custom-Machine-Translation-Engine-as-part-of-a-Postgraduate-University-Course-a-Case-Study.pdf

Open Access

Dimensione 197.94 kB
Formato Adobe PDF
197.94 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10808/47327
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • ???jsp.display-item.citation.isi??? ND
social impact