Preliminary evaluation of ChatGPT as a machine translation engine and as an automatic post-editor of raw machine translation output from other machine translation engines

IRIS

This preliminary study consisted of two experiments. The first aimed to gauge the translation quality obtained from the free-plan version of ChatGPT in comparison with the free versions of DeepL Translator and Google Translate through human evaluation, and the second consisted of using the free-plan version of ChatGPT as an automatic post-editor of raw output from the pay-for version of DeepL Translator (both monolingual and bilingual full machine translation post-editing). The experiments were limited to a single language pair (from English to Italian) and only one text genre (Wikipedia articles). In the first experiment, DeepL Translator was judged to have performed best, Google Translate came second, and ChatGPT, last. In the second experiment, the free-plan version of ChatGPT equalled average human translation (HT) levels of lexical variety in automatic monolingual machine translation post-editing (MTPE) and exceeded average HT lexical variety levels in automatic bilingual MTPE. However, only one MT marker was considered, and the results of the post-editing were not quality-assessed for other features of MTPE that distinguish it from HT. It would therefore be unadvisable to generalize these findings at present. The author intends to carry out new translation experiments during the next academic year with ChatGPT Plus, instead of the free-plan version, both as an MT engine and as an automatic post-editor. The plan is to continue to evaluate the results manually and not automatically.

Preliminary evaluation of ChatGPT as a machine translation engine and as an automatic post-editor of raw machine translation output from other machine translation engines, 2023-07-09.

Preliminary evaluation of ChatGPT as a machine translation engine and as an automatic post-editor of raw machine translation output from other machine translation engines

Michael Farrell^{Writing – Review & Editing}

2023-07-09

Abstract

This preliminary study consisted of two experiments. The first aimed to gauge the translation quality obtained from the free-plan version of ChatGPT in comparison with the free versions of DeepL Translator and Google Translate through human evaluation, and the second consisted of using the free-plan version of ChatGPT as an automatic post-editor of raw output from the pay-for version of DeepL Translator (both monolingual and bilingual full machine translation post-editing). The experiments were limited to a single language pair (from English to Italian) and only one text genre (Wikipedia articles). In the first experiment, DeepL Translator was judged to have performed best, Google Translate came second, and ChatGPT, last. In the second experiment, the free-plan version of ChatGPT equalled average human translation (HT) levels of lexical variety in automatic monolingual machine translation post-editing (MTPE) and exceeded average HT lexical variety levels in automatic bilingual MTPE. However, only one MT marker was considered, and the results of the post-editing were not quality-assessed for other features of MTPE that distinguish it from HT. It would therefore be unadvisable to generalize these findings at present. The author intends to carry out new translation experiments during the next academic year with ChatGPT Plus, instead of the free-plan version, both as an MT engine and as an automatic post-editor. The plan is to continue to evaluate the results manually and not automatically.

Scheda breve

Scheda completa

Scheda completa (DC)

	Lingua/e
	
				Inglese
			
	Data di pubblicazione degli atti o dell'intervento
	
				9-lug-2023
			
	URL
	
				https://www.researchgate.net/publication/372419955_Preliminary_evaluation_of_ChatGPT_as_a_machine_translation_engine_and_as_an_automatic_post-editor_of_raw_machine_translation_output_from_other_machine_translation_engines
			
	Nome del convegno
	
				International Conference HiT-IT 2023
			
	Luogo del convegno
	
				Napoli
			
	Anno del convegno
	
				2023
			
	Rilevanza del convegno
	
				internazionale
			
	Relazione
	
				contributo
			
	Titolo degli Atti
	
				Proceedings of the International Conference HiT-IT 2023
			
	Curatori degli Atti
	
				Orasan, Constantin; Mitkov, Ruslan; Corpas Pastor, Gloria; Monti, Johanna
			
	Pagina iniziale del contributo
	
				108
			
	Pagina finale del contributo
	
				113
			
	N. pagine
	
				6
			
	Paese di pubblicazione
	
				Bulgaria
			
	Città di pubblicazione
	
				Shoumen
			
	Editore
	
				Incoma Ltd.
			
	Referee
	
				esperti anonimi
			
	Formato
	
				A stampa
			
	Settori scientifico-disciplinari (validi fino a 24/06/2024)
	
				Settore INF/01 - Informatica
			
	Numero autori
	
				1
			
	Appare nelle tipologie:
	
				4.01 Contributo in atti di convegno (pubblicato)

File in questo prodotto:

File	Dimensione	Formato
Paper2.proceedings.pdf Open Access Descrizione: Preliminary evaluation of ChatGPT as a machine translation engine and as an automatic post-editor of raw machine translation output from other machine translation engines Tipologia: Documento in Post-print Dimensione 558.82 kB Formato Adobe PDF Visualizza/Apri	558.82 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10808/50564

Citazioni

ND

ND

ND

social impact