Conférence donnée par Shuly Wintner dans le cadre du séminaire "Language, cognition and computational models".
Translation is a text production mode that imposes cognitive (and cultural) constraints on the text producer. The product of this process, known as *translationese*, reflects these constraints; translated texts are therefore ontologically different from texts written originally in the same language. Many of the special properties of translationese are believed to be universal, in that they are manifest in any translated text regardless of the source and target languages. In fact, a cognitive framework has been suggested as the explanation of translation universals.
In this work we test several Translation Studies hypotheses using a computational methodology that is based on supervised machine learning. Casting the problem in the paradigm of authorship attribution, we define dozens of classifiers that implement various linguistically-informed features that reflect translation universals. While the practical task of distinguishing original from translated texts is easy, we focus not on improving the accuracy of classification, but rather on designing linguistically meaningful features and assessing their contribution to the task. We demonstrate that some feature sets are indeed good indicators of translationese, thereby corroborating some hypotheses, whereas others perform much worse (sometimes at chance level), indicating that some `universal' assumptions have to be reconsidered.
While our results are limited to the case of translationese, this methodology can be adopted for studying other kinds of texts produced under different cognitive constraints, such as texts produced by non-native speakers, by people with learning disabilities or medical problems, or by children acquiring a language.
Shuly Wintner is an associate professor of computer science at the University of Haifa, Israel. His research spans various areas of computational linguistics and natural language processing, including formal grammars, morphology, syntax, language resources, machine translation, and child language acquisition. He was the editor-in-chief of Springer's Research on Language and Computation, a program co-chair of EACL-2006, and is now the general chair of EACL-2014. He was among the founders, and twice (6 years) the chair, of ACL SIG Semitic. Currently, he serves as the Head of the Department of Computer Science in Haifa.Cliquer ICI pour fermer
Dernière mise à jour : 23/07/2013