DML Translations blog: 9-18-15

There's a difference between computer assisted translation (CAT) or machine-assisted translation, and machine translation.

CAT Tools: SDL Trados, WordFast

Translation Management Systems: XTRF,

On machine translation:

Word-sense disambiguation concerns finding a suitable translation when a word can have more than one meaning. The problem was first raised in the 1950s by Yehoshua Bar-Hillel.^[12]He pointed out that without a "universal encyclopedia", a machine would never be able to distinguish between the two meanings of a word.^[13] Today there are numerous approaches designed to overcome this problem. They can be approximately divided into "shallow" approaches and "deep" approaches.

Shallow approaches assume no knowledge of the text. They simply apply statistical methods to the words surrounding the ambiguous word. Deep approaches presume a comprehensive knowledge of the word. So far, shallow approaches have been more successful.^{[citation needed]}

C laude Piron, a long-time translator for the United Nations and the World Health Organization, wrote that machine translation, at its best, automates the easier part of a translator's job; the harder and more time-consuming part usually involves doing extensive research to resolve ambiguities in the source text, which the grammatical and lexical exigencies of the target language require to be resolved:

Why does a translator need a whole workday to translate five pages, and not an hour or two? ..... About 90% of an average text corresponds to these simple conditions. But unfortunately, there's the other 10%. It's that part that requires six [more] hours of work. There are ambiguities one has to resolve. For instance, the author of the source text, an Australian physician, cited the example of an epidemic which was declared during World War II in a "Japanese prisoner of war camp". Was he talking about an American camp with Japanese prisoners or a Japanese camp with American prisoners? The English has two senses. It's necessary therefore to do research, maybe to the extent of a phone call to Australia.^[14]

The ideal deep approach would require the translation software to do all the research necessary for this kind of disambiguation on its own; but this would require a higher degree of AI than has yet been attained. A shallow approach which simply guessed at the sense of the ambiguous English phrase that Piron mentions (based, perhaps, on which kind of prisoner-of-war camp is more often mentioned in a given corpus) would have a reasonable chance of guessing wrong fairly often. A shallow approach that involves "ask the user about each ambiguity" would, by Piron's estimate, only automate about 25% of a professional translator's job, leaving the harder 75% still to be done by a human.

Machine translation seems to be workiing well when the original document is written in "controlled language" but that takes a certain style of writing: https://en.wikipedia.org/wiki/Controlled_natural_language

INTERESTING ARTICLE FOR REVIEW by Mathias Winther Madsen:

"The Limits of Machine Translation"

https://drive.google.com/file/d/0B7-4xydn3MXJZjFkZTllZjItN2Q5Ny00YmUxLWEzODItNTYyMjhlNTY5NWIz/view?ddrp=1#

When you translate a law, a job application, a fire emergency instruction, a military order, or a medical prescription, you do not want the translation to be “fairly clear” and “almost accurate,” but clearand accurate. As two more recent researchers dryly commented, “a 95% system in the worst case produces a translated text analogous to a jar of cookies, only 5% of which are poisoned” (Carbonell and Tomita, 1987, p. 69).

DML Translations blog

Friday, September 18, 2015

9-18-15

No comments:

Post a Comment