Automatic Translation in Multilingual Electronic Meetings

Volume 13, No. 3
July 2009

Milam Aiken

Mina Park

Lakisha Simmons

Tobin Lindblom

Tobin Lindblom is an Assistant Professor of Management Information Systems at Northern State University in South Dakota. He received his Ph.D. in Management Information Systems from the University of Mississippi. His research interests include Group Support Systems and Open-Source technologies, including clustered computer environments.
Tobin Lindblom can be reached at Tobin.Lindblom@northern.edu

Front Page

	Index 1997-2009
	TJ Interactive: Translation Journal Blog
	Translator Profiles
	Success through Lifetime Learning by Gerardo Konig
	The Profession
	The Bottom Line by Fire Ant & Worker Bee
	In Memoriam
	In Memoriam—Ben Teague, 1945 - 2009 by Gabe Bokor
	Translation Nuts and Bolts
	What's Cooking: Translating Food by Brett Jocelyn Epstein
	Medical Translation
	Physician Extenders—Who are they? Are they measuring up? by Rafael A. Rivera, M.D., FACP
	Translation of Medical Terms by Katrin Herget, Teresa Alegre
	Cultural Aspects of Translation
	Cultural Untranslatability by Kanji Kitamura
	Translation History
	The Issue of Direction of Translation in China: A Historical Overview by Wang Baorong
	The Translator & the Computer
	Automatic Translation in Multilingual Electronic Meetings by Milam Aiken, Mina Park, Lakisha Simmons, and Tobin Lindblom
	Arts & Entertainment
	On the Dubbing of Humor: Tidying Up the Room Juan José Martínez-Sierra, Ph.D.
	Doblaje audiovisual y publicidad—Reflexiones en torno al concepto de manipulación Isabel Cómitre Narváez
	Literary Translation
	Chosen Aspects of the Polish Translation of J.K. Rowling's Harry Potter and the Philosopher's Stone by Andrzej Polkowski: Translating Proper Names by Anna Standowicz
	A Key Word in Gabriel García Márquez's One Hundred Years of Solitude by Dr. James McCutcheon
	Translator Education
	Communication Strategies Do Work! A study on the usage of communication strategies in translation by Iranian students of translation by Sahar Farrahi Avval
	The Applications of Keywords and Collocations to Translation-Studies and Teaching—A Tentative Research on the Parallel Corpus of the 17^th NCCPC Report by Dai Guangrong
	Translators' Tools
	The Google Translation Center That Was to Be by Jost Zetzsche
	Thirteen Days in June—Adventures with SDL/Trados by Danilo Nogueira and Kelli Semolini
	Translators’ Emporium
	Caught in the Web
	Web Surfing for Fun and Profit by Cathy Flick, Ph.D.
	Translators’ On-Line Resources by Gabe Bokor
	Translators’ Best Websites by Gabe Bokor
	Call for Papers and Editorial Policies

Automatic Translation in Multilingual Electronic Meetings

by Milam Aiken, Mina Park, Lakisha Simmons, and Tobin Lindblom

Abstract

Electronic meetings, e.g., chat rooms and bulletin boards, can be more efficient and effective than traditional, oral discussions, but until only recently, online groups speaking many languages could not benefit from machine translation (MT). Although it is possible for linguists to provide translations for the group members as they read comments during a multilingual discussion, this is not feasible for large groups and many languages. As a solution, we propose a fully automated multilingual meeting system, and an example of its use in a meeting with comments typed in English translated to Dutch and Russian illustrates its potential to reduce many multinational communication barriers.

Introduction.

n the past, oral meetings involving speakers of multiple languages required participants to adopt a common language, e.g. English, or use interpreters. In the former case, all participants might not be fluent in the non-native language and could be uncomfortable speaking it. In the latter case, human interpreters could be expensive and difficult to schedule.

Early results indicate a high level of comprehension for many translated comments, and future research will investigate the accuracy of more complex sentence translations.

Many studies have shown that people speaking a single language can use computer-based group support systems to improve the efficiency and effectiveness of meetings focused on sharing ideas among many participants. In these electronic meetings, group members can type and read comments simultaneously while all text is automatically recorded to a file. Because these meetings often provide anonymous input of ideas, experiments have shown that people participate more, generate more solutions to problems, take less time, and are more satisfied with the meeting process (Adkins, et al., 2003; Fjermestad & Hiltz, 2001).

Electronic meetings can support multiple languages with the integration of machine translation (Fügen, et al., 2007; Lim &Yang, 2008). Using such a system, group members could contribute typed comments in their own native languages while others' comments typed in different languages are translated automatically for presentation on the appropriate screens.

In this paper, we describe six Web-based machine translation services that can be used to assist with the understanding of foreign text and seven electronic chat systems that provide translations between language pairs. Then, we introduce a new, locally developed multilingual electronic meeting system that provides automatic translation among 41 languages. Finally, in a test of comprehension accuracy, we rank the languages using five simple phrases.

Web-based machine translation

Since the late-1990s with the introduction of Babelfish on the Web (Yang & Lange, 1998), free, online translators have been available for use on text, documents, and Web pages. Currently, there are at least six free services (shown in Table 1) that provide support for different numbers of language pairs (e.g., English to Spanish, French to Russian, Chinese to English, etc.).

Table 1: Free Web-based translation services

Service	URL	Underlying MT	Language Pairs
Babelfish	http://babelfish.yahoo.com	Systran	38
Freetranslation	www.freetranslation.com	SDL	19
Google Translate	http://translate.google.com	Google	1,640
Online-translator	www.online-translator.com	PROMT	24
Reverso	www.reverso.net	Reverso	19
Worldlingo	http://www2.worldlingo.com	Worldlingo	225

Using these Web sites, chat room participants could translate foreign comments, but conducting these translations can be confusing to group members (Flanagan, 1997). Group members are not likely to put forth the effort when faced with many comments in different languages, and a meeting facilitator providing translations for the discussion using these systems will be overwhelmed with the task once the group size reaches 5 or 6 with more than 2 or 3 languages (O'Hagan & Ashworth, 2002). More staff members could be added to help with the translations, but coordination would be difficult as they lost track of which comments were translated and which were not. Instead, automated translation is needed in multilingual electronic meetings.

Automated multilingual meetings

Despite the first multilingual application appearing in the early 1990s that automatically translated between English and Spanish in an electronic meeting (Aiken, et al.,1992; Aiken, et al., 1994), at least two United States patents were filed several years later which claimed to do essentially the same thing:

1. US Patent 5966685 - System for parallel foreign language communication over a computer network (Flanagan, et al, 1999)

US Patent Issued on October 12, 1999

Abstract: A system is disclosed which allows for an electronic discussion group user to communicate with another user who speaks a different language. Machine translators and other software are incorporated to translate messages, thereby creating parallel discussion groups in different languages.

2. US Patent 5987401 - Language translation for real-time text-based conversations

US Patent Issued on November 16, 1999 (Trudeau, 1999)

Abstract: A real-time language translation technique for text-based conversations. The messages forming the text-based conversation amongst a plurality of participants to the conversation are translated in real-time either from a user language to a conversation language of the conversation, or from the conversation language to the user language. The result is that the user is able to seamlessly converse in a text-based conversation (in the conversation language) using a language other than the conversation language. The invention is particularly advantageous for on-line text-based conversations, wherein users of on-line text-based conversations are able to seamlessly converse with each other in different languages.

Subsequently, at least seven applications (shown in Table 2) were developed that provide automatic translation for instant messaging between pairs of individuals.

Table 2: Online chat systems with automatic translation

Application	URL	Languages
Amikai	http://www.riskworld.com/PressRel/2000/00q3/PR00a076.htm	9
Annochat	http://www.langrid.org/association/pangaeasupport/indexe.html	4
ChatTranslator	http://www.chattranslator.com/	7
Free2IM	http://openaimblog.aol.com/2008/05/06/instant-language-translation-with-free2im	13
Hab.la Realtime Chat	http://www.programmableweb.com/mashup/hab.la-realtime-chat-translation	41
IBM Lotus Sametime	http://my.advisor.com/doc/07484 http://www-01.ibm.com/software/lotus/sametime/	7
MeGlobe	http://meglobe.com/	15
WorldLingo Chat	http://www.worldlingo.com/en/products/chat_translator.html	15

However, multilingual meetings usually involve more than two people, and they often use more than two languages. We believe there is no system available that can accommodate such a group by automatically translating among several languages at once, but there is a clear need for such an application.

Polyglot

To provide support for large, multilingual groups in an electronic meeting, we developed a prototype system that allows participants to type comments anonymously and simultaneously while being able to read all others' comments at the same time. The software has been designated Polyglot ("many tongues"), and it uses the Google Translate application programming interface (API) to perform translations between any of 41 languages (1,640 language pairs). Unlike many other MT systems, Google Translate is based upon a statistical translation system in which a language model is trained on billions of words of equivalent text in many different languages, e.g., comparing the Bible written in German versus the book written in Russian, and comprehension results have been very good (Geer, 2005). Relatively few evaluations of free, online MT systems have been conducted (e.g., Aiken, et al., 2006; Bezhanova, et al., 2005), but one study (NIST, 2005) of 20 free, commercial, and research MT systems showed that Google Translate was the most accurate in three of four tests.

At least seven mashups, i.e., software that combine two or more tools to create new services (Ennals & Garofalakis, 2007), have already been developed with the Google Translate API including MultiTranslator that provides translation in many languages at the same time (Programmableweb, 2009), and researchers are developing new applications every day (Grubinger, et al., 2008).

Figures 1 to 3 show an example of how Polyglot can be used by a multilingual group discussing the G20 protests in London (Telegraph, 2009) in English, Dutch, and Russian. Each user types a comment in the top textbox, and within two or three seconds, it is available for the other participants to read in their native languages at their own computers in the lower textbox.

Figure 1: English participants' view of multilingual meeting

Figure 2: Dutch participants' view of multilingual meeting

Figure 3:
Russian participants' view of multilingual meeting

Translation comprehension is still far from perfect because Google Translate's accuracy varies with sentence and vocabulary complexity and by language. In an attempt to judge Polyglot's potential performance, two objective, English-speaking evaluators ranked the 40 non-English languages supported by Google Translate based on the scales provided by Guyon (2003):

Comprehension

The text is clear, easy to understand and grammatically correct and does not require any corrections.
The text contains minor errors such as incorrect prepositions or articles, but is otherwise impeccable.
The text is a mixture of minor errors and incorrect terms, but the meaning is still understandable.
The text is a mixture of minor errors and incorrect terms, and it takes a definite effort to understand the meaning.
The text is incomprehensible gibberish.

Acceptability

The text is perfectly acceptable.
The reader notices slight anomalies in the text.
The reader feels somewhat uncomfortable reading the text.
The reader has the impression that the text is not very serious.
The reader feels insulted to have been presented with such a text.

Meaning

The translation conveys the meaning of the original exactly.
Minor nuances are missing.
The translation more or less conveys the meaning of the original.
The translation does not convey the meaning of the original very accurately.
The translation does not convey the meaning of the original at all.

The equivalents for the five English sentences below were obtained for each of the 40 languages from http://www.omniglot.com/language/phrases/index.htm and translated back to English with Google Translate.

Pleased to meet you.
My hovercraft is full of eels.
One language is never enough.
I don't understand.
I love you.

As shown in Table 3, the evaluators were able to understand all of the translations back into English, but the last five took more effort.

Table 3: Ranking of 40 non-English languages supported by Google Translate

(lower score better)

Language	Comprehension		Acceptability		Meaning
Language	Rank	Score	Rank	Score	Rank	Score
Dutch	1	1.3	1	1.3	3	1.5
Hungarian	1	1.3	1	1.3	3	1.5
Czech	3	1.4	3	1.4	1	1.4
Estonian	3	1.4	3	1.4	1	1.4
Chinese	5	1.5	5	1.5	9	1.8
Italian	5	1.5	5	1.5	9	1.8
Korean	5	1.5	5	1.5	3	1.5
Portuguese	5	1.5	5	1.5	3	1.5
French	9	1.7	9	1.7	9	1.8
German	9	1.7	9	1.7	19	2.0
Russian	9	1.7	9	1.7	7	1.7
Slovak	9	1.7	9	1.7	9	1.8
Slovenian	9	1.7	9	1.7	7	1.7
Danish	14	1.8	14	1.8	9	1.8
Norwegian	14	1.8	14	1.8	9	1.8
Spanish	14	1.8	14	1.8	9	1.8
Bulgarian	17	1.9	17	1.9	16	1.9
Finnish	17	1.9	17	1.9	16	1.9
Polish	17	1.9	17	1.9	16	1.9
Filipino	20	2.0	20	2.0	21	2.2
Hebrew	20	2.0	20	2.0	19	2.0
Swedish	20	2.0	20	2.0	21	2.2
Turkish	20	2.0	20	2.0	21	2.2
Croatian	24	2.2	24	2.2	28	2.5
Catalan	25	2.3	25	2.3	28	2.5
Japanese	25	2.3	25	2.3	24	2.3
Maltese	25	2.3	25	2.3	25	2.4
Serbian	28	2.4	28	2.4	25	2.4
Ukrainian	28	2.4	28	2.4	25	2.4
Vietnamese	28	2.4	28	2.4	30	2.6
Greek	31	2.5	31	2.5	30	2.6
Indonesian	31	2.5	31	2.5	30	2.6
Romanian	33	2.6	33	2.6	33	2.7
Albanian	34	2.7	34	2.7	34	2.8
Thai	35	2.8	35	2.8	35	3.1
Latvian	36	3.1	36	3.1	35	3.1
Hindi	37	3.2	37	3.2	39	3.6
Arabic	38	3.4	38	3.4	37	3.4
Lithuanian	38	3.4	38	3.4	37	3.4
Galician	40	3.5	40	3.5	40	3.8

Conclusion

The multilingual meeting prototype described here can support large groups using up to 41 languages with translations provided automatically within a few seconds via a link with Google Translate. Early results indicate a high level of comprehension for many translated comments, and future research will investigate the accuracy of more complex sentence translations as well as how the prototype performs with other languages.

References

Adkins, M., Burgoon, M., and Nunamaker, J. (2003). Using group support systems for strategic planning with the United States Air Force, Decision Support Systems. 34(3), 315-337.
Aiken, M., Martin, J., Paolillo, J., and Shirani, A. (1994). A group decision support system for multilingual groups. Information and Management, 26, 155-161.
Aiken, M., Martin, J., Reithel, B., Shirani, A., and Singleton, T. (1992). Using a group decision support system for multicultural and multilingual communication. Proceedings of the 23rd Annual Meeting of the Decision Sciences Institute, November, 792-794, San Francisco, California.
Aiken, M., Vanjani, M., and Wong, Z (2006). Measuring the accuracy of Spanish- to-English translations Issues in Information Systems, 7(2), 125-128.
Bezhanova, O., Byezhanova, M., and Landry, O. (2005). Comparative analysis of the translation quality produced by three MT systems. McGill University, Montreal, Canada.
Ennals, R. and Garofalakis, M. (2007). MashMaker: Mashups for the masses. Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data, Beijing, China, 1116 - 1118.
Estival, D. (2005). The Language translation interface: A perspective from the users. Machine Translation, 19(2), 175-192.
Fjermestad, J. and Hiltz, S. (2001). Group support systems: A descriptive evaluation of case and field studies, Journal of Management Information Systems, 17(3), 115-160.
Flanagan, M. (1997). Machine translation of interactive texts. Machine Translation Summit VI Proceedings, Washington, DC (AMTA) p. 50. Retrieved April 6, 2009 from http://www.mt-archive.info/MTS-1997-Flanagan.pdf
Flanagan, M., Trevor, A., and Jensen, P. (1999). PatentStorm. Retrieved April 6, 2009 from http://www.patentstorm.us/patents/5966685.html
Fügen, C., Waibel, A., and Kolss, M. (2007). Simultaneous translation of lectures and speeches, Machine Translation, 21(4), 209-252.
Geer, D. (2005). Computer, October, 18-21. Retrieved April 6, 2009 from http://www.geercom.com/rx018.pdf
Grubinger, M., Clough, P., Hanbury, A., and Müller, H. (2008). Overview of the ImageCLEFphoto 2007 Photographic Retrieval Task. In Gao, X., Müller, H., Loomes, M. (et al.) (Eds.), Advances in Multilingual and Multimodal Information Retrieval. Springer: Berlin.
Guyon, A. (2003). Analysis of machine translation for the virtual museum of Canada (VMC). Retrieved April 6, 2009 from http://www.chin.gc.ca/English/Digital_Content/Machine_Translation/phase1_rating.html
Kim, K. and Bonk, C. (2002). Cross-cultural comparisons of online communication. Journal of Computer-mediated Communication, 8(1). Retrieved April 6, 2009 from http://jcmc.indiana.edu/vol8/issue1/kimandbonk.html
Lim, J. and Yang, Y. (2008). Exploring computer-based multilingual negotiation support for English-Chinese dyads: Can we negotiate in out native languages? Behaviour and Information Technology, 27(2), 139-151.
NIST (2005). Machine Translation Evaluation Official Results. Retrieved April 6, 2009 from http://www.itl.nist.gov/iad/mig//tests/mt/2005/doc/mt05eval_official_results_release_20050801_v3.html
O'Hagan, M. and Ashworth, D. (2002). Translation-mediated Communication in a Digital World: Facing the Challenges of Globalization and Localization (Topics in Translation, 23). Multilingual Matters: Clevedon, England.
Ogura, K., Hayashi, Y, Nomura, S., and Ishida, I. (2004). User adaptation in MT-mediated communication. The First International Joint Conference on Natural Language Processing, 596-601.
Programmableweb (2009). Mashups Tag Search: Translation. Retrieved April 6, 2009 from http://www.programmableweb.com/tag/Translation
Telegraph (2009). G20 London protests: How are they for you? London Telegraph. Retrieved April 6, 2009 from http://www.telegraph.co.uk/finance/financetopics/g20-summit/5084566/G20-London-protests-How-are-they-for-you.html
Trudeau, J. (1999). PatentStorm. Retrieved April 6, 2009 from http://www.patentstorm.us/patents/5987401.html
Yang, J. and Lange, E. (1998). SYSTRAN on AltaVista: A User study on real-time machine translation on the Internet. In G. Carbonell and J. Siekmann (eds.) Machine Translation and the Information Soup, Springer: Berlin.