Google Translate 

Google Translate is a service provided by Google Inc. to translate a section of text, or a webpage, into another language, with limits to the number of paragraphs, or range of technical terms, translated. For some languages, users are asked for alternate translations, such as for technical terms, to be included for future updates to the translation process.

Unlike other translation services such as Babel Fish, AOL, and Yahoo which use SYSTRAN, Google uses its own translation software.

Contents

Functions

Google Translate, like other automatic translation tools, has its limitations. While it can help the reader to understand the general content of a foreign language text, it does not deliver accurate translations and does not produce publication-standard content, for example it often translates words out of context and is deliberately not applying any grammatical rules, since its algorithms are based on statistical analysis rather than traditional rule-based analysis.1

Approach

Google translate is based on an approach called statistical machine translation, and more specifically, on research by Franz-Josef Och who won the DARPA contest for speed machine translation in 2003. Och is now the head of Google's machine translation department.

According to Och,2 a solid base for developing a usable statistical machine translation system for a new pair of languages from scratch, would consist in having a bilingual text corpus (or parallel collection) of more than a million words and two monolingual corpora of each more than a billion words. Statistical models from this data are then used to translate between those languages.

To acquire this huge amount of linguistic data, Google used United Nations documents. The same document is normally available in all six official UN languages, thus Google now has a 7-language corpus of 20 billion words' worth of human translations.citation needed

The availability of Arabic and Chinese as official UN languages is probably one of the reasons why Google Translate initially focused on the development of translation between English and those languages, and not, for example, Japanese and German, which are not official languages at the UN.

Google representatives have been very active at domestic conferences in Japan in the field asking researchers to provide them with bilingual corpora.3

Options

(by chronological order)

References

  1. ^ Franz-Josef Och confirmed this during his keynote speech at the MT Summit 2005, stating that "We do not need rules any more."
  2. ^ Keynote speech at the Machine Translation Summit 2005
  3. ^ Google was an official sponsor of the annual Computational Linguistics in Japan Conference ("Gengoshorigakkai") in 2007. Google also sent a delegate from its headquarters to the meeting of the members of the Computational Linguistic Society of Japan in march 2005, promising funding to researchers who would be willing to share text data.

See also

External links