Panlingual Translation

Scaling the existing translation technology to all the language-pairs in the world is not feasible, due to lack of aligned parallel corpora and other resources needed by the statistical machine translation algorithms. This project aims at combining all existing translation dictionaries present in the world into a single resource, translation graph; and performing probabilistic inference on this graph to automatically infer translations between language- pairs for which no dictionary exists.


We have compiled the largest translation dictionary, PanDictionary, that contains over 4 times the number of translations compared to the English Wiktionary.

Publications

Panlingual Lexical Translation via Probabilistic Inference Panlingual Lexical Translation via Probabilistic Inference
Mausam, Stephen Soderland and Oren Etzioni
AAAI Conference on Artificial Intelligence, 2010. Full Paper (PDF)
Panlingual Lexical Translation via Probabilistic Inference Panlingual Lexical Translation via Probabilistic Inference
Mausam, Stephen Soderland, Oren Etzioni, Daniel S. Weld, Kobi Reiter, Michael Skinner, Marcus Sammer and Jeff A Bilmes
Artificial Intelligence, 2010. Journal Article (PDF)
Lemmatic Machine Translation Lemmatic Machine Translation
Stephen Soderland, Christopher Lim, Mausam, Bo Qin, Oren Etzioni and Jonathan Pool
Machine Translation Summit, 2009. Full Paper (PDF)
A Rose is a Roos is a Ruusu: Querying Translations for Web Image Search A Rose is a Roos is a Ruusu: Querying Translations for Web Image Search
Janara Christensen, Mausam and Oren Etzioni
Annual Meeting of the Association for Computational Linguistics, 2009. Full Paper (PDF)
Compiling a Massive, Multilingual Dictionary via Probabilistic Inference Compiling a Massive, Multilingual Dictionary via Probabilistic Inference
Mausam, Stephen Soderland, Oren Etzioni, Daniel S. Weld, Michael Skinner and Jeff A Bilmes
Annual Meeting of the Association for Computational Linguistics, 2009. Full Paper (PDF)
Lexical Translation with Application to Image Search on the Web Lexical Translation with Application to Image Search on the Web
Oren Etzioni, Kobi Reiter, Stephen Soderland and Marcus Sammer
Machine Translation Summit, 2007. Full Paper (PDF)
Ambiguity Reduction for Machine Translation: Human-Computer Collaboration Ambiguity Reduction for Machine Translation: Human-Computer Collaboration
Marcus Sammer, Kobi Reiter, Stephen Soderland, Katrin Kirchhoff and Oren Etzioni
biennial conference of the Association for Machine Translation in the Americas, 2006. Full Paper (PDF)