TUFS Media Corpus

TUFS Media Corpus is a collection of parallel news articles translated into Japanese from various languages. This corpus represents an ongoing project carried out at Tokyo University of Foreign Studies (TUFS) entitled "TUFS Media Project," which produces translated news articles in eight languages (Arabic, Bengali, Burmese, Indonesian, Persian, Turkish, Urdu, and Vietnamese).
The corpus is available here.
tufs logo