XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization

1 · Google AI Research · April 13, 2020, 8:18 p.m.
Posted by Melvin Johnson, Senior Software Engineer, Google Research and Sebastian Ruder, Research Scientist, DeepMind One of the key challenges in natural language processing (NLP) is building systems that not only work in English but in all of the world’s ~6,900 languages. Luckily, while most of the world’s languages are data sparse and do not have enough data available to train robust models on their own, many languages do share a considerable amount of underlying structure. On the vocabulary ...