Error Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation Models Paper • 2310.00840 • Published Oct 2, 2023
No Language Left Behind: Scaling Human-Centered Machine Translation Paper • 2207.04672 • Published Jul 11, 2022 • 2
The FLoRes Evaluation Datasets for Low-Resource Machine Translation: Nepali-English and Sinhala-English Paper • 1902.01382 • Published Feb 4, 2019
Embedding-Enhanced Giza++: Improving Alignment in Low- and High- Resource Scenarios Using Embedding Space Geometry Paper • 2104.08721 • Published Apr 18, 2021