ConLID: Supervised Contrastive Learning for Low-Resource Language Identification Paper • 2506.15304 • Published Jun 18 • 1
view article Article Assisted Generation: a new direction toward low-latency text generation May 11, 2023 • 74
Parallel Sentences Datasets Collection These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. • 14 items • Updated Feb 25 • 19
view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 • 259