A multilingual collection of datasets modeling the language a person observes from birth until they acquire a native language.