Great, what about different language ? What minimal mix is required for bilingual, trilingual ...
Also, adding a new language to a existing model ? Or coding skills ?
If you took your GPT2 model and tried to make it talk 16+language, would continued training on a new static mix work or does it need to be started from scratch ? Catastrophic risks ?
Thanks