torch transformers sentence-transformers spacy nltk pymupdf gradio spacy[transformers] datasets faker tqdm pandas scikit-learn sentencepiece