Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mkurman 's Collections
Medical Pre-Training Datasets
Medical QA Datasets

Medical Pre-Training Datasets

updated Aug 23

A collection of medical datasets suitable for LLMs pretraining

Upvote
1

  • openmed-community/TheBlueScrubs-v1-fixed

    Viewer • Updated Aug 29 • 11.1M • 269 • 12

  • mkurman/hindawi-journals-2007-2023

    Viewer • Updated Jun 9 • 298k • 1.6k • 3

  • epfl-llm/guidelines

    Viewer • Updated Mar 7, 2024 • 38k • 945 • 141

  • ncbi/Open-Patients

    Viewer • Updated May 11 • 180k • 260 • 22

  • AGBonnet/augmented-clinical-notes

    Viewer • Updated Jan 24, 2024 • 30k • 1.61k • 59

  • harishnair04/mtsamples

    Viewer • Updated Nov 7, 2024 • 5k • 229 • 1

  • Tonic/Health-Bench-Eval-OSS-2025-07

    Viewer • Updated May 17 • 9.67k • 178 • 2

  • zeroshot/arxiv-biology

    Viewer • Updated Jan 5, 2023 • 1.28k • 262 • 14
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs