A Reality Check of Vision-Language Pre-training in Radiology: Have We Progressed Using Text?

About "CXR_Unimodal_CM" weights:

  • A vision encoder for CXR pre-trained using only a vision encoder, via labels extracted trough NER extraction methods.
  • Pre-trained on CheXpert and MIMIC data.

If you find this repository useful, please consider citing this paper:

@inproceedings{dlilp,
    title={A Reality Check of Vision-Language Pre-training in Radiology: Have We Progressed Using Text?},
    author={Julio Silva-Rodríguez and Jose Dolz and Ismail {Ben Ayed}},
    booktitle={Information Processing in Medical Imaging (IPMI)},
    year={2025}
}
Downloads last month
33
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support