| ## DPO dataset construction | |
| ``` | |
| # Make sure you have mimic files, which should be like MedOmni_DPO/data/xray/dpo/images/physionet.org | |
| cd MedOmni_DPO/data/xray/dpo/utils | |
| python process_mask.py # Mask DPO from https://arxiv.org/abs/2406.06496 | |
| cd MedOmni_DPO/utils | |
| python build_dataset.py | |
| ``` | |
| You will have two JSON files `train.json` and `val.json` in `MedOmni_DPO/json_files/medinterp/`. If you prefer not to create the dataset yourself, we've already provided these two JSON files. | |
| ## Finetuning MedVersa checkpoint | |
| ``` | |
| sh run.sh | |
| ``` | |
| ## Evaluation | |
| ``` | |
| sh eval.sh # Need to specify --epoch-id and --model-id (medomni/output/medomni_v0) | |
| ``` |