ClinAlign
ClinAlign is a clinician-grounded healthcare alignment framework that scales from instance rubrics to a reusable library of clinical principles, enabling robust preference alignment and inference-time self-revision for medical LLMs.
π Paper
π₯ Highlights
- Clinician-verified rubrics (HealthRubrics): a physician-validated preference dataset built by having clinicians revise and finalize LLM-drafted, checkable rubrics.
- Reusable principle library (HealthPrinciples): distilled clinician consensus as 119 broadly reusable principles organized by clinical dimensions (urgency / uncertainty / expertise / task type).
- Scalable supervision: principles can be converted into per-question rubrics for new, unlabeled medical queriesβscaling training data without per-instance clinician authoring.
- Inference-time alignment tool: retrieve matched principles β generate rubric references β guide iterative self-revision at test time.
π₯ Results
π€ Models: ClinAlign-4B β’ ClinAlign-30B-A3B
Acknowledgement