Vision-language models fine-tuned from Qwen2.5-VL for medical report OCR. Trained on curated medical report datasets to accurately recognize complex textual content within medical report images, output structured Markdown, and answer questions based on extracted content. Available in 7B and 72B variants.

Model Details

Variants

Name Parameters Notes
BaichuanMed-OCR-7B 7B
BaichuanMed-OCR-72B 72B
open-weightbiologyvision