1 The Chinese University of Hong Kong
2 Shanghai AI Laboratory
2 Shanghai AI Laboratory
Visual Self-Refine (VSR) turns chart parsing into a pixel-guided feedback loop: predict anchors, render them, inspect the rendered chart, and refine the final parsing from the verified pixels.
From the repository root:
pip install -r requirements.txtcd chart_generation
python setup_fonts.py
bash gen.shcd data_pipeline
python resize_and_remap.py
python render_anchors.pycd benchmark
python eval_api.py \
--input-json <path/to/chartp_annotations.json> \
--image-folder <path/to/chartp_images> \
--output-json outputs/chartp_gpt-4o.json@article{li2026visual,
title={Visual self-refine: A pixel-guided paradigm for accurate chart parsing},
author={Li, Jinsong and Dong, Xiaoyi and Zang, Yuhang and Cao, Yuhang and Wang, Jiaqi and Lin, Dahua},
journal={arXiv preprint arXiv:2602.16455},
year={2026}
}

