Skip to content

InternLM/VSR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🏆 ICLR 2026

Visual Self-Refine: A Pixel-Guided Paradigm
for Accurate Chart Parsing


1 The Chinese University of Hong Kong
2 Shanghai AI Laboratory

arXiv


Visual Self-Refine (VSR) turns chart parsing into a pixel-guided feedback loop: predict anchors, render them, inspect the rendered chart, and refine the final parsing from the verified pixels.

💻 Overview

🎈 Quick Start

From the repository root:

pip install -r requirements.txt

Chart generation

cd chart_generation
python setup_fonts.py
bash gen.sh

Data pipeline

cd data_pipeline
python resize_and_remap.py
python render_anchors.py

Benchmark evaluation

cd benchmark
python eval_api.py \
  --input-json <path/to/chartp_annotations.json> \
  --image-folder <path/to/chartp_images> \
  --output-json outputs/chartp_gpt-4o.json

📎 Citation

@article{li2026visual,
  title={Visual self-refine: A pixel-guided paradigm for accurate chart parsing},
  author={Li, Jinsong and Dong, Xiaoyi and Zang, Yuhang and Cao, Yuhang and Wang, Jiaqi and Lin, Dahua},
  journal={arXiv preprint arXiv:2602.16455},
  year={2026}
}

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors