CodeGoat24's picture
Update README.md
6bc5f4e verified
metadata
license: mit
base_model:
  - CodeGoat24/UnifiedReward-2.0-qwen3vl-4b

Model Summary

UnifiedReward-Edit-qwen3vl-4b is a unified reward model for both Text-to-Image and Image-to-Image generation!! For image editing reward task, our models support:

  1. Pairwise Rank β€” directly judge which of two edited images is better.

  2. Pairwise Score β€” assign a separate score to each image in a pair.

  3. Pointwise Score β€” rate a single image on two axes: instruction-following and overall image quality.

πŸš€ The image editing reward inference code is available at UnifiedReward-Edit/ directory, while T2I inference code is unchanged from previous models. The editing training data is preprocessed from EditScore, EditReward, and Pico-Nano-Banana. We sincerely appreciate all contributors!!

For further details, please refer to the following resources:

Citation

@article{unifiedreward,
  title={Unified reward model for multimodal understanding and generation},
  author={Wang, Yibin and Zang, Yuhang and Li, Hao and Jin, Cheng and Wang, Jiaqi},
  journal={arXiv preprint arXiv:2503.05236},
  year={2025}
}