MizzenAI
/

HPSv3

@@ -50,7 +50,8 @@ First, we introduce a VLM-based preference model **HPSv3**, trained on a "wide s
 ## ✨ Updates
-- **[2025-8-05]** 🎉 We release HPSv3: inference code, training code, cohp code and model weights.
 ## 📑 Table of Contents
 1. [🚀 Quick Start](#🚀-quick-start)
@@ -123,11 +124,11 @@ The demo will be available at `http://localhost:7860` and provides:
   <img src="assets/gradio.png" alt="Gradio Demo" width="500"/>
 </p>
-## 🏋️ Training
-### 📁 Dataset
-#### Human Preference Dataset v3
 Human Preference Dataset v3 (HPD v3) comprises 1.08M text-image pairs and 1.17M annotated pairwise data. To modeling the wide spectrum of human preference, we introduce newest state-of-the-art generative models and high quality real photographs while maintaining old models and lower quality real images.
@@ -156,30 +157,80 @@ Human Preference Dataset v3 (HPD v3) comprises 1.08M text-image pairs and 1.17M
 | Curated HPDv2 | - | 327763 | - | Train |
 </details>
-#### Download HPDv3
-```
 HPDv3 is comming soon! Stay tuned!
-```
-<!-- ```bash
-huggingface-cli download --repo-type dataset MizzenAI/HPDv3 --local-dir /your-local-dataset-path
 ``` -->
-#### Pairwise Training Data Format
 **Important Note: For simplicity, path1's image is always the prefered one**
-```json
 [
-  {
-    "prompt": "A beautiful landscape painting",
-    "path1": "path/to/better/image.jpg",
-    "path2": "path/to/worse/image.jpg",
-    "confidence": 0.95
-  },
-  ...
 ]
 ```
 ### 🚀 Training Command
 ```bash

 ## ✨ Updates
+- **[2025-08-08]** 🎉 We release [HPDv3](https://huggingface.co/datasets/MizzenAI/HPDv3) dataset!.
+- **[2025-08-06]** 🎉 We release HPSv3: inference code, training code, cohp code and [HPSv3 model weights](https://huggingface.co/MizzenAI/HPSv3). And [PyPI Package](https://pypi.org/project/hpsv3/).
 ## 📑 Table of Contents
 1. [🚀 Quick Start](#🚀-quick-start)
   <img src="assets/gradio.png" alt="Gradio Demo" width="500"/>
 </p>
+## 📁 Dataset
+### Human Preference Dataset v3
 Human Preference Dataset v3 (HPD v3) comprises 1.08M text-image pairs and 1.17M annotated pairwise data. To modeling the wide spectrum of human preference, we introduce newest state-of-the-art generative models and high quality real photographs while maintaining old models and lower quality real images.
 | Curated HPDv2 | - | 327763 | - | Train |
 </details>
+### Download HPDv3
+<!-- ```
 HPDv3 is comming soon! Stay tuned!
 ``` -->
+```bash
+huggingface-cli download --repo-type dataset MizzenAI/HPDv3 --local-dir /your-local-dataset-path
+```
+### Pairwise Training Data Format
 **Important Note: For simplicity, path1's image is always the prefered one**
+#### All Annotated Pairs (`all.json`)
+**Important Notes: In HPDv3, we simply put the preferred sample at the first place (path1)**
+`all.json` contains **all** annotated pairs except for test.
+```bash
 [
+    # samples from HPDv3 annotation pipeline
+    {
+    "prompt": "Description of the visual content or the generation prompt.",
+    "choice_dist": [12, 7],           # Distribution of votes from annotators (12 votes for image1, 7 votes for image2)
+    "confidence": 0.9999907,         # Confidence score reflecting preference reliability, based on annotators' capabilities (independent of choice_dist)
+    "path1": "images/uuid1.jpg",     # File path to the preferred image
+    "path2": "images/uuid2.jpg",     # File path to the non-preferred image
+    "model1": "flux",                # Model used to generate the preferred image (path1)
+    "model2": "infinity"             # Model used to generate the non-preferred image (path2)
+    },
+    # samples from Midjourney
+    {
+    "prompt": "Description of the visual content or the generation prompt.",
+    "choice_dist": null,             # No distribution of votes Information from Discord
+    "confidence": null,              # No Confidence Information from Discord
+    "path1": "images/uuid1.jpg",     # File path to the preferred image.
+    "path2": "images/uuid2.jpg",     # File path to the non-preferred image.
+    "model1": "midjourney",          # Comparsion between images generated from midjourney
+    "model2": "midjourney"           # Comparsion between images generated from midjourney
+    },
+    # samples from Curated HPDv2
+    {
+    "prompt": "Description of the visual content or the generation prompt.",
+    "choice_dist": null,              # No distribution of votes Information from the original HPDv2 traindataset
+    "confidence": null,               # No Confidence Information from the original HPDv2 traindataset
+    "path1": "images/uuid1.jpg",     # File path to the preferred image.
+    "path2": "images/uuid2.jpg",     # File path to the non-preferred image.
+    "model1": "hpdv2",          # No specific model name in the original HPDv2 traindataset, set to hpdv2
+    "model2": "hpdv2"           # No specific model name in the original HPDv2 traindataset, set to hpdv2
+    },
 ]
 ```
+#### Train set (`train.json`)
+We sample part of training data from `all.json` to build training dataset `train.json`. Moreover, to improve robustness, we integrate random sampled part of data from [Pick-a-pic](https://huggingface.co/datasets/pickapic-anonymous/pickapic_v1) and [ImageRewardDB](https://huggingface.co/datasets/zai-org/ImageRewardDB), which is `pickapic.json` and `imagereward.json`. For these two datasets, we only provide the pair infomation, and its corresponding image can be found in their official dataset repository.
+#### Test Set (`test.json`)
+```bash
+[
+    {
+        "prompt": "Description of the visual content",
+        "path1": "images/uuid1.jpg",     # Preferred sample
+        "path2": "images/uuid2.jpg",     # Unpreferred sample
+        "model1": "flux",                # Model used to generate the preferred sample (path1).
+        "model2": "infinity",            # Model used to generate the non-preferred sample (path2).
+    }
+]
+```
+## 🏋️ Training
 ### 🚀 Training Command
 ```bash