Fine-grained Image Quality Assessment for Perceptual Image Restoration

1School of Artificial Intelligence, Xidian University
*Equal Contribution, Corresponding Author

Abstract

Recent years have witnessed remarkable achievements in perceptual image restoration (IR), creating an urgent demand for accurate image quality assessment (IQA), which is essential for both performance comparison and algorithm optimization. Unfortunately, the existing IQA metrics exhibit inherent weakness for IR task, particularly when distinguishing fine-grained quality differences among restored images. To address this dilemma, we contribute the first-of-its-kind fine-grained image quality assessment dataset for image restoration, termed FGRestore, comprising 18,408 restored images across six common IR tasks. Beyond conventional scalar quality scores, FGRestore was also annotated with 30,886 fine-grained pairwise preferences. Based on FGRestore, a comprehensive benchmark was conducted on the existing IQA metrics, which reveal significant inconsistencies between score-based IQA evaluations and the fine-grained restoration quality. Motivated by these findings, we further propose FGResQ, a new IQA model specifically designed for image restoration, which features both coarse-grained score regression and fine-grained quality ranking. Extensive experiments and comparisons demonstrate that FGResQ significantly outperforms state-of-the-art IQA metrics. Data and code will be publicly available.

FGResQ model architecture

Preliminary Validation Analysis

In image restoration tasks, both algorithm comparison and optimization frequently involve evaluating images with subtle quality differences. Algorithm comparison requires distinguishing between restoration results with marginal quality differences, while parameter optimization involves incremental quality changes that demand sensitive assessment methods to identify optimal configurations. To investigate whether existing IQA methods can objectively capture fine-grained quality differences in IR task, we conduct a comprehensive computational analysis on established IQA datasets. Specifically, we evaluate state-of-the-art IQA methods to assess their fine-grained discrimination capabilities.

Performance comparison across different MOS ranges on PIPAL dataset.

Type Method [0.0,0.2)
SRCC
[0.2,0.4)
SRCC
[0.4,0.6)
SRCC
[0.6,0.8)
SRCC
[0.8,1.0]
SRCC
Overall
SRCC
FR PSNR0.3230.0820.2090.1610.0720.422
SSIM0.2930.1080.2580.2540.0490.530
LPIPS-0.0340.0770.3250.2870.1240.612
DISTS0.1680.1590.3100.2420.1650.585
NR NIQE-0.126-0.0020.1070.0010.0800.153
IL-NIQE-0.235-0.0980.1260.1280.0540.289
BRISQUE-0.1420.0250.1250.0350.1310.185
DB-CNN-0.1570.3210.3530.330-0.0160.636
HyperIQA0.1000.2740.3140.2920.0320.584
MetaIQA0.0370.1600.2040.174-0.1010.423
LIQE-0.2320.0530.1750.2990.1070.479
CLIP-IQA-0.1520.2110.2380.2930.0710.530
Q-Align0.2300.3010.3370.2130.1780.418
DeQA-Score0.5680.6760.6230.5160.3500.747

FGResQ

Based on FGRestore, we propose FGResQ, a new fine-grained image quality assessment model for perceptual image restoration evaluation. FGResQ consists of two main components: (a) Degradation-aware Feature Learning that incorporates restoration task knowledge to enable unified evaluation across multiple IR tasks, and (b) Dual-branch Quality Prediction that simultaneously handles both coarse-grained score regression and fine-grained pairwise ranking.

FGResQ Framework

Experiments

Performance comparison on FGRestore dataset across different IR tasks. "-" indicates no reference images available.

Type Method Pub. Deblurring Denoising Deraining Dehazing MixtureRestoration SuperResolution Average
SRCCPLCCACC SRCCPLCCACC SRCCPLCCACC SRCCPLCCACC SRCCPLCCACC SRCCPLCCACC SRCCPLCCACC
FRPSNR-0.1870.1670.6340.4870.4820.775------0.2800.2480.6430.2960.3030.6240.3130.3000.669
SSIMTIP'040.4410.3480.6950.6420.6520.789------0.4210.3790.6840.3510.3610.6410.4640.4350.702
LPIPSCVPR'180.7760.7000.7550.6730.6800.765------0.4660.4750.6580.4480.4600.6660.5910.5790.711
DISTSTPAMI'200.9070.9010.8450.6790.6720.739------0.4950.4640.6440.4820.4880.6580.6400.6310.721
NRNIQESPL'120.3820.4100.5290.2400.1510.5640.0300.0570.6940.0360.0630.4460.0410.0270.5410.1760.0720.5030.1510.1300.546
BRISQUETIP'120.3540.3540.5550.2540.1140.5690.1120.1020.6030.1320.0880.4360.2010.0640.5750.1420.0400.5140.1990.1270.542
DB-CNNTCSVT'220.7880.7860.6880.4780.4310.6110.2430.2590.4370.6430.6450.5240.4150.4600.6140.4590.4540.6430.5040.5060.586
HyperIQACVPR'250.8710.8870.4020.6050.6250.6750.2640.2940.4840.6430.6740.4090.5230.5350.4990.4370.4330.5380.5570.5740.501
CLIP-IQAAAAI'230.8670.7850.7650.4740.4400.6250.2410.2210.3490.5470.4990.5460.3020.3000.5800.2440.1860.5710.4460.4050.573
Q-AlignICML'240.7670.8040.7950.6760.6870.7310.4330.4210.4550.7150.7650.5840.5690.5710.6580.3760.3660.6620.5890.6030.648
DeQA-ScoreCVPR'250.8150.8430.8190.7540.7710.7780.5070.5760.4260.7620.8030.6440.6690.6790.6970.5610.5730.7180.6780.7070.680
Compare2ScoreNeurIPS'240.7690.8130.7570.6610.6790.6870.0740.1080.7900.3340.3810.4360.4940.5190.6350.3170.3150.6070.4410.4690.652
FGResQ-0.9260.9100.8730.7590.7770.7600.4960.5180.7780.8210.8540.6690.6980.7060.7130.5210.5360.7210.7030.7170.752

Underline indicates runner-up, bold indicates best.

Fine-grained Examples

Qualitative Analysis

BibTeX

@article{sheng2025fg,
  title={Fine-grained Image Quality Assessment for Perceptual Image Restoration},
  author={Sheng, Xiangfei and Pan, Xiaofeng and Yang, Zhichao and Chen, Pengfei and Li, Leida},
  journal={arXiv preprint arXiv:2508.14475},
  year={2025}
}