Image Quality Assessment

← Back to Research


Dual-Branch Vision Transformer for Blind Image Quality Assessment

Dual-branch vision transformer for BIQA
Fig. 1. The proposed dual-branch vision transformer for blind image quality assessment (BIQA).

Blind image quality assessment (BIQA) aims to predict the perceptual quality of an image without access to a reference. We propose a dual-branch vision transformer that simultaneously considers both local distortions and global semantic information. Dual-scale features (S-Feature and L-Feature) are extracted from a ResNet-50 backbone and fed into separate transformer encoder branches. Each branch captures scale-variant local distortions through local feature embeddings, and jointly models global distortion context via content-aware IQA (CA-IQA) embeddings. The outputs of both branches are combined through feed-forward blocks to predict the final image quality score.

TABLE 1. Average SRCC results on six IQA databases. Best and second-best results are bold and underlined, respectively.
MethodSRCC
LIVECTID2013LIVECSIQLIVE MDKADID-10k
BRISQUE0.6080.6040.9390.7460.8860.528
M30.6070.6890.9510.7950.892-
FRIQUEE0.6820.6800.9400.8350.923-
CORNIA0.6290.6780.9470.6780.899-
HOSA0.6400.7350.9460.7410.913-
Le-CNN--0.956---
BIECON0.5950.7170.9610.8150.9090.623
DIQaM-NR0.6060.8350.960---
WaDIQaM-NR0.6710.7610.954--0.739
ResNet-ft0.8190.7120.9500.8760.909-
IW-CNN0.6630.8000.9630.8120.914-
DBCNN0.8510.8160.9680.9460.9270.851
HyperIQA0.8590.7970.9620.9230.8980.852
TReS0.8460.8630.9690.9220.9160.859
BIQA, M.D.-0.8350.9690.903--
RNSA0.8710.8490.9690.931-0.855
Proposed0.8620.8770.9760.9420.9350.970
TABLE 2. Average PLCC results on six IQA databases. Best and second-best results are bold and underlined, respectively.
MethodPLCC
LIVECTID2013LIVECSIQLIVE MDKADID-10k
BRISQUE0.6290.6940.9350.8290.9170.567
M30.6300.7710.9500.8390.919-
FRIQUEE0.7050.7530.9440.8740.934-
CORNIA0.6710.7680.9500.7760.921-
HOSA0.6780.8150.9470.8230.926-
Le-CNN--0.953---
BIECON0.6130.7620.9620.8230.9330.648
DIQaM-NR0.6010.8550.972---
WaDIQaM-NR0.6800.7870.963--0.752
ResNet-ft0.8490.7560.9540.9050.920-
IW-CNN0.7050.8020.9640.7910.929-
DBCNN0.8690.8650.9710.9590.8690.856
HyperIQA0.8820.8230.9660.9420.9240.845
TReS0.8770.8830.9680.9420.9210.858
BIQA, M.D.-0.8590.9780.925--
RNSA0.8830.8610.9720.959-0.859
Proposed0.8820.8940.9760.9520.9350.971

Publications

  • Se-Ho Lee and Seung-Wook Kim, “Dual-branch vision transformer for blind image quality assessment,” Journal of Visual Communication and Image Representation, vol. 94, pp. 103850, Jun. 2023. [DOI]