Article,

Critical Analysis on the Reproducibility of Visual Quality Assessment Using Deep Features

F. Götz-Hahn, V. Hosu, and D. Saupe.
PLoS ONE, (2022)Article Number: e0269715.
DOI: 10.1371/journal.pone.0269715

Abstract

Data used to train supervised machine learning models are commonly split into independent training, validation, and test sets. This paper illustrates that complex data leakage cases have occurred in the no-reference image and video quality assessment literature. Recently, papers in several journals reported performance results well above the best in the field. However, our analysis shows that information from the test set was inappropriately used in the training process in different ways and that the claimed performance results cannot be achieved. When correcting for the data leakage, the performances of the approaches drop even below the state-of-the-art by a large margin. Additionally, we investigate end-to-end variations to the discussed approaches, which do not improve upon the original.

BibTeX key: GotzHahn2022Criti-59105
entry type: article
year: 2022
journal: PLoS ONE
number: 8
volume: 17
DOI: 10.1371/journal.pone.0269715
url: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0269715
note: Article Number: e0269715

PUMA

Critical Analysis on the Reproducibility of Visual Quality Assessment Using Deep Features

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on