Image Quality Estimation

Images and videos are ubiquitous today. The share of bits representing visual signals is huge and ever growing. Storage and transmission of these signals require bit rate reductions through compression. However, with decreasing bit rate, distortions that are visible for the human eye are introduced into the signal. In order to automatically evaluate and optimize the performance of compression and transmission systems, a metric for image or video quality is necessary. As humans are typically the ultimate receiver of visual signals, it is crucial for such a metric to relate to human visual perception and to predict the visual distortion perceived by humans reliably.

In our research we investigate and develop computational models to reliably predict visual quality as perceived by humans. Methodologically, we work with traditional signal processing techniques, but we also pioneered the usage modern deep learning approaches in the visual quality domain. Our past successes for instance include the development of a convolutional neural network that jointly learns the relative importance of local and global visual image quality or the conceptual identification, introduction and mathematical modeling of distortion sensitivity which bridges from psychophysically motivated human visual sensitivity to natural image statistics.

Our approaches were shown to be highly feasible to optimize visual computing systems. When incorporated into modern video compression schemes such as VVC for bit allocation, our methods and models can reduce the bit rate by 16% over conventional bit allocation techniques.

Publications

S. Becker, T. Wiegand, S. Bosse, “Curiously Effective Features for Image Quality Prediction”, Proceedings of the IEEE International Conference on Image Processing (ICIP), Anchorage, Alaska, USA, September 2021. (to appear)
A. Perkis, C. Timmerer, S. Barakovic, J. Barakovic Husic, S. Bech, S. Bosse, J. Botev, K. Brunnström, L. da Silva Cruz, K. Moor, A. Saibanti, W. Durnez, S. Egger-Lampl, U. Engelke, T. Falk, A. Hameed, A. Hines, T. Kojic, D. Kukolj, S. Zadtootaghaj, “QUALINET White Paper on Definitions of Immersive Media Experience (IMEx)”, arXiv preprint arXiv:2007.07032, June 2020
M. Utke, S. Zadtootaghaj, S. Schmidt, S. Bosse, S. Möller, “NDNetGaming-development of a no-reference deep CNN for gaming video quality prediction”, Multimedia Tools and Applications, July 2020
C. R. Helmrich, M. Siekmann, S. Becker, S. Bosse, D. Marpe, T. Wiegand, “XPSNR: A Low-Complexity Extension of The Perceptually Weighted Peak Signal-To-Noise Ratio For High-Resolution Video Quality Assessment”, Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2727-2731, May 2020
S. Becker, K.-R. Müller, T. Wiegand, S. Bosse, “A neural network model of spatial distortion sensitivity for video quality estimation”, Proceedings of the 29th IEEE International Workshop on Machine Learning for Signal Processing (MLSP), pp. 1-8, Pittsburgh, PA, USA, October 201
S. Bosse, M. Dietzel, S. Becker, C. R. Helmrich, M. Siekmann, H. Schwarz, D. Marpe, T. Wiegand, “Neural Network Guided Perceptually Optimized Bit-Allocation for Block-Based Image and Video Compression”, Proceedings of the IEEE International Conference on Image Processing (ICIP), pp. 126-130, Taipei, Taiwan, September 2019
J. Erfurt, C.R. Helmrich, S. Bosse, H. Schwarz, D. Marpe, T. Wiegand, “A study of the perceptually weighted peak signal-to-noise ratio (WPSNR) for image compression”, Proceedings of the IEEE International Conference on Image Processing (ICIP), pp. 2339-2343, Taipei, Taiwan, September 2019
S. Bosse, S. Becker, K.-R. Müller, W. Samek, T. Wiegand, “Estimation of distortion sensitivity for visual quality prediction using a convolutional neural network”, Digital Signal Processing, vol. 91, pp. 54-65, 2019
C.R. Helmrich, S. Bosse, M. Siekmann, H. Schwarz, D. Marpe, T. Wiegand, “Perceptually optimized bit-allocation and associated distortion measure for block-based image or video coding”, Proceedings of the Data Compression Conference (DCC), pp. 172-181, Snowbird, Utah, USA, 2019
S. Bosse, S. Becker, Z.V. Fisches, W. Samek, T. Wiegand, “Neural network-based estimation of distortion sensitivity for image quality prediction”, Proceedings of the IEEE International Conference on Image Processing (ICIP), pp. 629-633, Athens, Greece, October 2018
R. Reisenhofer, S. Bosse, G. Kutyniok, T. Wiegand, “A Haar wavelet-based perceptual similarity index for image quality assessment”, Signal Processing: Image Communication, vol. 61, pp. 33-43, 2018
S. Bosse, D. Maniry, K.-R. Müller, T. Wiegand, and W. Samek, "Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment", IEEE Transactions on Image Processing, September 2017.
S. Bosse, M. Siekmann, W. Samek and T. Wiegand, "A perceptually relevant shearlet-based adaptation of the PSNR," 2017 IEEE International Conference on Image Processing (ICIP), Bejing, China, pp. 315-319, September 2017.
S. Bosse, D. Maniry, T. Wiegand, and W. Samek, "A Deep Neural Network for Image Quality Assessment", Proceedings of the IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA, pp. 3773-3777, September 2016.
S. Bosse, Q. Chen, M. Siekmann, W. Samek and T. Wiegand, "Shearlet-based reduced reference image quality assessment," 2016 IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA, pp. 2052-2056, September 2016.
S. Bosse, D. Maniry, K.-R. Müller, T. Wiegand, and W. Samek, "Neural Network-Based Full- Reference Image Quality Assessment", Proceedings of the Picture Coding Symposium (PCS), Nürnberg, Germany, pp. 1-5, December 2016.
A. S. Dias, M. Siekmann, S. Bosse, H. Schwarz, D. Marpe and M. Mrak, "Rate-distortion optimised quantisation for HEVC using spatial just noticeable distortion," 2015 23rd European Signal Processing Conference (EUSIPCO), Nice, Italy, pp. 110-114, September 2015