Skip to main content
placeholder image

MIEGAN: Mobile Image Enhancement via a Multi-Module Cascade Neural Network

Journal Article


Abstract


  • Visual quality of images captured by mobile devices is often inferior to that of images captured by a Digital Single Lens Reflex (DSLR) camera. This paper presents a novel generative adversarial network-based mobile image enhancement method, referred to as MIEGAN. It consists of a novel multi-module cascade generative network and a novel adaptive multi-scale discriminative network. The multi-module cascade generative network is built upon a two-stream encoder, a feature transformer, and a decoder. In the two-stream encoder, a luminance-regularizing stream is proposed to help the network focus on low-light areas. In the feature transformation module, two networks effectively capture both global and local information of an image. To further assist the generative network to generate the high visual quality images, a multi-scale discriminator is used instead of a regular single discriminator to distinguish whether an image is fake or real globally and locally. To balance the global and local discriminators, an adaptive weight allocation is proposed. In addition, a contrast loss is proposed, and a new mixed loss function is developed to improve the visual quality of the enhanced images. Extensive experiments on the popular DSLR photo enhancement dataset and MIT-FiveK dataset have verified the effectiveness of the proposed MIEGAN.

Publication Date


  • 2022

Citation


  • Pan, Z., Yuan, F., Lei, J., Li, W., Ling, N., & Kwong, S. (2022). MIEGAN: Mobile Image Enhancement via a Multi-Module Cascade Neural Network. IEEE Transactions on Multimedia, 24, 519-533. doi:10.1109/TMM.2021.3054509

Scopus Eid


  • 2-s2.0-85100486546

Start Page


  • 519

End Page


  • 533

Volume


  • 24

Abstract


  • Visual quality of images captured by mobile devices is often inferior to that of images captured by a Digital Single Lens Reflex (DSLR) camera. This paper presents a novel generative adversarial network-based mobile image enhancement method, referred to as MIEGAN. It consists of a novel multi-module cascade generative network and a novel adaptive multi-scale discriminative network. The multi-module cascade generative network is built upon a two-stream encoder, a feature transformer, and a decoder. In the two-stream encoder, a luminance-regularizing stream is proposed to help the network focus on low-light areas. In the feature transformation module, two networks effectively capture both global and local information of an image. To further assist the generative network to generate the high visual quality images, a multi-scale discriminator is used instead of a regular single discriminator to distinguish whether an image is fake or real globally and locally. To balance the global and local discriminators, an adaptive weight allocation is proposed. In addition, a contrast loss is proposed, and a new mixed loss function is developed to improve the visual quality of the enhanced images. Extensive experiments on the popular DSLR photo enhancement dataset and MIT-FiveK dataset have verified the effectiveness of the proposed MIEGAN.

Publication Date


  • 2022

Citation


  • Pan, Z., Yuan, F., Lei, J., Li, W., Ling, N., & Kwong, S. (2022). MIEGAN: Mobile Image Enhancement via a Multi-Module Cascade Neural Network. IEEE Transactions on Multimedia, 24, 519-533. doi:10.1109/TMM.2021.3054509

Scopus Eid


  • 2-s2.0-85100486546

Start Page


  • 519

End Page


  • 533

Volume


  • 24