Abstract
Aesthetic image analysis is essential for improving the performance of multimedia image retrieval systems, especially from a repository of social media and multimedia content stored on mobile devices. This paper presents a novel method for classifying aesthetic natural scene images by studying the naturalness of image content using statistical features, and reading text in the images using semantic features. Unlike existing methods that focus only on image quality with human information, the proposed approach focuses on image features as well as text-based semantic features without human intervention to reduce the gap between subjectivity and objectivity in the classification. The aesthetic classes considered in this work are (i) Very Pleasant, (ii) Pleasant, (iii) Normal and (iv) Unpleasant. The naturalness is represented by features of focus, defocus, perceived brightness, perceived contrast, blurriness and noisiness, while semantics are represented by text recognition, description of the images and labels of images, profile pictures, and banner images. Furthermore, a deep learning model is proposed in a novel way to fuse statistical and semantic features for the classification of aesthetic natural scene images. Experiments on our own dataset and the standard datasets demonstrate that the proposed approach achieves 92.74%, 88.67% and 83.22% average classification rates on our own dataset, AVA dataset and CUHKPQ dataset, respectively. Furthermore, a comparative study of the proposed model with the existing methods shows that the proposed method is effective for the classification of aesthetic social media images.
Similar content being viewed by others
References
Adak C, Chaudhuri BB, Blumenstein M (2017) Legibility and aesthetic analysis of handwriting, in proc. ICDAR 175-182
Bhattacharya S, Sukthankar R, Shah M (2011) A holistic approach to aesthetic enhancement of photographs. ACM TOMM 78:1–21
Bum J, Wang JJ, Choo H (2021) Sentiment based sub event segmentation and key photo selection. J Vis Commun Image Represent 74:102973
Cui C, Lin P, Nie X, Jian M, Yin Y (2020) Social-sensed image aesthetics assessment. ACM TOMM 16:1–19
Dai Y (2020) Sample-specific repetitive learning for photo aesthetic auto-assessment and highlight elements analysis. Multimed Tools Appl:1387–1402
Dai Y (2022) Exploring metrics to establish an optimal model for image aesthetic assessment and analysis. J Imaging:85
Deng Y, Loy CC, Tang X (2017) Image aesthetic assessment. IEEE Signal Processing Magazine:80–106
Gattupalli V, Chandakkar PS, Li B (2016) A computational approach to relative aesthetic, in proc. ICPR:2446–2451
Gomez R, Gibert J, Gomez L, Karatzas D (2019) “Exploring hate speech detection in multimodal publications”. https://arxiv.org/abs/1910.03814,
Guntuku SC, Lin W, Carpenter J, Ng WK, Ungar LH, Preotiuc-Pietro D (2017) Studying personality through the content of posted and liked images on twitter
Jang H, Lee JS (2021) Analysis of deep features for image aesthetic assessment. IEEE Access 9:29850–29861
Jiang W, Loui AC, Cerosaletti CD (2010) Automatic aesthetic value assessment in photographic images, in proc. ICME:920–925
Jin X, Wu L, Zhao G, Zhou X, Zhang X, Li X (2018) IDEA: a new dataset for image aesthetic scoring. Multimed Tools Appl:14341–14355
Kim WH, Choi JH, Lee JS (2020) Objectivity and subjectivity in aesthetic quality assessment of digital photograph. IEEE Trans AC 11:493–506
Krishanani D, Shivakumara P, Lu T, Pal U, Lopresti D, Kumar GH (2021) A new context-based features for classification of emotions in photographs. Multimed Tools Appl 80:15589–15618
Kuang Q, Jin X, Zhao Q, Zhou B (2020) Deep multimodality learning for UAV video aesthetic quality assessment. IEEE Trans MM:2623–2634
Kucer M, Messinger DW (2018) Aesthetic inferences for smart mobile devices, in proc. WCACV:1764–1773
Li L, Zhu H, Zhao S, Ding G, Lin W (2020) Personality assisted multi task learning for generic and personalized image aesthetic assessment. IEEE Trans IP 29:388–3910
Li X, Takayama Y, Shimizu N, Mori T (2009) An aesthetic solution for photo collection page layout, in proc. ICKSE:33–38
Lu P, Yu J, Peng X (2018) Deep conditional color harmony model for image aesthetic assessment, in proc. ICPR, pp 2845-2850
Luo P (2022) Social image aesthetic classification and optimization algorithm in machine learning. Neural Computing and Applications
MacAvaney S, Yao HR, Yang E, Russell K, Goharian N, Frieder O (2019) Hate speech detection: challenges and solutions. PLoS ONE
Mittal A, Moorthy AK, Bovik AC (2012) No reference image quality assessment in the spatial domain. IEEE Trans. IP:4695–4708
Mujumdar A, Krishnan P, Jawahar CV (2016) Visual aesthetic analysis for handwritten document images, in proc. ICFHR:423–428
Murray N, Marchesotti L, Perronnin F (2012) AVA: a large-scale database for aesthetic visual analysis. In Proc CVPR:2408–2415
Muszynski M, Kostaoulas T, Lombardo P, Pun T, Chanel G (2018) Aesthetic highlight detection in movie based on synchronization of spectators reactions. ACM TOMM 14
Pfister J, Kobs K, Hotho A (2021) Self-supervised multi-task pretraining improves image aesthetic assessment. In Proc CVPR-Workshop:816–825
Safari RM, Rahmani AM, Alizadeh SH (2019) User behavior mining on social media: a systematic literature review. Multimed Tools Appl 78:33747–33804
Sankarasubramaiam Y, Munnangi K, Banerjee S, Kuchibhotla A (2010) Paper widgets: visually aesthetic “smarts” for documents, in proc. ICIP:2137–2140
Shen L, Fang R, Yao Y, Geng X, Wu D (2019) No reference stereoscopic image quality assessment based on image distortion and stereo perceptual information. IEEE Trans, ETCI:59–72
Shu Y, Li ZQ, Liu S, Xu G (2020) Learning with privileged information for photo aesthetic assessment. Neurocomputing. 304–316.
Tian X, Dong Z, Yang K, Mei T (2015) Query dependent aesthetic mode with deep learning for photo quality assessment. IEEE Trans MM:2035–2048
“Vision AI | Derive Image Insights via ML | Cloud Vision API”, https://cloud.google.com/vision (accessed Mar. 17, 2021)
Zeng H, Cao Z, Zhang L, Bovik AC (2020) A unified probabilistic formulation of image aesthetic assessment. IEEE Trans IP:1548–1561
Zhang X, Gao X, Lu W, He L, Li J (2019) A gated peripheral foveal convolutional neural network for unified image aesthetic prediction. IEEE Trans. MM:2815–2826
Zhang X, Gao X, Lu W, He L, Li J (2021) Beyond vision: a multimodal recurrent attention convolutional neural network for unified image aesthetic prediction tasks. IEEE Trans MM 23:611–623
Zhang Y, Sun X, Yao H, Qin L, Huang Q (2012) Aesthetic composition representation for portrait photographic recommendation, in proc. ICIP:27523–22756
Acknowledgments
Palaiahnakote Shivakumara received partial support for this work from the Faculty Grant: GPF096A-2020, GPF096B-2020 and GPF096C-2020, University of Malaya, Malaysia. This work is also partly supported by TIH, ISI.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interests/competing interests
The authors have no conflicts of interest to declare that are relevant to the content of this article.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Biswas, K., Shivakumara, P., Pal, U. et al. Classification of aesthetic natural scene images using statistical and semantic features. Multimed Tools Appl 82, 13507–13532 (2023). https://doi.org/10.1007/s11042-022-13924-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-13924-7