short-paper

Combining Holistic and Part-based Deep Representations for Computational Painting Categorization

Authors:
Rao Muhammad Anwer

Aalto University, Espoo, Finland

Aalto University, Espoo, Finland
View Profile

,
Fahad Shahbaz Khan

Linkoping University, Linkoping, Sweden

Linkoping University, Linkoping, Sweden
View Profile

,
Joost van de Weijer

Universitat Autonoma de Barcelona, barcelona, Spain

Universitat Autonoma de Barcelona, barcelona, Spain
View Profile

,
Jorma Laaksonen

Aalto University, espoo, Finland

Aalto University, espoo, Finland
View Profile

ICMR '16: Proceedings of the 2016 ACM on International Conference on Multimedia RetrievalJune 2016Pages 339–342https://doi.org/10.1145/2911996.2912063

Published:06 June 2016Publication History

ICMR '16: Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval

Pages 339–342

ABSTRACT

Automatic analysis of visual art, such as paintings, is a challenging inter-disciplinary research problem. Conventional approaches only rely on global scene characteristics by encoding holistic information for computational painting categorization. We argue that such approaches are sub-optimal and that discriminative common visual structures provide complementary information for painting classification.

We present an approach that encodes both the global scene layout and discriminative latent common structures for computational painting categorization. The region of interests are automatically extracted, without any manual part labeling, by training class-specific deformable part-based models. Both holistic and region-of-interests are then described using multi-scale dense convolutional features. These features are pooled separately using Fisher vector encoding and concatenated afterwards in a single image representation. Experiments are performed on a challenging dataset with 91 different painters and 13 diverse painting styles. Our approach outperforms the standard method, which only employs the global scene characteristics. Furthermore, our method achieves state-of-the-art results outperforming a recent multi-scale deep features based approach by $6.4\%$ and $3.8\%$ respectively on artist and style classification.

References

L. Bourdev, S. Maji, and J. Malik. Describing people: A poselet-based approach to attribute classification. In ICCV, 2011. Google ScholarDigital Library
G. Carneiro, N. Silva, A. Bue, and J. Costeira. Artistic image classification: An analysis on the printart database. In ECCV, 2012. Google ScholarDigital Library
M. Cimpoi, S. Maji, and A. Vedaldi. Deep filter banks for texture recognition and segmentation. In CVPR, 2015.Google ScholarCross Ref
P. Felzenszwalb, R. Girshick, D. McAllester, and D. Ramanan. Object detection with discriminatively trained part-based models. PAMI, 32(9):1627--1645, 2010. Google ScholarDigital Library
Y. Gong, L. Wang, R. Guo, and S. Lazebnik. Multi-scale orderless pooling of deep convolutional activation features. In ECCV, 2014.Google ScholarCross Ref
F. S. Khan, S. Beigpour, J. van de Weijer, and M. Felsberg. Painting-91: a large scale database for computational painting categorization. MVA, 25(6):1385--1397, 2014. Google ScholarDigital Library
F. S. Khan, J. Xu, J. van de Weijer, A. Bagdanov, R. M. Anwer, and A. Lopez. Recognizing actions through action-specific person detection. TIP, 24(11):4422--4432, 2015.Google ScholarDigital Library
T. Mensink and J. Gemert. The rijksmuseum challenge: Museum-centered visual recognition. In ICMR, 2014. Google ScholarDigital Library
M. Pandey and S. Lazebnik. Scene recognition and weakly supervised object localization with deformable part-based models. In ICCV, 2011. Google ScholarDigital Library
K.-C. Peng and T. Chen. Cross-layer features in convolutional neural networks for generic classification tasks. In ICIP, 2015.Google ScholarDigital Library
K.-C. Peng and T. Chen. A framework of extracting multi-scale features using multiple convolutional neural networks. In ICME, 2015.Google ScholarCross Ref
A. Quattoni and A. Torralba. Recognizing indoor scenes. In CVPR, 2009.Google ScholarCross Ref
J. Sanchez, F. Perronnin, T. Mensink, and J. Verbeek. Image classification with the fisher vector: Theory and practice. IJCV, 105(3):222--245, 2013. Google ScholarDigital Library
K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In ICLR, 2015.Google Scholar
N. Zhang, R. Farrell, F. Iandola, and T. Darrell. Deformable part descriptors for fine-grained recognition and attribute prediction. In ICCV, 2013. Google ScholarDigital Library

Index Terms

Combining Holistic and Part-based Deep Representations for Computational Painting Categorization
1. Information systems
  1. Information retrieval
    1. Specialized information retrieval
      1. Multimedia and multimodal retrieval
        Image search

Recommendations

Multi-camera Based Human Tracking with Non-overlapping Fields of View
ICIG '09: Proceedings of the 2009 Fifth International Conference on Image and Graphics

This paper presents approach for an automated surveillance system which performs human detection and tracking across multiple non-overlapping cameras. Emphasis is put at single camera level where motion based segmentation is achieved using optical flow ...
Read More
A statistical and computational theory for the art of painting
Read More
Multitask painting categorization by deep multibranch neural network
Highlights
- A novel deep multibranch multitask neural network architecture.
- The different ...
Abstract
We propose a novel deep multibranch and multitask neural network for artist, style, and genre painting categorization. The multibranch approach allows us to exploit at the same time the coarse layout of the painting and the fine-...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ICMR '16: Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval
June 2016
452 pages
ISBN:9781450343596
DOI:10.1145/2911996
General Chairs:
John R. Kender
Columbia University, USA
,
John R. Smith
IBM Research, USA
,
Program Chairs:
Jiebo Luo
University of Rochester, USA
,
Susanne Boll
University of Oldenburg, Germany
,
Winston Hsu
National Taiwan University, Taiwan
Copyright © 2016 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 6 June 2016
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
computer vision
image processing
Qualifiers
- short-paper
Conference

Acceptance Rates
ICMR '16 Paper Acceptance Rate20of120submissions,17%Overall Acceptance Rate254of830submissions,31%
More
Upcoming Conference
ICMR '24

Sponsor:

sigmm

International Conference on Multimedia Retrieval

June 10 - 14, 2024

Phuket , Thailand
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 11
  Total Citations
  View Citations
- 256
  Total Downloads
- Downloads (Last 12 months)12
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Combining Holistic and Part-based Deep Representations for Computational Painting Categorization

ICMR '16: Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval

ABSTRACT

References

Cited By

Index Terms

Recommendations

Multi-camera Based Human Tracking with Non-overlapping Fields of View

A statistical and computational theory for the art of painting

Multitask painting categorization by deep multibranch neural network