Efficient Object Pixel-Level Categorization Using Bag of Features

Aldavert, David; Ramisa, Arnau; Toledo, Ricardo; Lopez de Mantaras, Ramon

doi:10.1007/978-3-642-10331-5_5

David Aldavert³⁰,
Arnau Ramisa³¹,
Ricardo Toledo³⁰ &
…
Ramon Lopez de Mantaras³¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5875))

Included in the following conference series:

International Symposium on Visual Computing

1622 Accesses
3 Citations

Abstract

In this paper we present a pixel-level object categorization method suitable to be applied under real-time constraints. Since pixels are categorized using a bag of features scheme, the major bottleneck of such an approach would be the feature pooling in local histograms of visual words. Therefore, we propose to bypass this time-consuming step and directly obtain the score from a linear Support Vector Machine classifier. This is achieved by creating an integral image of the components of the SVM which can readily obtain the classification score for any image sub-window with only 10 additions and 2 products, regardless of its size. Besides, we evaluated the performance of two efficient feature quantization methods: the Hierarchical K-Means and the Extremely Randomized Forest. All experiments have been done in the Graz02 database, showing comparable, or even better results to related work with a lower computational cost.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Lowe, D.: Distinctive image features from scale-invariant keypoints. Int. Journal of Computer Vision 60, 91–110 (2004)
Article Google Scholar
Csurka, G., Bray, C., Dance, C., Fan, L.: Visual categorization with bags of keypoints. In: Workshop on Stat. Learning in Computer Vision, ECCV, pp. 1–22 (2004)
Google Scholar
Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: Proc. of Computer Vision and Pattern Recognition, pp. 2161–2168 (2006)
Google Scholar
Fulkerson, B., Vedaldi, A., Soatto, S.: Localizing objects with smart dictionaries. In: Proc. of European Conference on Computer Vision, pp. 179–192 (2008)
Google Scholar
Sastre, R., Tuytelaars, T., Bascon, S.: Class representative visual words for category-level object recognition. In: IbPRIA 2009: Proceedings of the 4th Iberian Conference on Pattern Recognition and Image Analysis, pp. 184–191 (2009)
Google Scholar
Moosmann, F., Nowak, E., Jurie, F.: Randomized clustering forests for image classification. IEEE Trans. on Pat. Anal. and Machine Intel. 30, 1632–1646 (2008)
Article Google Scholar
Shotton, J., Johnson, M., Cipolla, R., Center, T., Kawasaki, J.: Semantic texton forests for image categorization and segmentation. In: Proc. of Computer Vision and Pattern Recognition, pp. 1–8 (2008)
Google Scholar
Ramisa, A.: Localization and Object Recognition for Mobile Robots. PhD thesis, Universitat Autonoma de Barcelona (2009)
Google Scholar
Lampert, C.H., Blaschko, M.B., Hofmann, T.: Beyond sliding windows: Object localization by efficient subwindow search. In: Proc. of Computer Vision and Pattern Recognition, pp. 1–8 (2008)
Google Scholar
Nowak, E., Jurie, F., Triggs, B.: Sampling strategies for bag-of-features image classification. In: Proc. of European Conference on Computer Vision, pp. 490–503 (2006)
Google Scholar
Opelt, A., Pinz, A., Fussenegger, M., Auer, P.: Generic object recognition with boosting. IEEE Trans. on Pat. Anal. and Machine Intel. 28(3), 416–431 (2006)
Article Google Scholar
Zhu, Q., Yeh, M.C., Cheng, K.T., Avidan, S.: Fast human detection using a cascade of histograms of oriented gradients. In: Proc. of Computer Vision and Pattern Recognition, pp. 1491–1498 (2006)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proc. of Computer Vision and Pattern Recognition, pp. 886–893 (2005)
Google Scholar
Zhang, J., Marszalek, M., Lazebnik, S., Schmid, C.: Local features and kernels for classification of texture and object categories: A comprehensive study. Int. Journal of Computer Vision 73, 213–238 (2007)
Article Google Scholar
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proc. of Computer Vision and Pattern Recognition, pp. 511–518 (2001)
Google Scholar
Bay, H., Ess, A., Tuytelaars, T., Gool, L.V.: Surf: Speeded up robust features. Computer Vision and Image Understanding (CVIU) 110, 346–359 (2008)
Article Google Scholar
Porikli, F.: Integral histogram: A fast way to extract histograms in cartesian spaces. In: Proc. of Computer Vision and Pattern Recognition, pp. 829–836 (2005)
Google Scholar
Marszalek, M., Schmid, C.: Accurate object localization with shape masks. In: Proc. of Computer Vision and Pattern Recognition, pp. 1–8 (2007)
Google Scholar
Lin, C.-J., Weng, R.C., Keerthi, S.S.: Trust region newton methods for large-scale logistic regression. In: Int. Conf. on Machine Learning, pp. 561–568 (2007)
Google Scholar
Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., Lin, C.-J.: Liblinear: A library for large linear classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Vision Center (CVC), Dept. Ciències de la Computació, Universitat Autònoma de Barcelona (UAB), 08193, Bellaterra, Spain
David Aldavert & Ricardo Toledo
Artificial Intelligence Research Institute (IIIA-CSIC), Campus de la UAB, 08193, Bellaterra, Spain
Arnau Ramisa & Ramon Lopez de Mantaras

Authors

David Aldavert
View author publications
You can also search for this author in PubMed Google Scholar
Arnau Ramisa
View author publications
You can also search for this author in PubMed Google Scholar
Ricardo Toledo
View author publications
You can also search for this author in PubMed Google Scholar
Ramon Lopez de Mantaras
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, University of Nevada, Reno, USA
George Bebis
NASA Ames Research Center, Moffett Field, CA, USA
Richard Boyle
Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Bahram Parvin
Desert Research Institute, Reno, NV, USA
Darko Koracin
Graduate School of Science and Engineering, Saitama University, 255 Shimo-Okubo, Sakura-ku, Saitama-shi, 338-8570, Saitama, Japan
Yoshinori Kuno
Institute for Infocomm Research, 21 Heng Mui Keng Terrace, P.O. Box, 119613, Singapore
Junxian Wang
Department of Computer Science & Information Engineering, Tamkang University, Tamsui, Taipei, Taiwan, R.O.C.
Jun-Xuan Wang
Microsoft Research, Redmond, WA, USA
Junxian Wang
Department of Informatics, Univ. of Zurich, Winterthurerstr. 190, P.O. Box, 8057, Zurich, Switzerland
Renato Pajarola
Lawrence Livermore National Laboratory, 94550, Livermore, CA, USA
Peter Lindstrom
University of Applied Sciences Bonn-Rhein-Sieg, 53754, Sankt Augustin, Germany
André Hinkenjann
Humana Inc., 40202, Louisville, KY, USA
Miguel L. Encarnação
SCI Institute & School of Computing, University of Utah, 84112, Salt Lake City, UT, USA
Cláudio T. Silva
Desert Research Institute, 89512, Reno, NV, USA
Daniel Coming

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Aldavert, D., Ramisa, A., Toledo, R., Lopez de Mantaras, R. (2009). Efficient Object Pixel-Level Categorization Using Bag of Features. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2009. Lecture Notes in Computer Science, vol 5875. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10331-5_5

Download citation

DOI: https://doi.org/10.1007/978-3-642-10331-5_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10330-8
Online ISBN: 978-3-642-10331-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics