Publication:
Learning ordered pooling weights in image classification

Consultable a partir de

2022-10-21

Date

2020

Director

Publisher

Elsevier
Acceso abierto / Sarbide irekia
Artículo / Artikulua
Versión aceptada / Onetsi den bertsioa

Project identifier

ES/1PE/TIN2016-77356-P

Abstract

Spatial pooling is an important step in computer vision systems like Convolutional Neural Networks or the Bag-of-Words method. The spatial pooling purpose is to combine neighbouring descriptors to obtain a single descriptor for a given region (local or global). The resultant combined vector must be as discriminant as possible, in other words, must contain relevant information, while removing irrelevant and confusing details. Maximum and average are the most common aggregation functions used in the pooling step. To improve the aggregation of relevant information without degrading their discriminative power for image classification, we introduce a simple but effective scheme based on Ordered Weighted Average (OWA) aggregation operators. We present a method to learn the weights of the OWA aggregation operator in a Bag-of-Words framework and in Convolutional Neural Networks, and provide an extensive evaluation showing that OWA based pooling outperforms classical aggregation operators.

Keywords

Pooling, Ordered weighted aggregation, Image classification, Bag-of-words, Mid-level features, Convolutional neural networks, Global pooling

Department

Estadística, Informática y Matemáticas / Estatistika, Informatika eta Matematika

Faculty/School

Degree

Doctorate program

Editor version

Funding entities

This work is partially supported by the research services of Universidad Pública de Navarra and by the project TIN2016-77356-P (AEI/FEDER, UE).

© 2020 Elsevier B.V. This manuscript version is made available under the CC-BY-NC-ND 4.1

Los documentos de Academica-e están protegidos por derechos de autor con todos los derechos reservados, a no ser que se indique lo contrario.