Generalizing max pooling via (a, b)-grouping functions for convolutional neural networks

Rodríguez Martínez, Iosu; Da Cruz Asmus, Tiago; Pereira Dimuro, Graçaliz; Herrera, Francisco; Takáč, Zdenko; Bustince Sola, Humberto

Generalizing max pooling via (a, b)-grouping functions for convolutional neural networks

Files

Rodriguez_GeneralizingMax.pdf (1.02 MB)

Date

2023

Authors

Rodríguez Martínez, Iosu

Da Cruz Asmus, Tiago

Pereira Dimuro, Graçaliz

Herrera, Francisco

Takáč, Zdenko

Bustince Sola, Humberto

Publisher

Elsevier

Acceso abierto / Sarbide irekia

Artículo / Artikulua

Versión publicada / Argitaratu den bertsioa

Project identifier

AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2019-108392GB-I00/ES/
Gobierno de Navarra/

Impacto

8

No disponible en Scopus

Abstract

Due to their high adaptability to varied settings and effective optimization algorithm, Convolutional Neural Networks (CNNs) have set the state-of-the-art on image processing jobs for the previous decade. CNNs work in a sequential fashion, alternating between extracting significant features from an input image and aggregating these features locally through ‘‘pooling" functions, in order to produce a more compact representation. Functions like the arithmetic mean or, more typically, the maximum are commonly used to perform this downsampling operation. Despite the fact that many studies have been devoted to the development of alternative pooling algorithms, in practice, ‘‘max-pooling" still equals or exceeds most of these possibilities, and has become the standard for CNN construction. In this paper we focus on the properties that make the maximum such an efficient solution in the context of CNN feature downsampling and propose its replacement by grouping functions, a family of functions that share those desirable properties. In order to adapt these functions to the context of CNNs, we present (𝑎, 𝑏)- grouping functions, an extension of grouping functions to work with real valued data. We present different construction methods for (𝑎, 𝑏)-grouping functions, and demonstrate their empirical applicability for replacing max-pooling by using them to replace the pooling function of many well-known CNN architectures, finding promising results.

Description

Versión resumida en castellano: https://academica-e.unavarra.es/handle/2454/53324

Keywords

Convolutional neural networks, Grouping functions, Pooling functions, Image classification

Department

Estadística, Informática y Matemáticas / Estatistika, Informatika eta Matematika / Institute of Smart Cities - ISC

URI

https://academica-e.unavarra.es/handle/2454/46368

https://doi.org/10.1016/j.inffus.2023.101893

item.page.cita

Rodríguez-Corbo, F. A., Celaya-Echarri, M., Shubair, R. M., Falcone, F., & Azpilicueta, L. (2023). An enhanced approach to virtually increase quasi-stationarity regions within geometric channel models for vehicular communications. IEEE Antennas and Wireless Propagation Letters, 22(9), 2180-2184. https://doi.org/10.1109/LAWP.2023.3281081

item.page.rights

Collections

Artículos de revista DEIM - EIMS Aldizkari artikuluak
Artículos de revista - Aldizkari artikuluak
Artículos de revista ISC - ISC aldizkari artikuluak

Full item page

Generalizing max pooling via (a, b)-grouping functions for convolutional neural networks

Files

Date

Authors

Director

Publisher

Project identifier

Impacto

Abstract

Description

Keywords

Department

Faculty/School

Degree

Doctorate program

URI

item.page.cita

item.page.rights

Collections