Publication:
Non-symmetric over-time pooling using pseudo-grouping functions for convolutional neural networks

Consultable a partir de

2026-07-01

Date

2024

Director

Publisher

Elsevier
Acceso embargado / Sarbidea bahitua dago
Artículo / Artikulua
Versión aceptada / Onetsi den bertsioa

Project identifier

AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-136627NB-I00/ES/
Gobierno de Navarra//1%2F0193%2F22

Abstract

Convolutional Neural Networks (CNNs) are a family of networks that have become state-of-the-art in several fields of artificial intelligence due to their ability to extract spatial features. In the context of natural language processing, they can be used to build text classification models based on textual features between words. These networks fuse local features to generate global features in their over-time pooling layers. These layers have been traditionally built using the maximum function or other symmetric functions such as the arithmetic mean. It is important to note that the order of input local features is significant (i.e. the symmetry is not an inherent characteristic of the model). While this characteristic is appropriate for image-oriented CNNs, where symmetry might make the network robust to image rigid transformations, it seems counter-productive for text processing, where the order of the words is certainly important. Our proposal is, hence, to use non-symmetric pooling operators to replace the maximum or average functions. Specifically, we propose to perform over-time pooling using pseudo-grouping functions, a family of non-symmetric aggregation operators that generalize the maximum function. We present a construction method for pseudo-grouping functions and apply different examples of this family to over-time pooling layers in text-oriented CNNs. Our proposal is tested on seven different models and six different datasets in the context of engineering applications, e.g. text classification. The results show an overall improvement of the models when using non-symmetric pseudo-grouping functions over the traditional pooling function.

Keywords

Pseudo-grouping function, Over-time pooling, Text classification, Feature fusion, Aggregation function

Department

Estadística, Informática y Matemáticas / Estatistika, Informatika eta Matematika / Institute of Smart Cities - ISC

Faculty/School

Degree

Doctorate program

Editor version

Funding entities

The authors acknowledge with thanks to their universities. Furthermore, this work was supported by the Brazilian funding agency CNPq (Brazilian Research Council) under Projects 311429/2020-3 and 200282/2022-0, by the project PID2022-136627NB-I00 founded by MCIN/AEI/10.13039/501100011033/FEDER, UE, of the Spanish Government, Project VEGA 1/0193/22 and by Tracasa Instrumental and the Immigration Policy and Justice Department of the Government of Navarre.

© 2024 Elsevier Ltd. This manuscript version is made available under the CC-BY-NC-ND 4.0

Los documentos de Academica-e están protegidos por derechos de autor con todos los derechos reservados, a no ser que se indique lo contrario.