USP Electronic Research Repository

Global-first training strategy with Convolutional Neural Networks to Improve Scale Invariance

Kumar, Dinesh and Sharma, Dharmendra P. (2022) Global-first training strategy with Convolutional Neural Networks to Improve Scale Invariance. In: Communications in Computer and Information Science. Springer Nature, Switzerland. ISBN TBC

Full text not available from this repository. (Request a copy)

Abstract

Modelled closely on the feedforward conical structure of the primate vision system - Convolutional Neural Networks (CNNs) learn by adopting a local to global feature extraction strategy. This makes them view-specific models and results in poor invariance encoding within its learnt weights to adequately identify objects whose appearance is altered by various transformations such as rotations, translations, and scale. Recent physiological studies reveal the visual system first views the scene globally for subsequent processing in its ventral stream leading to a global-first response strategy in its recognition function. Conventional CNNs generally use small filters, thus losing the global view of the image. A trainable module proposed by Kumar & Sharma [24] called Stacked Filters Convolution (SFC) models this approach by using a pyramid of large multi-scale filters to extract features from wider areas of the image, which is then trained by a normal CNN. The end-to-end model is referred to as Stacked Filter CNN (SFCNN). In addition to improved test results, SFCNN showed promising results on scale invariance classification. The experiments, however, were performed on small resolution datasets and small CNN as backbone. In this paper, we extend this work and test SFC integrated with the VGG16 network on larger resolution datasets for scale invariance classification. Our results confirm the integration of SFC, and standard CNN also shows promising results on scale invariance on large resolution datasets.

Item Type: Book Chapter
Uncontrolled Keywords: Convolutional Neural Network, Feature Map, Filter Pyramid, Global Feature, Scale Invariance, Visual System
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Q Science > QA Mathematics > QA76 Computer software
Divisions: Faculty of Science, Technology and Environment (FSTE) > School of Computing, Information and Mathematical Sciences
Depositing User: Dinesh Kumar
Date Deposited: 28 Nov 2022 07:02
Last Modified: 28 Nov 2022 07:02
URI: https://repository.usp.ac.fj/id/eprint/13830

Actions (login required)

View Item View Item