Wavelet-Based Feature Extraction for Efficient High-Resolution Image Classification
Albert Dede, Henry Nunoo-Mensah, Emmanuel Kofi Akowuah, Kwame Osei Boateng, Prince Ebenezer Adjei, Francisca Adoma Acheampong, Isaac Acquah, Jerry John Kponyo
Convolutional neural networks (CNNs) typically compress high-resolution images to minimize computational requirements. However, this can lead to loss of information and reduced accuracy in classification tasks. This paper introduces WaveNet, a novel approach for processing high-resolution images using wavelet-domain inputs in CNNs. We address the challenge of maintaining classification accuracy with high-resolution inputs while minimizing computational complexity. Our method employs wavelet packet transform (WPT) for image pre-processing, extracting detailed multi-scale and directional information from high-resolution images. We propose a wavelet-adaptive efficient channel attention (WAECA) module to dynamically select the most informative wavelet subbands. Popular CNN architectures like ResNet-50 and MobileNetV2 are adapted by replacing their initial convolutional layers with wavelet-transformed inputs, enabling direct learning in the wavelet domain. Experiments conducted on the Caltech-256 and ALOT datasets demonstrate that WaveNet improves classification accuracy while reducing computational complexity. For instance, our wavelet-enhanced ResNet-50 achieves a Top-1 accuracy of 72.47% on Caltech-256, outperforming the baseline (70.65%) while reducing FLOPs from 16.52G to 3.98G. Similar improvements are observed across different architectures and datasets. We also evaluate various wavelet filters and ResNet backbones, finding that the bior1.1 filter and ResNet-50 provide optimal performance. This work presents a practical solution for developing more accurate and efficient models for high-resolution inputs without extensive computational resources or complex architectural modifications.