CCAP: Cooperative Context Aware Pruning for Neural Network Model Compression
2021 IEEE International Symposium on Multimedia (ISM)
In this paper, we propose a new cross-domain model compression technique to yield a compact target model. We utilize a Cooperative Context-Aware Pruning (CCAP) module to produce sparse attention maps. They are then used to transmit the source models’ parameters to the target model precisely. We also leverage a weight-regular loss to minimize the difference between the source models’ and the target models’ parameters. Our quantitatively empirical evaluation shows that our CCAP module plus the weight-regular loss achieves lower model complexity without having serious performance decreasing.
Copyright IEEE 2021
Locate the Document
Wang, L. Y., & Akhtar, Z. (2021, November). CCAP: Cooperative Context Aware Pruning for Neural Network Model Compression. In 2021 IEEE International Symposium on Multimedia (ISM) (pp. 257-260). IEEE.