Back to Search Start Over

CORSD: Class-Oriented Relational Self Distillation

Authors :
Yu, Muzhou
Tan, Sia Huat
Wu, Kailu
Dong, Runpei
Zhang, Linfeng
Ma, Kaisheng
Publication Year :
2023

Abstract

Knowledge distillation conducts an effective model compression method while holding some limitations:(1) the feature based distillation methods only focus on distilling the feature map but are lack of transferring the relation of data examples; (2) the relational distillation methods are either limited to the handcrafted functions for relation extraction, such as L2 norm, or weak in inter- and intra- class relation modeling. Besides, the feature divergence of heterogeneous teacher-student architectures may lead to inaccurate relational knowledge transferring. In this work, we propose a novel training framework named Class-Oriented Relational Self Distillation (CORSD) to address the limitations. The trainable relation networks are designed to extract relation of structured data input, and they enable the whole model to better classify samples by transferring the relational knowledge from the deepest layer of the model to shallow layers. Besides, auxiliary classifiers are proposed to make relation networks capture class-oriented relation that benefits classification task. Experiments demonstrate that CORSD achieves remarkable improvements. Compared to baseline, 3.8%, 1.5% and 4.5% averaged accuracy boost can be observed on CIFAR100, ImageNet and CUB-200-2011, respectively.<br />Comment: 4 pages, 4 figures, accepted to ICASSP2023

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2305.00918
Document Type :
Working Paper