AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression

Published in ACL, 2023

Recommended citation: Siyue Wu, Hongzhan Chen, Xiaojun Quan, Qifan Wang, Rui Wang. (2023). "AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression." ACL 2023.
Download Paper