Dynamic Knowledge Distillation for Pre-trained Language Models

Publication
EMNLP 2021