Closed
Description
Hi @roymiles
Congratulations for the paper acceptance!
For the ImageNet experiment, self.embed
in OurDistillationLoss
class looks like a random projector for ResNet-18's embeddings and seems not updated as it's not included in optimizer. Is it intentional? If so, why is it required?
https://github.com/roymiles/Simple-Recipe-Distillation/blob/main/imagenet/torchdistill/losses/single.py#L140
Metadata
Metadata
Assignees
Labels
No labels