tinyCLAP: distilling language-audio pretrained models