Text this: Multimodal Knowledge Distillation for Emotion Recognition