Text this: CAG-MoE: Multimodal Emotion Recognition with Cross-Attention Gated Mixture of Experts