Text this: Emotion recognition and forecasting from wearable data via cluster-guided attention with cross-species pretraining