Text this: A Multi-Modal Panoramic Attentional Model for Robots and Applications