Text this: Leveraging Bird Eye View Video and Multimodal Large Language Models for Real-Time Intersection Control and Reasoning