Text this: Multi-Stage Video Analysis Framework