Text this: Automatic Detection and Tracking of Objects of Interest in Video Data with Global Motion