Text this: Modelling and Understanding of Speech and Speaker Recognition