Text this: Comparative analysis of audio-MAE and MAE-AST models for real-time audio classification