Text this: Data preparation in crowdsourcing for pedagogical purposes