Text this: Entropy and type-token ratio in gigaword corpora