Tviterasi, tviteraši or twitteraši? Producing and analysing a normalised dataset of Croatian and Serbian tweets

In this paper we discuss the parallel manual normalisation of samples extracted from Croatian and Serbian Twitter corpora. We describe the datasets, outline the unified guidelines provided to annotators, and present a series of analyses of standard-to-non-standard transformations found in the Twitte...

Full description

Saved in:
Bibliographic Details
Main Authors: Maja Miličević, Nikola Ljubešić
Format: Article
Language:English
Published: University of Ljubljana Press (Založba Univerze v Ljubljani) 2016-09-01
Series:Slovenščina 2.0: Empirične, aplikativne in interdisciplinarne raziskave
Subjects:
Online Access:https://journals.uni-lj.si/slovenscina2/article/view/7007
Tags: Add Tag
No Tags, Be the first to tag this record!