Corpus available
The Swiss WhatsApp corpus is now available as an open access resource with more than 5 mio tokens in all four national languages of Switzerland. You find the documentation and the access to the corpus here.
Latest publication
Ruzsics, Tatiana; Lusetti, Massimo; Göhring, Anne; Samardžić, Tanja; Stark, Elisabeth (2019): Neural Text Normalization with Adapted Decoding and PoS Features. Natural Language Engineering.
Ueberwasser, Simone/Stark, Elisabeth (2017). What’s up, Switzerland? A corpus-based research project in a multilingual country. Linguistik online 84/5, 105-126.
Upcoming talks