Abstract
We report on the collection of social media messages - from Twitter in particular - in the Italian language that is continuously going on since 2012 at the University of Turin. A number of smaller datasets have been extracted from the main collection and enriched with different kinds of annotations for linguistic purposes. Moreover, a few extra datasets have been collected independently and are now in the process of being merged with the main collection. We aim at making the resource available to the community to the best of our possibility, in accordance with the Terms of Service provided by the platforms where data have been gathered from.
| Lingua originale | Inglese |
|---|---|
| Rivista | CEUR Workshop Proceedings |
| Volume | 2253 |
| DOI | |
| Stato di pubblicazione | Pubblicato - 2018 |
| Pubblicato esternamente | Sì |
| Evento | 5th Italian Conference on Computational Linguistics, CLiC-it 2018 - Torino, Italy Durata: 10 dic 2018 → 12 dic 2018 |
Fingerprint
Entra nei temi di ricerca di 'Long-term social media data collection at the University of Turin'. Insieme formano una fingerprint unica.Cita questo
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver