Long-term social media data collection at the University of Turin

  • Valerio Basile
  • , Mirko Lai
  • , Manuela Sanguinetti

Risultato della ricerca: Contributo su rivistaArticolo da conferenzapeer review

Abstract

We report on the collection of social media messages - from Twitter in particular - in the Italian language that is continuously going on since 2012 at the University of Turin. A number of smaller datasets have been extracted from the main collection and enriched with different kinds of annotations for linguistic purposes. Moreover, a few extra datasets have been collected independently and are now in the process of being merged with the main collection. We aim at making the resource available to the community to the best of our possibility, in accordance with the Terms of Service provided by the platforms where data have been gathered from.

Lingua originaleInglese
RivistaCEUR Workshop Proceedings
Volume2253
DOI
Stato di pubblicazionePubblicato - 2018
Pubblicato esternamente
Evento5th Italian Conference on Computational Linguistics, CLiC-it 2018 - Torino, Italy
Durata: 10 dic 201812 dic 2018

Fingerprint

Entra nei temi di ricerca di 'Long-term social media data collection at the University of Turin'. Insieme formano una fingerprint unica.

Cita questo