Abstract
In this paper we describe the iTACOS submission for the Stance and Gender Detection in Tweets on Catalan Independence shared task. Concerning the detection of stance, we ranked as the first position in both languages outperforming the baselines; while in gender detection we ranked as fourth and third for Catalan and Spanish. Our approach is based on three diverse groups of features: stylistic, structural and context-based. We introduced two novel features that exploit signi ficant characteristics conveyed by the presence of Twitter marks and URLs. The results of our experiments are promising and will lead to future tailoring of these two features in a ffiner grained manner.
| Lingua originale | Inglese |
|---|---|
| pagine (da-a) | 185-192 |
| Numero di pagine | 8 |
| Rivista | CEUR Workshop Proceedings |
| Volume | 1881 |
| Stato di pubblicazione | Pubblicato - 2017 |
| Pubblicato esternamente | Sì |
| Evento | 2nd Workshop on Evaluation of Human Language Technologies for Iberian Languages, IberEval 2017 - Murcia, Spain Durata: 19 set 2017 → … |