Abstract
The paper illustrates the design and development of a textual corpus representative of the historical variants of Italian during the Great War, which was enriched with linguistic (lemmatization and pos-tagging) and meta-linguistic annotation.
The corpus, after a manual revision of the linguistic annotation, was used for
specializing existing NLP tools to process historical texts with promising results.
Lingua originale | Inglese |
---|---|
Pagine | 160-164 |
Numero di pagine | 5 |
DOI | |
Stato di pubblicazione | Pubblicato - 2018 |
Evento | Fifth Italian Conference on Computational Linguistics (CLiC-it 2018) - Torino, Università di Torino Durata: 1 gen 2018 → … |
???event.eventtypes.event.conference???
???event.eventtypes.event.conference??? | Fifth Italian Conference on Computational Linguistics (CLiC-it 2018) |
---|---|
Città | Torino, Università di Torino |
Periodo | 1/01/18 → … |
Keywords
- Great War
- historical corpus
- Italian
- Universal Dependencies