Abstract
The paper illustrates the design and development of a textual corpus representative of the historical variants of Italian during the Great War, which was enriched with linguistic (lemmatization and pos-tagging) and meta-linguistic annotation.
The corpus, after a manual revision of the linguistic annotation, was used for
specializing existing NLP tools to process historical texts with promising results.
Original language | English |
---|---|
Pages | 160-164 |
Number of pages | 5 |
DOIs | |
Publication status | Published - 2018 |
Event | Fifth Italian Conference on Computational Linguistics (CLiC-it 2018) - Torino, Università di Torino Duration: 1 Jan 2018 → … |
Conference
Conference | Fifth Italian Conference on Computational Linguistics (CLiC-it 2018) |
---|---|
City | Torino, Università di Torino |
Period | 1/01/18 → … |
Keywords
- Great War
- historical corpus
- Italian
- Universal Dependencies