Back to Search
Start Over
When silver glitters more than gold: Bootstrapping an Italian part-of-speech tagger for Twitter
- Source :
- EVALITA 2016, CLiC 2016, Scopus-Elsevier, CLiC-it/EVALITA
- Publication Year :
- 2016
-
Abstract
- We bootstrap a state-of-the-art part-of-speech tagger to tag Italian Twitter data, in the context of the Evalita 2016 PoSTWITA shared task. We show that training the tagger on native Twitter data enriched with little amounts of specifically selected gold data and additional silver-labelled data scraped from Facebook, yields better results than using large amounts of manually annotated data from a mix of genres.<br />Proceedings of the 5th Evaluation Campaign of Natural Language Processing and Speech Tools for Italian (EVALITA 2016)
- Subjects :
- FOS: Computer and information sciences
Computer science
Speech recognition
entité appelée rEcognition et liens dans le tweets italien
sentiment polarity classification
event factuality annotation
Context (language use)
computer.software_genre
etichettare per messaggi social media
classificazione polarità sentimenti
tagging for italian social media texts
computational linguistics
LAN009000
linguistica computazionale
Computer Science - Computation and Language
reconnaissance téléphonique articulatoire
business.industry
articulatory phone recognition
Linguistics
CF
annotazione fattualità degli eventi
Part of speech
riconoscimento telefonico articolare
named entity rEcognition and linking in italian tweets
linguistique computationelle
Bootstrapping (electronics)
classement polarité sentiments
Artificial intelligence
entità chiamata rEcognition e collegamenti nei tweet italiani
business
étiqueter les messages des médias sociaux
computer
Computation and Language (cs.CL)
Natural language processing
annotation de facturation de l'événement
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- EVALITA 2016, CLiC 2016, Scopus-Elsevier, CLiC-it/EVALITA
- Accession number :
- edsair.doi.dedup.....6ed362f55b16924d69c0c2a51726036b