1. #WhoAmI in 160 Characters? Classifying Social Identities Based on Twitter Profile Descriptions
- Author
-
Djoerd Hiemstra, Michel Léon Ehrenhard, Anna Priante, Aaqib Saeed, Ariana Need, Tijs van den Broek, Public Administration, Databases (Former), and Faculty of Behavioural, Management and Social Sciences
- Subjects
EWI-27810 ,business.industry ,05 social sciences ,Ethnic group ,Identity (social science) ,050109 social psychology ,computer.software_genre ,Test (assessment) ,Politics ,METIS-319140 ,IR-102356 ,0502 economics and business ,0501 psychology and cognitive sciences ,Artificial intelligence ,Social identity theory ,business ,Psychology ,computer ,050203 business & management ,Natural language processing ,Social theory - Abstract
We combine social theory and NLP methods to classify English-speaking Twitter users’ online social identity in profile descriptions. We conduct two text classification experiments. In Experiment 1 we use a 5-category online social identity classification based on identity and self-categorization theories. While we are able to automatically classify two identity categories (Relational and Occupational), automatic classification of the other three identities (Political, Ethnic/religious and Stigmatized) is challenging. In Experiment 2 we test a merger of such identities based on theoretical arguments. We find that by combining these identities we can improve the predictive performance of the classifiers in the experiment. Our study shows how social theory can be used to guide NLP methods, and how such methods provide input to revisit traditional social theory that is strongly consolidated in offline settings.
- Published
- 2016
- Full Text
- View/download PDF