Author Analysis
I am interested automatic analysis of authorship, both in terms of determining whether two texts are written by the same person or not (authorship attribution), as well as in terms of identifying some characteristics of the author, such as gender and age (author profiling). A few pointers:- A demo of both profiling and attribution using our systems that participated to the PAN competitions. At PAN 2016 and 2017 our systems won the competitions.
- A short video where I explain how we do authorship attribution.
- I co-organise the first truly cross-genre task on author profiling in Italian.
Sentiment and Emotion Analysis
I use language processing tools to analyse and predict the way people express themselves on social media. I have pioneered this work on Italian, but I also work with other languages. I am also one of the initiators and scientific coordinators of the Social Media Sensing group at the University of Groningen.- We have created TWITA, a corpus of Italian tweets, tokenised, POS-tagged, and (automatically) sentiment annotated.
- I am the co-organiser of the first and second campaigns for sentiment analysis in Italian: SENTIPOLC 2014 and SENTIPOLC 2016, run within the framework of EVALITA.
- I co-organise the PEOPLES workshop, in 2018 at its second edition. PEOPLES is a forum for discussing the interplay of various aspects of profiling/sentiment/emotions in social media, and is co-located with major events (COLING 2016, NAACL 2018).
- I have also worked with emotion and controversy detection, exploiting Facebook reactions as distant silver labels for training.