Social Language Machines

With a focus on Twitter data, the Social Language Machines build tools to monitor public opinion and events (floods, traffic jams, social events), detect the emotion from a text and predict the next characters that someone wants to type.

Research Projects

ADNEXT

ADNEXT

Dec. 12, 2011 -- July 31, 2016 http://www.commit-nl.nl/projects/wp-packages/adaptive-information-extraction-over-time-adnext Antal van den Bosch , Florian Kunneman , Ali Hürriyetoğlu , Mustafa Erkan Başar , Matje van de Camp

The objective of ADNEXT (ADaptive informatioN EXtraction over Time) is to develop trainable, adaptable Dutch language information extraction technology for named entity recognition, event detection, and time identification. The technology has a broad coverage “default” mode and retrains dynamically to new domains upon being confronted with new (clusters of) news or user-generated data (such as Twitter).

Vox Populi

Vox Populi

Jan. 1, 2015 -- Eric Sanders

In the Vox Populi project we try to monitor the opinion of the people by scanning social media (primarily Twitter). We are interested in political views and especially in predicting the outcome of elections on the basis of tweets. A preliminary study showed that by just counting tweets in which political parties are mentioned, we can approximate the polls. Because the posters on Twitter are not necessarily a cross-section of the population, we will research a number of filters to adjust this possible discrepancy. The filters we will be looking into are about: Clustering and topic detection (what are posts about?), User profiling (who is posting?), Sentiment analysis (what is the sentiment of messages?), External event detection (are there specific events that influence the people’s posts?), Time windowing (should we put relative weights on counts of tweets near a certain event (election)?)

Publications

E. Sanders, M. de Gier, and A. van den Bosch
Using Demographics in Predicting Election Results with Twitter
Proceedings of International Conference on Social Informatics. Springer International Publishing, 2016, 259-268, 2016
Full text (pdf), RIS, BibTex
F. Kunneman, C. Liebrecht, M. Van Mulken, and A. Van den Bosch
Signaling sarcasm: From hyperbole to hashtag
Information Processing \& Management, 51(4), 2015
RIS, BibTex
F. Kunneman and A. van den Bosch
Automatically identifying periodic social events from Twitter
Proceedings of Recent Advances in Natural Language Processing 2015, 2015
Full text (external), RIS, BibTex
F. Kunneman, C. Liebrecht, and A. van den Bosch
The (un)predictability of emotional hashtags in Twitter
Proceedings of the 5th Workshop on Language Analysis for Social Media (LASM), Association for Computational Linguistics, 2014
Full text (external), RIS, BibTex
A. Hürriyetoğlu, N. Oostdijk, and A. van den Bosch
Estimating time to event from tweets using temporal expressions
Proceedings of the 5th Workshop on Language Analysis for Social Media (LASM), Association for Computational Linguistics, 2014
Full text (external), RIS, BibTex
F. Kunneman, A. Hürriyetoglu, N. Oostdijk, and A. van den Bosch
Timely identification of event start dates from Twitter
Computational Linguistics in the Netherlands Journal, 4, 2014
RIS, BibTex
F. Kunneman and A. van den Bosch
Event detection in Twitter: A machine-learning approach based on term pivoting
Proceedings of the 26th Benelux Conference on Artificial Intelligence, 2014
RIS, BibTex
C. Liebrecht, F. Kunneman, and A. van den Bosch
The perfect solution for detecting sarcasm in tweets #not
Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, Association for Computational Linguistics, 2013
Full text (external), RIS, BibTex
A. Hürriyetoglu, F. Kunneman, and A. van den Bosch
Estimating the time between Twitter messages and future events"
Proceedings of the 13th Dutch-Belgian Information Retrieval Workshop, 2013
RIS, BibTex
H. Tops, A. van den Bosch, and F. Kunneman
Predicting time-to-event from Twitter messages
Proceedings of the 25th Benelux Artificial Intelligence Conference, 2013
RIS, BibTex
F. Kunneman and A. van den Bosch
Leveraging unscheduled event prediction through mining scheduled event tweets
Proceedings of the 24th Benelux Conference on Artficial Intelligence, 2012
RIS, BibTex

Software & Demos

Lama Events

Lama Events

by Antal van den Bosch , Florian Kunneman , Ali Hürriyetoğlu , Mustafa Erkan Başar http://applejack.science.ru.nl/lamaevents/

Lama Events is a calendar application listing events in the near future. The events are detected and selected by a fully automatic procedure in the Dutch Twitter stream (courtesy of Twiqs.nl). Tweets referring to the same future events are clustered based on the frequent co-occurrence of words (names, phrases) and temporal expressions that characterize the event. The date and time of the event is automatically determined based on direct and indirect time references in the texts of the tweets in a cluster. The demo shows a day-by-day ranked list of automatically detected events in the Dutch language area (Netherlands and Flanders).