Antal van den Bosch

Antal van den Bosch prof

Professor & Principal Investigator Centre for Language Studies since Sept. 1, 2011
E4.05 +31 24 3611647 a.vandenbosch@let.ru.nl http://antalvandenbosch.ruhosting.nl/

Research Projects

ADNEXT

ADNEXT

Dec. 12, 2011 -- July 31, 2016 http://www.commit-nl.nl/projects/wp-packages/adaptive-information-extraction-over-time-adnext Antal van den Bosch , Florian Kunneman , Ali Hürriyetoğlu , Mustafa Erkan Başar , Matje van de Camp

The objective of ADNEXT (ADaptive informatioN EXtraction over Time) is to develop trainable, adaptable Dutch language information extraction technology for named entity recognition, event detection, and time identification. The technology has a broad coverage “default” mode and retrains dynamically to new domains upon being confronted with new (clusters of) news or user-generated data (such as Twitter).

Dream research

Dream research

June 2, 2014 -- Antal van den Bosch , Maarten van Gompel , Florian Kunneman , Ali Hürriyetoğlu , Folgert Karsdorp , Iris Hendrickx , Martin Reynaert , Wessel Stoop , Louis Onrust

Dreams, the involuntary perceptions that occur in our minds during sleep, have been the topic of studies in many fields of research, including psychiatry, psychology, neurobiology, and religious studies. Their narrative content also links dreams to other forms of storytelling, with sharp distinctions (such as the focus on one's personal life and the typical personal perspective) but also interesting overlaps with genres such as orally transmitted folktales. We present a study on dreams aimed at the large-scale analysis of dreams using text analytics.

FutureTDM

FutureTDM

Nov. 1, 2015 -- Oct. 31, 2016 http://project.futuretdm.eu/ Antal van den Bosch , Maria Eskevich

The FutureTDM project identifies current barriers through policy analysis and consultation with researchers, developers, publishers, and SMEs and will come up with solid European-wide recommendations that address and reduce the barriers on a legal, policy and organizational level.

ISHER

ISHER

June 1, 2012 -- Dec. 1, 2013 http://www.nactem.ac.uk/DID-ISHER/ Antal van den Bosch , Kalliopi Zervanou

Integrated Social History Environment for Research: Digging into Social Unrest, a Digging Into Data project

TraMOOC

TraMOOC

Feb. 1, 2015 -- Feb. 1, 2018 http://www.tramooc.eu/ Antal van den Bosch , Iris Hendrickx

TraMOOC (Translation for Massive Open Online Courses) is a Horizon 2020 collaborative project aiming at providing reliable machine Translation for Massive Open Online Courses (MOOCs). The main result of the project will be an online translation platform, which will utilize a wide set of linguistic infrastructure tools and resources in order to provide accurate and coherent translation to its end users.

Publications

L. Onrust, A. van den Bosch, and H. Van hamme
Improving cross-domain n-gram language modelling with skipgrams
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Association for Computational Linguistics, 2016
Full text (external), RIS, BibTex
E. Sanders, M. de Gier, and A. van den Bosch
Using Demographics in Predicting Election Results with Twitter
Proceedings of International Conference on Social Informatics. Springer International Publishing, 2016, 259-268, 2016
Full text (pdf), RIS, BibTex
B. Sommerdijk, E. Sanders, and A. van den Bosch
Can Tweets Predict TV Ratings?
Proceedings of the International Conference on Language Resources and Evaluation (LREC) 2016,, 3965-3970, 2016
Full text (pdf), RIS, BibTex
M. Reynaert, M. van Gompel, K. van der Sloot, and A. van den Bosch
PICCL: Philosophical Integrator of Computational and Corpus Libraries
Proceedings of {CLARIN} {A}nnual {C}onference 2015 -- {B}ook of {A}bstracts, CLARIN ERIC, 2015
Full text (external), RIS, BibTex
F. Kunneman, C. Liebrecht, M. Van Mulken, and A. Van den Bosch
Signaling sarcasm: From hyperbole to hashtag
Information Processing \& Management, 51(4), 2015
RIS, BibTex
F. Kunneman and A. van den Bosch
Automatically identifying periodic social events from Twitter
Proceedings of Recent Advances in Natural Language Processing 2015, 2015
Full text (external), RIS, BibTex
S. Wubben, S. Verberne, E. Krahmer, and A. Van den Bosch
Facilitating online discussions by automatic summarization
Proceedings of the 27th Benelux Conference on Artificial Intelligence (BNAIC-2015), 2015
Full text (external), RIS, BibTex
A. Van den Bosch and J. Bresnan
Modeling dative alternations of individual children
Proceedings of the Sixth Workshop on Cognitive Aspects of Computational Language Learning (COGACLL-2015), 2015
Full text (external), RIS, BibTex
A. Van den Bosch, T. Bogers, and M. De Kunder
A longitudinal analysis of estimating search engine index size
Proceedings of the 15th International Society of Scientometrics and Informetrics Conference (ISSI-2015), 2015
Full text (external), RIS, BibTex
R. Willems, S. Frank, A. Nijhof, P. Hagoort, and A. Van den Bosch
Prediction during natural language comprehension
Cerebral Cortex, 2015
Full text (external), DOI, RIS, BibTex
M. Koolen, T. Bogers, A. Van Den Bosch, and J. Kamps
Looking for Books in Social Media: An Analysis of Complex Search Requests
Advances in Information Retrieval - 37th European Conference on {IR} Research, {ECIR} 2015, Vienna, Austria, March 29 - April 2, 2015. Proceedings, 2015
Full text (external), DOI, RIS, BibTex
F. Karsdorp, M. Kestemont, C. Schöch, and A. Van den Bosch
The love equation: Computational modeling of romantic relationships in French classical drama
Proceedings of the 6th Workshop on Computational Models of Narrative (CMN-2015), 2015
Full text (external), RIS, BibTex
F. Karsdorp, M. Van der Meulen, T. Meder, and A. Van den Bosch
Animacy detection in stories
Proceedings of the 6th Workshop on Computational Models of Narrative (CMN-2015), 2015
Full text (external), RIS, BibTex
F. Karsdorp, M. Van der Meulen, T. Meder, and A. Van den Bosch
MOMFER: A search engine of Thompson's Motif-Index of Folk Literature
Folklore, 126(1), 2015
motifs folktales tunes tales
Full text (external), DOI, RIS, BibTex
M. van Gompel and A. van den Bosch
Translation Assistance by Translation of L1 Fragments in an L2 Context
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Association for Computational Linguistics, 2014
Full text (external), RIS, BibTex
S. Wubben, A. van den Bosch, and E. Krahmer
Creating and Using Large Monolingual Parallel Corpora for Sentential Paraphrase Generation
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), European Language Resources Association (ELRA), 2014, ISBN 978-2-9517408-8-4
ISBN, RIS, BibTex
W. Stoop and A. van den Bosch
Using idiolects and sociolects to improve word prediction
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, Association for Computational Linguistics, 2014
Full text (external), RIS, BibTex
F. Kunneman, C. Liebrecht, and A. van den Bosch
The (un)predictability of emotional hashtags in Twitter
Proceedings of the 5th Workshop on Language Analysis for Social Media (LASM), Association for Computational Linguistics, 2014
Full text (external), RIS, BibTex
A. Hürriyetoğlu, N. Oostdijk, and A. van den Bosch
Estimating time to event from tweets using temporal expressions
Proceedings of the 5th Workshop on Language Analysis for Social Media (LASM), Association for Computational Linguistics, 2014
Full text (external), RIS, BibTex
S. Verberne, E. D’hondt, A. van den Bosch, and M. Marx
Automatic thematic classification of election manifestos
Information Processing \& Management, 50(4), 2014
RIS, BibTex
H. P. Maat, R. Kraf, A. van den Bosch, N. Dekker, M. van Gompel, S. Kleijn, T. Sanders, and K. van der Sloot
T-Scan: a new tool for analyzing Dutch text
Computational Linguistics in the Netherlands Journal, 4, 2014
RIS, BibTex
F. Kunneman, A. Hürriyetoglu, N. Oostdijk, and A. van den Bosch
Timely identification of event start dates from Twitter
Computational Linguistics in the Netherlands Journal, 4, 2014
RIS, BibTex
F. Kunneman and A. van den Bosch
Event detection in Twitter: A machine-learning approach based on term pivoting
Proceedings of the 26th Benelux Conference on Artificial Intelligence, 2014
RIS, BibTex
F. Kunneman, C. Liebrecht, M. van Mulken, and A. van den Bosch
Signaling sarcasm: From hyperbole to hashtag
Information Processing and Management, 2014
Full text (external), RIS, BibTex
M. van Gompel, A. van den Bosch, and A. Dykstra
Oersetter: Frisian-Dutch statistical machine translation
Philologia Frisica anno 2012, 2014
RIS, BibTex
M. van Gompel, I. Hendrickx, A. van den Bosch, E. Lefever, and V. Hoste
Semeval-2014 Task 5: L2 writing assistant
Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), 2014
RIS, BibTex
S. Wubben, E. Krahmer, and A. van den Bosch
Using character overlap to improve language transformation
Proceedings of the 7th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, Association for Computational Linguistics, 2013
vi language transformation machine translation middle dutch
Full text (external), RIS, BibTex
A. van den Bosch and P. Berck
Memory-based grammatical error correction
Proceedings of the Seventeenth Conference on Computational Natural Language Learning: Shared Task, Association for Computational Linguistics, 2013
vi spelling correction grammatical error correction memory-based language modeling
Full text (external), RIS, BibTex
C. Liebrecht, F. Kunneman, and A. van den Bosch
The perfect solution for detecting sarcasm in tweets #not
Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, Association for Computational Linguistics, 2013
Full text (external), RIS, BibTex
S. Verberne, E. D’hondt, A. van den Bosch, and M. Marx
Automatic Thematical Classification of Party Programmes
BOOK OF ABSTRACTS OF THE 23RD MEETING OF COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS: CLIN 2013, 2013
RIS, BibTex
A. Hürriyetoglu, F. Kunneman, and A. van den Bosch
Estimating the time between Twitter messages and future events"
Proceedings of the 13th Dutch-Belgian Information Retrieval Workshop, 2013
RIS, BibTex
E. Sanders and A. van den Bosch
Relating Political Party Mentions on Twitter with Polls and Election Results
Proceedings of 13th Dutch-Belgian Workshop on Information Retrieval, 2013
Full text (pdf), RIS, BibTex
K. Zervanou, M. Düring, I. Hendrickx, and A. van den Bosch
Documenting Social Unrest: Detecting Strikes in Historical Daily Newspapers
Proceedings of the 1st International Workshop on Histoinformatics, 2013
RIS, BibTex
H. Tops, A. van den Bosch, and F. Kunneman
Predicting time-to-event from Twitter messages
Proceedings of the 25th Benelux Artificial Intelligence Conference, 2013
RIS, BibTex
R. Berendsen, M. de Rijke, K. Balog, T. Bogers, and A. van den Bosch
On the assessment of expertise profiles
Journal of the American Society for Information Science and Technology,, 64(10), 2013
RIS, BibTex
A. van den Bosch and T. Bogers
Memory-based named entity recognition in tweets
\#MSM2013 Workshop Concept Extraction Challenge Proceedings,, 2013
RIS, BibTex
A. van den Bosch and W. Daelemans
Implicit Schemata and Categories in Memory-based Language Processing
Language and Speech, 56(3), 2013
RIS, BibTex
A. van den Bosch, R. Morante, and S. Canisius
Joint learning of dependency parsing and semantic role labeling
Computational Linguistics in the Netherlands Journal, 2, 2013
RIS, BibTex
M. van Gompel and A. van den Bosch
WSD2: parameter optimisation for memory-based cross-lingual word-sense disambiguation
Proceedings of the 7th International Workshop on Semantic Evaluation ({SemEval} 2013), in conjunction with the Second Joint Conference on Lexical and Computational Semantics, New Brunswick, NJ: Association for Computational Linguistics, 2013
RIS, BibTex
S. Verberne, A. Van Den Bosch, H. Strik, and L. Boves
The effect of domain and text type on text prediction quality
Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, 2012
RIS, BibTex
M. van de Camp and A. van den Bosch
The socialist network
Decision Support Systems, 53(4), 2012
hitime cls lst social networks social history
RIS, BibTex
F. Kunneman and A. van den Bosch
Leveraging unscheduled event prediction through mining scheduled event tweets
Proceedings of the 24th Benelux Conference on Artficial Intelligence, 2012
RIS, BibTex
A. van den Bosch and P. Berck
Memory-based text correction for preposition and determiner errors
Proceedings of the 7th Workshop on the Innovative Use of NLP for Building Educational Applications, ACL, 2012
vici cls lst grammatical error correction
RIS, BibTex
R. Haque, S. K. Naskar, A. van den Bosch, and A. Way
Integrating source-language context into phrase-based statistical machine translation
Machine Translation, 25(3), 2011
ilk vici mt machine translation igtree
RIS, BibTex
S. Wubben, E. Marsi, A. van den Bosch, and E. Krahmer
Comparing Phrase-based and Syntax-based Paraphrase Generation
Proceedings of the Workshop on Monolingual Text-To-Text Generation, Association for Computational Linguistics, 2011
ilk vici memphix paraphrasing daeso
Full text (external), RIS, BibTex
M. van de Camp and A. van den Bosch
A Link to the Past: Constructing Historical Social Networks
Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis (WASSA 2.011), Association for Computational Linguistics, 2011
Full text (external), RIS, BibTex
P. Vossen, A. Görög, F. Laan, M. van Gompel, R. Izquierdo-Bevia, and A. van den Bosch
DutchSemCor: building a semantically annotated corpus for Dutch
Electronic lexicography in the 21st century: New Applications for New Users: Proceedings of eLex 2011, Bled, 1 0-12 November 2011, 2011
RIS, BibTex
M. van Gompel, A. van den Bosch, and P. Berck
Extending memory-based machine translation to phrases
Proceedings of the Third Workshop on Example-Based Machine Translation, 2009
RIS, BibTex

Software

Fowlt.net

Fowlt.net

by Antal van den Bosch , Wessel Stoop http://fowlt.net/

Fowlt is a spelling correction system for English.

Frog

Frog

by Antal van den Bosch , Maarten van Gompel , Ko van der Sloot https://languagemachines.github.io/frog

Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package. Most modules were created in the 1990s at the ILK Research Group (Tilburg University, the Netherlands) and the CLiPS Research Centre (University of Antwerp, Belgium). Over the years they have been integrated into a single text processing tool, which is currently maintained and developed by the Language Machines Research Group and the Centre for Language and Speech Technology at Radboud University Nijmegen. A dependency parser, a base phrase chunker, and a named-entity recognizer module were added more recently. Where possible, Frog makes use of multi-processor support to run subtasks in parallel.

Lama Events

Lama Events

by Antal van den Bosch , Florian Kunneman , Ali Hürriyetoğlu , Mustafa Erkan Başar http://applejack.science.ru.nl/lamaevents/

Lama Events is a calendar application listing events in the near future. The events are detected and selected by a fully automatic procedure in the Dutch Twitter stream (courtesy of Twiqs.nl). Tweets referring to the same future events are clustered based on the frequent co-occurrence of words (names, phrases) and temporal expressions that characterize the event. The date and time of the event is automatically determined based on direct and indirect time references in the texts of the tweets in a cluster. The demo shows a day-by-day ranked list of automatically detected events in the Dutch language area (Netherlands and Flanders).

MBT: Memory-based tagger generator and tagging

MBT: Memory-based tagger generator and tagging

by Antal van den Bosch , Ko van der Sloot https://github.com/LanguageMachines/mbt/

MBT is a memory-based tagger-generator and tagger in one. The tagger-generator part can generate a sequence tagger on the basis of a training set of tagged sequences; the tagger part can tag new sequences. MBT can, for instance, be used to generate part-of-speech taggers or chunkers for natural language processing. It has also been used for named-entity recognition, information extraction in domain-specific texts, and disfluency chunking in transcribed speech.

TiMBL: Tilburg Memory-Based Learner

TiMBL: Tilburg Memory-Based Learner

by Antal van den Bosch , Maarten van Gompel , Ko van der Sloot , Walter Daelemans, Jakub Zavrel https://languagemachines.github.io/timbl

TiMBL is an open source software package implementing several memory-based learning algorithms, among which IB1-IG, an implementation of k-nearest neighbor classification with feature weighting suitable for symbolic feature spaces, and IGTree, a decision-tree approximation of IB1-IG. All implemented algorithms have in common that they store some representation of the training set explicitly in memory. During testing, new cases are classified by extrapolation from the most similar stored cases. For over fifteen years TiMBL has been mostly used in natural language processing as a machine learning classifier component, but its use extends to virtually any supervised machine learning domain. Due to its particular decision-tree-based implementation, TiMBL is in many cases far more efficient in classification than a standard k-nearest neighbor algorithm would be.

Valkuil.net

Valkuil.net

by Antal van den Bosch , Maarten van Gompel http://valkuil.net/

Valkuil is a Dutch spelling correction system.