Colibri: Constructions as Linguistic Bridges

Sept. 1, 2011 Sept. 1, 2016

Research into the modelling of source-side context in Machine Translation

Publications

The following publications regarding this project have been published:

M. van Gompel and A. van den Bosch
Translation Assistance by Translation of L1 Fragments in an L2 Context
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Association for Computational Linguistics, 2014
Full text (external), RIS, BibTex
M. van Gompel, A. van den Bosch, and A. Dykstra
Oersetter: Frisian-Dutch statistical machine translation
Philologia Frisica anno 2012, 2014
RIS, BibTex
M. van Gompel, I. Hendrickx, A. van den Bosch, E. Lefever, and V. Hoste
Semeval-2014 Task 5: L2 writing assistant
Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), 2014
RIS, BibTex
M. van Gompel and A. van den Bosch
WSD2: parameter optimisation for memory-based cross-lingual word-sense disambiguation
Proceedings of the 7th International Workshop on Semantic Evaluation ({SemEval} 2013), in conjunction with the Second Joint Conference on Lexical and Computational Semantics, New Brunswick, NJ: Association for Computational Linguistics, 2013
RIS, BibTex

Software

The following software was used or developed in the scope of this project:

Colibri Core

Colibri Core

by Maarten van Gompel https://proycon.github.io/colibri-core

Colibri Core is software, consisting of command line tools as well as programming libraries. to quickly and efficiently count and extract patterns from large corpus data, to extract various statistics on the extracted patterns, and to compute relations between the extracted patterns.

Colibri MT

Colibri MT

by Maarten van Gompel https://github.com/proycon/colibri-mt

A Machine Translation framework that wraps around the Moses Decoder and enables k-NN classifier techniques to be used for modelling source-side-context

Colibrita

Colibrita

by Maarten van Gompel https://github.com/proycon/colibrita

Colibrita is a proof-of-concept translation assistance system, translating L1 fragments in an L2 context, using machine learning and statistical machine translation techniques.

PyNLPl: Python Natural Language Processing Library

PyNLPl: Python Natural Language Processing Library

by Maarten van Gompel https://github.com/proycon/pynlpl/

PyNLPl, pronounced as "pineapple", is a Python (2 & 3) library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks.