Thursday, February 18, 2016

Coptic Scriptorium News: Annotation tools now include DDGLC Greek Loanword List

Annotation tools now include DDGLC Greek Loanword List
We are pleased to announce the release of our newest versions of some of our natural language processing tools for Coptic which incorporate the lemma list of loanwords developed by the Database and Dictionary of Greek Loanwords in Coptic (DDGLC).

The DDGLC is part of the KELLIA partnership between American and German digital Coptic projects funded by the NEH Office of Digital Humanities and the DFG.  The DDGLC, under the direction of Prof. Dr. Tonio Sebastian Richter, has been building a database of Greek loanwords in Coptic in order to facilitate the study of language contact, language borrowing, and multilingualism in Egypt.

We have integrated the Greek lemma list into our language of origin tagger, tokenizer and morphology analysis, and lemmatizer.

Our online natural language processing web service (which bundles together all of our NLP tools into one web application) also includes this new data from the DDGLC.

The Greek loanword list should greatly increase the accuracy of many of our tools.  If you use them, please let us know how it goes!

We at Coptic SCRIPTORIUM are grateful for this partnership and the generosity of the DDGLC team.

No comments:

Post a Comment