Thursday, November 22, 2018

LEMLAT 3.0: Morphological analysis of the Latin

LEMLAT 3.0
LEMLAT 3.0 is the latest development of the morphological analyser and lemmatiser for Latin LEMLAT.

This update consists in porting the code from C++ to Java .You can read details about past versions of LEMLAT in the Credits page.

The lexical base of LEMLAT is the result of the collation of three Latin dictionaries (Georges and Georges, 1913-1918; Glare, 1982; Gradenwitz, 1904). It counts 40,014 lexical entries and 43,432 lemmas. This version has also been enriched by (a) 26,250 lemmas out of 28,178 from the Onomasticon by Forcellini (1940) and (b) 82,556 lemmas out of 85,999 from Glossarium Mediae et Infimae Latinitatis by Du Cange (1883-1887).

The lexical base of LEMLAT is the result of the collation of three Latin dictionaries (Georges and Georges, 1913-1918; Glare, 1982; Gradenwitz, 1904). It counts 40,014 lexical entries and 43,432 lemmas. This version has also been enriched by (a) 26,250 lemmas out of 28,178 from the Onomasticon by Forcellini (1940) and (b) 82,556 lemmas out of 85,999 from Glossarium Mediae et Infimae Latinitatis by Du Cange (1883-1887).

LEMLAT 3.0 provides lemmatisation and morphological features for single word forms or text files.

The lexical basis of LEMLAT is used also in the derivational morphology dictionary Word Formation Latin. Funded by the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No 658332-WFL and developed at CIRCSE Research Centre in Milan, Word Formation Latin (WFL) shows derivation relations between lemmas. Further details on WFL can be found here. To use WFL go to wfl.marginalia.it

No comments:

Post a Comment