KELLIA: The Koptische/Coptic Electronic Language and Literature International Alliance
KELLIA: The Koptische/Coptic Electronic Language and Literature International Alliance
KELLIA (the Koptische/Coptic Electronic Language and Literature
International Alliance) is a partnership between digital Coptic projects
and Copticists in Germany and the United States.
Kellia’s goals are to establish standards for digital
Coptic projects, including for transcription of Coptic and metadata
curation, and to create or adapt open-source tools for linguistic
analysis and annotation. Kellia is funded by a bilateral NEH/DFG grant.
KELLIA Products
White Papers
A report with recommendations and best practices
for digital projects in Coptic language and literature regarding the
transcription and encoding of Coptic text. Authors: Caroline T.
Schroeder, Ulrich Schmid, So Miyagawa, Elizabeth Platte, Amir Zeldes
A report with recommendations and best practices
for digital projects involving Coptic texts regarding how to catalog,
index, and describe digital text objects. Authors: Ulrich Schmid, So
Miyagawa, Caroline T. Schroeder
A report with recommendations, best practices,
and practical options for digial projects in Coptic studies regarding
linking data between projects and websites. Authors: Elizabeth Platte,
Caroline T. Schroeder, Ulrich Schmid
Describes all grant activities and outcomes, including publications.
KELLIA members Frank Feder and Maxim Kupreyev of
the The Berlin-Brandenberg Academy of Sciences and Julien Delhez of the
University of Göttingen created a digital lexicon of Coptic. Amir Zeldes
and his student Emma Manning created a web interface for the lexicon,
which is available as a standalone website and is linked to Coptic SCRIPTORIUM texts in the normalized visualization available through the web application and ANNIS tool.
KELLIA member Amir Zeldes created a web
application and a machine actionable API that simultaneously runs Coptic
SCRIPTORIUM's Natural Language Processing tools. Users can input text,
including XML tags, and the online NLP pipeline will tokenize,
normalize, lemmatize, and tag the text for part-of-speech and language
of origin in SGML output.
KELLIA member So Miyagawa is creating Coptic OCR
using Artificial Neural Networks collaborating with Kirill Bulert
(eTRAP/Max Planck Insitute for Biophysical Chemistry), Marco Büchler
(eTRAP) and Eliese-Sophia Lincke (Humboldt Universität zu Berlin/eTRAP).
KELLIA member Uwe Sikora built a converter of XML structrues of Coptic SCRIPTORIUM and VMR
Integration of data from the Database and Dictionary of Greek Loanwords in Coptic (DDGLC) into the NLP tools
Information from the DDGLC from the University of Leipzig has been added to NLP tools, improving the language of origin tagger, the tokenizer and morphology analysis, and the lemmatizer and part-of-speech tagger. All of these tools are also available from the online NLP service.
Growing Besa Corpus
KELLIA partner So Miyagawa has edited and translated Besa's On Lack of Food, which is available on the Coptic SCRIPTORIUM web application for viewing texts and is searchable in ANNIS
GitDox: Online multilayer corpus annotation editing tool
GitDox
is a light-weight transcription and annotation tool customizable for
individual projects and for multiple languages. Created by researchers
at Georgetown University during the KELLIA project, it currently
contains a transcription/text editor with customizable encoding
validation options and a spreadsheet editor for collaborative editing of
a multi-layer annotated document. The tool can be adapted to different
languages; the Coptic Scriptorium version is linked to the online
Coptic Natural Language Processing Service. After researchers
transcribe a Coptic text with light XML markup for structural
information (i.e. page breaks, missing text, etc.), they can click a
button to run the text through the NLP tool pipeline; this annotated
text is presented to the researcher in a multilayer format in
spreadsheet mode. Researchers commit the data and subsequent edits to
repositories on GitHub. The tool includes space for document metadata
and customizable validation mechanisms. The tool is open-source (Apache
2.0 license) and available for download and installation.
Coptic WordNet
KELLIA member So Miyagawa is building
Coptic WordNet with Laura Slaughter (University of Oslo) and Luís
Morgado da Costa (Nanyang Technological University, Singapore). One of
their papers is now available online.
KELLIA member Uwe Sikora, Tiffany Ziegler
and So Miyagawa created a link and description website about DH projects
of digital editions.
No comments:
Post a Comment