Digital Corpus of Sanskrit (DCS)

Tuesday, March 19, 2019

Digital Corpus of Sanskrit (DCS)

The Digital Corpus of Sanskrit (DCS) is a corpus of Sanskrit texts with resolved Sandhis and full morphological and lexical analysis.

The DCS is designed for text-historical research in Sanskrit linguistics and philology. Users can search for lexical units (words) and their collocations in a corpus of about 4.000.000 manually tagged words in 560.000 sentences or text lines.

The DCS offers two main entry points for research:

Words can be retrieved from the dictionary through a simple query or a dictionary page. For each lexical unit contained in the corpus, DCS provides the complete set of occurrences and a statistical evaluation based on historical principles.

The text interface shows all contained texts along with their interlinear lexical and morphological analysis.