Corpus of Gāndhārī Texts

Sunday, December 8, 2024

Corpus of Gāndhārī Texts

Stefan Baums and Andrew Glass

The foundation for our work on this site is a digital corpus of all published Gāndhārī texts. We started jointly compiling this corpus in 2002, and brought coverage to completion in 2014 (see Baums & Glass 2013 for a brief history of our work, as well as our Blog). Going forward, we continue to keep our corpus updated as new material is discovered and published, add improved documentation to each text, and carry out a comprehensive linguistic analysis of the full range of Gāndhārī texts that we assembled. Especially the Gāndhārī manuscripts discovered in the last twenty years (most of which remain unpublished) are making significant contributions to the corpus, but also new inscriptions and coins from Gandhāra and wooden documents from Central Asia continue to be found.

Sources: Hultzsch 1925, British Library, British Library, Stefan Baums.

The current numbers of items and word tokens in our corpus are as follows:

388 manuscripts and manuscript fragments

1,166 inscriptions

901 administrative documents

337 coin legends

= 2,792 items (119,314 word tokens)