Tuesday, November 28, 2017

Proto-Indo-European Lexicon: The generative etymological dictionary of Indo-European languages

[First posted in AWOL 6 August 2015, updated 28 November 2017]

Proto-Indo-European Lexicon: The generative etymological dictionary of Indo-European languages
Welcome to PIE Lexicon Pilot 1.1: The generative etymological dictionary of Anatolian languages.
Proto-Indo-European Lexicon is the generative etymological dictionary of Indo-European languages.

The current version, PIE Lexicon Pilot 1.1, presents digitally generated data of hundred most ancient Indo-European languages with three hundred new etymologies for Old Anatolian languages, Hitttite, Palaic, Cuneiform Luwian and Hieroglyphic Luwian, arranged under two hundred Indo-European roots.

The correspondences contain data of all fourteen sub-branches of the Indo-European languages, Albanian, Anatolian, Armenian, Baltic, Celtic, Germanic, Greek, Indo-Aryan, Iranian, Italic, Old Balkan (Satem), Old Balkan (Centum), Slavic and Tocharian.

The Anatolian etymologies have been chosen due to their particularly problematic nature, and in the absence of other criteria the selection is random. However, as all the oldest forms of the languages are represented in the data, the ancient languages still absent in PIE Lexicon are supplemented in the near future.

We use a set of sound laws of the Neogrammarians, the laryngeal theory and monolaryngealism critically revised and selected to form a consistent system in Pyysalo (2013), which allows a computer-generated derivation of the most important (ancient) Indo-European languages.
The derivation has been digitized by means of Foma, a programming language developed by Mans Hulden (2009). In Foma the Indo-European sound laws are equipped with their digital counterparts, which are then arranged in chronological order. After this a PIE reconstruction results in a respective Indo-European form, when the Foma script of that language is applied.

All Indo-European forms of PIE Lexicon are automatically generated from the proto-language, and if marked with black the generation is successful. Any errors in the derivation are marked with red, and the derivation of a form through successive sound laws is generated by clicking its PIE reconstruction (in blue).
The Proto-Indo-European Phoneme Inventory
Show the entire data in a single page   |   Show all mismatches

1 comment:

  1. One must bear in mind, especially those of us who are less familiar with this field of study, that Proto-Indo-European is not a single, undifferentiated point-source of all its attested descendants, but has a partly reconstructable internal history stretching back through several thousand years, resulting in regional variations and irregular paradigms within the notional "parent language". Some of the earlier phases can be glimpsed via the way the accent alternated between phonemic pitch and phonemic stress, producing the complexities of apophony that dominate the verb and nominal paradigms in their latest reconstructed PIE shapes.