The aim with the present series, The Quran: Word List, is to present the Quran as raw data with as little interpretation as possible. The digital text used is the Uthmani text of the Tanzil Quran Text. This text is widely used, and we have conducted some comparison to pre-digital age printed editions of the Quran. This series is divided into three volumes. In the first volume, each attested word form in the Quran--adjectives, nouns, proper nouns and verbs--is listed with no parsing and no alteration whatsoever to the orthography of the Tanzil Uthmani text. They are listed as follows: word form < lemma < root. In the second volume, the lemmas assigned to each attested word form are listed as follows: lemma < root. In the third volume, the assigned roots are listed. Each list is sorted alphabetically according to Arabic and UNICODE order. In assigning each word a root and lemma, Classical dictionaries and Quran commentaries, as well as modern Quran dictionaries have been consulted. Deciding on these is not always obvious, since classical dictionaries and commentaries sometimes either disagree or present divergent variant readings or root and lemma attributions. No attention has been given to the semantics of each word. Only on rare occasion have similar forms of words been separated in order to avoid morphological confusion. All vocalized Arabic text is quoted unaltered in any shape or form from the Tanzil Uthmani text.
Elie Wardini is professor of Arabic at Stockholm University, Sweden. His research has focused on Semitic languages, especially Aramaic-Arabic contact from a diachronic and sociolinguistic perspective.