the turkic languages in a nutshell - türkçe Öğretimi · 2019. 3. 12. · site adresi: the...
TRANSCRIPT
-
Site adresi: The Internal Classification & Migration of Turkic languages
1
THE TURKIC LANGUAGES IN A NUTSHELL
The Internal Classification & Migration of Turkic languages Version 8.1
v.1 (04/2009) (first online, phonological studies) > v.4.3 (12/2009) (major update, lexicostatistics added) >
v.5.0 (11/2010) (major changes, the discussion of grammar added) > v.6.0 (11-12/2011) (major corrections to the text; maps, illustrations, references added) > v.7.0 (02-04/2012) (corrections to Yakutic, Kimak, the lexicostatistical part; the chapter on Turkic Urheimat was transferred into a separate article; grammatical and logical corrections) > v.8 (01/2013) (grammatical corrections to increase logical consistency and readability, additions to the chapter on Uzbek-Uyghur, Yugur)
Abstract
The internal classification of the Turkic languages has been rebuilt from scratch based upon the phonological, grammatical, lexical, geographical and historical evidence. The resulting linguistic phylogeny is largely consistent with the most prevalent taxonomic systems but contains many novel points.
Contents
1. Introduction
1.1 Preliminary notes on the reconstruction of Proto-Turkic
2. Collecting factual material
2.1 An overview of the lexicostatistical research in Turkic languages
2.2 Dissimilar basic lexemes in the Turkic languages
2.3 The comparison of phonological and grammatical features
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
2
3. Making Taxonomic Conclusions
Bulgaric
Some of the exclusive Bulgaric features
Yakutic
Where does Sakha actually belong?
How did Sakha actually get there?
On the origins of Turkic ethnonymy
Altay-Sayan
Tofa and Soyot closely related to Tuva
The Khakas languages
Khakas and Tuvan share no exclusive innovations
Altay, Khakas and Tuvan form the Altay-Sayan subgroup
Great-Steppe
Kimak-Kypchak-Tatar, Kyrgyz-Kazakh, and Chagatai-Uzbek-Uyghur seem to form a genetic unity
Great-Steppe and Altay-Sayan seem to be closer to each other than to Oghuz-Seljuk
Kyrgyz-Chagatai
Kazakh is closely related to Kyrgyz
Altay-Kyrgyz isolexemes
Chagatai looks like Karakhanid affected by Kyrgyz
Kimak-Kypchak-Tatar
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
3
The Kimak subtaxon
The relationship between Oghuz and Kimak
On the origins of the ethnonym Tatar
Bashkir is closely related to Kazan Tatar
On the origins of Nogai
Karachay-Balkar, an atypical Kimak language
Oghuz-Seljuk
Oghuz is still a valid subtaxon
Seljuk as a subtaxon of Oghuz
Oghuz-Seljuk is indirectly related to Orkhon-Karakhanid
Notes on the confusion about y-/j- in Oghuz and Kimak
Orkhon-Karakhanid
Orkhon-Karakhanid as a valid subtaxon
Khalaj is probably an offshoot of South Karakhanid
Yugur-Salar
Yugur seems to be ancient
Salar has little to do with Oghuz, but quite a lot with Yugur and Uyghur
4.The Resulting Internal Classification of Bulgaro-Turkic languages
4.1 The Genealogical Classification of Bulgaro-Turkic languages
4.2 The taxonomic Classification of Bulgaro-Turkic languages
4.3 The Geographical Tree of Bulgaro-Turkic languages
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
4
5. References and sources
1. Introduction
The present study of the Turkic languages (2009-2012) was started as brief online notes that gradually grew into a series of online publications. The study is mostly an original research with relatively few references to previous theories. Most analysis was based upon factual evidence collected from dictionaries, grammars, language textbooks, native speakers on the web, sound and video fragments, books and articles containing detailed descriptions of specific languages. The resulting conclusions rarely draw from historically accepted opinions or assumptions produced by other researchers, rather attempting to build a logically consistent view of the spread of Turkic languages and their internal classification grounded in the nearly independent and relatively comprehensive step-by-step analysis.
Nevertheless, the author deeply appreciates the extensive input from people who worked on the vast amount of Turkological literature dedicated to the numerous Turkic languages, as well as those who helped directly or indirectly by providing corrections and valuable notes by email or through web forums, without whose interest and collaboration this work would never have come to life.
The present article provides all the linguistic argumentation concerning the internal classification of Bulgaro-Turkic languages. Furthermore, there are three other separate articles which can be regarded as part of the same work.
The Lexicostatistics and Glottochronology of the Turkic languages (2009-2012) is a detailed research of Swasdesh-210 wordlists, which dates the Turkic Proper split to about 300-400 BC, and the Bulgaro-Turkic split to about 1000 BC.
The Proto-Turkic Urheimat & The Early Migrations of the Turkic Peoples (2012-13) is a detailed analysis of the early Bulgaro-Turkic migrations largely based upon the results obtained in the glottochronological analysis above and the present classification. The Proto-Turkic Proper Urheimat area was positioned northwest of the Altai Mountains, and the earlier Proto-Bulgaro-Turkic Urheimat in northern Kazakhstan. The work explores the associations with the major archaeological cultures of the Bronze and Iron Age period in West Siberia.
The Turkic languages in a Nutshell (2009-2012) embraces the final classification, trying to focus on the most well-established conclusions from various works including the present investigation. It also contains multiple illustrations, notes on history, ethnography, geography and the most typical linguistic features, which essentially makes it a basic introduction into Turkology for beginners.
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BChttp://turkic-languages.scienceontheweb.net/Turkic_languages_glottochronology.htmlhttp://turkic-languages.scienceontheweb.net/Proto_Turkic_Urheimat.htmlhttp://turkic-languages.scienceontheweb.net/Proto_Turkic_Urheimat.htmlhttp://turkic-languages.scienceontheweb.net/
-
Site adresi: The Internal Classification & Migration of Turkic languages
5
1.1 Preliminary notes on the reconstruction of Proto-Turkic
Before we proceed with the main analysis, let us consider the reconstruction of the Proto-Bulgaro-Turkic word-initial *j/*y, which has become a long-standing issue in Turkological studies, and which may affect certain conclusions in the main part of this publication.
Many proto-language reconstructions in various branches of historical linguistics are often based entirely on the supposed readings of the ancient texts from the oldest family representatives. For instances, in the Indo-European studies we can avail ourselves of the wonderful attestations of Ancient Greek, Latin and Avestan. However, when the oldest representatives are poorly read and interpreted, such an approach can result in errors.
Generally speaking, an ancient extinct language can only be seen suitable for reconstruction purposes, only if it meets several conditions, namely: (1) it is a uniquely preserved language closely related to a proto-state without the existence of any alternative sibling branches; (2) it is so well-attested that its data are completely reliable and no significant misinterpretations can occur from occasional mistakes in ancient writing, reading (e.g., from abraded petroglyphs), copying of the material, translation, interpretation, etc; (3) the scriptclosely and adequately reflects the original pronunciation and we know full well how to correctly reconstruct that pronunciation from that script; (4) the linguistic material should should be dialectically uniform, in other word it should constitute just one language, not a mixture of various dialects or languages gathered by numerous contributors during generally unknown periods or from unknown areas [which is referred herein as the Sanskrit dictionary syndrome].
Obviously, the situation in Turkology does not meet these criteria. Orkhon Old Turkic, the oldest Turkic language attested in the inscriptions from Mongolia, fails to meet the first point (see details below), it barely gets in with the second one, and raises many objections with the third one. In other words, Orkhon Old Turkic may just be insufficiently old or much too geographically off-centered to be considered close enough to the proto-state. Moreover, there may be just not enough correctly interpeted material for the solid attestation and interpretation of ancient phonology. Orkhon Old Turkic is not as well reconstructed as, say, Latin and Greek in the Indo-European studies, so many readings are quite ambiguous. And finally, it often gets mixed in literature with Old Karakhanid, Old Uyghur and generally unknown Old Yenisei Kyrgyz dialects (given that not all of the Old Turkic inscription were made in Mongolia). Therefore one should not confuse the methodological basis established for the Indo-European reconstruction with the methods convenient for other language branches, such as Turkic. An old language is not always just good enough.
As a result, the reconstruction of Proto-Turkic should be conducted by means of a completely different approach, namely using materials from the well-attested modern representativesof Turkic languages. In that case, we should build a reconstruction using a lineal formula with separately determined lineal coefficients representing contributions for each particular language branch. This method is drastically different from the old-fashioned old-language-for-all model. As an example, when reconstructing Bulgaro-Turkic, we could roughly assign about 50% to Chuvash and about 50% to Proto-Turkic Proper, and then more or less equally divide the second half among the most archaic representatives from the main
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
6
branches, e.g. (1) Proto-Sakha, (2) Proto-Altay-Sayan + Proto-Great-Steppe, and (3) Proto-Oghuz-Orkhon-Karakhanid , hence each one of the main Turkic branches would receive only about 50% /3 = 17% (see the classification dendrogram at the end of this article).
This example has been provided as a first-approximation approach to address the potential Old-Turkic-centristic attitude, which supposedly claims that "nothing that's not in Old Turkic could exist in Proto-Turkic" or that "Old Turkic is an ancient language, therefore it is more suitable for historical reconstruction". By contrast, the current revised method requires that Gökturk Old Turkic be considered as just one of several early Turkic branches, and it is hardly any more important for reconstruction purposes than about 17% or less.
However, the figures for the lineal coefficients depend on the genealogical topology of the most basic shoots in the internal classification dendrogram. Therefore, using Turkic languages as an example, we come to a general conclusion that a consistent internal tree-like language group classification must be built before proceeding with the reconstrution of a proto-language. In other words, an internal classification should be constructed prior to further linguistic or geomigrational analysis.
An example from the Revised Model: the reconstruction of the Proto-Bulgaro-Turkic *S-
The above reasoning can be exemplified by the following reconstruction of the Proto-Bulgaro-Turkic *S- (the S-symbol should be seen herein as just an arbitrary way to designate the *y-/ *j-phoneme as in Turkic yer / jer "place, earth", yol /jol "way", etc ). A very common error resulting from the Turkish-for-all or Karakhanid-for-all model is the conclusion that the words with the y- were pronounced exactly the same way in Proto-Bulgaro-Turkic. This idea is very common even among Turkologists outside Turkey, and seems to go as far back as the Mahmud al-Kashgari's classical Compendium of the Turkic languages (1073).
Note: Before proceeding with the further argumentation, we should confine ourselves only to the material internal to the Turkic languages, the Altaic and Nostratic languages being a completely separate issue that cannot be regarded herein at any length. This method can generally be called as an internally-based reconstruction vs. full reconstruction.
Note: We try to consistently use the Anglophone-based transcription throughout all the articles as opposed to the German-based transciption that goes back to the 19th century's tradition, therefore /y-/ denotes a semivowel as in "year" and /j-/ or /J-/ an affricate as in "Jack". To avoid occasional confusion, the capital denotation /J-/ has been used in some places for additional emphasis. The digraph /zh/ or monograph /ž/ are approximately similar to the voiced sibilant in French "je" or English "pleasure", "treasure". The use of complex UTF signs was avoided for reasons of readability and technical compatibility. For further details on transcription see The Turkic languages in a Nutshell.
The following table summerizes the pronunciation of the Turkic *S- in the most important branches:
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BChttp://turkic-languages.scienceontheweb.net/
-
Site adresi: The Internal Classification & Migration of Turkic languages
7
The Reconstruction of the Proto-Bulgaro-Turkic *S
Subgroup Phoneme
Remarks
Bulgaric
Dunai-Bulgar, Kuban-Bulgar d'; zh-/ch-;
j'-/ sh'-
The Dunai-Bulgar texts were written in Cyrillic, though their originals had possibly been written in Greek.
The Bulgaric words in Hungarian are written with the digraph , which should be read as /J-/ (as in Italian that provided basis for the orthography) (see Rona-Tash, and A. Dybo). Some of the Hungarian words have the initial sh-, such as shel (shelet) "wind" (cf. Chuvash s'il). Also, cf. the borrowing zhenchugê "pearls" into Old Russian (attested in 1161) and gyongy into Hungarian.
Chuvash s'- palatalized, soft
Turkic Proper
Yakut, Dolgan s-,
s- > h-
Aspirated between vowels,
hence /h/ in Dolgan due to the Evenk substratum.
Tuvan, Tofa ch'- slightly palatalized
Khakas, Shor, Chulym ch'-, n'- slightly palatalized;
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
8
sometimes an irregular /n-/ before /-i, -ï/
Kumandy (North Altai) ch'-, n'- as in Khakas
Standard South Altai d'-/ j- a palatalized soft /d'/ in writing, though pronounced much like English /j-/, maybe just shorter and with more palatalization.
Karakalpak, Kazakh, Kyrgyz
zh- < j-
(west to east);
j- (Kyrgyz)
An English-type /j-/ affricate in the eastern dialect of Kazakh probably due to the contact with the Altai-type /d'-/, but a /zh-/ sibilant in the western dialects apparently due to a contact with y-type languages.
Although at least one speaker suggested that /j-/ (the voiced /ch-/) was in fact original even in central Kazakhstan, whereas /zh-/ developed in the course of the 20th cent. due to a Russified spelling and pronunciation. That can be true in some cases due to mass bilingualism in Kazakhstan.
Similarly, this suggestion is partly corroboarted in Melioransky's textbook of Kazakh (1894), who wrote that this sound would be similar in pronunciation to the Russian /dzh/ with "a weak beginning", whereas "the pre-sound ("d") entirely disappears in the western part of the steppe". Consequently, */j-/ rather than /y-/ is reconstructed for the early Kazakh.
Also, note /J-/ but /-VzhV-/ between the vowels;
An English-type /J/ in Kyrgyz
Kazan Tatar
and most other Kimak-Kypchak
j'- before -e,-i
y- before -a, -o, -u
Many Kimak-Kypchak languages may have been influenced by the written Kazan Tatar standard in the course of the 20th century, whereas speakers often report a /j-/-type affricate in their native dialects.
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
9
E.g., a speaker of Kazan Tatar insists that his dialect (South Eastern Tatarstan) has a soft /j-/and /y-/ in an allophonic distribution.
Al-Kashgari (1072) reports /j-/ for Kypchak.
Ural Tatar
j-
The Ural Tatar is a poorly researched dialect located in the Urals, presumably a result of the Kazan Tatars immigration from the 15th-16th to the 19th centuries and thus retaining the early characteristics of Kazan Tatar.
North Crimean Tatar j-, sometimes y-
Mostly, always /j-/ in the northern (steppe) dialect, though /y-/ in numbers and a few other common words (such as yaxshi), probably due to borrowings at marketplaces.
Moreover, a /j-/ is reported in Yevpatorian Crimean Tatar.
Karachay-Balkar (1) j- and ch-;
(2) z- and ts-
There are two different dialects in Karachay-Balkar.
No signs of /y-/ even in marginal dialects is reported.
Early Kypchak y- Attested as /y-/ in the Armenian and Mamluk sources.
Yughur y-, sometimes tsh'- There are a few reports from Tenishev about /tsh'-/, as if in Mandarin, but
mostly /y-/ (which could be either an allophonic distribution or an unknown dialect of Yugur)
Salar y-, sometimes dzh'-
Just as in Yugur, Poppe mentions a few words from Potanin's materials, where /y-/ is irregularly rendered as /dzh'-/ in the Russophone transcription, which roughly equivalent to the English /j-/, e.g. dzhigirme, jigirme as opposed to the usual igermi"twenty".
Transoxanian Oghuz (c. 11th century) j- and y-
Confusingly attested as both /j-/ and /y-/ by al-Kashgari, but /j-/ is more certain.
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
10
Turkmen y- < *j-(?)
Because of the attestation of /j-/ in Transoxanian Oghuz, the accepted source of the Seljuk languages, we should deduce that /y-/ may in fact be a later development in Proto-Seljuk, for instance, due to the Karakhanid, Chagatai and Uzbek influence.
Azeri 0- < y- A regular loss of /y-/, as in üræk < yürek "heart"
Turkish y- In some instances, /y-/ may even be weakened further or disappear, as in Azeri, e.g. /biliyor/ "he knows" > /bilior/ in the real pronunciation.
Orkhon Old Turkic (c. 9th century) y- (?) Commonly interpreted as /y-/, but no exact evidence
Karakhanid (11th c.) y- Clearly attested as /y-/ in al-Kashgari's work
Uzbek, Uyghur
y- < *zh-;
j- (Kypchak Uzbek)
j-, y- (Uyghur)
Presently, written as /y-/ probably due to the Karakhanid influence; originally, probably/zh-/ or /j-/ because of the close relatedness to the early Kazakh-Kyrgyz-Kypchak (see below). The /j-/ phoneme is found in the Kypchak dialect of Uzbek (e.g. jaxshï as opposed to the usual yaxshï "good").
Interestingly, Uyghur mostly uses /j-/ and /y-/ interchangeably, so they must be in an allophonic distribution.
This table shows that the pure /y-/ pronunciation is attested only within the following subtaxa:
(1) in the languages historically connected with the Orkhon-Karakhanid and Oghuz-Seljuk subgroups, even though there seems to exist some /y-/-to-/j-/ allophonic distribution in Uyghur, some Uzbek dialects and some Oghuz dialects;
(2) partly, in Yugur and Salar, which also belong to the southern Orkhon-Karakhanid habitat and may have been contaminated by it, considering they are located along the Silk Road outposts, where migrations were a very common phenomenon.
(3) partly, in the /ya-/, /yu-/, /yo-/ syllables, in the languages descending from the late expansion of the Golden Horde, such as Kazan Tatar (but not the Kimak languages with an early separation, such as Karachay-Balkar). Nevertheless, even in Kazan Tatar, many speakers still report
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
11
an allophonic distribution of this phoneme, therefore a clear-cut /y-/ exists mostly in the written standard, produced more or less artificially after the 1920's, as well as in the recently Russified speech, rather than in older dialects or geographically marginal languages, such as North Crimean Tatar, Eastern Bashkir, etc. Moreover, we still have /jil/, not /yil/ "wind" before a high vowel even in the standard Kazan Tatar.
Consequently, we may conclude:
(1) Only the languages related or adjacent to the Oghuz-Orkhon-Karakhanid branch seem to have a clear-cut historical attestation of the /y-/ semi-vowel, whereas the majority of other branches with an early separation and long isolation either get jumbled data or seem to be clearly going back to something like a strongly palatalized sibilant /s'-/, /j-/, /d'-/, /ch-/ or a similar consonant sound.
This provides a purely statistical argument for our conclusion: there are more separate language branches that originally had an /s'-/- or /j-/-type phoneme than those that finally developed the /y/-phoneme. To put it in other words, it is statistically implausible that the supposed /y-/ > /j-/ mutation would have occurred simultaneously and independently in so many separately existing archaic branches.
(2) As we can see in the fig. below, the distribution of the y-type phoneme seems to be located outside of the main historical diversification area of Turkic languages, therefore it appears to be a recent phonological mutation, apparently linked to the migration of the Orkhon-Karakhanid and Oghuz languages, which again implies that the development of /y/ might have been a rather unique phonological innovation in Orkhon-Karakhanid Old Turkic. This provides us with a second phono-geographical argument: only the J-type phoneme seems to be distributed near the putative homeland area of Turkic languages, not the y- semivowel.
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
12
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
13
As to the existence of the allophonic /y-/-to-/j-/ phonological variation in the Kimak-Kypchak-Tatar languages of the Golden Horde, such as Kazan Tatar, the existence of /y-/ may be explained as an early Oghuz influence. As we will show below, the Golden Horde languages and Oghuz share many linguistic features at several levels, therefore this type of borrowing is well corroborated by other evidence of mutual interaction.
(3) Moreover, if /y-/ were present in the proto-form, we would rather observe phonological variations of the semi-vowel /y-/ (not /J-/): e.g. we would find something like /y-/, /i-/, /0-/, /ê-/, /l'-/, /J-/, /zh-/ in the most archaic and diversified Siberian branches in the east (near the historical homeland of the Turkic languages), but what we do see in that area are the phonological variations of the palatalized consonant /s'-/: /s'-/, /s-/, /h-/, /ch'-/, /J-/, /zh-/, /d'-/, /ni-/, /y-/. On the other hand, the expected zero phoneme resulting from the loss of /y-/ is only present in the westernmost languages, such as Azeri (e.g. ulduz < yulduz "star", il < yil "year"), and, partly, in Turkish (cf. ïlïk, but Turkmen yïlï "warm"), which marks the /y-/-phoneme as a relatively recent and rather westernmost phenomenon connected with the spread of the Oghuz-Seljuk languages. This provides us with a phonological diversification argument: if the /y-/ semi-vowel were original, there would be a range of predictable sound changes in the most early diversified branches, but nothing of the kind is found there.
Therefore, from the evidence internal to the Turkic languages alone, we may conclude that the *S- proto-phoneme in question can be placed somewhere within the range of sibilants {/s'-/, /s-/, /h-/, /ch'-/, /J-/, /zh-/, /d'-/}, and it could not have been similar to the /y-/ semivowel as in modern Oghuz-Seljuk languages.
Actually, this conclusion concerning the reconstruction of the Proto-Turkic *S- is hardly novel and has been expounded several times by different authors, such as A.N. Bernshtam (1938), S.E. Malov (1952), N. A. Baskakov (1955), A.M. Scherbak (1970), as well as by the authors of the authoritative Russian publication, sometimes abbreviated as SIGTY, namelyin its volume [Pratyurkskiy yazyk-osnova. Kartina mira pratyurkskogo etnosa po dannym yazyka. (The Proto-Turkic language. The Worldview of the Proto-Turkic ethnicity based on the linguistic data.), Moscow (2006)].
Note: Generally speaking, SIGTY [Sravnintelno-istoricheskaya grammatka tyurkskikh yazykov ("The Comparative Historical Grammar of the Turkic languages")] is a large and verbose multi-volume Moscow compehensive publication with detailed cross-comparative analysis of morphology, syntax, vocabulary, semiotics and other aspects of Turkic languages, produced between the 1970's and the 2000's.
As an additional quite interesting argument, the authors of SIGTY suggest that, since other sonants, such as *r- and *l-, were absent or atypical in the word-initial position, there is no reason to believe that the /*y-/ semi-vowel, phonetically similar to a sonant, could be there either.
The opposite view, which mostly goes back to Radlov's work in the end of the 19th century is usually based on the following incorrect presumptions: (1) that the Karakhanid Old Turkic of Makhmud al-Kashgari is equal to all of the Turkic languages (in other words, that Middle Turkic = late Proto-Turkic); (2) that Orkhon Old Turkic has been correctly and uncontroversially reconstructed from the script and it reflects /y-
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BChttp://en.wikipedia.org/wiki/Radlov
-
Site adresi: The Internal Classification & Migration of Turkic languages
14
/, even though we hardly know the actual pronunciation in the Orkhon inscriptions; (3) that the high level of differentiation among different Turkic subgroups can be ignored, including the evidence for the maximum differencies in the Siberian languages and Chuvash — in this approach the evidence from the Kimak-Kypchak-Tatar languages, for instance, may play the same role as the evidence from Sakha, and indeed this was the situation in Russian and European Turkology until the beginning of the 20th century, when most Turkic languages were officially viewed as merely dialects of each other. Even in SIGTY, Chuvash is still unreasonably included into the mainstream Turkic languages, at least as far as the phonological reconstructions are concerned.
As a final touch, we can describe a phonological calculation based on the above-postulated formula used in the reconstruction of the S-phoneme:
1/2 Proto-Chuvash /s'-/ + 1/2 [1/3 Proto-Yakutic /s-/ + 1/3 (1/2 (1/2 Proto-Altay-Sayan /ch'-/ + 1/2 (1/2 Proto-Kimak-Kypchak /j'-/ + 1/2 Proto-Kyrgyz-Kazakh-Chagatai /j-/)) + 1/3 Proto-Oghuz-Orkhon-Karakhanid /y-/)] =
1/2 Proto-Chuvash /s'-/ + 1/2 [1/3 Proto-Yakutic /s-/ + 1/3 (1/2 Proto-Altay-Sayan /ch'-/ + 1/2 Proto-Great-Steppe /j'-/ ) + 1/3 Proto-Oghuz-Orkhon-Karakhanid /y-/)] =
1/2 Proto-Chuvash /s'-/ + 1/2 [1/3 Proto-Yakutic /s-/ + 1/3 Proto-Central /ch'-/ + 1/3 Proto-Oghuz-Orkhon-Karakhanid /y-/]
It follows from this expression that the original Proto-Bulgaro-Turkic *S-phoneme was most likely similar to a soft palatalized /s'-/ as in modern Chuvash /s'/, Russian /sh'/ or Japanese , hence for instance */s'etti/ "seven" as in the Indo-European *septem, not *yetti, as it perhaps follows from Turkish, Azeri, Uzbek, Karakhanid and other widespread Turkic languages.
At a later stage, the phoneme began to change into a soft palatalized unvoiced /ch'/ or voiced /j'/ after the separation of Proto-Yakutic, whereas the mutation to /y-/ was a relatively recent innovative phenomenon typical only of the sourthern branch of Turkic languages.
2. Collecting factual material
Comprehensive research in Turkology was often hindered by the large number of languages and dialects (somewhere over 50 when all the major dialects are counted) and the lack of detailed grammars and dictionaries for some of them. In many cases, the language descriptions were composed only after the 1920's or even after World War II.
As a result, most of the 19th century's Turkological classifications had originally been built upon phonological criteria alone. The grammatical features were slowly added in in the course of the 20th century, whereas detailed lexcicostatistical and glottochronological analysis seems to be the thing of the recent past that appeared mostly in the 1990's.
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
15
In the present chapter, we will briefly summarize the essential lexical, grammatical and phonological evidence collected as the basis for further examination in the next chapters.
2.1 An overview of the lexicostatistical research in Turkic languages
In the beginning of the 21st century, several authors attempted to conduct some purely statistical studies of the Turkic languages, in most cases without any manual analysis of grammar or vocabulary.
Starostin (1991)
Sergey Starostin [STAH-res-tin] included some very detailed 110-word Swadesh-Yakhontov wordlists for 21 Turkic language in his book [Altajskaja problema i proiskhozhdenije japonskogo jazyka (The Altaic Problem and the Origins of the Japanese language), Moscow (1991)]. These lists were apparently later reintegrated into the Starling database.
Dyachok (2001)
A work conducted by M. Dyachok [pronounced: d-yah-CHOK] was published online as brief preliminary notes. In the introduction to his concise article, the author reminds the reader of the old geography-based classification by Samoylovich [sah-moy-LAW-vich] (1922), which had similar results, and then performs the lexicostatistical and glottochronological analysis of the 13 major Turkic languages. As a result, the Turkic languages were subdivided roughly into merely four basic subgroups (1) Bulgaric (2) Yakut, (3) Tuvan, (4) Western (= any other), which conforms to the idea that their area of maximum diversification was located somewhere in the east.
Dybo (2002, 2007)
The study by Anna Dybo [AHN-nah deh-BAW] was first published in 2001 as part of the articles collected in SIGTY [( Sravnitelnaja grammatika tyurkskikh jazykov (The Comparative Grammar of the Turkic languages)]. Then, it was republished in 2007 in a separate book [Anna Dybo, Lingvisticheskije kontakty rannikh tyurkov. Leksicheskij fond. (The Linguistic Contacts of the Early Turks: the Lexical Fund), Moscow (2007)].
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BChttp://en.wikipedia.org/wiki/Sergei_Anatolyevich_Starostinhttp://starling.rinet.ru/cgi-bin/main.cgi?root=config
-
Site adresi: The Internal Classification & Migration of Turkic languages
16
The study cites Dyachok as a recent lexicostatistical publication and then briefly describes its own methodology, "All the languages, for which the 100-Swadesh wordlists could be collected from written sources, were included into our investigation. The 100-word Yakhontov-Starostin wordlists were employed, taken that they allow better accuracy [= than the classical Swadish-100]; they were processed according to Starostin's methodology by excluding the recognizable borrowings and employing the STARLING program [...]"
As a result, the following dendrogram was obtained:
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
17
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
18
Dybo, Anna, The Chronology of the Turkic languages and the Linguistic Contacts of the Early Turks (2006)
There also exists a second version of this dendrogram that drastically differs from the first one, because of some kind of unexplained procedure that was applied to synonyms. This is slightly confusing and may result in the underestimation of the dendrogram's significance, however the first tree above (with the synonyms included) partly matches the outcome obtained in other investigations. Apart from such unconventional points as (1) the splitting of Turkmen and Turkish between two different taxa, (2) the positions of Yugur and Salar, (3) the slightly misplaced Kazakh (which cannot be directly related to Uzbek) and Uzbek position (which is known historically to be related to Uyghur), it is in fact in relatively good correspondence with other studies. However, the glottochronological part based on Starostin's formulas should be taken with a grain of salt.
It should also be noted that the use of shorter 110-word lists results in lower statistical robustness than in the current series of publications that uses larger 215-word lists. Nevertheless, this work has an advantage of representing a greater set of languages, especially those of the Altay-Sayan area, which are normally underestimated or omitted in other studies.
ASJP (2009)
Another example of a phonostatistical research that merits mentioning is the automated dendrogram built by the Automated Similarity Judgment Program for most languages of the world. Here's a preliminary an simplified first-approximation phonostatistical dendrogram of Turkic languages (gif) from 04/2009.
The study was based on a simple 40-word list. Many branches seem to be mispositioned, apparently due to certain limitations of the ASJP's initial approach, however you can see the early separation of Proto-Chuvash, then Proto-Oghuz, and then the rest of the languages, which is partly consistent with the conclusions obtained in the present work and other studies.
Herein (2009, 2012)
To prepare a lexicostatistical research for this publication, it was decided to use the readily available 200-word Swadesh lists from Wiktionary.org.
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BChttp://email.eva.mpg.de/%7Ewichmann/ASJPHomePage.htmhttp://email.eva.mpg.de/%7Ewichmann/ASJPHomePage.htmhttp://turkic-languages.scienceontheweb.net/turkic_asjp.gifhttp://turkic-languages.scienceontheweb.net/turkic_asjp.gif
-
Site adresi: The Internal Classification & Migration of Turkic languages
19
After verifying and correcting the available materials, building some new lists for absent languages (such as Khakas, Tuvan, Altai) (2009), composing a php-program to do all the routine calculations, performing some additional meticulous examinations and adding some new lexical material thus expanding the lists to 215 entries (2012), another lexicostatistical study named The Lexicostatistics and Glottochronology of the Turkic languages was finally produced.
It should be noted that the lexicostatistical figures obtained in 2009 and 2012 sometimes differed significantly from each other, because of different approaches used to account for the unavoidable synonymy. The 2009 approach had been much too basic and consequently was significantly enhanced in 2011-12, which included both reexamining the original lists and introducing changes into the program application, so the present version is to be considered more correct.
Most borrowings (Persian, Arabic, Mongolian, Russian, etc) were excluded wherever possible, so only the verified cognates were counted in the final glottochronological section of the study. In the doubtful cases the cognacy was determined according to the [Etymologicheskij slovar chuvashskego jazyka (The etymological Dictionary of Chuvash), by M. Fedotov; volume 1-2, Cheboksary (1996)] and sometimes using the [Etymologicheskij slovar tyurkskikh jazykov (The etymological Dictionary of the Turkic languages), E. V. Sevortyan, Vol. 1-7, Moscow (1974-2003)].
The lexical lists presently differ from the Wiktionary.org materials and are available online as a Word document
As the final outcome of the study, several lexicostatistical matrices of Turkic languages were built.
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BChttp://turkic-languages.scienceontheweb.net/Turkic_languages_glottochronology.htmlhttp://turkic-languages.scienceontheweb.net/Turkic_languages_glottochronology.html
-
Site adresi: The Internal Classification & Migration of Turkic languages
20
The Lexicostatistical Matrix of Turkic languages, Swadesh-215 (02.2012), borrowings excluded
Chuvash Sakha Tuvan Khakas Standard Altay Kyrgyz Kazakh Uzbek Uyghur Karachay Bashkir Tatar Turkmen Azeri
Sakha 51.9%
Tuvan 49.3% 57%
Khakas 52.8% 61.3% 71.9%
Standard Altay 50.9% 55.9% 69.3% 75.6%
Kyrgyz 57.9% 59.6% 63.3% 70.3% 74.6%
Kazakh 58.2% 59.4% 61.6% 68.1% 69.9% 92%
Uzbek 61.1% 57.8% 58.2% 65.3% 66.3% 82.9% 82.8%
Uyghur 59.2% 59% 61.7% 65.7% 70.2% 83.8% 81.9% 86.3%
Karachay 57.5% 60.8% 58.7% 65.1% 65.2% 77.8% 78.3% 74.6% 77.1%
Bashkir 58.3% 59.4% 59.9% 67.1% 69% 82% 79.9% 76.1% 78.5% 77.4%
Tatar 59.4% 60.7% 60.2% 68.2% 70.1% 83.9% 82.1% 78% 79.6% 79.2% 94.9%
Turkmen 55.6% 55% 54.7% 61.2% 59.5% 71.2% 71.9% 75.9% 71.7% 69.2% 71.9% 69.8%
Azeri 55.6% 51.8% 51.8% 56.4% 58.4% 66.9% 67.8% 70% 68.8%. 66.9% 66% 68.4% 78.2%
Turkish 54.9% 52% 50% 53.8% 54.4% 64.9% 64.8% 67.2% 66.7% 64.2% 62.8% 65.6% 73.6% 86%
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
21
Considering that an accurate analysis is supposed to include phonological, grammatical, historical and other non-lexical evidence, the lexicostatistical data alone are most likely insufficient to build a complete dendrogram of the Turkic languages at this point,
However, we can use the values in the table to build a wave model of Turkic languages that would reflect the mutual language intelligibility through the calculated relationships in the basic vocabulary. The wave model should be based on the borrowings-included matrix, because it is supposed to represent the mutual intelligibility as it is, without any exclusions, for this reason you may notice some small discrepancy in percentages with the table above.
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
22
The wave model of the Turkic languages with borrowings included from
[The Lexicostatistics and Glottochronology of the Turkic languages (2009-2012)]
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
23
2.2 Dissimilar basic lexemes in the Turkic languages
Another brief lexical table prepared in 2009 included a visual overview of certain lexemes that are known to be dissimilar within the core Turkic languages. These lexical data help to pick up dissimilarities between otherwise closely related groups and assist in identifying large supertaxa.
Dissimilar Basic Words in the Turkic languages
Red is a more ancient layer associated with the Siberian Turkic languages, brown marks the Oghuz-Turkmen innovations; blue is a more recent layer
probably connected with the spread of the Gökturks; green marks probable "Central Turkic" innovations; orange marks the Altay-Sayan (Tuvan + Khakas + Altai) innovations; purple marks the Yakutic innovations or otherwise differentiated Yakutic words; gray and black are "other" or unclassified. Borrowings may be included.
Turkish
Azeri Turkmen
Uzbek
Uyghur
Karakhanid
Kazan Tatar
Karachay Kazakh Kyrgyz Khakas Tuvan Sakha
Seljuk Oghuz Karkhanid-Chagatai Kimak-Kypchak Kazakh Kyrgyz Yenisei-Kyrgyz Yakutic
not (adj, nouns)
Tk. deGil;
Az. deyil däl
Uz. emas;
Uy. emes
Kh. ermes
KT. tügel;
KB. tüyse emes emes
Kh. nimes; chox
Al. emes; d'ok eves; chok suox
here Tk. burada;
Az. burada < *bu ara-da
shu tayda;
bäri
Uz. buyerda;
Uy. buyerde; mana
K. munda
KT. monda, bireda;
KB. mïnda, blaida mûnda mïnda
Kh. mïnda
Al. mïnda mïnda manna
there
Tk. orada;
Az. orada
< *o ara-da
o tayda;
ol yerde
Uz. uyerda;
Uy. uyerde;
KT. anda, shul zherde;
KB. anda, alaida onda anda
Kh. anda
Al. anda aNa: onno
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
24
how Tk. nasïl;
Az. nechê nähili
Uz. qanday
Uy. qandaq
KT. nichek;
KB. qalay qalay qanday
Kh. xaidi
Al. kandïy qandïg xaidax
many Tk. chok;
Az. chox köp
Uz. kûp
Uy. köp
Kh. talim; kûp
KT. küp
KB. köp köp köp
Kh. köp
Al. köp xöy elbex, ügüs
wide Tk. genish;
Az. genish giNish; giN
Uz. keN
Uy. keN
Kh. keN
KT. kiN
KB. keN keN keN
Kh. chalbax
Al. d'albak kalbak, chalbak kieN
forest Tk. orman;
Az. orman tokay
Uz. ûrmon
Uy. ormanliq
KT. urman
B. aGach
toGay;
orman
tokoy;
orman
Kh. agas;
Al. arka arga, arïg tïa
root Tk. kök;
Az. kök kök
Uz. ildiz
Uy. iltiz
Kh. yildiz
KT. tamïr
KB. tamïr tamïr tamïr
Kh. tazïl; chilige
Al. tazïl t.azïl silis
bark (n) Tk. kabuk;
Az. qabïq gabïk
Uz. qobuq
Uy. qovzaq
KT. kabïk
KB. qabuq qabïq qabïq
Kh. xabïx
Al. chobra chövure: xatïrïq
flower
Tk. gül "rose";chichek
Az. gül; chichêk
gül
Uz. gül; chichak
Uy. gül; chichek
Kh. chichek
KT. göl; chêchêk
KB. gül; gokka
gül;
shêshêk gül
Kh. chaxayax
Al. chechek chechek sibekki
fat (n) Tk. yaG; yaG Uz. yoG; may KT. may; may may Kh. üs, zhaG üs, chaG sïa
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
25
Az. yaG; Uy. yaG; may
KB. jau Al. üs
nose Tk. burun;
Az. burun burun
Uz. burun;
Uy. burun
KT. borïn;
KB. burun mûrïn murun
Kh. purun, tumzux;
Al. tumchuq t.umchuq murun
hand
Tk. el;
Az. êl el
Uz. qûl
Uy. qol
Kh. elig
KT. kul;
KB. qol qol qol
Kh. xol
Al. qol xol ili:
liver Tk. (kara) chiGer
Az. chiyer bagïr
Uz. zhigar; baGir;
Uy. jiger; beGir
Kh. baGir
KT. bawïr;
KB. baur bawïr boor
Kh. paar
Al. buur p.aar bïar
think Tk. düshün-
Az. düshün- öyt-
Uz. ûyla-;
Uy. oyli-
KT. uyla-;
KB. oymla- oyl- oyl-
Kh. sagïn-
Al. sanan p.od- sana:
live Tk. yasha-
Az. yasha- yasha-
Uz. yasha-;
Uy. yashi-
KT. yashê-;
KB. jasha- zhas- zhash-
Kh. churt-
Al. d'ür- churtt- olor; sïrït
say Tk. de-
Az. de- diy
Uz. ayt-; de-
Uy. eyt-; de-
Kh. ay-; de-
KT. êyt-
KB. ayt- ait-; de- ait-; desh
Kh.cho:xt-
Al. ayt- chug-; t.e:- die, et
sky Tk. gök
Az. göy gök
Uz. kûk; asman
Uy. kök; asman
KT. kük
KB. kük kök (rare);aspan
kök (rare);asman
Kh. tigir
Al. teNeri t.e:r xalla:n
burn (intr.) Tk. yan- öt-; yan- Uz. yon- KT. yan- zhan- köy-; zhan- Kh. köy- kïv- umai
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
26
Az. yan- Uy. yan-; köy- KB. jan- Al. küy-
night Tk. geche
Az. geche gije
Uz. tün
Uy. tün
Kh. tün; kecha
KT. tün
KB. köche tün tün
Kh. tün
Al. tün t.ün tü:n
yesterday Tk. dün
Az. dünên düyn
Uz. kech
Uy. tünügün
KT. kichê
KB. tünene keshe keche
Kh. kiche
Al. keche t.ü:n beHehe:
evening Tk. aksham
Az. axsham agsham
Uz. okshom; kecha
Uy. axsham; keche;
Kh. axsham
KT. kich
KB. ingir kesh kech
Kh.i:r
Al. engir kezhe: kiehe
big Tk. büyük
Az. böyük ulï; chishik
Uz. büyük; katta
Uy. büyük;yoGan,zor;chong
Kh. uluG
KT. zur
KB. ullu
ülken;
zor chong
Kh. ulug;
Al. d'a:n ulug ulaxan
child Tk. choJuk
Az. ushaq, chaga
chaga Uz. bola;
Uy. bala
KT. bala; sabii
KB. sabii bala bala
Kh. pala;
Al. bala urug oGo
face Tk. yüz;
Az. üz yüz
Uz. yuz
Uy. yüz
KT. bit; yöz;
KB. bet
bet;
zhüz;
shïray
bet Kh. sïray;
Al. d'üs; chïray shïray sirey
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
27
island Tk. ada;
Az. ada ada
Uz. orol;
Uy. aral;
Kh. utruG
KT. utrau;
KB. ayrïmkan aral aral
Kh. oltïrïx;
Al. ortolïk ortuluk arï
owl Tk. baykush
Az. baykush baygush
Uz. boygoli;
Uy. baykux
KT. yabalaq; ökö (dial.)
KB. uku üki ükü
Kh. tasxa;
Al. mechirtke mezhergen mekchirge
tomorrow Tk. yarïn
Az. sabah ertir
Uz. ertaga
Uy. ete
KT. irtêgê;
KB. tambla erteN erteN
Kh. taNda;
Al. erten erten; t.a:rta sarsïn
voice Tk. ses
Az. sês ses
Uz. ovoz
Uy. awaz
KT. tavïsh, avaz
KB. auaz dawïs ün
Kh. ün
Al. ün ün kuolas, saNa
wet Tk . yash
Az. yash öl
Uz. ho'l
Uy. höl
KT. yuesh, dïmlï
KB. mïlï, Jibigen
dïmqïl;
su nïm, nïmdu:
Kh. öl
Al. ülüsh , chïqtu
öl, mö:n, shal incheGey, u:la:x, si:kte:x
2.3 The comparison of phonological and grammatical features
Mudrak (2002, 2009)
The multivolume Moscow edition SIGTY. Regionalnyiye rekonstruktsii ("The Comparative Grammar of Turkic languages. Regional Reconstructions.") (2002) included an abbreviated article by Russian Turkologist Oleg Mudrak [aw-LEG moo-DRAHK; the name is etymologically akin to mudryj "wise, sagacious"] Ob utochnenii klassifikatsii tyurkskikh yazykov s pomosch'yu morphologicheskoy lingvostatistiki (On the clarification of the Turkic languages classification by means of morphological linguostatistics). It was subsequently republished in full as a separate book in 2009, and then briefly reviewed in a public lecture on the history of Turkic languages (available at youtube.com and as an magazine article).
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
28
The study uses a unique statistical analysis of 96 morphological and phonological features counted up for as many as 42 Turkic languages and major dialects, and builds up trees with glottochronological dates (though based again on the apparently incorrect Starostin's glottochronological formulas), checking them for consistency with the major historical events. This purely morphostatistical analysis is an extremely interesting and apparently completely novel approach in historical linguistics. The obtained dendrograms roughly coincided with the present study by about 80%, though differed in certain aspects.
* * *
The purely grammatical approach by Mudrak prompted us to take a closer look at the morphological features, which are well-known to be more resistant to borrowings than common words thus providing more robust results. Finally, a similar study of phono-morphological differences within the Turkic languages was conducted (2009).
The following table contains a list of certain phonological and grammatical features known to be different across Turkic languages, so studying them helps to establish the exact order of their taxonomic diversification.
It should be acknowledged that the former analysis of phono-morphological features by Mudrak (2009) seems to be more detailed, particularly as far as the number of included languages is concerned. However, even though many additional grammatical and phonological characteristics are not explicitly mentioned in the table of phonological and morphological differences, they are often described below under paragraphs for specific Turkic languages.
Much of the morphological and phonological data in the table have been collected from the encyclopedic edition [Jazyki mira: Tyurkskije jazyki (The Languages of the World: The Turkic Languages); editorial board: E. Tenishev, E. Potselujevskij, I. Kormushin, A. Kibrik, et al; The Russian Academy of Sciences (1996)], which is a detailed, comprehensive and authoritative publication consisting of articles by specific authors and brief phonetical and grammatical descriptions of each Turkic language. Other data were collected directly from grammar books on specific languages.
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
29
Some of the phonological and morphological differences within the Turkic languages
The table may contain simplifications in transcribing vocal harmony
y-/
J-
-G-/
-w-
-d-/
-y-
b-/p-
t-/d-
g-/k-
G/q-
Instrum
ental
case
Other
cases Plural
Dative
"Perfect"
Participle
Negation of adjectives, nouns
"We did"
ending
"We do"
Aorist
ending
"I do"
Aorist
ending
Use of
tur- or
any other copula
Future
Tense
someone,
somewhere,
no one,
nowhere
you
(plural)
Chu
vash
s'- -v- -r- p-, t-,
k-, x- -pa, -pe
Goal-directed
-shan,
-shen
-sem -a, -e – mar -r-âmâr,
-r-êmêr
-âpâr
-êpêr
-âm
-êm –
-at-, -et-
-0-
ta-kam; tashta;
nikam ta; nishta ta
esir
Sakh
a
s- -0:- -t-
b-, t-,
k-, k-
-nan
Partial
-ta;
Compar.
-ta:Gar;
-lar, -ler, -lor, -lör, -nar, -ner,
-dar, der,
-tar, etc
-ga -bit, -bït suox;
buol-batax
-ti-bït/bit,
-li-bït/bit
-bït/bit, -pït/pit
-bïn/bin, -pïn/pin
verb-an+ tur + pronoun = past tense
-ïah-;
-a:ya- /
-eye-i = optative
(apprehen-
sive)
kim ere,
xanna ere,
kim da + negative,
xanna da + negat.
ehigi
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
30
Tuv
an
ch- -0:- -d-
weak semivoiced
: strong unvoiced:
*q > x
–
Directive
-dïva,
-dive,-duva,
-düve,-tïva, etc
-lar, -ler, -nar, -ner, -tar, -ter, -dar, -der
-ga/ge,
-ka/ke -gan, etc eves; chok -dï-vïs
-vïs, -vis -vüs, -vus
men
verb + p + tur (chïdïr, olur) + pronoun =Present
-ïr-;
Gai/gei,
qai/kei = optative
bir-(le) kizhi;
bir-(le) cherde;
kïm-da: + negativ;
kaida-da: + negative
siler
Tof
alar
ch- -0:- -d-
weak semivoiced
: strong unvoiced
–
Partial
-da, -de,
-ta, -te
-lar, -ler, -nar, -ner, -tar, -ter
-Ga/Ge,
-qa/qe
-Gan/Gen,
-qan/ qen emes -dï-vïs -bis men
verb + p + turu (chïêtïrï, oluru) + pronoun = Present tense
-ar/er/ïr/ir-;
Gai/gei,
qai/kei = optative
--
qum-ta: + negat.
--
siler
Kha
kas ch-,
n'- -0:- -z-
p-, t-,
k-, x-
-naN,
-neN
Directive
-za, -zer,
-sar, -ser,
-nzar, -nzer
-lar, -ler,
-nar, -ner,
-tar, -ter
-ga/ge
-xa/ke, -na/ne, -a/e
-Gan/gen,
-xan/ken nimes; chox -dï-bïs
-bïs/bis
-pïs/pis
-mïs/mis
-bïn/bin
-pïn/pin
-mïn/min;
-ïm, -am
verb + (p) + tur + pronoun = Audative or Archaic past;
-ar/er/r-;
Gai/gei,
qai/kei = optative
kem-de,
xayda-da;
kem-de + negat.
xayda-da + negat.
sirer
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
31
Kum
andy
ch-,
n'- -0:- -y-
b/p-, t-,
k-,
k(q)-
–
Directive
-za, -ze,
-sa, -se
-lar, -ler,
-nar, -ner,
-dar, der,
-tar, -ter,
-ga, -ge, -ka, -ke
-a, -e,
etc
-gan, -gen, -kan, -ken
eves, emes;
chok, chox
-dï-bïs,
-di-bis, -dï-vïs
-bïs, -bis,
-pïs, -pis
-ïm, -am
verb + ïp + tur + pronoun = Audative past;
verb + a/e + tur + -ar + pers ending = Present Future;
-ar/er/r-;
-ad, -ed
Gai/gei,
qai/kei = Optative
kem-de,
kayda-da;
---
sner,
snir
Stan
dard
A
ltai
d'- -0:- -y- b-, t-,
k-, q- – –
-lar, -ler, -lor, -lör,
-dar, der,
-dor, dör,
-tar, -ter,
-tor, -tör
-ga, -ge,
-go, -gö, etc
-gan/gên, -kan/kên emes; d'ok
-(ï)bïs/(i)bis,
ïs/is, -ïk/ik
-bïs,
-bis,
-bïn/bin
-pïn/pin
-mïn/min
verb + dïr + pers ending = audative past;
verb + a/e + dïr + pers ending = Present Continuous;
verb + ïp/ip + tur + d + pers ending = Past Continuous;
-ar/er/r-;
-at/et-;
Gai/gei,
qai/kei = Optative
kem-de,
*kayda-da;
---
slerler
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
32
Kyr
gyz
J- -0:- -y- b-, t-,
k-, q- – –
-lar, -ler, -lor, -lör,
-dar, der,
-dor, dör,
-tar, -ter,
-tor, -tör
——————
-ga, -ge, -go, -gö, -ka, etc
-gan- emes -dik, etc -(ï)bïz -mïn
verb + ïptïr = audative past;
verb + ïp + tur (otur, Jat, Jur) + pronoun = Present Continuos;
-ar;
Gai/gei,
qai/kei = Optative
(kimdir) birö:,
kayda-dïr (bir Jerde);
ech kim;
ech kaida, ech Jerde
siler,
sizder
siz (polite)
Kaz
akh J-,
zh- -w- -y- b-, t-,
k-, q- -men, -pen –
-lar, -ler,
-dar, der,
-tar, -ter,
-Ga, -ge,
-qa, -qe
-Gan, -Gen
-qan, -qen emes -dïq, -dik -mïz, -miz
-bïn/bin
-pïn/pin
-mïn/min
verb + ïp + tûr (otur, Jatïrt, Jür) + pronoun = Present Continuos;
-ar/er/r;
-baq/bek-,
-paq/pek-,
-maq/mek-
êlde-bireu, êldekim
bir Jerde
esh kim;
esh kaida, esh Jerde
sender;
siz,
sizder(polite)
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
33
Uzb
ek
y- -G- -y- b-, t-,
k-, q- – – -lar -ga
-gan, -qan,
-mïsh- emas
-dik; -dimiz
(dialectical
variation)
-(i)miz -man
verb + ïp + tûr (ûtir, yot, yür) + pronoun = Present Continuos;
-a-, -y-;
-ar/r;
allakim, kimdir
--
hech kim;
hech qayerda;
siz
Uyg
hur
y- -G- -y- b-, t-,
k-, q- – – -lar, -lêr
-gê, -qa, -ka,-kê,-qê
-Gan êmês -duk, -tuq -(i)miz -mên
verb + ïp + tur (oltur, yat, yür) + pronoun = Present Continuos;
-i--;
-ar/r;
kimdu, biri
--
hech qaysi, hech kim;
hech yerde;
silêr,
siz (polite)
Cha
gata
i
y- -G- -y-
b-, t-,
k-, q-
– – -lAr
-Ga, -gä,
-qa, -kä
-Gan, -Gän
-mïsh- (rare)
e(r)mäs, yoq
-dïq (or similar) -(i)bïz
-men
(-Am)
noun + dur(ur);
verb + -A + dur-pronoun;
verb +Yp + -dur;
-Gu- kishi,
siz,
sizlär
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
34
Bar
aba
y- -y- b-, t-,
k-, q- – –
-lar, -nar, -tar
-qa -Gan tügil -dïq, etc -bïs, -mïn,
-Am
verb + ïp + tur (otïr, yat) + pronoun = Present Continuos (rare);
-ïr;
-
silär;
siz (polite)
Kar
acha
y
J-, ch- -w- -y-
b-, t-,
k-, q- – – -la, -lê
-ga/-xa/ -ge, -na/ -ne, -a/e
-Gan/gen tüyül -diq, -duk, -dük, etc
-bïz, -biz, etc
-ma, -me
verb + a/e + tur + pronouns = Present Continuous;
-ïr;
-rïq/nïq/lïq;
kim ese da,
qaida ese da,
--
siz
Tat
ar y-
,Ji-, Je-
-w- -y- b-, t-,
k-, q- –
Comparat.
-day, -tay,
-dêy, -dïy,
etc.
Locat-Temp.
-dagï, -tagï,
-dêge
-lar, -lêr, -nar, -nêr
-ga, -gê, -ka, -kê; -na/nê, -a/ê
-gan, -kên
tügel;
participle + pers. ending + yuk
-dïk, etc -bïz, etc -m(ïn) noun (3rd pers) + -dYr, -tYr
-ïr;
-achak;
kemder; kaidadïr;
berkaida;
ber kem (dê), hichkem;
(ber) kaida da
hich ber Jirdê;
sez
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
35
Cum
an-
Polo
vtsi
an
-y- b-, t-,
k-, q- – – -lar, -ler
-Ga, -ge, -qa, -ke; -a, -ê
-mYsh- -bïz -man,
-men
noun (3rd pers) + -dYr, -tYr
-Gai/-gei,
-kai/-kei siz
Tur
kmen
y- -G- -y-
b-, d-,
g-, G-
– – -lar, -ler
-a, -ä, -e;
-na, -ne
-mYsh
Used only as audative particle
dêl,
participle + pers. ending + -ok
-dYk -Ys
-ïn,
-in,
-un,
-ün
verb + ïp + dur (otïr, yat) + pronoun = Present Continuos;
verb + ïp + tïr + pronoun = Past Audative;
verb, noun (3rd pers) + -dYr, -tYr
-ar, -ïr;
-Jak, -Jek (no endings)
siz
Aze
ri
y- -G- -y- b-, d-,
g-, G- – – -lar, -ler -a, -ê
-mYsh-
Used as audative particle and perfect tense
deyil -dYg -Yg -êm;
-am
verb, noun (3rd pers) + -dYr, -tYr
-(y)acak(G-,
-(y)ecek(G-)
hech kim siz
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
36
Tur
kish
y- -G- -y- b-, d-,
g-, G- – – -lar, -ler
-(y)a, -(y)e
-mYsh-
Used as audative particle and perfect tense
deil, de(G)il -dYk -Yz
-ïm,
-im,
-um,
-üm
verb, noun (3rd pers) + -dYr, -tYr
-ar, -ïr;
-acak(G-),
-ecek(G-)
kimse,
bir shey;
hich kimse,
hich bir shey
siz
Kha
laj
y- -G- -d- b-, t-,
k-, q- -la
Locative
-cha -lar
-ka, -qa, -yä
-mYsh- daG -dimiz,
-dYk < Azeri
-(ï)mïz,
-uq < Azeri
-Vm
är (conjugated
copula)
-(ï)Ga siz
Kar
akha
nid
y- -G- -ð- b-, t-,
k-, q-
-ïn, -in, -un, -ün, -nïn,-nin
– -lar, -lär
-qa, -kê,
-Ga, -gê,
-a, -ê,
-Garu, -gerü
-mïsh-, -mish;
-Gan-,
-gen-, -qan,
-ken-
ärmês;
yok
-dimiz,
-duk -biz, -miz
ol (3rd pers. copula)
-Gay, -gey, -qay, -kêy
siz
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
37
Kho
rezm
ian
y- b-, t-,
k-, q-
-n, -ïn, -in, -un, -ün, -an, -än
-lar -qa, -kä, -a, -ä
-mïsh-,
-mish-
ärmäz, ärmäs;
däGül, dügül (rare); yok
-duq, -dïq -biz -män
er-;
-b turur = perfect past;
-a turur = repetetive present
-Gay, -gäy, -qay, -käy, -Ga, -gä, -qa, -kä
(siz)
Old
Uyg
hur
(Koj
o)
y- -
-ð-,
-d-,
-z-,
b-, t-,
k-, q-
-ïn, -in, -un, -ün, -nïn,-nin
Equative
-cha -lar, -lär
-qa, -kä,
-Ga, -gê, -Na, Nä;
-Garu, -gärü
-mïsh-,
-mish-
täGül;
ärmäz
-tïmïz,
-dimiz
-biz, -miz, -bïz -mïz
-män ärür (copula)
-Gay, -gäy
-Galïr;
-tachï, -dachï
siz
Ork
hon
Old
Tur
kic
y-? -G-,
-G -ð-
b-, t-,
k-, q- -ïn, -in
Equative
-cha
-lar, -lär
-qa, -gä,
-ya, -yä;
-Garu, -gärü
-mïsh-, -mish;
-Gan- –; jok
-timiz,
-dïmïz -biz -män er- -tachï, -dachï siz
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
38
Sala
r
y- -G- -t-,
-y-
weak semivoiced
: strong unvoiced
– – -lar, -lär, -ner
-Ga, -ge,
-qa, -ke,
-a, -e
-Gan, -gen;
-mïsh-
emes,
emes-tïr, emes-ar,
yox-tïr
– – –
noun + dïr (idïr-, oN;irar); adj + dïr (idïr + oN;irar);
verb + p + o(r) + (tur) = Present I;
verb + qu(r) +
( tur) = Future I;
verb + q/Gan + dïr = Past II;
-ar/er/ïr/ir;
-qur/Gur
k'em-ter
--
niNgi
--
seler
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
39
Yug
ur y-
tsh-, -G- -d-
weak semivoiced
: strong unvoiced
–
Compar.
-daG, -deg,
-taG,
-teg
-lar, -ler, -nar, -ner,
-dar, -der,
-tar, -ter
-Ga, -ge,
-qa, -qe
-Gan
emes-tro;
yoqer,
yok-tro,
yoq-pe-tro
– – –
i:re = copula;
verb + Gan + tïr = Present Tense;
verb + qïsh + tro = Future;
verb + Gan + tro = Past II;
verb + ïp/ip + tro = Past III;
-ar;
-qïsh-tro,
-Gïsh-tro
-qïsh-ere;
-Gu, -gu, -Go, -go; -Gï, -ge, -kï, -ke
-qïr/Gïr
qïm-er, nier
--
qïm-ma,
nima
siller
seler
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
40
3. Making Taxonomic Conclusions
With all the lexical and grammatical material collected in the previous chapter, we can finally get down to the analysis of each Turkic branch. Then, we will be able to attempt to make taxonomic conclusions concerning the position of each language in the phylogenetic dendrogram.
Note: Taxon is a general concept of classification science borrowed from biology which encompasses other subdivisions, such as group, family, macrofamily, etc. However for all practical purposes, we do not usually dinstinguish between (sub)group and (sub)taxon in this article. The usage of expression "the (Name) taxon" is thought to be equivalent to "the (Name) languages". The term "family" cannot be used except for the language taxa of high order with a temporal separation of more than 5000 years, e.g. "the Indo-European family", but hardly "the Turkic family", except maybe in the context where it would be necessary to underline the early separation of Proto-Bulgaro-Turkic from Proto-Altaic.
The Bulgaric subgroup
Chuvash, the only modern-day representative of Volga Bulgaric within the Bulgaric taxon, was definitively shown to be related to Turkic by Nicholas Poppe [Chuvashskij jazyk i jego otnoshenije k mongolskomu i tyurkskim jazykam (Chuvash and its relatedness to Mongolian and the Turkic languages), Nicholas Poppe (1924)]. Poppe established regular phonological correspondences between Chuvash and other Turkic languages. In his work, he listed several influential Turkologists (Adelung (1820), Rask (1834), Ramstedt (1922-23)) who had understood and accepted the Turkic origins of Chuvash long before his publication. Moreover, according to Alexander Samoylovich, Poppe had shown that "the Chuvash and Bulgaric languages do not stem from "Proto-Turkish" (z-group), but rather from the common progenitor of both of these groups", thus setting Chuvash aside from the rest of the Turkic languages. [Alexander Samoylovich, K voprosu o klassifikatsiji turetskikh jazykov (Towards the question of the classification of Turkish languages // The Bulletin of the 1st Turkological Congress of the Soviet Union (1926); reprinted in the collection of his works (2005)].
This positioning of Chuvash within the Turkic tree has changed little ever since. For this reason, Chuvash has not been considered herein in much detail, mostly because of its evidently early separation that does not cause much controversy among scholars.
Some of the exclusive Bulgaric features
Bulgaric phonology
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BChttp://en.wikipedia.org/wiki/Nicholas_Poppe
-
Site adresi: The Internal Classification & Migration of Turkic languages
41
(1) The famous Bulgaric rhotacism vs. the Turkic Proper zetacism, or the persistent use of /–r/ where other Turkic languages normally have /-z/ (though in some cases –r- can also be found in certain positions in Turkic Proper as well, for instance apparently in in the Aorist Tense). An intermediate pronunciation of /r/ and /z/ is found in Czech.
(2) Chuvash /-l/ vs. Turkic Proper /-sh/;
We have noted several times that the correspondant proto-Bulgaro-Turkic l/s- liquid seems to survive in modern Khalka Mongolian, cf. the pronunciation of ula:n "red" as /ush'a:n, uLa:n/, where /L/ denotes this unique liquid affricate.
Practically speaking, the huge phonological difference between Chuvash and any other Turkic language can be easily observed by comparing almost any Chuvash word, such as 1-10 numbers, to its Turkic Proper equivalent.
Bulgaric grammar
(1) the peculiar plural marker –sem in Chuvash (of seemingly unknown origin), absent not only in Turkic but apparently in other Altaic languages. It has been conjectured by a Soviet scholar in a separte article that the Chuvash -sem, which rather regularly goes back to *-sen, may only be similar to Kamassian (South Samoyedic) -saN. [Kamassian located in the East Sayan Mountains could be in contact with the early Turkic languages, however there is no clear explanation for this phenomenon.]
(2) a peculiar goal-directed case expressed by –shan, -shen;
(3) many contracted grammatical forms and a rather simplified grammar in Chuvash (generally typical of contact or "creolized" languages);
Bulgaric lexis
The lexical difference between Chuvash and any other Turkic language amounts to an average of 54.5% (Swadesh-215, borrowings excluded).
That is roughly equivalent or a little lower than to the lexicostatistical difference between English and any other Germanic language. A similar conclusion has been made by Talat Tekin in [Türk Dilleri Ailesi (The Turkic Language Family) // Genel Dilbilim Dergisi, Vol. 2, pp. 7-8, Ankara (1979)], who compared the actual difference between Chuvash and Turkish to the difference between English and German, the latter two,
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BChttp://www.dilimiz.com/dil/turkdiliailesi.htm
-
Site adresi: The Internal Classification & Migration of Turkic languages
42
of course, apart from formally belonging to the same Germanic group and sharing a number of common basic words, are far from being closely related or mutually intelligible.
There is a considerable number of Kazan Tatar lexemes found in the Chuvash basic vocabulary. These lexemes are normally recognizable by their typical non-Bulgaric phonological shape similar to Kazan Tatar or/and the existence of a parallel native word, e.g. yapâx "bad", yeshêl "green (about grass)", tinês "sea", chechek "flower", vârlâx "seed", kashkâr "wolf",kuyan "hare", utrav "island", yêbe "wet" (cf. Tatar jeben-, Bashkir yeben- "to get wet"), têrês "right, correct", etc.
Such common words as kus' "eye" and pus' "head" may in fact be too the Tatar borrowings, taken that they lack the r-ending that is expected in the Proto-Volga-Bulgaric reconstructions *xêl and *pul.
The abbreviated grammar and the considerable number of Kazan Tatar loanwords should be taken into consideration when making conclusions about the origins of Chuvash. Could the early Chuvash be strongly impacted by the Golden Horde language in the past? However, the number of borrowings in Chuvash is hardly much greater than in many other Turkic languages.
Bulgaric glottochronology
Glottochronologically, the separation of a language with the 55% of lexicostatistical differentiation should roughly correspond to anything between 900-1100 BC on the temporal scale. Note that this number has been calculated according to the local temporal calibration, which is neither the standard textbook figure, nor Starostin's method, see again The Glottochronology of the Turkic languages.
However, there is some uncertainty concerning this value, because of the logarithmic and statistical nature of the glottochronological principles that makes them prone to errors, particularly in the cases of standalone languages. Indeed, the lack of any present-day Chuvash siblings that could allow for a statistical averaging to cancel out any fluctuations, raises doubts about the robustness of this figure. As a result, a relatively small error, which may be due, for instance, to the infiltration of Tatar borrowings, may result in even greater discrepancy when extrapolated beyond the calibration interval, logarithmically modified and projected onto the temporal axis.
At any rate, despite these doubts, the number of about 54-55% is relatively stable, and nearly all the previous estimations performed between 2009-2012 (with the borrowings excluded or included, with different ways to treat synonymy, etc.) have pointed to the early separation of Chuvash, at least as early as 500 BC, but with 1000-1100 BC being a more likely period. Archaeologically, this era of 800-300 BC coincides with the onset
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BChttp://turkic-languages.scienceontheweb.net/Turkic_languages_glottochronology.html
-
Site adresi: The Internal Classification & Migration of Turkic languages
43
of the early Iron Age in West Siberia, so we may further attempt to support this date by making tentative assumptions about the active use of iron weapons and horse harness during that period, which might somehow have contributed to the Proto-Bulgaric and Proto-Turkic separation.
As it has been mentioned several times, the presence of relatively late dates for the Chuvash separation in other parallel works [Dyachok (2001), Dybo (2006), Mudrak (2009)] is most likely rooted in the application of Starostin's non-logarithmic formulas.
Bulgaric history and geography
In geography, a rather unique European position of Chuvash west of the Urals, a long way from the supposed Turkic homeland near the Altai Mountains (let alone Mongolia, as assumed in certain alternative Urheimat theories) is evident at the very first glance, which again indirectly corroborates the hypothesis of its early separation, given that longer distances presumably correlate with longer migration time.
By the 13th century, Volga Bulgaria must have extended approximately within the 200-km (120-mile) radius from the confluence of the Volga and Kama River. It was probably almost entirely destroyed during the Mongol invasion, making the Volga Bulgarians take refuge in the forested areas of the Volga's right (western) bank, situated within the same 120-mile circle. There, near the forests of Chuvashia, the legacy of Mongolian and Tatar raids must have been less pronounced.
These refugium-type Chuvash settlements in a small area along the Sura (=a tributary of the Volga) are very similar to those of the Mari in the forests and hills of the Volga's left and right bank in the nearby area north of Chuvashia. Unsurprisingly, both ethnicities seem to share certain common ethnological and lexical features (usually seen as Proto-Mari borrowings from Volga Bulgarian).
Consequently, the Chuvash people seem to be those Volga Bulgarians that survived the 13th century's invasion or any later military and cultural interventions by confining themselves to the woodland of Chuvashia and ceding their former territory to the ancestors of Kazan Tatars. The latter ones were clearly first attested in the proximity of the Volga-Kama confluence by Ibn-Fadlan as "al-Bashkird" as early as 922, so their settlement was running almost parallel to that of Volga Bulgarians.
The participation of Kazan Tatar people in the migrational seclusion of Chuvash is obscure. The Kazan Tatars did not necessarily occupy the Volga Bulgarian region by force as part of the Mongolian army in the 1230-40's, rather their settlement in the area of the present-day Tatarstan, though inevitably catalyzed by the disastrous Mongolian invasion, could have resulted from a long and slow migration and linguistic assimilation of Volga Bulgaria extending over a period of many centuries.
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
44
It should also be noted that the Chuvash people were first attested in the historical sources only in 1508, and then in 1551, during the rule of Ivan the Terrible and the siege of Kazan by his army. The association of Chuvash with Volga Bulgarians has mostly been the outcome of the historical and linguistic analysis of the 19th century's Turkologists (Kunik, Radlov, Amsharin, etc.) [see the Brockhaus and Efron Encyclopedic Dictionary (1906)], however this conjecture is now considered to be well-demonstrated.
Note: The ethnonym Chuvash is evidently a Tataricized pronunciation of S'uval, since the sounds in the former variant may not even exist in Proto-Bulgaric. The city named Suva:r is attested near the Etil River (=the Volga), for instance, on the map by Mahmud al-Kashgari (1072-74). He also noted, "As for the language of Bulgar, Suvar and Bajanak [= Pecheneg],approaching Rum [= that is, from north to south], it is Turkic of a peculiar type with clipped ends.[= apparently meaning the rather simplified Bulgaric morphology.]
Conclusion:
The discrepancy between Chuvash and other Turkic languages is so pronounced and its geographical position is so detached from the area of maximum diversification of other Turkic languages that it would be appropriate to separate Chuvash as part of a special Bulgaric taxon within the larger Bulgaro-Turkic supertaxon or family. For most practical purposes, we may assume the date of about 800-1100 BC to be a plausible period for the separation of Proto-Bulgaric from the rest of the Turkic languages.
An important terminological innovation that is suggested in the present study is the usage of the term Bulgaro-Turkic instead of just Turkic for the two major groupings. This terminology modification seems to be reasonable, and arises from the practical need to avoid the continual use of periphrastic expressions like "Turkic Proper", "the Turkic languages outside Chuvash", "the Proto-Turkic homeland excluding Proto-Bulgaric", etc.
The Yakutic subgroup
Where does Sakha actually belong?
It has been widely accepted since the 19th century's research work, that Sakha, the language of the Yakuts, is almost as distant from other Turkic languages as Chuvash.
Nevertheless, the matter is not that simple. It has also occurred to several researchers that the Yakuts may actually be directly related to other Turkic ethnic groups of Siberia, such as Tuvan, Khakas or Altay.
http://www.turkceogretimi.com/dil-%C3%BCzerine/the-internal-classification-migration-of-turkic-languages-t%C3%BCrk-dillerinin-kendi-i%C3%A7inde-s%C4%B1n%C4%B1fland%C4%B1rmas%C4%B1-ve-t%C3%BCrk-dillerinin-g%C3%B6%C3%A7%C3%BC
-
Site adresi: The Internal Classification & Migration of Turkic languages
45
So instead of positioning Sakha and Dolgan into a stand-alone sub-group, the alternative hypothesis suggests the existence of a "Siberian" taxon which would include most of the Turkic languages east of the Irtysh River line.
Trying to prove the existence of this "Siberian" taxon turns into a complicated Turkological problem. At first glance, Sakha differs drastically not only from any other Turkic language, but also from its closest potential Siberian neighbors. But in other respects, it seems to share with them certain linguistic features that are hard to delineate from common archaisms. Below we will study some of these shared "Siberian" features in detail.
Yakutic phonology
In phonology, the Yakutic subgroup is characterized by the following local innovations not shared by any other branches:
(1) the loss of the Proto-Turkic perhaps aspirated *sH as in Old Turkic sekiz "eight" > Sakha aGïs; Old Turkic sen > Sakha en "you"; Old Turkic suNok [N=ng] > Sakha uNuok "bone";
(2) the stabilization