Method (English)
Method of project implementation
The project consists of several stages:
- Forming language groups.
- Selecting the alphabet (using 35-letter Ortatil standard).
- Selecting grammatical rules (8-way harmony).
- Selecting words (Mean Intelligibility Index).
- Translating one book from each language into Ortatil.
The alphabet consists of 35 letters, incorporating diplomatic archiconsonants:
| A a | Ä ä | B b | Ḅ ḅ |
| C c | Ç ç | D d | Ḍ ḍ |
| E e | F f | G g | Ğ ğ |
| H h | I ı | İ i | J j |
| K k | Ḳ ḳ | L l | M m |
| N n | Ñ ñ | O o | Ö ö |
| P p | Q q | R r | S s |
| Ş ş | T t | U u | Ü ü |
| W w | Y y | Z z |
Preserving phonetic and pronunciation features (Diplomatic Initial Dots)
To bridge dialectal divides, we use archiconsonants only on the first letter of a root word to mark historical sound splits:
- Ḳelmäk — Oghuz speakers read it as Gelmäk, Karluk/Kipchak speakers as Kelmäk.
- Ḍurmaq — Oghuz speakers read it as Durmaq, Karluk/Kipchak speakers as Turmaq.
- Qardaş — Turks read as Kardaş, Azeris as Gardaş, East as Qardaş.
- Ḅar — Oghuz reads as Var, East as Bar.
- Jol — Oghuz/Karluk read as Yol, Kipchak as Jol.
- Wätän — Oghuz reads as Vätän, East as Wätän.
Selection method (Average Intelligibility Index)
Decisions on accepting a vocabulary root are based on the AII (Average Intelligibility Index).
The AII evaluates candidate words across ten core Turkic languages divided into three main linguistic branches:
- Oghuz: Turkish, Azerbaijani, Turkmen (3 languages)
- Qarluq: Uzbek, Uyghur (2 languages)
- Qipchaq: Kazakh, Kyrgyz, Karakalpak, Tatar, Bashkir (5 languages)
The baseline score is calculated by determining the presence (✅ = 1) or absence (❌ = 0) of a word variant in each language, calculating an average subtotal for each branch, and summing them together.
An additional Cross-Branch Coverage Bonus is then applied:
- "In All" Bonus (+1.0): If a variant is present in at least one language across all three branches.
- "In Oghuz/Qipchaq" Bonus (+0.5): If a variant spans Oghuz and Qipchaq groups but misses Qarluq.
AII = Oghuz Subtotal + Qarluq Subtotal + Qipchaq Subtotal + Coverage Bonus
AII Qualifying Thresholds
- AII >= 2.3: High-Intelligibility Winner. Accepted automatically and exported directly to the dictionary framework.
- AII < 2.3: Low-Intelligibility Variant. Kept inside the internal dataset, but skipped for primary vocabulary inclusion unless no other higher-scoring variants exist for that semantic concept.
Real-World Example: The Concept of "Wind"
Below is the live execution matrix assessing the root candidates jel, şamal, and daıl to determine the final accepted vocabulary items.
Comprehensibility Evaluation Matrix
| Metric / Language | jel | şamal | daıl |
|---|---|---|---|
| Turkish | ✅ | ❌ | ❌ |
| Azerbaijani | ✅ | ❌ | ❌ |
| Turkmen | ✅ | ✅ | ❌ |
| Uzbek | ✅ | ✅ | ❌ |
| Uyghur | ✅ | ✅ | ❌ |
| Kazakh | ✅ | ❌ | ✅ |
| Kyrgyz | ✅ | ✅ | ❌ |
| Karakalpak | ✅ | ✅ | ❌ |
| Tatar | ✅ | ❌ | ✅ |
| Bashkir | ✅ | ❌ | ✅ |
| Oghuz Subtotal | 1.00 (3/3) | 0.33 (1/3) | 0.00 (0/3) |
| Qarluq Subtotal | 1.00 (2/2) | 1.00 (2/2) | 0.00 (0/2) |
| Qipchaq Subtotal | 1.00 (5/5) | 0.40 (2/5) | 0.60 (3/5) |
| In All (Bonus) | ✅ (+1.0) | ✅ (+1.0) | ❌ (+0.0) |
| Final AII Score | 4.00 | 2.73 | 0.60 |
Final Analysis and Results
- jel (AII: 4.00): This root achieves absolute saturation across all ten languages (1.00 + 1.00 + 1.00) and receives a +1.0 bonus for full family coverage. It is accepted instantly as a primary choice.
- şamal (AII: 2.73): While missing in Turkish, Azerbaijani, Kazakh, Tatar, and Bashkir, it has strong coverage in the remaining languages. Because it successfully reaches into at least one language within all three branches, it earns the +1.0 cross-branch coverage bonus (0.33 + 1.00 + 0.40 + 1.0). With a score above 2.3, şamal is safely accepted alongside jel.
- daıl (AII: 0.60): This variant only appears inside a sub-cluster of the Qipchaq branch (0.60). It fails to span across branches, receives zero bonus points, and falls well below the 2.3 filter limit. It is discarded from the winning results.
Conclusion: Both jel and şamal are accepted without dispute as official Ortatil variants representing the concept of "Wind."