Understanding the behaviours of Consonant Clusters in Konkani Language
Converting a Konkani Language Text from Kannada/Romi/Malayalam script to Devanagari Scripts requires a detailed understanding of Consonant Cluster behaviours in Devanagari Script vis-a-vis other three scripts. In this regard, World Institue of Konkani Language conducted a detailed study of Consonant Clusters to understand patterns of behaviours in each consonant cluster occurances. The following cases where identified after the study of 1156 Consonant Cluster probables.

Konkani Consonant Cluster Behavioural Study
That means Consonant Clusters belonging to CASE A need to be separated by insertion of SCHWA while converting a Konkani text from Kannada script to Devanagari. Where as Consonant Clusters belonging to CASE B need to be retained as Clusters. But the Challenging part is to decide what needs to be done with 40 CASE C Clusters which need to be separated in some words and retained as conjuncts in some words. In this beta version of the KONKANVERTER we have assigned Case C Clusters according to the majority occurance of behaviours. Hence, incorrect transliterations in some words may be observed. This issue will be addressed through Lexical Lookup based on a comprehensive wordlist or development of machine learning system for KONKANVERTER.
The Above rules have been applied to Conversions from Roman/Malayalam Script to Devanagari Script as well.
OTHER TRANSLITERATION RULES FOR DEVANAGARI
1. | Conjuncts (Of any case) appearing in the beginning of the wordsi.e.,Word Initial Clusters, shall be retained. | ಮ್ಹಜೀ/mhoji = म्हजी, ಸ್ನೇಹ/sneh = स्नेह, ಸ್ರಾವ್/srav = स्राव |
2 | Convert Always | ಕ್+ಶ್ = ಕ್ಶ್ /kxo = क्ष, ಕ್+ಷ್ = ಕ್ಷ್/kxo = क्ष, ಜ್+ಞ್ = ಜ್ಞ್/gneo = ज्ञ |
3 | Convert all word-finally इ to ई | ಮ್ಹಜಿ/mhoji = म्हजी ತಿ / ti = ती, ಆನಿ/ani = आनी, ಬಡಿ/bodi = बडी, ಸಾಡಿ/sadi = साडी |
4 | Convert all word-finally उ to ऊ | ತುಂ /tum = तूं, ಕುರು/kuru = कुरू, ಗುರು/ guru = गुरू, ದುಡು/ duddu = दुडू |
5 | Convert all word-finally ऒ to ओ | ಬರೊ = बरो, ಕಾಂಟೊ = कांटो, ಪಿಟೊ = पिटो, ಮ್ಹಾತಾರೊ= म्हातारो |
5 | Convert all ऎ to ए | ದೆವಾಕ್ /devak = देवाक, ನೆವನ್/nevon = नेवन, ಮೆಂದು/mendu = मेंदू |