The Power of Afan Oromo Qubee: from Transliteration to Generating Amharic Text
Demessie Girma
PhD CEng MIET SMIEEE, Cloud Solutions Architect ? Azure Cloud Solutions ? Digital Transformation Consultant ? Tech Evangelist
Introduction
Qubee is the Afan Oromo system of writing based on the Latin alphabet. Introduced over three decades ago in 1991 for formal education, it has experienced rapid growth, contributing to significant advancements in broader areas such as education, linguistics, literature, and both printed and digital media. For detailed information on the Qubee orthography, readers are encouraged to refer to references [1] and [2].
One prominent feature of the Qubee writing system is its reliance on phonetic writing. In simple terms, words are transcribed as they are pronounced in the native Afan Oromo language, ensuring a clear and unambiguous pronunciation of the written word. This orthographic property proves to be a powerful tool for transliteration—the process of converting text from one script into another, preserving the original pronunciation while using characters from a different script.
For example, the text "Qubee yeAmaarinyaa nibaabin lemetsaf yiiredal." represents the transliteration of Amharic text: ?? ????? ???? ???? ????? In this case, transliteration occurs by employing the Qubee phonetic system for writing. This is feasible because the unambiguous nature of phonetic writing allows for the representation of words from any given language in phonetics, as long as the phonetics of the transliterated language are available in the host language.
This article provides an in-depth exploration of the foundational concepts behind achieving transliteration and script generation through phonetic writing. Subsequently, it introduces a practical implementation in the form of a software tool called Q2A—an application available for download on the Google Play app store. Emphasizing the algorithmic robustness of the Qubee writing system, the article underscores its evolution into a content generation tool, with the potential to extend its application to various other languages.
Transliteration to Native Script—A Linguistics Detour
If a transliterated language permits, it is also feasible to convert the transliterated text into a native script for that language. For this to occur, a transliterated language should facilitate the mapping of transliterated phonetics to unambiguous orthography or a written representation.
Although no formal proof is provided, the concept can be easily understood through examples. Therefore, consider the following as an axiomatic statement.
If a language permits a one-to-one or several-to-one transliteration (T) to orthography (O) mapping, denoted as {T => O}, then the native script of the transliterated language can be easily generated. The Afan Oromo Qubee system of writing, owing to its phonetic writing system, inherently embodies a one-to-one mapping. To facilitate discussion, we introduce the notation O2O, representing a one-to-one mapping of {T => O}. The O2O mapping defines the Qubee writing system.
On the other hand, the Amharic system of writing exhibits a several-to-one or S2O mapping of {T => O}. For instance, consider the frequently cited dual pronunciation of the Amharic word ??, leading to the mapping {(ale, alle) => ??}. Here, (ale, alle) represents Qubee phonetic transliterations of the Amharic words: ?? (he said) and ?? (there is or there exists), both converging to a single orthographic representation. Thus, the S2O mapping defines the Amharic writing system.
As a side note, the English writing system is characterized by a several-to-several or S2S mapping. This arises from its inclusion of homophones, where the same-sounding words have different spellings, as exemplified by {tu => (to, two, too)}, and homographs, featuring the same spelling but different sounds, as seen in the mapping {(miinet, maynuut) => minute}. The English language, in fact, encompasses several homographs, including words like bow, lead, tear, and tire, to name a few.
To ensure completeness, it is essential to state that, for transliteration, the writing system must adhere to the O2O mapping, as demonstrated by the Qubee writing system. The International Phonetic Alphabet (IPA) serves as one standardized system of phonetic notation designed to represent the sounds of a diverse array of spoken languages. Therefore, IPA should fulfil?the O2O mapping with a much richer set of phonetics than Qubee can accommodate. It is noteworthy that IPA is a purpose-built writing tool designed to cater to several languages worldwide and is not an inherent part of any specific language, as is the case with Qubee.
Many languages, such as Spanish, Italian, Finnish, Korean, etc., employ a phonetic writing system. While recognizing this fact, it is beyond the scope of this article to digress and analyse?to what extent these languages satisfy the O2O mapping. Contextualizing the discussion within the Qubee versus Amharic writing— the focus of this article—renders such a digression irrelevant.
Why Phonetic Writing in the First Place
While providing this overview of the linguistic detour, it is appropriate to comment on why phonetic writing was the chosen path for Afan Oromo. This discussion may serve to dispel some of the misunderstandings surrounding the use of Qubee phonetic writing in the first place.
It is well recognized that (1) tonal characteristics (pitch variation) and (2) phonemic length variations in a language can influence the difficulty of learning and mastering pronunciation. This is due to the introduction of an additional layer of linguistic complexity for the learner brought about by tonal and phonemic length variations.
Tonal languages, such as Mandarin and Cantonese Chinese, Vietnamese, and Thai, among others, can alter the meanings of words based on the pitch or tone in which a syllable is pronounced. For instance, Vietnamese has six distinct tones, while Thai has five. These tonal languages utilize pitch variations to convey different meanings, emphasizing the significance of tonal distinctions in their phonetic systems.
In contrast, Afan Oromo and Amharic are non-tonal languages, but they exhibit the second aspect of phonemic length variations. In the case of Amharic, this is exemplified by the dual pronunciation of the word ??, with the variation attributed to phonemic length.
In Afan Oromo, phonemic length variations are highly prevalent, as illustrated in the example below. Within the framework of Qubee phonetic writing, we can characterize phonemic length variation as a matrix of short/long vowels versus soft/hard consonants. In linguistic terms, the soft/hard consonants are referred to as lenis (soft) and fortis (hard), resulting in the following matrix:
(lenis, fortis) × (short, long) Matrix
This linguistic property is also applicable to Amharic, although not as widespread as in Afan Oromo. In fact, it is the prevalence of phonemic length variations in Afan Oromo that necessitates the use of phonetic writing. This can be easily understood through the several-to-one (S2O) mapping of Amharic orthography ?? from various Afan Oromo words, as illustrated in the Table below.
Observe the phonetic words that map to the Amharic orthography ??. Now, readers should be able to appreciate the role of phonetic writing in disambiguating multiple Afan Oromo words. In this example, there are as many as eight Afan Oromo words, all represented by a single Amharic orthography ??.
Note 1 (from Table above): This illustrates a good example of a slight tonal difference for the word qoree. Although Afan Oromo is a non-tonal language, a negligibly small tonal variation can be observed, particularly in interrogative sentences. It might be tempting to write the interrogative version as qoreee (with three e’s, and why not?) to disambiguate it from qoree (meaning thorn). This is a case of a tiny percentage where phonetic representation may be incomplete but can still be easily mitigated.
The passage vividly illustrates the challenges faced by non-Amharic speakers when decoding the non-phonetic writing of Amharic. The variability in pronunciation based on phonemic nuances makes it essential to have a prerequisite knowledge of the language to accurately decipher and pronounce words. This situation places non-Amharic speakers at a significant disadvantage.
领英推荐
An intriguing example from a flower shop further emphasizes the intricacies of Amharic pronunciation. The word ?????! displayed across a captivating rose picture on the wall presents a puzzle for non-Amharic speakers. Through mental decoding, the author interprets it as "yiggebbashal!"—conveying the message of "you deserve it!" to the recipient of a rose flower, cleverly employed as a familiar marketing tactic.
The author reflects on the challenges encountered during elementary education, attempting to pronounce words accurately without a complete grasp of the language. Even for Amharic speakers, the challenge persists if they lack the context to decipher the intended meaning. Is it "yiggebbashal!" or "yigebashal!"? The latter leads to a word maze with subtly different meanings, which is left to the reader to explore.
Phonetic writing, as exemplified by "yiggebbashal!", alleviates such complications by firmly tying pronunciation to the intended word. In this way, one doesn't need to be an Amharic speaker to pronounce it correctly, highlighting the efficiency of phonetic writing in overcoming these linguistic challenges.
Implementation of Amharic Transliteration and Script Generation
Central to the transliteration and generation of Amharic script is the phonetic algorithm embedded in the Afan Oromo Qubee writing system. In the context of transliteration into Amharic, the Qubee phonetic system offers a precise O2O mapping algorithm that can be easily translated into a computer program. During transliteration and script generation, the operation may appear as if one is using a keyboard for Amharic text. However, it is not an ordinary keyboard; the input keyboard is, in fact, based on Latin letters as the user edits the Qubee phonetic text to generate the Amharic text.
How it all started:
It was the summer of 2019 when I reached out to a friend—an avid blogger and commentator on Facebook—to inquire about the keyboard app he used for writing in Amharic. While most of his writings were in Afan Oromo and English, I noticed occasional posts in Amharic to reach a wider audience. He suggested an Amharic keyboard app, which I installed and tried out. Unfortunately, the experience didn't bring me joy.
Wearing the hat of a software developer and well-acquainted with the phonetic algorithm as the unique feature of the Afan Oromo Qubee writing system, I was convinced there could be a better and quicker way to write Amharic text using the familiar Latin keyboard. The non-ambiguous phonetic algorithm embedded in the Qubee writing as discussed above, made it suitable for a straightforward computer code implementation.
In a matter of days, this conviction led to the birth of the Q2A app, with the acronym denoting—yes, you guessed it—"Qubee to Amharic". The app was made available on Google Play and Microsoft app stores as far back as 2019. Now, with the Multiplatform Application User Interface (MAUI) being my preferred development framework [3] , I've upgraded the Q2A app with a fresh look and feel. This journey has also motivated me to pen down this article, which, along with the two previous ones [1], [2], thoroughly documents various facets of Computational Linguistics as applied to Qubee.
You can install the Q2A app from Google Play app store by clicking or tapping here.
Conclusions
The inception of my journey was sparked by a straightforward desire: the ease of writing in Amharic. This quest resulted in the development of the Q2A app, unlocking the potential of Qubee. Today, individuals equipped with a smartphone and basic typing skills can effortlessly engage in "Qubee to Amharic" writing.
Having embarked on a linguistic detour, we've sought to elucidate why phonetic writing is imperative for Afan Oromo. The pervasive phonemic variations, vividly illustrated through a real example, aim to persuade readers that phonetic writing serves as a crucial tool in alleviating the linguistic complexities that could otherwise significantly hinder the pace of learning in both the written and spoken aspects of Afan Oromo.
The foundation of phonetic writing sets the stage for Qubee's O2O or one-to-one mapping between sounds and letters. This transparent logic enables the development of an algorithm capable of accurately translating phonetic Qubee text into script. Picture it as a smart keyboard attuned to the inherent sounds of both languages, seamlessly bridging the gap between phonetics and script.
This marks merely the initial phase. As technology advances, the potential of Qubee continues to broaden. Envision voice-to-text translation or automated content generation, driven by the straightforward elegance of phonetic writing. The prospects for transliterating other languages appear increasingly promising, owing much to the innovative spirit encapsulated by Qubee.
It is important to note that, in its present state, Q2A functions as a modest productivity tool rather than a robust word-processing application. Its primary focus lies in generating concise text snippets, easily transferable to various applications such as messaging platforms. The Q2A app offers users the flexibility to store up to 10 snippets, and while this number is arbitrary, it can be adjusted to accommodate 100 or more, providing enhanced versatility.
Despite the current intentional limitations of the Q2A app, there's no inherent reason for these constraints to endure. Leveraging the same phonetic algorithms, it is entirely feasible to develop a heavyweight content processing and word-processing application that caters to diverse languages..
Q2A also serves as a testament to how languages can coexist harmoniously, enabling both to flourish in the technological era. Qubee has matured to facilitate Amharic writing, illustrating the potential for linguistic harmony. This reflection brought to mind Dr?Andargachew Tsige's recent suggestion to replace Qubee with the Amharic script for Afan Oromo. However, it is crucial to dismiss such thoughts, as Qubee writing holds immense potential, not only for Afan Oromo but also for Amharic, and potentially numerous other languages that remain unexplored.
Postscript:
By sheer coincidence, as I have been authoring this article, news reports have surfaced about certain individuals at universities proposing unwarranted changes to the Qubee writing system, including altering its name. At least two digital channels have extensively covered this matter, providing clarity on the ongoing situation. I am deeply appreciative of their efforts.
Needless to say, I have been disheartened by these developments, and I take this opportunity to express my humble opinion regarding the Qubee writing system:
References:
Associate Professor of ECE |SMIEEE |IEW Ambassador
9 个月So insightful. Thank you for sharing. "Qubee has matured to facilitate Amharic writing, illustrating the potential for linguistic harmony." Yeah, Qubee already matured!
Agricultural Economist. Teaching is the most rewarding job as teachers create intelligent minds. R&D is at the center of human civilization.
9 个月Thank you Dr Girma for this scentfic and deep analysis of Qubee Afan Oromo and its potential. The App Q2A is an amazing and very useful tool. Fayyaa argadhu.