Read Aloud the Text Content
This audio was created by Woord's Text to Speech service by content creators from all around the world.
Text Content or SSML code:
Unicode As computers speak in the language of 1’s and 0’s, they assign and store characters of a script with a unique numeric value. This process is known as character encoding and is the bedrock of communication in information technology. Character encoding gives meaning and allows the computer to differentiate and interpret data such as letters, numbers and symbols. For text to be accurately exchanged on a global scale, regardless of the software or its origin, the characters of a script must be encoded using a consistent and universally applied method to avoid malfunction. The Unicode Standard provides this standardisation of encoding characters. For a script to gain a published table of encoded characters, in other words, gain Unicode support, the organisation must first approve a formal proposal. (Unicode, With a character encoding table, type designers can map each glyph within a font to its respective character code, thus reinforcing encoding standardisation and generating usability of encoded characters. Unicode has become an essential component to achieve typographic stability when using the Khmer script on-screen. It has been the recognised encoding standard for Khmer in Cambodia since 2001. Before this, an estimated 30 competing formats were used, “which simply painted Khmer letters over existing Roman alphabet characters” (Phnom Penh Post, 2008). Multiple encoding formats are a breeding ground for inconsistencies, notably in glyph-to-character mapping. Without Unicode standardisation, Khmer text would be typographically unstable. For example, switching from one font to another with different character encoding properties would change the text—not just the font as the user intended (Open Forum of Cambodia, 2005). Text Layout Text layout is a crucial aspect of typography, assisting in the delivery and readability of a message. It is often synonymous with design considerations such as composition, rhythm, proportion and hierarchy. However, it is also fundamental to software development and text processing as it is responsible for how the integral structure of a writing system is displayed on-screen. This aspect of text layout is particularly crucial when handling complex scripts, such as Khmer, which has contextual and non-linear typographic requirements. It is less apparent and less of a high-level priority when dealing with simple scripts such as Latin, one of the easiest to display (Unicode, ICU Documentation, n.d). Latin characters can be processed from left to right, corresponding to how they are stored, through the Unicode character encoding table (Unicode, n.d). The script responds in a what-you-see-is-what-you-get-like fashion, and contextual analysis is unnecessary. Therefore, its rendering behaviour and layout requirements are forthright. Complex scripts, such as Khmer, require measures beyond the straightforward transformation of character code points into glyphs. These measures are needed for around half of the world’s writing systems and are handled collaboratively by layout engines, shaping engines, multiple algorithms and fonts (Hudson, 2016) (Windows, 2015). The following is a simplified diagram using the OpenType model, presented by John Hudson of Tiroworks, highlighting that collaboration.