Transliteration
The process of converting text from one writing system to another while preserving phonetics.
Transliteration is the process of converting text written in one writing system to another. Unlike translation, which replaces meaning in another language, transliteration preserves phonetics (pronunciation) while replacing only the characters. For example, writing the Japanese "東京" as "Tokyo" in Roman letters is a form of transliteration. It is used in many contexts including internationalization, search engines, and passport name notation.
Several standard systems exist for transliteration. Japanese romanization has Hepburn and Kunrei systems, where "し" is written as "shi" in Hepburn but "si" in Kunrei. Cyrillic-to-Latin conversion follows ISO 9, and Chinese Pinyin follows ISO 7098. Passport names use Hepburn romanization, requiring attention to long vowel handling, such as whether "おおの" becomes "ONO" or "OHNO." explore enema on Amazon provide systematic coverage.
For programming implementations, the ICU (International Components for Unicode) library is the de facto standard. ICU's Transliterator class provides conversion rules between numerous writing systems, such as Latin-to-Katakana and Cyrillic-to-Latin. Python has the unidecode library, JavaScript has the transliteration package, and similar tools exist for other languages. Search engines use transliteration to convert romanized search queries into Japanese for matching.
Transliteration and transcription are often confused. Transliteration is based on character-to-character correspondence and is generally reversible. Transcription is based on pronunciation and may be irreversible. For example, "ふじさん" romanized as "Fujisan" is transliteration, while an English speaker writing the heard sound as "Foojeesan" is closer to transcription.
A common pitfall is that transliteration does not always produce one-to-one character mappings. Japanese "ん" is romanized as "n," but becomes "m" before "b," "m," or "p" (e.g., "新橋" becomes "Shimbashi"), making simple character substitution insufficient. Additionally, tonal marks in Chinese or vowel marks in Arabic may be lost during Latin transliteration.
Regarding character counting, transliteration can significantly change character counts. A single kanji character may become multiple Roman letters ("東" becomes "higashi" with 7 characters), so the impact on character limits must be considered before and after transliteration. For social media posts and meta descriptions with character limits, checking post-transliteration character counts is practically important. check out foie gras on Amazon are also helpful references.