How Many Characters Is an Emoji? - Why One Emoji Is Not Always One Character
Is 😀 one character? What about 👨👩👧👦 (the family emoji)? Some emoji that look like a single symbol are actually multiple characters joined together. Ever had "I should have room for one more character" on X but could not fit it? Emoji might be the reason.
Emoji Character Count Breakdown
| Emoji | Appearance | Unicode Characters | UTF-16 Length | UTF-8 Bytes |
|---|---|---|---|---|
| 😀 | 1 emoji | 1 | 2 | 4 bytes |
| 👍 | 1 emoji | 1 | 2 | 4 bytes |
| 👍🏽 | 1 emoji | 2 | 4 | 8 bytes |
| 👨👩👧👦 | 1 emoji | 7 | 11 | 25 bytes |
| 🏳️🌈 | 1 emoji | 4 | 6 | 14 bytes |
| 🇯🇵 | 1 emoji | 2 | 4 | 8 bytes |
The family emoji 👨👩👧👦 looks like one symbol but is internally composed of "man + joiner + woman + joiner + girl + joiner + boy" - 7 characters total.
Why Emoji Character Counts Vary
| Mechanism | Explanation | Example |
|---|---|---|
| Basic emoji | One code point = 1 character | 😀 (U+1F600) |
| Skin tone | Base emoji + skin tone modifier = 2 characters | 👍🏽 = 👍 + 🏽 |
| ZWJ sequences | Multiple emoji joined by invisible characters | 👨👩👧👦 = 👨 + ZWJ + 👩 + ZWJ + 👧 + ZWJ + 👦 |
| Flags | Two regional indicator symbols combined | 🇯🇵 = 🇯 + 🇵 |
ZWJ stands for "Zero Width Joiner" - an invisible character that does not display on screen but counts toward the character total.
How Social Platforms Count Emoji
| Service | 😀 Count | 👨👩👧👦 Count | Method |
|---|---|---|---|
| X (formerly Twitter) | 2 characters | 7-11 characters | Custom weighting |
| 1 character | 1 character | Visual emoji count | |
| LINE | 1 character | 1 character | Visual emoji count |
| SMS | Switches to 70-char limit | Same | Encoding switch |
On X, each emoji counts as 2 characters. Heavy emoji use in a 140-character limit quickly eats into your text budget. As covered in the history of X's character limit, emoji handling exemplifies the complexity of character counting.
SMS and the Emoji Surprise
| Condition | Characters per SMS | Encoding |
|---|---|---|
| ASCII only | 160 characters | GSM-7 (7-bit) |
| Japanese only | 70 characters | UCS-2 (16-bit) |
| Contains emoji | 70 characters | Switches to UCS-2 |
An English SMS holds 160 characters, but adding just one emoji forces a switch to UCS-2 encoding, dropping the limit to 70. One emoji costs you 90 characters of capacity.
Emoji Keep Growing
| Year | Unicode Version | Total Emoji | Notable Additions |
|---|---|---|---|
| 2010 | 6.0 | ~720 | Basic faces, animals, food |
| 2015 | 8.0 | ~1,620 | Skin tone variations |
| 2020 | 13.0 | ~3,300 | Ninja, bubble tea, beaver |
| 2024 | 16.0 | ~3,790 | Acorn, fingerprint, splatter |
Emoji have grown from 176 (NTT DoCoMo's original set in 1999) to about 3,790 in 2024 - over 20x in 25 years.
Emoji Length in Programming Languages
| Language | "👨👩👧👦".length Result | Reason |
|---|---|---|
| JavaScript | 11 | Counts UTF-16 code units |
| Python | 7 | Counts Unicode code points |
| Swift | 1 | Counts grapheme clusters (visual characters) |
The same emoji returns lengths of 1, 7, or 11 depending on the programming language. As an advanced case of the character vs byte distinction, emoji are among the most complex examples.
Books on emoji and Unicode can be found on Amazon.