Emoji Sequence Analyzer: Decode ZWJ, Skin Tones, and Keycaps

What Is an Emoji SequenceEmoji Sequence
An ordered set of one or more Unicode code points that together represent a single emoji character.
?

Many emojis are not single characters — they are sequences of multiple UnicodeUnicode
Universal character encoding standard that assigns a unique number to every character across all writing systems and symbol sets, including emoji.
code points that combine to form one visible glyph. When you paste 👩‍💻 into a text editor, what looks like one emojiEmoji
A Japanese word (絵文字) meaning 'picture character' — small graphical symbols used in digital communication to express ideas, emotions, and objects.
is actually three separate code points joined together. Understanding this structure is essential for developers, designers, and anyone curious about how emoji actually work.

The EmojiFYI Sequence Analyzer takes any emoji or emoji sequence and breaks it apart, showing you each component code pointCode Point
A unique numerical value assigned to each character in the Unicode standard, written in the format U+XXXX (e.g., U+1F600 for 😀).
, its name, and the type of sequence being used.

Types of Emoji Sequences

ZWJ Sequences

ZWJ stands for Zero Width JoinerZero Width Joiner (ZWJ)
An invisible Unicode character (U+200D) used to join multiple emoji into a single composite emoji, such as combining people and objects into profession emoji.
(U+200D). It is an invisible character that tells rendering engines to combine the emojis on either side of it into a single glyph.

Example: 👩‍💻 Woman Technologist is composed of: 1. 👩 Woman (U+1F469) 2. U+200D (Zero Width Joiner) 3. 💻 Laptop (U+1F4BB)

The ZWJ signals that these should be fused into a single image. On platforms that support it, you see 👩‍💻. On platforms that do not, you see the three components side by side: 👩💻.

More complex ZWJ sequences chain multiple emojis together:

👨‍👩‍👧‍👦 Family: Man, Woman, Girl, Boy is built from: - 👨 Man + ZWJ + 👩 Woman + ZWJ + 👧 Girl + ZWJ + 👦 Boy

That is seven code points producing one emoji.

Skin Tone ModifierSkin Tone Modifier
Five Unicode modifier characters based on the Fitzpatrick scale that change the skin color of human emoji (U+1F3FB to U+1F3FF).
Sequences

The Unicode StandardUnicode Standard
The complete character encoding system maintained by the Unicode Consortium, defining characters, properties, algorithms, and encoding forms.
defines five skin tone modifiers (based on the Fitzpatrick scale):

Modifier Code Point Sample
Light U+1F3FB 👍🏻
Medium-Light U+1F3FC 👍🏼
Medium U+1F3FD 👍🏽
Medium-Dark U+1F3FE 👍🏾
Dark U+1F3FF 👍🏿

When you pick a skin tone for 👍 Thumbs Up, the result is a two-code-point sequence: the base emoji followed immediately by a modifier. The Sequence Analyzer shows both components with their code points.

Some ZWJ sequences combine skin tones on multiple people. 👫🏽 Couple holding hands with medium skin tone uses skin tone modifiers on both figures inside the ZWJ chain.

Keycap Sequences

Keycap emojis like 1️⃣ 2️⃣ #️⃣ *️⃣ are three-code-point sequences:

  1. The base character (a digit, #, or *)
  2. U+FE0F (Variation Selector-16, which signals emoji presentationEmoji Presentation
    The default rendering of a character as a colorful emoji glyph, either inherently or when triggered by Variation Selector-16.
    )
  3. U+20E3 (Combining Enclosing Keycap)

For example, 1️⃣ is: - U+0031 (Digit One: 1) - U+FE0F (Variation Selector-16) - U+20E3 (Combining Enclosing Keycap)

This is why keycap emojis sometimes appear as plain text digits in environments that do not support full emoji rendering — the base character is just the number.

Flag Sequences

Country flag emojis (🇺🇸, 🇯🇵, 🇰🇷) use Regional IndicatorRegional Indicator (RI)
Paired Unicode letters (U+1F1E6 to U+1F1FF) that form country flag emoji when combined according to ISO 3166-1 alpha-2 codes.
Symbols
. Each flag is two code points: one for each letter of the ISO 3166-1 alpha-2 country code.

🇺🇸 (United States) is: - 🇺 Regional Indicator U (U+1F1FA) - 🇸 Regional Indicator S (U+1F1F8)

The Sequence Analyzer identifies the country code and tells you which country the flag represents.

Subdivision flags (🏴󠁧󠁢󠁥󠁮󠁧󠁿 England) use tag sequences — a different, more complex mechanism involving invisible tag characters (U+E0000 range).

How to Use the Sequence Analyzer

Step 1: Open the Tool

Navigate to /tools/sequence-analyzer/.

Step 2: Paste or Type an Emoji

Click the input field and paste an emoji. You can paste a single emoji, a ZWJ sequence, or even a string containing multiple emojis — the analyzer processes each one.

Good sources for complex emoji sequences to analyze: - Copy a family emoji (👨‍👩‍👦) from any messaging app - Copy a profession emoji (🧑‍🚀 Astronaut, 👩‍🎤 Woman Singer) - Copy a couple emoji (💑, 👫) - Type a keycap: 0️⃣ 1️⃣ #️⃣

Step 3: Read the Breakdown

The analyzer displays a table showing:

  • Position — the order of each code point in the sequence
  • Code Point — the hexadecimal Unicode value (e.g., U+1F469)
  • Character — the rendered glyph or description for invisible characters
  • Name — the official Unicode name
  • Type — whether it is a base emoji, ZWJ, modifier, variation selectorVariation Selector (VS)
    Unicode characters (VS-15 U+FE0E and VS-16 U+FE0F) that modify whether a character renders in text (monochrome) or emoji (colorful) presentation.
    , or other component

For ZWJ sequences, the tool also names the full sequence (e.g., "Woman Technologist") and links to the emoji detail page.

Step 4: Explore Component Emojis

Clicking any component in the breakdown opens its own emoji detail page. This is useful for understanding which base emojis combine to form a ZWJ sequence, and whether those bases are available as standalone emojis on all platforms.

Practical Uses

For Developers

  • Verify that a string contains exactly the code points you expect before storing it in a database
  • Debug rendering issues where an emoji displays as multiple separate characters
  • Understand why emoji.length in JavaScript returns unexpected values for ZWJ sequences (each code point counts separately in UTF-16UTF-16
    A variable-width Unicode encoding that uses 2 or 4 bytes per character, used internally by JavaScript, Java, and Windows.
    )

For Content Creators

  • Confirm that a complex family emoji is a recognized Unicode sequence rather than a platform-specific hack
  • Check which component emojis make up a sequence before using it in a context where rendering support may be limited

For the Curious

  • Discover that 🏳️‍🌈 Rainbow Flag is built from 🏳️ White Flag + ZWJ + 🌈 Rainbow
  • See that 🧑‍🤝‍🧑 People Holding Hands chains together three emojis
  • Understand why 🇨🇳 and 🇹🇼 are just pairs of invisible regional indicator letters

Combining the Analyzer with Other Tools

The Sequence Analyzer works well alongside the Compare tool. After breaking down a ZWJ sequence into its parts, you can check each component in the Compare tool to see how platforms render the base emojis — giving insight into why the combined sequence might look different on Samsung versus Apple.

For a deeper look at ZWJ sequences in general, see the EmojiFYI glossary article on ZWJ Sequences.

Related Tools

🔀 Platform Compare Platform Compare
Compare how emojis render across Apple, Google, Samsung, Microsoft, and more. See visual differences side by side.
🔍 Sequence Analyzer Sequence Analyzer
Decode ZWJ sequences, skin tone modifiers, keycap sequences, and flag pairs into individual components.

Glossary Terms

Code Point Code Point
A unique numerical value assigned to each character in the Unicode standard, written in the format U+XXXX (e.g., U+1F600 for 😀).
Emoji Emoji
A Japanese word (絵文字) meaning 'picture character' — small graphical symbols used in digital communication to express ideas, emotions, and objects.
Emoji Presentation Emoji Presentation
The default rendering of a character as a colorful emoji glyph, either inherently or when triggered by Variation Selector-16.
Emoji Sequence Emoji Sequence
An ordered set of one or more Unicode code points that together represent a single emoji character.
Regional Indicator (RI) Regional Indicator (RI)
Paired Unicode letters (U+1F1E6 to U+1F1FF) that form country flag emoji when combined according to ISO 3166-1 alpha-2 codes.
Skin Tone Modifier Skin Tone Modifier
Five Unicode modifier characters based on the Fitzpatrick scale that change the skin color of human emoji (U+1F3FB to U+1F3FF).
UTF-16 UTF-16
A variable-width Unicode encoding that uses 2 or 4 bytes per character, used internally by JavaScript, Java, and Windows.
Unicode Unicode
Universal character encoding standard that assigns a unique number to every character across all writing systems and symbol sets, including emoji.
Unicode Standard Unicode Standard
The complete character encoding system maintained by the Unicode Consortium, defining characters, properties, algorithms, and encoding forms.
Variation Selector (VS) Variation Selector (VS)
Unicode characters (VS-15 U+FE0E and VS-16 U+FE0F) that modify whether a character renders in text (monochrome) or emoji (colorful) presentation.
Zero Width Joiner (ZWJ) Zero Width Joiner (ZWJ)
An invisible Unicode character (U+200D) used to join multiple emoji into a single composite emoji, such as combining people and objects into profession emoji.

Related Stories