BOM

Encoding & Standards

La marque d'ordre des octets (U+FEFF) placée en début de fichier texte pour indiquer l'ordre des octets (endianness) dans les encodages UTF-16/UTF-32.

The BOM is a special Unicode character used to signal the byte order of a text stream. In UTF-16, it distinguishes between little-endian (FF FE) and big-endian (FE FF) formats.

In UTF-8, a BOM (EF BB BF) is sometimes added but is not recommended — it can cause issues with scripts, JSON parsing, and Unix tools that don't expect it. Many text editors add a UTF-8 BOM by default, which can lead to subtle bugs.

Modern best practice: use UTF-8 without BOM for web content and data files.

Termes associés

Outils associés

Articles associés

Emoji Security: Homoglyphs, Spoofing, Invisible Characters, and Filtering

Security risks from emoji in user input: homoglyph spoofing, invisible Unicode characters, emoji in SQL/code injection, and how to filter and sanitize safely.

BOM

Embed This Widget

Termes associés

Outils associés

Articles associés