Quantcast
Channel: What characters must be escaped in HTML 5? - Stack Overflow
Viewing all articles
Browse latest Browse all 4

Answer by Sylvain Leroux for What characters must be escaped in HTML 5?

$
0
0

Adding my voice to insist that things are not that easy -- strictly speaking:

Case 1 : HTML serialization

(the most common)

If you serialize your HTML5 as HTML, "the text must not contain the character U+003C LESS-THAN SIGN (<) or an ambiguous ampersand."

An ambiguous ampersand is an "ampersand followed by one or more alphanumeric ASCII characters, followed by a U+003B SEMICOLON character (;)"

Furthermore, "the parsing of certain named character references in attributes happens even with the closing semicolon being omitted."

So, in that case editable && copy (notice the spaces around &&) is valid HTML5 serialized as HTML construction as none of the ampersands is followed by a letter.

As a counter example: editable&&copy is not safe (even if this might work) as the last sequence &copy might be interpreted as the entity reference for ©

Case 1 : XML serialization

(the less common)

Here the classic XML rules apply. For example, each and every ampersand either in the text or in attributes should be escaped as &amp;.

In that case && (with or without spaces) is invalid XML. You should write &amp;&amp;

Tricky, isn't it ?


Viewing all articles
Browse latest Browse all 4

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>