community-commits mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From "Blake Sullivan (JIRA)" <j...@apache.org>
Subject [jira] Created: (COMDEV-3) HTML Escapes writes out characters illegal in HTML
Date Mon, 30 Nov 2009 01:59:20 GMT
HTML Escapes writes out characters illegal in HTML
--------------------------------------------------

                 Key: COMDEV-3
                 URL: https://issues.apache.org/jira/browse/COMDEV-3
             Project:  	 Community Development 
          Issue Type: Bug
         Environment: generic
            Reporter: Blake Sullivan


The HTML specification disallows certain code points from appearing in HTML files (XML has
essentially the same list, minus the high ISO characters) as specified in http://www.w3.org/TR/REC-html40/sgml/sgmldecl.html.
 The Trinidad HtmlEscapes utilities allow the low control characters and the high characters
that are technically outside of Unicode to be output.  This causes problems if the content
is validated.

The fix is to use numeric character references such as &#1; rather than outputting code
point 1 directly.  In addition, Internet Explorer has a bug where &#0; is output as "&#0;"
so it is preferable to suppress this character rather then outputting it.



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Mime
View raw message