This Bugzilla instance is a read-only archive of historic NetBeans bug reports. To report a bug in NetBeans please follow the project's instructions for reporting issues.

Bug 219070 - Some pages in repository are missing or have incorrect encoding
Summary: Some pages in repository are missing or have incorrect encoding
Status: NEW
Alias: None
Product: www
Classification: Unclassified
Component: Web Content (show other bugs)
Version: 7.3
Hardware: PC Mac OS X
: P3 normal (vote)
Assignee: Jan Pirek
URL:
Keywords:
Depends on:
Blocks: 219063
  Show dependency tree
 
Reported: 2012-09-26 15:23 UTC by Kenneth Ganfield
Modified: 2012-10-08 08:35 UTC (History)
0 users

See Also:
Issue Type: DEFECT
Exception Reporter:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Kenneth Ganfield 2012-09-26 15:23:33 UTC
Some html files in the repository are missing the meta tag with the encoding or have the encoding set to an encoding other than utf-8. Tags without encoding could cause some corruption of characters when editing the files and cause problems for the translation team.

The script on the site assigns a character encoding of UTF-8 to all pages  regardless of the encoding of the original html file.
Comment 1 Jan Pirek 2012-09-27 10:52:16 UTC
Ken, I am not sure if this is problem of the publishing system. Source pages should be all in utf-8. I could optionally add the hook to publishing which would try to inspect and convert it on the fly but it does not come for free in terms of final page load speed. Also this also do not resolve root cause which is that in content repo source file is not utf-8.

Or what do you actually expect from this issue?
Comment 2 Kenneth Ganfield 2012-09-27 11:49:23 UTC
I agree that the root cause is that some files in the repository do not have the utf-8 encoding. So the issue is really how to accomplish this. Is there some way to at least find the files that are missing the encoding tag or specify something other than utf-8.

We can then decide how important it is to modify those files.

I think this mainly impacts the files that are translated.
Comment 3 Jan Pirek 2012-10-08 08:35:36 UTC
this is list of kb/ files with non utf-8 charsets

kb/articles/CreateWinkDemo.html:<META HTTP-EQUIV="CONTENT-TYPE" CONTENT="text/html; charset=windows-1252">
kb/articles/netbeans_platform/index.html:<meta http-equiv="Content-Type" content="text/html;  charset=ISO-8859-1">
kb/articles/netbeans_platform/player.html:<meta http-equiv="Content-Type" content="text/html;  charset=ISO-8859-1">
kb/articles/quickstart-collaboration.html:<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
kb/articles/codebeamer-for-netbeans.html: content="text/html; charset=windows-1252">
kb/samples/gui-form-examples.html:    <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
kb/samples/webjpa.html:    <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
kb/samples/javaee-stateless.html:    <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
kb/samples/jsfjpa.html:    <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
kb/samples/paint-application.html:    <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
kb/samples/master-index.html:        <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
kb/samples/mobile-svg.html:    <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
kb/samples/gui-client.html:    <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
kb/samples/feedreader.html:    <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
kb/samples/mobile-game-builder.html:    <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
kb/samples/javaee-crud.html:    <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">