This Bugzilla instance is a read-only archive of historic NetBeans bug reports. To report a bug in NetBeans please follow the project's instructions for reporting issues.

Bug 176176 - International characters can not be displayed in project web site
Summary: International characters can not be displayed in project web site
Status: RESOLVED FIXED
Alias: None
Product: www
Classification: Unclassified
Component: Admin (show other bugs)
Version: 6.x
Hardware: All All
: P1 normal (vote)
Assignee: fredjean
URL:
Keywords:
: 177434 (view as bug list)
Depends on:
Blocks:
 
Reported: 2009-11-09 02:49 UTC by Keiichi Oono
Modified: 2009-12-08 16:49 UTC (History)
2 users (show)

See Also:
Issue Type: DEFECT
Exception Reporter:


Attachments
ja project web site screenshot (53.94 KB, image/png)
2009-11-11 02:24 UTC, leawang
Details
HTML text page to see corrupted Japanese characters (153.84 KB, image/png)
2009-11-11 18:53 UTC, Keiichi Oono
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Keiichi Oono 2009-11-09 02:49:38 UTC
Hello Web team,

I've managing 'ja' project (http://ja.netbeans.org/). After the migration, all the Japanese characters are corrupted when we access the web site in http://ja.netbeans.org/

Would you check it?
If there are anything what I can do to fix, please advise.

Because Japanese characters are displayed correctly under 'www' (e.g. http://www.netbeans.org/index_ja.html), I guess the infrastructure itself is OK to display Japanese characters, but it can not be displayed in 'ja' project.

Thank you.
Keiichi
Comment 1 Keiichi Oono 2009-11-09 02:51:22 UTC
Let me to set P1 since project web site does not work, and we have no workaround.
Comment 2 Petr Blaha 2009-11-09 06:31:54 UTC
This bug must be fixed in kenai because original page is valid UTF-8 encoding.
Comment 3 leawang 2009-11-11 02:24:19 UTC
Created attachment 90774 [details]
ja project web site screenshot
Comment 4 leawang 2009-11-11 02:27:50 UTC
Hi Keiichi-san,

Do you still experience issues w/ ja project web site (http://ja.network.org/)?  It seems that Japanese characters looking OK when I use Firefox 3.0.12 on Mac OS X 10.5.8.  Please find the screenshot attached.

Thanks,
-Lea
Comment 5 leawang 2009-11-11 03:51:56 UTC
Please provide more info if problem persists...  Thanks.
Comment 6 Keiichi Oono 2009-11-11 18:52:19 UTC
Hi Lea,
I've modified top page to navigate users to wiki site. All the characters are embedded in PNG images, it's not text. The problem is still exist. All text in other pages are corrupted: http://ja.netbeans.org/downloads/
Please see attached image.

Thank you.
Keiichi
Comment 7 Keiichi Oono 2009-11-11 18:53:33 UTC
Created attachment 90858 [details]
HTML text page to see corrupted Japanese characters
Comment 8 Petr Blaha 2009-11-13 04:20:02 UTC
Assign to Lea.
Comment 9 Keiichi Oono 2009-11-16 02:10:28 UTC
As a workaround, I've cnverted all Japanese UTF-8 characters to numeric character references (http://www.w3.org/TR/html4/charset.html#entities). The numeric character references can be displayed in the current web site.

But website can not display UTF-8 characters as reported. Please check the following test page to see the problem. 
http://ja.netbeans.org/testfile.html
All other pages no longer have UTF-8 characters since I've converted from UTF-8 to numeric character references.

Thank you.
Keiichi
Comment 10 Keiichi Oono 2009-11-16 18:42:48 UTC
I've changed subject because we are facing this problem not only in 'ja' project.
Please see the following pages:

Example of corrupted pages:
Corrupted pages:
http://nblocalization.netbeans.org/index_ru.html
http://nblocalization.netbeans.org/index_fr.html
http://nblocalization.netbeans.org/index_zh.html
http://nblocalization.netbeans.org/index_sq.html
http://nblocalization.netbeans.org/index_ca_ES.html
http://nblocalization.netbeans.org/index_zh.htm
http://nblocalization.netbeans.org/index_cs.html
http://nblocalization.netbeans.org/index_gl_ES.html

Find all team pages linked on http://wiki.netbeans.org/TFLanguageTeams under "Team Page". 

For example, Czech characters are being corrupted in the following page:
http://nblocalization.netbeans.org/index_cs.html
Comment 11 leawang 2009-11-16 18:51:32 UTC
Thanks, Keiichi for the additional info.  Please see Kenai bug http://kenai.com/jira/browse/KENAI-1622.  We will work in 20091204 sprint.
Comment 12 Keiichi Oono 2009-11-16 19:03:24 UTC
Thank you for your filing Kenai bug!
Comment 13 Petr Blaha 2009-11-23 07:58:31 UTC
*** Bug 177434 has been marked as a duplicate of this bug. ***
Comment 14 fredjean 2009-12-08 16:26:56 UTC
Kenai was using an older version of Hpricot that had difficulties with UTF-8 characters. Upgrading Hpricot to 0.8.2 which is known to fix the problem.
Comment 15 fredjean 2009-12-08 16:28:17 UTC
(In reply to comment #14)
> Kenai was using an older version of Hpricot that had difficulties with UTF-8
> characters. Upgrading Hpricot to 0.8.2 which is known to fix the problem.

I forgot to mention that this will be resolved as part of the 20091211 release.
Comment 16 fredjean 2009-12-08 16:49:31 UTC
Resolved in commit f28cea6 of the Kenai git repo. It will be deployed to testnetbeans.org tonight.

This will be deployed to production with the 20091211 release.