strange utf8 phenomenon - Mozilla

This is a discussion on strange utf8 phenomenon - Mozilla ; Hi, I use Bugzilla storing hungarian texts as bugs. In our alphabet there are 2 letters which are not included in latin1 charset, but they are in latin2. I have interesting problems using these 2 letters in Bugzilla. The problematic ...

+ Reply to Thread
Results 1 to 2 of 2

Thread: strange utf8 phenomenon

  1. strange utf8 phenomenon

    Hi,

    I use Bugzilla storing hungarian texts as bugs. In our alphabet there
    are 2 letters which are not included in latin1 charset, but they are in
    latin2. I have interesting problems using these 2 letters in Bugzilla.

    The problematic vowels (small and capital ones) are
    őűŐŰ

    In my example I try to store a bug with the following summary (contains
    all hungarian vowels with some tilde):
    áéóöőúüűÁÉÍÓÖŐÚÜŰ

    Case A:
    Browsers use ISO-8859-1 (latin1) page encoding in default.

    Result in Bugzilla:
    áé*óöőúüűÁÉÍÓÖŐÚÜŰ

    Result hexdump in MySQL:
    C3A1C3A9C3ADC3B3C3B626233333373BC3BAC3BC2623333639 3BC381C389C38DC393C3962623333363BC39AC39C262333363 83B


    As you can see, the hexdump is a correct UTF8 code implementation of
    Bugzilla result, just it cannot display correctly my 4 letters.

    Case B:
    When I switched manually utf8 page encoding on in all pages
    (
    in bugzilla\template\en\default\global\header.html.tm pl file
    by inserting a new line after tag:

    )
    there are some change in the operation.

    Result in Bugzilla:
    áé*óöőúüűÁÉÍÓÖŐÚÜŰ

    Result hexdump in MySQL:
    C383C2A1C383C2A9C383C2ADC383C2B3C383C2B6C385E28098 C383C2BAC383C2BCC385C2B1C383
    C281C383E280B0C383C28DC383E2809CC383E28093C385C290 C383C5A1C383C593C385C2B0

    Bugzilla result is excellent, but MySQL contains (probably) double
    encoded UTF8 codes.
    E.g. the first letter is á, it is E1 in latin1.
    E1 -> C3 A1 -> C383C2A1
    Since Bugzilla result is good, there must be some own utf8
    encoding/decoding in Bugzilla.

    How can I stop this double UTF8 encoding in Bugzilla when UTF8 page
    encoding is on in all pages?

    Regards
    Zsolt

  2. Re: strange utf8 phenomenon

    On Fri, 2006-03-03 at 12:44 +0100, Zsolt Branyiczky wrote:
    > How can I stop this double UTF8 encoding in Bugzilla when UTF8 page
    > encoding is on in all pages?


    Bugzilla 2.22 is our first release that will really support UTF-8. You
    might want to try upgrading to 2.22rc1. If the problem still happens
    there, you can let us know in a bug.

    -Max
    --
    http://www.everythingsolved.com/
    Competent, Friendly Bugzilla Services. And Everything Else, too.


+ Reply to Thread