View Single Post

   
  #8 (permalink)  
Old 02-28-2008, 10:47 AM
Joachim Durchholz
 
Posts: n/a
Default Re: reverse mutliple charset convert

f-x.p@laposte.net schrieb:
> I have noticed that the result of the double conversion ends with the
> two characters 'ƒÂ' being added in every accented code:
>
> At least, this works for french characters, I'm not sure it is the
> same for german characters (Ö or ß), nor for asian characters


If it solves your problem: great!

It won't work in general, though.

> Therefore, I simply made an update query on the columns where the
> problem happens. I simply erase the 'ƒÂ' characters.
>
> UPDATE page SET `page_title` = REPLACE(page_title,'ƒÂ','');
> UPDATE pagelinks SET `pl_title` = REPLACE(pl_title,'ƒÂ','');


In German, we'd probably have something like

UPDATE page SET `page_title` = REPLACE(page_title,'ÃÂ','Ö');

This kind of mapping will always work (with the "right" character sets
in place), because UTF8-to-anything-else is never ambiguous (at least
not for the character sets that are covered by UTF).

Regards,
Jo
Reply With Quote