|
Posted by Rik Wasmus on June 11, 2008, 2:59 am
Please log in for more thread options On Sun, 08 Jun 2008 01:44:50 +0200, Lawrence Krubner
> Rik Wasmus wrote:
>> On Thu, 05 Jun 2008 22:16:08 +0200, Lawrence Krubner
>>> Once upon a time, there were no garbage characters on this page:
>>>
>>> http://www.teamlalala.com/blog/category/css/
>>>
>>> Now there are. For instance:
>>>
>>> The 2nd paragraph from page 114 of “The Zen Of CSS
Design�
>>>
>>>
>>> For me, there are garbage characters before "The" and after "Design".
>>>
>>> The page has always, always been served as UTF-8.
>>>
>>> I'm having trouble what might have changed, which would cause these
>>> garbage characters. At a stretch, I think back to an incident a few
>>> months ago, when our server was hacked, and we had to do a re-install,
>>> with upgraded versions of stuff like Apache. So I could almost imagine
>>> Apache sending new headers, except that, in my case, the meta tag
>>> indicates UTF-8 and when I look at it in FireFox, FireFox correctly
>>> reads it as UTF-8.
>>>
>>> Anything else that could cause this?
>>>
>>> I can not find a character encoding that renders this page without
>>> garbage characters.
>> Among the top reasons for double utf-8 encoding is an improper
>> database export/import.
>
> That must be it, then. Is there an automated way to undo the damage? Or
> do I have to fix every post by hand?
I am not aware of a general quick easy fix, ask in a group dedicated to
the database of your choice, it isn't an uncommon problem.
> Also, any tips on import/export, for the next time I have to do this?
If MySQL, be sure to set your connection characteristics to the proper
values. The first statement in your file to be imported in that case
should've been:
SET NAMES utf8;
HTH,
--
Rik Wasmus
...spamrun finished
|