Click here to get back home

garbage characters are now on the site, although they weren't there originally

 HomeNewsGroups | Search | About
 comp.infosystems.www.authoring.html    Post an article   get this group's latest topics as an RSS feed add this group's latest topics to your My MSN content add this group's latest topics to your My Yahoo content
Subject Author Date
garbage characters are now on the site, although they weren't there originally Lawrence Krubner 06-05-2008
Get Chitika Premium
Posted by Andreas Prilop on June 10, 2008, 11:50 am
Please log in for more thread options
On Tue, 10 Jun 2008, Keith Hughitt wrote:

> Somewhat off-topic question, but, when you copy-and-paste text in
> windows/unix, is the encoding included in that information?

What is "windows/unix"?

> I.e. if you saved a document in latin1 and wanted to get it to utf-8,
> could you just coipy and paste the text into a new document
> and save it as utf-8?

It depends on the program you use.
On Unix, it depends also on your locale settings.

--
In memoriam Alan J. Flavell
http://groups.google.com/groups/search?q=author:Alan.J.Flavell

Posted by Blinky the Shark on June 10, 2008, 6:34 pm
Please log in for more thread options
Andreas Prilop wrote:

> On Tue, 10 Jun 2008, Keith Hughitt wrote:
>
>> Somewhat off-topic question, but, when you copy-and-paste text in
>> windows/unix, is the encoding included in that information?
>
> What is "windows/unix"?

s/\// or /


--
Blinky
Killing all posts from Google Groups
The Usenet Improvement Project --> http://improve-usenet.org
Found 5/08: a free GG-blocking news *feed* --> http://usenet4all.se


Posted by Keith Hughitt on June 11, 2008, 8:50 am
Please log in for more thread options
Hehe, what I meant was on either Windows or Unix (Linux). I'd be
interested to know how it works
on both systems.


> On Tue, 10 Jun 2008, Keith Hughitt wrote:
> > Somewhat off-topic question, but, when you copy-and-paste text in
> > windows/unix, is the encoding included in that information?
>
> What is "windows/unix"?
>
> > I.e. if you saved a document in latin1 and wanted to get it to utf-8,
> > could you just coipy and paste the text into a new document
> > and save it as utf-8?
>
> It depends on the program you use.
> On Unix, it depends also on your locale settings.
>
> --
> In memoriam Alan J. Flavellhttp://groups.google.com/groups/search?q=3Dauth=
or:Alan.J.Flavell


Posted by Andreas Prilop on June 11, 2008, 11:12 am
Please log in for more thread options
Differently.

> interested to know how it works on both systems.
> Hehe, what I meant was on either Windows or Unix (Linux). I'd be
>
>> What is "windows/unix"?
>>
>>> windows/unix, is the encoding included in that information?
>>> Somewhat off-topic question, but, when you copy-and-paste text in

On Wed, 11 Jun 2008, Keith Hughitt wrote:

--
Top-posting.
What's the most irritating thing on Usenet?

Posted by Rik Wasmus on June 11, 2008, 2:59 am
Please log in for more thread options
On Sun, 08 Jun 2008 01:44:50 +0200, Lawrence Krubner

> Rik Wasmus wrote:
>> On Thu, 05 Jun 2008 22:16:08 +0200, Lawrence Krubner
>>> Once upon a time, there were no garbage characters on this page:
>>>
>>> http://www.teamlalala.com/blog/category/css/
>>>
>>> Now there are. For instance:
>>>
>>> The 2nd paragraph from page 114 of “The Zen Of CSS
Design�
>>>
>>>
>>> For me, there are garbage characters before "The" and after "Design".
>>>
>>> The page has always, always been served as UTF-8.
>>>
>>> I'm having trouble what might have changed, which would cause these
>>> garbage characters. At a stretch, I think back to an incident a few
>>> months ago, when our server was hacked, and we had to do a re-install,
>>> with upgraded versions of stuff like Apache. So I could almost imagine
>>> Apache sending new headers, except that, in my case, the meta tag
>>> indicates UTF-8 and when I look at it in FireFox, FireFox correctly
>>> reads it as UTF-8.
>>>
>>> Anything else that could cause this?
>>>
>>> I can not find a character encoding that renders this page without
>>> garbage characters.
>> Among the top reasons for double utf-8 encoding is an improper
>> database export/import.
>
> That must be it, then. Is there an automated way to undo the damage? Or
> do I have to fix every post by hand?

I am not aware of a general quick easy fix, ask in a group dedicated to
the database of your choice, it isn't an uncommon problem.

> Also, any tips on import/export, for the next time I have to do this?

If MySQL, be sure to set your connection characteristics to the proper
values. The first statement in your file to be imported in that case
should've been:

SET NAMES utf8;

HTH,
--
Rik Wasmus
...spamrun finished

Similar ThreadsPosted
UTF-8 garbage characters October 1, 2004, 1:34 am
UTF-8 garbage characters May 27, 2005, 1:19 pm
web site to translate spanish characters into corresponding html codes? April 14, 2008, 10:16 pm
UTF-8 and Latin-1 characters July 11, 2004, 10:04 pm
& special characters May 29, 2006, 6:48 am
Accented characters March 13, 2008, 1:37 am
Recommendations for good Web site hosting company (for personal Web site) April 16, 2006, 7:49 pm
Special characters used in Excel January 28, 2005, 4:29 am
Bad characters error, help requested October 19, 2006, 10:48 am
Re: Bad characters error, help requested October 19, 2006, 8:34 pm

Our other projects:

Art Dolls, Fairies and Mermaids - Sunnyfaces.net

Roy's Linux, Programming and Search Engines messages

1-Script XML SitemapXML Sitemap