Click here to get back home

unicode and numeric character reference in html

 HomeNewsGroups | Search | About
 comp.infosystems.www.authoring.html    Post an article   get this group's latest topics as an RSS feed add this group's latest topics to your My MSN content add this group's latest topics to your My Yahoo content
Subject Author Date
unicode and numeric character reference in html stefaniefauconnier 10-18-2007
Get Chitika Premium
Posted by Andy Dingley on October 20, 2007, 6:42 am
Please log in for more thread options
On Sat, 20 Oct 2007 10:20:26 +0300, "Jukka K. Korpela"

>Scripsit stefaniefauconnier@gmail.com:

>>> Do you want to convert these characters to HTML character references
>>> such as &#N; where N is a number in decimal notation?

>Here it would increase the
>amount of bytes needed to represent a Greek character, though it would be
>more serious that the HTML code would become tedious to read and change.

I sometimes convert Czech numeric entities. Our English-speaking coders
can proof read bug fixes to mis-accented Czech letters if they're
presented as numbers, but they can't do this reliably with the
characters themselves.

I may even convert them back again for archival storage afterwards.

Posted by Pierre Goiffon on October 22, 2007, 4:12 am
Please log in for more thread options
Andy Dingley wrote:
>>>> Do you want to convert these characters to HTML character references
>>>> such as &#N; where N is a number in decimal notation?
>
>> Here it would increase the
>> amount of bytes needed to represent a Greek character, though it would be
>> more serious that the HTML code would become tedious to read and change.
>
> I sometimes convert Czech numeric entities. Our English-speaking coders
> can proof read bug fixes to mis-accented Czech letters if they're
> presented as numbers, but they can't do this reliably with the
> characters themselves.

Lots of text editor are able to give the code point of a given character
- really nice feature I use a lot in such cases.

Posted by stefaniefauconnier on October 31, 2007, 7:07 pm
Please log in for more thread options
You ask me why? Well, I wrote those HTML files for someone who uses
them with some kind of search engine. This search engine can only
handle numeric character reference. I know this is weird, but that's
the way it is. So I need to convert the utf-8 symbols in my HTML files
to numeric character reference. I do know this means that the files
will become more difficult to read etc, but it doesn't matter at all.

I managed to solve the problem by downloading the Unicode Converter
add-on for Firefox.


> Scripsit stefaniefauconn...@gmail.com:
>
> >> To see the Unicode code points, use an hex editor.
> >> Do you want to convert these characters to HTML character references
> >> such as &#N; where N is a number in decimal notation?
>
> > yes, I do...
>
> Why? There's always someone who rewrites working code, to clean it up, or to
> speed it up, or (as here?) just to recode it. Here it would increase the
> amount of bytes needed to represent a Greek character, though it would be
> more serious that the HTML code would become tedious to read and change.
>
> I can imagine a few cases where the recoding might make sense, but I can
> also imagine many cases where people want that for no good reason and just
> cause confusion. Remember that in any such change, there's the risk of
> messing things up, even if the conversion is as such trivial.
>
> (My guess is that you would do the recoding in order to be able to work with
> the files using an editor that cannot handle UTF-8. In that case, you should
> have stated that and perhaps asked the right question "How can I find a
> UTF-8 capable editor for my ... system?" - in the right group.)
>
> --
> Jukka K. Korpela ("Yucca")http://www.cs.tut.fi/~jkorpela/



Posted by Jukka K. Korpela on November 2, 2007, 3:43 am
Please log in for more thread options
Scripsit stefaniefauconnier@gmail.com:

> You ask me why?

Thank you for Upside-down Fullquoting, the standard cluelessness indicator.

> This search engine can only
> handle numeric character reference. I know this is weird,

No, it's simply crappy software and should not be used for anything related
to HTML authoring for the WWW. There you have the real problem.

> I managed to solve the problem

No, you just created the illusion of being able to live with the problem.
The problem, I repeat, is a "search engine" that is not useful, in a world
with free search engines available around the globe.

--
Jukka K. Korpela ("Yucca")
http://www.cs.tut.fi/~jkorpela/


Posted by André Gillibert on November 2, 2007, 3:17 pm
Please log in for more thread options
Jukka K. Korpela wrote:

> Scripsit stefaniefauconnier@gmail.com:
>> This search engine can only
>> handle numeric character reference. I know this is weird,
>
> No, it's simply crappy software and should not be used for anything
> related to HTML authoring for the WWW. There you have the real problem.
>

He didn't specify what the search engine searches.
It may (or may not) be a very specific search engine, with highly complex
underlying technologies so that, there may be no alternative. For
instance, a search engine looking for specific english grammatical
constructs with a true english language grammar parser.
In that case, he may have to live with the quirks and bugs of this search
engine, or ask for support from the software developers, if there's still
support for this software.

--
If you've a question that doesn't belong to Usenet, contact me at

Similar ThreadsPosted
represent any Unicode character by means of a markup string coded in us-ascii May 27, 2005, 10:08 pm
reference in html page August 20, 2004, 11:25 am
HTML reference guide July 10, 2006, 1:31 am
why does unicode.org offer many scripts if unicode is a single code for all characters? May 27, 2005, 6:03 pm
Amazon: "The Ultimate HTML Reference" (2008) July 14, 2008, 11:23 am
Can an HTML source file be specified in unicode ? March 13, 2005, 12:29 pm
Unicode and html - help for simple web site August 24, 2005, 6:44 pm
Can an HTML source file be specified in unicode ? October 12, 2006, 8:08 am
character to HTML ampersand escape sequence converter December 17, 2004, 11:17 am
how to reference an html page from another html page March 7, 2008, 12:43 pm

Our other projects:

Art Dolls, Fairies and Mermaids - Sunnyfaces.net

Roy's Linux, Programming and Search Engines messages

1-Script XML SitemapXML Sitemap