Click here to get back home

unicode and numeric character reference in html

 HomeNewsGroups | Search | About
 comp.infosystems.www.authoring.html    Post an article   get this group's latest topics as an RSS feed add this group's latest topics to your My MSN content add this group's latest topics to your My Yahoo content
Subject Author Date
unicode and numeric character reference in html stefaniefauconnier 10-18-2007
Posted by stefaniefauconnier on October 18, 2007, 4:12 pm
Please log in for more thread options
I've got some html-files with a lot of greek characters in utf-8. When
I open my source code, I don't see the numeric character reference but
the character itself. How can I change this?
I really need source codes with numeric character references.


Posted by Janne Tuukkanen on October 18, 2007, 4:37 pm
Please log in for more thread options
Thu, 18 Oct 2007 20:12:35 +0000, stefaniefauconnier kirjoitti:

> I've got some html-files with a lot of greek characters in utf-8. When I
> open my source code, I don't see the numeric character reference but the
> character itself. How can I change this? I really need source codes with
> numeric character references.

You have to convert characters to html entities.

Recode ( http://recode.progiciels-bpi.ca/ ) should do the job.


        JanneT

--
http://jannetuukkanen.net/


Posted by André Gillibert on October 18, 2007, 5:20 pm
Please log in for more thread options
stefaniefauconnier@gmail.com wrote:

> I've got some html-files with a lot of greek characters in utf-8. When
> I open my source code, I don't see the numeric character reference but
> the character itself.

Most probably, you used a text editor supporting UTF-8 and the greek
characters are encoded in UTF-8. This is perfectly fine.

> How can I change this?
> I really need source codes with numeric character references.

To see the Unicode code points, use an hex editor.
Do you want to convert these characters to HTML character references such
as &#N; where N is a number in decimal notation?

--
If you've a question that doesn't belong to Usenet, contact me at

Posted by stefaniefauconnier on October 19, 2007, 4:11 am
Please log in for more thread options
> To see the Unicode code points, use an hex editor.
> Do you want to convert these characters to HTML character references such
> as &#N; where N is a number in decimal notation?

yes, I do...

i'll have a look at this recode program.


Posted by Jukka K. Korpela on October 20, 2007, 3:20 am
Please log in for more thread options
Scripsit stefaniefauconnier@gmail.com:

>> To see the Unicode code points, use an hex editor.
>> Do you want to convert these characters to HTML character references
>> such as &#N; where N is a number in decimal notation?
>
> yes, I do...

Why? There's always someone who rewrites working code, to clean it up, or to
speed it up, or (as here?) just to recode it. Here it would increase the
amount of bytes needed to represent a Greek character, though it would be
more serious that the HTML code would become tedious to read and change.

I can imagine a few cases where the recoding might make sense, but I can
also imagine many cases where people want that for no good reason and just
cause confusion. Remember that in any such change, there's the risk of
messing things up, even if the conversion is as such trivial.

(My guess is that you would do the recoding in order to be able to work with
the files using an editor that cannot handle UTF-8. In that case, you should
have stated that and perhaps asked the right question "How can I find a
UTF-8 capable editor for my ... system?" - in the right group.)

--
Jukka K. Korpela ("Yucca")
http://www.cs.tut.fi/~jkorpela/


Similar ThreadsPosted
represent any Unicode character by means of a markup string coded in us-ascii May 27, 2005, 10:08 pm
reference in html page August 20, 2004, 11:25 am
HTML reference guide July 10, 2006, 1:31 am
why does unicode.org offer many scripts if unicode is a single code for all characters? May 27, 2005, 6:03 pm
Amazon: "The Ultimate HTML Reference" (2008) July 14, 2008, 11:23 am
Can an HTML source file be specified in unicode ? March 13, 2005, 12:29 pm
Unicode and html - help for simple web site August 24, 2005, 6:44 pm
Can an HTML source file be specified in unicode ? October 12, 2006, 8:08 am
character to HTML ampersand escape sequence converter December 17, 2004, 11:17 am
how to reference an html page from another html page March 7, 2008, 12:43 pm

Our other projects:

Art Dolls, Fairies and Mermaids - Sunnyfaces.net

Roy's Linux, Programming and Search Engines messages

1-Script XML SitemapXML Sitemap