|
Posted by lkrubner on May 27, 2005, 10:08 pm
Please log in for more thread options
>Alan J. Flavell Oct 7 2004, 1:44 pm show options
>>On Thu, 7 Oct 2004, Shmuel (Seymour J.) Metz wrote:
>> >I think you mean "multiple character encoding schemes".
>> Yes, although a different character set would imply a different
>> encoding scheme.
>
>Absolutely not. That's the whole point!
>
>In (X)HTML you can (if you so choose) represent any Unicode character
>by means of a markup string coded in us-ascii, even. The use of other
>encoding schemes is merely a convenience when the desired character
>repertoire fits a particular pattern, but whichever encoding scheme
>you choose, you still - in principle - have access to any other
>Unicode character you need, by means of &-notation.
I could change any Unicode character to its html notation, if only I
had a way to find out the Unicode value of the characters in the string
I'm given. But given a random set of string inputs, possibly copy and
pasted from WordPerfect or Microsoft Word or BBedit on a Mac, I don't
know how to find the Unicode value of those characters.
|