|
Posted by jm on March 9, 2008, 3:08 pm
Please log in for more thread options
mathias wündisch a écrit :
> Hi,
>
> i look for informations about the use of the perl pragma "locale". the
> reason why: i have old input data coded with codepage 437. and i want
> to use the normal matchingcharacter \b (non alphabetic char) and \w
> (alphabetic char).
> i found a lot of site about posix under linux but i work under windows
> xp. am i the first with this problem?
> i want to tell perl what byte between 32 and 255 is a allphabetical
> character (\w) and what is not (\b).
> thanks for ideas.
>
> bye, mathias
>
What's about using Encode module, with decode function to transform your
IBM/CP 437 data to utf-8, and then using utf-8 (transparently) behaviour
of perl regex?
|