Prettifying ASCII etc text (-> Unicode, TeX etc)

Do you have a question? Post it now! No Registration Necessary.  Now with pictures!

I find that I use very similar code again and again...  For example,
for a conversion of Latin-1 (or ASCII or Win-Cyrillic etc) text to a
book-quality TeX; or to inverse conversion Unicode -> Latin1.

[For example, I have a rule like

    $in =~ s/\bDvorák\b/Dvo\v rák/g;


Thinking about it more, the process should/could be split into several
phases (some of which may be omitted):

  a) Guess the encoding of the text (like Mozilla does);

  b) Restore Unicode information lost due to limited character

  c) Pretty-print Unicode to the needed output format.

I do not recall seeing modules for any of these tasks...  Did you?


Site Timeline