Tales of the Parodyverse >> View Post
Post By
HH

In Reply To
Manga Shoggoth

Member Since: Fri Jan 02, 2004
Posts: 391
Subj: Out of character
Posted: Tue Dec 15, 2015 at 07:04:55 pm EST (Viewed 1 times)
Reply Subj: Re: Nobody should question your life choices.
Posted: Tue Dec 15, 2015 at 06:28:11 pm EST (Viewed 768 times)



    Quote:
    If you are using basic ASCII you use a single byte per character, and the letters of the alphabet all map on to specific numbers. To use foreign characters you have to muck around with code pages whereby a subset of the characters are changed depending on which code page you are using.


I got some way with this then stalled after a frustrating two hours.

And then I discovered something even weirder. Take one of the pages that doesn't have a character conversion problem but does have the white instead of black background problem. Say http://www.chillwater.org.uk/HH/hhstories/untold%20tales%20of%20ll%20341.htm If I manually correct the html file using Notepad ([body] to [body bgcolor="#000000" text="#ffffff" link="lightblue" vlink="white" alink="gray"]) the thing is fixed nicely. If I try a batch text program it fixes the background but actually replicates the character conversion problem of the other files!



    Quote:

      Quote:
      How can that help with the details? Is it something I could use?



    Quote:
    It would allow you to identify the sequence(s) of bytes being used (as above), which you could then feed in to the editing software. It is easy enough to use, and you would only be reading the files, not trying to edit them.



    Quote:
    For example, you would want to replace the byte string C3 A2 E2 82 AC E2 84 A2 with an apostrophe.


Do these programs work on batches of files or would I have to do each one seperately?