Let's describe the details of ptexenc here! For the Japanese language, three different character encodings—EUC-JP, ISO-2022-JP, and Shift_JIS (for short, EUC, JIS, and SJIS, respectively)—have been used, depending on system (used in traditional Linux® distributions, email transport system, and Windows®, resp.). Nowadays UTF-8 encoding is becoming popular. UTF-8 encoding is used as the default Japanese encoding for Fedora™, Ubuntu™, Mac OS® X, and so on. ASCII Japanese pTeX originally had three executable files for each of three traditional character encodings. Default encoding of your binary was determined when the binary was built (by ./configure script's option). If you would like to typeset the non-default encoding file, please execute ptex with -kanji=<the file's encoding> as 'ptex -kanji=sjis'. More precisely speaking, ISO-2022-JP files can handle well in any three default enconding package. You need to be careful not to mix files in EUC and SJIS encodings. Some improvements were made so that one executable file could support three encodings, but they were not sufficient. To solve this issue, we have created a new library, named ‘ptexenc,’ which makes the encoding functions clear. We will illustrate what we did below. Now, ptexenc provides UTF-8 encoding support. Internally, ptexenc change UTF-8 encoding texts to EUC-JP encoding if the letters are included in JIS X 0208 characters(?); if not, ptexenc expands the text in the form of "^^ab". This is because pTeX can only handle JIS X 0208 character set. If you use pTeX + ptexenc with inputenc package (`utf8' option) and UTF/OTF package, you can typeset Unocode characters (e.g. Hangul, some Kanji variations) more directly. ... Below are memos for editorial purpose, possibly in Japanese.
|