Museum

Home

Lab Overview

Retrotechnology Articles

⇒ Online Manual

Media Vault

Software Library

Restoration Projects

Artifacts Sought

Related Articles

locale(1)

ascii(5)

deckanji(5)

eucJP(5)

i18n_intro(5)

i18n_printing(5)

iconv_intro(5)

iso2022(5)

Japanese(5)

jiskanji(5)

l10n_intro(5)

sdeckanji(5)

shiftjis(5)

iso2022jp(5)  —  Macro Packages and Conventions

NAME

iso2022jp, iso-2022-jp, ISO-2022-JP − A character encoding system (codeset) for Japanese

DESCRIPTION

The ISO-2022-JP codeset consists of the following character sets:

       •ASCII

For information on the ASCII character set, refer to ascii(5). 

       •JIS X0201-1976

Only the Roman letters in this character set are included. For details, refer to deckanji(5). 

       •JIS X0208-1978

       •JIS X0208-1983

JIS X0208-1983 is a revised version of JIS X0208-1978 and remapped some characters of JIS X0208-1978 to other positions. 

Before a character is used, its corresponding character set must be designated. In ISO-2022-JP, the designation of a character set is done by using an escape sequence as follows:

Escape Sequence Character Set
ESC ( B ACSII
ESC ( J JIS X0201-1976 (left-hand part)
ESC $ @ JIS X0208-1978
ESC $ B JIS X0208-1983

It is assumed that the starting code of a line is ASCII (including CR alone and LF alone, but not including the combination CRLF). If there are JIS X0208 characters on a line, there must be a switch to ASCII or to the left-hand part of JIS X0201 (Roman letters) before the end of the line (in other words, before the CRLF, or carriage return and line feed). 

For example, if a line starts with the ASCII character 9, followed by the JIS X0208-1978 character at row 16 column 1, the line is encoded as follows:

39h ESC $ @ 30h 21h .... ESC ( B .... CRLF

If a line starts with the JIS X0208-1978 character at row 16 column 1, followed by the ASCII character 9, then the line is encoded as follows:

ESC $ @ 30h 21h ESC ( B 39h .... CRLF

Once a character set is designated, there is no need to redesignate the character set if the adjacent character belongs to the same character set. For example, the following practice is not recommended:

ESC $ B .... ESC $ B ....

Currently, the operating system supports the iso2022jp codeset only through codeset conversion. 

For general information about printing non-English text, refer to i18n_printing(5). 

SEE ALSO

Commands: locale(1)

Others: ascii(5), deckanji(5), eucJP(5), i18n_intro(5), i18n_printing(5), iconv_intro(5), iso2022(5), Japanese(5), jiskanji(5), l10n_intro(5), sdeckanji(5), shiftjis(5)

Typewritten Software • bear@typewritten.org • Edmonds, WA 98026