Museum

Home

Lab Overview

Retrotechnology Articles

⇒ Online Manual

Media Vault

Software Library

Restoration Projects

Artifacts Sought

Related Articles

locale(1)

ascii(5)

big5(5)

Chinese(5)

dechanyu(5)

dechanzi(5)

eucTW(5)

i18n_intro(5)

i18n_printing(5)

l10n_intro(5)

sbig5(5)

telecode(5)

GBK(5)  —  Macro Packages and Conventions

NAME

GBK, gbk − A character encoding system (codeset) for Simplified Chinese

DESCRIPTION

The GBK character set is an extension to the GB 2312−80 character set. (The “K” in “GBK” is the first sound in the Chinese word “Kuo Zhan,” which means “extension.”) GBK includes all the Hanzi characters specified by the ISO 10646−1:1993 standard (characters also known as the GB 13000.1.93 character set) that are not included in GB 2312−80. GBK is therefore defined as a normative annex of GB13000.1−93. 

GBK Value Ranges and Code Points

The GBK codeset is divided into five levels, as follows:

Level Encoding Range Code Points Characters
GBK/1 0xA1A1−0xA9FE 846 717
GBK/2 0xB0A1−0xF7FE 6,768 6,763
GBK/3 0x8140−0xA0FE 6,080 6,080
GBK/4 0xAA40−0xFE40 8,160 8,160
GBK/5 0xA840−0xA9A0 192 166

In addition, GBK includes code points for user-defined characters, as follows:

Encoding Range Code Points
0xAAA1−0xAFFE 564
0xF8A1−0xFEFE 658
0xA140−0xA7A0 672

GBK therefore provides a total of 23,940 code points, 21,886 of which are assigned. 

Each row in the GBK code table consists of 190 characters. ASCII characters, which are single-byte characters, are defined in the range 0x21−0x7E. Encoding ranges for two-byte characters are as follows:

Encoding range for the first byte: 0x81-0xFE

Encoding ranges for the second byte: 0x40-0x7E and 0x80-0xFE

Note

In terms of character-to-code allocation, the sub-range for GB2321−80 characters (0xA1A1-0xFEFE) in GBK is the same encoding range defined for these characters in Extended UNIX Code (EUC). GBK is therefore backward compatible with Chinese EUC encoding as well as forward compatible with the encoding as defined by ISO 10646−1:1993. 

GBK is the standard character set and encoding used in the Simplified Chinese version of Windows 95. 

Codeset Converters for GBK

The following codeset converters are available for GBK:

       •GBK_UCS-2

       •GBK_UCS-4

       •GBK_UTF-8

       •UCS-2_GBK

       •UCS-4_GBK

       •UTF-8_GBK

See iconv_intro(5) for more information about codeset converters and Unicode(5) for information about the UCS-2, UCS-4, and UTF-8 encoding formats. 

Fonts for GBK

The operating system provides the following TrueType fonts for GBK:

FangSong

-huatian-fangsong-medium-r-normal--0-0-0-0-c-0-gb2312.1980-0
-huatian-fangsong-medium-r-normal--0-0-0-0-c-0-gb2312.1980-1
-huatian-fangsong-medium-r-normal--0-0-0-0-c-0-gbk-1
-huatian-fangsong-medium-r-normal--0-0-0-0-m-0-iso8859-1

HeiTi

-huatian-heiti-medium-r-normal--0-0-0-0-c-0-gb2312.1980-0
-huatian-heiti-medium-r-normal--0-0-0-0-c-0-gb2312.1980-1
-huatian-heiti-medium-r-normal--0-0-0-0-c-0-gbk-1
-huatian-heiti-medium-r-normal--0-0-0-0-m-0-iso8859-1

KaiTi

-huatian-kaiti-medium-r-normal--0-0-0-0-c-0-gb2312.1980-0
-huatian-kaiti-medium-r-normal--0-0-0-0-c-0-gb2312.1980-1
-huatian-kaiti-medium-r-normal--0-0-0-0-c-0-gbk-1
-huatian-kaiti-medium-r-normal--0-0-0-0-m-0-iso8859-1

SongTi

-huatian-songti-medium-r-normal--0-0-0-0-c-0-gb2312.1980-0
-huatian-songti-medium-r-normal--0-0-0-0-c-0-gb2312.1980-1
-huatian-songti-medium-r-normal--0-0-0-0-c-0-gbk-1
-huatian-songti-medium-r-normal--0-0-0-0-m-0-iso8859-1

These fonts can be used for printing only with Chinese text printers.  The SongTi fonts are the default screen fonts for the GBK codeset. 

SEE ALSO

Commands: locale(1)

Others: ascii(5), big5(5), Chinese(5), dechanyu(5), dechanzi(5), eucTW(5), i18n_intro(5), i18n_printing(5), l10n_intro(5), sbig5(5), telecode(5)

Typewritten Software • bear@typewritten.org • Edmonds, WA 98026