CHINESE CHARACTER


A 'Chinese character' or 'Han character' () is a logogram used in writing Chinese, Japanese, sometimes Korean, and formerly Vietnamese.
The number of Chinese characters contained in the Kangxi dictionary is approximately 47,035, although a large number of these are rarely-used variants accumulated throughout history. Studies carried out in China have shown that full literacy requires a knowledge of between three and four thousand characters.[1]
In Chinese tradition, each character corresponds to a single syllable. A majority of words in all modern varieties of Chinese are polysyllabic and thus require two or more characters to write. Cognates in the various Chinese languages/dialects which have the same or similar meaning but different pronunciations can be written with the same character. In addition, many characters were adopted according to their meaning by the Japanese and Korean languages to represent native words, disregarding pronunciation altogether. The loose relationship between phonetics and characters has thus made it possible for them to be used to write very different and probably unrelated languages.
Chinese characters are also known as ''sinographs'', and the Chinese writing system as ''sinography''. Non-Chinese languages which have adopted sinography — and, with the orthography, a large number of loanwords from the Chinese language — are known as Sinoxenic languages, whether or not they still use the characters. The term does not imply any genetic affiliation with Chinese. The major Sinoxenic languages are generally considered to be Japanese, Korean and Vietnamese.

Contents
History
Neolithic signs
Written styles
Formation of characters
1. Pictograms (象形字 ''xiàngxíngzì'')
2. Pictophonetic compounds (形聲字/形声字, ''Xíngshēngzì'')
3. Ideograph (指事字, ''zhǐshìzì'')
4. Logical aggregates (會意字/会意字, ''Huìyìzì'')
5. Associate Transformation (轉注字/转注字, ''Zhuǎnzhùzì'')
6. Borrowing (假借字, ''Jiǎjièzì'')
Written variants
Orthography
Common typefaces
Reforms: Simplification
Simplification in China
Japanese kanji
Southeast Asian Chinese communities
Dictionaries
Sinoxenic languages
Number of Chinese characters
Chinese
Japanese
Korean
Vietnamese
Rare and complex characters
Chinese calligraphy
See also
References
External links

History


Areas using only Chinese characters in green; in conjunction with other scripts, dark green; maximum extent of historic usage, light green. (does not include other territories annexed by Japan in WWII)

The oldest Chinese inscriptions that are indisputably writing are the Oracle bone script (). The oracle bone script is a well-developed writing system, attested from the late Shang Dynasty (1200-1050 B.C.)William G. Boltz, Early Chinese Writing, World Archaeology, Vol. 17, No. 3, Early Writing Systems. (Feb., 1986), pp. 420-436 (436)David N. Keightley, Art, Ancestors, and the Origins of Writing in China, Representations, No. 56, Special Issue: The New Erudition. (Autumn, 1996), pp.68-95 (68)John DeFrancis: Visible Speech. The Diverse Oneness of Writing Systems: Chinese from Anyang, and from Zhengzhou, dated 1600 BC. In addition, there are very few logographs found on pottery shards and cast in bronzes, known as the Bronze script (), which is very similar to but more complex and pictorial than the Oracle Bone Script. Only about 1,400 of the 2,500 known Oracle Bone logographs can be identified with later Chinese characters and therefore easily read. However, it should be noted that these 1,400 logographs include most of the commonly used ones.
According to legend, Chinese characters were invented earlier by Cangjie (c. 2650 BC), a bureaucrat under the legendary emperor, Huangdi. The legend tells that Cangjie was hunting on Mount Yangxu (today Shanxi) when he saw a tortoise whose veins caught his curiosity. Inspired by the possibility of a logical relation of those veins, he studied the animals of the world, the landscape of the earth, and the stars in the sky, and invented a symbolic system called ''zì'' — Chinese characters. It was said that on the day the characters were born, Chinese heard the devil mourning, and saw crops falling like rain, as it marked the beginning of civilization, for good and for bad.
A complete writing system in Chinese characters appeared in China 3,200 years ago during the Shang Dynasty,William G. Boltz, Early Chinese Writing, World Archaeology, Vol. 17, No. 3, Early Writing Systems. (Feb., 1986), pp. 420-436 (436)David N. Keightley, Art, Ancestors, and the Origins of Writing in China, Representations, No. 56, Special Issue: The New Erudition. (Autumn, 1996), pp.68-95 (68) John DeFrancis: Visible Speech. The Diverse Oneness of Writing Systems: Chinese making it what is believed to be the oldest “surviving” writing system. However, as the symbols used are predominantly pictographs, the linkages to the modern Chinese writing system would be decipherable only to linguistic archaeologists. The oracle bone inscriptions were discovered at what is now called the Yin Ruins near Anyang city in 1899. Sumerian cuneiform is currently regarded as being the oldest known writing system having originated about 3200 B.C. In a 2003 archeological dig at Jiahu in Henan province in western China, various Neolithic signs were found inscribed on tortoise shells which date back as early as the 7th millennium BC, and may represent possible precursors of the Chinese script, although there has been no link established so far.[2]
Neolithic signs

Main articles: Neolithic signs in China


The earliest Neolithic signs come from ''Jiahu'', a Neolithic site in the basin of the Yellow River in Henan province, dated to c. 6500 BC,[3] known as the Jiahu script. It has yielded turtle carapaces that were pitted and inscribed with symbols. By the discoveries at Jiahu reported here Neolithic sign use in China must now be extended backward another two millennia to c. 6500 cal BC. Sign use, however, should not be easily equated with writing, although it may represent a formative stage.
Although the earliest forms of primitive Chinese writing are no more than individual symbols and therefore cannot be considered a true written script, the inscriptions found on bones (dated to 2500–1900 BC) used for the purposes of divination from the late Neolithic Longshan () culture (c. 3200–1900 BC) are thought by some to be a proto-written script, similar to the earliest forms of writing in Mesopotamia and Egypt. It is possible that these inscriptions are ancestral to the later Oracle bone script of the Shang Dynasty and therefore the modern Chinese script, since late Neolithic culture found in Longshan is widely accepted by historians and archaeologists to be ancestral to the Bronze Age Erlitou culture and the later Shang and Zhou dynasties.
At Damaidi in Ningxia, 3172 cliff carvings dating to 6000–5000 BC have been discovered "featuring 8453 individual characters such as the sun, moon, stars, gods and scenes of hunting or grazing." These pictographs are reputed to be similar to the earliest characters confirmed to be written Chinese.[4][5]

Written styles


Sample of the cursive script by Chinese Tang Dynasty calligrapher Sun Guoting, c. 650 CE.

There are numerous styles, or scripts, in which Chinese characters can be written, deriving from various calligraphic and historical models. Most of these originated in China and are now common, with minor variations, in all countries where Chinese characters are used. These characters were used over 3,000 years ago.
The Oracle Bone and Bronzeware scripts being no longer used, the oldest script that is still in use today is the Seal Script (). It evolved organically out of the Zhou bronze script, and was adopted in a standardized form under the first Emperor of China, Qin Shi Huang. The seal script, as the name suggests, is now only used in artistic seals. Few people are still able to read it effortlessly today, although the art of carving a traditional seal in the script remains alive; some calligraphers also work in this style.
Scripts that are still used regularly are the "Clerical Script" () of the Qin Dynasty to the Han Dynasty, the Weibei (), the "Regular Script" () used for most printing, and the "Semi-cursive Script" () used for most handwriting.
The Cursive Script () is not in general use, and is a purely artistic calligraphic style. The basic character shapes are suggested, rather than explicitly realized, and the abbreviations are extreme. Despite being cursive to the point where individual strokes are no longer differentiable and the characters often illegible to the untrained eye, this script (also known as ''draft'') is highly revered for the beauty and freedom that it embodies. Some of the Simplified Chinese characters adopted by the People's Republic of China, and some of the simplified characters used in Japan, are derived from the Cursive Script. The Japanese hiragana script is also derived from this script.
There also exist scripts created outside China, such as the Japanese Edomoji styles; these have tended to remain restricted to their countries of origin, rather than spreading to other countries like the standard scripts described above.

'Oracle Bone Script' 'Seal Script' 'Clerical Script' 'Semi-Cursive Script' 'Cursive Script' 'Regular Script (Traditional)' 'Regular Script (Simplified)' 'Pinyin' 'Sino-Vietnamese' 'Meaning'
Character Ri Oracle.png
Character Ri Seal.png
Character Ri Cler.png
Character Ri Semi.png
Character Ri Cur.png
Character Ri Trad.png
Nhật Sun
Character Yuue Oracle.png
Character Yuue Seal.png
Character Yuue Cler.png
Character Yuue Semi.png
Character Yuue Cur.png
Character Yuue Trad.png
yuè Nguyệt Moon
Character Shan Oracle.png
Character Shan Seal.png
Character Shan Cler.png
Character Shan Semi.png
Character Shan Cur.png
Character Shan Trad.png
shān Sơn Mountain
Character Shui Oracle.png
Character Shui Seal.png
Character Shui Cler.png
Character Shui Semi.png
Character Shui Cur.png
Character Shui Trad.png
shuǐ Thuỷ Water
Character Yuu Oracle.png
Character Yuu Seal.png
Character Yuu Cler.png
Character Yuu Semi.png
Character Yuu Cur.png
Character Yuu Trad.png
Rain
Character Mu4 Oracle.png
Character Mu4 Seal.png
Character Mu4 Cler.png
Character Mu4 Semi.png
Character Mu4 Cur.png
Character Mu4 Trad.png
Mộc Wood
Character He Oracle.png
Character He Seal.png
Character He Cler.png
Character He Semi.png
Character He Cur.png
Character He Trad.png
Rice Plant
Character Ren Oracle.png
Character Ren Seal.png
Character Ren Cler.png
Character Ren Semi.png
Character Ren Cur.png
Character Ren Trad.png
rén Nhân Human
Character Nuu Oracle.png
Character Nuu Seal.png
Character Nuu Cler.png
Character Nuu Semi.png
Character Nuu Cur.png
Character Nuu Trad.png
Nữ Woman
Character Mu Oracle.png
Character Mu Seal.png
Character Mu Cler.png
Character Mu Semi.png
Character Mu Cur.png
Character Mu Trad.png
Mẫu Mother
Character Eye Oracle.png
Character Eye Seal.png
Character Eye Cler.png
Character Eye Semi.png
Character Eye Cur.png
Character Eye Trad.png
Mục Eye
Character Niu Oracle.png
Character Niu Seal.png
Character Niu Cler.png
Character Niu Semi.png
Character Niu Cur.png
Character Niu Trad.png
niú Ngưu Ox
Character Yang Oracle.png
Character Yang Seal.png
Character Yang Cler.png
Character Yang Semi.png
Character Yang Cur.png
Character Yang Trad.png
yáng Dương Sheep
Character Ma Oracle.png
Character Ma Seal.png
Character Ma Cler.png
Character Ma Semi.png
Character Ma Cur.png
Character Ma Trad.png
Character Ma Simp.png
Horse
Character Niao Oracle.png
Character Niao Seal.png
Character Niao Cler.png
Character Niao Semi.png
Character Niao Cur.png
Character Niao Trad.png
Character Niao Simp.png
niǎo Điểu Bird
Character Gui Oracle.png
Character Gui Seal.png
Character Gui Cler.png
Character Gui Semi.png
Character Gui Cur.png
Character Gui Trad.png
Character Gui Simp.png
guī Quy Tortoise
Character Long Oracle.png
Character Long Seal.png
Character Long Cler.png
Character Long Semi.png
Character Long Cur.png
Character Long Trad.png
Character Long Simp.png
lóng Long Chinese Dragon
Character Feng Oracle.png
Character Feng Seal.png
Character Feng Cler.png
Character Feng Semi.png
Character Feng Cur.png
Character Feng Trad.png
Character Feng Simp.png
fèng Phụng (Phượng) Chinese Phoenix


Formation of characters


Main articles: Chinese character classification, radical (Chinese character)

The early stages of the development of characters were dominated by pictograms, in which meaning was expressed directly by the shapes. The development of the script, both to cover words for abstract concepts and to increase the efficiency of writing, has led to the introduction of numerous non-pictographic characters.
The various types of character were first classified c. 100 CE by the Chinese linguist Xu Shen, whose etymological dictionary ''Shuowen Jiezi'' (說文解字/说文解字) divides the script into six categories, the ''liùshū'' (六書/六书). While the categories and classification are occasionally problematic and arguably fail to reflect the complete nature of the Chinese writing system, the system has been perpetuated by its long history and pervasive use.[1]
Excerpt from a 1436 primer on Chinese characters

Four percent of Chinese characters are derived directly from individual pictograms (), and in most of those cases the relationship is not necessarily clear to the modern reader. Of the remaining 96%, some are logical aggregates (), which are characters combined from multiple parts indicative of meaning. But most characters are pictophonetics (), characters containing two parts where one indicates a general category of meaning and the other the sound. The sound in such characters is often only approximate to the modern pronunciation because of changes over time and differences between source languages.
1. Pictograms (象形字 ''xiàngxíngzì'')

Contrary to popular belief, pictograms make up only a small portion of Chinese characters. While characters in this class derive from pictures, they have been standardized, simplified, and stylized to make them easier to write, and their derivation is therefore not always obvious. Examples include 日 (rì) for "sun", 月 (yuè) for "moon", and 木 (mù) for "tree".
There is no concrete number for the proportion of modern characters that are pictographic in nature; however, Xu Shen (c. 100 CE) estimated that 4% of characters fell into this category.
2. Pictophonetic compounds (形聲字/形声字, ''Xíngshēngzì'')

Also called ''semantic-phonetic compounds'', or ''phono-semantic compounds'', this category represents the largest group of characters in modern Chinese. Characters of this sort are composed of two parts: a pictograph, which suggests the general meaning of the character, and a phonetic part, which is derived from a character pronounced in the same way as the word the new character represents.
Examples are 河 (hé) ''river'', 湖 (hú) ''lake'', 流 (liú) ''stream'', 冲 (chōng) ''riptide'', 滑 (huá) ''slippery''. All these characters have on the left a radical of three dots, which is a simplified pictograph for a water drop, indicating that the character has a semantic connection with water; the right-hand side in each case is a phonetic indicator. For example, in the case of 冲 (chōng), the phonetic indicator is 中 (zhōng), which by itself means ''middle''. In this case it can be seen that the pronunciation of the character has diverged from that of its phonetic indicator; this process means that the composition of such characters can sometimes seem arbitrary today. Further, the choice of radicals may also seem arbitrary in some cases; for example, the radical of 貓 (māo) ''cat'' is 豸 (zhì), originally a pictograph for worms, but in characters of this sort indicating an animal of any sort.
Xu Shen (c. 100 CE) placed approximately 82% of characters into this category, while in the Kangxi Dictionary (1716 CE) the number is closer to 90%, due to the extremely productive use of this technique to extend the Chinese vocabulary.
3. Ideograph (指事字, ''zhǐshìzì'')

Also called a ''simple indicative'', ''simple ideograph'', or ''ideogram'', characters of this sort either add indicators to pictographs to make new meanings, or illustrate abstract concepts directly. For instance, while 刀 (dāo) is a pictogram for "knife", placing an indicator in the knife makes 刃 (rèn), an ideogram for "blade". Other common examples are 上 (shàng) for "up" and 下 (xià) for "down". This category is small, as most concepts can be represented by characters in other categories.
4. Logical aggregates (會意字/会意字, ''Huìyìzì'')

Also translated as ''associative compounds'', characters of this sort combine pictograms to symbolize an abstract concept. For instance, 木 (mu) is a pictogram of a tree, and putting two 木 together makes 林 (lin), meaning ''forest''. Combining 日 (rì) ''sun'' and 月 (yuè) ''moon'' makes 明 (míng) ''bright'', which is traditionally interpreted as symbolizing the combination of sun and moon as the natural sources of light.
Xu Shen estimated that 13% of characters fall into this category.
Some scholars flatly reject the existence of this category, opining that failure of modern attempts to identify a phonetic in an alleged logical aggregate is due simply to our not looking at ancient so-called secondary readings.[6] These are readings that were once common but have since been lost as the script evolved over time. Commonly given as a logical aggregate is ān 安 "peace" which is popularly said to be a combination of "building" 宀 and "woman" 女, together yielding something akin to "all is peaceful with the woman at home". However, 女 was in olden days most likely a polyphone with a secondary reading of
★ an, as may be gleaned from the set yàn 妟 "tranquil", nuán 奻 "to quarrel", jiān 姦 "licentious".
Adding weight to this argument is the fact that characters claimed to belong to this "group" are almost invariably interpreted from modern forms rather than the archaic versions which as a rule are vastly different and often far more graphically complex. However, interpretations differ greatly, as can be evidenced from thorough studies of different sources.[7]
5. Associate Transformation (轉注字/转注字, ''Zhuǎnzhùzì'')

Characters in this category originally didn't represent the same meaning but have bifurcated through orthographic and often semantic drift. For instance, 考 (kǎo) ''to verify'' and 老 (lǎo) ''old'' were once the same character, meaning "elderly person", but detached into two separate words. Characters of this category are rare, so in modern systems this group is often omitted or combined with others.
6. Borrowing (假借字, ''Jiǎjièzì'')

Main articles: Jiajie

Also called ''phonetic loan characters'', this category covers cases where an existing character is used to represent an unrelated word with similar pronunciation; sometimes the old meaning is then lost completely, as with characters such as 自 (zì), which has lost its original meaning of ''nose'' completely and exclusively means ''oneself'', or 萬 (wan), which originally meant ''scorpion'' but is now used only in the sense of ''ten thousand''.
This technique has become uncommon, since there is considerable resistance to changing the meaning of existing characters. However, it has been used in the development of written forms of dialects, notably Cantonese and Taiwanese in Hong Kong and Taiwan, due to the amount of dialectal vocabulary which historically has had no written form and thus lacks characters of its own.

Written variants


Just as Roman letters have a characteristic shape (lower-case letters occupying a roundish area, with ascenders or descenders on some letters), Chinese characters occupy a more or less square area. Characters made up of multiple parts squash these parts together in order to maintain a uniform size and shape — this is the case especially with characters written in the Sòngtǐ style. Because of this, beginners often practise on squared graph paper, and the Chinese sometimes use the term "Square-Block Characters" ().
The actual shape of many Chinese characters varies in different cultures. Mainland China adopted simplified characters in 1956, but Traditional Chinese characters are still used in Hong Kong, Macau and Taiwan. Singapore has also adopted simplified Chinese characters. Postwar Japan has used its own less drastically simplified characters since 1946, while South Korea has limited its use of Chinese characters, and Vietnam and North Korea have completely abolished their use in favour of romanized Vietnamese and Hangul, respectively.
Orthography

The nature of Chinese characters makes it very easy to produce allographs for any character, and there have been many efforts at orthographical standardization throughout history. The widespread usage of the characters in several different nations has prevented any one system becoming universally adopted; consequently, the standard shape of any given character in Chinese usage may differ subtly from its standard shape in Japanese or Korean usage, even where no simplification has taken place.
Usually, each Chinese character takes up the same amount of space, due to their block-like square nature. Beginners therefore typically practice writing with a grid as a guide. In addition to strictness in the amount of space a character takes up, Chinese characters are written with very precise rules. The three most important rules are the strokes employed, stroke placement, and the order in which they are written (stroke order). Most words can be written with just one stroke order, though some words also have variant stroke orders, which may occasionally result in different stroke counts; certain characters are also written with different stroke orders in different languages.
Common typefaces

Serif (top) and sans-serif (bottom) typefaces exist for Chinese characters in the regular script.

There are two common typefaces based on the regular script for Chinese characters akin to serif and sans-serif fonts in the West. The most popular for body text is a family of fonts called the Song typeface (宋体), also known as Minchō (明朝) in Japan, and Ming typeface (明體) in Taiwan and Hong Kong. The names of these fonts come from the Song and Ming dynasties, when block printing flourished in China. Because the wood grain on printing blocks ran horizontally, it was fairly easy to carve horizontal lines with the grain. However, carving vertical or slanted patterns was difficult because those patterns intersect with the grain and break easily. This resulted in a typeface that has thin horizontal strokes and thick vertical strokes. To prevent wear and tear, the ending of horizontal strokes are also thickened. These design forces resulted in the current Song typeface characterized by thick vertical strokes contrasted with thin horizontal strokes; triangular ornaments at the end of single horizontal strokes; and overall geometrical regularity. This typeface is similar to Western serif fonts such as Times New Roman in both appearance and function.
The other common group of fonts is called the black typeface (黑体/體) in Chinese and Gothic typeface (ゴシック体) in Japanese. This group is characterized by straight lines of even thickness for each stroke, akin to sans-serif styles such as Arial and Helvetica in Western typography. This group of fonts, first introduced on newspaper headlines, is commonly used on headings, websites, signs and billboards.

Reforms: Simplification


:''Main articles: Simplified Chinese character, Shinjitai''
Simplification in China

The use of traditional characters versus simplified characters varies greatly, and can depend on both the local customs and the medium. Because character simplifications were not officially sanctioned and generally a result of caoshu writing or idiosyncratic reductions, traditional, standard characters were mandatory in printed works, while the (unofficial) simplified characters would be used in everyday writing, or quick scribblings. Since the 1950s, and especially with the publication of the 1964 list, the PRC has officially adopted a simplified script, while Hong Kong, Macau, and the ROC retain the use of the traditional characters. There is no absolute rule for using either system, and often it is determined by what the target audience understands, as well as the upbringing of the writer. In addition there is a special system of characters used for writing numerals in financial contexts; these characters are modifications or adaptations of the original, simple numerals, deliberately made complicated to prevent forgeries or unauthorized alterations.
Although most often associated with the PRC, character simplification predates the 1949 communist victory. Caoshu, cursive written text, almost always includes character simplification, and simplified forms have always existed in print, albeit not for the most formal works. In the 1930s and 1940s, discussions on character simplification took place within the Kuomintang government, and a large number of Chinese intellectuals and writers have long maintained that character simplification would help boost literacy in China. Indeed, this desire by the Kuomintang to simplify the Chinese writing system (inherited and implemented by the CCP) also nursed aspirations of some for the adoption of a phonetic script, in imitation of the Roman alphabet, and spawned such inventions as the Gwoyeu Romatzyh.
The PRC issued its first round of official character simplifications in two documents, the first in 1956 and the second in 1964. A second round of character simplifications (known as ''erjian'', or "second round simplified characters") was promulgated in 1977. It was poorly received, and in 1986 the authorities rescinded the second round completely, while making six revisions to the 1964 list, including the restoration of three traditional characters that had been simplified: 叠 ''dié'', 覆 ''fù'', 像 ''xiàng''.
Many of the simplifications adopted had been in use in informal contexts for a long time, as more convenient alternatives to their more complex standard forms. For example, the traditional character 來 ''lái'' (come) was written with the structure 来 in the clerical script (隸書 lìshū) of the Han dynasty. This clerical form uses two fewer strokes, and was thus adopted as a simplified form. The character 雲 ''yún'' (cloud) was written with the structure 云 in the oracle bone script of the Shāng dynasty, and had remained in use later as a phonetic loan in the meaning of ''to say''. The simplified form reverted to this original structure.
Japanese kanji

Main articles: Kanji

In the years after World War II, the Japanese government also instituted a series of orthographic reforms. Some characters were given simplified forms called ''Shinjitai'' 新字体 (lit. "new character forms"; the older forms were then labelled the ''Kyūjitai'' 旧字体 , lit. "old character forms"). The number of characters in common use was restricted, and formal lists of characters to be learned during each grade of school were established, first the 1850-character ''Tōyō kanji'' 当用漢字 list in 1945, and later the 1945-character ''Jōyō kanji'' 常用漢字 list in 1981. Many variant forms of characters and obscure alternatives for common characters were officially discouraged. This was done with the goal of facilitating learning for children and simplifying kanji use in literature and periodicals. These are simply guidelines, hence many characters outside these standards are still widely known and commonly used, especially those used for personal and place names (for the former, see Jinmeiyō kanji).
Southeast Asian Chinese communities

Singapore underwent three successive rounds of character simplification. These resulted in some simplifications that differed from those used in mainland China. It ultimately adopted the reforms of the PRC in their entirety as official, and has implemented them in the educational system.
Malaysia promulgated a set of simplified characters in 1981, which were also completely identical to the Mainland China simplifications; here, however, the simplifications were not generally widely adopted, as the Chinese educational system fell outside the purview of the federal government. However, with the advent of the PRC as an economic powerhouse, simplified characters are taught at school, and the simplified characters are more commonly, if not almost universally, used. However, a large majority of the older Chinese literate generation use the traditional characters. Chinese newspapers are published in either set of characters, with some even incorporating special Cantonese characters when publishing about the canto celebrity scene of Hong Kong.
'Comparisons of Traditional characters, Simplified Chinese characters, and Simplified Japanese characters'
Traditional Chinese simp. Japanese simp. meaning
Simplified in Chinese, not Japanese electricity
open
east
car, vehicle
red
nothing
bird
hot
Simplified in Japanese, not Chinese Buddha
favour
kowtow, pray to, worship
black
ice
rabbit
older/elder sister
jealousy
Simplified in both, but differently picture, diagram
turn
广 wide, broad
bad, evil
绿 green
brain
fun
air
Simplified in both in the same way learn
body
dot, point
cat
insect
yellow
thief
country

''Note:'' this table is merely a brief sample, not a complete listing.

Dictionaries


Dozens of indexing schemes have been created for arranging Chinese characters in Chinese dictionaries. The great majority of these schemes have appeared in only a single dictionary; only one such system has achieved truly widespread use. This is the system of radicals.
Chinese character dictionaries often allow users to locate entries in several different ways. Many Chinese, Japanese, and Korean dictionaries of Chinese characters list characters in radical order: characters are grouped together by radical, and radicals containing fewer strokes come before radicals containing more strokes. Under each radical, characters are listed by their total number of strokes. It is often also possible to search for characters by sound, using pinyin (in Chinese dictionaries), zhuyin (in Taiwanese dictionaries), kana (in Japanese dictionaries) or hangul (in Korean dictionaries). Most dictionaries also allow searches by total number of strokes, and individual dictionaries often allow other search methods as well.
For instance, to look up the character where the sound is not known, e.g., 松 (pine tree), the user first determines which part of the character is the radical (here 木), then counts the number of strokes in the radical (four), and turns to the radical index (usually located on the inside front or back cover of the dictionary). Under the number "4" for radical stroke count, the user locates 木, then turns to the page number listed, which is the start of the listing of all the characters containing this radical. This page will have a sub-index giving remainder stroke numbers (for the non-radical portions of characters) and page numbers. The right half of the character also contains four strokes, so the user locates the number 4, and turns to the page number given. From there, the user must scan the entries to locate the character he or she is seeking. Some dictionaries have a sub-index which lists every character containing each radical, and if the user knows the number of strokes in the non-radical portion of the character, he or she can locate the correct page directly.
Another dictionary system is the four corner method, where characters are classified according to the "shape" of each of the four corners.
Most modern Chinese dictionaries and Chinese dictionaries sold to English speakers use the traditional radical-based character index in a section at the front, while the main body of the dictionary arranges the main character entries alphabetically according to their pinyin spelling. To find a character with unknown sound using one of these dictionaries, the reader finds the radical and stroke number of the character, as before, and locates the character in the radical index. The character's entry will have the character's pronunciation in pinyin written down; the reader then turns to the main dictionary section and looks up the pinyin spelling alphabetically.

Sinoxenic languages


Besides Japanese and Korean, a number of Asian languages have historically been written using Han characters, with characters modified from Han characters, or using Han characters in combination with native characters. They include:

Iu Mien language

Jurchen language

Khitan language

Miao language

Nakhi (Naxi) language (Geba script)

Tangut language [2], [3]

Vietnamese language (Chữ nôm)

Zhuang language (using Zhuang logograms, or "sawndip")
In addition, the Yi script is similar to Han, but is not known to be directly related to it.

Number of Chinese characters


The total number of Chinese characters from past to present remains unknowable because new ones are developed all the time. Chinese characters are theoretically an open set. The number of entries in major Chinese dictionaries is the best means of estimating the historical growth of character inventory.
'Number of characters in Chinese dictionaries'[8]DateName of dictionaryNumber of characters
100 ''Shuowen Jiezi'' 9,353
543? ''Yupian'' 12,158
601 ''Qieyun'' 16,917
1011 ''Guangyun'' 26,194
1039 ''Jiyun'' 53,525
1615 ''Zihui'' 33,179
1716 ''Kangxi Zidian'' 47,035
1916 ''Zhonghua Da Zidian'' 48,000
1989 ''Hanyu Da Zidian'' 54,678
1994 ''Zhonghua Zihai'' 85,568

|}
Comparing the ''Shuowen Jiezi'' and ''Hanyu Da Zidian'' reveals that the overall number of characters has increased 577 percent over 1,900 years. Depending upon how one counts variants, 50,000+ is a good approximation for the current total number. This correlates with the most comprehensive Japanese and Korean dictionaries of Chinese characters; the ''Dai Kan-Wa Jiten'' has some 50,000 entries, and the ''Han-Han Dae Sajeon'' has over 57,000. The latest behemoth, the ''Zhonghua Zihai'', records a staggering 85,568 single characters, although even this fails to list all characters known, ignoring the roughly 1,500 Japanese-made ''kokuji'' given in the ''Kokuji no Jiten''[9] as well as the Chu Nom inventory only used in Vietnam in past days.
Modified radicals and obsolete variants are two common reasons for the ever-increasing number of characters. There are about 300 radicals and 100 are in common use. Creating a new character by modifying the radical is an easy way to disambiguate homographs among ''xíngshēngzì'' pictophonetic compounds. This practice began long before the standardization of Chinese script by Qin Shi Huang and continues to the present day. The traditional 3rd-person pronoun ''tā'' (他 "he; she; it"), which is written with the "person radical," illustrates modifying significs to form new characters. In modern usage, there is a graphic distinction between ''tā'' (她 "she") with the "woman radical", ''tā'' (牠 "it") with the "animal radical", ''tā'' (它 "it") with the "roof radical", and ''tā'' (祂 "He") with the "deity radical", One consequence of modifying radicals is the fossilization of rare and obscure variant logographs, some of which are not even used in Classical Chinese. For instance, ''he'' 和 "harmony; peace", which combines the "grain radical" with the "mouth radical", has infrequent variants 咊 with the radicals reversed and 龢 with the "flute radical".
Chinese

It is usually said that about 3,000 characters are needed for basic literacy in Chinese (for example, to read a Chinese newspaper), and a well-educated person will know well in excess of 4,000 to 5,000 characters. Note that Chinese characters should not be confused with Chinese words, as the majority of modern Chinese words, unlike their Ancient Chinese and Middle Chinese counterparts, are multi-morphemic and multi-syllabic compounds, that is, most Chinese words are written with two or more characters; each character representing one syllable. Knowing the meanings of the individual characters of a word will often allow the general meaning of the word to be inferred, but this is not invariably the case.
In the People's Republic of China, which uses Simplified Chinese characters, the ''Xiàndài Hànyǔ Chángyòng Zìbiǎo'' (现代汉语常用字表; Chart of Common Characters of Modern Chinese) lists 2,500 common characters and 1,000 less-than-common characters, while the ''Xiàndài Hànyǔ Tōngyòng Zìbiǎo'' (现代汉语通用字表; Chart of Generally Utilized Characters of Modern Chinese) lists 7,000 characters, including the 3,500 characters already listed above. GB2312, an early version of the national encoding standard used in the People's Republic of China, has 6,763 code points. GB18030, the modern, mandatory standard, has a much higher number. The Hànyǔ Shuǐpíng Kǎoshì proficiency test covers approximately 5,000 characters.
In the ROC, which uses Traditional Chinese characters, the Ministry of Education's ''Chángyòng Guózì Biāozhǔn Zìtǐ Biǎo'' (常用國字標準字體表; Chart of Standard Forms of Common National Characters) lists 4,808 characters; the ''Cì Chángyòng Guózì Biāozhǔn Zìtǐ Biǎo'' (次常用國字標準字體表; Chart of Standard Forms of Less-Than-Common National Characters) lists another 6,341 characters. The ''Chinese Standard Interchange Code'' (CNS11643)—the official national encoding standard—supports 48,027 characters, while the most widely-used encoding scheme, BIG-5, supports only 13,053.
In Hong Kong, which uses Traditional Chinese characters, the Education and Manpower Bureau's ''Soengjung Zi Zijing Biu'' (常用字字形表), intended for use in elementary and junior secondary education, lists a total of 4,759 characters.
In addition, there is a large corpus of ''dialect characters'', which are not used in formal written Chinese but represent colloquial terms in non-Mandarin Chinese spoken forms. One such variety is Written Cantonese, in widespread use in Hong Kong even for certain formal documents, due to the former British colonial administration's recognition of Cantonese for use for official purposes. In Taiwan, there is also an informal body of characters used to represent the spoken Hokkien (Min Nan) dialect.
Japanese

Main articles: Kanji

In Japanese there are 1945 ''Jōyō kanji'' (常用漢字 lit. "frequently used kanji") designated by the Japanese Ministry of Education; these are taught during primary and secondary school. The list is a recommendation, not a restriction, and many characters missing from it are still in common use.
The one area where character usage is officially restricted is in names, which may contain only government-approved characters. Since the ''Jōyō kanji'' list excludes many characters which have been used in personal and place names for generations, an additional list, referred to as the ''Jinmeiyō kanji'' (人名用漢字 lit. "kanji for use in personal names"), is published. It currently contains 983 characters, bringing the total number of government-endorsed characters to 2928. (See also the Names section of the Kanji article.)
Today, a well-educated Japanese person may know upwards of 3500 kanji. The Kanji kentei (日本漢字能力検定試験 ''Nihon Kanji Nōryoku Kentei Shiken'' or ''Test of Japanese Kanji Aptitude'') tests a speaker's ability to read and write kanji. The highest level of the ''Kanji kentei'' tests on 6000 kanji, though in practice few people attain (or need to attain) this level.
Korean

Main articles: Hanja

In times past, until the 15th century, in Korea, Chinese was the only form of written communication, prior to the creation of Hangul, the Korean alphabet. Much of the vocabulary, especially in the realms of science and sociology, comes directly from Chinese. However, due the lack of tones in Korean, as the words were imported from Chinese, many dissimilar characters took on identical sounds, and subsequently identical spelling in Hangul. Chinese characters are sometimes used to this day for either clarification in a practical manner, or to give a distinguished appearance, as knowledge of Chinese characters is considered a high class attribute and an indispensable part of a classical education.
In Korea, 한자 ''Hanja'' have become a politically contentious issue, with some Koreans urging a "purification" of the national language and culture by totally abandoning their use. These individuals encourage the exclusive use of the native Hangul alphabet throughout Korean society and the end to character education in public schools.
In South Korea, educational policy on characters has swung back and forth, often swayed by education ministers' personal opinions. At times, middle and high school students have been formally exposed to 1,800 to 2,000 basic characters, albeit with the principal focus on recognition, with the aim of achieving newspaper-literacy. Since there is little need to use Hanja in everyday life, young adult Koreans are often unable to read more than a few hundred characters.
There is a clear trend toward the exclusive use of Hangul in day-to-day South Korean society. Hanja are still used to some extent, particularly in newspapers, weddings, place names and calligraphy. Hanja is also extensively used in situations where ambiguity must be avoided, such as academic papers, high-level corporate reports, government documents, and newspapers; this is due to the large number of homonyms that have resulted from extended borrowing of Chinese words.
The issue of ambiguity is the main hurdle in any effort to "cleanse" the Korean language of Chinese characters. Characters convey meaning visually, while alphabets convey guidance to pronunciation, which in turn hints at meaning. As an example, in Korean dictionaries, the phonetic entry for 기사 ''gisa'' yields more than 30 different entries. In the past, this ambiguity had been efficiently resolved by parenthetically displaying the associated hanja.
In the modern Korean writing system based on Hangul, Chinese characters are not used any more to represent native morphemes.
In North Korea, the government, wielding much tighter control than its sister government to the south, has banned Chinese characters from virtually all public displays and media, and mandated the use of Hangul in their place.
Vietnamese

Although now nearly extinct in Vietnamese, varying scripts of Chinese characters (''hán tự'') were once in widespread use to write the language, although ''hán tự'' became limited to ceremonial uses beginning in the 19th century. Similarly to Japan and Korea, Chinese (especially Classical Chinese) was used by the ruling classes, and the characters were eventually adopted to write Vietnamese. To express native Vietnamese words which had different pronunciations from the Chinese, Vietnamese developed the Chu Nom script which used various methods to distinguish native Vietnamese words from Chinese. Vietnamese is currently exclusively written in the Vietnamese alphabet, a derivative of the Latin alphabet.

Rare and complex characters


'', "verbose"]]
'', "poor enunciation due to snuffle"]]
" - the appearance of a dragon in flight]]
"Biáng," a kind of noodle

Often a character not commonly used (a "rare" or "variant" character) will appear in a personal or place name in Chinese, Japanese, Korean, and Vietnamese (see Chinese name, Japanese name, Korean name, and Vietnamese name, respectively). This has caused problems as many computer encoding systems include only the most common characters and exclude the less oft-used characters. This is especially a problem for personal names which often contain rare or classical, antiquated characters.
People who have run into this problem include Taiwanese politicians Wang Chien-shien (王建煊, pinyin ''Wáng Jiànxuān'') and Yu Shyi-kun (游錫堃, pinyin ''Yóu Xīkūn''), ex-PRC Premier Zhu Rongji (朱镕基 ''Zhū Róngjī''), and Taiwanese singer David Tao (陶喆 ''Táo Zhé''). Newspapers have dealt with this problem in varying ways, including using software to combine two existing, similar characters, including a picture of the personality, or, especially as is the case with Yu Shyi-kun, simply substituting a homophone for the rare character in the hope that the reader would be able to make the correct inference. Japanese newspapers may render such names and words in katakana instead of kanji, and it is accepted practice for people to write names for which they are unsure of the correct kanji in katakana instead.
There are also some extremely complex characters which have understandably become rather rare. According to Bellassen (1989), the most complex Chinese character is 𪚥 (U+2A6A5) ''zhé'' (pictured right, top), meaning "verbose" and boasting sixty-four strokes; this character fell from use around the 5th century. It might be argued, however, that while boasting the most strokes, it is not necessarily the most complex character (in terms of difficulty), as it simply requires writing the same sixteen-stroke character 龍 ''lóng'' (lit. "dragon") four times in the space for one.
The most complex character found in modern Chinese dictionaries is 齉 (U+9F49) ''nàng'' (pictured right, middle), meaning "snuffle" (that is, a pronunciation marred by a blocked nose), with "just" thirty-six strokes. The most complex character that can be input using the Microsoft New Phonetic IMA 2002a for Traditional Chinese is 龘 ''tà'' "the appearance of a dragon in flight"; it is composed of the dragon radical represented three times, for a total of 16 × 3 = 48.
In Japanese, an 84-stroke ''kokuji'' exists [4]— it is composed of three "cloud" (雲) characters on top of the abovementioned triple "dragon" character (龘). Also meaning "the appearance of a dragon in flight", it is pronounced おとど ''otodo'', ''taito'', and だいと ''daito''.
The most complex Chinese character still in use may be 'biáng' (pictured right, bottom), with 57 strokes, which refers to Biang biang noodles, a type of noodle from China's Shaanxi province. This character along with syllable ''biang'' cannot be found in dictionaries. The fact that it represents a syllable that does not exist in any Standard Mandarin word means that it could be classified as a dialectal character.
In contrast, the simplest character is 一 ''yī'' ("one") with just one horizontal stroke. The most common character in Chinese is 的 ''de'', a grammatical particle functioning as an adjectival marker and as a clitic genitive case analogous to the English ''’s'', with eight strokes. The average number of strokes in a character has been calculated as 9.8;[10] it is unclear, however, whether this average is weighted, or whether it includes traditional characters.
Another very simple Chinese logograph is the character 〇 (líng), which simply refers to the number zero. For instance, the year 2000 would be 二〇〇〇年. However, there is another way to write zero which would be 零. The logograph 〇 is a native Chinese character, and its earliest documented use is in 1247 AD during the Southern Song dynasty period, found in a mathematical text called 數術九章 (''Shǔ Shù Jiǔ Zhāng'' "Mathematical Treatise in Nine Sections"). It is not directly derived from the Hindi-Arabic numeral "0".[11] Interestingly, being round, the character does not contain any traditional strokes.

Chinese calligraphy


Chinese calligraphy of mixed styles written by Song Dynasty (1051-1108 CE) poet Mifu. For centuries, the Chinese literati were expected to master the art of calligraphy.

Main articles: Chinese calligraphy

The art of writing Chinese characters is called 'Chinese calligraphy'. It is usually done with ink brushes. In ancient China, Chinese calligraphy is one of the Four Arts of the Chinese Scholars. There is a minimalist set of rules of Chinese calligraphy. Every character from the Chinese scripts is built into a uniform shape by means of assigning it a geometric area in which the character must occur. Each character has a set number of brushstrokes, none must be added or taken away from the character to enhance it visually, lest the meaning be lost. Finally, strict regularity is not required, meaning the strokes may be accentuated for dramatic effect of individual style. Calligraphy was the means by which scholars could mark their thoughts and teachings for immortality, and as such, represent some of the more precious treasures that can be found from ancient China.

See also



List of langauges written in Chinese characters and derivatives of Chinese characters

Simplified Chinese

Traditional Chinese

Romanization of Chinese





Chinese character encoding

Chinese input methods for computers

Chinese language

Chinese world

Han unification

Chinese written language

Transliteration into Chinese characters

Chinese characters for chemical elements

Xiandai Hanyu changyong zibiao (现代汉语常用字表, List of Frequently-Used Characters in Modern Chinese)

Stroke order

Eight Principles of Yong

Earthly Branches

Heavenly Stems

East Asian calligraphy

Horizontal and vertical writing in East Asian scripts

Blissymbols (an international auxiliary logographic script)

Sinoxenic

Devanagari

Hanja Test Wikipedia

Chu-Nom Test Wikipedia

References



1. Chinese Writing: Transitions and Transformations Norman, Jerry
2. http://news.bbc.co.uk/2/hi/science/nature/2956925.stm
3. 'Earliest writing' found in China
4. http://news.bbc.co.uk/2/hi/asia-pacific/6669569.stm
5. Carvings may rewrite history of Chinese characters
6. The Origin and Early Development of the Chinese Writing System, William G. Boltz, pp. 104-110, ISBN 0-940490-18-8
7. Sound Business: The Reality of Chinese Characters, Philip Philipsen, pp. 49-76, ISBN 0-595-35629-X
8. Updated from Norman, Jerry. ''Chinese''. New York: Cambridge University Press. 1988, p. 72. ISBN 0521296536
9. Hida & Sugawara, 1990, Tokyodo Shuppan
10. Bellassen, Joël & Zhang Pengpeng (1989). ''Méthode d'Initiation à la Langue et à l'Écriture chinoises''. La Compagnie. ISBN 2-9504135-1-X
11. Joseph Needham, Science and Civilisation in China, Volume III


External links



History of Chinese writing

Unihan Database: Chinese, Japanese, and Korean references, readings, and meanings for all the Chinese and Chinese-derived characters in the Unicode character set

IRG Page

IRG working documents – many big size pdfs, some of them with details of CJK extensions

Welcome To Mojikyo Institute!- big size downloadable Mojikyo program files

Hanzi Browser: Chinese Character reference searchable by semantic/phonetic component

Chinese Characters: Suggestions about How to Learn Chinese Character

Pinyin Annotator Add pinyin on ''top'' of any Chinese text, traditional or simplified. Save output to OpenOffice Writer format. Prints nicely. Also adds pinyin to any Chinese web page.

This article provided by Wikipedia. To edit the contents of this article, click here for original source.

psst.. try this: add to faves