Chinese Character Article Index for
Chinese Character
Articles about
Chinese Character
Website Links For
Chinese
 

Information About

Chinese Character




Chinese character () are Logogram s used in writing Chinese , a portion of Japanese , Korean , and ancient Vietnamese . Its possible precursors appeared as early as 8000 years ago and a complete writing system in Chinese character was developed 3500 years ago, namely the oldest surviving writing system.

In Chinese tradition, characters only correspond to single syllable, which is different from phonemes represented by a letter in an alphabet. Contrary to popular belief, only 4% of Chinese characters are Pictograms (象形字), the rest 95% are Compound Indicatives (會意字) and Phono-semantic Compound (形聲字), which are characters containing two parts, one indicates meanings and the other the sound. Each Word is often composed of two or more characters.

Numbers of chinese characters are nearly 50000, although large number is rarely used variants accumulated through the history. In China, literacy for working citizen is defined as knowledge of 2000 characters. {Link without Title}

Though under the same name, the shape of Chinese characters is varied in different culture. China adopted Simplified Characters in 1956, but Traditional Chinese Characters are used in Taiwan and Hong Kong . Japan has also its own simplfied chinese since 1946 while Korean has limited the use of Chinese characters and Vietnamese completely abolished the use of characters in romanized Vietnamese.

"]]


HISTORY

According to legend, Chinese characters was invented by Cangjie (c. 2650 BC), bureaucrat under Chinese legendary emperor, Huangdi . The fable told that Changjie was hunting in Mount Yangxu (Shanxi today) there he saw a tortoise whose veins catched his curiosity. Inspired by those viens, he studied animals of the world, landscape of the earth, stars of the sky, and invented a symbol system called '''zi''-- Chinese characters. On the day of his success, ghost mourned and crop rained, as it marked the new stage for civilization, for good or bad.

Modern archaeological evidence, however, suggested an earlier root of Chinese characters. The earliest evidence for what might be writing comes from ''Jiahu'' (賈湖), a Neolithic site in the basin of the Yellow River in Henan Province, dated to c. 6500 BC.

It has yielded turtle carapaces that were pitted and inscribed with symbols. The ''Chengziyai'' (城子崖) site in Longshan Townshi , Shandong has produced fragments of inscribed bones used to divine the future, dating to 2500 - 1900 BC, and symbols on pottery vessels from Dinggong are thought by some scholars to be an early form of writing. Symbols of a similar nature have also been found on pottery shards from the Liangzhu Culture of the lower Yangtze valley.

Although the earliest forms of primitive Chinese writing are no more than individual symbols and therefore cannot be considered a true written script, the inscriptions found on bones (dated to 2500 - 1900 BC) used for the purposes of divination from the late Neolithic Longshan Culture (c. 3200 - 1900 BC) are thought by some to be a proto-written script, similar to the earliest forms of writing in Mesopotamia and Egypt . It is possible that these inscriptions are ancestral to the later Oracle Bone Script of the Shang Dynasty and therefore the modern Chinese script, since late Neolithic culture found in Longshan is widely accepted by historians and archaeologists to be ancestral to the bronze age Erlitou Culture and the later Shang and Zhou Dynasties.

The oldest Chinese inscriptions that are indisputably writing are the Oracle Bone Script (甲骨文 ''jiǎgǔwén'', lit. shell-bone-script), a well-developed writing system of the Shang Dynasty (or Yin (殷) Dynasty), attested from about 1600 BC (from Zhengzhou ) and 1300 BC (from Anyang ), along with a very few logographs found on pottery shards and cast in bronzes, known as the Bronze Script , which is very similar to but more complex and pictorial than the Oracle Bone Script. Only about 1,400 of the 2,500 known Oracle Bone logographs can be identified with later Chinese characters and therefore easily read. However, it should be noted that these 1,400 logographs include most of the commonly used ones.


WRITTEN STYLES

, 220 BC ]]
The earliest Chinese characters are the Oracle Bone Script of the late Shang Dynasty and the Bronze Scripts (金文 ''jīnwén'') of the Shang and Zhou Dynasties. These scripts are no longer in use, and are of purely academic interest.

The first script that is still in use today, albeit restricted use, is the 篆書, 篆书 ''zhuànshū'' (" Seal Script "). It evolved organically out of the Zhou bronze script, and was adopted in a standardized form under the first emperor of China, Qin Shi Huang . The seal script, as the name suggests, is now only used in artistic seals. Few people are still able to read it effortlessly today, although the art of carving a traditional seal in the seal script remains alive in China and Japan today; some calligraphers also work in this style.

Scripts that are still used regularly are the " Clerical Script " (隸書, 隶书 ''lìshū'') of the Qin to Han dynasties, the "Wei monumental script" (魏碑 ''wèibēi'', see Weibei ), the " Regular Script " (楷書, 楷书 ''kǎishū'') used for most printing, and the " Semi-cursive Script " (行書, 行书 ''xíngshū'') used for most handwriting.

The ''cursive'' (草書, 草书 ''cǎoshū'', literally " Grass Script ") is not in general use, and is a purely artistic calligraphic style. The basic character shapes are suggested, rather than explicitly realized, and the abbreviations are extreme. Despite being cursive to the point where individual strokes are no longer differentiable and the characters often illegible to the untrained eye, this script (also known as ''draft'') is highly revered for the beauty and freedom that it embodies. Many Simplified Chinese Characters according to the CCP 1964 list are derived from ''cursive'' simplifications of traditional characters.

Just as Roman Letter s have a characteristic shape (lower-case letters occupying a roundish area, with ascenders or descenders on some letters), Chinese characters occupy a more or less square area. Characters made up of multiple parts squash these parts together in order to maintain a uniform size and shape. Because of this, beginners often practise on squared graph paper, and the Chinese sometimes call Han characters ''fāngkuài zì'' (lit. "square-block-characters").


FORMATION OF CHARACTERS


, ca. 800 BC ]]
''Main article: Radical ''

Main Article: Chinese Character Classification


The Six-Principles Theory of Chinese Script

Chinese scholars have traditionally classified Han characters according to six principles (六書 liùshū) by structural composition or usage. This six-way classification is historical and problematic; it is no longer the focus of modern scholars. However, its pervasiveness results in its deserving an explanation.

#The first type, and the type most often associated with Chinese writing, are Pictogram s (象形字 ''xiàngxíngzì''), which are pictorial representations of the Morpheme represented, e.g. 山 ''shān'' for "mountain". However, these are only a tiny fraction of Chinese characters.
Contrary to popular belief, only a small number of Chinese characters are at the time the character was created —the word "按" ''àn'' meaning "to press down" contains "安" ''ān'' (peace), which serves as its phonetic component, and "手" ''shǒu'' (hand), which indicates that the action is done using one's hand.

#The second type are Simple Indicatives or ''' Ideogram s''' (指事字 ''zhǐshìzì'') that attempt to graphicalize abstract concepts, such as 上 ''shàng'' "up" and 下 ''xià'' "down". Also considered ideograms are pictograms with an ideographic indicator; for instance, 刀 ''dāo'' is a pictogram meaning "knife", while 刃 ''rèn'' is an ideogram meaning "blade". Many scholars object to the term '''ideogram''' or '''ideograph''' on the grounds that these words suggest such graphs convey ideas directly, without association to spoken words. The terms ''indicative'' or ''simple indicative'' avoid this concern.
#''Huìyìzì'' (會意字 lit. "get-meaning-words"), associative compounds, '''semantic-semantic compounds''' or '''compound indicatives''' are those in which each element (semantic component) of the character hints at the meaning or the two interact to show meaning.
#''Xíngshēngzì'' (形聲字) are semantic-phonetic compounds, '''phono-semantic compounds''' or '''pictophonetic compounds''' in which one component (the semantic component) indicates the general meaning of the character, and the other (the phonetic) hints at the pronunciation. An example is 樑 ''liáng'', where the phonetic 梁 ''liáng'' indicates the pronunciation of the character and the semantic 木 ''mù'' ("wood") its meaning of "supporting beam". Characters of this type constitute the vast majority, over 90%, of Chinese logograms.
#''Zhuǎnzhùzì'' (轉註字 changed-annotation characters) are characters which were originally the same character but have bifurcated through Orthographic and often Semantic drift. For instance, 考 ''kǎo'' (to test) and 老 ''lǎo'' (old) were once the same character, meaning "elder person".
#''Jiǎjièzì'' (假借字 lit. "improvised-borrowed-words") are known as Phonetic Loan Characters , and come into use when a character with the same or a similar sound is "borrowed"; occasionally, the new meaning can supplant the old meaning. 自 ''zì'' used to be a pictographic word meaning "nose", but was borrowed to mean "self". It is now used almost exclusively to mean "self", while the "nose" meaning survives only in set phrases and more archaic compounds. Because of their derivational process, the entire set of Japanese Kana can in purely formal terms be considered to be Jiǎjièzì, hence the name ''kana'' (仮名; 仮 is a simplified form of 假).

The first two types above are ''dútǐ'' (獨體 single-body), meaning that the character was created independently of other Chinese characters. Although the perception of most Westerners is that most characters were derived in ''single-body'' fashion, pictograms and simple indicatives actually take up but a small proportion of Chinese Logogram s. More productive for the Chinese script were the next two, ''hétǐ'' (合體 '''compound'''), methods, i.e. the character was created from assembling different characters. Despite being called "compounds", these logograms are still single characters, and are written to take up the same amount of space as any other logogram. Note also that the first four types refer to how a character's structure comes about; the fifth is a comment on the relationship between two characters, while the sixth refers to the loan usage of a character. The six types are thus not mutually exclusive, as a character could be created in one way, and yet borrowed phonetically, i.e., belonging to two types.

The most productive method of Chinese writing, the semantic-phonetic compound (''xíngshēngzì'' 形聲字), was made possible because the phonetic system of Chinese allowed for generous ( Homonymy ), and because in consideration of phonetic similarity Tone was generally ignored, as were the final consonants of the characters in consideration, at least according to theory following from reconstructed Middle Chinese pronunciation. Note that due to the long period of language evolution, such component "hints" within characters as provided by the '''semantic-phonetic compounds''' are sometimes useless and may be misleading in modern usage. This is particularly true in non-Chinese languages (with Japanese being a notable exception for its phonological conservatism).


Section Headers

Each character has a component called a ''section header'' (部首 bùshǒu) under which it is traditionally indexed in Chinese dictionaries. These are often called radicals in English, a misnomer arising from the mistaken conception that every ''bùshǒu'' was a kind of semantic root, or "radix" (when in fact some are not, while some play that role in only a portion of the characters listed thereunder). For clarity, one may avoid the term "radical" entirely, instead using '''bùshǒu''' for the dictionary index component, and '''semantic''' component for any portion of a character bearing meaning.

The number of section headers in dictionaries is usually 214. Full characters are ordered according to their section header, and are then subcategorised by their total number of Stroke s.

In modern dictionaries, characters are generally classified in sections based on shared graphic components called 部首 ''bùshǒu'', literally "section headers". These are often also called radicals. However, the term 'radical' has also been used to refer to (and literally means) the semantic root of a character. Since not every section header plays a role as semantic root in every character listed under it, the translation of 部首 ''bùshǒu'' as ''radical'' is logically incorrect. Furthermore, it is not correct to automatically assume the remainder of a compound is its phonetic element, though according to research done by John DeFrancis , more than 95% of all characters were formed on the "radical + phonetic" principle.

For example, 各 gè is listed under the 部首 ''bùshǒu'' of 口 kǒu ''mouth''. However, the compound has nothing to do with the concept of, or graph for, ''mouth''. The oracle bone form of this compound, although very similar to the modern graph, clearly shows 夂, a ''foot'' (inverted form of 止 zhǐ, originally a ''foot'') arriving at or entering a U- or 口-shaped object, representing perhaps a dwelling or a walled city. The original attested meaning is ''to arrive'' (the graph was only later borrowed for the modern meaning of ''each''). It is clear that 口 is merely a convenient graphic extraction for the purpose of dictionary classification, and that even in its original function as perhaps a dwelling, it was not ''the radix'' or semantic root of the entire graph in the manner of European languages. Rather, the compound is formed by the principle of 會意 ''huìyì'', variously translated as ''compound indicative'', ''associative compound'', ''logical aggregate'', ''joined meaning'', etc.

Some 部首 ''bùshǒu'' are not even etymologically components of many of the graphs listed under them at all. Even a cursory examination of characters listed under the first six 部首 ''bùshǒu'' (一丨丶丿乙亅) will show that they do not play the role of ''radix'' in most of them. Under 一 yī 'one', for example, are listed 上 下 且 丘 丙 丑 and 並, all of which are present in the oracle bones as monosomatic graphs completely unrelated to the meaning of 'one'. 一 clearly is merely a convenient graphical extraction under which to list them. 部首 ''bùshǒu'' cannot therefore be rationally translated as ''radical'' (meaning semantic 'radix' or root), even though this is the most common term now.


ORTHOGRAPHY

Usually Chinese characters each take up the same amount of space, due to their block, square nature. One of the easiest ways for beginners to ensure a proper push-off is, hence, to practise writing with a grid as a guide, which is indeed standard practice in primary schools for both normal exercises and calligraphy training. In addition to strictness in the amount of space a character takes up, Chinese characters are written with very precise rules. The three most important rules are the strokes employed, stroke placement, and the order in which they are written ()''. Most Punctuation marks were adopted from the West, but there are a few exceptions: for example, names of books are marked with a wavy line drawn to their right in vertical text, or enclosed in a special double pointed bracket in horizontal text.

Common errors while writing Chinese characters include incorrect stroke direction, incorrect stroke order, incorrect stroke length relative to other strokes, and incorrect placement of strokes relative to other strokes, as well as the weight given to the different parts of a stroke. Each mistake is highly visible to the literate eye, and such mistakes are often shunned, being marks of illiteracy or incompetence. In a culture that values scholarship as its highest virtue, such attributions are highly undesirable. Because of this strictness in not only the image of the character, but how the image is produced, it is considered by many the most difficult to learn properly.

Due to the long history of China, as well as many stylistic variations that have developed and the many attempts by past rulers to standardise writing, some characters have multiple forms. The characters themselves can be considered separate, but often are merely derivatives of each other in that their composition is of the same root. They are often not considered simplifications, as their stroke count is sometimes the same, and often lessened only but a slight amount. Probably the most famous today is the character for sword (劍), where the radical (on the right) is knife (刀). The same word can be written with different forms for the radical, including using 刃 or 刀 itself.

The use of traditional characters versus simplified characters varies greatly, and can depend on both the local customs and the medium. Because character simplifications were not officially sanctioned and generally a result of Caoshu writing or idiosyncratic reductions, traditional, standard characters were mandatory in printed, and especially official, works, while the (unofficial) simplified characters would be used in everyday writing, or quick scribblings. Since the 1950's and especially with the publication of the 1964 list, the PRC has officially adopted a simplified script, while Hong Kong , Macau , and Taiwan retain the use of the traditional characters. There is no absolute rule for using either system, and often, it is determined by what the target audience understands, as well as the upbringing of the writer. In addition there is a special System Of Characters Used For Writing Numerals in financial contexts; these characters are modifications or adaptations of the original, simple numerals, deliberately made complicated to prevent forgeries or unauthorised alterations.


REFORMS: SIMPLIFICATION




Although most often associated with the PRC , character simplification predates the 1949 communist victory. Caoshu, cursive written text, almost always includes character simplification, and simplified forms have always existed in print, albeit not for the most formal works. In the 1930s and 1940s , discussions on character simplification took place within the Kuomintang government, and a large number of Chinese intellectuals and writers have long maintained that character simplification would help boost literacy in China. Indeed, this desire by the Kuomintang to simplify the Chinese writing system (inherited and implemented by the CCP ) also nursed aspirations of some for the adoption of a phonetic script, in imitation of the Roman Alphabet , and spawned such inventions as the Gwoyeu Romatzyh .

The PRC issued its first round of official character simplifications in two documents, the first in 1956 and the second in 1964 . A second round of character simplifications (known as ''erjian'', or "second round simplified characters"), were promulgated in 1977 . It was poorly received, and in 1986 the authorities rescinded the second round completely, while making six revisions to the 1964 list, including the restoration of three traditional characters that had been simplified: 叠 ''dié'', 覆 ''fù'', 像 ''xiàng''.

Many of the simplifications adopted had been in use in informal contexts for a long time, as more convenient alternatives to their more complex standard forms. For example, the traditional character 來 ''lái'' (come) was written with the structure 来 in the Clerical Script (隸書 lìshū) of the Han dynasty. This clerical form uses two fewer strokes, and was thus adopted as a simplified form. And the character 雲 ''yún'' (cloud) was written with the structure 云 in the Oracle Bone Script of the Shāng Dynasty , and had remained in use later as a phonetic loan in the meaning of ''to say''. The simplified form reverted to this original structure.


Southeast Asian Chinese communities

Singapore underwent three successive rounds of character simplification. These resulted in some simplifications that differed from those used in mainland China. It ultimately adopted the reforms of the PRC in their entirety as official, and has implemented them in the Educational System .

Malaysia promulgated a set of simplified characters in 1981 , which were also completely identical to the Mainland China simplifications; here, however, the simplifications were not generally widely adopted, as the Chinese educational system fell outside the purview of the Federal Government . However, with the advent of the PRC as an economic powerhouse, simplified characters are taught at school, and the simplified characters are more commonly, if not almost universally, used. However, a large majority of the older Chinese literate generation use the traditional characters. Chinese newspapers are published in either set of characters, with some even incorporating special Cantonese characters when publishing about the canto celebrity scene of Hong Kong.


Japanese Kanji

In the years after World War II , the Japanese government also instituted a series of Orthographic reforms. Some characters were given simplified forms called '' Shinjitai '' 新字体 (lit. "new character forms"; the older forms were then labelled the ''Kyūjitai'' 旧字体 , lit. "old character forms"). The number of characters in common use was restricted, and formal lists of characters to be learned during each grade of school were established, first the 1850-character '' Toyo Kanji '' 当用漢字 list in 1945, and later the 1945-character '' Jōyō Kanji '' 常用漢字 list in 1981. Many variant forms of characters and obscure alternatives for common characters were officially discouraged. This was done with the goal of facilitating learning for children and simplifying kanji use in literature and periodicals. These are simply guidelines, hence many characters outside these standards are still widely known and commonly used, especially those used for personal and place names (for the former, see Jinmeiyō Kanji ).

''Note:'' this table is merely cursory, and is not a complete listing.


DICTIONARIES

The design and use of a Dictionary of Chinese characters presents interesting problems. Dozens of indexing schemes have been created for the Chinese characters. The great majority of these schemes — beloved by their inventors but nobody else — have appeared in only a single dictionary; only one such system has achieved truly widespread use. This is the system of Radicals .

Chinese character dictionaries often allow users to locate entries in several different ways. Many Chinese, Japanese, and Korean dictionaries of Chinese characters list characters in radical order: characters are grouped together by radical, and radicals containing fewer Stroke s come before radicals containing more strokes. Under each radical, characters are listed by their total number of strokes. In Japanese and Korean dictionaries, it is usually possible to search for characters by sound, using Kana and Hangul . Most dictionaries also allow searches by total number of strokes, and individual dictionaries often allow other search methods as well.

For instance, to look up the character 松 (pine tree) in a typical dictionary, the user first determines which part of the character is the radical (here 木), then counts the number of strokes in the radical (four), and turns to the radical index (usually located on the inside front or back cover of the dictionary). Under the number "4" for radical stroke count, the user locates 木, then turns to the page number listed, which is the start of the listing of all the characters containing this radical. This page will have a sub-index giving remainder stroke numbers (for the non-radical portions of characters) and page numbers. The right half of the character also contains four strokes, so the user locates the number 4, and turns to the page number given. From there, the user must scan the entries to locate the character he or she is seeking. Some dictionaries have a sub-index which lists every character containing each radical, and if the user knows the number of strokes in the non-radical portion of the character, he or she can locate the correct page directly.



Another popular dictionary system is the Four Corner Method , where characters are classified according to the "shape" of each of the four corners.

Most Chinese-English dictionaries and Chinese dictionaries sold to English speakers use the radical lookup method combined with an alphabetical listing of characters based on their Pinyin Romanization system. To use one of these dictionaries, the reader finds the radical and stroke number of the character, as before, and locates the character in the radical index. The character's entry will have the character's pronunciation in pinyin written down; the reader then turns to the main dictionary section and looks up the pinyin spelling alphabetically, just as if it were an English dictionary.

This system has also been reborrowed by Chinese-language dictionary editors, giving rise to dictionaries with the traditional radical-based character listings in a section at the front, while the main body of the dictionary carries character listings by their pronunciation listed alphabetically according to their pinyin spelling.


DERIVATIVES

Besides Korean and Japanese, a number of Asian languages have historically been written with Han characters, or with characters modified from Han characters. They include:


The ) used an ideographic script consisted of original characters with a few Han borrowings.

In addition, the Yi Script is similar to Han, but is not known to be directly related to it.


NUMBER OF CHARACTERS

The question of how many characters there are is still the subject of debate. In the 18th century, European scholars claimed the total tally to be about 80,000. This number, however, is thought to be exaggerated as the character count varies by dictionary and its comprehensiveness. For example, the '' Kangxi Dictionary '' lists about 40,000 characters, while the modern ''Zhonghua Zihai'' lists in excess of 80,000 (the most comprehensive Korean hanja dictionary '' Han-Han Dae Sajeon '' consists about 60,000 characters, while Japanese competing kanji dictionary '' Dai Kan-Wa Jiten '' lists 50,000 entries). One reason for the overwhelming number of characters is due to the existence of rarely-occurring variant and obscure characters (many of which are unused, even in Classical Chinese ). Note, however, that no two characters are ever Contextually identical.

The large number of Chinese characters is due to their Logograph ic nature — for every Morpheme a glyph is required, and variant characters have at times developed for the same morpheme. Furthermore, in the centuries after the standardisation of the Chinese script by Qin Shi Huang to the '' Zhuanshu '', the Literati multiplied the total stock of characters by modifying extant characters ''à la'' xíngshēngzì (形聲字) method—by altering the radical of a homonym character to provide a distinct glyph for either new words or words that had till then been Homograph s. It has also been claimed that the sheer number of characters is used as a way to separate scholars from the ordinary, and perhaps even to keep certain texts from being read by all but the most scholarly.


ACHIEVEMENT OF LITERACY

It is usually said that about 2,000 characters are needed for basic literacy in Chinese (for example, to read a Chinese newspaper), and a well-educated person will know well in excess of 4,000 to 5,000 characters. (According to modern statistics, if one has learned just the 100 most commonly used Chinese logographs, one can read as much as 40% of all modern texts on average. If one has learned the 200 most commonly used Chinese logographs, then one can understand on average 54% of all modern texts. If one learned the 1000 most commonly used Chinese logographs, then one can on average read 89% of all modern texts. With 2000 logographs one can read 97% of all moder