The future of computer-character
Shi-zhao
Zhang
Meteorological
bureau of province Shaanxi
(20th CODATA International Conference POSTER ID:IT-8)
The common people
considers that, the language and character are used for the information
exchanging, in fact they have more important function, they are the very
important tools of the human thought. Since the born of humanity, the
development of the language and character have experienced 4 stages, the
carrier shape and its function in each stage have great enhancement, but, it
doesn¡¯t replace and remove the preceding stage, it is compatible with the
preceding stage, and coexists with the preceding stage carrier. Moreover the
new stage carrier unceasingly is simplifying, and its function is enhancing day
by day. The table next shows the development of all stages:
|
The stage |
The carrier |
characteristic |
new function |
|
body language |
body movement |
to limit with saw
that, written in water |
information
exchange |
|
spoken language |
air vibration |
to limit with
the ear listens, written in water |
Tool of thought |
|
character |
two-dimensional
graph |
to limit with
saw, may transmit, can be saved |
the promotion thought potency |
|
computer
character |
electromagnetism
condition |
Easily
massively duplicate, transmitting extremely fast |
thought mechanization |
The character
stage, first it was the stages that character was written with the pen on the paper,
in printing age, a great lot of character was printed on paper with the type.
This situation however improves the function of the character in storing and
spreading abroad, but it lose the function in signature and calligraphy that
represent individuality. The mainstream character in information age become
invisible state of electromagnetism now, we must use monitor to read it.
Computer character is republication of printing character, in other words, it
is the method of "storage and code", different type correspond to
different code. But the shortcoming of printing character remain as before,
future more, reading printing character do not need type, but reading computer
character must use whole types(character storage).
Writed
character: Pen ¡ª¡ª¡ª¡ª¡ª¡ª¡úarticle ¡ª¡ª¡ª¡ª¡úread
Printed
character: Type ¡ª¡ª¡ª¡ª¡ª¡ª¡úarticle ¡ª¡ª¡ª¡ª¡úread
Computer-char.: storage ¡ª¡ú code ¡ª¡úarticle ¡ª¡ª¡ª¡ª¡úread¡û¡ªstorage
If there was only a spell character (example English), it is never mind. But in Chinese, types are various, the problem appeared first, and in internet age, the problem is more and more serious. The requirement of using characters of different countries at same page makes it necessary of using Unicode. Currently the improvement of hardware can easily afford the increasing code length and storage. The problem is that: there are too many character various and they are continually developing, but to collect all of them for ever. Due to the Code Table can't arranged beforehand, and to make sure that character data is not invalidation, the codes that were collected before must be unchangeable, so, in the beginning it is in confusion case.
On
the other hand, with the increasing of the number of type in storage, it is more
difficult to choose the needed character. Even one-fold Chinese, totally depend
on especial input method, although the input methods is various(¡°ten thousand yards gallop¡±),
there are still some seldom type can't be inputted, must
simultaneously use the many kinds of character, is more difficult.
The Number of Chinese type about 100000, currently storage has reached to the
80000 types, but also can appear lacks the character,
because quantity too big, is easy to make a mistake, has discovered the so-called
computer wrong character and the heavy code. We
find that the huge Unicode is not a efficiently method, so the Methods IDS,CDL
are brought forward. The method construct new character with old character, in
principle, it need nothing but only a few etymon, or merely strokes. But the
method needs additional structural character and data in appointment position
of components. For length of character string for new character is too long, it
can not be used as code of character.
Ought to digital here digital there now
©°¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ú letter ¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ý
Carve ¡ª¡ª¡ú stroke ¡ª¡ª¡ú etymon ¡ª¡ª¡ú Chinese character ¡ª¡ª¡ª¡ú article
©¸¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ú pattern ¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ª¡ü
The
system of Chinese without storage that I developed at 1984 only need 240 etymons
without additional character (only few Chinese character needs structural
character), the etymon string itself is the code of the Chinese character. In
2003, the demo software on PC Chinese character had been developed; it only
used 50 genus strokes, buildup all Chinese characters, including any big
dictionary in all no. I think that the way of the solution in computer-Chinese
area is building up characters with strokes, and then all character of whole
world may be built up with a few element strokes. The first test, 7 element
strokes had been used to buildup all ASCII character and many map chart. In
fact the 7 element strokes are the 7 subroutine. Character is not a picture; it
is made up with single lines (strokes).
With the foundation of study about all strokes character of countries
and the advanced software and hardware, it will come to truth that we can write
all character type with keyboard but not electronic pen. It is the future of
Computer-character.
The
characters from strokes which do
not
included in any big dictionaries
50
genus strokes

Key word: The language and character, Unicode, the encoding, the storage of character type, the computer character
Brief
of author: Shi-zhao Zhang, male, born in 04-22-1937, retired high-grade
engineer, develop system of Chinese without type storage in 1984, research
about that it is correct method to solve Chinese. Web build in 2001, detail
refer to http://www.chancezoo.net,
http://www.chancezoo.org
Telephone
number: 029-86239494
Email:mzsgls@pub.xaonline.com