One document matched: draft-rfced-info-demchenko-00.txt
INTERNET DRAFT EXPIRES APRIL 1998 INTERNET DRAFT
Network Working Group Yu. Demchenko
INTERNET DRAFT KPI
Category: Informational August 1997
Registration of a Ukrainian Cyrillic Character Set KOI8-RU
(as extention to Russian KOI8-R and ISO-IR-111)
<draft-rfced-info-demchenko-00.txt>
Status of This Memo
This document is an Internet-Draft. Internet-Drafts are working
documents of the Internet Engineering Task Force (IETF), its
areas, and its working groups. Note that other groups may also
distribute working documents as Internet-Drafts.
Internet-Drafts are draft documents valid for a maximum of six
months and may be updated, replaced, or obsoleted by other
documents at any time. It is inappropriate to use Internet-
Drafts as reference material or to cite them other than as
"work in progress."
To learn the current status of any Internet-Draft, please check
the "1id-abstracts.txt" listing contained in the Internet-
Drafts Shadow Directories on ftp.is.co.za (Africa),
nic.nordu.net (Europe), munnari.oz.au (Pacific Rim),
ds.internic.net (US East Coast), or ftp.isi.edu (US West Coast).
Distribution of this document is unlimited.
1. Introduction
This document provides information about widely
used in Ukrainian Internet community character set for mail and news
exchange as well as for presentation WWW information resources in
Ukrainian language.
Though the proposed character set "KOI8-RU" is not currently an
international standard, there is large Internet user community
(including Ukraine and worldwide Ukrainian speaking community)
supporting it.
"KOI8-RU" is de-facto standard accepted by all Ukrainian community in
the Internet and unofficially published at many sites (F.E.,
ftp://ftp.ua.net/pub/info/encodings/koi8-u/ukr_chars_in_koi8-
u_and_others.txt; ftp://ftp.gu.kiev.ua/pub/koi8-u/ukr_chars_in_koi8-
u_and_others.txt; http://cad.ntu-kpi.kiev.ua/multiling/KOI8-U.html).
Ukrainian language is the 20th among the world's languages (http://
www.isoc.org:8080/langues/iso639.htm) and supported not only in
Ukraine as national state but among Ukrainian communities over the
world.
KOI8-RU should be registered to support and facilitate general and
cultural infromation content development and dessimination. Support
of Ukrainian language in new software product is restrained by absent
of oficially registered and widely published de-facto used Ukrainian
charset.
One of the problem now is that all old codepages ISO-IR-111, ISO
8859-5 doesn't include new Ukr. letter GHE (with upturn). Now it's
registered in UNICODE 2.0.14 as Cyrillic GHE with upturn (0490 -
capital, 0491 - small). It is used in more than 25 ukrainian words
and carry in some cases specific national features.
Demchenko [Page 1]
I/D
MIME character set name: koi8-ru
Published specification:
This standard is unpublished but based on several published
standards: first of all, RFC1489 (it is fully complaint in all
russian letters), ISO 8859-5, ISO-IR-111, UNICODE 2.0.14.
KOI8-RU is compatible with KOI8-R in all Cyrillic Letters and
completes it with four Ukrainian (#164, #180 - ukr. ie, #166, #182 -
ukr. i, #167, #183 - ukr. yi, #173, #189 - ukr. ghe with upturn) and
one Byelorussian (#174, #190 - byelorussian short u) letters which
locations are complaint with ISO-IR-111.
All FORMS except positions ocupied by Ukrainian and Byelorussian
letters and Bullets in positions #148, #149, #158 coincide with KOI8-
R.
Positions #147, #150-153, #155-#157, #159 are used for important
characters which are currently missing from ISO-IR-111.
The description of all characters from the upper half of the table is
compliance with ISO 10646 (Unicode). All Russian letters places have
been left at their original KOI8-R places. Introduced new ukrainian
letters ocupy positions where they are used as standard-de-facto in
Ukrainian language applications and newsgroups exchange accepted all
Ukrainian language community.
Demchenko [Page 2]
I/D
<decimal> <hex-code> <Unicode> <description>
128 80 U2500 FORMS LIGHT HORIZONTAL
129 81 U2502 FORMS LIGHT VERTICAL
130 82 U250C FORMS LIGHT DOWN AND RIGHT
131 83 U2510 FORMS LIGHT DOWN AND LEFT
132 84 U2514 FORMS LIGHT UP AND RIGHT
133 85 U2518 FORMS LIGHT UP AND LEFT
134 86 U251C FORMS LIGHT VERTICAL AND RIGHT
135 87 U2524 FORMS LIGHT VERTICAL AND LEFT
136 88 U252C FORMS LIGHT DOWN AND HORIZONTAL
137 89 U2534 FORMS LIGHT UP AND HORIZONTAL
138 8A U253C FORMS LIGHT VERTICAL AND HORIZONTAL
139 8B U2580 UPPER HALF BLOCK
140 8C U2584 LOWER HALF BLOCK
141 8D U2588 FULL BLOCK
142 8E U258C LEFT HALF BLOCK
143 8F U2590 RIGHT HALF BLOCK
144 90 U2591 LIGHT SHADE
145 91 U2592 MEDIUM SHADE
146 92 U2593 DARK SHADE
147 93 U201C LEFT DOUBLE QUOTATION MARK
148 94 U25A0 BLACK SQUARE
149 95 U2219 BULLET OPERATOR
150 96 U201D RIGHT DOUBLE QUOTATION MARK
151 97 U2014 EM DASH
152 98 U2116 NUMERO SIGN
153 99 U2122 TRADE MARK SIGN
154 9A U00A0 NONBREAKING SPACE
155 9B U00BB RIGHT-POINTING DOUBLE ANGLE QUOTATION
MARK
156 9C U00AE REGISTERED SIGN
157 9D U00AB LEFT-POINTING DOUBLE ANGLE QUOTATION
MARK
158 9E U00B7 MIDDLE DOT
159 9F U00A4 CURRENCY SIGN
160 A0 U2550 FORMS DOUBLE HORIZONTAL
161 A1 U2551 FORMS DOUBLE VERTICAL
162 A2 U2552 FORMS DOWN SINGLE AND RIGHT DOUBLE
163 A3 U0451 CYRILLIC SMALL LETTER IO
164 A4 U0454 CYRILLIC SMALL LETTER UKRAINIAN IE
UKR
165 A5 U2554 FORMS DOUBLE DOWN AND RIGHT
166 A6 U0456 CYRILLIC SMALL LETTER BELORUSSIAN-
UKRAINIAN I UKR
167 A7 U0457 CYRILLIC SMALL LETTER YI (UKRAINIAN)
UKR
168 A8 U2557 FORMS DOUBLE DOWN AND LEFT
Demchenko [Page 3]
I/D
169 A9 U2558 FORMS UP SINGLE AND RIGHT DOUBLE
170 AA U2559 FORMS UP DOUBLE AND RIGHT SINGLE
171 AB U255A FORMS DOUBLE UP AND RIGHT
172 AC U255B FORMS UP SINGLE AND LEFT DOUBLE
173 AD U0491 CYRILLIC SMALL LETTER UKRAINIAN GHE
(WITH UPTURN) UKR
174 AE U045E CYRILLIC SMALL LETTER BELORUSSIAN
SHORT U BYEL
175 AF U255E FORMS VERTICAL SINGLE AND RIGHT
DOUBLE
176 *B0 U255F FORMS VERTICAL DOUBLE AND RIGHT
SINGLE
177 B1 U2560 FORMS DOUBLE VERTICAL AND RIGHT
178 B2 U2561 FORMS VERTICAL SINGLE AND LEFT DOUBLE
179 B3 U0401 CYRILLIC CAPITAL LETTER IO
180 B4 U0403 CYRILLIC CAPITAL LETTER UKRAINIAN
IE
UKR
181 B5 U2563 FORMS DOUBLE VERTICAL AND LEFT
182 B6 U0406 CYRILLIC CAPITAL LETTER BELORUSSIAN-
UKRAINIAN I UKR
183 B7 U0407 CYRILLIC CAPITAL LETTER YI
(UKRAINIAN) UKR
184 B8 U2566 FORMS DOUBLE DOWN AND HORIZONTAL
185 B9 U2567 FORMS UP SINGLE AND HORIZONTAL DOUBLE
186 BA U2568 FORMS UP DOUBLE AND HORIZONTAL SINGLE
187 BB U2569 FORMS DOUBLE UP AND HORIZONTAL
188 BC U256A FORMS VERTICAL SINGLE AND HORIZONTAL
DOUBLE
189 BD U0490 CYRILLIC CAPITAL LETTER UKRAINIAN GHE
(WITH UPTURN) UKR
190 BE U040E CYRILLIC CAPITAL LETTER BELORUSSIAN
SHORT U BYEL
191 BF U00A9 COPYRIGHT SIGN
192 C0 U044E CYRILLIC SMALL LETTER IU
193 C1 U0430 CYRILLIC SMALL LETTER A
194 C2 U0431 CYRILLIC SMALL LETTER BE
195 C3 U0446 CYRILLIC SMALL LETTER TSE
196 C4 U0434 CYRILLIC SMALL LETTER DE
197 C5 U0435 CYRILLIC SMALL LETTER IE
198 C6 U0444 CYRILLIC SMALL LETTER EF
199 C7 U0433 CYRILLIC SMALL LETTER GE
200 C8 U0445 CYRILLIC SMALL LETTER KHA
201 C9 U0438 CYRILLIC SMALL LETTER II
202 CA U0439 CYRILLIC SMALL LETTER SHORT II
203 CB U043A CYRILLIC SMALL LETTER KA
204 CC U043B CYRILLIC SMALL LETTER EL
205 CD U043C CYRILLIC SMALL LETTER EM
206 CE U043D CYRILLIC SMALL LETTER EN
207 CF U043E CYRILLIC SMALL LETTER O
208 D0 U043F CYRILLIC SMALL LETTER PE
209 D1 U044F CYRILLIC SMALL LETTER IA
210 D2 U0440 CYRILLIC SMALL LETTER ER
Demchenko [Page 4]
I/D
211 D3 U0441 CYRILLIC SMALL LETTER ES
212 D4 U0442 CYRILLIC SMALL LETTER TE
213 D5 U0443 CYRILLIC SMALL LETTER U
214 D6 U0436 CYRILLIC SMALL LETTER ZHE
215 D7 U0432 CYRILLIC SMALL LETTER VE
216 D8 U044C CYRILLIC SMALL LETTER SOFT SIGN
217 D9 U044B CYRILLIC SMALL LETTER YERI
218 DA U0437 CYRILLIC SMALL LETTER ZE
219 DB U0448 CYRILLIC SMALL LETTER SHA
220 DC U044D CYRILLIC SMALL LETTER REVERSED E
221 DD U0449 CYRILLIC SMALL LETTER SHCHA
222 DE U0447 CYRILLIC SMALL LETTER CHE
223 DF U044A CYRILLIC SMALL LETTER HARD SIGN
224 E0 U042E CYRILLIC CAPITAL LETTER IU
225 E1 U0410 CYRILLIC CAPITAL LETTER A
226 E2 U0411 CYRILLIC CAPITAL LETTER BE
227 E3 U0426 CYRILLIC CAPITAL LETTER TSE
228 E4 U0414 CYRILLIC CAPITAL LETTER DE
229 E5 U0415 CYRILLIC CAPITAL LETTER IE
230 E6 U0424 CYRILLIC CAPITAL LETTER EF
231 E7 U0413 CYRILLIC CAPITAL LETTER GE
232 E8 U0425 CYRILLIC CAPITAL LETTER KHA
233 E9 U0418 CYRILLIC CAPITAL LETTER II
234 EA U0419 CYRILLIC CAPITAL LETTER SHORT II
235 EB U041A CYRILLIC CAPITAL LETTER KA
236 EC U041B CYRILLIC CAPITAL LETTER EL
237 ED U041C CYRILLIC CAPITAL LETTER EM
238 EE U041D CYRILLIC CAPITAL LETTER EN
239 EF U041E CYRILLIC CAPITAL LETTER O
240 F0 U041F CYRILLIC CAPITAL LETTER PE
241 F1 U042F CYRILLIC CAPITAL LETTER IA
242 F2 U0420 CYRILLIC CAPITAL LETTER ER
243 F3 U0421 CYRILLIC CAPITAL LETTER ES
244 F4 U0422 CYRILLIC CAPITAL LETTER TE
245 F5 U0423 CYRILLIC CAPITAL LETTER U
246 F6 U0416 CYRILLIC CAPITAL LETTER ZHE
247 F7 U0412 CYRILLIC CAPITAL LETTER VE
248 F8 U042C CYRILLIC CAPITAL LETTER SOFT SIGN
249 F9 U042B CYRILLIC CAPITAL LETTER YERI
250 FA U0417 CYRILLIC CAPITAL LETTER ZE
251 FB U0428 CYRILLIC CAPITAL LETTER SHA
252 FC U042D CYRILLIC CAPITAL LETTER REVERSED E
253 FD U0429 CYRILLIC CAPITAL LETTER SHCHA
254 FE U0427 CYRILLIC CAPITAL LETTER CHE
255 FF U042A CYRILLIC CAPITAL LETTER HARD SIGN
Legend
UKR - New included Ukrainian letters
BYEL - New included Byelorusian letters
Demchenko [Page 5]
I/D
APPENDIX A
DIFFERENCE OF KOI8-RU from EXISTING KOI8-R and ISO-IR-111
KOI8-RU is compatible with KOI8-R in all Cyrillic Letters and
completes it with Ukrainian letters UKRAINIAN IE #164, #180,
CYRILLIC SMALL LETTER BELORUSSIAN-UKRAINIAN I #166, #182,
UKRAINIAN YI #167, #183, UKRAINIAN GHE (WITH UPTURN) #173,
#189, BELORUSSIAN SHORT U #174, #190.
Positions #147, #150 - #153, #155-#157, #159 are used for
important characters which are currently missing from ISO-IR-111.
In all other positions FORMS coincide with KOI8-R.
147 93 U201C LEFT DOUBLE QUOTATION MARK
150 96 U201D RIGHT DOUBLE QUOTATION MARK
151 97 U2014 EM DASH
152 98 U2116 NUMERO SIGN
153 99 U2122 TRADE MARK SIGN
155 9B U00BB RIGHT-POINTING DOUBLE ANGLE QUOTATION
MARK
156 9C U00AE REGISTERED SIGN
157 9D U00AB LEFT-POINTING DOUBLE ANGLE QUOTATION
MARK
159 9F U00A4 CURRENCY SIGN
164 A4 U0454 CYRILLIC SMALL LETTER UKRAINIAN
IE
UKR
166 A6 U0456 CYRILLIC SMALL LETTER BELORUSSIAN-
UKRAINIAN I UKR
167 A7 U0457 CYRILLIC SMALL LETTER YI
(UKRAINIAN)
UKR
173 AD U0491 CYRILLIC SMALL LETTER UKRAINIAN GHE
(WITH UPTURN) UKR
174 AE U045E CYRILLIC SMALL LETTER BELORUSSIAN
SHORT U BYEL
180 B4 U0403 CYRILLIC CAPITAL LETTER UKRAINIAN
IE
UKR
182 B6 U0406 CYRILLIC CAPITAL LETTER BELORUSSIAN-
UKRAINIAN I UKR
183 B7 U0407 CYRILLIC CAPITAL LETTER YI
(UKRAINIAN) UKR
189 BD U0490 CYRILLIC CAPITAL LETTER UKRAINIAN GHE
(WITH UPTURN) UKR
190 BE U040E CYRILLIC CAPITAL LETTER BELORUSSIAN
SHORT U BYEL
191 BF U00A9 COPYRIGHT SIGN
Demchenko [Page 6]
I/D
KOI8-RU compatible with ISO-IR-111 in all Russian, Ukrainian and
Belorussian letters but differs in positions of one additional
Ukrainian letter GHE WITH UPTURN, non-specified in ISO-IR-111
positions #128-#159 are used for FORMS elements from KOI8-R and other
important characters which are currently missing from ISO-IR-111 and
KOI8-R.
128 80 U2500 FORMS LIGHT HORIZONTAL
129 81 U2502 FORMS LIGHT VERTICAL
130 82 U250C FORMS LIGHT DOWN AND RIGHT
131 83 U2510 FORMS LIGHT DOWN AND LEFT
132 84 U2514 FORMS LIGHT UP AND RIGHT
133 85 U2518 FORMS LIGHT UP AND LEFT
134 86 U251C FORMS LIGHT VERTICAL AND RIGHT
135 87 U2524 FORMS LIGHT VERTICAL AND LEFT
136 88 U252C FORMS LIGHT DOWN AND HORIZONTAL
137 89 U2534 FORMS LIGHT UP AND HORIZONTAL
138 8A U253C FORMS LIGHT VERTICAL AND HORIZONTAL
139 8B U2580 UPPER HALF BLOCK
140 8C U2584 LOWER HALF BLOCK
141 8D U2588 FULL BLOCK
142 8E U258C LEFT HALF BLOCK
143 8F U2590 RIGHT HALF BLOCK
144 90 U2591 LIGHT SHADE
145 91 U2592 MEDIUM SHADE
146 92 U2593 DARK SHADE
147 93 U201C LEFT DOUBLE QUOTATION MARK
148 94 U25A0 BLACK SQUARE
149 95 U2219 BULLET OPERATOR
150 96 U201D RIGHT DOUBLE QUOTATION MARK
151 97 U2014 EM DASH
152 98 U2116 NUMERO SIGN
153 99 U2122 TRADE MARK SIGN
154 9A U00A0 NONBREAKING SPACE
155 9B U00BB RIGHT-POINTING DOUBLE ANGLE QUOTATION
MARK
156 9C U00AE REGISTERED SIGN
157 9D U00AB LEFT-POINTING DOUBLE ANGLE QUOTATION
MARK
158 9E U00B7 MIDDLE DOT
159 9F U00A4 CURRENCY SIGN
160 A0 U2550 FORMS DOUBLE HORIZONTAL
161 A1 U2551 FORMS DOUBLE VERTICAL
162 A2 U2552 FORMS DOWN SINGLE AND RIGHT DOUBLE
165 A5 U2554 FORMS DOUBLE DOWN AND RIGHT
168 A8 U2557 FORMS DOUBLE DOWN AND LEFT
169 A9 U2558 FORMS UP SINGLE AND RIGHT DOUBLE
170 AA U2559 FORMS UP DOUBLE AND RIGHT SINGLE
Demchenko [Page 7]
I/D
171 AB U255A FORMS DOUBLE UP AND RIGHT
172 AC U255B FORMS UP SINGLE AND LEFT DOUBLE
173 AD U0491 CYRILLIC SMALL LETTER UKRAINIAN GHE
(WITH UPTURN) UKR
175 AF U255E FORMS VERTICAL SINGLE AND RIGHT
DOUBLE
176 B0 U255F FORMS VERTICAL DOUBLE AND RIGHT
SINGLE
177 B1 U2560 FORMS DOUBLE VERTICAL AND RIGHT
178 B2 U2561 FORMS VERTICAL SINGLE AND LEFT DOUBLE
181 B5 U2563 FORMS DOUBLE VERTICAL AND LEFT
184 B8 U2566 FORMS DOUBLE DOWN AND HORIZONTAL
185 B9 U2567 FORMS UP SINGLE AND HORIZONTAL DOUBLE
186 BA U2568 FORMS UP DOUBLE AND HORIZONTAL SINGLE
187 BB U2569 FORMS DOUBLE UP AND HORIZONTAL
188 BC U256A FORMS VERTICAL SINGLE AND HORIZONTAL
DOUBLE
189 BD U0490 CYRILLIC CAPITAL LETTER UKRAINIAN GHE
(WITH UPTURN) UKR
191 BF U00A9 COPYRIGHT SIGN
Security Considerations
Security issues are not discussed in this memo.
References
[1] Chernov, A., "Registration of a Cyrillic Character Set", RFC
1589, Network Working Group, July 1993.
[2] UNICODE 2.0 CHARACTER DATABASE. - ftp://unicode.org/pub/2.0-
Update/UnicodeData-2.0.14.txt
[3] Ukrainian letters in koi8-u and other character sets
ftp://ftp.ua.net/pub/info/encodings/koi8-u/ukr_chars_in_koi8-
u_and_others.txt, June 1995.
[4] ECMA-CYRILLIC. - ftp://dkuug.dk/i18n/charmaps.all/ECMA-
CYRILLIC
Author's Address
Yuri Demchenko
Kiev Polytechnic Institute
Kiev, Ukraine
EMail: demch@cad.ntu-kpi.kiev.ua
INTERNET DRAFT EXPIRES APRIL 1998 INTERNET DRAFT
| PAFTECH AB 2003-2026 | 2026-04-23 15:48:49 |