One document matched: draft-rfced-info-demchenko-00.txt


INTERNET DRAFT		EXPIRES APRIL 1998		INTERNET DRAFT

Network Working Group                                      Yu. Demchenko
INTERNET DRAFT                                                       KPI
Category: Informational                                      August 1997



         Registration of a Ukrainian Cyrillic Character Set KOI8-RU
              (as extention to Russian KOI8-R and ISO-IR-111)
			<draft-rfced-info-demchenko-00.txt>


Status of This Memo

This document is an Internet-Draft.  Internet-Drafts are working
documents of the Internet Engineering Task Force (IETF), its
areas, and its working groups.  Note that other groups may also
distribute working documents as Internet-Drafts.

Internet-Drafts are draft documents valid for a maximum of six
months and may be updated, replaced, or obsoleted by other
documents at any time.  It is inappropriate to use Internet-
Drafts as reference material or to cite them other than as
"work in progress."

To learn the current status of any Internet-Draft, please check
the "1id-abstracts.txt" listing contained in the Internet-
Drafts Shadow Directories on ftp.is.co.za (Africa),
nic.nordu.net (Europe), munnari.oz.au (Pacific Rim),
ds.internic.net (US East Coast), or ftp.isi.edu (US West Coast).

Distribution of this document is unlimited.


1.  Introduction

   This document provides information about widely 
   used in Ukrainian Internet community character set for mail and news
   exchange as well as for presentation WWW information resources in
   Ukrainian language.

   Though the proposed character set "KOI8-RU" is not currently an 
   international standard, there is large Internet user community 
   (including Ukraine and worldwide Ukrainian speaking community) 
   supporting it. 

   "KOI8-RU" is de-facto standard accepted by all Ukrainian community in 
   the Internet and unofficially published at many sites (F.E., 
   ftp://ftp.ua.net/pub/info/encodings/koi8-u/ukr_chars_in_koi8-
   u_and_others.txt; ftp://ftp.gu.kiev.ua/pub/koi8-u/ukr_chars_in_koi8-
   u_and_others.txt; http://cad.ntu-kpi.kiev.ua/multiling/KOI8-U.html). 

   Ukrainian language is the 20th among the world's languages (http:// 
   www.isoc.org:8080/langues/iso639.htm) and supported not only in 
   Ukraine as national state but among Ukrainian communities over the 
   world.

   KOI8-RU should be registered to support and facilitate general and 
   cultural infromation content development and dessimination. Support 
   of Ukrainian language in new software product is restrained by absent 
   of oficially registered and widely published de-facto used Ukrainian 
   charset.

   One of the problem now is that all old codepages ISO-IR-111, ISO 
   8859-5 doesn't include new Ukr. letter GHE (with upturn). Now it's 
   registered in UNICODE 2.0.14 as Cyrillic GHE with upturn (0490 - 
   capital, 0491 - small). It is used in more than 25 ukrainian words 
   and carry in some cases specific national features. 



Demchenko                                                       [Page 1]




I/D


   MIME character set name: koi8-ru

   Published specification: 

   This standard is unpublished but based on several published 
   standards: first of all, RFC1489 (it is fully complaint in all 
   russian letters), ISO 8859-5, ISO-IR-111, UNICODE 2.0.14.

   KOI8-RU is compatible with KOI8-R in all Cyrillic Letters and 
   completes it with four Ukrainian (#164, #180 - ukr. ie, #166, #182 - 
   ukr. i, #167, #183 - ukr. yi, #173, #189 - ukr. ghe with upturn) and 
   one Byelorussian (#174, #190 - byelorussian short u) letters which 
   locations are complaint with ISO-IR-111.

   All FORMS except positions ocupied by Ukrainian and Byelorussian 
   letters and Bullets in positions #148, #149, #158 coincide with KOI8-
   R.

   Positions #147, #150-153, #155-#157, #159 are used for important 
   characters which are currently missing from ISO-IR-111.

   The description of all characters from the upper half of the table is 
   compliance with ISO 10646 (Unicode). All Russian letters places have 
   been left at their original KOI8-R places. Introduced new ukrainian 
   letters ocupy positions where they are used as standard-de-facto in 
   Ukrainian language applications and newsgroups exchange accepted all 
   Ukrainian language community.


Demchenko                                                       [Page 2]




I/D


   <decimal> <hex-code> <Unicode>  <description> 

      128       80      U2500      FORMS LIGHT HORIZONTAL 
      129       81      U2502      FORMS LIGHT VERTICAL 
      130       82      U250C      FORMS LIGHT DOWN AND RIGHT 
      131       83      U2510      FORMS LIGHT DOWN AND LEFT 
      132       84      U2514      FORMS LIGHT UP AND RIGHT 
      133       85      U2518      FORMS LIGHT UP AND LEFT 
      134       86      U251C      FORMS LIGHT VERTICAL AND RIGHT 
      135       87      U2524      FORMS LIGHT VERTICAL AND LEFT 
      136       88      U252C      FORMS LIGHT DOWN AND HORIZONTAL 
      137       89      U2534      FORMS LIGHT UP AND HORIZONTAL 
      138       8A      U253C      FORMS LIGHT VERTICAL AND HORIZONTAL 
      139       8B      U2580      UPPER HALF BLOCK 
      140       8C      U2584      LOWER HALF BLOCK 
      141       8D      U2588      FULL BLOCK 
      142       8E      U258C      LEFT HALF BLOCK 
      143       8F      U2590      RIGHT HALF BLOCK 
      144       90      U2591      LIGHT SHADE 
      145       91      U2592      MEDIUM SHADE 
      146       92      U2593      DARK SHADE 
      147       93      U201C      LEFT DOUBLE QUOTATION MARK
      148       94      U25A0      BLACK SQUARE 
      149       95      U2219      BULLET OPERATOR 
      150       96      U201D      RIGHT DOUBLE QUOTATION MARK
      151       97      U2014      EM DASH
      152       98      U2116      NUMERO SIGN
      153       99      U2122      TRADE MARK SIGN
      154       9A      U00A0      NONBREAKING SPACE
      155       9B      U00BB      RIGHT-POINTING DOUBLE ANGLE QUOTATION 
                                   MARK
      156       9C      U00AE      REGISTERED SIGN
      157       9D      U00AB      LEFT-POINTING DOUBLE ANGLE QUOTATION 
                                   MARK
      158       9E      U00B7      MIDDLE DOT 
      159       9F      U00A4      CURRENCY SIGN 
      160       A0      U2550      FORMS DOUBLE HORIZONTAL 
      161       A1      U2551      FORMS DOUBLE VERTICAL 
      162       A2      U2552      FORMS DOWN SINGLE AND RIGHT DOUBLE
      163       A3      U0451      CYRILLIC SMALL LETTER IO 
      164       A4      U0454      CYRILLIC SMALL LETTER UKRAINIAN IE 
                                                                     UKR
      165       A5      U2554      FORMS DOUBLE DOWN AND RIGHT 
      166       A6      U0456      CYRILLIC SMALL LETTER BELORUSSIAN-
                                   UKRAINIAN I  UKR
      167       A7      U0457      CYRILLIC SMALL LETTER YI (UKRAINIAN) 
                                                                     UKR
      168       A8      U2557      FORMS DOUBLE DOWN AND LEFT 


Demchenko                                                       [Page 3]




I/D

      169       A9      U2558      FORMS UP SINGLE AND RIGHT DOUBLE
      170       AA      U2559      FORMS UP DOUBLE AND RIGHT SINGLE
      171       AB      U255A      FORMS DOUBLE UP AND RIGHT 
      172       AC      U255B      FORMS UP SINGLE AND LEFT DOUBLE
      173       AD      U0491      CYRILLIC SMALL LETTER UKRAINIAN GHE 
                                   (WITH UPTURN)                     UKR
      174       AE      U045E      CYRILLIC SMALL LETTER BELORUSSIAN 
                                   SHORT U                          BYEL
      175       AF      U255E      FORMS VERTICAL SINGLE AND RIGHT 
                                   DOUBLE
      176       *B0      U255F     FORMS VERTICAL DOUBLE AND RIGHT 
                                   SINGLE
      177       B1      U2560      FORMS DOUBLE VERTICAL AND RIGHT
      178       B2      U2561      FORMS VERTICAL SINGLE AND LEFT DOUBLE
      179       B3      U0401      CYRILLIC CAPITAL LETTER IO
      180       B4      U0403      CYRILLIC CAPITAL LETTER UKRAINIAN
IE   
                                   UKR
      181       B5      U2563      FORMS DOUBLE VERTICAL AND LEFT
      182       B6      U0406      CYRILLIC CAPITAL LETTER BELORUSSIAN-
                                   UKRAINIAN I                       UKR
      183       B7      U0407      CYRILLIC CAPITAL LETTER YI 
                                   (UKRAINIAN) UKR
      184       B8      U2566      FORMS DOUBLE DOWN AND HORIZONTAL 
      185       B9      U2567      FORMS UP SINGLE AND HORIZONTAL DOUBLE
      186       BA      U2568      FORMS UP DOUBLE AND HORIZONTAL SINGLE
      187       BB      U2569      FORMS DOUBLE UP AND HORIZONTAL 
      188       BC      U256A      FORMS VERTICAL SINGLE AND HORIZONTAL 
                                   DOUBLE
      189       BD      U0490      CYRILLIC CAPITAL LETTER UKRAINIAN GHE 
                                   (WITH UPTURN)                     UKR
      190       BE      U040E      CYRILLIC CAPITAL LETTER BELORUSSIAN 
                                   SHORT U                          BYEL
      191       BF      U00A9      COPYRIGHT SIGN 
      192       C0      U044E      CYRILLIC SMALL LETTER IU 
      193       C1      U0430      CYRILLIC SMALL LETTER A 
      194       C2      U0431      CYRILLIC SMALL LETTER BE 
      195       C3      U0446      CYRILLIC SMALL LETTER TSE 
      196       C4      U0434      CYRILLIC SMALL LETTER DE 
      197       C5      U0435      CYRILLIC SMALL LETTER IE 
      198       C6      U0444      CYRILLIC SMALL LETTER EF 
      199       C7      U0433      CYRILLIC SMALL LETTER GE  
      200       C8      U0445      CYRILLIC SMALL LETTER KHA 
      201       C9      U0438      CYRILLIC SMALL LETTER II 
      202       CA      U0439      CYRILLIC SMALL LETTER SHORT II 
      203       CB      U043A      CYRILLIC SMALL LETTER KA 
      204       CC      U043B      CYRILLIC SMALL LETTER EL 
      205       CD      U043C      CYRILLIC SMALL LETTER EM 
      206       CE      U043D      CYRILLIC SMALL LETTER EN 
      207       CF      U043E      CYRILLIC SMALL LETTER O 
      208       D0      U043F      CYRILLIC SMALL LETTER PE 
      209       D1      U044F      CYRILLIC SMALL LETTER IA 
      210       D2      U0440      CYRILLIC SMALL LETTER ER 

Demchenko                                                       [Page 4]




I/D

      211       D3      U0441      CYRILLIC SMALL LETTER ES 
      212       D4      U0442      CYRILLIC SMALL LETTER TE 
      213       D5      U0443      CYRILLIC SMALL LETTER U 
      214       D6      U0436      CYRILLIC SMALL LETTER ZHE 
      215       D7      U0432      CYRILLIC SMALL LETTER VE 
      216       D8      U044C      CYRILLIC SMALL LETTER SOFT SIGN 
      217       D9      U044B      CYRILLIC SMALL LETTER YERI 
      218       DA      U0437      CYRILLIC SMALL LETTER ZE 
      219       DB      U0448      CYRILLIC SMALL LETTER SHA 
      220       DC      U044D      CYRILLIC SMALL LETTER REVERSED E 
      221       DD      U0449      CYRILLIC SMALL LETTER SHCHA 
      222       DE      U0447      CYRILLIC SMALL LETTER CHE 
      223       DF      U044A      CYRILLIC SMALL LETTER HARD SIGN 
      224       E0      U042E      CYRILLIC CAPITAL LETTER IU 
      225       E1      U0410      CYRILLIC CAPITAL LETTER A 
      226       E2      U0411      CYRILLIC CAPITAL LETTER BE 
      227       E3      U0426      CYRILLIC CAPITAL LETTER TSE 
      228       E4      U0414      CYRILLIC CAPITAL LETTER DE 
      229       E5      U0415      CYRILLIC CAPITAL LETTER IE 
      230       E6      U0424      CYRILLIC CAPITAL LETTER EF 
      231       E7      U0413      CYRILLIC CAPITAL LETTER GE  
      232       E8      U0425      CYRILLIC CAPITAL LETTER KHA 
      233       E9      U0418      CYRILLIC CAPITAL LETTER II 
      234       EA      U0419      CYRILLIC CAPITAL LETTER SHORT II 
      235       EB      U041A      CYRILLIC CAPITAL LETTER KA 
      236       EC      U041B      CYRILLIC CAPITAL LETTER EL 
      237       ED      U041C      CYRILLIC CAPITAL LETTER EM 
      238       EE      U041D      CYRILLIC CAPITAL LETTER EN 
      239       EF      U041E      CYRILLIC CAPITAL LETTER O 
      240       F0      U041F      CYRILLIC CAPITAL LETTER PE 
      241       F1      U042F      CYRILLIC CAPITAL LETTER IA 
      242       F2      U0420      CYRILLIC CAPITAL LETTER ER 
      243       F3      U0421      CYRILLIC CAPITAL LETTER ES 
      244       F4      U0422      CYRILLIC CAPITAL LETTER TE 
      245       F5      U0423      CYRILLIC CAPITAL LETTER U 
      246       F6      U0416      CYRILLIC CAPITAL LETTER ZHE 
      247       F7      U0412      CYRILLIC CAPITAL LETTER VE 
      248       F8      U042C      CYRILLIC CAPITAL LETTER SOFT SIGN 
      249       F9      U042B      CYRILLIC CAPITAL LETTER YERI 
      250       FA      U0417      CYRILLIC CAPITAL LETTER ZE 
      251       FB      U0428      CYRILLIC CAPITAL LETTER SHA 
      252       FC      U042D      CYRILLIC CAPITAL LETTER REVERSED E 
      253       FD      U0429      CYRILLIC CAPITAL LETTER SHCHA 
      254       FE      U0427      CYRILLIC CAPITAL LETTER CHE 
      255       FF      U042A      CYRILLIC CAPITAL LETTER HARD SIGN 


Legend 

UKR - New included Ukrainian letters 

BYEL - New included Byelorusian letters 


Demchenko                                                       [Page 5]




I/D


   APPENDIX A

   DIFFERENCE OF KOI8-RU from EXISTING KOI8-R and ISO-IR-111 


   KOI8-RU is compatible with KOI8-R in all Cyrillic Letters and 
   completes it with Ukrainian letters UKRAINIAN IE #164, #180, 
   CYRILLIC SMALL LETTER BELORUSSIAN-UKRAINIAN I #166, #182, 
   UKRAINIAN YI #167, #183, UKRAINIAN GHE (WITH UPTURN) #173, 
   #189, BELORUSSIAN SHORT U #174, #190.

   Positions #147, #150 - #153, #155-#157, #159 are used for 
   important characters which are currently missing from ISO-IR-111.
   In all other positions FORMS  coincide with KOI8-R.


      147       93      U201C      LEFT DOUBLE QUOTATION MARK
      150       96      U201D      RIGHT DOUBLE QUOTATION MARK
      151       97      U2014      EM DASH
      152       98      U2116      NUMERO SIGN
      153       99      U2122      TRADE MARK SIGN
      155       9B      U00BB      RIGHT-POINTING DOUBLE ANGLE QUOTATION 
                                   MARK
      156       9C      U00AE      REGISTERED SIGN
      157       9D      U00AB      LEFT-POINTING DOUBLE ANGLE QUOTATION 
                                   MARK
      159       9F      U00A4      CURRENCY SIGN 
      164       A4      U0454      CYRILLIC SMALL LETTER UKRAINIAN
IE     
                                   UKR
      166       A6      U0456      CYRILLIC SMALL LETTER BELORUSSIAN-
                                   UKRAINIAN I  UKR
      167       A7      U0457      CYRILLIC SMALL LETTER YI
(UKRAINIAN)   
                                   UKR
      173       AD      U0491      CYRILLIC SMALL LETTER UKRAINIAN GHE 
                                   (WITH UPTURN)         UKR
      174       AE      U045E      CYRILLIC SMALL LETTER BELORUSSIAN 
                                   SHORT U                          BYEL
      180       B4      U0403      CYRILLIC CAPITAL LETTER UKRAINIAN
IE   
                                   UKR
      182       B6      U0406      CYRILLIC CAPITAL LETTER BELORUSSIAN-
                                   UKRAINIAN I                       UKR
      183       B7      U0407      CYRILLIC CAPITAL LETTER YI 
                                   (UKRAINIAN)                       UKR
      189       BD      U0490      CYRILLIC CAPITAL LETTER UKRAINIAN GHE 
                                   (WITH UPTURN)                     UKR
      190       BE      U040E      CYRILLIC CAPITAL LETTER BELORUSSIAN 
                                   SHORT U                          BYEL
      191       BF      U00A9      COPYRIGHT SIGN 


Demchenko                                                       [Page 6]




I/D


   KOI8-RU compatible with ISO-IR-111 in all Russian, Ukrainian and 
   Belorussian letters but differs in positions of one additional 
   Ukrainian letter GHE WITH UPTURN, non-specified in ISO-IR-111 
   positions #128-#159 are used for FORMS elements from KOI8-R and other 
   important characters which are currently missing from ISO-IR-111 and 
   KOI8-R.

      128       80      U2500      FORMS LIGHT HORIZONTAL 
      129       81      U2502      FORMS LIGHT VERTICAL 
      130       82      U250C      FORMS LIGHT DOWN AND RIGHT 
      131       83      U2510      FORMS LIGHT DOWN AND LEFT 
      132       84      U2514      FORMS LIGHT UP AND RIGHT 
      133       85      U2518      FORMS LIGHT UP AND LEFT 
      134       86      U251C      FORMS LIGHT VERTICAL AND RIGHT 
      135       87      U2524      FORMS LIGHT VERTICAL AND LEFT 
      136       88      U252C      FORMS LIGHT DOWN AND HORIZONTAL 
      137       89      U2534      FORMS LIGHT UP AND HORIZONTAL 
      138       8A      U253C      FORMS LIGHT VERTICAL AND HORIZONTAL 
      139       8B      U2580      UPPER HALF BLOCK 
      140       8C      U2584      LOWER HALF BLOCK 
      141       8D      U2588      FULL BLOCK 
      142       8E      U258C      LEFT HALF BLOCK 
      143       8F      U2590      RIGHT HALF BLOCK 
      144       90      U2591      LIGHT SHADE 
      145       91      U2592      MEDIUM SHADE 
      146       92      U2593      DARK SHADE 
      147       93      U201C      LEFT DOUBLE QUOTATION MARK
      148       94      U25A0      BLACK SQUARE 
      149       95      U2219      BULLET OPERATOR 
      150       96      U201D      RIGHT DOUBLE QUOTATION MARK
      151       97      U2014      EM DASH
      152       98      U2116      NUMERO SIGN
      153       99      U2122      TRADE MARK SIGN
      154       9A      U00A0      NONBREAKING SPACE
      155       9B      U00BB      RIGHT-POINTING DOUBLE ANGLE QUOTATION 
                                   MARK
      156       9C      U00AE      REGISTERED SIGN
      157       9D      U00AB      LEFT-POINTING DOUBLE ANGLE QUOTATION 
                                   MARK
      158       9E      U00B7      MIDDLE DOT 
      159       9F      U00A4      CURRENCY SIGN 
      160       A0      U2550      FORMS DOUBLE HORIZONTAL 
      161       A1      U2551      FORMS DOUBLE VERTICAL 
      162       A2      U2552      FORMS DOWN SINGLE AND RIGHT DOUBLE
      165       A5      U2554      FORMS DOUBLE DOWN AND RIGHT 
      168       A8      U2557      FORMS DOUBLE DOWN AND LEFT 
      169       A9      U2558      FORMS UP SINGLE AND RIGHT DOUBLE
      170       AA      U2559      FORMS UP DOUBLE AND RIGHT SINGLE


Demchenko                                                      [Page 7]




I/D


      171       AB      U255A      FORMS DOUBLE UP AND RIGHT 
      172       AC      U255B      FORMS UP SINGLE AND LEFT DOUBLE
      173       AD      U0491      CYRILLIC SMALL LETTER UKRAINIAN GHE 
                                   (WITH UPTURN)                    UKR
      175       AF      U255E      FORMS VERTICAL SINGLE AND RIGHT 
                                   DOUBLE
      176       B0      U255F      FORMS VERTICAL DOUBLE AND RIGHT 
                                   SINGLE
      177       B1      U2560      FORMS DOUBLE VERTICAL AND RIGHT
      178       B2      U2561      FORMS VERTICAL SINGLE AND LEFT DOUBLE
      181       B5      U2563      FORMS DOUBLE VERTICAL AND LEFT
      184       B8      U2566      FORMS DOUBLE DOWN AND HORIZONTAL 
      185       B9      U2567      FORMS UP SINGLE AND HORIZONTAL DOUBLE
      186       BA      U2568      FORMS UP DOUBLE AND HORIZONTAL SINGLE
      187       BB      U2569      FORMS DOUBLE UP AND HORIZONTAL 
      188       BC      U256A      FORMS VERTICAL SINGLE AND HORIZONTAL 
                                   DOUBLE
      189       BD      U0490      CYRILLIC CAPITAL LETTER UKRAINIAN GHE 
                                   (WITH UPTURN)                     UKR
      191       BF      U00A9      COPYRIGHT SIGN 


   Security Considerations

   Security issues are not discussed in this memo.

   References

     [1]  Chernov, A., "Registration of a Cyrillic Character Set", RFC 
          1589, Network Working Group, July 1993.
     [2]  UNICODE 2.0 CHARACTER DATABASE. - ftp://unicode.org/pub/2.0-
          Update/UnicodeData-2.0.14.txt
     [3]  Ukrainian letters in koi8-u and other character sets 
          ftp://ftp.ua.net/pub/info/encodings/koi8-u/ukr_chars_in_koi8-
          u_and_others.txt, June 1995.
     [4]  ECMA-CYRILLIC. - ftp://dkuug.dk/i18n/charmaps.all/ECMA-
          CYRILLIC


   Author's Address

   Yuri Demchenko
   Kiev Polytechnic Institute
   Kiev, Ukraine

   EMail: demch@cad.ntu-kpi.kiev.ua

INTERNET DRAFT		EXPIRES APRIL 1998		INTERNET DRAFT


PAFTECH AB 2003-20262026-04-23 15:48:49