Commit Graph

56 Commits

Author SHA1 Message Date
tnozaki 3acfcd1374 add iconv GB18030 <-> Unicode BMP conversion table. 2006-09-09 15:43:04 +00:00
tnozaki c43c0703e2 * updated 0xFE50-0xFEA0 mapping to Unicode3.0 later.
* added fallback mapping of 0xA989-0xA995 <-> U+E7E7-U+E7F3.
* some TTF - NSimSun(Microsoft GB18030 Support Package) and
  FZSongTi(Solaris GB18030 locale) - assign Vertical form(U+FE10-U+FE19)
  to UDA/PUA(0x2659-0x2673/U+E78D-U+E796), so comment about this.
2006-08-29 18:52:29 +00:00
tnozaki 2af744a1fb filled 0x577A-0x577E hole with PUA. 2006-08-25 15:27:41 +00:00
tnozaki 4ba9b09748 remove *.mps entry that doesn't exists. 2006-08-19 11:13:18 +00:00
tnozaki 2ad099c737 1. added iconv support of Chinese Simplefied encoding, GBK.
2. added Unicode mapping of 2byte GB18030 User Defined Area.
2006-08-19 10:58:40 +00:00
tnozaki 41efa2e2cd 1. added CNS11643 plane 3 <-> UCS iconv data.
2. zh_TW.eucTW locale now supports CNS11643 plane 3 ~ 7
2006-07-16 06:13:29 +00:00
tnozaki 3ce59d0949 ISO-8859-7:1987 -> ISO-8859-7:2003 2006-07-12 16:56:35 +00:00
tnozaki d1116a2478 fix wrong alias, s/iso646-gb/iso646-cn/ is correct. 2006-07-05 15:31:50 +00:00
tnozaki 2a35d342e7 added fallback U+2015 -> 1-1-29. 2006-07-05 15:27:53 +00:00
tnozaki d9cfca81ba fix typo. 2006-06-22 16:18:45 +00:00
tnozaki e476537d55 correct the code point incompatible with Big5. 2006-06-22 16:11:55 +00:00
tnozaki 2bcfe3b4c8 added Chinese Tradisional Big5 family,
Big5-2003, Big5-ETen, Big5-IBM, Big-5E, Big-5+.

``Big5 is now the alias of Big5-ETen,
if you want Unicode.org's obsolete mappings, use Big5-IBM instead.
2006-06-19 17:28:24 +00:00
tnozaki 57b95db2c8 add iconv support for CP1131(Belarus DOS code). 2006-06-16 20:09:23 +00:00
tnozaki a302e2a921 added iconv support for georgian-ps, georgian-academy. 2006-06-15 15:34:20 +00:00
tnozaki f52a3ac4b6 1-29(0x213D) is not HORIZONTAL BAR(U+2015) but EM DASH(U+2014). 2006-06-07 17:04:37 +00:00
tnozaki ffcda0dc45 fix wrong token, s/DST_ILSEQ/DST_INVALID/. 2006-04-08 15:47:39 +00:00
tnozaki 451ff41269 fix typo, s/ISO-5427/KOI7/ 2006-04-08 14:26:45 +00:00
tnozaki a3e63d71ca add following encoding support to iconv:
* KOI-7(iso-ir-37, iso-5427)
	* KOI-8(old KOI8)
	* KOI7-switched(iso-2022 variant)
	* KOI8-C -- Pablo Saratxaga's  X_LOCALE KOI8-C(Caucasus)
	* KOI8-CyrillicOld -- Serge Winitzki's KOI8-C(Cyrillic-Old)
	  ** not exported yet **
	* KOI8-E(iso-ir-111, ecma-cyrillic)
	* KOI8-RU
	* KOI8-T
2006-04-08 14:15:48 +00:00
tnozaki b37120b682 add iconv support for Armenian State Standard encoding:
ARMSCII-7, ARMSCII-8, ARMSCII-8A
2006-04-01 04:13:21 +00:00
tnozaki 307ce80709 add kk_KZ.PT154 locale and iconv support for PTCP154. 2006-03-28 14:44:00 +00:00
tnozaki e01718f7f1 do not use LS0/LS1 but SI/SO. 2006-03-17 16:37:10 +00:00
tnozaki c0f03c8796 suppress annoying warning messages.
noticed by hannken AT netbsd DOT org, thanks.
2006-03-16 14:28:19 +00:00
tnozaki 274950e65d added macintosh encodings(single-byte only). 2006-03-13 19:45:35 +00:00
tnozaki 71d1b0a7b1 CP936/GBK mapping table incomplete. 2006-02-15 19:33:06 +00:00
tnozaki 30d806de60 remove codepoint overlaps with KS5601. 2006-02-11 17:31:43 +00:00
tnozaki 254e94eef6 remove codepoint overlaps KS5601.
see src/share/i18n/esdb/CP/CP949.src
2006-02-11 16:26:46 +00:00
tnozaki 0f4c8d531d Fix lib/32749: iconv(3): CP949 mapping table is incomplete.
reported by Sergey Svishchev, thanks.
2006-02-11 14:49:50 +00:00
tshiozak 87a9a60239 add lacked correspondences between CP932 and UCS.
This is reported by Dave Huang on lib/25795,
MORIYAMA Masayuki <msyk _at_ mtg.biglobe.ne.jp> and
"NARUSE, Yui" <naruse _at_ airemix.com>, and fixed by MORIYAMA-san.
2005-10-18 06:49:28 +00:00
wiz 2198bef95e Add unicode, unicodelittle, and unicodebig aliases. Suggested by tshiozak@. 2005-10-16 19:49:16 +00:00
tshiozak 3095745573 change OOB_MODE to INVALID; this mapper is only used to be layered on
other mappers, thus ILSEQ is irrelevance.
2005-09-08 09:40:20 +00:00
tshiozak 3d40ace0a6 attach tables to CP932<->UCS mappers for NEC-defined VDC zone. 2005-09-07 07:17:59 +00:00
tnozaki a3b248100e add csmapper:CNS11643-1,2 and esdb:ISO-2022-CN,
integrate esdb:EUC-TW, locale:zh_TW.eucTW.
2005-03-27 22:30:05 +00:00
christos dd554c952a PR/29658: Joerg Sonnenberger: incorrect DST_UNIT_BITS and DST_INVALID in
character set maps:

A number of Code Pages specify a 8bit character mapping, but use
16bit runes, ma king them inaccessible for the actual conversion.
Similiary do some Code Pages specify an invalid rune e.g. of 0x00,
which is actually used in the mapping table and makes that rune
mapping invalid.
2005-03-11 23:38:02 +00:00
tshiozak 16758dd815 remove harmful 0x0000 to non-zero mappings.
reported by Joerg Sonnenberger on tech-userlevel@.
2005-03-09 11:31:40 +00:00
tnozaki fe05f588fb add new citrus iconv module UTF-7.
thanks advice, yamt-san.
2005-03-05 18:05:14 +00:00
jdolecek b6df8b1e96 add windows-12?? aliases for CP12?? 2005-02-25 22:00:50 +00:00
tnozaki af5bc0e0b7 add iso-2022-kr support to iconv. 2005-02-10 18:33:00 +00:00
tnozaki a966c55826 fix unicode 0xFF80 page mapping, lib/28325.
OK'ed by tshiozak AT netbsd DOT org.
2004-11-16 18:36:15 +00:00
tshiozak 94244ec3de add missed entry for CP866. (lib/27060) 2004-09-28 06:51:37 +00:00
lukem 130ab7336b Rework how MAKEVERBOSE operates:
*	Don't bother prefixing commands with a line of  ${_MKCMD}\
	and instead rely upon "make -s".  This is less intrusive on
	all the Makefiles than the former.  Idea from David Laight.

    *	Rename the variables use to print messages.  The scheme now is:
	    _MKMSG_FOO		Run  _MKMSG 'foo'
	    _MKTARGET_FOO	Run  _MKMSG_FOO ${.TARGET}
	From discussion with Alistair Crooks.
2003-10-21 10:01:19 +00:00
itojun 566d91a6d8 add GB2312 after JIS0208/0213, then there's no confusion.
some chinese chars may get converted into JIS0208/0213 on
"iconv -f utf-8 -t iso-2022-jp-2".  but anyway, it's iso-2022-*jp*-2 so
japanese chars should take precedence.
XXX what should we do about X11 ctext?

i really hate unicode han unification.
do we really need to support language tags?
2003-10-20 08:00:33 +00:00
itojun 4f2eafbac7 add iso-2022-jp-2.
XXX somehow adding GB2312 causes iconv to mishandle jis x0208.
2003-10-20 01:26:44 +00:00
lukem 87700a22c8 Implement MAKEVERBOSE.
Rework a bunch of make .for loops into sh for loops.

(XXX: This stuff could be implemented in a more efficient manner that
does a lot less of  SOMEMAKEVAR!=some shell command  , etc etc)
2003-10-19 14:08:50 +00:00
tshiozak 8bf28f7014 update conversion tables for KSC5601(KSX1001) <-> UCS.
reported by bjy.
2003-09-26 17:58:42 +00:00
tshiozak ad49f181fd To ensure the mapping identical, change the unicode codepoint for the reverse
solidus on JIS X 0208.
2003-08-18 15:28:25 +00:00
tshiozak 6c85e3334a add KOI8-U encoding support for iconv. 2003-07-20 19:26:55 +00:00
tshiozak 173e5850f7 - add converter for EUC-JP-MS.
- apply MS conversion rule to JISX0208 part of CP932.
c.f.:
  http://www.opengroup.or.jp/jvc/cde/ucs-conv-e.html
2003-07-19 20:20:39 +00:00
itojun 128a9bd8d5 add X11 CTEXT encoding
XXX add more charsets
2003-07-16 08:25:17 +00:00
tshiozak a08c1c4533 use GETA (Japanese typographical "lacked piece of type") for INVALID. 2003-07-16 06:57:34 +00:00
tshiozak 809243d0fb add aliases; ucs-{2,4}* -> utf-{16,32}*.
this hopefully fixes lib/22142.
2003-07-16 05:10:29 +00:00