Commit Graph

61 Commits

Author SHA1 Message Date
tnozaki
17e6581dbd add encoding alias CP949 -> UHC. 2006-10-19 14:47:58 +00:00
tnozaki
cf5d987754 add new iconvdata ATARIST, HP-ROMAN8, NEXTSTEP. 2006-10-19 14:45:02 +00:00
tnozaki
503c123619 switch the conversion table from rfc1345 to unicode.org's one. 2006-10-19 14:06:14 +00:00
tnozaki
2e2fc44e22 add new iconv module libJOHAB,
this supports S.Korean character encoding scheme ``JOHAB''.
2006-10-18 17:54:54 +00:00
tnozaki
138067b448 - rename KOI8-BASE more suitable name, GOST19768-74.
- fix wrong conversion table KOI7 and KOI7-switched.
2006-09-16 08:38:20 +00:00
tnozaki
3acfcd1374 add iconv GB18030 <-> Unicode BMP conversion table. 2006-09-09 15:43:04 +00:00
tnozaki
c43c0703e2 * updated 0xFE50-0xFEA0 mapping to Unicode3.0 later.
* added fallback mapping of 0xA989-0xA995 <-> U+E7E7-U+E7F3.
* some TTF - NSimSun(Microsoft GB18030 Support Package) and
  FZSongTi(Solaris GB18030 locale) - assign Vertical form(U+FE10-U+FE19)
  to UDA/PUA(0x2659-0x2673/U+E78D-U+E796), so comment about this.
2006-08-29 18:52:29 +00:00
tnozaki
2af744a1fb filled 0x577A-0x577E hole with PUA. 2006-08-25 15:27:41 +00:00
tnozaki
4ba9b09748 remove *.mps entry that doesn't exists. 2006-08-19 11:13:18 +00:00
tnozaki
2ad099c737 1. added iconv support of Chinese Simplefied encoding, GBK.
2. added Unicode mapping of 2byte GB18030 User Defined Area.
2006-08-19 10:58:40 +00:00
tnozaki
41efa2e2cd 1. added CNS11643 plane 3 <-> UCS iconv data.
2. zh_TW.eucTW locale now supports CNS11643 plane 3 ~ 7
2006-07-16 06:13:29 +00:00
tnozaki
3ce59d0949 ISO-8859-7:1987 -> ISO-8859-7:2003 2006-07-12 16:56:35 +00:00
tnozaki
d1116a2478 fix wrong alias, s/iso646-gb/iso646-cn/ is correct. 2006-07-05 15:31:50 +00:00
tnozaki
2a35d342e7 added fallback U+2015 -> 1-1-29. 2006-07-05 15:27:53 +00:00
tnozaki
d9cfca81ba fix typo. 2006-06-22 16:18:45 +00:00
tnozaki
e476537d55 correct the code point incompatible with Big5. 2006-06-22 16:11:55 +00:00
tnozaki
2bcfe3b4c8 added Chinese Tradisional Big5 family,
Big5-2003, Big5-ETen, Big5-IBM, Big-5E, Big-5+.

``Big5 is now the alias of Big5-ETen,
if you want Unicode.org's obsolete mappings, use Big5-IBM instead.
2006-06-19 17:28:24 +00:00
tnozaki
57b95db2c8 add iconv support for CP1131(Belarus DOS code). 2006-06-16 20:09:23 +00:00
tnozaki
a302e2a921 added iconv support for georgian-ps, georgian-academy. 2006-06-15 15:34:20 +00:00
tnozaki
f52a3ac4b6 1-29(0x213D) is not HORIZONTAL BAR(U+2015) but EM DASH(U+2014). 2006-06-07 17:04:37 +00:00
tnozaki
ffcda0dc45 fix wrong token, s/DST_ILSEQ/DST_INVALID/. 2006-04-08 15:47:39 +00:00
tnozaki
451ff41269 fix typo, s/ISO-5427/KOI7/ 2006-04-08 14:26:45 +00:00
tnozaki
a3e63d71ca add following encoding support to iconv:
* KOI-7(iso-ir-37, iso-5427)
	* KOI-8(old KOI8)
	* KOI7-switched(iso-2022 variant)
	* KOI8-C -- Pablo Saratxaga's  X_LOCALE KOI8-C(Caucasus)
	* KOI8-CyrillicOld -- Serge Winitzki's KOI8-C(Cyrillic-Old)
	  ** not exported yet **
	* KOI8-E(iso-ir-111, ecma-cyrillic)
	* KOI8-RU
	* KOI8-T
2006-04-08 14:15:48 +00:00
tnozaki
b37120b682 add iconv support for Armenian State Standard encoding:
ARMSCII-7, ARMSCII-8, ARMSCII-8A
2006-04-01 04:13:21 +00:00
tnozaki
307ce80709 add kk_KZ.PT154 locale and iconv support for PTCP154. 2006-03-28 14:44:00 +00:00
tnozaki
e01718f7f1 do not use LS0/LS1 but SI/SO. 2006-03-17 16:37:10 +00:00
tnozaki
c0f03c8796 suppress annoying warning messages.
noticed by hannken AT netbsd DOT org, thanks.
2006-03-16 14:28:19 +00:00
tnozaki
274950e65d added macintosh encodings(single-byte only). 2006-03-13 19:45:35 +00:00
tnozaki
71d1b0a7b1 CP936/GBK mapping table incomplete. 2006-02-15 19:33:06 +00:00
tnozaki
30d806de60 remove codepoint overlaps with KS5601. 2006-02-11 17:31:43 +00:00
tnozaki
254e94eef6 remove codepoint overlaps KS5601.
see src/share/i18n/esdb/CP/CP949.src
2006-02-11 16:26:46 +00:00
tnozaki
0f4c8d531d Fix lib/32749: iconv(3): CP949 mapping table is incomplete.
reported by Sergey Svishchev, thanks.
2006-02-11 14:49:50 +00:00
tshiozak
87a9a60239 add lacked correspondences between CP932 and UCS.
This is reported by Dave Huang on lib/25795,
MORIYAMA Masayuki <msyk _at_ mtg.biglobe.ne.jp> and
"NARUSE, Yui" <naruse _at_ airemix.com>, and fixed by MORIYAMA-san.
2005-10-18 06:49:28 +00:00
wiz
2198bef95e Add unicode, unicodelittle, and unicodebig aliases. Suggested by tshiozak@. 2005-10-16 19:49:16 +00:00
tshiozak
3095745573 change OOB_MODE to INVALID; this mapper is only used to be layered on
other mappers, thus ILSEQ is irrelevance.
2005-09-08 09:40:20 +00:00
tshiozak
3d40ace0a6 attach tables to CP932<->UCS mappers for NEC-defined VDC zone. 2005-09-07 07:17:59 +00:00
tnozaki
a3b248100e add csmapper:CNS11643-1,2 and esdb:ISO-2022-CN,
integrate esdb:EUC-TW, locale:zh_TW.eucTW.
2005-03-27 22:30:05 +00:00
christos
dd554c952a PR/29658: Joerg Sonnenberger: incorrect DST_UNIT_BITS and DST_INVALID in
character set maps:

A number of Code Pages specify a 8bit character mapping, but use
16bit runes, ma king them inaccessible for the actual conversion.
Similiary do some Code Pages specify an invalid rune e.g. of 0x00,
which is actually used in the mapping table and makes that rune
mapping invalid.
2005-03-11 23:38:02 +00:00
tshiozak
16758dd815 remove harmful 0x0000 to non-zero mappings.
reported by Joerg Sonnenberger on tech-userlevel@.
2005-03-09 11:31:40 +00:00
tnozaki
fe05f588fb add new citrus iconv module UTF-7.
thanks advice, yamt-san.
2005-03-05 18:05:14 +00:00
jdolecek
b6df8b1e96 add windows-12?? aliases for CP12?? 2005-02-25 22:00:50 +00:00
tnozaki
af5bc0e0b7 add iso-2022-kr support to iconv. 2005-02-10 18:33:00 +00:00
tnozaki
a966c55826 fix unicode 0xFF80 page mapping, lib/28325.
OK'ed by tshiozak AT netbsd DOT org.
2004-11-16 18:36:15 +00:00
tshiozak
94244ec3de add missed entry for CP866. (lib/27060) 2004-09-28 06:51:37 +00:00
lukem
130ab7336b Rework how MAKEVERBOSE operates:
*	Don't bother prefixing commands with a line of  ${_MKCMD}\
	and instead rely upon "make -s".  This is less intrusive on
	all the Makefiles than the former.  Idea from David Laight.

    *	Rename the variables use to print messages.  The scheme now is:
	    _MKMSG_FOO		Run  _MKMSG 'foo'
	    _MKTARGET_FOO	Run  _MKMSG_FOO ${.TARGET}
	From discussion with Alistair Crooks.
2003-10-21 10:01:19 +00:00
itojun
566d91a6d8 add GB2312 after JIS0208/0213, then there's no confusion.
some chinese chars may get converted into JIS0208/0213 on
"iconv -f utf-8 -t iso-2022-jp-2".  but anyway, it's iso-2022-*jp*-2 so
japanese chars should take precedence.
XXX what should we do about X11 ctext?

i really hate unicode han unification.
do we really need to support language tags?
2003-10-20 08:00:33 +00:00
itojun
4f2eafbac7 add iso-2022-jp-2.
XXX somehow adding GB2312 causes iconv to mishandle jis x0208.
2003-10-20 01:26:44 +00:00
lukem
87700a22c8 Implement MAKEVERBOSE.
Rework a bunch of make .for loops into sh for loops.

(XXX: This stuff could be implemented in a more efficient manner that
does a lot less of  SOMEMAKEVAR!=some shell command  , etc etc)
2003-10-19 14:08:50 +00:00
tshiozak
8bf28f7014 update conversion tables for KSC5601(KSX1001) <-> UCS.
reported by bjy.
2003-09-26 17:58:42 +00:00
tshiozak
ad49f181fd To ensure the mapping identical, change the unicode codepoint for the reverse
solidus on JIS X 0208.
2003-08-18 15:28:25 +00:00