Commit Graph

113 Commits

Author SHA1 Message Date
tnozaki fe9a7f2aa1 0x2A7E is safe character. 2006-10-26 15:24:00 +00:00
tnozaki f0e9414b43 added supports for chinese encoding ISO-2022-CN-EXT. 2006-10-26 15:06:17 +00:00
tnozaki 17e6581dbd add encoding alias CP949 -> UHC. 2006-10-19 14:47:58 +00:00
tnozaki cf5d987754 add new iconvdata ATARIST, HP-ROMAN8, NEXTSTEP. 2006-10-19 14:45:02 +00:00
tnozaki 503c123619 switch the conversion table from rfc1345 to unicode.org's one. 2006-10-19 14:06:14 +00:00
tnozaki 2e2fc44e22 add new iconv module libJOHAB,
this supports S.Korean character encoding scheme ``JOHAB''.
2006-10-18 17:54:54 +00:00
tnozaki 138067b448 - rename KOI8-BASE more suitable name, GOST19768-74.
- fix wrong conversion table KOI7 and KOI7-switched.
2006-09-16 08:38:20 +00:00
tnozaki 3acfcd1374 add iconv GB18030 <-> Unicode BMP conversion table. 2006-09-09 15:43:04 +00:00
tnozaki c43c0703e2 * updated 0xFE50-0xFEA0 mapping to Unicode3.0 later.
* added fallback mapping of 0xA989-0xA995 <-> U+E7E7-U+E7F3.
* some TTF - NSimSun(Microsoft GB18030 Support Package) and
  FZSongTi(Solaris GB18030 locale) - assign Vertical form(U+FE10-U+FE19)
  to UDA/PUA(0x2659-0x2673/U+E78D-U+E796), so comment about this.
2006-08-29 18:52:29 +00:00
tnozaki 2af744a1fb filled 0x577A-0x577E hole with PUA. 2006-08-25 15:27:41 +00:00
tnozaki 4ba9b09748 remove *.mps entry that doesn't exists. 2006-08-19 11:13:18 +00:00
tnozaki 2ad099c737 1. added iconv support of Chinese Simplefied encoding, GBK.
2. added Unicode mapping of 2byte GB18030 User Defined Area.
2006-08-19 10:58:40 +00:00
tnozaki 41efa2e2cd 1. added CNS11643 plane 3 <-> UCS iconv data.
2. zh_TW.eucTW locale now supports CNS11643 plane 3 ~ 7
2006-07-16 06:13:29 +00:00
tnozaki 3ce59d0949 ISO-8859-7:1987 -> ISO-8859-7:2003 2006-07-12 16:56:35 +00:00
tnozaki d1116a2478 fix wrong alias, s/iso646-gb/iso646-cn/ is correct. 2006-07-05 15:31:50 +00:00
tnozaki 2a35d342e7 added fallback U+2015 -> 1-1-29. 2006-07-05 15:27:53 +00:00
tnozaki d9cfca81ba fix typo. 2006-06-22 16:18:45 +00:00
tnozaki e476537d55 correct the code point incompatible with Big5. 2006-06-22 16:11:55 +00:00
tnozaki 2bcfe3b4c8 added Chinese Tradisional Big5 family,
Big5-2003, Big5-ETen, Big5-IBM, Big-5E, Big-5+.

``Big5 is now the alias of Big5-ETen,
if you want Unicode.org's obsolete mappings, use Big5-IBM instead.
2006-06-19 17:28:24 +00:00
tnozaki 57b95db2c8 add iconv support for CP1131(Belarus DOS code). 2006-06-16 20:09:23 +00:00
tnozaki a302e2a921 added iconv support for georgian-ps, georgian-academy. 2006-06-15 15:34:20 +00:00
tnozaki f52a3ac4b6 1-29(0x213D) is not HORIZONTAL BAR(U+2015) but EM DASH(U+2014). 2006-06-07 17:04:37 +00:00
tnozaki ffcda0dc45 fix wrong token, s/DST_ILSEQ/DST_INVALID/. 2006-04-08 15:47:39 +00:00
tnozaki 451ff41269 fix typo, s/ISO-5427/KOI7/ 2006-04-08 14:26:45 +00:00
tnozaki a3e63d71ca add following encoding support to iconv:
* KOI-7(iso-ir-37, iso-5427)
	* KOI-8(old KOI8)
	* KOI7-switched(iso-2022 variant)
	* KOI8-C -- Pablo Saratxaga's  X_LOCALE KOI8-C(Caucasus)
	* KOI8-CyrillicOld -- Serge Winitzki's KOI8-C(Cyrillic-Old)
	  ** not exported yet **
	* KOI8-E(iso-ir-111, ecma-cyrillic)
	* KOI8-RU
	* KOI8-T
2006-04-08 14:15:48 +00:00
tnozaki b37120b682 add iconv support for Armenian State Standard encoding:
ARMSCII-7, ARMSCII-8, ARMSCII-8A
2006-04-01 04:13:21 +00:00
tnozaki 307ce80709 add kk_KZ.PT154 locale and iconv support for PTCP154. 2006-03-28 14:44:00 +00:00
tnozaki e01718f7f1 do not use LS0/LS1 but SI/SO. 2006-03-17 16:37:10 +00:00
tnozaki c0f03c8796 suppress annoying warning messages.
noticed by hannken AT netbsd DOT org, thanks.
2006-03-16 14:28:19 +00:00
tnozaki 274950e65d added macintosh encodings(single-byte only). 2006-03-13 19:45:35 +00:00
tnozaki 71d1b0a7b1 CP936/GBK mapping table incomplete. 2006-02-15 19:33:06 +00:00
tnozaki 30d806de60 remove codepoint overlaps with KS5601. 2006-02-11 17:31:43 +00:00
tnozaki 254e94eef6 remove codepoint overlaps KS5601.
see src/share/i18n/esdb/CP/CP949.src
2006-02-11 16:26:46 +00:00
tnozaki 0f4c8d531d Fix lib/32749: iconv(3): CP949 mapping table is incomplete.
reported by Sergey Svishchev, thanks.
2006-02-11 14:49:50 +00:00
tshiozak 87a9a60239 add lacked correspondences between CP932 and UCS.
This is reported by Dave Huang on lib/25795,
MORIYAMA Masayuki <msyk _at_ mtg.biglobe.ne.jp> and
"NARUSE, Yui" <naruse _at_ airemix.com>, and fixed by MORIYAMA-san.
2005-10-18 06:49:28 +00:00
wiz 2198bef95e Add unicode, unicodelittle, and unicodebig aliases. Suggested by tshiozak@. 2005-10-16 19:49:16 +00:00
tshiozak 3095745573 change OOB_MODE to INVALID; this mapper is only used to be layered on
other mappers, thus ILSEQ is irrelevance.
2005-09-08 09:40:20 +00:00
tshiozak 3d40ace0a6 attach tables to CP932<->UCS mappers for NEC-defined VDC zone. 2005-09-07 07:17:59 +00:00
tnozaki a3b248100e add csmapper:CNS11643-1,2 and esdb:ISO-2022-CN,
integrate esdb:EUC-TW, locale:zh_TW.eucTW.
2005-03-27 22:30:05 +00:00
christos dd554c952a PR/29658: Joerg Sonnenberger: incorrect DST_UNIT_BITS and DST_INVALID in
character set maps:

A number of Code Pages specify a 8bit character mapping, but use
16bit runes, ma king them inaccessible for the actual conversion.
Similiary do some Code Pages specify an invalid rune e.g. of 0x00,
which is actually used in the mapping table and makes that rune
mapping invalid.
2005-03-11 23:38:02 +00:00
tshiozak 16758dd815 remove harmful 0x0000 to non-zero mappings.
reported by Joerg Sonnenberger on tech-userlevel@.
2005-03-09 11:31:40 +00:00
tnozaki fe05f588fb add new citrus iconv module UTF-7.
thanks advice, yamt-san.
2005-03-05 18:05:14 +00:00
jdolecek b6df8b1e96 add windows-12?? aliases for CP12?? 2005-02-25 22:00:50 +00:00
tnozaki af5bc0e0b7 add iso-2022-kr support to iconv. 2005-02-10 18:33:00 +00:00
tnozaki a966c55826 fix unicode 0xFF80 page mapping, lib/28325.
OK'ed by tshiozak AT netbsd DOT org.
2004-11-16 18:36:15 +00:00
tshiozak 94244ec3de add missed entry for CP866. (lib/27060) 2004-09-28 06:51:37 +00:00
lukem 130ab7336b Rework how MAKEVERBOSE operates:
*	Don't bother prefixing commands with a line of  ${_MKCMD}\
	and instead rely upon "make -s".  This is less intrusive on
	all the Makefiles than the former.  Idea from David Laight.

    *	Rename the variables use to print messages.  The scheme now is:
	    _MKMSG_FOO		Run  _MKMSG 'foo'
	    _MKTARGET_FOO	Run  _MKMSG_FOO ${.TARGET}
	From discussion with Alistair Crooks.
2003-10-21 10:01:19 +00:00
itojun 566d91a6d8 add GB2312 after JIS0208/0213, then there's no confusion.
some chinese chars may get converted into JIS0208/0213 on
"iconv -f utf-8 -t iso-2022-jp-2".  but anyway, it's iso-2022-*jp*-2 so
japanese chars should take precedence.
XXX what should we do about X11 ctext?

i really hate unicode han unification.
do we really need to support language tags?
2003-10-20 08:00:33 +00:00
itojun 4f2eafbac7 add iso-2022-jp-2.
XXX somehow adding GB2312 causes iconv to mishandle jis x0208.
2003-10-20 01:26:44 +00:00
lukem 87700a22c8 Implement MAKEVERBOSE.
Rework a bunch of make .for loops into sh for loops.

(XXX: This stuff could be implemented in a more efficient manner that
does a lot less of  SOMEMAKEVAR!=some shell command  , etc etc)
2003-10-19 14:08:50 +00:00
tshiozak 8bf28f7014 update conversion tables for KSC5601(KSX1001) <-> UCS.
reported by bjy.
2003-09-26 17:58:42 +00:00
tshiozak ad49f181fd To ensure the mapping identical, change the unicode codepoint for the reverse
solidus on JIS X 0208.
2003-08-18 15:28:25 +00:00
tshiozak 6c85e3334a add KOI8-U encoding support for iconv. 2003-07-20 19:26:55 +00:00
tshiozak 173e5850f7 - add converter for EUC-JP-MS.
- apply MS conversion rule to JISX0208 part of CP932.
c.f.:
  http://www.opengroup.or.jp/jvc/cde/ucs-conv-e.html
2003-07-19 20:20:39 +00:00
itojun 128a9bd8d5 add X11 CTEXT encoding
XXX add more charsets
2003-07-16 08:25:17 +00:00
tshiozak a08c1c4533 use GETA (Japanese typographical "lacked piece of type") for INVALID. 2003-07-16 06:57:34 +00:00
tshiozak 809243d0fb add aliases; ucs-{2,4}* -> utf-{16,32}*.
this hopefully fixes lib/22142.
2003-07-16 05:10:29 +00:00
tshiozak 99831f71fd set OOB_MODE to INVALID for the UCS -> any converters. 2003-07-14 06:03:36 +00:00
tshiozak 5428a0306b add OOB_MODE (out of bounds mode) and change DST_INVALID to DST_ILSEQ. 2003-07-12 16:11:01 +00:00
lukem 7b6e5e6602 NOxxx= variables must appear before <bsd.own.mk> 2003-07-10 11:27:45 +00:00
lukem 001c68bd94 Rename a large chunk of the make(1) variables which refer to a
program/tool from "FOO" to "TOOL_FOO".  The new variables are:
	TOOL_ASN1_COMPILE  TOOL_CAP_MKDB  TOOL_CAT  TOOL_CKSUM  TOOL_COMPILE_ET
	TOOL_CONFIG  TOOL_CRUNCHGEN  TOOL_CTAGS  TOOL_DB  TOOL_EQN  TOOL_FGEN
	TOOL_GENCAT  TOOL_GROFF  TOOL_HEXDUMP  TOOL_INDXBIB  TOOL_INSTALLBOOT
	TOOL_INSTALL_INFO  TOOL_M4  TOOL_MAKEFS  TOOL_MAKEINFO  TOOL_MAKEWHATIS
	TOOL_MDSETIMAGE  TOOL_MENUC  TOOL_MKCSMAPPER  TOOL_MKESDB
	TOOL_MKLOCALE  TOOL_MKMAGIC  TOOL_MKTEMP  TOOL_MSGC  TOOL_MTREE
	TOOL_PAX  TOOL_PIC  TOOL_PREPMKBOOTIMAGE  TOOL_PWD_MKDB  TOOL_REFER
	TOOL_ROFF_ASCII  TOOL_ROFF_DVI  TOOL_ROFF_HTML  TOOL_ROFF_PS
	TOOL_ROFF_RAW  TOOL_RPCGEN  TOOL_SOELIM  TOOL_SUNLABEL  TOOL_TBL
	TOOL_UUDECODE  TOOL_VGRIND  TOOL_ZIC

For each, provide default in <bsd.sys.mk> of the form:
	TOOL_FOO?=	foo
and for the ${USETOOLS}=="yes" case in <bsd.own.mk>, provide override:
	TOOL_FOO=	${TOOLDIR}/bin/${_TOOL_PREFIX}foo

Document all of these in bsd.README.

This cleans up a chunk of potential (and actual) namespace collision
within our build infrastructure, as well as improves consistency in
the share/mk documentation and provision of appropriate defaults for
each of these variables.
2003-07-10 10:33:58 +00:00
tshiozak 9badb7d5a5 add KOI8-R support for iconv.
(A simple example of adding a new conversion)
2003-07-01 12:16:23 +00:00
tshiozak 1d910e61f7 Citrus iconv support(5)
add iconv data files.  now ready to use iconv stuffs.
TODO:
- manpages.
- enrich mapper/esdb.
2003-06-27 08:39:59 +00:00