Commit Graph

112 Commits

Author SHA1 Message Date
maya 262c7ccc67 Add more aliases for Hebrew and Arabic ISO-8859-... encodings.
ISO-8859-8 is supposed to be visual order (i.e. legible if displayed ltr)
ISO-8859-8-i is supposed to be implicit logic order
ISO-8859-8-e is supposed to be explicit about order

In practice, ISO-8859-8 implying visual order is rare, and logic
order is used. ISO-8859-8-e is rarely used.

Same for Arabic, which uses ISO-8859-6-...

Mentioned in RFC 1555, RFC 1556.
2018-06-08 12:07:41 +00:00
mbalmer 64648650d8 missing files 2011-04-29 09:57:49 +00:00
mbalmer b2c3b2a905 add Yen symbol 2011-04-28 21:02:34 +00:00
mbalmer 9d34d3bbe5 Kazakh character code mappings reside in KAZAKH/, not MISC/. I really wonder
why nobody noticed this before...
2011-04-28 07:08:59 +00:00
mbalmer 5f971e5eac Add Japan CBM character codes. 2011-04-27 20:05:56 +00:00
tnozaki 591608f7a5 1. PR/42209 various problems with EBCDIC support in iconv(3)
part3: added CP114[0-9] support.
2. fix csmapper name separator.
2009-11-14 01:56:21 +00:00
tnozaki a316eb20e2 update Unicode 5.2.0 2009-11-10 15:06:06 +00:00
tnozaki 16bacb5c04 lib/42209 various problems with EBCDIC support in iconv(3)
1. RFC1345 have wrong conversion table, 0x00-0x1F is not EBCDIC but ISO-8.
  2. correct DST_INVALID.
CP1141 is not added yet.
2009-11-08 10:37:50 +00:00
tnozaki 4079756871 1. add new alias Window-31J -> CP932, IBM-932 -> CP942C.
2. change alias IBM-943 -> CP943 to CP943"C"(compatible).
for compatibility with AIX and glibc's iconv.
2009-10-31 15:15:42 +00:00
tnozaki 3c82a9bc85 lib/42209 various problems with EBCDIC support in iconv(3)
1. add EBCDIC-US conversion table.
2. RFC1345 have wrong conversion table, 0x00-0x1F is not EBCDIC but ISO-8.
(CP273 and CP1141 is not yet)
2009-10-31 15:06:52 +00:00
apb a02cace51f Use ${TOOL_SED} instead of plain "sed" in "!=" assignments.
This may need more work to prevent warning messages during
"make cleandir" when the commands in "!=" assignments are executed
even though tools may not have been built.
2008-10-25 22:35:36 +00:00
apb f46c1de7cb Use ${TOOL_SED} instead if plain sed in Makefiles. 2008-10-25 22:27:34 +00:00
tnozaki e8a3f4691d fix same problem with CP932, order should be NEC_IBM -> IBM.
rev1.11 and rev1.12 is completely mistake, sorry.
2008-09-13 13:08:36 +00:00
tnozaki d2502d21a7 revert previous change, there's no problem for CP943. 2008-09-13 13:05:09 +00:00
tnozaki 167ed5d7b2 fix same problem with CP932, oreder should be NEC_IBM -> IBM. 2008-09-13 12:56:25 +00:00
tnozaki aaf9b2042e PR/39445, fix wrong charsets priority for CP932.
reported and tested by NARUSE, Yui-san. thanks!
2008-09-03 15:42:45 +00:00
tnozaki 5bf0fcb63d remove redundant semicolon. 2008-07-23 13:44:02 +00:00
tnozaki f026a78601 fix wrong SRC_ZONE. 2008-04-12 13:13:50 +00:00
tnozaki 2f44c5eedb update Unicode 5.1.0 and adding some missing HKSCS-2004 charaters. 2008-04-12 13:11:56 +00:00
joerg 60a3119281 Alias UTF-8 to utf8 for symmetry with latin1. 2007-12-17 14:06:57 +00:00
tnozaki 082ee1f077 forgot to add dec-mcs entry to mapper.dir 2007-07-15 14:44:37 +00:00
tnozaki 33793b3134 add CP922 support to iconv. 2007-07-15 14:41:18 +00:00
tnozaki d260ecaa44 add CP116[12](Thai ISO-8859-11 variant), CP1129,CP1163(Vietnamese). 2007-06-18 16:19:30 +00:00
tnozaki 05df4b4fef add following encoding suuport of iconvdata:
CP1046(Arabic Extended)
	CP1124(Cyrillic, Ukraine)
	CP1125(PC, Cyrillic, Ukrainian)
2007-06-02 06:09:08 +00:00
tnozaki fd2dd8ec0d add new encoding support to iconv(3):
- RISCOS-LATIN1
	- DEC-MCS
	- DEC-HANYU(libDECHanyu)
2007-04-01 18:52:28 +00:00
tnozaki 9eae719a71 1. added iconv support of KZ-1048 encoding
2. move PTCP154 to KAZAKH directory.
2007-03-13 16:34:37 +00:00
tnozaki 64b5b95f38 fix wrong mapping koi8-c,koi8-e. 2007-03-07 15:12:41 +00:00
tnozaki b9412befd1 fix koi8-c mapping. 2007-03-05 17:06:18 +00:00
tnozaki 1bf1d71e3c iconv: add following CCS/CES support.
- CNS11643-[3-7] <-> UCS:BMP/SIP (EUC-TW, ISO-2022-CN-EXT)
- HKSCS <-> UCS:BMP/SIP (Big5-HKSCS)
- JISX0213-[1-2] <-> UCS:BMP/SIP (EUC-JIS-2004,Shift_JIS-2004,ISO-2022-JP-2004)
2007-03-05 16:57:06 +00:00
tnozaki be0ee7c9f3 citrus_csmapper.c: find_best_pivot_pvdb()
- don't break loop until all the src/dst combinations are tested.
charset.pivot.CP.src
	- redo rev 1.3.
2007-02-17 12:39:44 +00:00
tnozaki a7bd6df304 backout rev1.3's change, this broke cp932 -> eucjp-ms conversion. 2007-02-08 15:48:18 +00:00
tnozaki 902f215bd9 add iconv encoding alias: cp813, cp91[2-6], cp920, cp923, cp1089. 2006-12-13 15:46:34 +00:00
tnozaki e012779dec correct reference url. 2006-12-13 15:40:42 +00:00
tnozaki 57f0023ef8 catch up KS X 1001:2002: added U+327E - CIRCLED HANGUL IEUNG U. 2006-12-04 15:01:42 +00:00
tnozaki 787d7a82ef update CP942 iconv data - don't share Unicode mapping table with JISX0208:1978. 2006-12-03 14:28:46 +00:00
tnozaki da493f6484 - correct JISX0208:1978 <-> Unicode mappings.
- gives priority to JISX0212 more than JISX0208:1978.
2006-12-03 12:02:05 +00:00
tnozaki afa086e571 1. add IBM codepage 853 support to iconv.
2. correct JISX0208:1978 <-> Unicode mapping for IBM942.
2006-12-02 14:36:13 +00:00
tnozaki e105ab22c2 add iconv support following encoding:
CP858	- CP850 with euro sign
	CP942	- variant of CP932(JIS-ROMAN + JIS78)
	CP942C	- ditto(ASCII + JIS78)
	CP943	- ditto(JIS-ROMAN + JIS83)
	CP943C	- ditto(ASCII + JIS83)
	TDS565	- Turkmen Alphabet(ISO-IR-230)
2006-11-29 15:45:26 +00:00
tnozaki 16b9b1a8bf use mapper_zone to avoid pivot conversion by ucs. 2006-11-25 15:30:44 +00:00
tnozaki 0a9d990226 1. add iconv support for Lao language - CP1133, MULELAO-1.
2. add alias Thai language: TIS620, ISO-IR-166.
2006-11-25 10:27:22 +00:00
tnozaki dec2eb4c9d 1. move GB*.esdb from i18n/esdb/MISC to i18n/esdb/GB directory.
2. add iconv support for Chinese Tradisional GB12345.
2006-11-23 03:25:24 +00:00
tnozaki 8033a5b008 1. add iconv support for following encodings:
Chinese Simplefied
        HZ, HZ8 - 7/8bit stateful encoding, see RFC1842,1843. (libHZ)
        zW      - 7bit stateful encoding, see RFC1842. (libZW)
2. add citrus_prop.[ch] - parser for encoding module's init parameter strings.
2006-11-22 23:38:25 +00:00
tnozaki 9b1aff8258 add fallback mapping. 2006-11-22 20:22:30 +00:00
tnozaki 79a70a823d 1. add iconv support for followint encoding:
Vietnamese
	TCVN	8bit Viet Nam National Standard
	VISCII	8bit RFC1456
	VIQR	7bit RFC1456(libVIQR)
    Unicode Escape (GNU libiconv compatibility)
	C99, JAVA (libUES)
2. fix iconv_std module:
	add special treatment for POSIX Defect Report #288 case.
2006-11-13 15:16:28 +00:00
tnozaki f8989225cf change georgian-ps(geo8std)'s conversion table
from X-TrueType to Internet-Draft that more widely used.
2006-11-04 06:17:35 +00:00
tnozaki 2f07b9b3fb rollback unintended charnge. 2006-11-04 03:53:11 +00:00
tnozaki 6288c3fa08 add iconv support following encodings:
ISO-2022-JP-1(RFC2237)
	CP50220,CP50221,CP50222(ISO-2022-JP variant)
	CP51932(EUC-JP variant)
2006-11-04 03:46:19 +00:00
tnozaki 4b8989d796 remove unessesary 0x00-0x20, 0x60-0x7F mapping. 2006-10-26 16:19:02 +00:00
tnozaki e53a1bf203 DST_UNIT_BITS is 8, nuke leading 0x00. 2006-10-26 16:04:21 +00:00
tnozaki fe9a7f2aa1 0x2A7E is safe character. 2006-10-26 15:24:00 +00:00