mc/misc/mc.charsets.in

14 lines
278 B
Plaintext
Raw Normal View History

ASCII 7-bit ASCII
charset: rename IBM866 to CP866 for iconv and adjust charset names for codeset ``` linux $ iconv -l | grep 866 866// 866NAV// CP866// CP866NAV// CSIBM866// IBM866// IBM866NAV// solaris $ iconv -l | grep 866 CP866 (CP866, CP-866, CP_866, 866), IBM-866, macos % iconv -l | grep 866 CP866 866 CSIBM866 IBM866 MSCP866 musl/src/locale/codepages.h: "cp866\0" ``` On glibc-based systems, codeset will be set to charmap name, on most other systems it seems to be taken from locale name. ## Linux ``` zaytsev@fedora:~$ locale -a | grep ru ru_RU ru_RU.cp866 ru_RU.ibm866 ru_RU.iso88595 ru_RU.koi8r ru_RU.utf8 russian ru_UA ru_UA.koi8u ru_UA.utf8 zaytsev@fedora:~/src$ LC_ALL=ru_RU.cp866 LANG=ru_RU.cp866 ./a.out IBM866 zaytsev@fedora:~/src$ LC_ALL=ru_RU.koi8r LANG=ru_RU.koi8r ./a.out KOI8-R zaytsev@fedora:~/src$ LC_ALL=ru_RU.iso88595 LANG=ru_RU.iso88595 ./a.out ISO-8859-5 ``` ## macOS ``` ru_RU.ISO8859-5 ru_RU.CP866 ru_RU.CP1251 ru_RU.UTF-8 ru_RU.KOI8-R ru_RU zaytsev@Yurys-MBP mc % LANG=ru_RU.CP866 LC_ALL=ru_RU.CP866 ./a.out CP866 zaytsev@Yurys-MBP mc % LANG=ru_RU.ISO8859-5 LC_ALL=ru_RU.ISO8859-5 ./a.out ISO8859-5 ``` ## FreeBSD ``` ru_RU.CP1251 ru_RU.CP866 ru_RU.ISO8859-5 ru_RU.KOI8-R ru_RU.UTF-8 zaytsev@cfarm240:~ $ LANG=ru_RU.ISO8859-5 LC_ALL=ru_RU.ISO8859-5 ./a.out ISO8859-5 zaytsev@cfarm240:~ $ LANG=ru_RU.CP866 LC_ALL=ru_RU.CP866 ./a.out CP866 ``` ## Solaris ``` ru ru.UTF-8 ru.koi8-r ru_RU ru_RU.ANSI1251 ru_RU.ISO8859-5 ru_RU.KOI8-R ru_RU.UTF-8 zaytsev@gcc-solaris10:~/src$ LANG=ru_RU.ISO8859-5 LC_ALL=ru_RU.ISO8859-5 ./a.o> ISO8859-5 zaytsev@gcc-solaris10:~/src$ LANG=ru.koi8-r LC_ALL=ru.koi8-r ./a.out KOI8-R ``` ## AIX ``` zaytsev@gcc111:[/home/zaytsev]locale -a C POSIX en_US.8859-15 en_US.IBM-858 en_US.ISO8859-1 en_US -bash-5.1$ LANG=en_US.ISO8859-1 LC_ALL=en_US.ISO8859-1 ./a.out ISO8859-1 ``` Signed-off-by: Yury V. Zaytsev <yury@shurup.com>
2024-09-13 11:18:11 +03:00
@ENCODING_ISO8859@-1 ISO 8859-1
@ENCODING_ISO8859@-2 ISO 8859-2
@ENCODING_ISO8859@-5 ISO 8859-5
CP1250 Windows 1250
charset: rename IBM866 to CP866 for iconv and adjust charset names for codeset ``` linux $ iconv -l | grep 866 866// 866NAV// CP866// CP866NAV// CSIBM866// IBM866// IBM866NAV// solaris $ iconv -l | grep 866 CP866 (CP866, CP-866, CP_866, 866), IBM-866, macos % iconv -l | grep 866 CP866 866 CSIBM866 IBM866 MSCP866 musl/src/locale/codepages.h: "cp866\0" ``` On glibc-based systems, codeset will be set to charmap name, on most other systems it seems to be taken from locale name. ## Linux ``` zaytsev@fedora:~$ locale -a | grep ru ru_RU ru_RU.cp866 ru_RU.ibm866 ru_RU.iso88595 ru_RU.koi8r ru_RU.utf8 russian ru_UA ru_UA.koi8u ru_UA.utf8 zaytsev@fedora:~/src$ LC_ALL=ru_RU.cp866 LANG=ru_RU.cp866 ./a.out IBM866 zaytsev@fedora:~/src$ LC_ALL=ru_RU.koi8r LANG=ru_RU.koi8r ./a.out KOI8-R zaytsev@fedora:~/src$ LC_ALL=ru_RU.iso88595 LANG=ru_RU.iso88595 ./a.out ISO-8859-5 ``` ## macOS ``` ru_RU.ISO8859-5 ru_RU.CP866 ru_RU.CP1251 ru_RU.UTF-8 ru_RU.KOI8-R ru_RU zaytsev@Yurys-MBP mc % LANG=ru_RU.CP866 LC_ALL=ru_RU.CP866 ./a.out CP866 zaytsev@Yurys-MBP mc % LANG=ru_RU.ISO8859-5 LC_ALL=ru_RU.ISO8859-5 ./a.out ISO8859-5 ``` ## FreeBSD ``` ru_RU.CP1251 ru_RU.CP866 ru_RU.ISO8859-5 ru_RU.KOI8-R ru_RU.UTF-8 zaytsev@cfarm240:~ $ LANG=ru_RU.ISO8859-5 LC_ALL=ru_RU.ISO8859-5 ./a.out ISO8859-5 zaytsev@cfarm240:~ $ LANG=ru_RU.CP866 LC_ALL=ru_RU.CP866 ./a.out CP866 ``` ## Solaris ``` ru ru.UTF-8 ru.koi8-r ru_RU ru_RU.ANSI1251 ru_RU.ISO8859-5 ru_RU.KOI8-R ru_RU.UTF-8 zaytsev@gcc-solaris10:~/src$ LANG=ru_RU.ISO8859-5 LC_ALL=ru_RU.ISO8859-5 ./a.o> ISO8859-5 zaytsev@gcc-solaris10:~/src$ LANG=ru.koi8-r LC_ALL=ru.koi8-r ./a.out KOI8-R ``` ## AIX ``` zaytsev@gcc111:[/home/zaytsev]locale -a C POSIX en_US.8859-15 en_US.IBM-858 en_US.ISO8859-1 en_US -bash-5.1$ LANG=en_US.ISO8859-1 LC_ALL=en_US.ISO8859-1 ./a.out ISO8859-1 ``` Signed-off-by: Yury V. Zaytsev <yury@shurup.com>
2024-09-13 11:18:11 +03:00
@ENCODING_CP1251@ Windows 1251
CP437 CP 437
CP850 CP 850
CP852 CP 852
charset: rename IBM866 to CP866 for iconv and adjust charset names for codeset ``` linux $ iconv -l | grep 866 866// 866NAV// CP866// CP866NAV// CSIBM866// IBM866// IBM866NAV// solaris $ iconv -l | grep 866 CP866 (CP866, CP-866, CP_866, 866), IBM-866, macos % iconv -l | grep 866 CP866 866 CSIBM866 IBM866 MSCP866 musl/src/locale/codepages.h: "cp866\0" ``` On glibc-based systems, codeset will be set to charmap name, on most other systems it seems to be taken from locale name. ## Linux ``` zaytsev@fedora:~$ locale -a | grep ru ru_RU ru_RU.cp866 ru_RU.ibm866 ru_RU.iso88595 ru_RU.koi8r ru_RU.utf8 russian ru_UA ru_UA.koi8u ru_UA.utf8 zaytsev@fedora:~/src$ LC_ALL=ru_RU.cp866 LANG=ru_RU.cp866 ./a.out IBM866 zaytsev@fedora:~/src$ LC_ALL=ru_RU.koi8r LANG=ru_RU.koi8r ./a.out KOI8-R zaytsev@fedora:~/src$ LC_ALL=ru_RU.iso88595 LANG=ru_RU.iso88595 ./a.out ISO-8859-5 ``` ## macOS ``` ru_RU.ISO8859-5 ru_RU.CP866 ru_RU.CP1251 ru_RU.UTF-8 ru_RU.KOI8-R ru_RU zaytsev@Yurys-MBP mc % LANG=ru_RU.CP866 LC_ALL=ru_RU.CP866 ./a.out CP866 zaytsev@Yurys-MBP mc % LANG=ru_RU.ISO8859-5 LC_ALL=ru_RU.ISO8859-5 ./a.out ISO8859-5 ``` ## FreeBSD ``` ru_RU.CP1251 ru_RU.CP866 ru_RU.ISO8859-5 ru_RU.KOI8-R ru_RU.UTF-8 zaytsev@cfarm240:~ $ LANG=ru_RU.ISO8859-5 LC_ALL=ru_RU.ISO8859-5 ./a.out ISO8859-5 zaytsev@cfarm240:~ $ LANG=ru_RU.CP866 LC_ALL=ru_RU.CP866 ./a.out CP866 ``` ## Solaris ``` ru ru.UTF-8 ru.koi8-r ru_RU ru_RU.ANSI1251 ru_RU.ISO8859-5 ru_RU.KOI8-R ru_RU.UTF-8 zaytsev@gcc-solaris10:~/src$ LANG=ru_RU.ISO8859-5 LC_ALL=ru_RU.ISO8859-5 ./a.o> ISO8859-5 zaytsev@gcc-solaris10:~/src$ LANG=ru.koi8-r LC_ALL=ru.koi8-r ./a.out KOI8-R ``` ## AIX ``` zaytsev@gcc111:[/home/zaytsev]locale -a C POSIX en_US.8859-15 en_US.IBM-858 en_US.ISO8859-1 en_US -bash-5.1$ LANG=en_US.ISO8859-1 LC_ALL=en_US.ISO8859-1 ./a.out ISO8859-1 ``` Signed-off-by: Yury V. Zaytsev <yury@shurup.com>
2024-09-13 11:18:11 +03:00
@ENCODING_CP866@ CP 866
KOI8-R KOI8-R
KOI8-U KOI8-U
2009-01-13 17:18:40 +03:00
UTF-8 UTF-8