Added Mac OS Roman encoding test files for further UTF-8 testing.

git-svn-id: file:///fltk/svn/fltk/branches/branch-1.3@7655 ea41ed52-d2ee-0310-a9c1-e6b18d33e121
This commit is contained in:
Albrecht Schlosser 2010-06-25 13:47:43 +00:00
parent 142cefd106
commit 0e22a7cb26
3 changed files with 52 additions and 14 deletions

BIN
misc/MacRoman_utf-8.png Normal file

Binary file not shown.

After

Width:  |  Height:  |  Size: 103 KiB

26
misc/MacRoman_utf-8.txt Normal file
View File

@ -0,0 +1,26 @@
+---------------+-------------------------------------------------+
| octal --> | 0 1 2 3 4 5 6 7 10 11 12 13 14 16 16 17 |
| | dec. -> | 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 |
| v | v |hex| 0 1 2 3 4 5 6 7 8 9 A B C D E F |
+-----+-----+---+-------------------------------------------------+
| 0 | 0 | 0 | |
| 20 | 16 | 1 | |
| 40 | 32 | 2 | ! "" # $ % & ' ( ) * + , - . / |
| 60 | 48 | 3 | 0 1 2 3 4 5 6 7 8 9 : ; < = > ? |
| 100 | 64 | 4 | @ A B C D E F G H I J K L M N O |
| 120 | 80 | 5 | P Q R S T U V W X Y Z [ \ ] ^ _ |
| 140 | 96 | 6 | ` a b c d e f g h i j k l m n o |
| 160 | 112 | 7 | p q r s t u v w x y z { | } ~ |
| 200 | 128 | 8 | Ä Å Ç É Ñ Ö Ü á à â ä ã å ç é è |
| 220 | 144 | 9 | ê ë í ì î ï ñ ó ò ô ö õ ú ù û ü |
| 240 | 160 | A | † ° ¢ £ § • ¶ ß ® © ™ ´ ¨ ≠ Æ Ø |
| 260 | 176 | B | ∞ ± ≤ ≥ ¥ µ ∂ ∑ ∏ π ∫ ª º Ω æ ø |
| 300 | 192 | C | ¿ ¡ ¬ √ ƒ ≈ ∆ « » …   À Ã Õ Œ œ |
| 320 | 208 | D | — “ ” ÷ ◊ ÿ Ÿ fi fl |
| 340 | 224 | E | ‡ · „ ‰ Â Ê Á Ë È Í Î Ï Ì Ó Ô |
| 360 | 240 | F |  Ò Ú Û Ù ı ˆ ˜ ¯ ˘ ˙ ˚ ¸ ˝ ˛ ˇ |
+-----+-----+---+-------------------------------------------------+
File: MacRoman_utf-8.txt
Encoding: UTF-8
Note: 0xF0 (MacRoman) = U+F8FF (Unicode) = Apple logo

View File

@ -27,50 +27,62 @@ or an editor with the corresponding codepage or locale.
The UTF-8 encoded files should be usable with FLTK 1.3, FLTK 2, or FLTK 3
(future). They contain UTF-8 encoded characters at the table positions,
where the corresponding ISO-8859-1 or Windows copepage 1252 characters would
be. They should look identical as the image file (cp1252.png) on all UTF-8
capable systems.
where the corresponding ISO-8859-1, Windows copepage 1252, or Mac OS Roman
characters would be. They should look identical as the image file (cp1252.png)
or MacRoman_utf-8.png, resp., on all UTF-8 capable systems.
The files with names iso-8859-1_* contain only the ISO-8859-1 subset, i.e.
the colums with character codes x'80' - x'9F' (U+0080 - U+009F) are empty.
the colums with character codes 0x80 - 0x9F (U+0080 - U+009F) are empty.
Mac OS Roman is the Apple Mac native character set before it became Unicode
by default (OS X). The corresponding files are ordered as they would appear
with the native character set / encoding.
Special Notes:
x'22' (U+0022: "quotation mark") is doubled intentionally. This has been
0x22 (U+0022: "quotation mark") is doubled intentionally. This has been
done for better results in test/editor, because otherwise the rest of the
file would be shown in blue color (as a comment).
x'98' (U+02DC: "small tilde") may not display correctly on Windows systems.
0x98 (U+02DC: "small tilde") may not display correctly on Windows systems.
I saw the same effect with other Windows editors as well. The following
characters may be displayed "shifted left" by one position. Seems to work
correctly on Linux (with UTF-8 encoding).
Mac OS Roman: 0xF0 (U+F8FF: "Apple logo") may not be printable, depending
on installed fonts on other systems.
There may currently be drawing artefacts when moving the cursor forwards
and/or backwards over some "special" characters.
References:
http://unicode.org/Public/MAPPINGS/ISO8859/8859-1.TXT
http://unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1252.TXT
http://unicode.org/Public/MAPPINGS/VENDORS/APPLE/ROMAN.TXT
http://en.wikipedia.org/wiki/Mac_OS_Roman
Files:
The first file is an image of how it should look:
The first two files are images of how it should look:
cp1252.png full Windows Codepage 1252 (Western)
cp1252.png full Windows Codepage 1252 (Western)
MacRoman_utf-8.png full Mac OS Roman character set (w/o Apple logo)
The following two files contain the full Windows Codepage 1252:
cp1252.txt native (Windows) encoding, suitable for FLTK 1.1
cp1252_utf-8.txt UTF-8 encoding, suitable for FLTK 1.3 and greater
cp1252.txt native (Windows) encoding, suitable for FLTK 1.1
cp1252_utf-8.txt UTF-8 encoding, suitable for FLTK 1.3 and greater
The following two files contain only the ISO-8859-1 characters:
The following two files contain only the ISO-8859-1 subset:
iso-8859-1.txt native (Windows) encoding, suitable for FLTK 1.1
iso-8859-1_utf-8.txt UTF-8 encoding, suitable for FLTK 1.3 and greater
iso-8859-1.txt native (Windows) encoding, suitable for FLTK 1.1
iso-8859-1_utf-8.txt UTF-8 encoding, suitable for FLTK 1.3 and greater
The following file contains the full Mac OS Roman character set:
MacRoman_utf-8.txt UTF-8 encoding, suitable for FLTK 1.3 and greater
(use cp1252.txt with FLTK 1.1 on Mac OS instead)
Other developer support files:
================================================================================