diff options
| author | Tom Tromey | 2013-03-17 05:17:24 -0600 |
|---|---|---|
| committer | Tom Tromey | 2013-03-17 05:17:24 -0600 |
| commit | 6bd488cd8d05aa3983ca55f70ee384732d8c0085 (patch) | |
| tree | 5645fc7b882638d6c0eb3f61fd55bde1a63fc190 /admin/notes | |
| parent | 71f91792e3013b397996905224f387da5cc539a9 (diff) | |
| parent | 9c44569ea2a18099307e0571d523d8637000a153 (diff) | |
| download | emacs-6bd488cd8d05aa3983ca55f70ee384732d8c0085.tar.gz emacs-6bd488cd8d05aa3983ca55f70ee384732d8c0085.zip | |
merge from trunk
Diffstat (limited to 'admin/notes')
| -rw-r--r-- | admin/notes/unicode | 62 |
1 files changed, 57 insertions, 5 deletions
diff --git a/admin/notes/unicode b/admin/notes/unicode index 0654036d364..7b5e21c864b 100644 --- a/admin/notes/unicode +++ b/admin/notes/unicode | |||
| @@ -104,12 +104,15 @@ Source file encoding | |||
| 104 | 104 | ||
| 105 | Most Emacs source files are encoded in UTF-8 (or in ASCII, which is a | 105 | Most Emacs source files are encoded in UTF-8 (or in ASCII, which is a |
| 106 | subset), but there are a few exceptions, listed below. Perhaps | 106 | subset), but there are a few exceptions, listed below. Perhaps |
| 107 | someday these files will be converted to UTF-8, for convenience when | 107 | someday many of these files will be converted to UTF-8, for |
| 108 | using tools like 'grep -r', but this might need nontrivial changes to | 108 | convenience when using tools like 'grep -r', but this might need |
| 109 | the build process. | 109 | nontrivial changes to the build process. |
| 110 | 110 | ||
| 111 | * chinese-big5 | 111 | * chinese-big5 |
| 112 | 112 | ||
| 113 | These are verbatim copies of files taken from external sources. | ||
| 114 | They haven't been converted to UTF-8. | ||
| 115 | |||
| 113 | leim/CXTERM-DIC/4Corner.tit | 116 | leim/CXTERM-DIC/4Corner.tit |
| 114 | leim/CXTERM-DIC/ARRAY30.tit | 117 | leim/CXTERM-DIC/ARRAY30.tit |
| 115 | leim/CXTERM-DIC/ECDICT.tit | 118 | leim/CXTERM-DIC/ECDICT.tit |
| @@ -123,6 +126,9 @@ the build process. | |||
| 123 | 126 | ||
| 124 | * chinese-iso-8bit | 127 | * chinese-iso-8bit |
| 125 | 128 | ||
| 129 | These are verbatim copies of files taken from external sources. | ||
| 130 | They haven't been converted to UTF-8. | ||
| 131 | |||
| 126 | leim/CXTERM-DIC/CCDOSPY.tit | 132 | leim/CXTERM-DIC/CCDOSPY.tit |
| 127 | leim/CXTERM-DIC/Punct.tit | 133 | leim/CXTERM-DIC/Punct.tit |
| 128 | leim/CXTERM-DIC/QJ.tit | 134 | leim/CXTERM-DIC/QJ.tit |
| @@ -132,28 +138,74 @@ the build process. | |||
| 132 | leim/MISC-DIC/CTLau.html | 138 | leim/MISC-DIC/CTLau.html |
| 133 | leim/MISC-DIC/ziranma.cin | 139 | leim/MISC-DIC/ziranma.cin |
| 134 | 140 | ||
| 141 | * cp850 | ||
| 142 | |||
| 143 | This file contains non-ASCII characters in unibyte strings. When | ||
| 144 | editing a keyboard layout it's more convenient to see 'é' than | ||
| 145 | '\202', and the MS-DOS compiler requires the single byte if a | ||
| 146 | backslash escape is not being used. | ||
| 147 | |||
| 148 | src/msdos.c | ||
| 149 | |||
| 150 | * iso-2022-cn-ext | ||
| 151 | |||
| 152 | This file is externally generated from leim/MISC-DIC/cangjie-table.b5 | ||
| 153 | by Big5->CNS converter. It hasn't been converted to UTF-8. | ||
| 154 | |||
| 155 | leim/MISC-DIC/cangjie-table.cns | ||
| 156 | |||
| 135 | * iso-latin-2 | 157 | * iso-latin-2 |
| 136 | 158 | ||
| 159 | These files are processed by csplain, a program that requires | ||
| 160 | Latin-2 input. In 2012 the csplain maintainers started | ||
| 161 | recommending UTF-8, but these files haven't been converted yet. | ||
| 162 | |||
| 163 | etc/refcards/cs-dired-ref.tex | ||
| 137 | etc/refcards/cs-refcard.tex | 164 | etc/refcards/cs-refcard.tex |
| 138 | etc/refcards/sk-survival.tex | ||
| 139 | etc/refcards/cs-survival.tex | 165 | etc/refcards/cs-survival.tex |
| 140 | etc/refcards/cs-dired-ref.tex | ||
| 141 | etc/refcards/sk-dired-ref.tex | 166 | etc/refcards/sk-dired-ref.tex |
| 142 | etc/refcards/sk-refcard.tex | 167 | etc/refcards/sk-refcard.tex |
| 168 | etc/refcards/sk-survival.tex | ||
| 143 | 169 | ||
| 144 | * japanese-iso-8bit | 170 | * japanese-iso-8bit |
| 145 | 171 | ||
| 172 | SKK-JISYO.L is a verbatim copy of a file taken from an external source. | ||
| 173 | ja-dic.el is generated automatically by skkdic-convert; this process | ||
| 174 | hasn't been converted to use UTF-8. | ||
| 175 | |||
| 146 | leim/SKK-DIC/SKK-JISYO.L | 176 | leim/SKK-DIC/SKK-JISYO.L |
| 147 | leim/ja-dic/ja-dic.el | 177 | leim/ja-dic/ja-dic.el |
| 148 | 178 | ||
| 149 | * japanese-shift-jis | 179 | * japanese-shift-jis |
| 150 | 180 | ||
| 181 | This is a verbatim copy of a file taken from an external source. | ||
| 182 | It hasn't been converted to UTF-8. | ||
| 183 | |||
| 151 | admin/charsets/mapfiles/cns2ucsdkw.txt | 184 | admin/charsets/mapfiles/cns2ucsdkw.txt |
| 152 | 185 | ||
| 153 | * no-conversion | 186 | * no-conversion |
| 154 | 187 | ||
| 188 | This file purposely contains arbitrary bytes interspersed within text, | ||
| 189 | to test whether the Emacs distribution is corrupted. | ||
| 190 | |||
| 155 | lib-src/testfile | 191 | lib-src/testfile |
| 156 | 192 | ||
| 193 | * iso-2022-7bit | ||
| 194 | |||
| 195 | This file contains significant charset information, which is not | ||
| 196 | encoded in UTF-8. | ||
| 197 | |||
| 198 | etc/HELLO | ||
| 199 | |||
| 200 | These files contain characters that cannot be encoded in UTF-8. | ||
| 201 | |||
| 202 | leim/quail/tibetan.el | ||
| 203 | leim/quail/ethiopic.el | ||
| 204 | lisp/international/titdic-cnv.el | ||
| 205 | lisp/language/tibetan.el | ||
| 206 | lisp/language/tibet-util.el | ||
| 207 | lisp/language/ind-util.el | ||
| 208 | |||
| 157 | 209 | ||
| 158 | This file is part of GNU Emacs. | 210 | This file is part of GNU Emacs. |
| 159 | 211 | ||