aboutsummaryrefslogtreecommitdiffstats
path: root/admin/notes
diff options
context:
space:
mode:
authorTom Tromey2013-03-17 05:17:24 -0600
committerTom Tromey2013-03-17 05:17:24 -0600
commit6bd488cd8d05aa3983ca55f70ee384732d8c0085 (patch)
tree5645fc7b882638d6c0eb3f61fd55bde1a63fc190 /admin/notes
parent71f91792e3013b397996905224f387da5cc539a9 (diff)
parent9c44569ea2a18099307e0571d523d8637000a153 (diff)
downloademacs-6bd488cd8d05aa3983ca55f70ee384732d8c0085.tar.gz
emacs-6bd488cd8d05aa3983ca55f70ee384732d8c0085.zip
merge from trunk
Diffstat (limited to 'admin/notes')
-rw-r--r--admin/notes/unicode62
1 files changed, 57 insertions, 5 deletions
diff --git a/admin/notes/unicode b/admin/notes/unicode
index 0654036d364..7b5e21c864b 100644
--- a/admin/notes/unicode
+++ b/admin/notes/unicode
@@ -104,12 +104,15 @@ Source file encoding
104 104
105Most Emacs source files are encoded in UTF-8 (or in ASCII, which is a 105Most Emacs source files are encoded in UTF-8 (or in ASCII, which is a
106subset), but there are a few exceptions, listed below. Perhaps 106subset), but there are a few exceptions, listed below. Perhaps
107someday these files will be converted to UTF-8, for convenience when 107someday many of these files will be converted to UTF-8, for
108using tools like 'grep -r', but this might need nontrivial changes to 108convenience when using tools like 'grep -r', but this might need
109the build process. 109nontrivial changes to the build process.
110 110
111 * chinese-big5 111 * chinese-big5
112 112
113 These are verbatim copies of files taken from external sources.
114 They haven't been converted to UTF-8.
115
113 leim/CXTERM-DIC/4Corner.tit 116 leim/CXTERM-DIC/4Corner.tit
114 leim/CXTERM-DIC/ARRAY30.tit 117 leim/CXTERM-DIC/ARRAY30.tit
115 leim/CXTERM-DIC/ECDICT.tit 118 leim/CXTERM-DIC/ECDICT.tit
@@ -123,6 +126,9 @@ the build process.
123 126
124 * chinese-iso-8bit 127 * chinese-iso-8bit
125 128
129 These are verbatim copies of files taken from external sources.
130 They haven't been converted to UTF-8.
131
126 leim/CXTERM-DIC/CCDOSPY.tit 132 leim/CXTERM-DIC/CCDOSPY.tit
127 leim/CXTERM-DIC/Punct.tit 133 leim/CXTERM-DIC/Punct.tit
128 leim/CXTERM-DIC/QJ.tit 134 leim/CXTERM-DIC/QJ.tit
@@ -132,28 +138,74 @@ the build process.
132 leim/MISC-DIC/CTLau.html 138 leim/MISC-DIC/CTLau.html
133 leim/MISC-DIC/ziranma.cin 139 leim/MISC-DIC/ziranma.cin
134 140
141 * cp850
142
143 This file contains non-ASCII characters in unibyte strings. When
144 editing a keyboard layout it's more convenient to see 'é' than
145 '\202', and the MS-DOS compiler requires the single byte if a
146 backslash escape is not being used.
147
148 src/msdos.c
149
150 * iso-2022-cn-ext
151
152 This file is externally generated from leim/MISC-DIC/cangjie-table.b5
153 by Big5->CNS converter. It hasn't been converted to UTF-8.
154
155 leim/MISC-DIC/cangjie-table.cns
156
135 * iso-latin-2 157 * iso-latin-2
136 158
159 These files are processed by csplain, a program that requires
160 Latin-2 input. In 2012 the csplain maintainers started
161 recommending UTF-8, but these files haven't been converted yet.
162
163 etc/refcards/cs-dired-ref.tex
137 etc/refcards/cs-refcard.tex 164 etc/refcards/cs-refcard.tex
138 etc/refcards/sk-survival.tex
139 etc/refcards/cs-survival.tex 165 etc/refcards/cs-survival.tex
140 etc/refcards/cs-dired-ref.tex
141 etc/refcards/sk-dired-ref.tex 166 etc/refcards/sk-dired-ref.tex
142 etc/refcards/sk-refcard.tex 167 etc/refcards/sk-refcard.tex
168 etc/refcards/sk-survival.tex
143 169
144 * japanese-iso-8bit 170 * japanese-iso-8bit
145 171
172 SKK-JISYO.L is a verbatim copy of a file taken from an external source.
173 ja-dic.el is generated automatically by skkdic-convert; this process
174 hasn't been converted to use UTF-8.
175
146 leim/SKK-DIC/SKK-JISYO.L 176 leim/SKK-DIC/SKK-JISYO.L
147 leim/ja-dic/ja-dic.el 177 leim/ja-dic/ja-dic.el
148 178
149 * japanese-shift-jis 179 * japanese-shift-jis
150 180
181 This is a verbatim copy of a file taken from an external source.
182 It hasn't been converted to UTF-8.
183
151 admin/charsets/mapfiles/cns2ucsdkw.txt 184 admin/charsets/mapfiles/cns2ucsdkw.txt
152 185
153 * no-conversion 186 * no-conversion
154 187
188 This file purposely contains arbitrary bytes interspersed within text,
189 to test whether the Emacs distribution is corrupted.
190
155 lib-src/testfile 191 lib-src/testfile
156 192
193 * iso-2022-7bit
194
195 This file contains significant charset information, which is not
196 encoded in UTF-8.
197
198 etc/HELLO
199
200 These files contain characters that cannot be encoded in UTF-8.
201
202 leim/quail/tibetan.el
203 leim/quail/ethiopic.el
204 lisp/international/titdic-cnv.el
205 lisp/language/tibetan.el
206 lisp/language/tibet-util.el
207 lisp/language/ind-util.el
208
157 209
158This file is part of GNU Emacs. 210This file is part of GNU Emacs.
159 211