merge from trunk

author: Tom Tromey 2013-03-17 05:17:24 -0600
committer: Tom Tromey 2013-03-17 05:17:24 -0600
commit: 6bd488cd8d05aa3983ca55f70ee384732d8c0085 (patch)
tree: 5645fc7b882638d6c0eb3f61fd55bde1a63fc190 /admin/notes
parent: 71f91792e3013b397996905224f387da5cc539a9 (diff)
parent: 9c44569ea2a18099307e0571d523d8637000a153 (diff)
download: emacs-6bd488cd8d05aa3983ca55f70ee384732d8c0085.tar.gz
emacs-6bd488cd8d05aa3983ca55f70ee384732d8c0085.zip
1 files changed, 57 insertions, 5 deletions
diff --git a/admin/notes/unicode b/admin/notes/unicode
index 0654036d364..7b5e21c864b 100644
--- a/admin/notes/unicode
+++ b/admin/notes/unicode
@@ -104,12 +104,15 @@ Source file encoding
 Most Emacs source files are encoded in UTF-8 (or in ASCII, which is a
 subset), but there are a few exceptions, listed below.  Perhaps
-someday these files will be converted to UTF-8, for convenience when
+someday many of these files will be converted to UTF-8, for
-using tools like 'grep -r', but this might need nontrivial changes to
+convenience when using tools like 'grep -r', but this might need
-the build process.
+nontrivial changes to the build process.
 * chinese-big5
+     These are verbatim copies of files taken from external sources.
+     They haven't been converted to UTF-8.
        leim/CXTERM-DIC/4Corner.tit
        leim/CXTERM-DIC/ARRAY30.tit
        leim/CXTERM-DIC/ECDICT.tit
@@ -123,6 +126,9 @@ the build process.
 * chinese-iso-8bit
+     These are verbatim copies of files taken from external sources.
+     They haven't been converted to UTF-8.
        leim/CXTERM-DIC/CCDOSPY.tit
        leim/CXTERM-DIC/Punct.tit
        leim/CXTERM-DIC/QJ.tit
@@ -132,28 +138,74 @@ the build process.
        leim/MISC-DIC/CTLau.html
        leim/MISC-DIC/ziranma.cin
+ * cp850
+     This file contains non-ASCII characters in unibyte strings.  When
+     editing a keyboard layout it's more convenient to see 'é' than
+     '\202', and the MS-DOS compiler requires the single byte if a
+     backslash escape is not being used.
+        src/msdos.c
+ * iso-2022-cn-ext
+     This file is externally generated from leim/MISC-DIC/cangjie-table.b5
+     by Big5->CNS converter.  It hasn't been converted to UTF-8.
+        leim/MISC-DIC/cangjie-table.cns
 * iso-latin-2
+     These files are processed by csplain, a program that requires
+     Latin-2 input.  In 2012 the csplain maintainers started
+     recommending UTF-8, but these files haven't been converted yet.
+        etc/refcards/cs-dired-ref.tex
        etc/refcards/cs-refcard.tex
-        etc/refcards/sk-survival.tex
        etc/refcards/cs-survival.tex
-        etc/refcards/cs-dired-ref.tex
        etc/refcards/sk-dired-ref.tex
        etc/refcards/sk-refcard.tex
+        etc/refcards/sk-survival.tex
 * japanese-iso-8bit
+     SKK-JISYO.L is a verbatim copy of a file taken from an external source.
+     ja-dic.el is generated automatically by skkdic-convert; this process
+     hasn't been converted to use UTF-8.
        leim/SKK-DIC/SKK-JISYO.L
        leim/ja-dic/ja-dic.el
 * japanese-shift-jis
+     This is a verbatim copy of a file taken from an external source.
+     It hasn't been converted to UTF-8.
        admin/charsets/mapfiles/cns2ucsdkw.txt
 * no-conversion
+     This file purposely contains arbitrary bytes interspersed within text,
+     to test whether the Emacs distribution is corrupted.
        lib-src/testfile
+ * iso-2022-7bit
+     This file contains significant charset information, which is not
+     encoded in UTF-8.
+        etc/HELLO
+     These files contain characters that cannot be encoded in UTF-8.
+        leim/quail/tibetan.el
+        leim/quail/ethiopic.el
+        lisp/international/titdic-cnv.el
+        lisp/language/tibetan.el
+        lisp/language/tibet-util.el
+        lisp/language/ind-util.el
 This file is part of GNU Emacs.
author	Tom Tromey	2013-03-17 05:17:24 -0600
committer	Tom Tromey	2013-03-17 05:17:24 -0600
commit	6bd488cd8d05aa3983ca55f70ee384732d8c0085 (patch)
tree	5645fc7b882638d6c0eb3f61fd55bde1a63fc190 /admin/notes
parent	71f91792e3013b397996905224f387da5cc539a9 (diff)
parent	9c44569ea2a18099307e0571d523d8637000a153 (diff)
download	emacs-6bd488cd8d05aa3983ca55f70ee384732d8c0085.tar.gz emacs-6bd488cd8d05aa3983ca55f70ee384732d8c0085.zip

diff --git a/admin/notes/unicode b/admin/notes/unicode index 0654036d364..7b5e21c864b 100644 --- a/admin/notes/unicode +++ b/admin/notes/unicode
@@ -104,12 +104,15 @@ Source file encoding
104		104
105	Most Emacs source files are encoded in UTF-8 (or in ASCII, which is a	105	Most Emacs source files are encoded in UTF-8 (or in ASCII, which is a
106	subset), but there are a few exceptions, listed below. Perhaps	106	subset), but there are a few exceptions, listed below. Perhaps
107	someday these files will be converted to UTF-8, for convenience when	107	someday many of these files will be converted to UTF-8, for
108	using tools like 'grep -r', but this might need nontrivial changes to	108	convenience when using tools like 'grep -r', but this might need
109	the build process.	109	nontrivial changes to the build process.
110		110
111	* chinese-big5	111	* chinese-big5
112		112
		113	These are verbatim copies of files taken from external sources.
		114	They haven't been converted to UTF-8.
		115
113	leim/CXTERM-DIC/4Corner.tit	116	leim/CXTERM-DIC/4Corner.tit
114	leim/CXTERM-DIC/ARRAY30.tit	117	leim/CXTERM-DIC/ARRAY30.tit
115	leim/CXTERM-DIC/ECDICT.tit	118	leim/CXTERM-DIC/ECDICT.tit
@@ -123,6 +126,9 @@ the build process.
123		126
124	* chinese-iso-8bit	127	* chinese-iso-8bit
125		128
		129	These are verbatim copies of files taken from external sources.
		130	They haven't been converted to UTF-8.
		131
126	leim/CXTERM-DIC/CCDOSPY.tit	132	leim/CXTERM-DIC/CCDOSPY.tit
127	leim/CXTERM-DIC/Punct.tit	133	leim/CXTERM-DIC/Punct.tit
128	leim/CXTERM-DIC/QJ.tit	134	leim/CXTERM-DIC/QJ.tit
@@ -132,28 +138,74 @@ the build process.
132	leim/MISC-DIC/CTLau.html	138	leim/MISC-DIC/CTLau.html
133	leim/MISC-DIC/ziranma.cin	139	leim/MISC-DIC/ziranma.cin
134		140
		141	* cp850
		142
		143	This file contains non-ASCII characters in unibyte strings. When
		144	editing a keyboard layout it's more convenient to see 'é' than
		145	'\202', and the MS-DOS compiler requires the single byte if a
		146	backslash escape is not being used.
		147
		148	src/msdos.c
		149
		150	* iso-2022-cn-ext
		151
		152	This file is externally generated from leim/MISC-DIC/cangjie-table.b5
		153	by Big5->CNS converter. It hasn't been converted to UTF-8.
		154
		155	leim/MISC-DIC/cangjie-table.cns
		156
135	* iso-latin-2	157	* iso-latin-2
136		158
		159	These files are processed by csplain, a program that requires
		160	Latin-2 input. In 2012 the csplain maintainers started
		161	recommending UTF-8, but these files haven't been converted yet.
		162
		163	etc/refcards/cs-dired-ref.tex
137	etc/refcards/cs-refcard.tex	164	etc/refcards/cs-refcard.tex
138	etc/refcards/sk-survival.tex
139	etc/refcards/cs-survival.tex	165	etc/refcards/cs-survival.tex
140	etc/refcards/cs-dired-ref.tex
141	etc/refcards/sk-dired-ref.tex	166	etc/refcards/sk-dired-ref.tex
142	etc/refcards/sk-refcard.tex	167	etc/refcards/sk-refcard.tex
		168	etc/refcards/sk-survival.tex
143		169
144	* japanese-iso-8bit	170	* japanese-iso-8bit
145		171
		172	SKK-JISYO.L is a verbatim copy of a file taken from an external source.
		173	ja-dic.el is generated automatically by skkdic-convert; this process
		174	hasn't been converted to use UTF-8.
		175
146	leim/SKK-DIC/SKK-JISYO.L	176	leim/SKK-DIC/SKK-JISYO.L
147	leim/ja-dic/ja-dic.el	177	leim/ja-dic/ja-dic.el
148		178
149	* japanese-shift-jis	179	* japanese-shift-jis
150		180
		181	This is a verbatim copy of a file taken from an external source.
		182	It hasn't been converted to UTF-8.
		183
151	admin/charsets/mapfiles/cns2ucsdkw.txt	184	admin/charsets/mapfiles/cns2ucsdkw.txt
152		185
153	* no-conversion	186	* no-conversion
154		187
		188	This file purposely contains arbitrary bytes interspersed within text,
		189	to test whether the Emacs distribution is corrupted.
		190
155	lib-src/testfile	191	lib-src/testfile
156		192
		193	* iso-2022-7bit
		194
		195	This file contains significant charset information, which is not
		196	encoded in UTF-8.
		197
		198	etc/HELLO
		199
		200	These files contain characters that cannot be encoded in UTF-8.
		201
		202	leim/quail/tibetan.el
		203	leim/quail/ethiopic.el
		204	lisp/international/titdic-cnv.el
		205	lisp/language/tibetan.el
		206	lisp/language/tibet-util.el
		207	lisp/language/ind-util.el
		208
157		209
158	This file is part of GNU Emacs.	210	This file is part of GNU Emacs.
159		211