aboutsummaryrefslogtreecommitdiffstats
path: root/admin/notes/unicode
diff options
context:
space:
mode:
authorTom Tromey2013-03-08 11:57:29 -0700
committerTom Tromey2013-03-08 11:57:29 -0700
commit71f91792e3013b397996905224f387da5cc539a9 (patch)
tree4c3d3ba909e76deea1cdf73b73fca67a57149465 /admin/notes/unicode
parent6f4de085f065e11f4df3195d47479f28f5ef08ba (diff)
parentb5426561089d39f18b42bed9dbfcb531f43ed562 (diff)
downloademacs-71f91792e3013b397996905224f387da5cc539a9.tar.gz
emacs-71f91792e3013b397996905224f387da5cc539a9.zip
merge from trunk
Diffstat (limited to 'admin/notes/unicode')
-rw-r--r--admin/notes/unicode66
1 files changed, 61 insertions, 5 deletions
diff --git a/admin/notes/unicode b/admin/notes/unicode
index 21704c78a00..0654036d364 100644
--- a/admin/notes/unicode
+++ b/admin/notes/unicode
@@ -1,4 +1,4 @@
1 -*-mode: text; coding: latin-1;-*- 1 -*-mode: text; coding: utf-8;-*-
2 2
3Copyright (C) 2002-2013 Free Software Foundation, Inc. 3Copyright (C) 2002-2013 Free Software Foundation, Inc.
4See the end of the file for license conditions. 4See the end of the file for license conditions.
@@ -12,9 +12,9 @@ regard to completeness.
12 12
13 * SINGLE_BYTE_CHAR_P returns true for Latin-1 characters, which has 13 * SINGLE_BYTE_CHAR_P returns true for Latin-1 characters, which has
14 undesirable effects. E.g.: 14 undesirable effects. E.g.:
15 (multibyte-string-p (let ((s "x")) (aset s 0 ?£) s)) => nil 15 (multibyte-string-p (let ((s "x")) (aset s 0 ?£) s)) => nil
16 (multibyte-string-p (concat [?£])) => nil 16 (multibyte-string-p (concat [?£])) => nil
17 (text-char-description ?£) => "M-#" 17 (text-char-description ?£) => "M-#"
18 18
19 These examples are all fixed by the change of 2002-10-14, but 19 These examples are all fixed by the change of 2002-10-14, but
20 there still exist questionable SINGLE_BYTE_CHAR_P in the 20 there still exist questionable SINGLE_BYTE_CHAR_P in the
@@ -77,7 +77,7 @@ regard to completeness.
77 spelling and calendar, but that's not a Unicode issue.) 77 spelling and calendar, but that's not a Unicode issue.)
78 78
79 * Handle Unicode combining characters usefully, e.g. diacritics, and 79 * Handle Unicode combining characters usefully, e.g. diacritics, and
80 handle more scripts specifically (à la Devanagari). There are 80 handle more scripts specifically (à la Devanagari). There are
81 issues with canonicalization. 81 issues with canonicalization.
82 82
83 * We need tabular input methods, e.g. for maths symbols. (Not 83 * We need tabular input methods, e.g. for maths symbols. (Not
@@ -98,6 +98,62 @@ regard to completeness.
98 * Old auto-save files, and similar files, such as Gnus drafts, 98 * Old auto-save files, and similar files, such as Gnus drafts,
99 containing non-ASCII characters probably won't be re-read correctly. 99 containing non-ASCII characters probably won't be re-read correctly.
100 100
101
102Source file encoding
103--------------------
104
105Most Emacs source files are encoded in UTF-8 (or in ASCII, which is a
106subset), but there are a few exceptions, listed below. Perhaps
107someday these files will be converted to UTF-8, for convenience when
108using tools like 'grep -r', but this might need nontrivial changes to
109the build process.
110
111 * chinese-big5
112
113 leim/CXTERM-DIC/4Corner.tit
114 leim/CXTERM-DIC/ARRAY30.tit
115 leim/CXTERM-DIC/ECDICT.tit
116 leim/CXTERM-DIC/ETZY.tit
117 leim/CXTERM-DIC/PY-b5.tit
118 leim/CXTERM-DIC/Punct-b5.tit
119 leim/CXTERM-DIC/QJ-b5.tit
120 leim/CXTERM-DIC/ZOZY.tit
121 leim/MISC-DIC/CTLau-b5.html
122 leim/MISC-DIC/cangjie-table.b5
123
124 * chinese-iso-8bit
125
126 leim/CXTERM-DIC/CCDOSPY.tit
127 leim/CXTERM-DIC/Punct.tit
128 leim/CXTERM-DIC/QJ.tit
129 leim/CXTERM-DIC/SW.tit
130 leim/CXTERM-DIC/TONEPY.tit
131 leim/MISC-DIC/pinyin.map
132 leim/MISC-DIC/CTLau.html
133 leim/MISC-DIC/ziranma.cin
134
135 * iso-latin-2
136
137 etc/refcards/cs-refcard.tex
138 etc/refcards/sk-survival.tex
139 etc/refcards/cs-survival.tex
140 etc/refcards/cs-dired-ref.tex
141 etc/refcards/sk-dired-ref.tex
142 etc/refcards/sk-refcard.tex
143
144 * japanese-iso-8bit
145
146 leim/SKK-DIC/SKK-JISYO.L
147 leim/ja-dic/ja-dic.el
148
149 * japanese-shift-jis
150
151 admin/charsets/mapfiles/cns2ucsdkw.txt
152
153 * no-conversion
154
155 lib-src/testfile
156
101 157
102This file is part of GNU Emacs. 158This file is part of GNU Emacs.
103 159