aboutsummaryrefslogtreecommitdiffstats
path: root/doc
diff options
context:
space:
mode:
authorChong Yidong2009-10-31 19:30:23 +0000
committerChong Yidong2009-10-31 19:30:23 +0000
commit18430066df0ddaab670cf1beb8701d235b0f8179 (patch)
tree1ca34ebef194da6c57656b61f74c65e73bb014d1 /doc
parent26581f0e2bdc4d93c2da9978cd0516503c000d22 (diff)
downloademacs-18430066df0ddaab670cf1beb8701d235b0f8179.tar.gz
emacs-18430066df0ddaab670cf1beb8701d235b0f8179.zip
* mule.texi (Charsets): Numerous copyedits. Don't discuss the
`charset' property, which is irrelevant to the user manual (Bug#3526).
Diffstat (limited to 'doc')
-rw-r--r--doc/emacs/ChangeLog5
-rw-r--r--doc/emacs/mule.texi67
2 files changed, 38 insertions, 34 deletions
diff --git a/doc/emacs/ChangeLog b/doc/emacs/ChangeLog
index f8bd8252e10..82e9dcb5958 100644
--- a/doc/emacs/ChangeLog
+++ b/doc/emacs/ChangeLog
@@ -1,3 +1,8 @@
12009-10-31 Chong Yidong <cyd@stupidchicken.com>
2
3 * mule.texi (Charsets): Numerous copyedits. Don't discuss the
4 `charset' property, which is irrelevant to the user manual (Bug#3526).
5
12009-10-14 Juanma Barranquero <lekktu@gmail.com> 62009-10-14 Juanma Barranquero <lekktu@gmail.com>
2 7
3 * trouble.texi (DEL Does Not Delete): Fix typo. 8 * trouble.texi (DEL Does Not Delete): Fix typo.
diff --git a/doc/emacs/mule.texi b/doc/emacs/mule.texi
index cca3b15e0f6..b924e3b62ca 100644
--- a/doc/emacs/mule.texi
+++ b/doc/emacs/mule.texi
@@ -1608,51 +1608,50 @@ Use @kbd{C-x 8 C-h} to list all the available @kbd{C-x 8} translations.
1608@section Charsets 1608@section Charsets
1609@cindex charsets 1609@cindex charsets
1610 1610
1611 Emacs defines most of popular character sets (e.g. ascii, 1611 In Emacs, @dfn{charset} is short for ``character set''. Emacs
1612iso-8859-1, cp1250, big5, unicode) as @dfn{charsets} and a few of its 1612supports most popular charsets (such as @code{ascii},
1613own charsets (e.g. emacs, unicode-bmp, eight-bit). All supported 1613@code{iso-8859-1}, @code{cp1250}, @code{big5}, and @code{unicode}), in
1614characters belong to one or more charsets. Usually you don't have to 1614addition to some charsets of its own (such as @code{emacs},
1615take care of ``charset'', but knowing about it may help understanding 1615@code{unicode-bmp}, and @code{eight-bit}). All supported characters
1616the behavior of Emacs in some cases. 1616belong to one or more charsets.
1617 1617
1618 One example is a font selection. In each language environment, 1618 Emacs normally ``does the right thing'' with respect to charsets, so
1619charsets have different priorities. Emacs, at first, tries to use a 1619that you don't have to worry about them. However, it is sometimes
1620font that matches with charsets of higher priority. For instance, in 1620helpful to know some of the underlying details about charsets.
1621Japanese language environment, the charset @code{japanese-jisx0208} 1621
1622has the highest priority (@pxref{Describe Language Environment}). So, 1622 One example is font selection (@pxref{Font X}). Each language
1623Emacs tries to use a font whose @code{registry} property is 1623environment (@pxref{Language Environments}) defines a ``priority
1624``JISX0208.1983-0'' for characters belonging to that charset. 1624list'' for the various charsets. When searching for a font, Emacs
1625 1625initially attempts to find one that can display the highest-priority
1626 Another example is a use of @code{charset} text property. When 1626charsets. For instance, in the Japanese language environment, the
1627Emacs reads a file encoded in a coding systems that uses escape 1627charset @code{japanese-jisx0208} has the highest priority, so Emacs
1628sequences to switch charsets (e.g. iso-2022-int-1), the buffer text 1628tries to use a font whose @code{registry} property is
1629keep the information of the original charset by @code{charset} text 1629@samp{JISX0208.1983-0}.
1630property. By using this information, Emacs can write the file with
1631the same byte sequence as the original.
1632 1630
1633@findex list-charset-chars 1631@findex list-charset-chars
1634@cindex characters in a certain charset 1632@cindex characters in a certain charset
1635@findex describe-character-set 1633@findex describe-character-set
1636 There are two commands for obtaining information about Emacs 1634 There are two commands that can be used to obtain information about
1637charsets. The command @kbd{M-x list-charset-chars} prompts for a 1635charsets. The command @kbd{M-x list-charset-chars} prompts for a
1638charset name, and displays all the characters in that character set. 1636charset name, and displays all the characters in that character set.
1639The command @kbd{M-x describe-character-set} prompts for a charset 1637The command @kbd{M-x describe-character-set} prompts for a charset
1640name and displays information about that charset, including its 1638name, and displays information about that charset, including its
1641internal representation within Emacs. 1639internal representation within Emacs.
1642 1640
1643@findex list-character-sets 1641@findex list-character-sets
1644 To display a list of all the supported charsets, type @kbd{M-x 1642 To display a list of all supported charsets, type @kbd{M-x
1645list-character-sets}. The list gives the names of charsets and 1643list-character-sets}. The list gives the names of charsets and
1646additional information to identity each charset (see ISO/IEC's this 1644additional information to identity each charset (see
1647page <http://www.itscj.ipsj.or.jp/ISO-IR/> for the detail). In the 1645@url{http://www.itscj.ipsj.or.jp/ISO-IR/} for details). In this list,
1648list, charsets are categorized into two; the normal charsets are 1646charsets are divided into two categories: @dfn{normal charsets} are
1649listed first, and the supplementary charsets are listed last. A 1647listed first, followed by @dfn{supplementary charsets}. A
1650charset in the latter category is used for defining another charset 1648supplementary charset is one that is used to define another charset
1651(as a parent or a subset), or was used only in Emacs of the older 1649(as a parent or a subset), or to provide backward-compatibility for
1652versions. 1650older Emacs versions.
1653 1651
1654 To find out which charset a character in the buffer belongs to, 1652 To find out which charset a character in the buffer belongs to, put
1655put point before it and type @kbd{C-u C-x =}. 1653point before it and type @kbd{C-u C-x =} (@pxref{International
1654Chars}).
1656 1655
1657@ignore 1656@ignore
1658 arch-tag: 310ba60d-31ef-4ce7-91f1-f282dd57b6b3 1657 arch-tag: 310ba60d-31ef-4ce7-91f1-f282dd57b6b3