diff options
| author | Chong Yidong | 2009-10-31 19:30:23 +0000 |
|---|---|---|
| committer | Chong Yidong | 2009-10-31 19:30:23 +0000 |
| commit | 18430066df0ddaab670cf1beb8701d235b0f8179 (patch) | |
| tree | 1ca34ebef194da6c57656b61f74c65e73bb014d1 /doc | |
| parent | 26581f0e2bdc4d93c2da9978cd0516503c000d22 (diff) | |
| download | emacs-18430066df0ddaab670cf1beb8701d235b0f8179.tar.gz emacs-18430066df0ddaab670cf1beb8701d235b0f8179.zip | |
* mule.texi (Charsets): Numerous copyedits. Don't discuss the
`charset' property, which is irrelevant to the user manual (Bug#3526).
Diffstat (limited to 'doc')
| -rw-r--r-- | doc/emacs/ChangeLog | 5 | ||||
| -rw-r--r-- | doc/emacs/mule.texi | 67 |
2 files changed, 38 insertions, 34 deletions
diff --git a/doc/emacs/ChangeLog b/doc/emacs/ChangeLog index f8bd8252e10..82e9dcb5958 100644 --- a/doc/emacs/ChangeLog +++ b/doc/emacs/ChangeLog | |||
| @@ -1,3 +1,8 @@ | |||
| 1 | 2009-10-31 Chong Yidong <cyd@stupidchicken.com> | ||
| 2 | |||
| 3 | * mule.texi (Charsets): Numerous copyedits. Don't discuss the | ||
| 4 | `charset' property, which is irrelevant to the user manual (Bug#3526). | ||
| 5 | |||
| 1 | 2009-10-14 Juanma Barranquero <lekktu@gmail.com> | 6 | 2009-10-14 Juanma Barranquero <lekktu@gmail.com> |
| 2 | 7 | ||
| 3 | * trouble.texi (DEL Does Not Delete): Fix typo. | 8 | * trouble.texi (DEL Does Not Delete): Fix typo. |
diff --git a/doc/emacs/mule.texi b/doc/emacs/mule.texi index cca3b15e0f6..b924e3b62ca 100644 --- a/doc/emacs/mule.texi +++ b/doc/emacs/mule.texi | |||
| @@ -1608,51 +1608,50 @@ Use @kbd{C-x 8 C-h} to list all the available @kbd{C-x 8} translations. | |||
| 1608 | @section Charsets | 1608 | @section Charsets |
| 1609 | @cindex charsets | 1609 | @cindex charsets |
| 1610 | 1610 | ||
| 1611 | Emacs defines most of popular character sets (e.g. ascii, | 1611 | In Emacs, @dfn{charset} is short for ``character set''. Emacs |
| 1612 | iso-8859-1, cp1250, big5, unicode) as @dfn{charsets} and a few of its | 1612 | supports most popular charsets (such as @code{ascii}, |
| 1613 | own charsets (e.g. emacs, unicode-bmp, eight-bit). All supported | 1613 | @code{iso-8859-1}, @code{cp1250}, @code{big5}, and @code{unicode}), in |
| 1614 | characters belong to one or more charsets. Usually you don't have to | 1614 | addition to some charsets of its own (such as @code{emacs}, |
| 1615 | take care of ``charset'', but knowing about it may help understanding | 1615 | @code{unicode-bmp}, and @code{eight-bit}). All supported characters |
| 1616 | the behavior of Emacs in some cases. | 1616 | belong to one or more charsets. |
| 1617 | 1617 | ||
| 1618 | One example is a font selection. In each language environment, | 1618 | Emacs normally ``does the right thing'' with respect to charsets, so |
| 1619 | charsets have different priorities. Emacs, at first, tries to use a | 1619 | that you don't have to worry about them. However, it is sometimes |
| 1620 | font that matches with charsets of higher priority. For instance, in | 1620 | helpful to know some of the underlying details about charsets. |
| 1621 | Japanese language environment, the charset @code{japanese-jisx0208} | 1621 | |
| 1622 | has the highest priority (@pxref{Describe Language Environment}). So, | 1622 | One example is font selection (@pxref{Font X}). Each language |
| 1623 | Emacs tries to use a font whose @code{registry} property is | 1623 | environment (@pxref{Language Environments}) defines a ``priority |
| 1624 | ``JISX0208.1983-0'' for characters belonging to that charset. | 1624 | list'' for the various charsets. When searching for a font, Emacs |
| 1625 | 1625 | initially attempts to find one that can display the highest-priority | |
| 1626 | Another example is a use of @code{charset} text property. When | 1626 | charsets. For instance, in the Japanese language environment, the |
| 1627 | Emacs reads a file encoded in a coding systems that uses escape | 1627 | charset @code{japanese-jisx0208} has the highest priority, so Emacs |
| 1628 | sequences to switch charsets (e.g. iso-2022-int-1), the buffer text | 1628 | tries to use a font whose @code{registry} property is |
| 1629 | keep the information of the original charset by @code{charset} text | 1629 | @samp{JISX0208.1983-0}. |
| 1630 | property. By using this information, Emacs can write the file with | ||
| 1631 | the same byte sequence as the original. | ||
| 1632 | 1630 | ||
| 1633 | @findex list-charset-chars | 1631 | @findex list-charset-chars |
| 1634 | @cindex characters in a certain charset | 1632 | @cindex characters in a certain charset |
| 1635 | @findex describe-character-set | 1633 | @findex describe-character-set |
| 1636 | There are two commands for obtaining information about Emacs | 1634 | There are two commands that can be used to obtain information about |
| 1637 | charsets. The command @kbd{M-x list-charset-chars} prompts for a | 1635 | charsets. The command @kbd{M-x list-charset-chars} prompts for a |
| 1638 | charset name, and displays all the characters in that character set. | 1636 | charset name, and displays all the characters in that character set. |
| 1639 | The command @kbd{M-x describe-character-set} prompts for a charset | 1637 | The command @kbd{M-x describe-character-set} prompts for a charset |
| 1640 | name and displays information about that charset, including its | 1638 | name, and displays information about that charset, including its |
| 1641 | internal representation within Emacs. | 1639 | internal representation within Emacs. |
| 1642 | 1640 | ||
| 1643 | @findex list-character-sets | 1641 | @findex list-character-sets |
| 1644 | To display a list of all the supported charsets, type @kbd{M-x | 1642 | To display a list of all supported charsets, type @kbd{M-x |
| 1645 | list-character-sets}. The list gives the names of charsets and | 1643 | list-character-sets}. The list gives the names of charsets and |
| 1646 | additional information to identity each charset (see ISO/IEC's this | 1644 | additional information to identity each charset (see |
| 1647 | page <http://www.itscj.ipsj.or.jp/ISO-IR/> for the detail). In the | 1645 | @url{http://www.itscj.ipsj.or.jp/ISO-IR/} for details). In this list, |
| 1648 | list, charsets are categorized into two; the normal charsets are | 1646 | charsets are divided into two categories: @dfn{normal charsets} are |
| 1649 | listed first, and the supplementary charsets are listed last. A | 1647 | listed first, followed by @dfn{supplementary charsets}. A |
| 1650 | charset in the latter category is used for defining another charset | 1648 | supplementary charset is one that is used to define another charset |
| 1651 | (as a parent or a subset), or was used only in Emacs of the older | 1649 | (as a parent or a subset), or to provide backward-compatibility for |
| 1652 | versions. | 1650 | older Emacs versions. |
| 1653 | 1651 | ||
| 1654 | To find out which charset a character in the buffer belongs to, | 1652 | To find out which charset a character in the buffer belongs to, put |
| 1655 | put point before it and type @kbd{C-u C-x =}. | 1653 | point before it and type @kbd{C-u C-x =} (@pxref{International |
| 1654 | Chars}). | ||
| 1656 | 1655 | ||
| 1657 | @ignore | 1656 | @ignore |
| 1658 | arch-tag: 310ba60d-31ef-4ce7-91f1-f282dd57b6b3 | 1657 | arch-tag: 310ba60d-31ef-4ce7-91f1-f282dd57b6b3 |