diff options
| author | Richard M. Stallman | 2005-04-01 22:08:47 +0000 |
|---|---|---|
| committer | Richard M. Stallman | 2005-04-01 22:08:47 +0000 |
| commit | 8b9182147e2e2430eb2a6189444e966c6e121f71 (patch) | |
| tree | f46850a08fd5b31898564749b31ff2153ff8a81a | |
| parent | 1ee49a88dd20cda9baaaa5247c46c5356e485737 (diff) | |
| download | emacs-8b9182147e2e2430eb2a6189444e966c6e121f71.tar.gz emacs-8b9182147e2e2430eb2a6189444e966c6e121f71.zip | |
(Coding System Basics): Clarify previous change.
| -rw-r--r-- | lispref/ChangeLog | 4 | ||||
| -rw-r--r-- | lispref/nonascii.texi | 25 |
2 files changed, 16 insertions, 13 deletions
diff --git a/lispref/ChangeLog b/lispref/ChangeLog index 0d11d7c0e9e..8a34499507f 100644 --- a/lispref/ChangeLog +++ b/lispref/ChangeLog | |||
| @@ -1,3 +1,7 @@ | |||
| 1 | 2005-04-01 Richard M. Stallman <rms@gnu.org> | ||
| 2 | |||
| 3 | * nonascii.texi (Coding System Basics): Clarify previous change. | ||
| 4 | |||
| 1 | 2005-04-01 Kenichi Handa <handa@m17n.org> | 5 | 2005-04-01 Kenichi Handa <handa@m17n.org> |
| 2 | 6 | ||
| 3 | * nonascii.texi (Coding System Basics): Describe about rondtrip | 7 | * nonascii.texi (Coding System Basics): Describe about rondtrip |
diff --git a/lispref/nonascii.texi b/lispref/nonascii.texi index 91a47ea50f9..4e38c300a61 100644 --- a/lispref/nonascii.texi +++ b/lispref/nonascii.texi | |||
| @@ -628,11 +628,11 @@ characters; for example, there are three coding systems for the Cyrillic | |||
| 628 | conversion, but some of them leave the choice unspecified---to be chosen | 628 | conversion, but some of them leave the choice unspecified---to be chosen |
| 629 | heuristically for each file, based on the data. | 629 | heuristically for each file, based on the data. |
| 630 | 630 | ||
| 631 | In general, a coding system doesn't guarantee a roundtrip identity, | 631 | In general, a coding system doesn't guarantee roundtrip identity: |
| 632 | i.e. decoding followed by encoding in the same coding system can | 632 | decoding text then encoding the result in the same coding system can |
| 633 | result in the different byte sequence. But there are several coding | 633 | produce a different byte sequence from the one you originally decoded. |
| 634 | systems that go guarantee that the result will be the same as what you | 634 | However, the following coding systems do guarantee that the result |
| 635 | originally decoded. They are: | 635 | will be the same as what you originally decoded: |
| 636 | 636 | ||
| 637 | @quotation | 637 | @quotation |
| 638 | chinese-big5 chinese-iso-8bit cyrillic-iso-8bit emacs-mule | 638 | chinese-big5 chinese-iso-8bit cyrillic-iso-8bit emacs-mule |
| @@ -641,14 +641,13 @@ iso-latin-4 iso-latin-5 iso-latin-8 iso-latin-9 iso-safe | |||
| 641 | japanese-iso-8bit japanese-shift-jis korean-iso-8bit raw-text | 641 | japanese-iso-8bit japanese-shift-jis korean-iso-8bit raw-text |
| 642 | @end quotation | 642 | @end quotation |
| 643 | 643 | ||
| 644 | Likewise, a coding systme doesn't guarantee the other way of roundtrip | 644 | Encoding buffer text and then decoding the result can also fail to |
| 645 | identity, i.e. encoding buffer text into a coding system followed by | 645 | reproduce the original text. For instance, when you encode Latin-2 |
| 646 | decoding again with the same coding system will produce the different | 646 | characters with @code{utf-8} and decode the result using the same |
| 647 | buffer text. For instance, when you encode Latin-2 characters by | 647 | coding system, you'll get Unicode characters (of charset |
| 648 | @code{utf-8} and decode it back by the same coding system, you'll get | 648 | @code{mule-unicode-0100-24ff}). When you encode Unicode characters |
| 649 | Unicode charactes (of charset @code{mule-unicode-0100-24ff}), and when | 649 | with @code{iso-latin-2} and decode them back with the same coding |
| 650 | you encode Unicode characters by @code{iso-latin-2} and decode it back | 650 | system, you'll get Latin-2 characters. |
| 651 | by the same coding system, you'll get Latin-2 characters. | ||
| 652 | 651 | ||
| 653 | @cindex end of line conversion | 652 | @cindex end of line conversion |
| 654 | @dfn{End of line conversion} handles three different conventions used | 653 | @dfn{End of line conversion} handles three different conventions used |