aboutsummaryrefslogtreecommitdiffstats
diff options
context:
space:
mode:
authorRichard M. Stallman2005-04-01 22:08:47 +0000
committerRichard M. Stallman2005-04-01 22:08:47 +0000
commit8b9182147e2e2430eb2a6189444e966c6e121f71 (patch)
treef46850a08fd5b31898564749b31ff2153ff8a81a
parent1ee49a88dd20cda9baaaa5247c46c5356e485737 (diff)
downloademacs-8b9182147e2e2430eb2a6189444e966c6e121f71.tar.gz
emacs-8b9182147e2e2430eb2a6189444e966c6e121f71.zip
(Coding System Basics): Clarify previous change.
-rw-r--r--lispref/ChangeLog4
-rw-r--r--lispref/nonascii.texi25
2 files changed, 16 insertions, 13 deletions
diff --git a/lispref/ChangeLog b/lispref/ChangeLog
index 0d11d7c0e9e..8a34499507f 100644
--- a/lispref/ChangeLog
+++ b/lispref/ChangeLog
@@ -1,3 +1,7 @@
12005-04-01 Richard M. Stallman <rms@gnu.org>
2
3 * nonascii.texi (Coding System Basics): Clarify previous change.
4
12005-04-01 Kenichi Handa <handa@m17n.org> 52005-04-01 Kenichi Handa <handa@m17n.org>
2 6
3 * nonascii.texi (Coding System Basics): Describe about rondtrip 7 * nonascii.texi (Coding System Basics): Describe about rondtrip
diff --git a/lispref/nonascii.texi b/lispref/nonascii.texi
index 91a47ea50f9..4e38c300a61 100644
--- a/lispref/nonascii.texi
+++ b/lispref/nonascii.texi
@@ -628,11 +628,11 @@ characters; for example, there are three coding systems for the Cyrillic
628conversion, but some of them leave the choice unspecified---to be chosen 628conversion, but some of them leave the choice unspecified---to be chosen
629heuristically for each file, based on the data. 629heuristically for each file, based on the data.
630 630
631In general, a coding system doesn't guarantee a roundtrip identity, 631In general, a coding system doesn't guarantee roundtrip identity:
632i.e. decoding followed by encoding in the same coding system can 632decoding text then encoding the result in the same coding system can
633result in the different byte sequence. But there are several coding 633produce a different byte sequence from the one you originally decoded.
634systems that go guarantee that the result will be the same as what you 634However, the following coding systems do guarantee that the result
635originally decoded. They are: 635will be the same as what you originally decoded:
636 636
637@quotation 637@quotation
638chinese-big5 chinese-iso-8bit cyrillic-iso-8bit emacs-mule 638chinese-big5 chinese-iso-8bit cyrillic-iso-8bit emacs-mule
@@ -641,14 +641,13 @@ iso-latin-4 iso-latin-5 iso-latin-8 iso-latin-9 iso-safe
641japanese-iso-8bit japanese-shift-jis korean-iso-8bit raw-text 641japanese-iso-8bit japanese-shift-jis korean-iso-8bit raw-text
642@end quotation 642@end quotation
643 643
644Likewise, a coding systme doesn't guarantee the other way of roundtrip 644Encoding buffer text and then decoding the result can also fail to
645identity, i.e. encoding buffer text into a coding system followed by 645reproduce the original text. For instance, when you encode Latin-2
646decoding again with the same coding system will produce the different 646characters with @code{utf-8} and decode the result using the same
647buffer text. For instance, when you encode Latin-2 characters by 647coding system, you'll get Unicode characters (of charset
648@code{utf-8} and decode it back by the same coding system, you'll get 648@code{mule-unicode-0100-24ff}). When you encode Unicode characters
649Unicode charactes (of charset @code{mule-unicode-0100-24ff}), and when 649with @code{iso-latin-2} and decode them back with the same coding
650you encode Unicode characters by @code{iso-latin-2} and decode it back 650system, you'll get Latin-2 characters.
651by the same coding system, you'll get Latin-2 characters.
652 651
653@cindex end of line conversion 652@cindex end of line conversion
654 @dfn{End of line conversion} handles three different conventions used 653 @dfn{End of line conversion} handles three different conventions used