(Coding System Basics): Clarify previous change.

author: Richard M. Stallman 2005-04-01 22:08:47 +0000
committer: Richard M. Stallman 2005-04-01 22:08:47 +0000
commit: 8b9182147e2e2430eb2a6189444e966c6e121f71 (patch)
tree: f46850a08fd5b31898564749b31ff2153ff8a81a
parent: 1ee49a88dd20cda9baaaa5247c46c5356e485737 (diff)
download: emacs-8b9182147e2e2430eb2a6189444e966c6e121f71.tar.gz
emacs-8b9182147e2e2430eb2a6189444e966c6e121f71.zip
2 files changed, 16 insertions, 13 deletions
diff --git a/lispref/ChangeLog b/lispref/ChangeLog
index 0d11d7c0e9e..8a34499507f 100644
--- a/lispref/ChangeLog
+++ b/lispref/ChangeLog
@@ -1,3 +1,7 @@
+2005-04-01  Richard M. Stallman  <rms@gnu.org>
+        * nonascii.texi (Coding System Basics): Clarify previous change.
 2005-04-01  Kenichi Handa  <handa@m17n.org>
        * nonascii.texi (Coding System Basics): Describe about rondtrip
diff --git a/lispref/nonascii.texi b/lispref/nonascii.texi
index 91a47ea50f9..4e38c300a61 100644
--- a/lispref/nonascii.texi
+++ b/lispref/nonascii.texi
@@ -628,11 +628,11 @@ characters; for example, there are three coding systems for the Cyrillic
 conversion, but some of them leave the choice unspecified---to be chosen
 heuristically for each file, based on the data.
-In general, a coding system doesn't guarantee a roundtrip identity,
+In general, a coding system doesn't guarantee roundtrip identity:
-i.e. decoding followed by encoding in the same coding system can
+decoding text then encoding the result in the same coding system can
-result in the different byte sequence.  But there are several coding
+produce a different byte sequence from the one you originally decoded.
-systems that go guarantee that the result will be the same as what you
+However, the following coding systems do guarantee that the result
-originally decoded.  They are:
+will be the same as what you originally decoded:
 @quotation
 chinese-big5 chinese-iso-8bit cyrillic-iso-8bit emacs-mule
@@ -641,14 +641,13 @@ iso-latin-4 iso-latin-5 iso-latin-8 iso-latin-9 iso-safe
 japanese-iso-8bit japanese-shift-jis korean-iso-8bit raw-text
 @end quotation
-Likewise, a coding systme doesn't guarantee the other way of roundtrip
+Encoding buffer text and then decoding the result can also fail to
-identity, i.e. encoding buffer text into a coding system followed by
+reproduce the original text.  For instance, when you encode Latin-2
-decoding again with the same coding system will produce the different
+characters with @code{utf-8} and decode the result using the same
-buffer text.  For instance, when you encode Latin-2 characters by
+coding system, you'll get Unicode characters (of charset
-@code{utf-8} and decode it back by the same coding system, you'll get
+@code{mule-unicode-0100-24ff}).  When you encode Unicode characters
-Unicode charactes (of charset @code{mule-unicode-0100-24ff}), and when
+with @code{iso-latin-2} and decode them back with the same coding
-you encode Unicode characters by @code{iso-latin-2} and decode it back
+system, you'll get Latin-2 characters.
-by the same coding system, you'll get Latin-2 characters.
 @cindex end of line conversion
  @dfn{End of line conversion} handles three different conventions used
author	Richard M. Stallman	2005-04-01 22:08:47 +0000
committer	Richard M. Stallman	2005-04-01 22:08:47 +0000
commit	8b9182147e2e2430eb2a6189444e966c6e121f71 (patch)
tree	f46850a08fd5b31898564749b31ff2153ff8a81a
parent	1ee49a88dd20cda9baaaa5247c46c5356e485737 (diff)
download	emacs-8b9182147e2e2430eb2a6189444e966c6e121f71.tar.gz emacs-8b9182147e2e2430eb2a6189444e966c6e121f71.zip

diff --git a/lispref/ChangeLog b/lispref/ChangeLog index 0d11d7c0e9e..8a34499507f 100644 --- a/lispref/ChangeLog +++ b/lispref/ChangeLog
@@ -1,3 +1,7 @@
		1	2005-04-01 Richard M. Stallman <rms@gnu.org>
		2
		3	* nonascii.texi (Coding System Basics): Clarify previous change.
		4
1	2005-04-01 Kenichi Handa <handa@m17n.org>	5	2005-04-01 Kenichi Handa <handa@m17n.org>
2		6
3	* nonascii.texi (Coding System Basics): Describe about rondtrip	7	* nonascii.texi (Coding System Basics): Describe about rondtrip


diff --git a/lispref/nonascii.texi b/lispref/nonascii.texi index 91a47ea50f9..4e38c300a61 100644 --- a/lispref/nonascii.texi +++ b/lispref/nonascii.texi
@@ -628,11 +628,11 @@ characters; for example, there are three coding systems for the Cyrillic
628	conversion, but some of them leave the choice unspecified---to be chosen	628	conversion, but some of them leave the choice unspecified---to be chosen
629	heuristically for each file, based on the data.	629	heuristically for each file, based on the data.
630		630
631	In general, a coding system doesn't guarantee a roundtrip identity,	631	In general, a coding system doesn't guarantee roundtrip identity:
632	i.e. decoding followed by encoding in the same coding system can	632	decoding text then encoding the result in the same coding system can
633	result in the different byte sequence. But there are several coding	633	produce a different byte sequence from the one you originally decoded.
634	systems that go guarantee that the result will be the same as what you	634	However, the following coding systems do guarantee that the result
635	originally decoded. They are:	635	will be the same as what you originally decoded:
636		636
637	@quotation	637	@quotation
638	chinese-big5 chinese-iso-8bit cyrillic-iso-8bit emacs-mule	638	chinese-big5 chinese-iso-8bit cyrillic-iso-8bit emacs-mule
@@ -641,14 +641,13 @@ iso-latin-4 iso-latin-5 iso-latin-8 iso-latin-9 iso-safe
641	japanese-iso-8bit japanese-shift-jis korean-iso-8bit raw-text	641	japanese-iso-8bit japanese-shift-jis korean-iso-8bit raw-text
642	@end quotation	642	@end quotation
643		643
644	Likewise, a coding systme doesn't guarantee the other way of roundtrip	644	Encoding buffer text and then decoding the result can also fail to
645	identity, i.e. encoding buffer text into a coding system followed by	645	reproduce the original text. For instance, when you encode Latin-2
646	decoding again with the same coding system will produce the different	646	characters with @code{utf-8} and decode the result using the same
647	buffer text. For instance, when you encode Latin-2 characters by	647	coding system, you'll get Unicode characters (of charset
648	@code{utf-8} and decode it back by the same coding system, you'll get	648	@code{mule-unicode-0100-24ff}). When you encode Unicode characters
649	Unicode charactes (of charset @code{mule-unicode-0100-24ff}), and when	649	with @code{iso-latin-2} and decode them back with the same coding
650	you encode Unicode characters by @code{iso-latin-2} and decode it back	650	system, you'll get Latin-2 characters.
651	by the same coding system, you'll get Latin-2 characters.
652		651
653	@cindex end of line conversion	652	@cindex end of line conversion
654	@dfn{End of line conversion} handles three different conventions used	653	@dfn{End of line conversion} handles three different conventions used