Fix nxml-get-inside (Bug#32003)

The change from 2016-01-16 "lisp/nxml: Use syntax-tables for comments" made nxml-get-inside return non-nil for any string or comment, including attribute strings. This caused incorrect and therefore indentation. * lisp/nxml/nxml-rap.el: Update commentary to reflect changes to nxml-mode parsing. (nxml-get-inside): Only return non-nil when inside comments and generic strings, not normal quote-delimited strings. * test/lisp/nxml/nxml-mode-tests.el: New tests.
author: Noam Postavsky 2019-04-18 23:36:04 -0400
committer: Noam Postavsky 2019-05-09 06:42:40 -0400
commit: ca14dd1d4628094dd33d5d94694dcf5f29e843b8 (patch)
tree: 3e875dde24b32704e647ba9853f52d341861a4a6 /lisp
parent: e7ab351caa884755c032fd9544ba67a3c953144f (diff)
download: emacs-ca14dd1d4628094dd33d5d94694dcf5f29e843b8.tar.gz
emacs-ca14dd1d4628094dd33d5d94694dcf5f29e843b8.zip
1 files changed, 20 insertions, 22 deletions
diff --git a/lisp/nxml/nxml-rap.el b/lisp/nxml/nxml-rap.el
index 2bd758be3a5..21dbaded25a 100644
--- a/lisp/nxml/nxml-rap.el
+++ b/lisp/nxml/nxml-rap.el
@@ -35,35 +35,25 @@
 ;;
 ;; Our strategy is to keep track of just the problematic things.
 ;; Specifically, we keep track of all comments, CDATA sections and
-;; processing instructions in the instance.  We do this by marking all
+;; processing instructions in the instance.  We do this by marking
-;; except the first character of these with a non-nil nxml-inside text
+;; the first character of these with the generic string syntax by setting
-;; property. The value of the nxml-inside property is comment,
+;; a 'syntax-table' text property in `sgml-syntax-propertize'.
-;; cdata-section or processing-instruction.  The first character does
-;; not have the nxml-inside property so we can find the beginning of
-;; the construct by looking for a change in a text property value
-;; (Emacs provides primitives for this).  We use text properties
-;; rather than overlays, since the implementation of overlays doesn't
-;; look like it scales to large numbers of overlays in a buffer.
-;;
-;; We don't in fact track all these constructs, but only track them in
-;; some initial part of the instance.
 ;;
 ;; Thus to parse some random point in the file we first ensure that we
-;; have scanned up to that point.  Then we search backwards for a
+;; have scanned up to that point.  Then we search backwards for a <.
-;; <. Then we check whether the < has an nxml-inside property. If it
+;; Then we check whether the < has the generic string syntax.  If it
-;; does we go backwards to first character that does not have an
+;; does we go backwards to first character of the generic string (this
-;; nxml-inside property (this character must be a <).  Then we start
+;; character must be a <).  Then we start parsing forward from the <
-;; parsing forward from the < we have found.
+;; we have found.
 ;;
 ;; The prolog has to be parsed specially, so we also keep track of the
 ;; end of the prolog in `nxml-prolog-end'. The prolog is reparsed on
 ;; every change to the prolog.  This won't work well if people try to
 ;; edit huge internal subsets. Hopefully that will be rare.
 ;;
-;; We keep track of the changes by adding to the buffer's
+;; We rely on the `syntax-propertize-function' machinery to keep track
-;; after-change-functions hook.  Scanning is also done as a
+;; of the changes in the buffer.  Fontification also relies on correct
-;; prerequisite to fontification by adding to fontification-functions
+;; `syntax-table' properties.  This means that scanning for these
-;; (in the same way as jit-lock).  This means that scanning for these
 ;; constructs had better be quick.  Fortunately it is. Firstly, the
 ;; typical proportion of comments, CDATA sections and processing
 ;; instructions is small relative to other things.  Secondly, to scan
@@ -79,7 +69,15 @@
  "Integer giving position following end of the prolog.")
 (defsubst nxml-get-inside (pos)
-  (save-excursion (nth 8 (syntax-ppss pos))))
+  "Return non-nil if inside comment, CDATA, or PI."
+  (let ((ppss (save-excursion (syntax-ppss pos))))
+    (or
+     ;; Inside comment.
+     (nth 4 ppss)
+     ;; Inside "generic" string which is used for CDATA, and PI.
+     ;; "Normal" double and single quoted strings are used for
+     ;; attribute values.
+     (eq t (nth 3 ppss)))))
 (defun nxml-inside-end (pos)
  "Return the end of the inside region containing POS.
author	Noam Postavsky	2019-04-18 23:36:04 -0400
committer	Noam Postavsky	2019-05-09 06:42:40 -0400
commit	ca14dd1d4628094dd33d5d94694dcf5f29e843b8 (patch)
tree	3e875dde24b32704e647ba9853f52d341861a4a6 /lisp
parent	e7ab351caa884755c032fd9544ba67a3c953144f (diff)
download	emacs-ca14dd1d4628094dd33d5d94694dcf5f29e843b8.tar.gz emacs-ca14dd1d4628094dd33d5d94694dcf5f29e843b8.zip