Some doc for cycle-spacing
[bpt/emacs.git] / doc / lispref / text.texi
1 @c -*-texinfo-*-
2 @c This is part of the GNU Emacs Lisp Reference Manual.
3 @c Copyright (C) 1990-1995, 1998-2014 Free Software Foundation, Inc.
4 @c See the file elisp.texi for copying conditions.
5 @node Text
6 @chapter Text
7 @cindex text
8
9 This chapter describes the functions that deal with the text in a
10 buffer. Most examine, insert, or delete text in the current buffer,
11 often operating at point or on text adjacent to point. Many are
12 interactive. All the functions that change the text provide for undoing
13 the changes (@pxref{Undo}).
14
15 Many text-related functions operate on a region of text defined by two
16 buffer positions passed in arguments named @var{start} and @var{end}.
17 These arguments should be either markers (@pxref{Markers}) or numeric
18 character positions (@pxref{Positions}). The order of these arguments
19 does not matter; it is all right for @var{start} to be the end of the
20 region and @var{end} the beginning. For example, @code{(delete-region 1
21 10)} and @code{(delete-region 10 1)} are equivalent. An
22 @code{args-out-of-range} error is signaled if either @var{start} or
23 @var{end} is outside the accessible portion of the buffer. In an
24 interactive call, point and the mark are used for these arguments.
25
26 @cindex buffer contents
27 Throughout this chapter, ``text'' refers to the characters in the
28 buffer, together with their properties (when relevant). Keep in mind
29 that point is always between two characters, and the cursor appears on
30 the character after point.
31
32 @menu
33 * Near Point:: Examining text in the vicinity of point.
34 * Buffer Contents:: Examining text in a general fashion.
35 * Comparing Text:: Comparing substrings of buffers.
36 * Insertion:: Adding new text to a buffer.
37 * Commands for Insertion:: User-level commands to insert text.
38 * Deletion:: Removing text from a buffer.
39 * User-Level Deletion:: User-level commands to delete text.
40 * The Kill Ring:: Where removed text sometimes is saved for later use.
41 * Undo:: Undoing changes to the text of a buffer.
42 * Maintaining Undo:: How to enable and disable undo information.
43 How to control how much information is kept.
44 * Filling:: Functions for explicit filling.
45 * Margins:: How to specify margins for filling commands.
46 * Adaptive Fill:: Adaptive Fill mode chooses a fill prefix from context.
47 * Auto Filling:: How auto-fill mode is implemented to break lines.
48 * Sorting:: Functions for sorting parts of the buffer.
49 * Columns:: Computing horizontal positions, and using them.
50 * Indentation:: Functions to insert or adjust indentation.
51 * Case Changes:: Case conversion of parts of the buffer.
52 * Text Properties:: Assigning Lisp property lists to text characters.
53 * Substitution:: Replacing a given character wherever it appears.
54 * Registers:: How registers are implemented. Accessing the text or
55 position stored in a register.
56 * Transposition:: Swapping two portions of a buffer.
57 * Base 64:: Conversion to or from base 64 encoding.
58 * Checksum/Hash:: Computing cryptographic hashes.
59 * Parsing HTML/XML:: Parsing HTML and XML.
60 * Atomic Changes:: Installing several buffer changes "atomically".
61 * Change Hooks:: Supplying functions to be run when text is changed.
62 @end menu
63
64 @node Near Point
65 @section Examining Text Near Point
66 @cindex text near point
67
68 Many functions are provided to look at the characters around point.
69 Several simple functions are described here. See also @code{looking-at}
70 in @ref{Regexp Search}.
71
72 In the following four functions, ``beginning'' or ``end'' of buffer
73 refers to the beginning or end of the accessible portion.
74
75 @defun char-after &optional position
76 This function returns the character in the current buffer at (i.e.,
77 immediately after) position @var{position}. If @var{position} is out of
78 range for this purpose, either before the beginning of the buffer, or at
79 or beyond the end, then the value is @code{nil}. The default for
80 @var{position} is point.
81
82 In the following example, assume that the first character in the
83 buffer is @samp{@@}:
84
85 @example
86 @group
87 (string (char-after 1))
88 @result{} "@@"
89 @end group
90 @end example
91 @end defun
92
93 @defun char-before &optional position
94 This function returns the character in the current buffer immediately
95 before position @var{position}. If @var{position} is out of range for
96 this purpose, either at or before the beginning of the buffer, or beyond
97 the end, then the value is @code{nil}. The default for
98 @var{position} is point.
99 @end defun
100
101 @defun following-char
102 This function returns the character following point in the current
103 buffer. This is similar to @code{(char-after (point))}. However, if
104 point is at the end of the buffer, then @code{following-char} returns 0.
105
106 Remember that point is always between characters, and the cursor
107 normally appears over the character following point. Therefore, the
108 character returned by @code{following-char} is the character the
109 cursor is over.
110
111 In this example, point is between the @samp{a} and the @samp{c}.
112
113 @example
114 @group
115 ---------- Buffer: foo ----------
116 Gentlemen may cry ``Pea@point{}ce! Peace!,''
117 but there is no peace.
118 ---------- Buffer: foo ----------
119 @end group
120
121 @group
122 (string (preceding-char))
123 @result{} "a"
124 (string (following-char))
125 @result{} "c"
126 @end group
127 @end example
128 @end defun
129
130 @defun preceding-char
131 This function returns the character preceding point in the current
132 buffer. See above, under @code{following-char}, for an example. If
133 point is at the beginning of the buffer, @code{preceding-char} returns
134 0.
135 @end defun
136
137 @defun bobp
138 This function returns @code{t} if point is at the beginning of the
139 buffer. If narrowing is in effect, this means the beginning of the
140 accessible portion of the text. See also @code{point-min} in
141 @ref{Point}.
142 @end defun
143
144 @defun eobp
145 This function returns @code{t} if point is at the end of the buffer.
146 If narrowing is in effect, this means the end of accessible portion of
147 the text. See also @code{point-max} in @xref{Point}.
148 @end defun
149
150 @defun bolp
151 This function returns @code{t} if point is at the beginning of a line.
152 @xref{Text Lines}. The beginning of the buffer (or of its accessible
153 portion) always counts as the beginning of a line.
154 @end defun
155
156 @defun eolp
157 This function returns @code{t} if point is at the end of a line. The
158 end of the buffer (or of its accessible portion) is always considered
159 the end of a line.
160 @end defun
161
162 @node Buffer Contents
163 @section Examining Buffer Contents
164
165 This section describes functions that allow a Lisp program to
166 convert any portion of the text in the buffer into a string.
167
168 @defun buffer-substring start end
169 This function returns a string containing a copy of the text of the
170 region defined by positions @var{start} and @var{end} in the current
171 buffer. If the arguments are not positions in the accessible portion
172 of the buffer, @code{buffer-substring} signals an
173 @code{args-out-of-range} error.
174
175 Here's an example which assumes Font-Lock mode is not enabled:
176
177 @example
178 @group
179 ---------- Buffer: foo ----------
180 This is the contents of buffer foo
181
182 ---------- Buffer: foo ----------
183 @end group
184
185 @group
186 (buffer-substring 1 10)
187 @result{} "This is t"
188 @end group
189 @group
190 (buffer-substring (point-max) 10)
191 @result{} "he contents of buffer foo\n"
192 @end group
193 @end example
194
195 If the text being copied has any text properties, these are copied into
196 the string along with the characters they belong to. @xref{Text
197 Properties}. However, overlays (@pxref{Overlays}) in the buffer and
198 their properties are ignored, not copied.
199
200 For example, if Font-Lock mode is enabled, you might get results like
201 these:
202
203 @example
204 @group
205 (buffer-substring 1 10)
206 @result{} #("This is t" 0 1 (fontified t) 1 9 (fontified t))
207 @end group
208 @end example
209 @end defun
210
211 @defun buffer-substring-no-properties start end
212 This is like @code{buffer-substring}, except that it does not copy text
213 properties, just the characters themselves. @xref{Text Properties}.
214 @end defun
215
216 @defun buffer-string
217 This function returns the contents of the entire accessible portion of
218 the current buffer, as a string.
219 @end defun
220
221 @defun filter-buffer-substring start end &optional delete
222 This function passes the buffer text between @var{start} and @var{end}
223 through the filter functions specified by the wrapper hook
224 @code{filter-buffer-substring-functions}, and returns the result. The
225 obsolete variable @code{buffer-substring-filters} is also consulted.
226 If both of these variables are @code{nil}, the value is the unaltered
227 text from the buffer, i.e., what @code{buffer-substring} would
228 return.
229
230 If @var{delete} is non-@code{nil}, this function deletes the text
231 between @var{start} and @var{end} after copying it, like
232 @code{delete-and-extract-region}.
233
234 Lisp code should use this function instead of @code{buffer-substring},
235 @code{buffer-substring-no-properties},
236 or @code{delete-and-extract-region} when copying into user-accessible
237 data structures such as the kill-ring, X clipboard, and registers.
238 Major and minor modes can add functions to
239 @code{filter-buffer-substring-functions} to alter such text as it is
240 copied out of the buffer.
241 @end defun
242
243 @c FIXME: `filter-buffer-substring-function' should be documented.
244 @defvar filter-buffer-substring-functions
245 This variable is a wrapper hook (@pxref{Running Hooks}), whose members
246 should be functions that accept four arguments: @var{fun},
247 @var{start}, @var{end}, and @var{delete}. @var{fun} is a function
248 that takes three arguments (@var{start}, @var{end}, and @var{delete}),
249 and returns a string. In both cases, the @var{start}, @var{end}, and
250 @var{delete} arguments are the same as those of
251 @code{filter-buffer-substring}.
252
253 The first hook function is passed a @var{fun} that is equivalent to
254 the default operation of @code{filter-buffer-substring}, i.e., it
255 returns the buffer-substring between @var{start} and @var{end}
256 (processed by any @code{buffer-substring-filters}) and optionally
257 deletes the original text from the buffer. In most cases, the hook
258 function will call @var{fun} once, and then do its own processing of
259 the result. The next hook function receives a @var{fun} equivalent to
260 this, and so on. The actual return value is the result of all the
261 hook functions acting in sequence.
262 @end defvar
263
264 @defvar buffer-substring-filters
265 This variable is obsoleted by
266 @code{filter-buffer-substring-functions}, but is still supported for
267 backward compatibility. Its value should should be a list of
268 functions which accept a single string argument and return another
269 string. @code{filter-buffer-substring} passes the buffer substring to
270 the first function in this list, and the return value of each function
271 is passed to the next function. The return value of the last function
272 is passed to @code{filter-buffer-substring-functions}.
273 @end defvar
274
275 @defun current-word &optional strict really-word
276 This function returns the symbol (or word) at or near point, as a
277 string. The return value includes no text properties.
278
279 If the optional argument @var{really-word} is non-@code{nil}, it finds a
280 word; otherwise, it finds a symbol (which includes both word
281 characters and symbol constituent characters).
282
283 If the optional argument @var{strict} is non-@code{nil}, then point
284 must be in or next to the symbol or word---if no symbol or word is
285 there, the function returns @code{nil}. Otherwise, a nearby symbol or
286 word on the same line is acceptable.
287 @end defun
288
289 @defun thing-at-point thing
290 Return the @var{thing} around or next to point, as a string.
291
292 The argument @var{thing} is a symbol which specifies a kind of syntactic
293 entity. Possibilities include @code{symbol}, @code{list}, @code{sexp},
294 @code{defun}, @code{filename}, @code{url}, @code{word}, @code{sentence},
295 @code{whitespace}, @code{line}, @code{page}, and others.
296
297 @example
298 ---------- Buffer: foo ----------
299 Gentlemen may cry ``Pea@point{}ce! Peace!,''
300 but there is no peace.
301 ---------- Buffer: foo ----------
302
303 (thing-at-point 'word)
304 @result{} "Peace"
305 (thing-at-point 'line)
306 @result{} "Gentlemen may cry ``Peace! Peace!,''\n"
307 (thing-at-point 'whitespace)
308 @result{} nil
309 @end example
310 @end defun
311
312 @node Comparing Text
313 @section Comparing Text
314 @cindex comparing buffer text
315
316 This function lets you compare portions of the text in a buffer, without
317 copying them into strings first.
318
319 @defun compare-buffer-substrings buffer1 start1 end1 buffer2 start2 end2
320 This function lets you compare two substrings of the same buffer or two
321 different buffers. The first three arguments specify one substring,
322 giving a buffer (or a buffer name) and two positions within the
323 buffer. The last three arguments specify the other substring in the
324 same way. You can use @code{nil} for @var{buffer1}, @var{buffer2}, or
325 both to stand for the current buffer.
326
327 The value is negative if the first substring is less, positive if the
328 first is greater, and zero if they are equal. The absolute value of
329 the result is one plus the index of the first differing characters
330 within the substrings.
331
332 This function ignores case when comparing characters
333 if @code{case-fold-search} is non-@code{nil}. It always ignores
334 text properties.
335
336 Suppose the current buffer contains the text @samp{foobarbar
337 haha!rara!}; then in this example the two substrings are @samp{rbar }
338 and @samp{rara!}. The value is 2 because the first substring is greater
339 at the second character.
340
341 @example
342 (compare-buffer-substrings nil 6 11 nil 16 21)
343 @result{} 2
344 @end example
345 @end defun
346
347 @node Insertion
348 @section Inserting Text
349 @cindex insertion of text
350 @cindex text insertion
351
352 @cindex insertion before point
353 @cindex before point, insertion
354 @dfn{Insertion} means adding new text to a buffer. The inserted text
355 goes at point---between the character before point and the character
356 after point. Some insertion functions leave point before the inserted
357 text, while other functions leave it after. We call the former
358 insertion @dfn{after point} and the latter insertion @dfn{before point}.
359
360 Insertion moves markers located at positions after the insertion
361 point, so that they stay with the surrounding text (@pxref{Markers}).
362 When a marker points at the place of insertion, insertion may or may
363 not relocate the marker, depending on the marker's insertion type
364 (@pxref{Marker Insertion Types}). Certain special functions such as
365 @code{insert-before-markers} relocate all such markers to point after
366 the inserted text, regardless of the markers' insertion type.
367
368 Insertion functions signal an error if the current buffer is
369 read-only (@pxref{Read Only Buffers}) or if they insert within
370 read-only text (@pxref{Special Properties}).
371
372 These functions copy text characters from strings and buffers along
373 with their properties. The inserted characters have exactly the same
374 properties as the characters they were copied from. By contrast,
375 characters specified as separate arguments, not part of a string or
376 buffer, inherit their text properties from the neighboring text.
377
378 The insertion functions convert text from unibyte to multibyte in
379 order to insert in a multibyte buffer, and vice versa---if the text
380 comes from a string or from a buffer. However, they do not convert
381 unibyte character codes 128 through 255 to multibyte characters, not
382 even if the current buffer is a multibyte buffer. @xref{Converting
383 Representations}.
384
385 @defun insert &rest args
386 This function inserts the strings and/or characters @var{args} into the
387 current buffer, at point, moving point forward. In other words, it
388 inserts the text before point. An error is signaled unless all
389 @var{args} are either strings or characters. The value is @code{nil}.
390 @end defun
391
392 @defun insert-before-markers &rest args
393 This function inserts the strings and/or characters @var{args} into the
394 current buffer, at point, moving point forward. An error is signaled
395 unless all @var{args} are either strings or characters. The value is
396 @code{nil}.
397
398 This function is unlike the other insertion functions in that it
399 relocates markers initially pointing at the insertion point, to point
400 after the inserted text. If an overlay begins at the insertion point,
401 the inserted text falls outside the overlay; if a nonempty overlay
402 ends at the insertion point, the inserted text falls inside that
403 overlay.
404 @end defun
405
406 @deffn Command insert-char character &optional count inherit
407 This command inserts @var{count} instances of @var{character} into the
408 current buffer before point. The argument @var{count} must be an
409 integer, and @var{character} must be a character.
410
411 If called interactively, this command prompts for @var{character}
412 using its Unicode name or its code point. @xref{Inserting Text,,,
413 emacs, The GNU Emacs Manual}.
414
415 This function does not convert unibyte character codes 128 through 255
416 to multibyte characters, not even if the current buffer is a multibyte
417 buffer. @xref{Converting Representations}.
418
419 If @var{inherit} is non-@code{nil}, the inserted characters inherit
420 sticky text properties from the two characters before and after the
421 insertion point. @xref{Sticky Properties}.
422 @end deffn
423
424 @defun insert-buffer-substring from-buffer-or-name &optional start end
425 This function inserts a portion of buffer @var{from-buffer-or-name}
426 into the current buffer before point. The text inserted is the region
427 between @var{start} (inclusive) and @var{end} (exclusive). (These
428 arguments default to the beginning and end of the accessible portion
429 of that buffer.) This function returns @code{nil}.
430
431 In this example, the form is executed with buffer @samp{bar} as the
432 current buffer. We assume that buffer @samp{bar} is initially empty.
433
434 @example
435 @group
436 ---------- Buffer: foo ----------
437 We hold these truths to be self-evident, that all
438 ---------- Buffer: foo ----------
439 @end group
440
441 @group
442 (insert-buffer-substring "foo" 1 20)
443 @result{} nil
444
445 ---------- Buffer: bar ----------
446 We hold these truth@point{}
447 ---------- Buffer: bar ----------
448 @end group
449 @end example
450 @end defun
451
452 @defun insert-buffer-substring-no-properties from-buffer-or-name &optional start end
453 This is like @code{insert-buffer-substring} except that it does not
454 copy any text properties.
455 @end defun
456
457 @xref{Sticky Properties}, for other insertion functions that inherit
458 text properties from the nearby text in addition to inserting it.
459 Whitespace inserted by indentation functions also inherits text
460 properties.
461
462 @node Commands for Insertion
463 @section User-Level Insertion Commands
464
465 This section describes higher-level commands for inserting text,
466 commands intended primarily for the user but useful also in Lisp
467 programs.
468
469 @deffn Command insert-buffer from-buffer-or-name
470 This command inserts the entire accessible contents of
471 @var{from-buffer-or-name} (which must exist) into the current buffer
472 after point. It leaves the mark after the inserted text. The value
473 is @code{nil}.
474 @end deffn
475
476 @deffn Command self-insert-command count
477 @cindex character insertion
478 @cindex self-insertion
479 This command inserts the last character typed; it does so @var{count}
480 times, before point, and returns @code{nil}. Most printing characters
481 are bound to this command. In routine use, @code{self-insert-command}
482 is the most frequently called function in Emacs, but programs rarely use
483 it except to install it on a keymap.
484
485 In an interactive call, @var{count} is the numeric prefix argument.
486
487 @c FIXME: This variable is obsolete since 23.1.
488 Self-insertion translates the input character through
489 @code{translation-table-for-input}. @xref{Translation of Characters}.
490
491 This command calls @code{auto-fill-function} whenever that is
492 non-@code{nil} and the character inserted is in the table
493 @code{auto-fill-chars} (@pxref{Auto Filling}).
494
495 @c Cross refs reworded to prevent overfull hbox. --rjc 15mar92
496 This command performs abbrev expansion if Abbrev mode is enabled and
497 the inserted character does not have word-constituent
498 syntax. (@xref{Abbrevs}, and @ref{Syntax Class Table}.) It is also
499 responsible for calling @code{blink-paren-function} when the inserted
500 character has close parenthesis syntax (@pxref{Blinking}).
501
502 @vindex post-self-insert-hook
503 The final thing this command does is to run the hook
504 @code{post-self-insert-hook}. You could use this to automatically
505 reindent text as it is typed, for example.
506
507 Do not try substituting your own definition of
508 @code{self-insert-command} for the standard one. The editor command
509 loop handles this function specially.
510 @end deffn
511
512 @deffn Command newline &optional number-of-newlines
513 This command inserts newlines into the current buffer before point.
514 If @var{number-of-newlines} is supplied, that many newline characters
515 are inserted.
516
517 @cindex newline and Auto Fill mode
518 This function calls @code{auto-fill-function} if the current column
519 number is greater than the value of @code{fill-column} and
520 @var{number-of-newlines} is @code{nil}. Typically what
521 @code{auto-fill-function} does is insert a newline; thus, the overall
522 result in this case is to insert two newlines at different places: one
523 at point, and another earlier in the line. @code{newline} does not
524 auto-fill if @var{number-of-newlines} is non-@code{nil}.
525
526 This command indents to the left margin if that is not zero.
527 @xref{Margins}.
528
529 The value returned is @code{nil}. In an interactive call, @var{count}
530 is the numeric prefix argument.
531 @end deffn
532
533 @defvar overwrite-mode
534 This variable controls whether overwrite mode is in effect. The value
535 should be @code{overwrite-mode-textual}, @code{overwrite-mode-binary},
536 or @code{nil}. @code{overwrite-mode-textual} specifies textual
537 overwrite mode (treats newlines and tabs specially), and
538 @code{overwrite-mode-binary} specifies binary overwrite mode (treats
539 newlines and tabs like any other characters).
540 @end defvar
541
542 @node Deletion
543 @section Deleting Text
544 @cindex text deletion
545
546 @cindex deleting text vs killing
547 Deletion means removing part of the text in a buffer, without saving
548 it in the kill ring (@pxref{The Kill Ring}). Deleted text can't be
549 yanked, but can be reinserted using the undo mechanism (@pxref{Undo}).
550 Some deletion functions do save text in the kill ring in some special
551 cases.
552
553 All of the deletion functions operate on the current buffer.
554
555 @deffn Command erase-buffer
556 This function deletes the entire text of the current buffer
557 (@emph{not} just the accessible portion), leaving it
558 empty. If the buffer is read-only, it signals a @code{buffer-read-only}
559 error; if some of the text in it is read-only, it signals a
560 @code{text-read-only} error. Otherwise, it deletes the text without
561 asking for any confirmation. It returns @code{nil}.
562
563 Normally, deleting a large amount of text from a buffer inhibits further
564 auto-saving of that buffer ``because it has shrunk''. However,
565 @code{erase-buffer} does not do this, the idea being that the future
566 text is not really related to the former text, and its size should not
567 be compared with that of the former text.
568 @end deffn
569
570 @deffn Command delete-region start end
571 This command deletes the text between positions @var{start} and
572 @var{end} in the current buffer, and returns @code{nil}. If point was
573 inside the deleted region, its value afterward is @var{start}.
574 Otherwise, point relocates with the surrounding text, as markers do.
575 @end deffn
576
577 @defun delete-and-extract-region start end
578 This function deletes the text between positions @var{start} and
579 @var{end} in the current buffer, and returns a string containing the
580 text just deleted.
581
582 If point was inside the deleted region, its value afterward is
583 @var{start}. Otherwise, point relocates with the surrounding text, as
584 markers do.
585 @end defun
586
587 @deffn Command delete-char count &optional killp
588 This command deletes @var{count} characters directly after point, or
589 before point if @var{count} is negative. If @var{killp} is
590 non-@code{nil}, then it saves the deleted characters in the kill ring.
591
592 In an interactive call, @var{count} is the numeric prefix argument, and
593 @var{killp} is the unprocessed prefix argument. Therefore, if a prefix
594 argument is supplied, the text is saved in the kill ring. If no prefix
595 argument is supplied, then one character is deleted, but not saved in
596 the kill ring.
597
598 The value returned is always @code{nil}.
599 @end deffn
600
601 @deffn Command delete-backward-char count &optional killp
602 @cindex deleting previous char
603 This command deletes @var{count} characters directly before point, or
604 after point if @var{count} is negative. If @var{killp} is
605 non-@code{nil}, then it saves the deleted characters in the kill ring.
606
607 In an interactive call, @var{count} is the numeric prefix argument, and
608 @var{killp} is the unprocessed prefix argument. Therefore, if a prefix
609 argument is supplied, the text is saved in the kill ring. If no prefix
610 argument is supplied, then one character is deleted, but not saved in
611 the kill ring.
612
613 The value returned is always @code{nil}.
614 @end deffn
615
616 @deffn Command backward-delete-char-untabify count &optional killp
617 @cindex tab deletion
618 This command deletes @var{count} characters backward, changing tabs
619 into spaces. When the next character to be deleted is a tab, it is
620 first replaced with the proper number of spaces to preserve alignment
621 and then one of those spaces is deleted instead of the tab. If
622 @var{killp} is non-@code{nil}, then the command saves the deleted
623 characters in the kill ring.
624
625 Conversion of tabs to spaces happens only if @var{count} is positive.
626 If it is negative, exactly @minus{}@var{count} characters after point
627 are deleted.
628
629 In an interactive call, @var{count} is the numeric prefix argument, and
630 @var{killp} is the unprocessed prefix argument. Therefore, if a prefix
631 argument is supplied, the text is saved in the kill ring. If no prefix
632 argument is supplied, then one character is deleted, but not saved in
633 the kill ring.
634
635 The value returned is always @code{nil}.
636 @end deffn
637
638 @defopt backward-delete-char-untabify-method
639 This option specifies how @code{backward-delete-char-untabify} should
640 deal with whitespace. Possible values include @code{untabify}, the
641 default, meaning convert a tab to many spaces and delete one;
642 @code{hungry}, meaning delete all tabs and spaces before point with
643 one command; @code{all} meaning delete all tabs, spaces and newlines
644 before point, and @code{nil}, meaning do nothing special for
645 whitespace characters.
646 @end defopt
647
648 @node User-Level Deletion
649 @section User-Level Deletion Commands
650
651 This section describes higher-level commands for deleting text,
652 commands intended primarily for the user but useful also in Lisp
653 programs.
654
655 @deffn Command delete-horizontal-space &optional backward-only
656 @cindex deleting whitespace
657 This function deletes all spaces and tabs around point. It returns
658 @code{nil}.
659
660 If @var{backward-only} is non-@code{nil}, the function deletes
661 spaces and tabs before point, but not after point.
662
663 In the following examples, we call @code{delete-horizontal-space} four
664 times, once on each line, with point between the second and third
665 characters on the line each time.
666
667 @example
668 @group
669 ---------- Buffer: foo ----------
670 I @point{}thought
671 I @point{} thought
672 We@point{} thought
673 Yo@point{}u thought
674 ---------- Buffer: foo ----------
675 @end group
676
677 @group
678 (delete-horizontal-space) ; @r{Four times.}
679 @result{} nil
680
681 ---------- Buffer: foo ----------
682 Ithought
683 Ithought
684 Wethought
685 You thought
686 ---------- Buffer: foo ----------
687 @end group
688 @end example
689 @end deffn
690
691 @deffn Command delete-indentation &optional join-following-p
692 This function joins the line point is on to the previous line, deleting
693 any whitespace at the join and in some cases replacing it with one
694 space. If @var{join-following-p} is non-@code{nil},
695 @code{delete-indentation} joins this line to the following line
696 instead. The function returns @code{nil}.
697
698 If there is a fill prefix, and the second of the lines being joined
699 starts with the prefix, then @code{delete-indentation} deletes the
700 fill prefix before joining the lines. @xref{Margins}.
701
702 In the example below, point is located on the line starting
703 @samp{events}, and it makes no difference if there are trailing spaces
704 in the preceding line.
705
706 @smallexample
707 @group
708 ---------- Buffer: foo ----------
709 When in the course of human
710 @point{} events, it becomes necessary
711 ---------- Buffer: foo ----------
712 @end group
713
714 (delete-indentation)
715 @result{} nil
716
717 @group
718 ---------- Buffer: foo ----------
719 When in the course of human@point{} events, it becomes necessary
720 ---------- Buffer: foo ----------
721 @end group
722 @end smallexample
723
724 After the lines are joined, the function @code{fixup-whitespace} is
725 responsible for deciding whether to leave a space at the junction.
726 @end deffn
727
728 @deffn Command fixup-whitespace
729 This function replaces all the horizontal whitespace surrounding point
730 with either one space or no space, according to the context. It
731 returns @code{nil}.
732
733 At the beginning or end of a line, the appropriate amount of space is
734 none. Before a character with close parenthesis syntax, or after a
735 character with open parenthesis or expression-prefix syntax, no space is
736 also appropriate. Otherwise, one space is appropriate. @xref{Syntax
737 Class Table}.
738
739 In the example below, @code{fixup-whitespace} is called the first time
740 with point before the word @samp{spaces} in the first line. For the
741 second invocation, point is directly after the @samp{(}.
742
743 @smallexample
744 @group
745 ---------- Buffer: foo ----------
746 This has too many @point{}spaces
747 This has too many spaces at the start of (@point{} this list)
748 ---------- Buffer: foo ----------
749 @end group
750
751 @group
752 (fixup-whitespace)
753 @result{} nil
754 (fixup-whitespace)
755 @result{} nil
756 @end group
757
758 @group
759 ---------- Buffer: foo ----------
760 This has too many spaces
761 This has too many spaces at the start of (this list)
762 ---------- Buffer: foo ----------
763 @end group
764 @end smallexample
765 @end deffn
766
767 @deffn Command just-one-space &optional n
768 @comment !!SourceFile simple.el
769 This command replaces any spaces and tabs around point with a single
770 space, or @var{n} spaces if @var{n} is specified. It returns
771 @code{nil}.
772 @end deffn
773
774 @c There is also cycle-spacing, but I cannot see it being useful in
775 @c Lisp programs, so it is not mentioned here.
776
777 @deffn Command delete-blank-lines
778 This function deletes blank lines surrounding point. If point is on a
779 blank line with one or more blank lines before or after it, then all but
780 one of them are deleted. If point is on an isolated blank line, then it
781 is deleted. If point is on a nonblank line, the command deletes all
782 blank lines immediately following it.
783
784 A blank line is defined as a line containing only tabs and spaces.
785 @c and the Newline character?
786
787 @code{delete-blank-lines} returns @code{nil}.
788 @end deffn
789
790 @node The Kill Ring
791 @section The Kill Ring
792 @cindex kill ring
793
794 @dfn{Kill functions} delete text like the deletion functions, but save
795 it so that the user can reinsert it by @dfn{yanking}. Most of these
796 functions have @samp{kill-} in their name. By contrast, the functions
797 whose names start with @samp{delete-} normally do not save text for
798 yanking (though they can still be undone); these are ``deletion''
799 functions.
800
801 Most of the kill commands are primarily for interactive use, and are
802 not described here. What we do describe are the functions provided for
803 use in writing such commands. You can use these functions to write
804 commands for killing text. When you need to delete text for internal
805 purposes within a Lisp function, you should normally use deletion
806 functions, so as not to disturb the kill ring contents.
807 @xref{Deletion}.
808
809 Killed text is saved for later yanking in the @dfn{kill ring}. This
810 is a list that holds a number of recent kills, not just the last text
811 kill. We call this a ``ring'' because yanking treats it as having
812 elements in a cyclic order. The list is kept in the variable
813 @code{kill-ring}, and can be operated on with the usual functions for
814 lists; there are also specialized functions, described in this section,
815 that treat it as a ring.
816
817 Some people think this use of the word ``kill'' is unfortunate, since
818 it refers to operations that specifically @emph{do not} destroy the
819 entities ``killed''. This is in sharp contrast to ordinary life, in
820 which death is permanent and ``killed'' entities do not come back to
821 life. Therefore, other metaphors have been proposed. For example, the
822 term ``cut ring'' makes sense to people who, in pre-computer days, used
823 scissors and paste to cut up and rearrange manuscripts. However, it
824 would be difficult to change the terminology now.
825
826 @menu
827 * Kill Ring Concepts:: What text looks like in the kill ring.
828 * Kill Functions:: Functions that kill text.
829 * Yanking:: How yanking is done.
830 * Yank Commands:: Commands that access the kill ring.
831 * Low-Level Kill Ring:: Functions and variables for kill ring access.
832 * Internals of Kill Ring:: Variables that hold kill ring data.
833 @end menu
834
835 @node Kill Ring Concepts
836 @subsection Kill Ring Concepts
837
838 The kill ring records killed text as strings in a list, most recent
839 first. A short kill ring, for example, might look like this:
840
841 @example
842 ("some text" "a different piece of text" "even older text")
843 @end example
844
845 @noindent
846 When the list reaches @code{kill-ring-max} entries in length, adding a
847 new entry automatically deletes the last entry.
848
849 When kill commands are interwoven with other commands, each kill
850 command makes a new entry in the kill ring. Multiple kill commands in
851 succession build up a single kill ring entry, which would be yanked as a
852 unit; the second and subsequent consecutive kill commands add text to
853 the entry made by the first one.
854
855 For yanking, one entry in the kill ring is designated the ``front'' of
856 the ring. Some yank commands ``rotate'' the ring by designating a
857 different element as the ``front''. But this virtual rotation doesn't
858 change the list itself---the most recent entry always comes first in the
859 list.
860
861 @node Kill Functions
862 @subsection Functions for Killing
863
864 @code{kill-region} is the usual subroutine for killing text. Any
865 command that calls this function is a ``kill command'' (and should
866 probably have @samp{kill} in its name). @code{kill-region} puts the
867 newly killed text in a new element at the beginning of the kill ring or
868 adds it to the most recent element. It determines automatically (using
869 @code{last-command}) whether the previous command was a kill command,
870 and if so appends the killed text to the most recent entry.
871
872 @deffn Command kill-region start end
873 This function kills the text in the region defined by @var{start} and
874 @var{end}. The text is deleted but saved in the kill ring, along with
875 its text properties. The value is always @code{nil}.
876
877 In an interactive call, @var{start} and @var{end} are point and
878 the mark.
879
880 If the buffer or text is read-only, @code{kill-region} modifies the kill
881 ring just the same, then signals an error without modifying the buffer.
882 This is convenient because it lets the user use a series of kill
883 commands to copy text from a read-only buffer into the kill ring.
884 @end deffn
885
886 @defopt kill-read-only-ok
887 If this option is non-@code{nil}, @code{kill-region} does not signal an
888 error if the buffer or text is read-only. Instead, it simply returns,
889 updating the kill ring but not changing the buffer.
890 @end defopt
891
892 @deffn Command copy-region-as-kill start end
893 This command saves the region defined by @var{start} and @var{end} on
894 the kill ring (including text properties), but does not delete the text
895 from the buffer. It returns @code{nil}.
896
897 The command does not set @code{this-command} to @code{kill-region}, so a
898 subsequent kill command does not append to the same kill ring entry.
899
900 @c FIXME Why is it better? Why isn't copy-region-as-kill obsolete then?
901 @c Why is it used in many places in Emacs?
902 In Lisp programs, it is better to use @code{kill-new} or
903 @code{kill-append} instead of this command. @xref{Low-Level Kill Ring}.
904 @end deffn
905
906 @node Yanking
907 @subsection Yanking
908
909 Yanking means inserting text from the kill ring, but it does not
910 insert the text blindly. The @code{yank} command, and related
911 commands, use @code{insert-for-yank} to perform special processing on
912 the text before it is inserted.
913
914 @defun insert-for-yank string
915 This function works like @code{insert}, except that it processes the
916 text in @var{string} according to the @code{yank-handler} text
917 property, as well as the variables @code{yank-handled-properties} and
918 @code{yank-excluded-properties} (see below), before inserting the
919 result into the current buffer.
920 @end defun
921
922 @defun insert-buffer-substring-as-yank buf &optional start end
923 This function resembles @code{insert-buffer-substring}, except that it
924 processes the text according to @code{yank-handled-properties} and
925 @code{yank-excluded-properties}. (It does not handle the
926 @code{yank-handler} property, which does not normally occur in buffer
927 text anyway.)
928 @end defun
929
930 @c FIXME: Add an index for yank-handler.
931 If you put a @code{yank-handler} text property on all or part of a
932 string, that alters how @code{insert-for-yank} inserts the string. If
933 different parts of the string have different @code{yank-handler}
934 values (comparison being done with @code{eq}), each substring is
935 handled separately. The property value must be a list of one to four
936 elements, with the following format (where elements after the first
937 may be omitted):
938
939 @example
940 (@var{function} @var{param} @var{noexclude} @var{undo})
941 @end example
942
943 Here is what the elements do:
944
945 @table @var
946 @item function
947 When @var{function} is non-@code{nil}, it is called instead of
948 @code{insert} to insert the string, with one argument---the string to
949 insert.
950
951 @item param
952 If @var{param} is present and non-@code{nil}, it replaces @var{string}
953 (or the substring of @var{string} being processed) as the object
954 passed to @var{function} (or @code{insert}). For example, if
955 @var{function} is @code{yank-rectangle}, @var{param} should be a list
956 of strings to insert as a rectangle.
957
958 @item noexclude
959 If @var{noexclude} is present and non-@code{nil}, that disables the
960 normal action of @code{yank-handled-properties} and
961 @code{yank-excluded-properties} on the inserted string.
962
963 @item undo
964 If @var{undo} is present and non-@code{nil}, it is a function that will be
965 called by @code{yank-pop} to undo the insertion of the current object.
966 It is called with two arguments, the start and end of the current
967 region. @var{function} can set @code{yank-undo-function} to override
968 the @var{undo} value.
969 @end table
970
971 @cindex yanking and text properties
972 @defopt yank-handled-properties
973 This variable specifies special text property handling conditions for
974 yanked text. It takes effect after the text has been inserted (either
975 normally, or via the @code{yank-handler} property), and prior to
976 @code{yank-excluded-properties} taking effect.
977
978 The value should be an alist of elements @code{(@var{prop}
979 . @var{fun})}. Each alist element is handled in order. The inserted
980 text is scanned for stretches of text having text properties @code{eq}
981 to @var{prop}; for each such stretch, @var{fun} is called with three
982 arguments: the value of the property, and the start and end positions
983 of the text.
984 @end defopt
985
986 @defopt yank-excluded-properties
987 The value of this variable is the list of properties to remove from
988 inserted text. Its default value contains properties that might lead
989 to annoying results, such as causing the text to respond to the mouse
990 or specifying key bindings. It takes effect after
991 @code{yank-handled-properties}.
992 @end defopt
993
994
995 @node Yank Commands
996 @subsection Functions for Yanking
997
998 This section describes higher-level commands for yanking, which are
999 intended primarily for the user but useful also in Lisp programs.
1000 Both @code{yank} and @code{yank-pop} honor the
1001 @code{yank-excluded-properties} variable and @code{yank-handler} text
1002 property (@pxref{Yanking}).
1003
1004 @deffn Command yank &optional arg
1005 @cindex inserting killed text
1006 This command inserts before point the text at the front of the kill
1007 ring. It sets the mark at the beginning of that text, using
1008 @code{push-mark} (@pxref{The Mark}), and puts point at the end.
1009
1010 If @var{arg} is a non-@code{nil} list (which occurs interactively when
1011 the user types @kbd{C-u} with no digits), then @code{yank} inserts the
1012 text as described above, but puts point before the yanked text and
1013 sets the mark after it.
1014
1015 If @var{arg} is a number, then @code{yank} inserts the @var{arg}th
1016 most recently killed text---the @var{arg}th element of the kill ring
1017 list, counted cyclically from the front, which is considered the
1018 first element for this purpose.
1019
1020 @code{yank} does not alter the contents of the kill ring, unless it
1021 used text provided by another program, in which case it pushes that text
1022 onto the kill ring. However if @var{arg} is an integer different from
1023 one, it rotates the kill ring to place the yanked string at the front.
1024
1025 @code{yank} returns @code{nil}.
1026 @end deffn
1027
1028 @deffn Command yank-pop &optional arg
1029 This command replaces the just-yanked entry from the kill ring with a
1030 different entry from the kill ring.
1031
1032 This is allowed only immediately after a @code{yank} or another
1033 @code{yank-pop}. At such a time, the region contains text that was just
1034 inserted by yanking. @code{yank-pop} deletes that text and inserts in
1035 its place a different piece of killed text. It does not add the deleted
1036 text to the kill ring, since it is already in the kill ring somewhere.
1037 It does however rotate the kill ring to place the newly yanked string at
1038 the front.
1039
1040 If @var{arg} is @code{nil}, then the replacement text is the previous
1041 element of the kill ring. If @var{arg} is numeric, the replacement is
1042 the @var{arg}th previous kill. If @var{arg} is negative, a more recent
1043 kill is the replacement.
1044
1045 The sequence of kills in the kill ring wraps around, so that after the
1046 oldest one comes the newest one, and before the newest one goes the
1047 oldest.
1048
1049 The return value is always @code{nil}.
1050 @end deffn
1051
1052 @defvar yank-undo-function
1053 If this variable is non-@code{nil}, the function @code{yank-pop} uses
1054 its value instead of @code{delete-region} to delete the text
1055 inserted by the previous @code{yank} or
1056 @code{yank-pop} command. The value must be a function of two
1057 arguments, the start and end of the current region.
1058
1059 The function @code{insert-for-yank} automatically sets this variable
1060 according to the @var{undo} element of the @code{yank-handler}
1061 text property, if there is one.
1062 @end defvar
1063
1064 @node Low-Level Kill Ring
1065 @subsection Low-Level Kill Ring
1066
1067 These functions and variables provide access to the kill ring at a
1068 lower level, but are still convenient for use in Lisp programs,
1069 because they take care of interaction with window system selections
1070 (@pxref{Window System Selections}).
1071
1072 @defun current-kill n &optional do-not-move
1073 The function @code{current-kill} rotates the yanking pointer, which
1074 designates the ``front'' of the kill ring, by @var{n} places (from newer
1075 kills to older ones), and returns the text at that place in the ring.
1076
1077 If the optional second argument @var{do-not-move} is non-@code{nil},
1078 then @code{current-kill} doesn't alter the yanking pointer; it just
1079 returns the @var{n}th kill, counting from the current yanking pointer.
1080
1081 If @var{n} is zero, indicating a request for the latest kill,
1082 @code{current-kill} calls the value of
1083 @code{interprogram-paste-function} (documented below) before
1084 consulting the kill ring. If that value is a function and calling it
1085 returns a string or a list of several string, @code{current-kill}
1086 pushes the strings onto the kill ring and returns the first string.
1087 It also sets the yanking pointer to point to the kill-ring entry of
1088 the first string returned by @code{interprogram-paste-function},
1089 regardless of the value of @var{do-not-move}. Otherwise,
1090 @code{current-kill} does not treat a zero value for @var{n} specially:
1091 it returns the entry pointed at by the yanking pointer and does not
1092 move the yanking pointer.
1093 @end defun
1094
1095 @defun kill-new string &optional replace
1096 This function pushes the text @var{string} onto the kill ring and
1097 makes the yanking pointer point to it. It discards the oldest entry
1098 if appropriate. It also invokes the value of
1099 @code{interprogram-cut-function} (see below).
1100
1101 If @var{replace} is non-@code{nil}, then @code{kill-new} replaces the
1102 first element of the kill ring with @var{string}, rather than pushing
1103 @var{string} onto the kill ring.
1104 @end defun
1105
1106 @defun kill-append string before-p
1107 This function appends the text @var{string} to the first entry in the
1108 kill ring and makes the yanking pointer point to the combined entry.
1109 Normally @var{string} goes at the end of the entry, but if
1110 @var{before-p} is non-@code{nil}, it goes at the beginning. This
1111 function also invokes the value of @code{interprogram-cut-function}
1112 (see below).
1113 @end defun
1114
1115 @defvar interprogram-paste-function
1116 This variable provides a way of transferring killed text from other
1117 programs, when you are using a window system. Its value should be
1118 @code{nil} or a function of no arguments.
1119
1120 If the value is a function, @code{current-kill} calls it to get the
1121 ``most recent kill''. If the function returns a non-@code{nil} value,
1122 then that value is used as the ``most recent kill''. If it returns
1123 @code{nil}, then the front of the kill ring is used.
1124
1125 To facilitate support for window systems that support multiple
1126 selections, this function may also return a list of strings. In that
1127 case, the first string is used as the ``most recent kill'', and all
1128 the other strings are pushed onto the kill ring, for easy access by
1129 @code{yank-pop}.
1130
1131 The normal use of this function is to get the window system's
1132 clipboard as the most recent kill, even if the selection belongs to
1133 another application. @xref{Window System Selections}. However, if
1134 the clipboard contents come from the current Emacs session, this
1135 function should return @code{nil}.
1136 @end defvar
1137
1138 @defvar interprogram-cut-function
1139 This variable provides a way of communicating killed text to other
1140 programs, when you are using a window system. Its value should be
1141 @code{nil} or a function of one required argument.
1142
1143 If the value is a function, @code{kill-new} and @code{kill-append} call
1144 it with the new first element of the kill ring as the argument.
1145
1146 The normal use of this function is to put newly killed text in the
1147 window system's clipboard. @xref{Window System Selections}.
1148 @end defvar
1149
1150 @node Internals of Kill Ring
1151 @subsection Internals of the Kill Ring
1152
1153 The variable @code{kill-ring} holds the kill ring contents, in the
1154 form of a list of strings. The most recent kill is always at the front
1155 of the list.
1156
1157 The @code{kill-ring-yank-pointer} variable points to a link in the
1158 kill ring list, whose @sc{car} is the text to yank next. We say it
1159 identifies the ``front'' of the ring. Moving
1160 @code{kill-ring-yank-pointer} to a different link is called
1161 @dfn{rotating the kill ring}. We call the kill ring a ``ring'' because
1162 the functions that move the yank pointer wrap around from the end of the
1163 list to the beginning, or vice-versa. Rotation of the kill ring is
1164 virtual; it does not change the value of @code{kill-ring}.
1165
1166 Both @code{kill-ring} and @code{kill-ring-yank-pointer} are Lisp
1167 variables whose values are normally lists. The word ``pointer'' in the
1168 name of the @code{kill-ring-yank-pointer} indicates that the variable's
1169 purpose is to identify one element of the list for use by the next yank
1170 command.
1171
1172 The value of @code{kill-ring-yank-pointer} is always @code{eq} to one
1173 of the links in the kill ring list. The element it identifies is the
1174 @sc{car} of that link. Kill commands, which change the kill ring, also
1175 set this variable to the value of @code{kill-ring}. The effect is to
1176 rotate the ring so that the newly killed text is at the front.
1177
1178 Here is a diagram that shows the variable @code{kill-ring-yank-pointer}
1179 pointing to the second entry in the kill ring @code{("some text" "a
1180 different piece of text" "yet older text")}.
1181
1182 @example
1183 @group
1184 kill-ring ---- kill-ring-yank-pointer
1185 | |
1186 | v
1187 | --- --- --- --- --- ---
1188 --> | | |------> | | |--> | | |--> nil
1189 --- --- --- --- --- ---
1190 | | |
1191 | | |
1192 | | -->"yet older text"
1193 | |
1194 | --> "a different piece of text"
1195 |
1196 --> "some text"
1197 @end group
1198 @end example
1199
1200 @noindent
1201 This state of affairs might occur after @kbd{C-y} (@code{yank})
1202 immediately followed by @kbd{M-y} (@code{yank-pop}).
1203
1204 @defvar kill-ring
1205 This variable holds the list of killed text sequences, most recently
1206 killed first.
1207 @end defvar
1208
1209 @defvar kill-ring-yank-pointer
1210 This variable's value indicates which element of the kill ring is at the
1211 ``front'' of the ring for yanking. More precisely, the value is a tail
1212 of the value of @code{kill-ring}, and its @sc{car} is the kill string
1213 that @kbd{C-y} should yank.
1214 @end defvar
1215
1216 @defopt kill-ring-max
1217 The value of this variable is the maximum length to which the kill
1218 ring can grow, before elements are thrown away at the end. The default
1219 value for @code{kill-ring-max} is 60.
1220 @end defopt
1221
1222 @node Undo
1223 @section Undo
1224 @cindex redo
1225
1226 Most buffers have an @dfn{undo list}, which records all changes made
1227 to the buffer's text so that they can be undone. (The buffers that
1228 don't have one are usually special-purpose buffers for which Emacs
1229 assumes that undoing is not useful. In particular, any buffer whose
1230 name begins with a space has its undo recording off by default;
1231 see @ref{Buffer Names}.) All the primitives that modify the
1232 text in the buffer automatically add elements to the front of the undo
1233 list, which is in the variable @code{buffer-undo-list}.
1234
1235 @defvar buffer-undo-list
1236 This buffer-local variable's value is the undo list of the current
1237 buffer. A value of @code{t} disables the recording of undo information.
1238 @end defvar
1239
1240 Here are the kinds of elements an undo list can have:
1241
1242 @table @code
1243 @item @var{position}
1244 This kind of element records a previous value of point; undoing this
1245 element moves point to @var{position}. Ordinary cursor motion does not
1246 make any sort of undo record, but deletion operations use these entries
1247 to record where point was before the command.
1248
1249 @item (@var{beg} . @var{end})
1250 This kind of element indicates how to delete text that was inserted.
1251 Upon insertion, the text occupied the range @var{beg}--@var{end} in the
1252 buffer.
1253
1254 @item (@var{text} . @var{position})
1255 This kind of element indicates how to reinsert text that was deleted.
1256 The deleted text itself is the string @var{text}. The place to
1257 reinsert it is @code{(abs @var{position})}. If @var{position} is
1258 positive, point was at the beginning of the deleted text, otherwise it
1259 was at the end.
1260
1261 @item (t . @var{time-flag})
1262 This kind of element indicates that an unmodified buffer became
1263 modified. A @var{time-flag} of the form
1264 @code{(@var{sec-high} @var{sec-low} @var{microsec}
1265 @var{picosec})} represents the visited file's modification time as of
1266 when it was previously visited or saved, using the same format as
1267 @code{current-time}; see @ref{Time of Day}.
1268 A @var{time-flag} of 0 means the buffer does not correspond to any file;
1269 @minus{}1 means the visited file previously did not exist.
1270 @code{primitive-undo} uses these
1271 values to determine whether to mark the buffer as unmodified once again;
1272 it does so only if the file's status matches that of @var{time-flag}.
1273
1274 @item (nil @var{property} @var{value} @var{beg} . @var{end})
1275 This kind of element records a change in a text property.
1276 Here's how you might undo the change:
1277
1278 @example
1279 (put-text-property @var{beg} @var{end} @var{property} @var{value})
1280 @end example
1281
1282 @item (@var{marker} . @var{adjustment})
1283 This kind of element records the fact that the marker @var{marker} was
1284 relocated due to deletion of surrounding text, and that it moved
1285 @var{adjustment} character positions. Undoing this element moves
1286 @var{marker} @minus{} @var{adjustment} characters.
1287
1288 @item (apply @var{funname} . @var{args})
1289 This is an extensible undo item, which is undone by calling
1290 @var{funname} with arguments @var{args}.
1291
1292 @item (apply @var{delta} @var{beg} @var{end} @var{funname} . @var{args})
1293 This is an extensible undo item, which records a change limited to the
1294 range @var{beg} to @var{end}, which increased the size of the buffer
1295 by @var{delta} characters. It is undone by calling @var{funname} with
1296 arguments @var{args}.
1297
1298 This kind of element enables undo limited to a region to determine
1299 whether the element pertains to that region.
1300
1301 @item nil
1302 This element is a boundary. The elements between two boundaries are
1303 called a @dfn{change group}; normally, each change group corresponds to
1304 one keyboard command, and undo commands normally undo an entire group as
1305 a unit.
1306 @end table
1307
1308 @defun undo-boundary
1309 This function places a boundary element in the undo list. The undo
1310 command stops at such a boundary, and successive undo commands undo
1311 to earlier and earlier boundaries. This function returns @code{nil}.
1312
1313 The editor command loop automatically calls @code{undo-boundary} just
1314 before executing each key sequence, so that each undo normally undoes
1315 the effects of one command. As an exception, the command
1316 @code{self-insert-command}, which produces self-inserting input
1317 characters (@pxref{Commands for Insertion}), may remove the boundary
1318 inserted by the command loop: a boundary is accepted for the first
1319 such character, the next 19 consecutive self-inserting input
1320 characters do not have boundaries, and then the 20th does; and so on
1321 as long as the self-inserting characters continue. Hence, sequences
1322 of consecutive character insertions can be undone as a group.
1323
1324 All buffer modifications add a boundary whenever the previous undoable
1325 change was made in some other buffer. This is to ensure that
1326 each command makes a boundary in each buffer where it makes changes.
1327
1328 Calling this function explicitly is useful for splitting the effects of
1329 a command into more than one unit. For example, @code{query-replace}
1330 calls @code{undo-boundary} after each replacement, so that the user can
1331 undo individual replacements one by one.
1332 @end defun
1333
1334 @defvar undo-in-progress
1335 This variable is normally @code{nil}, but the undo commands bind it to
1336 @code{t}. This is so that various kinds of change hooks can tell when
1337 they're being called for the sake of undoing.
1338 @end defvar
1339
1340 @defun primitive-undo count list
1341 This is the basic function for undoing elements of an undo list.
1342 It undoes the first @var{count} elements of @var{list}, returning
1343 the rest of @var{list}.
1344
1345 @code{primitive-undo} adds elements to the buffer's undo list when it
1346 changes the buffer. Undo commands avoid confusion by saving the undo
1347 list value at the beginning of a sequence of undo operations. Then the
1348 undo operations use and update the saved value. The new elements added
1349 by undoing are not part of this saved value, so they don't interfere with
1350 continuing to undo.
1351
1352 This function does not bind @code{undo-in-progress}.
1353 @end defun
1354
1355 @node Maintaining Undo
1356 @section Maintaining Undo Lists
1357
1358 This section describes how to enable and disable undo information for
1359 a given buffer. It also explains how the undo list is truncated
1360 automatically so it doesn't get too big.
1361
1362 Recording of undo information in a newly created buffer is normally
1363 enabled to start with; but if the buffer name starts with a space, the
1364 undo recording is initially disabled. You can explicitly enable or
1365 disable undo recording with the following two functions, or by setting
1366 @code{buffer-undo-list} yourself.
1367
1368 @deffn Command buffer-enable-undo &optional buffer-or-name
1369 This command enables recording undo information for buffer
1370 @var{buffer-or-name}, so that subsequent changes can be undone. If no
1371 argument is supplied, then the current buffer is used. This function
1372 does nothing if undo recording is already enabled in the buffer. It
1373 returns @code{nil}.
1374
1375 In an interactive call, @var{buffer-or-name} is the current buffer.
1376 You cannot specify any other buffer.
1377 @end deffn
1378
1379 @deffn Command buffer-disable-undo &optional buffer-or-name
1380 @cindex disabling undo
1381 This function discards the undo list of @var{buffer-or-name}, and disables
1382 further recording of undo information. As a result, it is no longer
1383 possible to undo either previous changes or any subsequent changes. If
1384 the undo list of @var{buffer-or-name} is already disabled, this function
1385 has no effect.
1386
1387 In an interactive call, BUFFER-OR-NAME is the current buffer. You
1388 cannot specify any other buffer. This function returns @code{nil}.
1389 @end deffn
1390
1391 As editing continues, undo lists get longer and longer. To prevent
1392 them from using up all available memory space, garbage collection trims
1393 them back to size limits you can set. (For this purpose, the ``size''
1394 of an undo list measures the cons cells that make up the list, plus the
1395 strings of deleted text.) Three variables control the range of acceptable
1396 sizes: @code{undo-limit}, @code{undo-strong-limit} and
1397 @code{undo-outer-limit}. In these variables, size is counted as the
1398 number of bytes occupied, which includes both saved text and other
1399 data.
1400
1401 @defopt undo-limit
1402 This is the soft limit for the acceptable size of an undo list. The
1403 change group at which this size is exceeded is the last one kept.
1404 @end defopt
1405
1406 @defopt undo-strong-limit
1407 This is the upper limit for the acceptable size of an undo list. The
1408 change group at which this size is exceeded is discarded itself (along
1409 with all older change groups). There is one exception: the very latest
1410 change group is only discarded if it exceeds @code{undo-outer-limit}.
1411 @end defopt
1412
1413 @defopt undo-outer-limit
1414 If at garbage collection time the undo info for the current command
1415 exceeds this limit, Emacs discards the info and displays a warning.
1416 This is a last ditch limit to prevent memory overflow.
1417 @end defopt
1418
1419 @defopt undo-ask-before-discard
1420 If this variable is non-@code{nil}, when the undo info exceeds
1421 @code{undo-outer-limit}, Emacs asks in the echo area whether to
1422 discard the info. The default value is @code{nil}, which means to
1423 discard it automatically.
1424
1425 This option is mainly intended for debugging. Garbage collection is
1426 inhibited while the question is asked, which means that Emacs might
1427 leak memory if the user waits too long before answering the question.
1428 @end defopt
1429
1430 @node Filling
1431 @section Filling
1432 @cindex filling text
1433
1434 @dfn{Filling} means adjusting the lengths of lines (by moving the line
1435 breaks) so that they are nearly (but no greater than) a specified
1436 maximum width. Additionally, lines can be @dfn{justified}, which means
1437 inserting spaces to make the left and/or right margins line up
1438 precisely. The width is controlled by the variable @code{fill-column}.
1439 For ease of reading, lines should be no longer than 70 or so columns.
1440
1441 You can use Auto Fill mode (@pxref{Auto Filling}) to fill text
1442 automatically as you insert it, but changes to existing text may leave
1443 it improperly filled. Then you must fill the text explicitly.
1444
1445 Most of the commands in this section return values that are not
1446 meaningful. All the functions that do filling take note of the current
1447 left margin, current right margin, and current justification style
1448 (@pxref{Margins}). If the current justification style is
1449 @code{none}, the filling functions don't actually do anything.
1450
1451 Several of the filling functions have an argument @var{justify}.
1452 If it is non-@code{nil}, that requests some kind of justification. It
1453 can be @code{left}, @code{right}, @code{full}, or @code{center}, to
1454 request a specific style of justification. If it is @code{t}, that
1455 means to use the current justification style for this part of the text
1456 (see @code{current-justification}, below). Any other value is treated
1457 as @code{full}.
1458
1459 When you call the filling functions interactively, using a prefix
1460 argument implies the value @code{full} for @var{justify}.
1461
1462 @deffn Command fill-paragraph &optional justify region
1463 This command fills the paragraph at or after point. If
1464 @var{justify} is non-@code{nil}, each line is justified as well.
1465 It uses the ordinary paragraph motion commands to find paragraph
1466 boundaries. @xref{Paragraphs,,, emacs, The GNU Emacs Manual}.
1467
1468 When @var{region} is non-@code{nil}, then if Transient Mark mode is
1469 enabled and the mark is active, this command calls @code{fill-region}
1470 to fill all the paragraphs in the region, instead of filling only the
1471 current paragraph. When this command is called interactively,
1472 @var{region} is @code{t}.
1473 @end deffn
1474
1475 @deffn Command fill-region start end &optional justify nosqueeze to-eop
1476 This command fills each of the paragraphs in the region from @var{start}
1477 to @var{end}. It justifies as well if @var{justify} is
1478 non-@code{nil}.
1479
1480 If @var{nosqueeze} is non-@code{nil}, that means to leave whitespace
1481 other than line breaks untouched. If @var{to-eop} is non-@code{nil},
1482 that means to keep filling to the end of the paragraph---or the next hard
1483 newline, if @code{use-hard-newlines} is enabled (see below).
1484
1485 The variable @code{paragraph-separate} controls how to distinguish
1486 paragraphs. @xref{Standard Regexps}.
1487 @end deffn
1488
1489 @deffn Command fill-individual-paragraphs start end &optional justify citation-regexp
1490 This command fills each paragraph in the region according to its
1491 individual fill prefix. Thus, if the lines of a paragraph were indented
1492 with spaces, the filled paragraph will remain indented in the same
1493 fashion.
1494
1495 The first two arguments, @var{start} and @var{end}, are the beginning
1496 and end of the region to be filled. The third and fourth arguments,
1497 @var{justify} and @var{citation-regexp}, are optional. If
1498 @var{justify} is non-@code{nil}, the paragraphs are justified as
1499 well as filled. If @var{citation-regexp} is non-@code{nil}, it means the
1500 function is operating on a mail message and therefore should not fill
1501 the header lines. If @var{citation-regexp} is a string, it is used as
1502 a regular expression; if it matches the beginning of a line, that line
1503 is treated as a citation marker.
1504
1505 @c FIXME: "That mode" is confusing. It isn't a major/minor mode.
1506 Ordinarily, @code{fill-individual-paragraphs} regards each change in
1507 indentation as starting a new paragraph. If
1508 @code{fill-individual-varying-indent} is non-@code{nil}, then only
1509 separator lines separate paragraphs. That mode can handle indented
1510 paragraphs with additional indentation on the first line.
1511 @end deffn
1512
1513 @defopt fill-individual-varying-indent
1514 This variable alters the action of @code{fill-individual-paragraphs} as
1515 described above.
1516 @end defopt
1517
1518 @deffn Command fill-region-as-paragraph start end &optional justify nosqueeze squeeze-after
1519 This command considers a region of text as a single paragraph and fills
1520 it. If the region was made up of many paragraphs, the blank lines
1521 between paragraphs are removed. This function justifies as well as
1522 filling when @var{justify} is non-@code{nil}.
1523
1524 If @var{nosqueeze} is non-@code{nil}, that means to leave whitespace
1525 other than line breaks untouched. If @var{squeeze-after} is
1526 non-@code{nil}, it specifies a position in the region, and means don't
1527 canonicalize spaces before that position.
1528
1529 In Adaptive Fill mode, this command calls @code{fill-context-prefix} to
1530 choose a fill prefix by default. @xref{Adaptive Fill}.
1531 @end deffn
1532
1533 @deffn Command justify-current-line &optional how eop nosqueeze
1534 This command inserts spaces between the words of the current line so
1535 that the line ends exactly at @code{fill-column}. It returns
1536 @code{nil}.
1537
1538 The argument @var{how}, if non-@code{nil} specifies explicitly the style
1539 of justification. It can be @code{left}, @code{right}, @code{full},
1540 @code{center}, or @code{none}. If it is @code{t}, that means to do
1541 follow specified justification style (see @code{current-justification},
1542 below). @code{nil} means to do full justification.
1543
1544 If @var{eop} is non-@code{nil}, that means do only left-justification
1545 if @code{current-justification} specifies full justification. This is
1546 used for the last line of a paragraph; even if the paragraph as a
1547 whole is fully justified, the last line should not be.
1548
1549 If @var{nosqueeze} is non-@code{nil}, that means do not change interior
1550 whitespace.
1551 @end deffn
1552
1553 @defopt default-justification
1554 This variable's value specifies the style of justification to use for
1555 text that doesn't specify a style with a text property. The possible
1556 values are @code{left}, @code{right}, @code{full}, @code{center}, or
1557 @code{none}. The default value is @code{left}.
1558 @end defopt
1559
1560 @defun current-justification
1561 This function returns the proper justification style to use for filling
1562 the text around point.
1563
1564 This returns the value of the @code{justification} text property at
1565 point, or the variable @var{default-justification} if there is no such
1566 text property. However, it returns @code{nil} rather than @code{none}
1567 to mean ``don't justify''.
1568 @end defun
1569
1570 @defopt sentence-end-double-space
1571 @anchor{Definition of sentence-end-double-space}
1572 If this variable is non-@code{nil}, a period followed by just one space
1573 does not count as the end of a sentence, and the filling functions
1574 avoid breaking the line at such a place.
1575 @end defopt
1576
1577 @defopt sentence-end-without-period
1578 If this variable is non-@code{nil}, a sentence can end without a
1579 period. This is used for languages like Thai, where sentences end
1580 with a double space but without a period.
1581 @end defopt
1582
1583 @defopt sentence-end-without-space
1584 If this variable is non-@code{nil}, it should be a string of
1585 characters that can end a sentence without following spaces.
1586 @end defopt
1587
1588 @defvar fill-paragraph-function
1589 This variable provides a way to override the filling of paragraphs.
1590 If its value is non-@code{nil}, @code{fill-paragraph} calls this
1591 function to do the work. If the function returns a non-@code{nil}
1592 value, @code{fill-paragraph} assumes the job is done, and immediately
1593 returns that value.
1594
1595 The usual use of this feature is to fill comments in programming
1596 language modes. If the function needs to fill a paragraph in the usual
1597 way, it can do so as follows:
1598
1599 @example
1600 (let ((fill-paragraph-function nil))
1601 (fill-paragraph arg))
1602 @end example
1603 @end defvar
1604
1605 @defvar fill-forward-paragraph-function
1606 This variable provides a way to override how the filling functions,
1607 such as @code{fill-region} and @code{fill-paragraph}, move forward to
1608 the next paragraph. Its value should be a function, which is called
1609 with a single argument @var{n}, the number of paragraphs to move, and
1610 should return the difference between @var{n} and the number of
1611 paragraphs actually moved. The default value of this variable is
1612 @code{forward-paragraph}. @xref{Paragraphs,,, emacs, The GNU Emacs
1613 Manual}.
1614 @end defvar
1615
1616 @defvar use-hard-newlines
1617 If this variable is non-@code{nil}, the filling functions do not delete
1618 newlines that have the @code{hard} text property. These ``hard
1619 newlines'' act as paragraph separators. @xref{Hard and Soft
1620 Newlines,, Hard and Soft Newlines, emacs, The GNU Emacs Manual}.
1621 @end defvar
1622
1623 @node Margins
1624 @section Margins for Filling
1625 @cindex margins, filling
1626
1627 @defopt fill-prefix
1628 This buffer-local variable, if non-@code{nil}, specifies a string of
1629 text that appears at the beginning of normal text lines and should be
1630 disregarded when filling them. Any line that fails to start with the
1631 fill prefix is considered the start of a paragraph; so is any line
1632 that starts with the fill prefix followed by additional whitespace.
1633 Lines that start with the fill prefix but no additional whitespace are
1634 ordinary text lines that can be filled together. The resulting filled
1635 lines also start with the fill prefix.
1636
1637 The fill prefix follows the left margin whitespace, if any.
1638 @end defopt
1639
1640 @defopt fill-column
1641 This buffer-local variable specifies the maximum width of filled lines.
1642 Its value should be an integer, which is a number of columns. All the
1643 filling, justification, and centering commands are affected by this
1644 variable, including Auto Fill mode (@pxref{Auto Filling}).
1645
1646 As a practical matter, if you are writing text for other people to
1647 read, you should set @code{fill-column} to no more than 70. Otherwise
1648 the line will be too long for people to read comfortably, and this can
1649 make the text seem clumsy.
1650
1651 The default value for @code{fill-column} is 70.
1652 @end defopt
1653
1654 @deffn Command set-left-margin from to margin
1655 This sets the @code{left-margin} property on the text from @var{from} to
1656 @var{to} to the value @var{margin}. If Auto Fill mode is enabled, this
1657 command also refills the region to fit the new margin.
1658 @end deffn
1659
1660 @deffn Command set-right-margin from to margin
1661 This sets the @code{right-margin} property on the text from @var{from}
1662 to @var{to} to the value @var{margin}. If Auto Fill mode is enabled,
1663 this command also refills the region to fit the new margin.
1664 @end deffn
1665
1666 @defun current-left-margin
1667 This function returns the proper left margin value to use for filling
1668 the text around point. The value is the sum of the @code{left-margin}
1669 property of the character at the start of the current line (or zero if
1670 none), and the value of the variable @code{left-margin}.
1671 @end defun
1672
1673 @defun current-fill-column
1674 This function returns the proper fill column value to use for filling
1675 the text around point. The value is the value of the @code{fill-column}
1676 variable, minus the value of the @code{right-margin} property of the
1677 character after point.
1678 @end defun
1679
1680 @deffn Command move-to-left-margin &optional n force
1681 This function moves point to the left margin of the current line. The
1682 column moved to is determined by calling the function
1683 @code{current-left-margin}. If the argument @var{n} is non-@code{nil},
1684 @code{move-to-left-margin} moves forward @var{n}@minus{}1 lines first.
1685
1686 If @var{force} is non-@code{nil}, that says to fix the line's
1687 indentation if that doesn't match the left margin value.
1688 @end deffn
1689
1690 @defun delete-to-left-margin &optional from to
1691 This function removes left margin indentation from the text between
1692 @var{from} and @var{to}. The amount of indentation to delete is
1693 determined by calling @code{current-left-margin}. In no case does this
1694 function delete non-whitespace. If @var{from} and @var{to} are omitted,
1695 they default to the whole buffer.
1696 @end defun
1697
1698 @defun indent-to-left-margin
1699 This function adjusts the indentation at the beginning of the current
1700 line to the value specified by the variable @code{left-margin}. (That
1701 may involve either inserting or deleting whitespace.) This function
1702 is value of @code{indent-line-function} in Paragraph-Indent Text mode.
1703 @end defun
1704
1705 @defopt left-margin
1706 This variable specifies the base left margin column. In Fundamental
1707 mode, @kbd{C-j} indents to this column. This variable automatically
1708 becomes buffer-local when set in any fashion.
1709 @end defopt
1710
1711 @defopt fill-nobreak-predicate
1712 This variable gives major modes a way to specify not to break a line
1713 at certain places. Its value should be a list of functions. Whenever
1714 filling considers breaking the line at a certain place in the buffer,
1715 it calls each of these functions with no arguments and with point
1716 located at that place. If any of the functions returns
1717 non-@code{nil}, then the line won't be broken there.
1718 @end defopt
1719
1720 @node Adaptive Fill
1721 @section Adaptive Fill Mode
1722 @c @cindex Adaptive Fill mode "adaptive-fill-mode" is adjacent.
1723
1724 When @dfn{Adaptive Fill Mode} is enabled, Emacs determines the fill
1725 prefix automatically from the text in each paragraph being filled
1726 rather than using a predetermined value. During filling, this fill
1727 prefix gets inserted at the start of the second and subsequent lines
1728 of the paragraph as described in @ref{Filling}, and in @ref{Auto
1729 Filling}.
1730
1731 @defopt adaptive-fill-mode
1732 Adaptive Fill mode is enabled when this variable is non-@code{nil}.
1733 It is @code{t} by default.
1734 @end defopt
1735
1736 @defun fill-context-prefix from to
1737 This function implements the heart of Adaptive Fill mode; it chooses a
1738 fill prefix based on the text between @var{from} and @var{to},
1739 typically the start and end of a paragraph. It does this by looking
1740 at the first two lines of the paragraph, based on the variables
1741 described below.
1742 @c The optional argument first-line-regexp is not documented
1743 @c because it exists for internal purposes and might be eliminated
1744 @c in the future.
1745
1746 Usually, this function returns the fill prefix, a string. However,
1747 before doing this, the function makes a final check (not specially
1748 mentioned in the following) that a line starting with this prefix
1749 wouldn't look like the start of a paragraph. Should this happen, the
1750 function signals the anomaly by returning @code{nil} instead.
1751
1752 In detail, @code{fill-context-prefix} does this:
1753
1754 @enumerate
1755 @item
1756 It takes a candidate for the fill prefix from the first line---it
1757 tries first the function in @code{adaptive-fill-function} (if any),
1758 then the regular expression @code{adaptive-fill-regexp} (see below).
1759 The first non-@code{nil} result of these, or the empty string if
1760 they're both @code{nil}, becomes the first line's candidate.
1761 @item
1762 If the paragraph has as yet only one line, the function tests the
1763 validity of the prefix candidate just found. The function then
1764 returns the candidate if it's valid, or a string of spaces otherwise.
1765 (see the description of @code{adaptive-fill-first-line-regexp} below).
1766 @item
1767 When the paragraph already has two lines, the function next looks for
1768 a prefix candidate on the second line, in just the same way it did for
1769 the first line. If it doesn't find one, it returns @code{nil}.
1770 @item
1771 The function now compares the two candidate prefixes heuristically: if
1772 the non-whitespace characters in the line 2 candidate occur in the
1773 same order in the line 1 candidate, the function returns the line 2
1774 candidate. Otherwise, it returns the largest initial substring which
1775 is common to both candidates (which might be the empty string).
1776 @end enumerate
1777 @end defun
1778
1779 @defopt adaptive-fill-regexp
1780 Adaptive Fill mode matches this regular expression against the text
1781 starting after the left margin whitespace (if any) on a line; the
1782 characters it matches are that line's candidate for the fill prefix.
1783
1784 The default value matches whitespace with certain punctuation
1785 characters intermingled.
1786 @end defopt
1787
1788 @defopt adaptive-fill-first-line-regexp
1789 Used only in one-line paragraphs, this regular expression acts as an
1790 additional check of the validity of the one available candidate fill
1791 prefix: the candidate must match this regular expression, or match
1792 @code{comment-start-skip}. If it doesn't, @code{fill-context-prefix}
1793 replaces the candidate with a string of spaces ``of the same width''
1794 as it.
1795
1796 The default value of this variable is @w{@code{"\\`[ \t]*\\'"}}, which
1797 matches only a string of whitespace. The effect of this default is to
1798 force the fill prefixes found in one-line paragraphs always to be pure
1799 whitespace.
1800 @end defopt
1801
1802 @defopt adaptive-fill-function
1803 You can specify more complex ways of choosing a fill prefix
1804 automatically by setting this variable to a function. The function is
1805 called with point after the left margin (if any) of a line, and it
1806 must preserve point. It should return either ``that line's'' fill
1807 prefix or @code{nil}, meaning it has failed to determine a prefix.
1808 @end defopt
1809
1810 @node Auto Filling
1811 @section Auto Filling
1812 @cindex filling, automatic
1813 @cindex Auto Fill mode
1814
1815 @c FIXME: I don't think any of the variables below is a/an normal/abnormal hook.
1816 Auto Fill mode is a minor mode that fills lines automatically as text
1817 is inserted. This section describes the hook used by Auto Fill mode.
1818 For a description of functions that you can call explicitly to fill and
1819 justify existing text, see @ref{Filling}.
1820
1821 Auto Fill mode also enables the functions that change the margins and
1822 justification style to refill portions of the text. @xref{Margins}.
1823
1824 @defvar auto-fill-function
1825 The value of this buffer-local variable should be a function (of no
1826 arguments) to be called after self-inserting a character from the table
1827 @code{auto-fill-chars}. It may be @code{nil}, in which case nothing
1828 special is done in that case.
1829
1830 The value of @code{auto-fill-function} is @code{do-auto-fill} when
1831 Auto-Fill mode is enabled. That is a function whose sole purpose is to
1832 implement the usual strategy for breaking a line.
1833 @end defvar
1834
1835 @defvar normal-auto-fill-function
1836 This variable specifies the function to use for
1837 @code{auto-fill-function}, if and when Auto Fill is turned on. Major
1838 modes can set buffer-local values for this variable to alter how Auto
1839 Fill works.
1840 @end defvar
1841
1842 @defvar auto-fill-chars
1843 A char table of characters which invoke @code{auto-fill-function} when
1844 self-inserted---space and newline in most language environments. They
1845 have an entry @code{t} in the table.
1846 @end defvar
1847
1848 @node Sorting
1849 @section Sorting Text
1850 @cindex sorting text
1851
1852 The sorting functions described in this section all rearrange text in
1853 a buffer. This is in contrast to the function @code{sort}, which
1854 rearranges the order of the elements of a list (@pxref{Rearrangement}).
1855 The values returned by these functions are not meaningful.
1856
1857 @defun sort-subr reverse nextrecfun endrecfun &optional startkeyfun endkeyfun predicate
1858 This function is the general text-sorting routine that subdivides a
1859 buffer into records and then sorts them. Most of the commands in this
1860 section use this function.
1861
1862 To understand how @code{sort-subr} works, consider the whole accessible
1863 portion of the buffer as being divided into disjoint pieces called
1864 @dfn{sort records}. The records may or may not be contiguous, but they
1865 must not overlap. A portion of each sort record (perhaps all of it) is
1866 designated as the sort key. Sorting rearranges the records in order by
1867 their sort keys.
1868
1869 Usually, the records are rearranged in order of ascending sort key.
1870 If the first argument to the @code{sort-subr} function, @var{reverse},
1871 is non-@code{nil}, the sort records are rearranged in order of
1872 descending sort key.
1873
1874 The next four arguments to @code{sort-subr} are functions that are
1875 called to move point across a sort record. They are called many times
1876 from within @code{sort-subr}.
1877
1878 @enumerate
1879 @item
1880 @var{nextrecfun} is called with point at the end of a record. This
1881 function moves point to the start of the next record. The first record
1882 is assumed to start at the position of point when @code{sort-subr} is
1883 called. Therefore, you should usually move point to the beginning of
1884 the buffer before calling @code{sort-subr}.
1885
1886 This function can indicate there are no more sort records by leaving
1887 point at the end of the buffer.
1888
1889 @item
1890 @var{endrecfun} is called with point within a record. It moves point to
1891 the end of the record.
1892
1893 @item
1894 @var{startkeyfun} is called to move point from the start of a record to
1895 the start of the sort key. This argument is optional; if it is omitted,
1896 the whole record is the sort key. If supplied, the function should
1897 either return a non-@code{nil} value to be used as the sort key, or
1898 return @code{nil} to indicate that the sort key is in the buffer
1899 starting at point. In the latter case, @var{endkeyfun} is called to
1900 find the end of the sort key.
1901
1902 @item
1903 @var{endkeyfun} is called to move point from the start of the sort key
1904 to the end of the sort key. This argument is optional. If
1905 @var{startkeyfun} returns @code{nil} and this argument is omitted (or
1906 @code{nil}), then the sort key extends to the end of the record. There
1907 is no need for @var{endkeyfun} if @var{startkeyfun} returns a
1908 non-@code{nil} value.
1909 @end enumerate
1910
1911 The argument @var{predicate} is the function to use to compare keys.
1912 If keys are numbers, it defaults to @code{<}; otherwise it defaults to
1913 @code{string<}.
1914
1915 As an example of @code{sort-subr}, here is the complete function
1916 definition for @code{sort-lines}:
1917
1918 @example
1919 @group
1920 ;; @r{Note that the first two lines of doc string}
1921 ;; @r{are effectively one line when viewed by a user.}
1922 (defun sort-lines (reverse beg end)
1923 "Sort lines in region alphabetically;\
1924 argument means descending order.
1925 Called from a program, there are three arguments:
1926 @end group
1927 @group
1928 REVERSE (non-nil means reverse order),\
1929 BEG and END (region to sort).
1930 The variable `sort-fold-case' determines\
1931 whether alphabetic case affects
1932 the sort order."
1933 @end group
1934 @group
1935 (interactive "P\nr")
1936 (save-excursion
1937 (save-restriction
1938 (narrow-to-region beg end)
1939 (goto-char (point-min))
1940 (let ((inhibit-field-text-motion t))
1941 (sort-subr reverse 'forward-line 'end-of-line)))))
1942 @end group
1943 @end example
1944
1945 Here @code{forward-line} moves point to the start of the next record,
1946 and @code{end-of-line} moves point to the end of record. We do not pass
1947 the arguments @var{startkeyfun} and @var{endkeyfun}, because the entire
1948 record is used as the sort key.
1949
1950 The @code{sort-paragraphs} function is very much the same, except that
1951 its @code{sort-subr} call looks like this:
1952
1953 @example
1954 @group
1955 (sort-subr reverse
1956 (function
1957 (lambda ()
1958 (while (and (not (eobp))
1959 (looking-at paragraph-separate))
1960 (forward-line 1))))
1961 'forward-paragraph)
1962 @end group
1963 @end example
1964
1965 Markers pointing into any sort records are left with no useful
1966 position after @code{sort-subr} returns.
1967 @end defun
1968
1969 @defopt sort-fold-case
1970 If this variable is non-@code{nil}, @code{sort-subr} and the other
1971 buffer sorting functions ignore case when comparing strings.
1972 @end defopt
1973
1974 @deffn Command sort-regexp-fields reverse record-regexp key-regexp start end
1975 This command sorts the region between @var{start} and @var{end}
1976 alphabetically as specified by @var{record-regexp} and @var{key-regexp}.
1977 If @var{reverse} is a negative integer, then sorting is in reverse
1978 order.
1979
1980 Alphabetical sorting means that two sort keys are compared by
1981 comparing the first characters of each, the second characters of each,
1982 and so on. If a mismatch is found, it means that the sort keys are
1983 unequal; the sort key whose character is less at the point of first
1984 mismatch is the lesser sort key. The individual characters are compared
1985 according to their numerical character codes in the Emacs character set.
1986
1987 The value of the @var{record-regexp} argument specifies how to divide
1988 the buffer into sort records. At the end of each record, a search is
1989 done for this regular expression, and the text that matches it is taken
1990 as the next record. For example, the regular expression @samp{^.+$},
1991 which matches lines with at least one character besides a newline, would
1992 make each such line into a sort record. @xref{Regular Expressions}, for
1993 a description of the syntax and meaning of regular expressions.
1994
1995 The value of the @var{key-regexp} argument specifies what part of each
1996 record is the sort key. The @var{key-regexp} could match the whole
1997 record, or only a part. In the latter case, the rest of the record has
1998 no effect on the sorted order of records, but it is carried along when
1999 the record moves to its new position.
2000
2001 The @var{key-regexp} argument can refer to the text matched by a
2002 subexpression of @var{record-regexp}, or it can be a regular expression
2003 on its own.
2004
2005 If @var{key-regexp} is:
2006
2007 @table @asis
2008 @item @samp{\@var{digit}}
2009 then the text matched by the @var{digit}th @samp{\(...\)} parenthesis
2010 grouping in @var{record-regexp} is the sort key.
2011
2012 @item @samp{\&}
2013 then the whole record is the sort key.
2014
2015 @item a regular expression
2016 then @code{sort-regexp-fields} searches for a match for the regular
2017 expression within the record. If such a match is found, it is the sort
2018 key. If there is no match for @var{key-regexp} within a record then
2019 that record is ignored, which means its position in the buffer is not
2020 changed. (The other records may move around it.)
2021 @end table
2022
2023 For example, if you plan to sort all the lines in the region by the
2024 first word on each line starting with the letter @samp{f}, you should
2025 set @var{record-regexp} to @samp{^.*$} and set @var{key-regexp} to
2026 @samp{\<f\w*\>}. The resulting expression looks like this:
2027
2028 @example
2029 @group
2030 (sort-regexp-fields nil "^.*$" "\\<f\\w*\\>"
2031 (region-beginning)
2032 (region-end))
2033 @end group
2034 @end example
2035
2036 If you call @code{sort-regexp-fields} interactively, it prompts for
2037 @var{record-regexp} and @var{key-regexp} in the minibuffer.
2038 @end deffn
2039
2040 @deffn Command sort-lines reverse start end
2041 This command alphabetically sorts lines in the region between
2042 @var{start} and @var{end}. If @var{reverse} is non-@code{nil}, the sort
2043 is in reverse order.
2044 @end deffn
2045
2046 @deffn Command sort-paragraphs reverse start end
2047 This command alphabetically sorts paragraphs in the region between
2048 @var{start} and @var{end}. If @var{reverse} is non-@code{nil}, the sort
2049 is in reverse order.
2050 @end deffn
2051
2052 @deffn Command sort-pages reverse start end
2053 This command alphabetically sorts pages in the region between
2054 @var{start} and @var{end}. If @var{reverse} is non-@code{nil}, the sort
2055 is in reverse order.
2056 @end deffn
2057
2058 @deffn Command sort-fields field start end
2059 This command sorts lines in the region between @var{start} and
2060 @var{end}, comparing them alphabetically by the @var{field}th field
2061 of each line. Fields are separated by whitespace and numbered starting
2062 from 1. If @var{field} is negative, sorting is by the
2063 @w{@minus{}@var{field}th} field from the end of the line. This command
2064 is useful for sorting tables.
2065 @end deffn
2066
2067 @deffn Command sort-numeric-fields field start end
2068 This command sorts lines in the region between @var{start} and
2069 @var{end}, comparing them numerically by the @var{field}th field of
2070 each line. Fields are separated by whitespace and numbered starting
2071 from 1. The specified field must contain a number in each line of the
2072 region. Numbers starting with 0 are treated as octal, and numbers
2073 starting with @samp{0x} are treated as hexadecimal.
2074
2075 If @var{field} is negative, sorting is by the
2076 @w{@minus{}@var{field}th} field from the end of the line. This
2077 command is useful for sorting tables.
2078 @end deffn
2079
2080 @defopt sort-numeric-base
2081 This variable specifies the default radix for
2082 @code{sort-numeric-fields} to parse numbers.
2083 @end defopt
2084
2085 @deffn Command sort-columns reverse &optional beg end
2086 This command sorts the lines in the region between @var{beg} and
2087 @var{end}, comparing them alphabetically by a certain range of
2088 columns. The column positions of @var{beg} and @var{end} bound the
2089 range of columns to sort on.
2090
2091 If @var{reverse} is non-@code{nil}, the sort is in reverse order.
2092
2093 One unusual thing about this command is that the entire line
2094 containing position @var{beg}, and the entire line containing position
2095 @var{end}, are included in the region sorted.
2096
2097 Note that @code{sort-columns} rejects text that contains tabs, because
2098 tabs could be split across the specified columns. Use @kbd{M-x
2099 untabify} to convert tabs to spaces before sorting.
2100
2101 When possible, this command actually works by calling the @code{sort}
2102 utility program.
2103 @end deffn
2104
2105 @node Columns
2106 @section Counting Columns
2107 @cindex columns
2108 @cindex counting columns
2109 @cindex horizontal position
2110
2111 The column functions convert between a character position (counting
2112 characters from the beginning of the buffer) and a column position
2113 (counting screen characters from the beginning of a line).
2114
2115 These functions count each character according to the number of
2116 columns it occupies on the screen. This means control characters count
2117 as occupying 2 or 4 columns, depending upon the value of
2118 @code{ctl-arrow}, and tabs count as occupying a number of columns that
2119 depends on the value of @code{tab-width} and on the column where the tab
2120 begins. @xref{Usual Display}.
2121
2122 Column number computations ignore the width of the window and the
2123 amount of horizontal scrolling. Consequently, a column value can be
2124 arbitrarily high. The first (or leftmost) column is numbered 0. They
2125 also ignore overlays and text properties, aside from invisibility.
2126
2127 @defun current-column
2128 This function returns the horizontal position of point, measured in
2129 columns, counting from 0 at the left margin. The column position is the
2130 sum of the widths of all the displayed representations of the characters
2131 between the start of the current line and point.
2132
2133 For an example of using @code{current-column}, see the description of
2134 @code{count-lines} in @ref{Text Lines}.
2135 @end defun
2136
2137 @deffn Command move-to-column column &optional force
2138 This function moves point to @var{column} in the current line. The
2139 calculation of @var{column} takes into account the widths of the
2140 displayed representations of the characters between the start of the
2141 line and point.
2142
2143 When called interactively, @var{column} is the value of prefix numeric
2144 argument. If @var{column} is not an integer, an error is signaled.
2145
2146 @c This behavior used to be documented until 2013/08.
2147 @ignore
2148 If column @var{column} is beyond the end of the line, point moves to
2149 the end of the line. If @var{column} is negative, point moves to the
2150 beginning of the line.
2151 @end ignore
2152
2153 If it is impossible to move to column @var{column} because that is in
2154 the middle of a multicolumn character such as a tab, point moves to the
2155 end of that character. However, if @var{force} is non-@code{nil}, and
2156 @var{column} is in the middle of a tab, then @code{move-to-column}
2157 converts the tab into spaces so that it can move precisely to column
2158 @var{column}. Other multicolumn characters can cause anomalies despite
2159 @var{force}, since there is no way to split them.
2160
2161 The argument @var{force} also has an effect if the line isn't long
2162 enough to reach column @var{column}; if it is @code{t}, that means to
2163 add whitespace at the end of the line to reach that column.
2164
2165 The return value is the column number actually moved to.
2166 @end deffn
2167
2168 @node Indentation
2169 @section Indentation
2170 @cindex indentation
2171
2172 The indentation functions are used to examine, move to, and change
2173 whitespace that is at the beginning of a line. Some of the functions
2174 can also change whitespace elsewhere on a line. Columns and indentation
2175 count from zero at the left margin.
2176
2177 @menu
2178 * Primitive Indent:: Functions used to count and insert indentation.
2179 * Mode-Specific Indent:: Customize indentation for different modes.
2180 * Region Indent:: Indent all the lines in a region.
2181 * Relative Indent:: Indent the current line based on previous lines.
2182 * Indent Tabs:: Adjustable, typewriter-like tab stops.
2183 * Motion by Indent:: Move to first non-blank character.
2184 @end menu
2185
2186 @node Primitive Indent
2187 @subsection Indentation Primitives
2188
2189 This section describes the primitive functions used to count and
2190 insert indentation. The functions in the following sections use these
2191 primitives. @xref{Width}, for related functions.
2192
2193 @defun current-indentation
2194 @comment !!Type Primitive Function
2195 @comment !!SourceFile indent.c
2196 This function returns the indentation of the current line, which is
2197 the horizontal position of the first nonblank character. If the
2198 contents are entirely blank, then this is the horizontal position of the
2199 end of the line.
2200 @end defun
2201
2202 @deffn Command indent-to column &optional minimum
2203 @comment !!Type Primitive Function
2204 @comment !!SourceFile indent.c
2205 This function indents from point with tabs and spaces until @var{column}
2206 is reached. If @var{minimum} is specified and non-@code{nil}, then at
2207 least that many spaces are inserted even if this requires going beyond
2208 @var{column}. Otherwise the function does nothing if point is already
2209 beyond @var{column}. The value is the column at which the inserted
2210 indentation ends.
2211
2212 The inserted whitespace characters inherit text properties from the
2213 surrounding text (usually, from the preceding text only). @xref{Sticky
2214 Properties}.
2215 @end deffn
2216
2217 @defopt indent-tabs-mode
2218 @comment !!SourceFile indent.c
2219 If this variable is non-@code{nil}, indentation functions can insert
2220 tabs as well as spaces. Otherwise, they insert only spaces. Setting
2221 this variable automatically makes it buffer-local in the current buffer.
2222 @end defopt
2223
2224 @node Mode-Specific Indent
2225 @subsection Indentation Controlled by Major Mode
2226
2227 An important function of each major mode is to customize the @key{TAB}
2228 key to indent properly for the language being edited. This section
2229 describes the mechanism of the @key{TAB} key and how to control it.
2230 The functions in this section return unpredictable values.
2231
2232 @deffn Command indent-for-tab-command &optional rigid
2233 This is the command bound to @key{TAB} in most editing modes. Its
2234 usual action is to indent the current line, but it can alternatively
2235 insert a tab character or indent a region.
2236
2237 Here is what it does:
2238
2239 @itemize
2240 @item
2241 First, it checks whether Transient Mark mode is enabled and the region
2242 is active. If so, it called @code{indent-region} to indent all the
2243 text in the region (@pxref{Region Indent}).
2244
2245 @item
2246 Otherwise, if the indentation function in @code{indent-line-function}
2247 is @code{indent-to-left-margin} (a trivial command that inserts a tab
2248 character), or if the variable @code{tab-always-indent} specifies that
2249 a tab character ought to be inserted (see below), then it inserts a
2250 tab character.
2251
2252 @item
2253 Otherwise, it indents the current line; this is done by calling the
2254 function in @code{indent-line-function}. If the line is already
2255 indented, and the value of @code{tab-always-indent} is @code{complete}
2256 (see below), it tries completing the text at point.
2257 @end itemize
2258
2259 If @var{rigid} is non-@code{nil} (interactively, with a prefix
2260 argument), then after this command indents a line or inserts a tab, it
2261 also rigidly indents the entire balanced expression which starts at
2262 the beginning of the current line, in order to reflect the new
2263 indentation. This argument is ignored if the command indents the
2264 region.
2265 @end deffn
2266
2267 @defvar indent-line-function
2268 This variable's value is the function to be used by
2269 @code{indent-for-tab-command}, and various other indentation commands,
2270 to indent the current line. It is usually assigned by the major mode;
2271 for instance, Lisp mode sets it to @code{lisp-indent-line}, C mode
2272 sets it to @code{c-indent-line}, and so on. The default value is
2273 @code{indent-relative}. @xref{Auto-Indentation}.
2274 @end defvar
2275
2276 @deffn Command indent-according-to-mode
2277 This command calls the function in @code{indent-line-function} to
2278 indent the current line in a way appropriate for the current major mode.
2279 @end deffn
2280
2281 @deffn Command newline-and-indent
2282 This function inserts a newline, then indents the new line (the one
2283 following the newline just inserted) according to the major mode. It
2284 does indentation by calling @code{indent-according-to-mode}.
2285 @end deffn
2286
2287 @deffn Command reindent-then-newline-and-indent
2288 This command reindents the current line, inserts a newline at point,
2289 and then indents the new line (the one following the newline just
2290 inserted). It does indentation on both lines by calling
2291 @code{indent-according-to-mode}.
2292 @end deffn
2293
2294 @defopt tab-always-indent
2295 This variable can be used to customize the behavior of the @key{TAB}
2296 (@code{indent-for-tab-command}) command. If the value is @code{t}
2297 (the default), the command normally just indents the current line. If
2298 the value is @code{nil}, the command indents the current line only if
2299 point is at the left margin or in the line's indentation; otherwise,
2300 it inserts a tab character. If the value is @code{complete}, the
2301 command first tries to indent the current line, and if the line was
2302 already indented, it calls @code{completion-at-point} to complete the
2303 text at point (@pxref{Completion in Buffers}).
2304 @end defopt
2305
2306 @node Region Indent
2307 @subsection Indenting an Entire Region
2308
2309 This section describes commands that indent all the lines in the
2310 region. They return unpredictable values.
2311
2312 @deffn Command indent-region start end &optional to-column
2313 This command indents each nonblank line starting between @var{start}
2314 (inclusive) and @var{end} (exclusive). If @var{to-column} is
2315 @code{nil}, @code{indent-region} indents each nonblank line by calling
2316 the current mode's indentation function, the value of
2317 @code{indent-line-function}.
2318
2319 If @var{to-column} is non-@code{nil}, it should be an integer
2320 specifying the number of columns of indentation; then this function
2321 gives each line exactly that much indentation, by either adding or
2322 deleting whitespace.
2323
2324 If there is a fill prefix, @code{indent-region} indents each line
2325 by making it start with the fill prefix.
2326 @end deffn
2327
2328 @defvar indent-region-function
2329 The value of this variable is a function that can be used by
2330 @code{indent-region} as a short cut. It should take two arguments, the
2331 start and end of the region. You should design the function so
2332 that it will produce the same results as indenting the lines of the
2333 region one by one, but presumably faster.
2334
2335 If the value is @code{nil}, there is no short cut, and
2336 @code{indent-region} actually works line by line.
2337
2338 A short-cut function is useful in modes such as C mode and Lisp mode,
2339 where the @code{indent-line-function} must scan from the beginning of
2340 the function definition: applying it to each line would be quadratic in
2341 time. The short cut can update the scan information as it moves through
2342 the lines indenting them; this takes linear time. In a mode where
2343 indenting a line individually is fast, there is no need for a short cut.
2344
2345 @code{indent-region} with a non-@code{nil} argument @var{to-column} has
2346 a different meaning and does not use this variable.
2347 @end defvar
2348
2349 @deffn Command indent-rigidly start end count
2350 This function indents all lines starting between @var{start}
2351 (inclusive) and @var{end} (exclusive) sideways by @var{count} columns.
2352 This ``preserves the shape'' of the affected region, moving it as a
2353 rigid unit.
2354
2355 This is useful not only for indenting regions of unindented text, but
2356 also for indenting regions of formatted code. For example, if
2357 @var{count} is 3, this command adds 3 columns of indentation to every
2358 line that begins in the specified region.
2359
2360 If called interactively with no prefix argument, this command invokes
2361 a transient mode for adjusting indentation rigidly. @xref{Indentation
2362 Commands,,, emacs, The GNU Emacs Manual}.
2363 @end deffn
2364
2365 @deffn Command indent-code-rigidly start end columns &optional nochange-regexp
2366 This is like @code{indent-rigidly}, except that it doesn't alter lines
2367 that start within strings or comments.
2368
2369 In addition, it doesn't alter a line if @var{nochange-regexp} matches at
2370 the beginning of the line (if @var{nochange-regexp} is non-@code{nil}).
2371 @end deffn
2372
2373 @node Relative Indent
2374 @subsection Indentation Relative to Previous Lines
2375
2376 This section describes two commands that indent the current line
2377 based on the contents of previous lines.
2378
2379 @deffn Command indent-relative &optional unindented-ok
2380 This command inserts whitespace at point, extending to the same
2381 column as the next @dfn{indent point} of the previous nonblank line. An
2382 indent point is a non-whitespace character following whitespace. The
2383 next indent point is the first one at a column greater than the current
2384 column of point. For example, if point is underneath and to the left of
2385 the first non-blank character of a line of text, it moves to that column
2386 by inserting whitespace.
2387
2388 If the previous nonblank line has no next indent point (i.e., none at a
2389 great enough column position), @code{indent-relative} either does
2390 nothing (if @var{unindented-ok} is non-@code{nil}) or calls
2391 @code{tab-to-tab-stop}. Thus, if point is underneath and to the right
2392 of the last column of a short line of text, this command ordinarily
2393 moves point to the next tab stop by inserting whitespace.
2394
2395 The return value of @code{indent-relative} is unpredictable.
2396
2397 In the following example, point is at the beginning of the second
2398 line:
2399
2400 @example
2401 @group
2402 This line is indented twelve spaces.
2403 @point{}The quick brown fox jumped.
2404 @end group
2405 @end example
2406
2407 @noindent
2408 Evaluation of the expression @code{(indent-relative nil)} produces the
2409 following:
2410
2411 @example
2412 @group
2413 This line is indented twelve spaces.
2414 @point{}The quick brown fox jumped.
2415 @end group
2416 @end example
2417
2418 In this next example, point is between the @samp{m} and @samp{p} of
2419 @samp{jumped}:
2420
2421 @example
2422 @group
2423 This line is indented twelve spaces.
2424 The quick brown fox jum@point{}ped.
2425 @end group
2426 @end example
2427
2428 @noindent
2429 Evaluation of the expression @code{(indent-relative nil)} produces the
2430 following:
2431
2432 @example
2433 @group
2434 This line is indented twelve spaces.
2435 The quick brown fox jum @point{}ped.
2436 @end group
2437 @end example
2438 @end deffn
2439
2440 @deffn Command indent-relative-maybe
2441 @comment !!SourceFile indent.el
2442 This command indents the current line like the previous nonblank line,
2443 by calling @code{indent-relative} with @code{t} as the
2444 @var{unindented-ok} argument. The return value is unpredictable.
2445
2446 If the previous nonblank line has no indent points beyond the current
2447 column, this command does nothing.
2448 @end deffn
2449
2450 @node Indent Tabs
2451 @subsection Adjustable ``Tab Stops''
2452 @cindex tabs stops for indentation
2453
2454 This section explains the mechanism for user-specified ``tab stops''
2455 and the mechanisms that use and set them. The name ``tab stops'' is
2456 used because the feature is similar to that of the tab stops on a
2457 typewriter. The feature works by inserting an appropriate number of
2458 spaces and tab characters to reach the next tab stop column; it does not
2459 affect the display of tab characters in the buffer (@pxref{Usual
2460 Display}). Note that the @key{TAB} character as input uses this tab
2461 stop feature only in a few major modes, such as Text mode.
2462 @xref{Tab Stops,,, emacs, The GNU Emacs Manual}.
2463
2464 @deffn Command tab-to-tab-stop
2465 This command inserts spaces or tabs before point, up to the next tab
2466 stop column defined by @code{tab-stop-list}.
2467 @end deffn
2468
2469 @defopt tab-stop-list
2470 This variable defines the tab stop columns used by @code{tab-to-tab-stop}.
2471 It should be either @code{nil}, or a list of increasing integers,
2472 which need not be evenly spaced. The list is implicitly
2473 extended to infinity through repetition of the interval between the
2474 last and penultimate elements (or @code{tab-width} if the list has
2475 fewer than two elements). A value of @code{nil} means a tab stop
2476 every @code{tab-width} columns.
2477
2478 Use @kbd{M-x edit-tab-stops} to edit the location of tab stops interactively.
2479 @end defopt
2480
2481 @node Motion by Indent
2482 @subsection Indentation-Based Motion Commands
2483
2484 These commands, primarily for interactive use, act based on the
2485 indentation in the text.
2486
2487 @deffn Command back-to-indentation
2488 @comment !!SourceFile simple.el
2489 This command moves point to the first non-whitespace character in the
2490 current line (which is the line in which point is located). It returns
2491 @code{nil}.
2492 @end deffn
2493
2494 @deffn Command backward-to-indentation &optional arg
2495 @comment !!SourceFile simple.el
2496 This command moves point backward @var{arg} lines and then to the
2497 first nonblank character on that line. It returns @code{nil}.
2498 If @var{arg} is omitted or @code{nil}, it defaults to 1.
2499 @end deffn
2500
2501 @deffn Command forward-to-indentation &optional arg
2502 @comment !!SourceFile simple.el
2503 This command moves point forward @var{arg} lines and then to the first
2504 nonblank character on that line. It returns @code{nil}.
2505 If @var{arg} is omitted or @code{nil}, it defaults to 1.
2506 @end deffn
2507
2508 @node Case Changes
2509 @section Case Changes
2510 @cindex case conversion in buffers
2511
2512 The case change commands described here work on text in the current
2513 buffer. @xref{Case Conversion}, for case conversion functions that work
2514 on strings and characters. @xref{Case Tables}, for how to customize
2515 which characters are upper or lower case and how to convert them.
2516
2517 @deffn Command capitalize-region start end
2518 This function capitalizes all words in the region defined by
2519 @var{start} and @var{end}. To capitalize means to convert each word's
2520 first character to upper case and convert the rest of each word to lower
2521 case. The function returns @code{nil}.
2522
2523 If one end of the region is in the middle of a word, the part of the
2524 word within the region is treated as an entire word.
2525
2526 When @code{capitalize-region} is called interactively, @var{start} and
2527 @var{end} are point and the mark, with the smallest first.
2528
2529 @example
2530 @group
2531 ---------- Buffer: foo ----------
2532 This is the contents of the 5th foo.
2533 ---------- Buffer: foo ----------
2534 @end group
2535
2536 @group
2537 (capitalize-region 1 37)
2538 @result{} nil
2539
2540 ---------- Buffer: foo ----------
2541 This Is The Contents Of The 5th Foo.
2542 ---------- Buffer: foo ----------
2543 @end group
2544 @end example
2545 @end deffn
2546
2547 @deffn Command downcase-region start end
2548 This function converts all of the letters in the region defined by
2549 @var{start} and @var{end} to lower case. The function returns
2550 @code{nil}.
2551
2552 When @code{downcase-region} is called interactively, @var{start} and
2553 @var{end} are point and the mark, with the smallest first.
2554 @end deffn
2555
2556 @deffn Command upcase-region start end
2557 This function converts all of the letters in the region defined by
2558 @var{start} and @var{end} to upper case. The function returns
2559 @code{nil}.
2560
2561 When @code{upcase-region} is called interactively, @var{start} and
2562 @var{end} are point and the mark, with the smallest first.
2563 @end deffn
2564
2565 @deffn Command capitalize-word count
2566 This function capitalizes @var{count} words after point, moving point
2567 over as it does. To capitalize means to convert each word's first
2568 character to upper case and convert the rest of each word to lower case.
2569 If @var{count} is negative, the function capitalizes the
2570 @minus{}@var{count} previous words but does not move point. The value
2571 is @code{nil}.
2572
2573 If point is in the middle of a word, the part of the word before point
2574 is ignored when moving forward. The rest is treated as an entire word.
2575
2576 When @code{capitalize-word} is called interactively, @var{count} is
2577 set to the numeric prefix argument.
2578 @end deffn
2579
2580 @deffn Command downcase-word count
2581 This function converts the @var{count} words after point to all lower
2582 case, moving point over as it does. If @var{count} is negative, it
2583 converts the @minus{}@var{count} previous words but does not move point.
2584 The value is @code{nil}.
2585
2586 When @code{downcase-word} is called interactively, @var{count} is set
2587 to the numeric prefix argument.
2588 @end deffn
2589
2590 @deffn Command upcase-word count
2591 This function converts the @var{count} words after point to all upper
2592 case, moving point over as it does. If @var{count} is negative, it
2593 converts the @minus{}@var{count} previous words but does not move point.
2594 The value is @code{nil}.
2595
2596 When @code{upcase-word} is called interactively, @var{count} is set to
2597 the numeric prefix argument.
2598 @end deffn
2599
2600 @node Text Properties
2601 @section Text Properties
2602 @cindex text properties
2603 @cindex attributes of text
2604 @cindex properties of text
2605
2606 Each character position in a buffer or a string can have a @dfn{text
2607 property list}, much like the property list of a symbol (@pxref{Property
2608 Lists}). The properties belong to a particular character at a
2609 particular place, such as, the letter @samp{T} at the beginning of this
2610 sentence or the first @samp{o} in @samp{foo}---if the same character
2611 occurs in two different places, the two occurrences in general have
2612 different properties.
2613
2614 Each property has a name and a value. Both of these can be any Lisp
2615 object, but the name is normally a symbol. Typically each property
2616 name symbol is used for a particular purpose; for instance, the text
2617 property @code{face} specifies the faces for displaying the character
2618 (@pxref{Special Properties}). The usual way to access the property
2619 list is to specify a name and ask what value corresponds to it.
2620
2621 If a character has a @code{category} property, we call it the
2622 @dfn{property category} of the character. It should be a symbol. The
2623 properties of the symbol serve as defaults for the properties of the
2624 character.
2625
2626 Copying text between strings and buffers preserves the properties
2627 along with the characters; this includes such diverse functions as
2628 @code{substring}, @code{insert}, and @code{buffer-substring}.
2629
2630 @menu
2631 * Examining Properties:: Looking at the properties of one character.
2632 * Changing Properties:: Setting the properties of a range of text.
2633 * Property Search:: Searching for where a property changes value.
2634 * Special Properties:: Particular properties with special meanings.
2635 * Format Properties:: Properties for representing formatting of text.
2636 * Sticky Properties:: How inserted text gets properties from
2637 neighboring text.
2638 * Lazy Properties:: Computing text properties in a lazy fashion
2639 only when text is examined.
2640 * Clickable Text:: Using text properties to make regions of text
2641 do something when you click on them.
2642 * Fields:: The @code{field} property defines
2643 fields within the buffer.
2644 * Not Intervals:: Why text properties do not use
2645 Lisp-visible text intervals.
2646 @end menu
2647
2648 @node Examining Properties
2649 @subsection Examining Text Properties
2650
2651 The simplest way to examine text properties is to ask for the value of
2652 a particular property of a particular character. For that, use
2653 @code{get-text-property}. Use @code{text-properties-at} to get the
2654 entire property list of a character. @xref{Property Search}, for
2655 functions to examine the properties of a number of characters at once.
2656
2657 These functions handle both strings and buffers. Keep in mind that
2658 positions in a string start from 0, whereas positions in a buffer start
2659 from 1.
2660
2661 @defun get-text-property pos prop &optional object
2662 This function returns the value of the @var{prop} property of the
2663 character after position @var{pos} in @var{object} (a buffer or
2664 string). The argument @var{object} is optional and defaults to the
2665 current buffer.
2666
2667 If there is no @var{prop} property strictly speaking, but the character
2668 has a property category that is a symbol, then @code{get-text-property} returns
2669 the @var{prop} property of that symbol.
2670 @end defun
2671
2672 @defun get-char-property position prop &optional object
2673 This function is like @code{get-text-property}, except that it checks
2674 overlays first and then text properties. @xref{Overlays}.
2675
2676 The argument @var{object} may be a string, a buffer, or a window. If
2677 it is a window, then the buffer displayed in that window is used for
2678 text properties and overlays, but only the overlays active for that
2679 window are considered. If @var{object} is a buffer, then overlays in
2680 that buffer are considered first, in order of decreasing priority,
2681 followed by the text properties. If @var{object} is a string, only
2682 text properties are considered, since strings never have overlays.
2683 @end defun
2684
2685 @defun get-pos-property position prop &optional object
2686 This function is like @code{get-char-property}, except that it pays
2687 attention to properties' stickiness and overlays' advancement settings
2688 instead of the property of the character at (i.e. right after)
2689 @var{position}.
2690 @end defun
2691
2692 @defun get-char-property-and-overlay position prop &optional object
2693 This is like @code{get-char-property}, but gives extra information
2694 about the overlay that the property value comes from.
2695
2696 Its value is a cons cell whose @sc{car} is the property value, the
2697 same value @code{get-char-property} would return with the same
2698 arguments. Its @sc{cdr} is the overlay in which the property was
2699 found, or @code{nil}, if it was found as a text property or not found
2700 at all.
2701
2702 If @var{position} is at the end of @var{object}, both the @sc{car} and
2703 the @sc{cdr} of the value are @code{nil}.
2704 @end defun
2705
2706 @defvar char-property-alias-alist
2707 This variable holds an alist which maps property names to a list of
2708 alternative property names. If a character does not specify a direct
2709 value for a property, the alternative property names are consulted in
2710 order; the first non-@code{nil} value is used. This variable takes
2711 precedence over @code{default-text-properties}, and @code{category}
2712 properties take precedence over this variable.
2713 @end defvar
2714
2715 @defun text-properties-at position &optional object
2716 This function returns the entire property list of the character at
2717 @var{position} in the string or buffer @var{object}. If @var{object} is
2718 @code{nil}, it defaults to the current buffer.
2719 @end defun
2720
2721 @defvar default-text-properties
2722 This variable holds a property list giving default values for text
2723 properties. Whenever a character does not specify a value for a
2724 property, neither directly, through a category symbol, or through
2725 @code{char-property-alias-alist}, the value stored in this list is
2726 used instead. Here is an example:
2727
2728 @example
2729 (setq default-text-properties '(foo 69)
2730 char-property-alias-alist nil)
2731 ;; @r{Make sure character 1 has no properties of its own.}
2732 (set-text-properties 1 2 nil)
2733 ;; @r{What we get, when we ask, is the default value.}
2734 (get-text-property 1 'foo)
2735 @result{} 69
2736 @end example
2737 @end defvar
2738
2739 @node Changing Properties
2740 @subsection Changing Text Properties
2741
2742 The primitives for changing properties apply to a specified range of
2743 text in a buffer or string. The function @code{set-text-properties}
2744 (see end of section) sets the entire property list of the text in that
2745 range; more often, it is useful to add, change, or delete just certain
2746 properties specified by name.
2747
2748 Since text properties are considered part of the contents of the
2749 buffer (or string), and can affect how a buffer looks on the screen,
2750 any change in buffer text properties marks the buffer as modified.
2751 Buffer text property changes are undoable also (@pxref{Undo}).
2752 Positions in a string start from 0, whereas positions in a buffer
2753 start from 1.
2754
2755 @defun put-text-property start end prop value &optional object
2756 This function sets the @var{prop} property to @var{value} for the text
2757 between @var{start} and @var{end} in the string or buffer @var{object}.
2758 If @var{object} is @code{nil}, it defaults to the current buffer.
2759 @end defun
2760
2761 @defun add-text-properties start end props &optional object
2762 This function adds or overrides text properties for the text between
2763 @var{start} and @var{end} in the string or buffer @var{object}. If
2764 @var{object} is @code{nil}, it defaults to the current buffer.
2765
2766 The argument @var{props} specifies which properties to add. It should
2767 have the form of a property list (@pxref{Property Lists}): a list whose
2768 elements include the property names followed alternately by the
2769 corresponding values.
2770
2771 The return value is @code{t} if the function actually changed some
2772 property's value; @code{nil} otherwise (if @var{props} is @code{nil} or
2773 its values agree with those in the text).
2774
2775 For example, here is how to set the @code{comment} and @code{face}
2776 properties of a range of text:
2777
2778 @example
2779 (add-text-properties @var{start} @var{end}
2780 '(comment t face highlight))
2781 @end example
2782 @end defun
2783
2784 @defun remove-text-properties start end props &optional object
2785 This function deletes specified text properties from the text between
2786 @var{start} and @var{end} in the string or buffer @var{object}. If
2787 @var{object} is @code{nil}, it defaults to the current buffer.
2788
2789 The argument @var{props} specifies which properties to delete. It
2790 should have the form of a property list (@pxref{Property Lists}): a list
2791 whose elements are property names alternating with corresponding values.
2792 But only the names matter---the values that accompany them are ignored.
2793 For example, here's how to remove the @code{face} property.
2794
2795 @example
2796 (remove-text-properties @var{start} @var{end} '(face nil))
2797 @end example
2798
2799 The return value is @code{t} if the function actually changed some
2800 property's value; @code{nil} otherwise (if @var{props} is @code{nil} or
2801 if no character in the specified text had any of those properties).
2802
2803 To remove all text properties from certain text, use
2804 @code{set-text-properties} and specify @code{nil} for the new property
2805 list.
2806 @end defun
2807
2808 @defun remove-list-of-text-properties start end list-of-properties &optional object
2809 Like @code{remove-text-properties} except that
2810 @var{list-of-properties} is a list of property names only, not an
2811 alternating list of property names and values.
2812 @end defun
2813
2814 @defun set-text-properties start end props &optional object
2815 This function completely replaces the text property list for the text
2816 between @var{start} and @var{end} in the string or buffer @var{object}.
2817 If @var{object} is @code{nil}, it defaults to the current buffer.
2818
2819 The argument @var{props} is the new property list. It should be a list
2820 whose elements are property names alternating with corresponding values.
2821
2822 After @code{set-text-properties} returns, all the characters in the
2823 specified range have identical properties.
2824
2825 If @var{props} is @code{nil}, the effect is to get rid of all properties
2826 from the specified range of text. Here's an example:
2827
2828 @example
2829 (set-text-properties @var{start} @var{end} nil)
2830 @end example
2831
2832 Do not rely on the return value of this function.
2833 @end defun
2834
2835 @defun add-face-text-property start end face &optional appendp object
2836 This function acts on the text between @var{start} and @var{end},
2837 adding the face @var{face} to the @code{face} text property.
2838 @var{face} should be a valid value for the @code{face} property
2839 (@pxref{Special Properties}), such as a face name or an anonymous face
2840 (@pxref{Faces}).
2841
2842 If any text in the region already has a non-nil @code{face} property,
2843 those face(s) are retained. This function sets the @code{face}
2844 property to a list of faces, with @var{face} as the first element (by
2845 default) and the pre-existing faces as the remaining elements. If the
2846 optional argument @var{append} is non-@code{nil}, @var{face} is
2847 appended to the end of the list instead. Note that in a face list,
2848 the first occurring value for each attribute takes precedence.
2849
2850 For example, the following code would assign a italicized green face
2851 to the text between @var{start} and @var{end}:
2852
2853 @example
2854 (add-face-text-property @var{start} @var{end} 'italic)
2855 (add-face-text-property @var{start} @var{end} '(:foreground "red"))
2856 (add-face-text-property @var{start} @var{end} '(:foreground "green"))
2857 @end example
2858
2859 The optional argument @var{object}, if non-@code{nil}, specifies a
2860 buffer or string to act on, rather than the current buffer. If
2861 @var{object} is a string, then @var{start} and @var{end} are
2862 zero-based indices into the string.
2863 @end defun
2864
2865 The easiest way to make a string with text properties is with
2866 @code{propertize}:
2867
2868 @defun propertize string &rest properties
2869 This function returns a copy of @var{string} which has the text
2870 properties @var{properties}. These properties apply to all the
2871 characters in the string that is returned. Here is an example that
2872 constructs a string with a @code{face} property and a @code{mouse-face}
2873 property:
2874
2875 @smallexample
2876 (propertize "foo" 'face 'italic
2877 'mouse-face 'bold-italic)
2878 @result{} #("foo" 0 3 (mouse-face bold-italic face italic))
2879 @end smallexample
2880
2881 To put different properties on various parts of a string, you can
2882 construct each part with @code{propertize} and then combine them with
2883 @code{concat}:
2884
2885 @smallexample
2886 (concat
2887 (propertize "foo" 'face 'italic
2888 'mouse-face 'bold-italic)
2889 " and "
2890 (propertize "bar" 'face 'italic
2891 'mouse-face 'bold-italic))
2892 @result{} #("foo and bar"
2893 0 3 (face italic mouse-face bold-italic)
2894 3 8 nil
2895 8 11 (face italic mouse-face bold-italic))
2896 @end smallexample
2897 @end defun
2898
2899 @xref{Buffer Contents}, for the function
2900 @code{buffer-substring-no-properties}, which copies text from the
2901 buffer but does not copy its properties.
2902
2903 @node Property Search
2904 @subsection Text Property Search Functions
2905
2906 In typical use of text properties, most of the time several or many
2907 consecutive characters have the same value for a property. Rather than
2908 writing your programs to examine characters one by one, it is much
2909 faster to process chunks of text that have the same property value.
2910
2911 Here are functions you can use to do this. They use @code{eq} for
2912 comparing property values. In all cases, @var{object} defaults to the
2913 current buffer.
2914
2915 For good performance, it's very important to use the @var{limit}
2916 argument to these functions, especially the ones that search for a
2917 single property---otherwise, they may spend a long time scanning to the
2918 end of the buffer, if the property you are interested in does not change.
2919
2920 These functions do not move point; instead, they return a position (or
2921 @code{nil}). Remember that a position is always between two characters;
2922 the position returned by these functions is between two characters with
2923 different properties.
2924
2925 @defun next-property-change pos &optional object limit
2926 The function scans the text forward from position @var{pos} in the
2927 string or buffer @var{object} until it finds a change in some text
2928 property, then returns the position of the change. In other words, it
2929 returns the position of the first character beyond @var{pos} whose
2930 properties are not identical to those of the character just after
2931 @var{pos}.
2932
2933 If @var{limit} is non-@code{nil}, then the scan ends at position
2934 @var{limit}. If there is no property change before that point, this
2935 function returns @var{limit}.
2936
2937 The value is @code{nil} if the properties remain unchanged all the way
2938 to the end of @var{object} and @var{limit} is @code{nil}. If the value
2939 is non-@code{nil}, it is a position greater than or equal to @var{pos}.
2940 The value equals @var{pos} only when @var{limit} equals @var{pos}.
2941
2942 Here is an example of how to scan the buffer by chunks of text within
2943 which all properties are constant:
2944
2945 @smallexample
2946 (while (not (eobp))
2947 (let ((plist (text-properties-at (point)))
2948 (next-change
2949 (or (next-property-change (point) (current-buffer))
2950 (point-max))))
2951 @r{Process text from point to @var{next-change}@dots{}}
2952 (goto-char next-change)))
2953 @end smallexample
2954 @end defun
2955
2956 @defun previous-property-change pos &optional object limit
2957 This is like @code{next-property-change}, but scans back from @var{pos}
2958 instead of forward. If the value is non-@code{nil}, it is a position
2959 less than or equal to @var{pos}; it equals @var{pos} only if @var{limit}
2960 equals @var{pos}.
2961 @end defun
2962
2963 @defun next-single-property-change pos prop &optional object limit
2964 The function scans text for a change in the @var{prop} property, then
2965 returns the position of the change. The scan goes forward from
2966 position @var{pos} in the string or buffer @var{object}. In other
2967 words, this function returns the position of the first character
2968 beyond @var{pos} whose @var{prop} property differs from that of the
2969 character just after @var{pos}.
2970
2971 If @var{limit} is non-@code{nil}, then the scan ends at position
2972 @var{limit}. If there is no property change before that point,
2973 @code{next-single-property-change} returns @var{limit}.
2974
2975 The value is @code{nil} if the property remains unchanged all the way to
2976 the end of @var{object} and @var{limit} is @code{nil}. If the value is
2977 non-@code{nil}, it is a position greater than or equal to @var{pos}; it
2978 equals @var{pos} only if @var{limit} equals @var{pos}.
2979 @end defun
2980
2981 @defun previous-single-property-change pos prop &optional object limit
2982 This is like @code{next-single-property-change}, but scans back from
2983 @var{pos} instead of forward. If the value is non-@code{nil}, it is a
2984 position less than or equal to @var{pos}; it equals @var{pos} only if
2985 @var{limit} equals @var{pos}.
2986 @end defun
2987
2988 @defun next-char-property-change pos &optional limit
2989 This is like @code{next-property-change} except that it considers
2990 overlay properties as well as text properties, and if no change is
2991 found before the end of the buffer, it returns the maximum buffer
2992 position rather than @code{nil} (in this sense, it resembles the
2993 corresponding overlay function @code{next-overlay-change}, rather than
2994 @code{next-property-change}). There is no @var{object} operand
2995 because this function operates only on the current buffer. It returns
2996 the next address at which either kind of property changes.
2997 @end defun
2998
2999 @defun previous-char-property-change pos &optional limit
3000 This is like @code{next-char-property-change}, but scans back from
3001 @var{pos} instead of forward, and returns the minimum buffer
3002 position if no change is found.
3003 @end defun
3004
3005 @defun next-single-char-property-change pos prop &optional object limit
3006 This is like @code{next-single-property-change} except that it
3007 considers overlay properties as well as text properties, and if no
3008 change is found before the end of the @var{object}, it returns the
3009 maximum valid position in @var{object} rather than @code{nil}. Unlike
3010 @code{next-char-property-change}, this function @emph{does} have an
3011 @var{object} operand; if @var{object} is not a buffer, only
3012 text-properties are considered.
3013 @end defun
3014
3015 @defun previous-single-char-property-change pos prop &optional object limit
3016 This is like @code{next-single-char-property-change}, but scans back
3017 from @var{pos} instead of forward, and returns the minimum valid
3018 position in @var{object} if no change is found.
3019 @end defun
3020
3021 @defun text-property-any start end prop value &optional object
3022 This function returns non-@code{nil} if at least one character between
3023 @var{start} and @var{end} has a property @var{prop} whose value is
3024 @var{value}. More precisely, it returns the position of the first such
3025 character. Otherwise, it returns @code{nil}.
3026
3027 The optional fifth argument, @var{object}, specifies the string or
3028 buffer to scan. Positions are relative to @var{object}. The default
3029 for @var{object} is the current buffer.
3030 @end defun
3031
3032 @defun text-property-not-all start end prop value &optional object
3033 This function returns non-@code{nil} if at least one character between
3034 @var{start} and @var{end} does not have a property @var{prop} with value
3035 @var{value}. More precisely, it returns the position of the first such
3036 character. Otherwise, it returns @code{nil}.
3037
3038 The optional fifth argument, @var{object}, specifies the string or
3039 buffer to scan. Positions are relative to @var{object}. The default
3040 for @var{object} is the current buffer.
3041 @end defun
3042
3043 @node Special Properties
3044 @subsection Properties with Special Meanings
3045
3046 Here is a table of text property names that have special built-in
3047 meanings. The following sections list a few additional special property
3048 names that control filling and property inheritance. All other names
3049 have no standard meaning, and you can use them as you like.
3050
3051 Note: the properties @code{composition}, @code{display},
3052 @code{invisible} and @code{intangible} can also cause point to move to
3053 an acceptable place, after each Emacs command. @xref{Adjusting
3054 Point}.
3055
3056 @table @code
3057 @cindex property category of text character
3058 @c FIXME: Isn't @kindex for keyboard commands?
3059 @kindex category @r{(text property)}
3060 @item category
3061 If a character has a @code{category} property, we call it the
3062 @dfn{property category} of the character. It should be a symbol. The
3063 properties of this symbol serve as defaults for the properties of the
3064 character.
3065
3066 @item face
3067 @cindex face codes of text
3068 @kindex face @r{(text property)}
3069 The @code{face} property controls the appearance of the character
3070 (@pxref{Faces}). The value of the property can be the following:
3071
3072 @itemize @bullet
3073 @item
3074 A face name (a symbol or string).
3075
3076 @item
3077 An anonymous face: a property list of the form @code{(@var{keyword}
3078 @var{value} @dots{})}, where each @var{keyword} is a face attribute
3079 name and @var{value} is a value for that attribute.
3080
3081 @item
3082 A list of faces. Each list element should be either a face name or an
3083 anonymous face. This specifies a face which is an aggregate of the
3084 attributes of each of the listed faces. Faces occurring earlier in
3085 the list have higher priority.
3086
3087 @item
3088 A cons cell of the form @code{(foreground-color . @var{color-name})}
3089 or @code{(background-color . @var{color-name})}. This specifies the
3090 foreground or background color, similar to @code{(:foreground
3091 @var{color-name})} or @code{(:background @var{color-name})}. This
3092 form is supported for backward compatibility only, and should be
3093 avoided.
3094 @end itemize
3095
3096 Font Lock mode (@pxref{Font Lock Mode}) works in most buffers by
3097 dynamically updating the @code{face} property of characters based on
3098 the context.
3099
3100 The @code{add-face-text-property} function provides a convenient way
3101 to set this text property. @xref{Changing Properties}.
3102
3103 @item font-lock-face
3104 @kindex font-lock-face @r{(text property)}
3105 This property specifies a value for the @code{face} property that Font
3106 Lock mode should apply to the underlying text. It is one of the
3107 fontification methods used by Font Lock mode, and is useful for
3108 special modes that implement their own highlighting.
3109 @xref{Precalculated Fontification}. When Font Lock mode is disabled,
3110 @code{font-lock-face} has no effect.
3111
3112 @item mouse-face
3113 @kindex mouse-face @r{(text property)}
3114 This property is used instead of @code{face} when the mouse is on or
3115 near the character. For this purpose, ``near'' means that all text
3116 between the character and where the mouse is have the same
3117 @code{mouse-face} property value.
3118
3119 Emacs ignores all face attributes from the @code{mouse-face} property
3120 that alter the text size (e.g., @code{:height}, @code{:weight}, and
3121 @code{:slant}). Those attributes are always the same as for the
3122 unhighlighted text.
3123
3124 @item fontified
3125 @kindex fontified @r{(text property)}
3126 This property says whether the text is ready for display. If
3127 @code{nil}, Emacs's redisplay routine calls the functions in
3128 @code{fontification-functions} (@pxref{Auto Faces}) to prepare this
3129 part of the buffer before it is displayed. It is used internally by
3130 the ``just in time'' font locking code.
3131
3132 @item display
3133 This property activates various features that change the
3134 way text is displayed. For example, it can make text appear taller
3135 or shorter, higher or lower, wider or narrow, or replaced with an image.
3136 @xref{Display Property}.
3137
3138 @item help-echo
3139 @kindex help-echo @r{(text property)}
3140 @cindex tooltip
3141 @anchor{Text help-echo}
3142 If text has a string as its @code{help-echo} property, then when you
3143 move the mouse onto that text, Emacs displays that string in the echo
3144 area, or in the tooltip window (@pxref{Tooltips,,, emacs, The GNU Emacs
3145 Manual}).
3146
3147 If the value of the @code{help-echo} property is a function, that
3148 function is called with three arguments, @var{window}, @var{object} and
3149 @var{pos} and should return a help string or @code{nil} for
3150 none. The first argument, @var{window} is the window in which
3151 the help was found. The second, @var{object}, is the buffer, overlay or
3152 string which had the @code{help-echo} property. The @var{pos}
3153 argument is as follows:
3154
3155 @itemize @bullet{}
3156 @item
3157 If @var{object} is a buffer, @var{pos} is the position in the buffer.
3158 @item
3159 If @var{object} is an overlay, that overlay has a @code{help-echo}
3160 property, and @var{pos} is the position in the overlay's buffer.
3161 @item
3162 If @var{object} is a string (an overlay string or a string displayed
3163 with the @code{display} property), @var{pos} is the position in that
3164 string.
3165 @end itemize
3166
3167 If the value of the @code{help-echo} property is neither a function nor
3168 a string, it is evaluated to obtain a help string.
3169
3170 You can alter the way help text is displayed by setting the variable
3171 @code{show-help-function} (@pxref{Help display}).
3172
3173 This feature is used in the mode line and for other active text.
3174
3175 @item keymap
3176 @cindex keymap of character
3177 @kindex keymap @r{(text property)}
3178 The @code{keymap} property specifies an additional keymap for
3179 commands. When this keymap applies, it is used for key lookup before
3180 the minor mode keymaps and before the buffer's local map.
3181 @xref{Active Keymaps}. If the property value is a symbol, the
3182 symbol's function definition is used as the keymap.
3183
3184 The property's value for the character before point applies if it is
3185 non-@code{nil} and rear-sticky, and the property's value for the
3186 character after point applies if it is non-@code{nil} and
3187 front-sticky. (For mouse clicks, the position of the click is used
3188 instead of the position of point.)
3189
3190 @item local-map
3191 @kindex local-map @r{(text property)}
3192 This property works like @code{keymap} except that it specifies a
3193 keymap to use @emph{instead of} the buffer's local map. For most
3194 purposes (perhaps all purposes), it is better to use the @code{keymap}
3195 property.
3196
3197 @item syntax-table
3198 The @code{syntax-table} property overrides what the syntax table says
3199 about this particular character. @xref{Syntax Properties}.
3200
3201 @item read-only
3202 @cindex read-only character
3203 @kindex read-only @r{(text property)}
3204 If a character has the property @code{read-only}, then modifying that
3205 character is not allowed. Any command that would do so gets an error,
3206 @code{text-read-only}. If the property value is a string, that string
3207 is used as the error message.
3208
3209 Insertion next to a read-only character is an error if inserting
3210 ordinary text there would inherit the @code{read-only} property due to
3211 stickiness. Thus, you can control permission to insert next to
3212 read-only text by controlling the stickiness. @xref{Sticky Properties}.
3213
3214 Since changing properties counts as modifying the buffer, it is not
3215 possible to remove a @code{read-only} property unless you know the
3216 special trick: bind @code{inhibit-read-only} to a non-@code{nil} value
3217 and then remove the property. @xref{Read Only Buffers}.
3218
3219 @item invisible
3220 @kindex invisible @r{(text property)}
3221 A non-@code{nil} @code{invisible} property can make a character invisible
3222 on the screen. @xref{Invisible Text}, for details.
3223
3224 @item intangible
3225 @kindex intangible @r{(text property)}
3226 If a group of consecutive characters have equal and non-@code{nil}
3227 @code{intangible} properties, then you cannot place point between them.
3228 If you try to move point forward into the group, point actually moves to
3229 the end of the group. If you try to move point backward into the group,
3230 point actually moves to the start of the group.
3231
3232 If consecutive characters have unequal non-@code{nil}
3233 @code{intangible} properties, they belong to separate groups; each
3234 group is separately treated as described above.
3235
3236 When the variable @code{inhibit-point-motion-hooks} is non-@code{nil},
3237 the @code{intangible} property is ignored.
3238
3239 Beware: this property operates at a very low level, and affects a lot of code
3240 in unexpected ways. So use it with extreme caution. A common misuse is to put
3241 an intangible property on invisible text, which is actually unnecessary since
3242 the command loop will move point outside of the invisible text at the end of
3243 each command anyway. @xref{Adjusting Point}.
3244
3245 @item field
3246 @kindex field @r{(text property)}
3247 Consecutive characters with the same @code{field} property constitute a
3248 @dfn{field}. Some motion functions including @code{forward-word} and
3249 @code{beginning-of-line} stop moving at a field boundary.
3250 @xref{Fields}.
3251
3252 @item cursor
3253 @kindex cursor @r{(text property)}
3254 Normally, the cursor is displayed at the beginning or the end of any
3255 overlay and text property strings present at the current buffer
3256 position. You can place the cursor on any desired character of these
3257 strings by giving that character a non-@code{nil} @code{cursor} text
3258 property. In addition, if the value of the @code{cursor} property is
3259 an integer number, it specifies the number of buffer's character
3260 positions, starting with the position where the overlay or the
3261 @code{display} property begins, for which the cursor should be
3262 displayed on that character. Specifically, if the value of the
3263 @code{cursor} property of a character is the number @var{n}, the
3264 cursor will be displayed on this character for any buffer position in
3265 the range @code{[@var{ovpos}..@var{ovpos}+@var{n})}, where @var{ovpos}
3266 is the overlay's starting position given by @code{overlay-start}
3267 (@pxref{Managing Overlays}), or the position where the @code{display}
3268 text property begins in the buffer.
3269
3270 In other words, the string character with the @code{cursor} property
3271 of any non-@code{nil} value is the character where to display the
3272 cursor. The value of the property says for which buffer positions to
3273 display the cursor there. If the value is an integer number @var{n},
3274 the cursor is displayed there when point is anywhere between the
3275 beginning of the overlay or @code{display} property and @var{n}
3276 positions after that. If the value is anything else and
3277 non-@code{nil}, the cursor is displayed there only when point is at
3278 the beginning of the @code{display} property or at
3279 @code{overlay-start}.
3280
3281 @cindex cursor position for @code{display} properties and overlays
3282 When the buffer has many overlay strings (e.g., @pxref{Overlay
3283 Properties, before-string}) or @code{display} properties that are
3284 strings, it is a good idea to use the @code{cursor} property on these
3285 strings to cue the Emacs display about the places where to put the
3286 cursor while traversing these strings. This directly communicates to
3287 the display engine where the Lisp program wants to put the cursor, or
3288 where the user would expect the cursor.
3289
3290 @item pointer
3291 @kindex pointer @r{(text property)}
3292 This specifies a specific pointer shape when the mouse pointer is over
3293 this text or image. @xref{Pointer Shape}, for possible pointer
3294 shapes.
3295
3296 @item line-spacing
3297 @kindex line-spacing @r{(text property)}
3298 A newline can have a @code{line-spacing} text or overlay property that
3299 controls the height of the display line ending with that newline. The
3300 property value overrides the default frame line spacing and the buffer
3301 local @code{line-spacing} variable. @xref{Line Height}.
3302
3303 @item line-height
3304 @kindex line-height @r{(text property)}
3305 A newline can have a @code{line-height} text or overlay property that
3306 controls the total height of the display line ending in that newline.
3307 @xref{Line Height}.
3308
3309 @item wrap-prefix
3310 If text has a @code{wrap-prefix} property, the prefix it defines will
3311 be added at display time to the beginning of every continuation line
3312 due to text wrapping (so if lines are truncated, the wrap-prefix is
3313 never used). It may be a string or an image (@pxref{Other Display
3314 Specs}), or a stretch of whitespace such as specified by the
3315 @code{:width} or @code{:align-to} display properties (@pxref{Specified
3316 Space}).
3317
3318 A wrap-prefix may also be specified for an entire buffer using the
3319 @code{wrap-prefix} buffer-local variable (however, a
3320 @code{wrap-prefix} text-property takes precedence over the value of
3321 the @code{wrap-prefix} variable). @xref{Truncation}.
3322
3323 @item line-prefix
3324 If text has a @code{line-prefix} property, the prefix it defines will
3325 be added at display time to the beginning of every non-continuation
3326 line. It may be a string or an image (@pxref{Other Display
3327 Specs}), or a stretch of whitespace such as specified by the
3328 @code{:width} or @code{:align-to} display properties (@pxref{Specified
3329 Space}).
3330
3331 A line-prefix may also be specified for an entire buffer using the
3332 @code{line-prefix} buffer-local variable (however, a
3333 @code{line-prefix} text-property takes precedence over the value of
3334 the @code{line-prefix} variable). @xref{Truncation}.
3335
3336 @item modification-hooks
3337 @cindex change hooks for a character
3338 @cindex hooks for changing a character
3339 @kindex modification-hooks @r{(text property)}
3340 If a character has the property @code{modification-hooks}, then its
3341 value should be a list of functions; modifying that character calls
3342 all of those functions before the actual modification. Each function
3343 receives two arguments: the beginning and end of the part of the
3344 buffer being modified. Note that if a particular modification hook
3345 function appears on several characters being modified by a single
3346 primitive, you can't predict how many times the function will
3347 be called.
3348 Furthermore, insertion will not modify any existing character, so this
3349 hook will only be run when removing some characters, replacing them
3350 with others, or changing their text-properties.
3351
3352 If these functions modify the buffer, they should bind
3353 @code{inhibit-modification-hooks} to @code{t} around doing so, to
3354 avoid confusing the internal mechanism that calls these hooks.
3355
3356 Overlays also support the @code{modification-hooks} property, but the
3357 details are somewhat different (@pxref{Overlay Properties}).
3358
3359 @item insert-in-front-hooks
3360 @itemx insert-behind-hooks
3361 @kindex insert-in-front-hooks @r{(text property)}
3362 @kindex insert-behind-hooks @r{(text property)}
3363 The operation of inserting text in a buffer also calls the functions
3364 listed in the @code{insert-in-front-hooks} property of the following
3365 character and in the @code{insert-behind-hooks} property of the
3366 preceding character. These functions receive two arguments, the
3367 beginning and end of the inserted text. The functions are called
3368 @emph{after} the actual insertion takes place.
3369
3370 See also @ref{Change Hooks}, for other hooks that are called
3371 when you change text in a buffer.
3372
3373 @item point-entered
3374 @itemx point-left
3375 @cindex hooks for motion of point
3376 @kindex point-entered @r{(text property)}
3377 @kindex point-left @r{(text property)}
3378 The special properties @code{point-entered} and @code{point-left}
3379 record hook functions that report motion of point. Each time point
3380 moves, Emacs compares these two property values:
3381
3382 @itemize @bullet
3383 @item
3384 the @code{point-left} property of the character after the old location,
3385 and
3386 @item
3387 the @code{point-entered} property of the character after the new
3388 location.
3389 @end itemize
3390
3391 @noindent
3392 If these two values differ, each of them is called (if not @code{nil})
3393 with two arguments: the old value of point, and the new one.
3394
3395 The same comparison is made for the characters before the old and new
3396 locations. The result may be to execute two @code{point-left} functions
3397 (which may be the same function) and/or two @code{point-entered}
3398 functions (which may be the same function). In any case, all the
3399 @code{point-left} functions are called first, followed by all the
3400 @code{point-entered} functions.
3401
3402 It is possible to use @code{char-after} to examine characters at various
3403 buffer positions without moving point to those positions. Only an
3404 actual change in the value of point runs these hook functions.
3405
3406 The variable @code{inhibit-point-motion-hooks} can inhibit running the
3407 @code{point-left} and @code{point-entered} hooks, see @ref{Inhibit
3408 point motion hooks}.
3409
3410 @item composition
3411 @kindex composition @r{(text property)}
3412 This text property is used to display a sequence of characters as a
3413 single glyph composed from components. But the value of the property
3414 itself is completely internal to Emacs and should not be manipulated
3415 directly by, for instance, @code{put-text-property}.
3416
3417 @end table
3418
3419 @defvar inhibit-point-motion-hooks
3420 @anchor{Inhibit point motion hooks} When this variable is
3421 non-@code{nil}, @code{point-left} and @code{point-entered} hooks are
3422 not run, and the @code{intangible} property has no effect. Do not set
3423 this variable globally; bind it with @code{let}.
3424 @end defvar
3425
3426 @defvar show-help-function
3427 @anchor{Help display} If this variable is non-@code{nil}, it specifies a
3428 function called to display help strings. These may be @code{help-echo}
3429 properties, menu help strings (@pxref{Simple Menu Items},
3430 @pxref{Extended Menu Items}), or tool bar help strings (@pxref{Tool
3431 Bar}). The specified function is called with one argument, the help
3432 string to display. Tooltip mode (@pxref{Tooltips,,, emacs, The GNU Emacs
3433 Manual}) provides an example.
3434 @end defvar
3435
3436 @node Format Properties
3437 @subsection Formatted Text Properties
3438
3439 These text properties affect the behavior of the fill commands. They
3440 are used for representing formatted text. @xref{Filling}, and
3441 @ref{Margins}.
3442
3443 @table @code
3444 @item hard
3445 If a newline character has this property, it is a ``hard'' newline.
3446 The fill commands do not alter hard newlines and do not move words
3447 across them. However, this property takes effect only if the
3448 @code{use-hard-newlines} minor mode is enabled. @xref{Hard and Soft
3449 Newlines,, Hard and Soft Newlines, emacs, The GNU Emacs Manual}.
3450
3451 @item right-margin
3452 This property specifies an extra right margin for filling this part of the
3453 text.
3454
3455 @item left-margin
3456 This property specifies an extra left margin for filling this part of the
3457 text.
3458
3459 @item justification
3460 This property specifies the style of justification for filling this part
3461 of the text.
3462 @end table
3463
3464 @node Sticky Properties
3465 @subsection Stickiness of Text Properties
3466 @cindex sticky text properties
3467 @cindex inheritance, text property
3468
3469 Self-inserting characters normally take on the same properties as the
3470 preceding character. This is called @dfn{inheritance} of properties.
3471
3472 A Lisp program can do insertion with inheritance or without,
3473 depending on the choice of insertion primitive. The ordinary text
3474 insertion functions, such as @code{insert}, do not inherit any
3475 properties. They insert text with precisely the properties of the
3476 string being inserted, and no others. This is correct for programs
3477 that copy text from one context to another---for example, into or out
3478 of the kill ring. To insert with inheritance, use the special
3479 primitives described in this section. Self-inserting characters
3480 inherit properties because they work using these primitives.
3481
3482 When you do insertion with inheritance, @emph{which} properties are
3483 inherited, and from where, depends on which properties are @dfn{sticky}.
3484 Insertion after a character inherits those of its properties that are
3485 @dfn{rear-sticky}. Insertion before a character inherits those of its
3486 properties that are @dfn{front-sticky}. When both sides offer different
3487 sticky values for the same property, the previous character's value
3488 takes precedence.
3489
3490 By default, a text property is rear-sticky but not front-sticky; thus,
3491 the default is to inherit all the properties of the preceding character,
3492 and nothing from the following character.
3493
3494 You can control the stickiness of various text properties with two
3495 specific text properties, @code{front-sticky} and @code{rear-nonsticky},
3496 and with the variable @code{text-property-default-nonsticky}. You can
3497 use the variable to specify a different default for a given property.
3498 You can use those two text properties to make any specific properties
3499 sticky or nonsticky in any particular part of the text.
3500
3501 If a character's @code{front-sticky} property is @code{t}, then all
3502 its properties are front-sticky. If the @code{front-sticky} property is
3503 a list, then the sticky properties of the character are those whose
3504 names are in the list. For example, if a character has a
3505 @code{front-sticky} property whose value is @code{(face read-only)},
3506 then insertion before the character can inherit its @code{face} property
3507 and its @code{read-only} property, but no others.
3508
3509 The @code{rear-nonsticky} property works the opposite way. Most
3510 properties are rear-sticky by default, so the @code{rear-nonsticky}
3511 property says which properties are @emph{not} rear-sticky. If a
3512 character's @code{rear-nonsticky} property is @code{t}, then none of its
3513 properties are rear-sticky. If the @code{rear-nonsticky} property is a
3514 list, properties are rear-sticky @emph{unless} their names are in the
3515 list.
3516
3517 @defvar text-property-default-nonsticky
3518 This variable holds an alist which defines the default rear-stickiness
3519 of various text properties. Each element has the form
3520 @code{(@var{property} . @var{nonstickiness})}, and it defines the
3521 stickiness of a particular text property, @var{property}.
3522
3523 If @var{nonstickiness} is non-@code{nil}, this means that the property
3524 @var{property} is rear-nonsticky by default. Since all properties are
3525 front-nonsticky by default, this makes @var{property} nonsticky in both
3526 directions by default.
3527
3528 The text properties @code{front-sticky} and @code{rear-nonsticky}, when
3529 used, take precedence over the default @var{nonstickiness} specified in
3530 @code{text-property-default-nonsticky}.
3531 @end defvar
3532
3533 Here are the functions that insert text with inheritance of properties:
3534
3535 @defun insert-and-inherit &rest strings
3536 Insert the strings @var{strings}, just like the function @code{insert},
3537 but inherit any sticky properties from the adjoining text.
3538 @end defun
3539
3540 @defun insert-before-markers-and-inherit &rest strings
3541 Insert the strings @var{strings}, just like the function
3542 @code{insert-before-markers}, but inherit any sticky properties from the
3543 adjoining text.
3544 @end defun
3545
3546 @xref{Insertion}, for the ordinary insertion functions which do not
3547 inherit.
3548
3549 @node Lazy Properties
3550 @subsection Lazy Computation of Text Properties
3551
3552 Instead of computing text properties for all the text in the buffer,
3553 you can arrange to compute the text properties for parts of the text
3554 when and if something depends on them.
3555
3556 The primitive that extracts text from the buffer along with its
3557 properties is @code{buffer-substring}. Before examining the properties,
3558 this function runs the abnormal hook @code{buffer-access-fontify-functions}.
3559
3560 @defvar buffer-access-fontify-functions
3561 This variable holds a list of functions for computing text properties.
3562 Before @code{buffer-substring} copies the text and text properties for a
3563 portion of the buffer, it calls all the functions in this list. Each of
3564 the functions receives two arguments that specify the range of the
3565 buffer being accessed. (The buffer itself is always the current
3566 buffer.)
3567 @end defvar
3568
3569 The function @code{buffer-substring-no-properties} does not call these
3570 functions, since it ignores text properties anyway.
3571
3572 In order to prevent the hook functions from being called more than
3573 once for the same part of the buffer, you can use the variable
3574 @code{buffer-access-fontified-property}.
3575
3576 @defvar buffer-access-fontified-property
3577 If this variable's value is non-@code{nil}, it is a symbol which is used
3578 as a text property name. A non-@code{nil} value for that text property
3579 means, ``the other text properties for this character have already been
3580 computed''.
3581
3582 If all the characters in the range specified for @code{buffer-substring}
3583 have a non-@code{nil} value for this property, @code{buffer-substring}
3584 does not call the @code{buffer-access-fontify-functions} functions. It
3585 assumes these characters already have the right text properties, and
3586 just copies the properties they already have.
3587
3588 The normal way to use this feature is that the
3589 @code{buffer-access-fontify-functions} functions add this property, as
3590 well as others, to the characters they operate on. That way, they avoid
3591 being called over and over for the same text.
3592 @end defvar
3593
3594 @node Clickable Text
3595 @subsection Defining Clickable Text
3596 @cindex clickable text
3597 @cindex follow links
3598 @cindex mouse-1
3599
3600 @dfn{Clickable text} is text that can be clicked, with either the
3601 mouse or via a keyboard command, to produce some result. Many major
3602 modes use clickable text to implement textual hyper-links, or
3603 @dfn{links} for short.
3604
3605 The easiest way to insert and manipulate links is to use the
3606 @code{button} package. @xref{Buttons}. In this section, we will
3607 explain how to manually set up clickable text in a buffer, using text
3608 properties. For simplicity, we will refer to the clickable text as a
3609 @dfn{link}.
3610
3611 Implementing a link involves three separate steps: (1) indicating
3612 clickability when the mouse moves over the link; (2) making @key{RET}
3613 or @kbd{Mouse-2} on that link do something; and (3) setting up a
3614 @code{follow-link} condition so that the link obeys
3615 @code{mouse-1-click-follows-link}.
3616
3617 To indicate clickability, add the @code{mouse-face} text property to
3618 the text of the link; then Emacs will highlight the link when the
3619 mouse moves over it. In addition, you should define a tooltip or echo
3620 area message, using the @code{help-echo} text property. @xref{Special
3621 Properties}. For instance, here is how Dired indicates that file
3622 names are clickable:
3623
3624 @smallexample
3625 (if (dired-move-to-filename)
3626 (add-text-properties
3627 (point)
3628 (save-excursion
3629 (dired-move-to-end-of-filename)
3630 (point))
3631 '(mouse-face highlight
3632 help-echo "mouse-2: visit this file in other window")))
3633 @end smallexample
3634
3635 To make the link clickable, bind @key{RET} and @kbd{Mouse-2} to
3636 commands that perform the desired action. Each command should check
3637 to see whether it was called on a link, and act accordingly. For
3638 instance, Dired's major mode keymap binds @kbd{Mouse-2} to the
3639 following command:
3640
3641 @smallexample
3642 (defun dired-mouse-find-file-other-window (event)
3643 "In Dired, visit the file or directory name you click on."
3644 (interactive "e")
3645 (let ((window (posn-window (event-end event)))
3646 (pos (posn-point (event-end event)))
3647 file)
3648 (if (not (windowp window))
3649 (error "No file chosen"))
3650 (with-current-buffer (window-buffer window)
3651 (goto-char pos)
3652 (setq file (dired-get-file-for-visit)))
3653 (if (file-directory-p file)
3654 (or (and (cdr dired-subdir-alist)
3655 (dired-goto-subdir file))
3656 (progn
3657 (select-window window)
3658 (dired-other-window file)))
3659 (select-window window)
3660 (find-file-other-window (file-name-sans-versions file t)))))
3661 @end smallexample
3662
3663 @noindent
3664 This command uses the functions @code{posn-window} and
3665 @code{posn-point} to determine where the click occurred, and
3666 @code{dired-get-file-for-visit} to determine which file to visit.
3667
3668 Instead of binding the mouse command in a major mode keymap, you can
3669 bind it within the link text, using the @code{keymap} text property
3670 (@pxref{Special Properties}). For instance:
3671
3672 @example
3673 (let ((map (make-sparse-keymap)))
3674 (define-key map [mouse-2] 'operate-this-button)
3675 (put-text-property link-start link-end 'keymap map))
3676 @end example
3677
3678 @noindent
3679 With this method, you can easily define different commands for
3680 different links. Furthermore, the global definition of @key{RET} and
3681 @kbd{Mouse-2} remain available for the rest of the text in the buffer.
3682
3683 @vindex mouse-1-click-follows-link
3684 The basic Emacs command for clicking on links is @kbd{Mouse-2}.
3685 However, for compatibility with other graphical applications, Emacs
3686 also recognizes @kbd{Mouse-1} clicks on links, provided the user
3687 clicks on the link quickly without moving the mouse. This behavior is
3688 controlled by the user option @code{mouse-1-click-follows-link}.
3689 @xref{Mouse References,,, emacs, The GNU Emacs Manual}.
3690
3691 To set up the link so that it obeys
3692 @code{mouse-1-click-follows-link}, you must either (1) apply a
3693 @code{follow-link} text or overlay property to the link text, or (2)
3694 bind the @code{follow-link} event to a keymap (which can be a major
3695 mode keymap or a local keymap specified via the @code{keymap} text
3696 property). The value of the @code{follow-link} property, or the
3697 binding for the @code{follow-link} event, acts as a ``condition'' for
3698 the link action. This condition tells Emacs two things: the
3699 circumstances under which a @kbd{Mouse-1} click should be regarded as
3700 occurring ``inside'' the link, and how to compute an ``action code''
3701 that says what to translate the @kbd{Mouse-1} click into. The link
3702 action condition can be one of the following:
3703
3704 @table @asis
3705 @item @code{mouse-face}
3706 If the condition is the symbol @code{mouse-face}, a position is inside
3707 a link if there is a non-@code{nil} @code{mouse-face} property at that
3708 position. The action code is always @code{t}.
3709
3710 For example, here is how Info mode handles @key{Mouse-1}:
3711
3712 @smallexample
3713 (define-key Info-mode-map [follow-link] 'mouse-face)
3714 @end smallexample
3715
3716 @item a function
3717 If the condition is a function, @var{func}, then a position @var{pos}
3718 is inside a link if @code{(@var{func} @var{pos})} evaluates to
3719 non-@code{nil}. The value returned by @var{func} serves as the action
3720 code.
3721
3722 For example, here is how pcvs enables @kbd{Mouse-1} to follow links on
3723 file names only:
3724
3725 @smallexample
3726 (define-key map [follow-link]
3727 (lambda (pos)
3728 (eq (get-char-property pos 'face) 'cvs-filename-face)))
3729 @end smallexample
3730
3731 @item anything else
3732 If the condition value is anything else, then the position is inside a
3733 link and the condition itself is the action code. Clearly, you should
3734 specify this kind of condition only when applying the condition via a
3735 text or property overlay on the link text (so that it does not apply
3736 to the entire buffer).
3737 @end table
3738
3739 @noindent
3740 The action code tells @kbd{Mouse-1} how to follow the link:
3741
3742 @table @asis
3743 @item a string or vector
3744 If the action code is a string or vector, the @kbd{Mouse-1} event is
3745 translated into the first element of the string or vector; i.e., the
3746 action of the @kbd{Mouse-1} click is the local or global binding of
3747 that character or symbol. Thus, if the action code is @code{"foo"},
3748 @kbd{Mouse-1} translates into @kbd{f}. If it is @code{[foo]},
3749 @kbd{Mouse-1} translates into @key{foo}.
3750
3751 @item anything else
3752 For any other non-@code{nil} action code, the @kbd{Mouse-1} event is
3753 translated into a @kbd{Mouse-2} event at the same position.
3754 @end table
3755
3756 To define @kbd{Mouse-1} to activate a button defined with
3757 @code{define-button-type}, give the button a @code{follow-link}
3758 property. The property value should be a link action condition, as
3759 described above. @xref{Buttons}. For example, here is how Help mode
3760 handles @kbd{Mouse-1}:
3761
3762 @smallexample
3763 (define-button-type 'help-xref
3764 'follow-link t
3765 'action #'help-button-action)
3766 @end smallexample
3767
3768 To define @kbd{Mouse-1} on a widget defined with
3769 @code{define-widget}, give the widget a @code{:follow-link} property.
3770 The property value should be a link action condition, as described
3771 above. For example, here is how the @code{link} widget specifies that
3772 a @key{Mouse-1} click shall be translated to @key{RET}:
3773
3774 @smallexample
3775 (define-widget 'link 'item
3776 "An embedded link."
3777 :button-prefix 'widget-link-prefix
3778 :button-suffix 'widget-link-suffix
3779 :follow-link "\C-m"
3780 :help-echo "Follow the link."
3781 :format "%[%t%]")
3782 @end smallexample
3783
3784 @defun mouse-on-link-p pos
3785 This function returns non-@code{nil} if position @var{pos} in the
3786 current buffer is on a link. @var{pos} can also be a mouse event
3787 location, as returned by @code{event-start} (@pxref{Accessing Mouse}).
3788 @end defun
3789
3790 @node Fields
3791 @subsection Defining and Using Fields
3792 @cindex fields
3793
3794 A field is a range of consecutive characters in the buffer that are
3795 identified by having the same value (comparing with @code{eq}) of the
3796 @code{field} property (either a text-property or an overlay property).
3797 This section describes special functions that are available for
3798 operating on fields.
3799
3800 You specify a field with a buffer position, @var{pos}. We think of
3801 each field as containing a range of buffer positions, so the position
3802 you specify stands for the field containing that position.
3803
3804 When the characters before and after @var{pos} are part of the same
3805 field, there is no doubt which field contains @var{pos}: the one those
3806 characters both belong to. When @var{pos} is at a boundary between
3807 fields, which field it belongs to depends on the stickiness of the
3808 @code{field} properties of the two surrounding characters (@pxref{Sticky
3809 Properties}). The field whose property would be inherited by text
3810 inserted at @var{pos} is the field that contains @var{pos}.
3811
3812 There is an anomalous case where newly inserted text at @var{pos}
3813 would not inherit the @code{field} property from either side. This
3814 happens if the previous character's @code{field} property is not
3815 rear-sticky, and the following character's @code{field} property is not
3816 front-sticky. In this case, @var{pos} belongs to neither the preceding
3817 field nor the following field; the field functions treat it as belonging
3818 to an empty field whose beginning and end are both at @var{pos}.
3819
3820 In all of these functions, if @var{pos} is omitted or @code{nil}, the
3821 value of point is used by default. If narrowing is in effect, then
3822 @var{pos} should fall within the accessible portion. @xref{Narrowing}.
3823
3824 @defun field-beginning &optional pos escape-from-edge limit
3825 This function returns the beginning of the field specified by @var{pos}.
3826
3827 If @var{pos} is at the beginning of its field, and
3828 @var{escape-from-edge} is non-@code{nil}, then the return value is
3829 always the beginning of the preceding field that @emph{ends} at @var{pos},
3830 regardless of the stickiness of the @code{field} properties around
3831 @var{pos}.
3832
3833 If @var{limit} is non-@code{nil}, it is a buffer position; if the
3834 beginning of the field is before @var{limit}, then @var{limit} will be
3835 returned instead.
3836 @end defun
3837
3838 @defun field-end &optional pos escape-from-edge limit
3839 This function returns the end of the field specified by @var{pos}.
3840
3841 If @var{pos} is at the end of its field, and @var{escape-from-edge} is
3842 non-@code{nil}, then the return value is always the end of the following
3843 field that @emph{begins} at @var{pos}, regardless of the stickiness of
3844 the @code{field} properties around @var{pos}.
3845
3846 If @var{limit} is non-@code{nil}, it is a buffer position; if the end
3847 of the field is after @var{limit}, then @var{limit} will be returned
3848 instead.
3849 @end defun
3850
3851 @defun field-string &optional pos
3852 This function returns the contents of the field specified by @var{pos},
3853 as a string.
3854 @end defun
3855
3856 @defun field-string-no-properties &optional pos
3857 This function returns the contents of the field specified by @var{pos},
3858 as a string, discarding text properties.
3859 @end defun
3860
3861 @defun delete-field &optional pos
3862 This function deletes the text of the field specified by @var{pos}.
3863 @end defun
3864
3865 @defun constrain-to-field new-pos old-pos &optional escape-from-edge only-in-line inhibit-capture-property
3866 This function ``constrains'' @var{new-pos} to the field that
3867 @var{old-pos} belongs to---in other words, it returns the position
3868 closest to @var{new-pos} that is in the same field as @var{old-pos}.
3869
3870 If @var{new-pos} is @code{nil}, then @code{constrain-to-field} uses
3871 the value of point instead, and moves point to the resulting position
3872 in addition to returning that position.
3873
3874 If @var{old-pos} is at the boundary of two fields, then the acceptable
3875 final positions depend on the argument @var{escape-from-edge}. If
3876 @var{escape-from-edge} is @code{nil}, then @var{new-pos} must be in
3877 the field whose @code{field} property equals what new characters
3878 inserted at @var{old-pos} would inherit. (This depends on the
3879 stickiness of the @code{field} property for the characters before and
3880 after @var{old-pos}.) If @var{escape-from-edge} is non-@code{nil},
3881 @var{new-pos} can be anywhere in the two adjacent fields.
3882 Additionally, if two fields are separated by another field with the
3883 special value @code{boundary}, then any point within this special
3884 field is also considered to be ``on the boundary''.
3885
3886 Commands like @kbd{C-a} with no argument, that normally move backward
3887 to a specific kind of location and stay there once there, probably
3888 should specify @code{nil} for @var{escape-from-edge}. Other motion
3889 commands that check fields should probably pass @code{t}.
3890
3891 If the optional argument @var{only-in-line} is non-@code{nil}, and
3892 constraining @var{new-pos} in the usual way would move it to a different
3893 line, @var{new-pos} is returned unconstrained. This used in commands
3894 that move by line, such as @code{next-line} and
3895 @code{beginning-of-line}, so that they respect field boundaries only in
3896 the case where they can still move to the right line.
3897
3898 If the optional argument @var{inhibit-capture-property} is
3899 non-@code{nil}, and @var{old-pos} has a non-@code{nil} property of that
3900 name, then any field boundaries are ignored.
3901
3902 You can cause @code{constrain-to-field} to ignore all field boundaries
3903 (and so never constrain anything) by binding the variable
3904 @code{inhibit-field-text-motion} to a non-@code{nil} value.
3905 @end defun
3906
3907 @node Not Intervals
3908 @subsection Why Text Properties are not Intervals
3909 @cindex intervals
3910
3911 Some editors that support adding attributes to text in the buffer do
3912 so by letting the user specify ``intervals'' within the text, and adding
3913 the properties to the intervals. Those editors permit the user or the
3914 programmer to determine where individual intervals start and end. We
3915 deliberately provided a different sort of interface in Emacs Lisp to
3916 avoid certain paradoxical behavior associated with text modification.
3917
3918 If the actual subdivision into intervals is meaningful, that means you
3919 can distinguish between a buffer that is just one interval with a
3920 certain property, and a buffer containing the same text subdivided into
3921 two intervals, both of which have that property.
3922
3923 Suppose you take the buffer with just one interval and kill part of
3924 the text. The text remaining in the buffer is one interval, and the
3925 copy in the kill ring (and the undo list) becomes a separate interval.
3926 Then if you yank back the killed text, you get two intervals with the
3927 same properties. Thus, editing does not preserve the distinction
3928 between one interval and two.
3929
3930 Suppose we ``fix'' this problem by coalescing the two intervals when
3931 the text is inserted. That works fine if the buffer originally was a
3932 single interval. But suppose instead that we have two adjacent
3933 intervals with the same properties, and we kill the text of one interval
3934 and yank it back. The same interval-coalescence feature that rescues
3935 the other case causes trouble in this one: after yanking, we have just
3936 one interval. One again, editing does not preserve the distinction
3937 between one interval and two.
3938
3939 Insertion of text at the border between intervals also raises
3940 questions that have no satisfactory answer.
3941
3942 However, it is easy to arrange for editing to behave consistently
3943 for questions of the form, ``What are the properties of text at this
3944 buffer or string position?'' So we have decided these are the only
3945 questions that make sense; we have not implemented asking questions
3946 about where intervals start or end.
3947
3948 In practice, you can usually use the text property search functions in
3949 place of explicit interval boundaries. You can think of them as finding
3950 the boundaries of intervals, assuming that intervals are always
3951 coalesced whenever possible. @xref{Property Search}.
3952
3953 Emacs also provides explicit intervals as a presentation feature; see
3954 @ref{Overlays}.
3955
3956 @node Substitution
3957 @section Substituting for a Character Code
3958
3959 The following functions replace characters within a specified region
3960 based on their character codes.
3961
3962 @defun subst-char-in-region start end old-char new-char &optional noundo
3963 @cindex replace characters
3964 This function replaces all occurrences of the character @var{old-char}
3965 with the character @var{new-char} in the region of the current buffer
3966 defined by @var{start} and @var{end}.
3967
3968 @cindex undo avoidance
3969 If @var{noundo} is non-@code{nil}, then @code{subst-char-in-region} does
3970 not record the change for undo and does not mark the buffer as modified.
3971 This was useful for controlling the old selective display feature
3972 (@pxref{Selective Display}).
3973
3974 @code{subst-char-in-region} does not move point and returns
3975 @code{nil}.
3976
3977 @example
3978 @group
3979 ---------- Buffer: foo ----------
3980 This is the contents of the buffer before.
3981 ---------- Buffer: foo ----------
3982 @end group
3983
3984 @group
3985 (subst-char-in-region 1 20 ?i ?X)
3986 @result{} nil
3987
3988 ---------- Buffer: foo ----------
3989 ThXs Xs the contents of the buffer before.
3990 ---------- Buffer: foo ----------
3991 @end group
3992 @end example
3993 @end defun
3994
3995 @deffn Command translate-region start end table
3996 This function applies a translation table to the characters in the
3997 buffer between positions @var{start} and @var{end}.
3998
3999 The translation table @var{table} is a string or a char-table;
4000 @code{(aref @var{table} @var{ochar})} gives the translated character
4001 corresponding to @var{ochar}. If @var{table} is a string, any
4002 characters with codes larger than the length of @var{table} are not
4003 altered by the translation.
4004
4005 The return value of @code{translate-region} is the number of
4006 characters that were actually changed by the translation. This does
4007 not count characters that were mapped into themselves in the
4008 translation table.
4009 @end deffn
4010
4011 @node Registers
4012 @section Registers
4013 @cindex registers
4014
4015 A register is a sort of variable used in Emacs editing that can hold a
4016 variety of different kinds of values. Each register is named by a
4017 single character. All @acronym{ASCII} characters and their meta variants
4018 (but with the exception of @kbd{C-g}) can be used to name registers.
4019 Thus, there are 255 possible registers. A register is designated in
4020 Emacs Lisp by the character that is its name.
4021
4022 @defvar register-alist
4023 This variable is an alist of elements of the form @code{(@var{name} .
4024 @var{contents})}. Normally, there is one element for each Emacs
4025 register that has been used.
4026
4027 The object @var{name} is a character (an integer) identifying the
4028 register.
4029 @end defvar
4030
4031 The @var{contents} of a register can have several possible types:
4032
4033 @table @asis
4034 @item a number
4035 A number stands for itself. If @code{insert-register} finds a number
4036 in the register, it converts the number to decimal.
4037
4038 @item a marker
4039 A marker represents a buffer position to jump to.
4040
4041 @item a string
4042 A string is text saved in the register.
4043
4044 @item a rectangle
4045 A rectangle is represented by a list of strings.
4046
4047 @item @code{(@var{window-configuration} @var{position})}
4048 This represents a window configuration to restore in one frame, and a
4049 position to jump to in the current buffer.
4050
4051 @c FIXME: Mention frameset here.
4052 @item @code{(@var{frame-configuration} @var{position})}
4053 This represents a frame configuration to restore, and a position
4054 to jump to in the current buffer.
4055
4056 @item (file @var{filename})
4057 This represents a file to visit; jumping to this value visits file
4058 @var{filename}.
4059
4060 @item (file-query @var{filename} @var{position})
4061 This represents a file to visit and a position in it; jumping to this
4062 value visits file @var{filename} and goes to buffer position
4063 @var{position}. Restoring this type of position asks the user for
4064 confirmation first.
4065 @end table
4066
4067 The functions in this section return unpredictable values unless
4068 otherwise stated.
4069
4070 @defun get-register reg
4071 This function returns the contents of the register
4072 @var{reg}, or @code{nil} if it has no contents.
4073 @end defun
4074
4075 @defun set-register reg value
4076 This function sets the contents of register @var{reg} to @var{value}.
4077 A register can be set to any value, but the other register functions
4078 expect only certain data types. The return value is @var{value}.
4079 @end defun
4080
4081 @deffn Command view-register reg
4082 This command displays what is contained in register @var{reg}.
4083 @end deffn
4084
4085 @deffn Command insert-register reg &optional beforep
4086 This command inserts contents of register @var{reg} into the current
4087 buffer.
4088
4089 Normally, this command puts point before the inserted text, and the
4090 mark after it. However, if the optional second argument @var{beforep}
4091 is non-@code{nil}, it puts the mark before and point after.
4092 You can pass a non-@code{nil} second argument @var{beforep} to this
4093 function interactively by supplying any prefix argument.
4094
4095 If the register contains a rectangle, then the rectangle is inserted
4096 with its upper left corner at point. This means that text is inserted
4097 in the current line and underneath it on successive lines.
4098
4099 If the register contains something other than saved text (a string) or
4100 a rectangle (a list), currently useless things happen. This may be
4101 changed in the future.
4102 @end deffn
4103
4104 @node Transposition
4105 @section Transposition of Text
4106
4107 This function can be used to transpose stretches of text:
4108
4109 @defun transpose-regions start1 end1 start2 end2 &optional leave-markers
4110 This function exchanges two nonoverlapping portions of the buffer.
4111 Arguments @var{start1} and @var{end1} specify the bounds of one portion
4112 and arguments @var{start2} and @var{end2} specify the bounds of the
4113 other portion.
4114
4115 Normally, @code{transpose-regions} relocates markers with the transposed
4116 text; a marker previously positioned within one of the two transposed
4117 portions moves along with that portion, thus remaining between the same
4118 two characters in their new position. However, if @var{leave-markers}
4119 is non-@code{nil}, @code{transpose-regions} does not do this---it leaves
4120 all markers unrelocated.
4121 @end defun
4122
4123 @node Base 64
4124 @section Base 64 Encoding
4125 @cindex base 64 encoding
4126
4127 Base 64 code is used in email to encode a sequence of 8-bit bytes as
4128 a longer sequence of @acronym{ASCII} graphic characters. It is defined in
4129 Internet RFC@footnote{
4130 An RFC, an acronym for @dfn{Request for Comments}, is a numbered
4131 Internet informational document describing a standard. RFCs are
4132 usually written by technical experts acting on their own initiative,
4133 and are traditionally written in a pragmatic, experience-driven
4134 manner.
4135 }2045. This section describes the functions for
4136 converting to and from this code.
4137
4138 @deffn Command base64-encode-region beg end &optional no-line-break
4139 This function converts the region from @var{beg} to @var{end} into base
4140 64 code. It returns the length of the encoded text. An error is
4141 signaled if a character in the region is multibyte, i.e., in a
4142 multibyte buffer the region must contain only characters from the
4143 charsets @code{ascii}, @code{eight-bit-control} and
4144 @code{eight-bit-graphic}.
4145
4146 Normally, this function inserts newline characters into the encoded
4147 text, to avoid overlong lines. However, if the optional argument
4148 @var{no-line-break} is non-@code{nil}, these newlines are not added, so
4149 the output is just one long line.
4150 @end deffn
4151
4152 @defun base64-encode-string string &optional no-line-break
4153 This function converts the string @var{string} into base 64 code. It
4154 returns a string containing the encoded text. As for
4155 @code{base64-encode-region}, an error is signaled if a character in the
4156 string is multibyte.
4157
4158 Normally, this function inserts newline characters into the encoded
4159 text, to avoid overlong lines. However, if the optional argument
4160 @var{no-line-break} is non-@code{nil}, these newlines are not added, so
4161 the result string is just one long line.
4162 @end defun
4163
4164 @deffn Command base64-decode-region beg end
4165 This function converts the region from @var{beg} to @var{end} from base
4166 64 code into the corresponding decoded text. It returns the length of
4167 the decoded text.
4168
4169 The decoding functions ignore newline characters in the encoded text.
4170 @end deffn
4171
4172 @defun base64-decode-string string
4173 This function converts the string @var{string} from base 64 code into
4174 the corresponding decoded text. It returns a unibyte string containing the
4175 decoded text.
4176
4177 The decoding functions ignore newline characters in the encoded text.
4178 @end defun
4179
4180 @node Checksum/Hash
4181 @section Checksum/Hash
4182 @cindex MD5 checksum
4183 @cindex SHA hash
4184 @cindex hash, cryptographic
4185 @cindex cryptographic hash
4186
4187 Emacs has built-in support for computing @dfn{cryptographic hashes}.
4188 A cryptographic hash, or @dfn{checksum}, is a digital ``fingerprint''
4189 of a piece of data (e.g., a block of text) which can be used to check
4190 that you have an unaltered copy of that data.
4191
4192 @cindex message digest
4193 Emacs supports several common cryptographic hash algorithms: MD5,
4194 SHA-1, SHA-2, SHA-224, SHA-256, SHA-384 and SHA-512. MD5 is the
4195 oldest of these algorithms, and is commonly used in @dfn{message
4196 digests} to check the integrity of messages transmitted over a
4197 network. MD5 is not ``collision resistant'' (i.e., it is possible to
4198 deliberately design different pieces of data which have the same MD5
4199 hash), so you should not used it for anything security-related. A
4200 similar theoretical weakness also exists in SHA-1. Therefore, for
4201 security-related applications you should use the other hash types,
4202 such as SHA-2.
4203
4204 @defun secure-hash algorithm object &optional start end binary
4205 This function returns a hash for @var{object}. The argument
4206 @var{algorithm} is a symbol stating which hash to compute: one of
4207 @code{md5}, @code{sha1}, @code{sha224}, @code{sha256}, @code{sha384}
4208 or @code{sha512}. The argument @var{object} should be a buffer or a
4209 string.
4210
4211 The optional arguments @var{start} and @var{end} are character
4212 positions specifying the portion of @var{object} to compute the
4213 message digest for. If they are @code{nil} or omitted, the hash is
4214 computed for the whole of @var{object}.
4215
4216 If the argument @var{binary} is omitted or @code{nil}, the function
4217 returns the @dfn{text form} of the hash, as an ordinary Lisp string.
4218 If @var{binary} is non-@code{nil}, it returns the hash in @dfn{binary
4219 form}, as a sequence of bytes stored in a unibyte string.
4220
4221 This function does not compute the hash directly from the internal
4222 representation of @var{object}'s text (@pxref{Text Representations}).
4223 Instead, it encodes the text using a coding system (@pxref{Coding
4224 Systems}), and computes the hash from that encoded text. If
4225 @var{object} is a buffer, the coding system used is the one which
4226 would be chosen by default for writing the text into a file. If
4227 @var{object} is a string, the user's preferred coding system is used
4228 (@pxref{Recognize Coding,,, emacs, GNU Emacs Manual}).
4229 @end defun
4230
4231 @defun md5 object &optional start end coding-system noerror
4232 This function returns an MD5 hash. It is semi-obsolete, since for
4233 most purposes it is equivalent to calling @code{secure-hash} with
4234 @code{md5} as the @var{algorithm} argument. The @var{object},
4235 @var{start} and @var{end} arguments have the same meanings as in
4236 @code{secure-hash}.
4237
4238 If @var{coding-system} is non-@code{nil}, it specifies a coding system
4239 to use to encode the text; if omitted or @code{nil}, the default
4240 coding system is used, like in @code{secure-hash}.
4241
4242 Normally, @code{md5} signals an error if the text can't be encoded
4243 using the specified or chosen coding system. However, if
4244 @var{noerror} is non-@code{nil}, it silently uses @code{raw-text}
4245 coding instead.
4246 @end defun
4247
4248 @node Parsing HTML/XML
4249 @section Parsing HTML and XML
4250 @cindex parsing html
4251
4252 When Emacs is compiled with libxml2 support, the following functions
4253 are available to parse HTML or XML text into Lisp object trees.
4254
4255 @defun libxml-parse-html-region start end &optional base-url
4256 This function parses the text between @var{start} and @var{end} as
4257 HTML, and returns a list representing the HTML @dfn{parse tree}. It
4258 attempts to handle ``real world'' HTML by robustly coping with syntax
4259 mistakes.
4260
4261 The optional argument @var{base-url}, if non-@code{nil}, should be a
4262 string specifying the base URL for relative URLs occurring in links.
4263
4264 In the parse tree, each HTML node is represented by a list in which
4265 the first element is a symbol representing the node name, the second
4266 element is an alist of node attributes, and the remaining elements are
4267 the subnodes.
4268
4269 The following example demonstrates this. Given this (malformed) HTML
4270 document:
4271
4272 @example
4273 <html><head></head><body width=101><div class=thing>Foo<div>Yes
4274 @end example
4275
4276 @noindent
4277 A call to @code{libxml-parse-html-region} returns this:
4278
4279 @example
4280 (html ()
4281 (head ())
4282 (body ((width . "101"))
4283 (div ((class . "thing"))
4284 "Foo"
4285 (div ()
4286 "Yes"))))
4287 @end example
4288 @end defun
4289
4290 @cindex rendering html
4291 @defun shr-insert-document dom
4292 This function renders the parsed HTML in @var{dom} into the current
4293 buffer. The argument @var{dom} should be a list as generated by
4294 @code{libxml-parse-html-region}. This function is, e.g., used by
4295 @ref{Top, EWW,, eww, The Emacs Web Wowser Manual}.
4296 @end defun
4297
4298 @cindex parsing xml
4299 @defun libxml-parse-xml-region start end &optional base-url
4300 This function is the same as @code{libxml-parse-html-region}, except
4301 that it parses the text as XML rather than HTML (so it is stricter
4302 about syntax).
4303 @end defun
4304
4305 @node Atomic Changes
4306 @section Atomic Change Groups
4307 @cindex atomic changes
4308
4309 In database terminology, an @dfn{atomic} change is an indivisible
4310 change---it can succeed entirely or it can fail entirely, but it
4311 cannot partly succeed. A Lisp program can make a series of changes to
4312 one or several buffers as an @dfn{atomic change group}, meaning that
4313 either the entire series of changes will be installed in their buffers
4314 or, in case of an error, none of them will be.
4315
4316 To do this for one buffer, the one already current, simply write a
4317 call to @code{atomic-change-group} around the code that makes the
4318 changes, like this:
4319
4320 @example
4321 (atomic-change-group
4322 (insert foo)
4323 (delete-region x y))
4324 @end example
4325
4326 @noindent
4327 If an error (or other nonlocal exit) occurs inside the body of
4328 @code{atomic-change-group}, it unmakes all the changes in that buffer
4329 that were during the execution of the body. This kind of change group
4330 has no effect on any other buffers---any such changes remain.
4331
4332 If you need something more sophisticated, such as to make changes in
4333 various buffers constitute one atomic group, you must directly call
4334 lower-level functions that @code{atomic-change-group} uses.
4335
4336 @defun prepare-change-group &optional buffer
4337 This function sets up a change group for buffer @var{buffer}, which
4338 defaults to the current buffer. It returns a ``handle'' that
4339 represents the change group. You must use this handle to activate the
4340 change group and subsequently to finish it.
4341 @end defun
4342
4343 To use the change group, you must @dfn{activate} it. You must do
4344 this before making any changes in the text of @var{buffer}.
4345
4346 @defun activate-change-group handle
4347 This function activates the change group that @var{handle} designates.
4348 @end defun
4349
4350 After you activate the change group, any changes you make in that
4351 buffer become part of it. Once you have made all the desired changes
4352 in the buffer, you must @dfn{finish} the change group. There are two
4353 ways to do this: you can either accept (and finalize) all the changes,
4354 or cancel them all.
4355
4356 @defun accept-change-group handle
4357 This function accepts all the changes in the change group specified by
4358 @var{handle}, making them final.
4359 @end defun
4360
4361 @defun cancel-change-group handle
4362 This function cancels and undoes all the changes in the change group
4363 specified by @var{handle}.
4364 @end defun
4365
4366 Your code should use @code{unwind-protect} to make sure the group is
4367 always finished. The call to @code{activate-change-group} should be
4368 inside the @code{unwind-protect}, in case the user types @kbd{C-g}
4369 just after it runs. (This is one reason why
4370 @code{prepare-change-group} and @code{activate-change-group} are
4371 separate functions, because normally you would call
4372 @code{prepare-change-group} before the start of that
4373 @code{unwind-protect}.) Once you finish the group, don't use the
4374 handle again---in particular, don't try to finish the same group
4375 twice.
4376
4377 To make a multibuffer change group, call @code{prepare-change-group}
4378 once for each buffer you want to cover, then use @code{nconc} to
4379 combine the returned values, like this:
4380
4381 @example
4382 (nconc (prepare-change-group buffer-1)
4383 (prepare-change-group buffer-2))
4384 @end example
4385
4386 You can then activate the multibuffer change group with a single call
4387 to @code{activate-change-group}, and finish it with a single call to
4388 @code{accept-change-group} or @code{cancel-change-group}.
4389
4390 Nested use of several change groups for the same buffer works as you
4391 would expect. Non-nested use of change groups for the same buffer
4392 will get Emacs confused, so don't let it happen; the first change
4393 group you start for any given buffer should be the last one finished.
4394
4395 @node Change Hooks
4396 @section Change Hooks
4397 @cindex change hooks
4398 @cindex hooks for text changes
4399
4400 These hook variables let you arrange to take notice of all changes in
4401 all buffers (or in a particular buffer, if you make them buffer-local).
4402 See also @ref{Special Properties}, for how to detect changes to specific
4403 parts of the text.
4404
4405 The functions you use in these hooks should save and restore the match
4406 data if they do anything that uses regular expressions; otherwise, they
4407 will interfere in bizarre ways with the editing operations that call
4408 them.
4409
4410 @defvar before-change-functions
4411 This variable holds a list of functions to call before any buffer
4412 modification. Each function gets two arguments, the beginning and end
4413 of the region that is about to change, represented as integers. The
4414 buffer that is about to change is always the current buffer.
4415 @end defvar
4416
4417 @defvar after-change-functions
4418 This variable holds a list of functions to call after any buffer
4419 modification. Each function receives three arguments: the beginning
4420 and end of the region just changed, and the length of the text that
4421 existed before the change. All three arguments are integers. The
4422 buffer that has been changed is always the current buffer.
4423
4424 The length of the old text is the difference between the buffer
4425 positions before and after that text as it was before the change. As
4426 for the changed text, its length is simply the difference between the
4427 first two arguments.
4428 @end defvar
4429
4430 Output of messages into the @file{*Messages*} buffer does not
4431 call these functions.
4432
4433 @defmac combine-after-change-calls body@dots{}
4434 The macro executes @var{body} normally, but arranges to call the
4435 after-change functions just once for a series of several changes---if
4436 that seems safe.
4437
4438 If a program makes several text changes in the same area of the buffer,
4439 using the macro @code{combine-after-change-calls} around that part of
4440 the program can make it run considerably faster when after-change hooks
4441 are in use. When the after-change hooks are ultimately called, the
4442 arguments specify a portion of the buffer including all of the changes
4443 made within the @code{combine-after-change-calls} body.
4444
4445 @strong{Warning:} You must not alter the values of
4446 @code{after-change-functions} within
4447 the body of a @code{combine-after-change-calls} form.
4448
4449 @strong{Warning:} if the changes you combine occur in widely scattered
4450 parts of the buffer, this will still work, but it is not advisable,
4451 because it may lead to inefficient behavior for some change hook
4452 functions.
4453 @end defmac
4454
4455 @defvar first-change-hook
4456 This variable is a normal hook that is run whenever a buffer is changed
4457 that was previously in the unmodified state.
4458 @end defvar
4459
4460 @defvar inhibit-modification-hooks
4461 If this variable is non-@code{nil}, all of the change hooks are
4462 disabled; none of them run. This affects all the hook variables
4463 described above in this section, as well as the hooks attached to
4464 certain special text properties (@pxref{Special Properties}) and overlay
4465 properties (@pxref{Overlay Properties}).
4466
4467 Also, this variable is bound to non-@code{nil} while running those
4468 same hook variables, so that by default modifying the buffer from
4469 a modification hook does not cause other modification hooks to be run.
4470 If you do want modification hooks to be run in a particular piece of
4471 code that is itself run from a modification hook, then rebind locally
4472 @code{inhibit-modification-hooks} to @code{nil}.
4473 @end defvar