Merge from trunk.
[bpt/emacs.git] / doc / lispref / text.texi
1 @c -*-texinfo-*-
2 @c This is part of the GNU Emacs Lisp Reference Manual.
3 @c Copyright (C) 1990-1995, 1998-2011 Free Software Foundation, Inc.
4 @c See the file elisp.texi for copying conditions.
5 @setfilename ../../info/text
6 @node Text, Non-ASCII Characters, Markers, Top
7 @chapter Text
8 @cindex text
9
10 This chapter describes the functions that deal with the text in a
11 buffer. Most examine, insert, or delete text in the current buffer,
12 often operating at point or on text adjacent to point. Many are
13 interactive. All the functions that change the text provide for undoing
14 the changes (@pxref{Undo}).
15
16 Many text-related functions operate on a region of text defined by two
17 buffer positions passed in arguments named @var{start} and @var{end}.
18 These arguments should be either markers (@pxref{Markers}) or numeric
19 character positions (@pxref{Positions}). The order of these arguments
20 does not matter; it is all right for @var{start} to be the end of the
21 region and @var{end} the beginning. For example, @code{(delete-region 1
22 10)} and @code{(delete-region 10 1)} are equivalent. An
23 @code{args-out-of-range} error is signaled if either @var{start} or
24 @var{end} is outside the accessible portion of the buffer. In an
25 interactive call, point and the mark are used for these arguments.
26
27 @cindex buffer contents
28 Throughout this chapter, ``text'' refers to the characters in the
29 buffer, together with their properties (when relevant). Keep in mind
30 that point is always between two characters, and the cursor appears on
31 the character after point.
32
33 @menu
34 * Near Point:: Examining text in the vicinity of point.
35 * Buffer Contents:: Examining text in a general fashion.
36 * Comparing Text:: Comparing substrings of buffers.
37 * Insertion:: Adding new text to a buffer.
38 * Commands for Insertion:: User-level commands to insert text.
39 * Deletion:: Removing text from a buffer.
40 * User-Level Deletion:: User-level commands to delete text.
41 * The Kill Ring:: Where removed text sometimes is saved for later use.
42 * Undo:: Undoing changes to the text of a buffer.
43 * Maintaining Undo:: How to enable and disable undo information.
44 How to control how much information is kept.
45 * Filling:: Functions for explicit filling.
46 * Margins:: How to specify margins for filling commands.
47 * Adaptive Fill:: Adaptive Fill mode chooses a fill prefix from context.
48 * Auto Filling:: How auto-fill mode is implemented to break lines.
49 * Sorting:: Functions for sorting parts of the buffer.
50 * Columns:: Computing horizontal positions, and using them.
51 * Indentation:: Functions to insert or adjust indentation.
52 * Case Changes:: Case conversion of parts of the buffer.
53 * Text Properties:: Assigning Lisp property lists to text characters.
54 * Substitution:: Replacing a given character wherever it appears.
55 * Transposition:: Swapping two portions of a buffer.
56 * Registers:: How registers are implemented. Accessing the text or
57 position stored in a register.
58 * Base 64:: Conversion to or from base 64 encoding.
59 * MD5 Checksum:: Compute the MD5 "message digest"/"checksum".
60 * Parsing HTML:: Parsing HTML and XML.
61 * Atomic Changes:: Installing several buffer changes "atomically".
62 * Change Hooks:: Supplying functions to be run when text is changed.
63 @end menu
64
65 @node Near Point
66 @section Examining Text Near Point
67 @cindex text near point
68
69 Many functions are provided to look at the characters around point.
70 Several simple functions are described here. See also @code{looking-at}
71 in @ref{Regexp Search}.
72
73 In the following four functions, ``beginning'' or ``end'' of buffer
74 refers to the beginning or end of the accessible portion.
75
76 @defun char-after &optional position
77 This function returns the character in the current buffer at (i.e.,
78 immediately after) position @var{position}. If @var{position} is out of
79 range for this purpose, either before the beginning of the buffer, or at
80 or beyond the end, then the value is @code{nil}. The default for
81 @var{position} is point.
82
83 In the following example, assume that the first character in the
84 buffer is @samp{@@}:
85
86 @example
87 @group
88 (string (char-after 1))
89 @result{} "@@"
90 @end group
91 @end example
92 @end defun
93
94 @defun char-before &optional position
95 This function returns the character in the current buffer immediately
96 before position @var{position}. If @var{position} is out of range for
97 this purpose, either at or before the beginning of the buffer, or beyond
98 the end, then the value is @code{nil}. The default for
99 @var{position} is point.
100 @end defun
101
102 @defun following-char
103 This function returns the character following point in the current
104 buffer. This is similar to @code{(char-after (point))}. However, if
105 point is at the end of the buffer, then @code{following-char} returns 0.
106
107 Remember that point is always between characters, and the cursor
108 normally appears over the character following point. Therefore, the
109 character returned by @code{following-char} is the character the
110 cursor is over.
111
112 In this example, point is between the @samp{a} and the @samp{c}.
113
114 @example
115 @group
116 ---------- Buffer: foo ----------
117 Gentlemen may cry ``Pea@point{}ce! Peace!,''
118 but there is no peace.
119 ---------- Buffer: foo ----------
120 @end group
121
122 @group
123 (string (preceding-char))
124 @result{} "a"
125 (string (following-char))
126 @result{} "c"
127 @end group
128 @end example
129 @end defun
130
131 @defun preceding-char
132 This function returns the character preceding point in the current
133 buffer. See above, under @code{following-char}, for an example. If
134 point is at the beginning of the buffer, @code{preceding-char} returns
135 0.
136 @end defun
137
138 @defun bobp
139 This function returns @code{t} if point is at the beginning of the
140 buffer. If narrowing is in effect, this means the beginning of the
141 accessible portion of the text. See also @code{point-min} in
142 @ref{Point}.
143 @end defun
144
145 @defun eobp
146 This function returns @code{t} if point is at the end of the buffer.
147 If narrowing is in effect, this means the end of accessible portion of
148 the text. See also @code{point-max} in @xref{Point}.
149 @end defun
150
151 @defun bolp
152 This function returns @code{t} if point is at the beginning of a line.
153 @xref{Text Lines}. The beginning of the buffer (or of its accessible
154 portion) always counts as the beginning of a line.
155 @end defun
156
157 @defun eolp
158 This function returns @code{t} if point is at the end of a line. The
159 end of the buffer (or of its accessible portion) is always considered
160 the end of a line.
161 @end defun
162
163 @node Buffer Contents
164 @section Examining Buffer Contents
165
166 This section describes functions that allow a Lisp program to
167 convert any portion of the text in the buffer into a string.
168
169 @defun buffer-substring start end
170 This function returns a string containing a copy of the text of the
171 region defined by positions @var{start} and @var{end} in the current
172 buffer. If the arguments are not positions in the accessible portion of
173 the buffer, @code{buffer-substring} signals an @code{args-out-of-range}
174 error.
175
176 It is not necessary for @var{start} to be less than @var{end}; the
177 arguments can be given in either order. But most often the smaller
178 argument is written first.
179
180 Here's an example which assumes Font-Lock mode is not enabled:
181
182 @example
183 @group
184 ---------- Buffer: foo ----------
185 This is the contents of buffer foo
186
187 ---------- Buffer: foo ----------
188 @end group
189
190 @group
191 (buffer-substring 1 10)
192 @result{} "This is t"
193 @end group
194 @group
195 (buffer-substring (point-max) 10)
196 @result{} "he contents of buffer foo\n"
197 @end group
198 @end example
199
200 If the text being copied has any text properties, these are copied into
201 the string along with the characters they belong to. @xref{Text
202 Properties}. However, overlays (@pxref{Overlays}) in the buffer and
203 their properties are ignored, not copied.
204
205 For example, if Font-Lock mode is enabled, you might get results like
206 these:
207
208 @example
209 @group
210 (buffer-substring 1 10)
211 @result{} #("This is t" 0 1 (fontified t) 1 9 (fontified t))
212 @end group
213 @end example
214 @end defun
215
216 @defun buffer-substring-no-properties start end
217 This is like @code{buffer-substring}, except that it does not copy text
218 properties, just the characters themselves. @xref{Text Properties}.
219 @end defun
220
221 @defun filter-buffer-substring start end &optional delete noprops
222 This function passes the buffer text between @var{start} and @var{end}
223 through the filter functions specified by the variable
224 @code{buffer-substring-filters}, and returns the value from the last
225 filter function. If @code{buffer-substring-filters} is @code{nil},
226 the value is the unaltered text from the buffer, what
227 @code{buffer-substring} would return.
228
229 If @var{delete} is non-@code{nil}, this function deletes the text
230 between @var{start} and @var{end} after copying it, like
231 @code{delete-and-extract-region}.
232
233 If @var{noprops} is non-@code{nil}, the final string returned does not
234 include text properties, while the string passed through the filters
235 still includes text properties from the buffer text.
236
237 Lisp code should use this function instead of @code{buffer-substring},
238 @code{buffer-substring-no-properties},
239 or @code{delete-and-extract-region} when copying into user-accessible
240 data structures such as the kill-ring, X clipboard, and registers.
241 Major and minor modes can add functions to
242 @code{buffer-substring-filters} to alter such text as it is copied out
243 of the buffer.
244 @end defun
245
246 @defvar buffer-substring-filters
247 This variable should be a list of functions that accept a single
248 argument, a string, and return a string.
249 @code{filter-buffer-substring} passes the buffer substring to the
250 first function in this list, and the return value of each function is
251 passed to the next function. The return value of the last function is
252 used as the return value of @code{filter-buffer-substring}.
253
254 As a special convention, point is set to the start of the buffer text
255 being operated on (i.e., the @var{start} argument for
256 @code{filter-buffer-substring}) before these functions are called.
257
258 If this variable is @code{nil}, no filtering is performed.
259 @end defvar
260
261 @defun buffer-string
262 This function returns the contents of the entire accessible portion of
263 the current buffer as a string. It is equivalent to
264
265 @example
266 (buffer-substring (point-min) (point-max))
267 @end example
268
269 @example
270 @group
271 ---------- Buffer: foo ----------
272 This is the contents of buffer foo
273
274 ---------- Buffer: foo ----------
275
276 (buffer-string)
277 @result{} "This is the contents of buffer foo\n"
278 @end group
279 @end example
280 @end defun
281
282 @defun current-word &optional strict really-word
283 This function returns the symbol (or word) at or near point, as a string.
284 The return value includes no text properties.
285
286 If the optional argument @var{really-word} is non-@code{nil}, it finds a
287 word; otherwise, it finds a symbol (which includes both word
288 characters and symbol constituent characters).
289
290 If the optional argument @var{strict} is non-@code{nil}, then point
291 must be in or next to the symbol or word---if no symbol or word is
292 there, the function returns @code{nil}. Otherwise, a nearby symbol or
293 word on the same line is acceptable.
294 @end defun
295
296 @defun thing-at-point thing
297 Return the @var{thing} around or next to point, as a string.
298
299 The argument @var{thing} is a symbol which specifies a kind of syntactic
300 entity. Possibilities include @code{symbol}, @code{list}, @code{sexp},
301 @code{defun}, @code{filename}, @code{url}, @code{word}, @code{sentence},
302 @code{whitespace}, @code{line}, @code{page}, and others.
303
304 @example
305 ---------- Buffer: foo ----------
306 Gentlemen may cry ``Pea@point{}ce! Peace!,''
307 but there is no peace.
308 ---------- Buffer: foo ----------
309
310 (thing-at-point 'word)
311 @result{} "Peace"
312 (thing-at-point 'line)
313 @result{} "Gentlemen may cry ``Peace! Peace!,''\n"
314 (thing-at-point 'whitespace)
315 @result{} nil
316 @end example
317 @end defun
318
319 @node Comparing Text
320 @section Comparing Text
321 @cindex comparing buffer text
322
323 This function lets you compare portions of the text in a buffer, without
324 copying them into strings first.
325
326 @defun compare-buffer-substrings buffer1 start1 end1 buffer2 start2 end2
327 This function lets you compare two substrings of the same buffer or two
328 different buffers. The first three arguments specify one substring,
329 giving a buffer (or a buffer name) and two positions within the
330 buffer. The last three arguments specify the other substring in the
331 same way. You can use @code{nil} for @var{buffer1}, @var{buffer2}, or
332 both to stand for the current buffer.
333
334 The value is negative if the first substring is less, positive if the
335 first is greater, and zero if they are equal. The absolute value of
336 the result is one plus the index of the first differing characters
337 within the substrings.
338
339 This function ignores case when comparing characters
340 if @code{case-fold-search} is non-@code{nil}. It always ignores
341 text properties.
342
343 Suppose the current buffer contains the text @samp{foobarbar
344 haha!rara!}; then in this example the two substrings are @samp{rbar }
345 and @samp{rara!}. The value is 2 because the first substring is greater
346 at the second character.
347
348 @example
349 (compare-buffer-substrings nil 6 11 nil 16 21)
350 @result{} 2
351 @end example
352 @end defun
353
354 @node Insertion
355 @section Inserting Text
356 @cindex insertion of text
357 @cindex text insertion
358
359 @cindex insertion before point
360 @cindex before point, insertion
361 @dfn{Insertion} means adding new text to a buffer. The inserted text
362 goes at point---between the character before point and the character
363 after point. Some insertion functions leave point before the inserted
364 text, while other functions leave it after. We call the former
365 insertion @dfn{after point} and the latter insertion @dfn{before point}.
366
367 Insertion relocates markers that point at positions after the
368 insertion point, so that they stay with the surrounding text
369 (@pxref{Markers}). When a marker points at the place of insertion,
370 insertion may or may not relocate the marker, depending on the marker's
371 insertion type (@pxref{Marker Insertion Types}). Certain special
372 functions such as @code{insert-before-markers} relocate all such markers
373 to point after the inserted text, regardless of the markers' insertion
374 type.
375
376 Insertion functions signal an error if the current buffer is
377 read-only or if they insert within read-only text.
378
379 These functions copy text characters from strings and buffers along
380 with their properties. The inserted characters have exactly the same
381 properties as the characters they were copied from. By contrast,
382 characters specified as separate arguments, not part of a string or
383 buffer, inherit their text properties from the neighboring text.
384
385 The insertion functions convert text from unibyte to multibyte in
386 order to insert in a multibyte buffer, and vice versa---if the text
387 comes from a string or from a buffer. However, they do not convert
388 unibyte character codes 128 through 255 to multibyte characters, not
389 even if the current buffer is a multibyte buffer. @xref{Converting
390 Representations}.
391
392 @defun insert &rest args
393 This function inserts the strings and/or characters @var{args} into the
394 current buffer, at point, moving point forward. In other words, it
395 inserts the text before point. An error is signaled unless all
396 @var{args} are either strings or characters. The value is @code{nil}.
397 @end defun
398
399 @defun insert-before-markers &rest args
400 This function inserts the strings and/or characters @var{args} into the
401 current buffer, at point, moving point forward. An error is signaled
402 unless all @var{args} are either strings or characters. The value is
403 @code{nil}.
404
405 This function is unlike the other insertion functions in that it
406 relocates markers initially pointing at the insertion point, to point
407 after the inserted text. If an overlay begins at the insertion point,
408 the inserted text falls outside the overlay; if a nonempty overlay
409 ends at the insertion point, the inserted text falls inside that
410 overlay.
411 @end defun
412
413 @defun insert-char character count &optional inherit
414 This function inserts @var{count} instances of @var{character} into the
415 current buffer before point. The argument @var{count} should be an
416 integer, and @var{character} must be a character. The value is @code{nil}.
417
418 This function does not convert unibyte character codes 128 through 255
419 to multibyte characters, not even if the current buffer is a multibyte
420 buffer. @xref{Converting Representations}.
421
422 If @var{inherit} is non-@code{nil}, then the inserted characters inherit
423 sticky text properties from the two characters before and after the
424 insertion point. @xref{Sticky Properties}.
425 @end defun
426
427 @defun insert-buffer-substring from-buffer-or-name &optional start end
428 This function inserts a portion of buffer @var{from-buffer-or-name}
429 (which must already exist) into the current buffer before point. The
430 text inserted is the region between @var{start} and @var{end}. (These
431 arguments default to the beginning and end of the accessible portion of
432 that buffer.) This function returns @code{nil}.
433
434 In this example, the form is executed with buffer @samp{bar} as the
435 current buffer. We assume that buffer @samp{bar} is initially empty.
436
437 @example
438 @group
439 ---------- Buffer: foo ----------
440 We hold these truths to be self-evident, that all
441 ---------- Buffer: foo ----------
442 @end group
443
444 @group
445 (insert-buffer-substring "foo" 1 20)
446 @result{} nil
447
448 ---------- Buffer: bar ----------
449 We hold these truth@point{}
450 ---------- Buffer: bar ----------
451 @end group
452 @end example
453 @end defun
454
455 @defun insert-buffer-substring-no-properties from-buffer-or-name &optional start end
456 This is like @code{insert-buffer-substring} except that it does not
457 copy any text properties.
458 @end defun
459
460 @xref{Sticky Properties}, for other insertion functions that inherit
461 text properties from the nearby text in addition to inserting it.
462 Whitespace inserted by indentation functions also inherits text
463 properties.
464
465 @node Commands for Insertion
466 @section User-Level Insertion Commands
467
468 This section describes higher-level commands for inserting text,
469 commands intended primarily for the user but useful also in Lisp
470 programs.
471
472 @deffn Command insert-buffer from-buffer-or-name
473 This command inserts the entire accessible contents of
474 @var{from-buffer-or-name} (which must exist) into the current buffer
475 after point. It leaves the mark after the inserted text. The value
476 is @code{nil}.
477 @end deffn
478
479 @deffn Command self-insert-command count
480 @cindex character insertion
481 @cindex self-insertion
482 This command inserts the last character typed; it does so @var{count}
483 times, before point, and returns @code{nil}. Most printing characters
484 are bound to this command. In routine use, @code{self-insert-command}
485 is the most frequently called function in Emacs, but programs rarely use
486 it except to install it on a keymap.
487
488 In an interactive call, @var{count} is the numeric prefix argument.
489
490 Self-insertion translates the input character through
491 @code{translation-table-for-input}. @xref{Translation of Characters}.
492
493 This command calls @code{auto-fill-function} whenever that is
494 non-@code{nil} and the character inserted is in the table
495 @code{auto-fill-chars} (@pxref{Auto Filling}).
496
497 @c Cross refs reworded to prevent overfull hbox. --rjc 15mar92
498 This command performs abbrev expansion if Abbrev mode is enabled and
499 the inserted character does not have word-constituent
500 syntax. (@xref{Abbrevs}, and @ref{Syntax Class Table}.) It is also
501 responsible for calling @code{blink-paren-function} when the inserted
502 character has close parenthesis syntax (@pxref{Blinking}).
503
504 Do not try substituting your own definition of
505 @code{self-insert-command} for the standard one. The editor command
506 loop handles this function specially.
507 @end deffn
508
509 @deffn Command newline &optional number-of-newlines
510 This command inserts newlines into the current buffer before point.
511 If @var{number-of-newlines} is supplied, that many newline characters
512 are inserted.
513
514 @cindex newline and Auto Fill mode
515 This function calls @code{auto-fill-function} if the current column
516 number is greater than the value of @code{fill-column} and
517 @var{number-of-newlines} is @code{nil}. Typically what
518 @code{auto-fill-function} does is insert a newline; thus, the overall
519 result in this case is to insert two newlines at different places: one
520 at point, and another earlier in the line. @code{newline} does not
521 auto-fill if @var{number-of-newlines} is non-@code{nil}.
522
523 This command indents to the left margin if that is not zero.
524 @xref{Margins}.
525
526 The value returned is @code{nil}. In an interactive call, @var{count}
527 is the numeric prefix argument.
528 @end deffn
529
530 @defvar overwrite-mode
531 This variable controls whether overwrite mode is in effect. The value
532 should be @code{overwrite-mode-textual}, @code{overwrite-mode-binary},
533 or @code{nil}. @code{overwrite-mode-textual} specifies textual
534 overwrite mode (treats newlines and tabs specially), and
535 @code{overwrite-mode-binary} specifies binary overwrite mode (treats
536 newlines and tabs like any other characters).
537 @end defvar
538
539 @node Deletion
540 @section Deleting Text
541 @cindex text deletion
542
543 @cindex deleting text vs killing
544 Deletion means removing part of the text in a buffer, without saving
545 it in the kill ring (@pxref{The Kill Ring}). Deleted text can't be
546 yanked, but can be reinserted using the undo mechanism (@pxref{Undo}).
547 Some deletion functions do save text in the kill ring in some special
548 cases.
549
550 All of the deletion functions operate on the current buffer.
551
552 @deffn Command erase-buffer
553 This function deletes the entire text of the current buffer
554 (@emph{not} just the accessible portion), leaving it
555 empty. If the buffer is read-only, it signals a @code{buffer-read-only}
556 error; if some of the text in it is read-only, it signals a
557 @code{text-read-only} error. Otherwise, it deletes the text without
558 asking for any confirmation. It returns @code{nil}.
559
560 Normally, deleting a large amount of text from a buffer inhibits further
561 auto-saving of that buffer ``because it has shrunk.'' However,
562 @code{erase-buffer} does not do this, the idea being that the future
563 text is not really related to the former text, and its size should not
564 be compared with that of the former text.
565 @end deffn
566
567 @deffn Command delete-region start end
568 This command deletes the text between positions @var{start} and
569 @var{end} in the current buffer, and returns @code{nil}. If point was
570 inside the deleted region, its value afterward is @var{start}.
571 Otherwise, point relocates with the surrounding text, as markers do.
572 @end deffn
573
574 @defun delete-and-extract-region start end
575 This function deletes the text between positions @var{start} and
576 @var{end} in the current buffer, and returns a string containing the
577 text just deleted.
578
579 If point was inside the deleted region, its value afterward is
580 @var{start}. Otherwise, point relocates with the surrounding text, as
581 markers do.
582 @end defun
583
584 @deffn Command delete-char count &optional killp
585 This command deletes @var{count} characters directly after point, or
586 before point if @var{count} is negative. If @var{killp} is
587 non-@code{nil}, then it saves the deleted characters in the kill ring.
588
589 In an interactive call, @var{count} is the numeric prefix argument, and
590 @var{killp} is the unprocessed prefix argument. Therefore, if a prefix
591 argument is supplied, the text is saved in the kill ring. If no prefix
592 argument is supplied, then one character is deleted, but not saved in
593 the kill ring.
594
595 The value returned is always @code{nil}.
596 @end deffn
597
598 @deffn Command delete-backward-char count &optional killp
599 @cindex deleting previous char
600 This command deletes @var{count} characters directly before point, or
601 after point if @var{count} is negative. If @var{killp} is
602 non-@code{nil}, then it saves the deleted characters in the kill ring.
603
604 In an interactive call, @var{count} is the numeric prefix argument, and
605 @var{killp} is the unprocessed prefix argument. Therefore, if a prefix
606 argument is supplied, the text is saved in the kill ring. If no prefix
607 argument is supplied, then one character is deleted, but not saved in
608 the kill ring.
609
610 The value returned is always @code{nil}.
611 @end deffn
612
613 @deffn Command backward-delete-char-untabify count &optional killp
614 @cindex tab deletion
615 This command deletes @var{count} characters backward, changing tabs
616 into spaces. When the next character to be deleted is a tab, it is
617 first replaced with the proper number of spaces to preserve alignment
618 and then one of those spaces is deleted instead of the tab. If
619 @var{killp} is non-@code{nil}, then the command saves the deleted
620 characters in the kill ring.
621
622 Conversion of tabs to spaces happens only if @var{count} is positive.
623 If it is negative, exactly @minus{}@var{count} characters after point
624 are deleted.
625
626 In an interactive call, @var{count} is the numeric prefix argument, and
627 @var{killp} is the unprocessed prefix argument. Therefore, if a prefix
628 argument is supplied, the text is saved in the kill ring. If no prefix
629 argument is supplied, then one character is deleted, but not saved in
630 the kill ring.
631
632 The value returned is always @code{nil}.
633 @end deffn
634
635 @defopt backward-delete-char-untabify-method
636 This option specifies how @code{backward-delete-char-untabify} should
637 deal with whitespace. Possible values include @code{untabify}, the
638 default, meaning convert a tab to many spaces and delete one;
639 @code{hungry}, meaning delete all tabs and spaces before point with
640 one command; @code{all} meaning delete all tabs, spaces and newlines
641 before point, and @code{nil}, meaning do nothing special for
642 whitespace characters.
643 @end defopt
644
645 @node User-Level Deletion
646 @section User-Level Deletion Commands
647
648 This section describes higher-level commands for deleting text,
649 commands intended primarily for the user but useful also in Lisp
650 programs.
651
652 @deffn Command delete-horizontal-space &optional backward-only
653 @cindex deleting whitespace
654 This function deletes all spaces and tabs around point. It returns
655 @code{nil}.
656
657 If @var{backward-only} is non-@code{nil}, the function deletes
658 spaces and tabs before point, but not after point.
659
660 In the following examples, we call @code{delete-horizontal-space} four
661 times, once on each line, with point between the second and third
662 characters on the line each time.
663
664 @example
665 @group
666 ---------- Buffer: foo ----------
667 I @point{}thought
668 I @point{} thought
669 We@point{} thought
670 Yo@point{}u thought
671 ---------- Buffer: foo ----------
672 @end group
673
674 @group
675 (delete-horizontal-space) ; @r{Four times.}
676 @result{} nil
677
678 ---------- Buffer: foo ----------
679 Ithought
680 Ithought
681 Wethought
682 You thought
683 ---------- Buffer: foo ----------
684 @end group
685 @end example
686 @end deffn
687
688 @deffn Command delete-indentation &optional join-following-p
689 This function joins the line point is on to the previous line, deleting
690 any whitespace at the join and in some cases replacing it with one
691 space. If @var{join-following-p} is non-@code{nil},
692 @code{delete-indentation} joins this line to the following line
693 instead. The function returns @code{nil}.
694
695 If there is a fill prefix, and the second of the lines being joined
696 starts with the prefix, then @code{delete-indentation} deletes the
697 fill prefix before joining the lines. @xref{Margins}.
698
699 In the example below, point is located on the line starting
700 @samp{events}, and it makes no difference if there are trailing spaces
701 in the preceding line.
702
703 @smallexample
704 @group
705 ---------- Buffer: foo ----------
706 When in the course of human
707 @point{} events, it becomes necessary
708 ---------- Buffer: foo ----------
709 @end group
710
711 (delete-indentation)
712 @result{} nil
713
714 @group
715 ---------- Buffer: foo ----------
716 When in the course of human@point{} events, it becomes necessary
717 ---------- Buffer: foo ----------
718 @end group
719 @end smallexample
720
721 After the lines are joined, the function @code{fixup-whitespace} is
722 responsible for deciding whether to leave a space at the junction.
723 @end deffn
724
725 @deffn Command fixup-whitespace
726 This function replaces all the horizontal whitespace surrounding point
727 with either one space or no space, according to the context. It
728 returns @code{nil}.
729
730 At the beginning or end of a line, the appropriate amount of space is
731 none. Before a character with close parenthesis syntax, or after a
732 character with open parenthesis or expression-prefix syntax, no space is
733 also appropriate. Otherwise, one space is appropriate. @xref{Syntax
734 Class Table}.
735
736 In the example below, @code{fixup-whitespace} is called the first time
737 with point before the word @samp{spaces} in the first line. For the
738 second invocation, point is directly after the @samp{(}.
739
740 @smallexample
741 @group
742 ---------- Buffer: foo ----------
743 This has too many @point{}spaces
744 This has too many spaces at the start of (@point{} this list)
745 ---------- Buffer: foo ----------
746 @end group
747
748 @group
749 (fixup-whitespace)
750 @result{} nil
751 (fixup-whitespace)
752 @result{} nil
753 @end group
754
755 @group
756 ---------- Buffer: foo ----------
757 This has too many spaces
758 This has too many spaces at the start of (this list)
759 ---------- Buffer: foo ----------
760 @end group
761 @end smallexample
762 @end deffn
763
764 @deffn Command just-one-space &optional n
765 @comment !!SourceFile simple.el
766 This command replaces any spaces and tabs around point with a single
767 space, or @var{n} spaces if @var{n} is specified. It returns
768 @code{nil}.
769 @end deffn
770
771 @deffn Command delete-blank-lines
772 This function deletes blank lines surrounding point. If point is on a
773 blank line with one or more blank lines before or after it, then all but
774 one of them are deleted. If point is on an isolated blank line, then it
775 is deleted. If point is on a nonblank line, the command deletes all
776 blank lines immediately following it.
777
778 A blank line is defined as a line containing only tabs and spaces.
779
780 @code{delete-blank-lines} returns @code{nil}.
781 @end deffn
782
783 @node The Kill Ring
784 @section The Kill Ring
785 @cindex kill ring
786
787 @dfn{Kill functions} delete text like the deletion functions, but save
788 it so that the user can reinsert it by @dfn{yanking}. Most of these
789 functions have @samp{kill-} in their name. By contrast, the functions
790 whose names start with @samp{delete-} normally do not save text for
791 yanking (though they can still be undone); these are ``deletion''
792 functions.
793
794 Most of the kill commands are primarily for interactive use, and are
795 not described here. What we do describe are the functions provided for
796 use in writing such commands. You can use these functions to write
797 commands for killing text. When you need to delete text for internal
798 purposes within a Lisp function, you should normally use deletion
799 functions, so as not to disturb the kill ring contents.
800 @xref{Deletion}.
801
802 Killed text is saved for later yanking in the @dfn{kill ring}. This
803 is a list that holds a number of recent kills, not just the last text
804 kill. We call this a ``ring'' because yanking treats it as having
805 elements in a cyclic order. The list is kept in the variable
806 @code{kill-ring}, and can be operated on with the usual functions for
807 lists; there are also specialized functions, described in this section,
808 that treat it as a ring.
809
810 Some people think this use of the word ``kill'' is unfortunate, since
811 it refers to operations that specifically @emph{do not} destroy the
812 entities ``killed.'' This is in sharp contrast to ordinary life, in
813 which death is permanent and ``killed'' entities do not come back to
814 life. Therefore, other metaphors have been proposed. For example, the
815 term ``cut ring'' makes sense to people who, in pre-computer days, used
816 scissors and paste to cut up and rearrange manuscripts. However, it
817 would be difficult to change the terminology now.
818
819 @menu
820 * Kill Ring Concepts:: What text looks like in the kill ring.
821 * Kill Functions:: Functions that kill text.
822 * Yanking:: How yanking is done.
823 * Yank Commands:: Commands that access the kill ring.
824 * Low-Level Kill Ring:: Functions and variables for kill ring access.
825 * Internals of Kill Ring:: Variables that hold kill ring data.
826 @end menu
827
828 @node Kill Ring Concepts
829 @comment node-name, next, previous, up
830 @subsection Kill Ring Concepts
831
832 The kill ring records killed text as strings in a list, most recent
833 first. A short kill ring, for example, might look like this:
834
835 @example
836 ("some text" "a different piece of text" "even older text")
837 @end example
838
839 @noindent
840 When the list reaches @code{kill-ring-max} entries in length, adding a
841 new entry automatically deletes the last entry.
842
843 When kill commands are interwoven with other commands, each kill
844 command makes a new entry in the kill ring. Multiple kill commands in
845 succession build up a single kill ring entry, which would be yanked as a
846 unit; the second and subsequent consecutive kill commands add text to
847 the entry made by the first one.
848
849 For yanking, one entry in the kill ring is designated the ``front'' of
850 the ring. Some yank commands ``rotate'' the ring by designating a
851 different element as the ``front.'' But this virtual rotation doesn't
852 change the list itself---the most recent entry always comes first in the
853 list.
854
855 @node Kill Functions
856 @comment node-name, next, previous, up
857 @subsection Functions for Killing
858
859 @code{kill-region} is the usual subroutine for killing text. Any
860 command that calls this function is a ``kill command'' (and should
861 probably have @samp{kill} in its name). @code{kill-region} puts the
862 newly killed text in a new element at the beginning of the kill ring or
863 adds it to the most recent element. It determines automatically (using
864 @code{last-command}) whether the previous command was a kill command,
865 and if so appends the killed text to the most recent entry.
866
867 @deffn Command kill-region start end
868 This function kills the text in the region defined by @var{start} and
869 @var{end}. The text is deleted but saved in the kill ring, along with
870 its text properties. The value is always @code{nil}.
871
872 In an interactive call, @var{start} and @var{end} are point and
873 the mark.
874
875 If the buffer or text is read-only, @code{kill-region} modifies the kill
876 ring just the same, then signals an error without modifying the buffer.
877 This is convenient because it lets the user use a series of kill
878 commands to copy text from a read-only buffer into the kill ring.
879 @end deffn
880
881 @defopt kill-read-only-ok
882 If this option is non-@code{nil}, @code{kill-region} does not signal an
883 error if the buffer or text is read-only. Instead, it simply returns,
884 updating the kill ring but not changing the buffer.
885 @end defopt
886
887 @deffn Command copy-region-as-kill start end
888 This command saves the region defined by @var{start} and @var{end} on
889 the kill ring (including text properties), but does not delete the text
890 from the buffer. It returns @code{nil}.
891
892 The command does not set @code{this-command} to @code{kill-region}, so a
893 subsequent kill command does not append to the same kill ring entry.
894
895 @c FIXME Why is it better? Why isn't copy-region-as-kill obsolete then?
896 @c Why is it used in many places in Emacs?
897 In Lisp programs, it is better to use @code{kill-new} or
898 @code{kill-append} instead of this command. @xref{Low-Level Kill Ring}.
899 @end deffn
900
901 @node Yanking
902 @subsection Yanking
903
904 Yanking means inserting text from the kill ring, but it does
905 not insert the text blindly. Yank commands and some other commands
906 use @code{insert-for-yank} to perform special processing on the
907 text that they copy into the buffer.
908
909 @defun insert-for-yank string
910 This function normally works like @code{insert} except that it doesn't
911 insert the text properties in the @code{yank-excluded-properties}
912 list. However, if any part of @var{string} has a non-@code{nil}
913 @code{yank-handler} text property, that property can do various
914 special processing on that part of the text being inserted.
915 @end defun
916
917 @defun insert-buffer-substring-as-yank buf &optional start end
918 This function resembles @code{insert-buffer-substring} except that it
919 doesn't insert the text properties in the
920 @code{yank-excluded-properties} list.
921 @end defun
922
923 You can put a @code{yank-handler} text property on all or part of
924 the text to control how it will be inserted if it is yanked. The
925 @code{insert-for-yank} function looks for that property. The property
926 value must be a list of one to four elements, with the following
927 format (where elements after the first may be omitted):
928
929 @example
930 (@var{function} @var{param} @var{noexclude} @var{undo})
931 @end example
932
933 Here is what the elements do:
934
935 @table @var
936 @item function
937 When @var{function} is present and non-@code{nil}, it is called instead of
938 @code{insert} to insert the string. @var{function} takes one
939 argument---the string to insert.
940
941 @item param
942 If @var{param} is present and non-@code{nil}, it replaces @var{string}
943 (or the part of @var{string} being processed) as the object passed to
944 @var{function} (or @code{insert}); for example, if @var{function} is
945 @code{yank-rectangle}, @var{param} should be a list of strings to
946 insert as a rectangle.
947
948 @item noexclude
949 If @var{noexclude} is present and non-@code{nil}, the normal removal of the
950 yank-excluded-properties is not performed; instead @var{function} is
951 responsible for removing those properties. This may be necessary
952 if @var{function} adjusts point before or after inserting the object.
953
954 @item undo
955 If @var{undo} is present and non-@code{nil}, it is a function that will be
956 called by @code{yank-pop} to undo the insertion of the current object.
957 It is called with two arguments, the start and end of the current
958 region. @var{function} can set @code{yank-undo-function} to override
959 the @var{undo} value.
960 @end table
961
962 @node Yank Commands
963 @comment node-name, next, previous, up
964 @subsection Functions for Yanking
965
966 This section describes higher-level commands for yanking, which are
967 intended primarily for the user but useful also in Lisp programs.
968 Both @code{yank} and @code{yank-pop} honor the
969 @code{yank-excluded-properties} variable and @code{yank-handler} text
970 property (@pxref{Yanking}).
971
972 @deffn Command yank &optional arg
973 @cindex inserting killed text
974 This command inserts before point the text at the front of the kill
975 ring. It sets the mark at the beginning of that text, using
976 @code{push-mark} (@pxref{The Mark}), and puts point at the end.
977
978 If @var{arg} is a non-@code{nil} list (which occurs interactively when
979 the user types @kbd{C-u} with no digits), then @code{yank} inserts the
980 text as described above, but puts point before the yanked text and
981 sets the mark after it.
982
983 If @var{arg} is a number, then @code{yank} inserts the @var{arg}th
984 most recently killed text---the @var{arg}th element of the kill ring
985 list, counted cyclically from the front, which is considered the
986 first element for this purpose.
987
988 @code{yank} does not alter the contents of the kill ring, unless it
989 used text provided by another program, in which case it pushes that text
990 onto the kill ring. However if @var{arg} is an integer different from
991 one, it rotates the kill ring to place the yanked string at the front.
992
993 @code{yank} returns @code{nil}.
994 @end deffn
995
996 @deffn Command yank-pop &optional arg
997 This command replaces the just-yanked entry from the kill ring with a
998 different entry from the kill ring.
999
1000 This is allowed only immediately after a @code{yank} or another
1001 @code{yank-pop}. At such a time, the region contains text that was just
1002 inserted by yanking. @code{yank-pop} deletes that text and inserts in
1003 its place a different piece of killed text. It does not add the deleted
1004 text to the kill ring, since it is already in the kill ring somewhere.
1005 It does however rotate the kill ring to place the newly yanked string at
1006 the front.
1007
1008 If @var{arg} is @code{nil}, then the replacement text is the previous
1009 element of the kill ring. If @var{arg} is numeric, the replacement is
1010 the @var{arg}th previous kill. If @var{arg} is negative, a more recent
1011 kill is the replacement.
1012
1013 The sequence of kills in the kill ring wraps around, so that after the
1014 oldest one comes the newest one, and before the newest one goes the
1015 oldest.
1016
1017 The return value is always @code{nil}.
1018 @end deffn
1019
1020 @defvar yank-undo-function
1021 If this variable is non-@code{nil}, the function @code{yank-pop} uses
1022 its value instead of @code{delete-region} to delete the text
1023 inserted by the previous @code{yank} or
1024 @code{yank-pop} command. The value must be a function of two
1025 arguments, the start and end of the current region.
1026
1027 The function @code{insert-for-yank} automatically sets this variable
1028 according to the @var{undo} element of the @code{yank-handler}
1029 text property, if there is one.
1030 @end defvar
1031
1032 @node Low-Level Kill Ring
1033 @subsection Low-Level Kill Ring
1034
1035 These functions and variables provide access to the kill ring at a
1036 lower level, but are still convenient for use in Lisp programs,
1037 because they take care of interaction with window system selections
1038 (@pxref{Window System Selections}).
1039
1040 @defun current-kill n &optional do-not-move
1041 The function @code{current-kill} rotates the yanking pointer, which
1042 designates the ``front'' of the kill ring, by @var{n} places (from newer
1043 kills to older ones), and returns the text at that place in the ring.
1044
1045 If the optional second argument @var{do-not-move} is non-@code{nil},
1046 then @code{current-kill} doesn't alter the yanking pointer; it just
1047 returns the @var{n}th kill, counting from the current yanking pointer.
1048
1049 If @var{n} is zero, indicating a request for the latest kill,
1050 @code{current-kill} calls the value of
1051 @code{interprogram-paste-function} (documented below) before
1052 consulting the kill ring. If that value is a function and calling it
1053 returns a string or a list of several string, @code{current-kill}
1054 pushes the strings onto the kill ring and returns the first string.
1055 It also sets the yanking pointer to point to the kill-ring entry of
1056 the first string returned by @code{interprogram-paste-function},
1057 regardless of the value of @var{do-not-move}. Otherwise,
1058 @code{current-kill} does not treat a zero value for @var{n} specially:
1059 it returns the entry pointed at by the yanking pointer and does not
1060 move the yanking pointer.
1061 @end defun
1062
1063 @defun kill-new string &optional replace
1064 This function pushes the text @var{string} onto the kill ring and
1065 makes the yanking pointer point to it. It discards the oldest entry
1066 if appropriate. It also invokes the value of
1067 @code{interprogram-cut-function} (see below).
1068
1069 If @var{replace} is non-@code{nil}, then @code{kill-new} replaces the
1070 first element of the kill ring with @var{string}, rather than pushing
1071 @var{string} onto the kill ring.
1072 @end defun
1073
1074 @defun kill-append string before-p
1075 This function appends the text @var{string} to the first entry in the
1076 kill ring and makes the yanking pointer point to the combined entry.
1077 Normally @var{string} goes at the end of the entry, but if
1078 @var{before-p} is non-@code{nil}, it goes at the beginning. This
1079 function also invokes the value of @code{interprogram-cut-function}
1080 (see below).
1081 @end defun
1082
1083 @defvar interprogram-paste-function
1084 This variable provides a way of transferring killed text from other
1085 programs, when you are using a window system. Its value should be
1086 @code{nil} or a function of no arguments.
1087
1088 If the value is a function, @code{current-kill} calls it to get the
1089 ``most recent kill.'' If the function returns a non-@code{nil} value,
1090 then that value is used as the ``most recent kill.'' If it returns
1091 @code{nil}, then the front of the kill ring is used.
1092
1093 To facilitate support for window systems that support multiple
1094 selections, this function may also return a list of strings. In that
1095 case, the first string is used as the ``most recent kill'', and all
1096 the other strings are pushed onto the kill ring, for easy access by
1097 @code{yank-pop}.
1098
1099 The normal use of this function is to get the window system's primary
1100 selection as the most recent kill, even if the selection belongs to
1101 another application. @xref{Window System Selections}. However, if
1102 the selection was provided by the current Emacs session, this function
1103 should return @code{nil}. (If it is hard to tell whether Emacs or
1104 some other program provided the selection, it should be good enough to
1105 use @code{string=} to compare it with the last text Emacs provided.)
1106 @end defvar
1107
1108 @defvar interprogram-cut-function
1109 This variable provides a way of communicating killed text to other
1110 programs, when you are using a window system. Its value should be
1111 @code{nil} or a function of one required argument.
1112
1113 If the value is a function, @code{kill-new} and @code{kill-append} call
1114 it with the new first element of the kill ring as the argument.
1115
1116 The normal use of this function is to set the window system's primary
1117 selection from the newly killed text.
1118 @xref{Window System Selections}.
1119 @end defvar
1120
1121 @node Internals of Kill Ring
1122 @comment node-name, next, previous, up
1123 @subsection Internals of the Kill Ring
1124
1125 The variable @code{kill-ring} holds the kill ring contents, in the
1126 form of a list of strings. The most recent kill is always at the front
1127 of the list.
1128
1129 The @code{kill-ring-yank-pointer} variable points to a link in the
1130 kill ring list, whose @sc{car} is the text to yank next. We say it
1131 identifies the ``front'' of the ring. Moving
1132 @code{kill-ring-yank-pointer} to a different link is called
1133 @dfn{rotating the kill ring}. We call the kill ring a ``ring'' because
1134 the functions that move the yank pointer wrap around from the end of the
1135 list to the beginning, or vice-versa. Rotation of the kill ring is
1136 virtual; it does not change the value of @code{kill-ring}.
1137
1138 Both @code{kill-ring} and @code{kill-ring-yank-pointer} are Lisp
1139 variables whose values are normally lists. The word ``pointer'' in the
1140 name of the @code{kill-ring-yank-pointer} indicates that the variable's
1141 purpose is to identify one element of the list for use by the next yank
1142 command.
1143
1144 The value of @code{kill-ring-yank-pointer} is always @code{eq} to one
1145 of the links in the kill ring list. The element it identifies is the
1146 @sc{car} of that link. Kill commands, which change the kill ring, also
1147 set this variable to the value of @code{kill-ring}. The effect is to
1148 rotate the ring so that the newly killed text is at the front.
1149
1150 Here is a diagram that shows the variable @code{kill-ring-yank-pointer}
1151 pointing to the second entry in the kill ring @code{("some text" "a
1152 different piece of text" "yet older text")}.
1153
1154 @example
1155 @group
1156 kill-ring ---- kill-ring-yank-pointer
1157 | |
1158 | v
1159 | --- --- --- --- --- ---
1160 --> | | |------> | | |--> | | |--> nil
1161 --- --- --- --- --- ---
1162 | | |
1163 | | |
1164 | | -->"yet older text"
1165 | |
1166 | --> "a different piece of text"
1167 |
1168 --> "some text"
1169 @end group
1170 @end example
1171
1172 @noindent
1173 This state of affairs might occur after @kbd{C-y} (@code{yank})
1174 immediately followed by @kbd{M-y} (@code{yank-pop}).
1175
1176 @defvar kill-ring
1177 This variable holds the list of killed text sequences, most recently
1178 killed first.
1179 @end defvar
1180
1181 @defvar kill-ring-yank-pointer
1182 This variable's value indicates which element of the kill ring is at the
1183 ``front'' of the ring for yanking. More precisely, the value is a tail
1184 of the value of @code{kill-ring}, and its @sc{car} is the kill string
1185 that @kbd{C-y} should yank.
1186 @end defvar
1187
1188 @defopt kill-ring-max
1189 The value of this variable is the maximum length to which the kill
1190 ring can grow, before elements are thrown away at the end. The default
1191 value for @code{kill-ring-max} is 60.
1192 @end defopt
1193
1194 @node Undo
1195 @comment node-name, next, previous, up
1196 @section Undo
1197 @cindex redo
1198
1199 Most buffers have an @dfn{undo list}, which records all changes made
1200 to the buffer's text so that they can be undone. (The buffers that
1201 don't have one are usually special-purpose buffers for which Emacs
1202 assumes that undoing is not useful. In particular, any buffer whose
1203 name begins with a space has its undo recording off by default;
1204 see @ref{Buffer Names}.) All the primitives that modify the
1205 text in the buffer automatically add elements to the front of the undo
1206 list, which is in the variable @code{buffer-undo-list}.
1207
1208 @defvar buffer-undo-list
1209 This buffer-local variable's value is the undo list of the current
1210 buffer. A value of @code{t} disables the recording of undo information.
1211 @end defvar
1212
1213 Here are the kinds of elements an undo list can have:
1214
1215 @table @code
1216 @item @var{position}
1217 This kind of element records a previous value of point; undoing this
1218 element moves point to @var{position}. Ordinary cursor motion does not
1219 make any sort of undo record, but deletion operations use these entries
1220 to record where point was before the command.
1221
1222 @item (@var{beg} . @var{end})
1223 This kind of element indicates how to delete text that was inserted.
1224 Upon insertion, the text occupied the range @var{beg}--@var{end} in the
1225 buffer.
1226
1227 @item (@var{text} . @var{position})
1228 This kind of element indicates how to reinsert text that was deleted.
1229 The deleted text itself is the string @var{text}. The place to
1230 reinsert it is @code{(abs @var{position})}. If @var{position} is
1231 positive, point was at the beginning of the deleted text, otherwise it
1232 was at the end.
1233
1234 @item (t @var{high} . @var{low})
1235 This kind of element indicates that an unmodified buffer became
1236 modified. The elements @var{high} and @var{low} are two integers, each
1237 recording 16 bits of the visited file's modification time as of when it
1238 was previously visited or saved. @code{primitive-undo} uses those
1239 values to determine whether to mark the buffer as unmodified once again;
1240 it does so only if the file's modification time matches those numbers.
1241
1242 @item (nil @var{property} @var{value} @var{beg} . @var{end})
1243 This kind of element records a change in a text property.
1244 Here's how you might undo the change:
1245
1246 @example
1247 (put-text-property @var{beg} @var{end} @var{property} @var{value})
1248 @end example
1249
1250 @item (@var{marker} . @var{adjustment})
1251 This kind of element records the fact that the marker @var{marker} was
1252 relocated due to deletion of surrounding text, and that it moved
1253 @var{adjustment} character positions. Undoing this element moves
1254 @var{marker} @minus{} @var{adjustment} characters.
1255
1256 @item (apply @var{funname} . @var{args})
1257 This is an extensible undo item, which is undone by calling
1258 @var{funname} with arguments @var{args}.
1259
1260 @item (apply @var{delta} @var{beg} @var{end} @var{funname} . @var{args})
1261 This is an extensible undo item, which records a change limited to the
1262 range @var{beg} to @var{end}, which increased the size of the buffer
1263 by @var{delta}. It is undone by calling @var{funname} with arguments
1264 @var{args}.
1265
1266 This kind of element enables undo limited to a region to determine
1267 whether the element pertains to that region.
1268
1269 @item nil
1270 This element is a boundary. The elements between two boundaries are
1271 called a @dfn{change group}; normally, each change group corresponds to
1272 one keyboard command, and undo commands normally undo an entire group as
1273 a unit.
1274 @end table
1275
1276 @defun undo-boundary
1277 This function places a boundary element in the undo list. The undo
1278 command stops at such a boundary, and successive undo commands undo
1279 to earlier and earlier boundaries. This function returns @code{nil}.
1280
1281 The editor command loop automatically calls @code{undo-boundary} just
1282 before executing each key sequence, so that each undo normally undoes
1283 the effects of one command. As an exception, the command
1284 @code{self-insert-command}, which produces self-inserting input
1285 characters (@pxref{Commands for Insertion}), may remove the boundary
1286 inserted by the command loop: a boundary is accepted for the first
1287 such character, the next 19 consecutive self-inserting input
1288 characters do not have boundaries, and then the 20th does; and so on
1289 as long as the self-inserting characters continue. Hence, sequences
1290 of consecutive character insertions can be undone as a group.
1291
1292 All buffer modifications add a boundary whenever the previous undoable
1293 change was made in some other buffer. This is to ensure that
1294 each command makes a boundary in each buffer where it makes changes.
1295
1296 Calling this function explicitly is useful for splitting the effects of
1297 a command into more than one unit. For example, @code{query-replace}
1298 calls @code{undo-boundary} after each replacement, so that the user can
1299 undo individual replacements one by one.
1300 @end defun
1301
1302 @defvar undo-in-progress
1303 This variable is normally @code{nil}, but the undo commands bind it to
1304 @code{t}. This is so that various kinds of change hooks can tell when
1305 they're being called for the sake of undoing.
1306 @end defvar
1307
1308 @defun primitive-undo count list
1309 This is the basic function for undoing elements of an undo list.
1310 It undoes the first @var{count} elements of @var{list}, returning
1311 the rest of @var{list}.
1312
1313 @code{primitive-undo} adds elements to the buffer's undo list when it
1314 changes the buffer. Undo commands avoid confusion by saving the undo
1315 list value at the beginning of a sequence of undo operations. Then the
1316 undo operations use and update the saved value. The new elements added
1317 by undoing are not part of this saved value, so they don't interfere with
1318 continuing to undo.
1319
1320 This function does not bind @code{undo-in-progress}.
1321 @end defun
1322
1323 @node Maintaining Undo
1324 @section Maintaining Undo Lists
1325
1326 This section describes how to enable and disable undo information for
1327 a given buffer. It also explains how the undo list is truncated
1328 automatically so it doesn't get too big.
1329
1330 Recording of undo information in a newly created buffer is normally
1331 enabled to start with; but if the buffer name starts with a space, the
1332 undo recording is initially disabled. You can explicitly enable or
1333 disable undo recording with the following two functions, or by setting
1334 @code{buffer-undo-list} yourself.
1335
1336 @deffn Command buffer-enable-undo &optional buffer-or-name
1337 This command enables recording undo information for buffer
1338 @var{buffer-or-name}, so that subsequent changes can be undone. If no
1339 argument is supplied, then the current buffer is used. This function
1340 does nothing if undo recording is already enabled in the buffer. It
1341 returns @code{nil}.
1342
1343 In an interactive call, @var{buffer-or-name} is the current buffer.
1344 You cannot specify any other buffer.
1345 @end deffn
1346
1347 @deffn Command buffer-disable-undo &optional buffer-or-name
1348 @cindex disabling undo
1349 This function discards the undo list of @var{buffer-or-name}, and disables
1350 further recording of undo information. As a result, it is no longer
1351 possible to undo either previous changes or any subsequent changes. If
1352 the undo list of @var{buffer-or-name} is already disabled, this function
1353 has no effect.
1354
1355 This function returns @code{nil}.
1356 @end deffn
1357
1358 As editing continues, undo lists get longer and longer. To prevent
1359 them from using up all available memory space, garbage collection trims
1360 them back to size limits you can set. (For this purpose, the ``size''
1361 of an undo list measures the cons cells that make up the list, plus the
1362 strings of deleted text.) Three variables control the range of acceptable
1363 sizes: @code{undo-limit}, @code{undo-strong-limit} and
1364 @code{undo-outer-limit}. In these variables, size is counted as the
1365 number of bytes occupied, which includes both saved text and other
1366 data.
1367
1368 @defopt undo-limit
1369 This is the soft limit for the acceptable size of an undo list. The
1370 change group at which this size is exceeded is the last one kept.
1371 @end defopt
1372
1373 @defopt undo-strong-limit
1374 This is the upper limit for the acceptable size of an undo list. The
1375 change group at which this size is exceeded is discarded itself (along
1376 with all older change groups). There is one exception: the very latest
1377 change group is only discarded if it exceeds @code{undo-outer-limit}.
1378 @end defopt
1379
1380 @defopt undo-outer-limit
1381 If at garbage collection time the undo info for the current command
1382 exceeds this limit, Emacs discards the info and displays a warning.
1383 This is a last ditch limit to prevent memory overflow.
1384 @end defopt
1385
1386 @defopt undo-ask-before-discard
1387 If this variable is non-@code{nil}, when the undo info exceeds
1388 @code{undo-outer-limit}, Emacs asks in the echo area whether to
1389 discard the info. The default value is @code{nil}, which means to
1390 discard it automatically.
1391
1392 This option is mainly intended for debugging. Garbage collection is
1393 inhibited while the question is asked, which means that Emacs might
1394 leak memory if the user waits too long before answering the question.
1395 @end defopt
1396
1397 @node Filling
1398 @comment node-name, next, previous, up
1399 @section Filling
1400 @cindex filling text
1401
1402 @dfn{Filling} means adjusting the lengths of lines (by moving the line
1403 breaks) so that they are nearly (but no greater than) a specified
1404 maximum width. Additionally, lines can be @dfn{justified}, which means
1405 inserting spaces to make the left and/or right margins line up
1406 precisely. The width is controlled by the variable @code{fill-column}.
1407 For ease of reading, lines should be no longer than 70 or so columns.
1408
1409 You can use Auto Fill mode (@pxref{Auto Filling}) to fill text
1410 automatically as you insert it, but changes to existing text may leave
1411 it improperly filled. Then you must fill the text explicitly.
1412
1413 Most of the commands in this section return values that are not
1414 meaningful. All the functions that do filling take note of the current
1415 left margin, current right margin, and current justification style
1416 (@pxref{Margins}). If the current justification style is
1417 @code{none}, the filling functions don't actually do anything.
1418
1419 Several of the filling functions have an argument @var{justify}.
1420 If it is non-@code{nil}, that requests some kind of justification. It
1421 can be @code{left}, @code{right}, @code{full}, or @code{center}, to
1422 request a specific style of justification. If it is @code{t}, that
1423 means to use the current justification style for this part of the text
1424 (see @code{current-justification}, below). Any other value is treated
1425 as @code{full}.
1426
1427 When you call the filling functions interactively, using a prefix
1428 argument implies the value @code{full} for @var{justify}.
1429
1430 @deffn Command fill-paragraph &optional justify region
1431 This command fills the paragraph at or after point. If
1432 @var{justify} is non-@code{nil}, each line is justified as well.
1433 It uses the ordinary paragraph motion commands to find paragraph
1434 boundaries. @xref{Paragraphs,,, emacs, The GNU Emacs Manual}.
1435
1436 When @var{region} is non-@code{nil}, then if Transient Mark mode is
1437 enabled and the mark is active, this command calls @code{fill-region}
1438 to fill all the paragraphs in the region, instead of filling only the
1439 current paragraph. When this command is called interactively,
1440 @var{region} is @code{t}.
1441 @end deffn
1442
1443 @deffn Command fill-region start end &optional justify nosqueeze to-eop
1444 This command fills each of the paragraphs in the region from @var{start}
1445 to @var{end}. It justifies as well if @var{justify} is
1446 non-@code{nil}.
1447
1448 If @var{nosqueeze} is non-@code{nil}, that means to leave whitespace
1449 other than line breaks untouched. If @var{to-eop} is non-@code{nil},
1450 that means to keep filling to the end of the paragraph---or the next hard
1451 newline, if @code{use-hard-newlines} is enabled (see below).
1452
1453 The variable @code{paragraph-separate} controls how to distinguish
1454 paragraphs. @xref{Standard Regexps}.
1455 @end deffn
1456
1457 @deffn Command fill-individual-paragraphs start end &optional justify citation-regexp
1458 This command fills each paragraph in the region according to its
1459 individual fill prefix. Thus, if the lines of a paragraph were indented
1460 with spaces, the filled paragraph will remain indented in the same
1461 fashion.
1462
1463 The first two arguments, @var{start} and @var{end}, are the beginning
1464 and end of the region to be filled. The third and fourth arguments,
1465 @var{justify} and @var{citation-regexp}, are optional. If
1466 @var{justify} is non-@code{nil}, the paragraphs are justified as
1467 well as filled. If @var{citation-regexp} is non-@code{nil}, it means the
1468 function is operating on a mail message and therefore should not fill
1469 the header lines. If @var{citation-regexp} is a string, it is used as
1470 a regular expression; if it matches the beginning of a line, that line
1471 is treated as a citation marker.
1472
1473 Ordinarily, @code{fill-individual-paragraphs} regards each change in
1474 indentation as starting a new paragraph. If
1475 @code{fill-individual-varying-indent} is non-@code{nil}, then only
1476 separator lines separate paragraphs. That mode can handle indented
1477 paragraphs with additional indentation on the first line.
1478 @end deffn
1479
1480 @defopt fill-individual-varying-indent
1481 This variable alters the action of @code{fill-individual-paragraphs} as
1482 described above.
1483 @end defopt
1484
1485 @deffn Command fill-region-as-paragraph start end &optional justify nosqueeze squeeze-after
1486 This command considers a region of text as a single paragraph and fills
1487 it. If the region was made up of many paragraphs, the blank lines
1488 between paragraphs are removed. This function justifies as well as
1489 filling when @var{justify} is non-@code{nil}.
1490
1491 If @var{nosqueeze} is non-@code{nil}, that means to leave whitespace
1492 other than line breaks untouched. If @var{squeeze-after} is
1493 non-@code{nil}, it specifies a position in the region, and means don't
1494 canonicalize spaces before that position.
1495
1496 In Adaptive Fill mode, this command calls @code{fill-context-prefix} to
1497 choose a fill prefix by default. @xref{Adaptive Fill}.
1498 @end deffn
1499
1500 @deffn Command justify-current-line &optional how eop nosqueeze
1501 This command inserts spaces between the words of the current line so
1502 that the line ends exactly at @code{fill-column}. It returns
1503 @code{nil}.
1504
1505 The argument @var{how}, if non-@code{nil} specifies explicitly the style
1506 of justification. It can be @code{left}, @code{right}, @code{full},
1507 @code{center}, or @code{none}. If it is @code{t}, that means to do
1508 follow specified justification style (see @code{current-justification},
1509 below). @code{nil} means to do full justification.
1510
1511 If @var{eop} is non-@code{nil}, that means do only left-justification
1512 if @code{current-justification} specifies full justification. This is
1513 used for the last line of a paragraph; even if the paragraph as a
1514 whole is fully justified, the last line should not be.
1515
1516 If @var{nosqueeze} is non-@code{nil}, that means do not change interior
1517 whitespace.
1518 @end deffn
1519
1520 @defopt default-justification
1521 This variable's value specifies the style of justification to use for
1522 text that doesn't specify a style with a text property. The possible
1523 values are @code{left}, @code{right}, @code{full}, @code{center}, or
1524 @code{none}. The default value is @code{left}.
1525 @end defopt
1526
1527 @defun current-justification
1528 This function returns the proper justification style to use for filling
1529 the text around point.
1530
1531 This returns the value of the @code{justification} text property at
1532 point, or the variable @var{default-justification} if there is no such
1533 text property. However, it returns @code{nil} rather than @code{none}
1534 to mean ``don't justify''.
1535 @end defun
1536
1537 @defopt sentence-end-double-space
1538 @anchor{Definition of sentence-end-double-space}
1539 If this variable is non-@code{nil}, a period followed by just one space
1540 does not count as the end of a sentence, and the filling functions
1541 avoid breaking the line at such a place.
1542 @end defopt
1543
1544 @defopt sentence-end-without-period
1545 If this variable is non-@code{nil}, a sentence can end without a
1546 period. This is used for languages like Thai, where sentences end
1547 with a double space but without a period.
1548 @end defopt
1549
1550 @defopt sentence-end-without-space
1551 If this variable is non-@code{nil}, it should be a string of
1552 characters that can end a sentence without following spaces.
1553 @end defopt
1554
1555 @defvar fill-paragraph-function
1556 This variable provides a way to override the filling of paragraphs.
1557 If its value is non-@code{nil}, @code{fill-paragraph} calls this
1558 function to do the work. If the function returns a non-@code{nil}
1559 value, @code{fill-paragraph} assumes the job is done, and immediately
1560 returns that value.
1561
1562 The usual use of this feature is to fill comments in programming
1563 language modes. If the function needs to fill a paragraph in the usual
1564 way, it can do so as follows:
1565
1566 @example
1567 (let ((fill-paragraph-function nil))
1568 (fill-paragraph arg))
1569 @end example
1570 @end defvar
1571
1572 @defvar fill-forward-paragraph-function
1573 This variable provides a way to override how the filling functions,
1574 such as @code{fill-region} and @code{fill-paragraph}, move forward to
1575 the next paragraph. Its value should be a function, which is called
1576 with a single argument @var{n}, the number of paragraphs to move, and
1577 should return the difference between @var{n} and the number of
1578 paragraphs actually moved. The default value of this variable is
1579 @code{forward-paragraph}. @xref{Paragraphs,,, emacs, The GNU Emacs
1580 Manual}.
1581 @end defvar
1582
1583 @defvar use-hard-newlines
1584 If this variable is non-@code{nil}, the filling functions do not delete
1585 newlines that have the @code{hard} text property. These ``hard
1586 newlines'' act as paragraph separators.
1587 @end defvar
1588
1589 @node Margins
1590 @section Margins for Filling
1591
1592 @defopt fill-prefix
1593 This buffer-local variable, if non-@code{nil}, specifies a string of
1594 text that appears at the beginning of normal text lines and should be
1595 disregarded when filling them. Any line that fails to start with the
1596 fill prefix is considered the start of a paragraph; so is any line
1597 that starts with the fill prefix followed by additional whitespace.
1598 Lines that start with the fill prefix but no additional whitespace are
1599 ordinary text lines that can be filled together. The resulting filled
1600 lines also start with the fill prefix.
1601
1602 The fill prefix follows the left margin whitespace, if any.
1603 @end defopt
1604
1605 @defopt fill-column
1606 This buffer-local variable specifies the maximum width of filled lines.
1607 Its value should be an integer, which is a number of columns. All the
1608 filling, justification, and centering commands are affected by this
1609 variable, including Auto Fill mode (@pxref{Auto Filling}).
1610
1611 As a practical matter, if you are writing text for other people to
1612 read, you should set @code{fill-column} to no more than 70. Otherwise
1613 the line will be too long for people to read comfortably, and this can
1614 make the text seem clumsy.
1615
1616 The default value for @code{fill-column} is 70.
1617 @end defopt
1618
1619 @deffn Command set-left-margin from to margin
1620 This sets the @code{left-margin} property on the text from @var{from} to
1621 @var{to} to the value @var{margin}. If Auto Fill mode is enabled, this
1622 command also refills the region to fit the new margin.
1623 @end deffn
1624
1625 @deffn Command set-right-margin from to margin
1626 This sets the @code{right-margin} property on the text from @var{from}
1627 to @var{to} to the value @var{margin}. If Auto Fill mode is enabled,
1628 this command also refills the region to fit the new margin.
1629 @end deffn
1630
1631 @defun current-left-margin
1632 This function returns the proper left margin value to use for filling
1633 the text around point. The value is the sum of the @code{left-margin}
1634 property of the character at the start of the current line (or zero if
1635 none), and the value of the variable @code{left-margin}.
1636 @end defun
1637
1638 @defun current-fill-column
1639 This function returns the proper fill column value to use for filling
1640 the text around point. The value is the value of the @code{fill-column}
1641 variable, minus the value of the @code{right-margin} property of the
1642 character after point.
1643 @end defun
1644
1645 @deffn Command move-to-left-margin &optional n force
1646 This function moves point to the left margin of the current line. The
1647 column moved to is determined by calling the function
1648 @code{current-left-margin}. If the argument @var{n} is non-@code{nil},
1649 @code{move-to-left-margin} moves forward @var{n}@minus{}1 lines first.
1650
1651 If @var{force} is non-@code{nil}, that says to fix the line's
1652 indentation if that doesn't match the left margin value.
1653 @end deffn
1654
1655 @defun delete-to-left-margin &optional from to
1656 This function removes left margin indentation from the text between
1657 @var{from} and @var{to}. The amount of indentation to delete is
1658 determined by calling @code{current-left-margin}. In no case does this
1659 function delete non-whitespace. If @var{from} and @var{to} are omitted,
1660 they default to the whole buffer.
1661 @end defun
1662
1663 @defun indent-to-left-margin
1664 This function adjusts the indentation at the beginning of the current
1665 line to the value specified by the variable @code{left-margin}. (That
1666 may involve either inserting or deleting whitespace.) This function
1667 is value of @code{indent-line-function} in Paragraph-Indent Text mode.
1668 @end defun
1669
1670 @defopt left-margin
1671 This variable specifies the base left margin column. In Fundamental
1672 mode, @kbd{C-j} indents to this column. This variable automatically
1673 becomes buffer-local when set in any fashion.
1674 @end defopt
1675
1676 @defopt fill-nobreak-predicate
1677 This variable gives major modes a way to specify not to break a line
1678 at certain places. Its value should be a list of functions. Whenever
1679 filling considers breaking the line at a certain place in the buffer,
1680 it calls each of these functions with no arguments and with point
1681 located at that place. If any of the functions returns
1682 non-@code{nil}, then the line won't be broken there.
1683 @end defopt
1684
1685 @node Adaptive Fill
1686 @section Adaptive Fill Mode
1687 @c @cindex Adaptive Fill mode "adaptive-fill-mode" is adjacent.
1688
1689 When @dfn{Adaptive Fill Mode} is enabled, Emacs determines the fill
1690 prefix automatically from the text in each paragraph being filled
1691 rather than using a predetermined value. During filling, this fill
1692 prefix gets inserted at the start of the second and subsequent lines
1693 of the paragraph as described in @ref{Filling}, and in @ref{Auto
1694 Filling}.
1695
1696 @defopt adaptive-fill-mode
1697 Adaptive Fill mode is enabled when this variable is non-@code{nil}.
1698 It is @code{t} by default.
1699 @end defopt
1700
1701 @defun fill-context-prefix from to
1702 This function implements the heart of Adaptive Fill mode; it chooses a
1703 fill prefix based on the text between @var{from} and @var{to},
1704 typically the start and end of a paragraph. It does this by looking
1705 at the first two lines of the paragraph, based on the variables
1706 described below.
1707 @c The optional argument first-line-regexp is not documented
1708 @c because it exists for internal purposes and might be eliminated
1709 @c in the future.
1710
1711 Usually, this function returns the fill prefix, a string. However,
1712 before doing this, the function makes a final check (not specially
1713 mentioned in the following) that a line starting with this prefix
1714 wouldn't look like the start of a paragraph. Should this happen, the
1715 function signals the anomaly by returning @code{nil} instead.
1716
1717 In detail, @code{fill-context-prefix} does this:
1718
1719 @enumerate
1720 @item
1721 It takes a candidate for the fill prefix from the first line---it
1722 tries first the function in @code{adaptive-fill-function} (if any),
1723 then the regular expression @code{adaptive-fill-regexp} (see below).
1724 The first non-@code{nil} result of these, or the empty string if
1725 they're both @code{nil}, becomes the first line's candidate.
1726 @item
1727 If the paragraph has as yet only one line, the function tests the
1728 validity of the prefix candidate just found. The function then
1729 returns the candidate if it's valid, or a string of spaces otherwise.
1730 (see the description of @code{adaptive-fill-first-line-regexp} below).
1731 @item
1732 When the paragraph already has two lines, the function next looks for
1733 a prefix candidate on the second line, in just the same way it did for
1734 the first line. If it doesn't find one, it returns @code{nil}.
1735 @item
1736 The function now compares the two candidate prefixes heuristically: if
1737 the non-whitespace characters in the line 2 candidate occur in the
1738 same order in the line 1 candidate, the function returns the line 2
1739 candidate. Otherwise, it returns the largest initial substring which
1740 is common to both candidates (which might be the empty string).
1741 @end enumerate
1742 @end defun
1743
1744 @defopt adaptive-fill-regexp
1745 Adaptive Fill mode matches this regular expression against the text
1746 starting after the left margin whitespace (if any) on a line; the
1747 characters it matches are that line's candidate for the fill prefix.
1748
1749 The default value matches whitespace with certain punctuation
1750 characters intermingled.
1751 @end defopt
1752
1753 @defopt adaptive-fill-first-line-regexp
1754 Used only in one-line paragraphs, this regular expression acts as an
1755 additional check of the validity of the one available candidate fill
1756 prefix: the candidate must match this regular expression, or match
1757 @code{comment-start-skip}. If it doesn't, @code{fill-context-prefix}
1758 replaces the candidate with a string of spaces ``of the same width''
1759 as it.
1760
1761 The default value of this variable is @w{@code{"\\`[ \t]*\\'"}}, which
1762 matches only a string of whitespace. The effect of this default is to
1763 force the fill prefixes found in one-line paragraphs always to be pure
1764 whitespace.
1765 @end defopt
1766
1767 @defopt adaptive-fill-function
1768 You can specify more complex ways of choosing a fill prefix
1769 automatically by setting this variable to a function. The function is
1770 called with point after the left margin (if any) of a line, and it
1771 must preserve point. It should return either ``that line's'' fill
1772 prefix or @code{nil}, meaning it has failed to determine a prefix.
1773 @end defopt
1774
1775 @node Auto Filling
1776 @comment node-name, next, previous, up
1777 @section Auto Filling
1778 @cindex filling, automatic
1779 @cindex Auto Fill mode
1780
1781 Auto Fill mode is a minor mode that fills lines automatically as text
1782 is inserted. This section describes the hook used by Auto Fill mode.
1783 For a description of functions that you can call explicitly to fill and
1784 justify existing text, see @ref{Filling}.
1785
1786 Auto Fill mode also enables the functions that change the margins and
1787 justification style to refill portions of the text. @xref{Margins}.
1788
1789 @defvar auto-fill-function
1790 The value of this buffer-local variable should be a function (of no
1791 arguments) to be called after self-inserting a character from the table
1792 @code{auto-fill-chars}. It may be @code{nil}, in which case nothing
1793 special is done in that case.
1794
1795 The value of @code{auto-fill-function} is @code{do-auto-fill} when
1796 Auto-Fill mode is enabled. That is a function whose sole purpose is to
1797 implement the usual strategy for breaking a line.
1798
1799 @quotation
1800 In older Emacs versions, this variable was named @code{auto-fill-hook},
1801 but since it is not called with the standard convention for hooks, it
1802 was renamed to @code{auto-fill-function} in version 19.
1803 @end quotation
1804 @end defvar
1805
1806 @defvar normal-auto-fill-function
1807 This variable specifies the function to use for
1808 @code{auto-fill-function}, if and when Auto Fill is turned on. Major
1809 modes can set buffer-local values for this variable to alter how Auto
1810 Fill works.
1811 @end defvar
1812
1813 @defvar auto-fill-chars
1814 A char table of characters which invoke @code{auto-fill-function} when
1815 self-inserted---space and newline in most language environments. They
1816 have an entry @code{t} in the table.
1817 @end defvar
1818
1819 @node Sorting
1820 @section Sorting Text
1821 @cindex sorting text
1822
1823 The sorting functions described in this section all rearrange text in
1824 a buffer. This is in contrast to the function @code{sort}, which
1825 rearranges the order of the elements of a list (@pxref{Rearrangement}).
1826 The values returned by these functions are not meaningful.
1827
1828 @defun sort-subr reverse nextrecfun endrecfun &optional startkeyfun endkeyfun predicate
1829 This function is the general text-sorting routine that subdivides a
1830 buffer into records and then sorts them. Most of the commands in this
1831 section use this function.
1832
1833 To understand how @code{sort-subr} works, consider the whole accessible
1834 portion of the buffer as being divided into disjoint pieces called
1835 @dfn{sort records}. The records may or may not be contiguous, but they
1836 must not overlap. A portion of each sort record (perhaps all of it) is
1837 designated as the sort key. Sorting rearranges the records in order by
1838 their sort keys.
1839
1840 Usually, the records are rearranged in order of ascending sort key.
1841 If the first argument to the @code{sort-subr} function, @var{reverse},
1842 is non-@code{nil}, the sort records are rearranged in order of
1843 descending sort key.
1844
1845 The next four arguments to @code{sort-subr} are functions that are
1846 called to move point across a sort record. They are called many times
1847 from within @code{sort-subr}.
1848
1849 @enumerate
1850 @item
1851 @var{nextrecfun} is called with point at the end of a record. This
1852 function moves point to the start of the next record. The first record
1853 is assumed to start at the position of point when @code{sort-subr} is
1854 called. Therefore, you should usually move point to the beginning of
1855 the buffer before calling @code{sort-subr}.
1856
1857 This function can indicate there are no more sort records by leaving
1858 point at the end of the buffer.
1859
1860 @item
1861 @var{endrecfun} is called with point within a record. It moves point to
1862 the end of the record.
1863
1864 @item
1865 @var{startkeyfun} is called to move point from the start of a record to
1866 the start of the sort key. This argument is optional; if it is omitted,
1867 the whole record is the sort key. If supplied, the function should
1868 either return a non-@code{nil} value to be used as the sort key, or
1869 return @code{nil} to indicate that the sort key is in the buffer
1870 starting at point. In the latter case, @var{endkeyfun} is called to
1871 find the end of the sort key.
1872
1873 @item
1874 @var{endkeyfun} is called to move point from the start of the sort key
1875 to the end of the sort key. This argument is optional. If
1876 @var{startkeyfun} returns @code{nil} and this argument is omitted (or
1877 @code{nil}), then the sort key extends to the end of the record. There
1878 is no need for @var{endkeyfun} if @var{startkeyfun} returns a
1879 non-@code{nil} value.
1880 @end enumerate
1881
1882 The argument @var{predicate} is the function to use to compare keys.
1883 If keys are numbers, it defaults to @code{<}; otherwise it defaults to
1884 @code{string<}.
1885
1886 As an example of @code{sort-subr}, here is the complete function
1887 definition for @code{sort-lines}:
1888
1889 @example
1890 @group
1891 ;; @r{Note that the first two lines of doc string}
1892 ;; @r{are effectively one line when viewed by a user.}
1893 (defun sort-lines (reverse beg end)
1894 "Sort lines in region alphabetically;\
1895 argument means descending order.
1896 Called from a program, there are three arguments:
1897 @end group
1898 @group
1899 REVERSE (non-nil means reverse order),\
1900 BEG and END (region to sort).
1901 The variable `sort-fold-case' determines\
1902 whether alphabetic case affects
1903 the sort order."
1904 @end group
1905 @group
1906 (interactive "P\nr")
1907 (save-excursion
1908 (save-restriction
1909 (narrow-to-region beg end)
1910 (goto-char (point-min))
1911 (let ((inhibit-field-text-motion t))
1912 (sort-subr reverse 'forward-line 'end-of-line)))))
1913 @end group
1914 @end example
1915
1916 Here @code{forward-line} moves point to the start of the next record,
1917 and @code{end-of-line} moves point to the end of record. We do not pass
1918 the arguments @var{startkeyfun} and @var{endkeyfun}, because the entire
1919 record is used as the sort key.
1920
1921 The @code{sort-paragraphs} function is very much the same, except that
1922 its @code{sort-subr} call looks like this:
1923
1924 @example
1925 @group
1926 (sort-subr reverse
1927 (function
1928 (lambda ()
1929 (while (and (not (eobp))
1930 (looking-at paragraph-separate))
1931 (forward-line 1))))
1932 'forward-paragraph)
1933 @end group
1934 @end example
1935
1936 Markers pointing into any sort records are left with no useful
1937 position after @code{sort-subr} returns.
1938 @end defun
1939
1940 @defopt sort-fold-case
1941 If this variable is non-@code{nil}, @code{sort-subr} and the other
1942 buffer sorting functions ignore case when comparing strings.
1943 @end defopt
1944
1945 @deffn Command sort-regexp-fields reverse record-regexp key-regexp start end
1946 This command sorts the region between @var{start} and @var{end}
1947 alphabetically as specified by @var{record-regexp} and @var{key-regexp}.
1948 If @var{reverse} is a negative integer, then sorting is in reverse
1949 order.
1950
1951 Alphabetical sorting means that two sort keys are compared by
1952 comparing the first characters of each, the second characters of each,
1953 and so on. If a mismatch is found, it means that the sort keys are
1954 unequal; the sort key whose character is less at the point of first
1955 mismatch is the lesser sort key. The individual characters are compared
1956 according to their numerical character codes in the Emacs character set.
1957
1958 The value of the @var{record-regexp} argument specifies how to divide
1959 the buffer into sort records. At the end of each record, a search is
1960 done for this regular expression, and the text that matches it is taken
1961 as the next record. For example, the regular expression @samp{^.+$},
1962 which matches lines with at least one character besides a newline, would
1963 make each such line into a sort record. @xref{Regular Expressions}, for
1964 a description of the syntax and meaning of regular expressions.
1965
1966 The value of the @var{key-regexp} argument specifies what part of each
1967 record is the sort key. The @var{key-regexp} could match the whole
1968 record, or only a part. In the latter case, the rest of the record has
1969 no effect on the sorted order of records, but it is carried along when
1970 the record moves to its new position.
1971
1972 The @var{key-regexp} argument can refer to the text matched by a
1973 subexpression of @var{record-regexp}, or it can be a regular expression
1974 on its own.
1975
1976 If @var{key-regexp} is:
1977
1978 @table @asis
1979 @item @samp{\@var{digit}}
1980 then the text matched by the @var{digit}th @samp{\(...\)} parenthesis
1981 grouping in @var{record-regexp} is the sort key.
1982
1983 @item @samp{\&}
1984 then the whole record is the sort key.
1985
1986 @item a regular expression
1987 then @code{sort-regexp-fields} searches for a match for the regular
1988 expression within the record. If such a match is found, it is the sort
1989 key. If there is no match for @var{key-regexp} within a record then
1990 that record is ignored, which means its position in the buffer is not
1991 changed. (The other records may move around it.)
1992 @end table
1993
1994 For example, if you plan to sort all the lines in the region by the
1995 first word on each line starting with the letter @samp{f}, you should
1996 set @var{record-regexp} to @samp{^.*$} and set @var{key-regexp} to
1997 @samp{\<f\w*\>}. The resulting expression looks like this:
1998
1999 @example
2000 @group
2001 (sort-regexp-fields nil "^.*$" "\\<f\\w*\\>"
2002 (region-beginning)
2003 (region-end))
2004 @end group
2005 @end example
2006
2007 If you call @code{sort-regexp-fields} interactively, it prompts for
2008 @var{record-regexp} and @var{key-regexp} in the minibuffer.
2009 @end deffn
2010
2011 @deffn Command sort-lines reverse start end
2012 This command alphabetically sorts lines in the region between
2013 @var{start} and @var{end}. If @var{reverse} is non-@code{nil}, the sort
2014 is in reverse order.
2015 @end deffn
2016
2017 @deffn Command sort-paragraphs reverse start end
2018 This command alphabetically sorts paragraphs in the region between
2019 @var{start} and @var{end}. If @var{reverse} is non-@code{nil}, the sort
2020 is in reverse order.
2021 @end deffn
2022
2023 @deffn Command sort-pages reverse start end
2024 This command alphabetically sorts pages in the region between
2025 @var{start} and @var{end}. If @var{reverse} is non-@code{nil}, the sort
2026 is in reverse order.
2027 @end deffn
2028
2029 @deffn Command sort-fields field start end
2030 This command sorts lines in the region between @var{start} and
2031 @var{end}, comparing them alphabetically by the @var{field}th field
2032 of each line. Fields are separated by whitespace and numbered starting
2033 from 1. If @var{field} is negative, sorting is by the
2034 @w{@minus{}@var{field}th} field from the end of the line. This command
2035 is useful for sorting tables.
2036 @end deffn
2037
2038 @deffn Command sort-numeric-fields field start end
2039 This command sorts lines in the region between @var{start} and
2040 @var{end}, comparing them numerically by the @var{field}th field of
2041 each line. Fields are separated by whitespace and numbered starting
2042 from 1. The specified field must contain a number in each line of the
2043 region. Numbers starting with 0 are treated as octal, and numbers
2044 starting with @samp{0x} are treated as hexadecimal.
2045
2046 If @var{field} is negative, sorting is by the
2047 @w{@minus{}@var{field}th} field from the end of the line. This
2048 command is useful for sorting tables.
2049 @end deffn
2050
2051 @defopt sort-numeric-base
2052 This variable specifies the default radix for
2053 @code{sort-numeric-fields} to parse numbers.
2054 @end defopt
2055
2056 @deffn Command sort-columns reverse &optional beg end
2057 This command sorts the lines in the region between @var{beg} and
2058 @var{end}, comparing them alphabetically by a certain range of
2059 columns. The column positions of @var{beg} and @var{end} bound the
2060 range of columns to sort on.
2061
2062 If @var{reverse} is non-@code{nil}, the sort is in reverse order.
2063
2064 One unusual thing about this command is that the entire line
2065 containing position @var{beg}, and the entire line containing position
2066 @var{end}, are included in the region sorted.
2067
2068 Note that @code{sort-columns} rejects text that contains tabs, because
2069 tabs could be split across the specified columns. Use @kbd{M-x
2070 untabify} to convert tabs to spaces before sorting.
2071
2072 When possible, this command actually works by calling the @code{sort}
2073 utility program.
2074 @end deffn
2075
2076 @node Columns
2077 @comment node-name, next, previous, up
2078 @section Counting Columns
2079 @cindex columns
2080 @cindex counting columns
2081 @cindex horizontal position
2082
2083 The column functions convert between a character position (counting
2084 characters from the beginning of the buffer) and a column position
2085 (counting screen characters from the beginning of a line).
2086
2087 These functions count each character according to the number of
2088 columns it occupies on the screen. This means control characters count
2089 as occupying 2 or 4 columns, depending upon the value of
2090 @code{ctl-arrow}, and tabs count as occupying a number of columns that
2091 depends on the value of @code{tab-width} and on the column where the tab
2092 begins. @xref{Usual Display}.
2093
2094 Column number computations ignore the width of the window and the
2095 amount of horizontal scrolling. Consequently, a column value can be
2096 arbitrarily high. The first (or leftmost) column is numbered 0. They
2097 also ignore overlays and text properties, aside from invisibility.
2098
2099 @defun current-column
2100 This function returns the horizontal position of point, measured in
2101 columns, counting from 0 at the left margin. The column position is the
2102 sum of the widths of all the displayed representations of the characters
2103 between the start of the current line and point.
2104
2105 For an example of using @code{current-column}, see the description of
2106 @code{count-lines} in @ref{Text Lines}.
2107 @end defun
2108
2109 @deffn Command move-to-column column &optional force
2110 This function moves point to @var{column} in the current line. The
2111 calculation of @var{column} takes into account the widths of the
2112 displayed representations of the characters between the start of the
2113 line and point.
2114
2115 When called interactively, @var{column} is the value of prefix numeric
2116 argument. If @var{column} is not an integer, an error is signaled.
2117
2118 If column @var{column} is beyond the end of the line, point moves to
2119 the end of the line. If @var{column} is negative, point moves to the
2120 beginning of the line.
2121
2122 If it is impossible to move to column @var{column} because that is in
2123 the middle of a multicolumn character such as a tab, point moves to the
2124 end of that character. However, if @var{force} is non-@code{nil}, and
2125 @var{column} is in the middle of a tab, then @code{move-to-column}
2126 converts the tab into spaces so that it can move precisely to column
2127 @var{column}. Other multicolumn characters can cause anomalies despite
2128 @var{force}, since there is no way to split them.
2129
2130 The argument @var{force} also has an effect if the line isn't long
2131 enough to reach column @var{column}; if it is @code{t}, that means to
2132 add whitespace at the end of the line to reach that column.
2133
2134 The return value is the column number actually moved to.
2135 @end deffn
2136
2137 @node Indentation
2138 @section Indentation
2139 @cindex indentation
2140
2141 The indentation functions are used to examine, move to, and change
2142 whitespace that is at the beginning of a line. Some of the functions
2143 can also change whitespace elsewhere on a line. Columns and indentation
2144 count from zero at the left margin.
2145
2146 @menu
2147 * Primitive Indent:: Functions used to count and insert indentation.
2148 * Mode-Specific Indent:: Customize indentation for different modes.
2149 * Region Indent:: Indent all the lines in a region.
2150 * Relative Indent:: Indent the current line based on previous lines.
2151 * Indent Tabs:: Adjustable, typewriter-like tab stops.
2152 * Motion by Indent:: Move to first non-blank character.
2153 @end menu
2154
2155 @node Primitive Indent
2156 @subsection Indentation Primitives
2157
2158 This section describes the primitive functions used to count and
2159 insert indentation. The functions in the following sections use these
2160 primitives. @xref{Width}, for related functions.
2161
2162 @defun current-indentation
2163 @comment !!Type Primitive Function
2164 @comment !!SourceFile indent.c
2165 This function returns the indentation of the current line, which is
2166 the horizontal position of the first nonblank character. If the
2167 contents are entirely blank, then this is the horizontal position of the
2168 end of the line.
2169 @end defun
2170
2171 @deffn Command indent-to column &optional minimum
2172 @comment !!Type Primitive Function
2173 @comment !!SourceFile indent.c
2174 This function indents from point with tabs and spaces until @var{column}
2175 is reached. If @var{minimum} is specified and non-@code{nil}, then at
2176 least that many spaces are inserted even if this requires going beyond
2177 @var{column}. Otherwise the function does nothing if point is already
2178 beyond @var{column}. The value is the column at which the inserted
2179 indentation ends.
2180
2181 The inserted whitespace characters inherit text properties from the
2182 surrounding text (usually, from the preceding text only). @xref{Sticky
2183 Properties}.
2184 @end deffn
2185
2186 @defopt indent-tabs-mode
2187 @comment !!SourceFile indent.c
2188 If this variable is non-@code{nil}, indentation functions can insert
2189 tabs as well as spaces. Otherwise, they insert only spaces. Setting
2190 this variable automatically makes it buffer-local in the current buffer.
2191 @end defopt
2192
2193 @node Mode-Specific Indent
2194 @subsection Indentation Controlled by Major Mode
2195
2196 An important function of each major mode is to customize the @key{TAB}
2197 key to indent properly for the language being edited. This section
2198 describes the mechanism of the @key{TAB} key and how to control it.
2199 The functions in this section return unpredictable values.
2200
2201 @defvar indent-line-function
2202 This variable's value is the function to be used by @key{TAB} (and
2203 various commands) to indent the current line. The command
2204 @code{indent-according-to-mode} does little more than call this function.
2205
2206 In Lisp mode, the value is the symbol @code{lisp-indent-line}; in C
2207 mode, @code{c-indent-line}; in Fortran mode, @code{fortran-indent-line}.
2208 The default value is @code{indent-relative}. @xref{Auto-Indentation}.
2209 @end defvar
2210
2211 @deffn Command indent-according-to-mode
2212 This command calls the function in @code{indent-line-function} to
2213 indent the current line in a way appropriate for the current major mode.
2214 @end deffn
2215
2216 @deffn Command indent-for-tab-command &optional rigid
2217 This command calls the function in @code{indent-line-function} to
2218 indent the current line; however, if that function is
2219 @code{indent-to-left-margin}, @code{insert-tab} is called instead.
2220 (That is a trivial command that inserts a tab character.) If
2221 @var{rigid} is non-@code{nil}, this function also rigidly indents the
2222 entire balanced expression that starts at the beginning of the current
2223 line, to reflect change in indentation of the current line.
2224 @end deffn
2225
2226 @deffn Command newline-and-indent
2227 This function inserts a newline, then indents the new line (the one
2228 following the newline just inserted) according to the major mode.
2229
2230 It does indentation by calling the current @code{indent-line-function}.
2231 In programming language modes, this is the same thing @key{TAB} does,
2232 but in some text modes, where @key{TAB} inserts a tab,
2233 @code{newline-and-indent} indents to the column specified by
2234 @code{left-margin}.
2235 @end deffn
2236
2237 @deffn Command reindent-then-newline-and-indent
2238 @comment !!SourceFile simple.el
2239 This command reindents the current line, inserts a newline at point,
2240 and then indents the new line (the one following the newline just
2241 inserted).
2242
2243 This command does indentation on both lines according to the current
2244 major mode, by calling the current value of @code{indent-line-function}.
2245 In programming language modes, this is the same thing @key{TAB} does,
2246 but in some text modes, where @key{TAB} inserts a tab,
2247 @code{reindent-then-newline-and-indent} indents to the column specified
2248 by @code{left-margin}.
2249 @end deffn
2250
2251 @node Region Indent
2252 @subsection Indenting an Entire Region
2253
2254 This section describes commands that indent all the lines in the
2255 region. They return unpredictable values.
2256
2257 @deffn Command indent-region start end &optional to-column
2258 This command indents each nonblank line starting between @var{start}
2259 (inclusive) and @var{end} (exclusive). If @var{to-column} is
2260 @code{nil}, @code{indent-region} indents each nonblank line by calling
2261 the current mode's indentation function, the value of
2262 @code{indent-line-function}.
2263
2264 If @var{to-column} is non-@code{nil}, it should be an integer
2265 specifying the number of columns of indentation; then this function
2266 gives each line exactly that much indentation, by either adding or
2267 deleting whitespace.
2268
2269 If there is a fill prefix, @code{indent-region} indents each line
2270 by making it start with the fill prefix.
2271 @end deffn
2272
2273 @defvar indent-region-function
2274 The value of this variable is a function that can be used by
2275 @code{indent-region} as a short cut. It should take two arguments, the
2276 start and end of the region. You should design the function so
2277 that it will produce the same results as indenting the lines of the
2278 region one by one, but presumably faster.
2279
2280 If the value is @code{nil}, there is no short cut, and
2281 @code{indent-region} actually works line by line.
2282
2283 A short-cut function is useful in modes such as C mode and Lisp mode,
2284 where the @code{indent-line-function} must scan from the beginning of
2285 the function definition: applying it to each line would be quadratic in
2286 time. The short cut can update the scan information as it moves through
2287 the lines indenting them; this takes linear time. In a mode where
2288 indenting a line individually is fast, there is no need for a short cut.
2289
2290 @code{indent-region} with a non-@code{nil} argument @var{to-column} has
2291 a different meaning and does not use this variable.
2292 @end defvar
2293
2294 @deffn Command indent-rigidly start end count
2295 This command indents all lines starting between @var{start}
2296 (inclusive) and @var{end} (exclusive) sideways by @var{count} columns.
2297 This ``preserves the shape'' of the affected region, moving it as a
2298 rigid unit. Consequently, this command is useful not only for indenting
2299 regions of unindented text, but also for indenting regions of formatted
2300 code.
2301
2302 For example, if @var{count} is 3, this command adds 3 columns of
2303 indentation to each of the lines beginning in the region specified.
2304
2305 In Mail mode, @kbd{C-c C-y} (@code{mail-yank-original}) uses
2306 @code{indent-rigidly} to indent the text copied from the message being
2307 replied to.
2308 @end deffn
2309
2310 @deffn Command indent-code-rigidly start end columns &optional nochange-regexp
2311 This is like @code{indent-rigidly}, except that it doesn't alter lines
2312 that start within strings or comments.
2313
2314 In addition, it doesn't alter a line if @var{nochange-regexp} matches at
2315 the beginning of the line (if @var{nochange-regexp} is non-@code{nil}).
2316 @end deffn
2317
2318 @node Relative Indent
2319 @subsection Indentation Relative to Previous Lines
2320
2321 This section describes two commands that indent the current line
2322 based on the contents of previous lines.
2323
2324 @deffn Command indent-relative &optional unindented-ok
2325 This command inserts whitespace at point, extending to the same
2326 column as the next @dfn{indent point} of the previous nonblank line. An
2327 indent point is a non-whitespace character following whitespace. The
2328 next indent point is the first one at a column greater than the current
2329 column of point. For example, if point is underneath and to the left of
2330 the first non-blank character of a line of text, it moves to that column
2331 by inserting whitespace.
2332
2333 If the previous nonblank line has no next indent point (i.e., none at a
2334 great enough column position), @code{indent-relative} either does
2335 nothing (if @var{unindented-ok} is non-@code{nil}) or calls
2336 @code{tab-to-tab-stop}. Thus, if point is underneath and to the right
2337 of the last column of a short line of text, this command ordinarily
2338 moves point to the next tab stop by inserting whitespace.
2339
2340 The return value of @code{indent-relative} is unpredictable.
2341
2342 In the following example, point is at the beginning of the second
2343 line:
2344
2345 @example
2346 @group
2347 This line is indented twelve spaces.
2348 @point{}The quick brown fox jumped.
2349 @end group
2350 @end example
2351
2352 @noindent
2353 Evaluation of the expression @code{(indent-relative nil)} produces the
2354 following:
2355
2356 @example
2357 @group
2358 This line is indented twelve spaces.
2359 @point{}The quick brown fox jumped.
2360 @end group
2361 @end example
2362
2363 In this next example, point is between the @samp{m} and @samp{p} of
2364 @samp{jumped}:
2365
2366 @example
2367 @group
2368 This line is indented twelve spaces.
2369 The quick brown fox jum@point{}ped.
2370 @end group
2371 @end example
2372
2373 @noindent
2374 Evaluation of the expression @code{(indent-relative nil)} produces the
2375 following:
2376
2377 @example
2378 @group
2379 This line is indented twelve spaces.
2380 The quick brown fox jum @point{}ped.
2381 @end group
2382 @end example
2383 @end deffn
2384
2385 @deffn Command indent-relative-maybe
2386 @comment !!SourceFile indent.el
2387 This command indents the current line like the previous nonblank line,
2388 by calling @code{indent-relative} with @code{t} as the
2389 @var{unindented-ok} argument. The return value is unpredictable.
2390
2391 If the previous nonblank line has no indent points beyond the current
2392 column, this command does nothing.
2393 @end deffn
2394
2395 @node Indent Tabs
2396 @comment node-name, next, previous, up
2397 @subsection Adjustable ``Tab Stops''
2398 @cindex tabs stops for indentation
2399
2400 This section explains the mechanism for user-specified ``tab stops''
2401 and the mechanisms that use and set them. The name ``tab stops'' is
2402 used because the feature is similar to that of the tab stops on a
2403 typewriter. The feature works by inserting an appropriate number of
2404 spaces and tab characters to reach the next tab stop column; it does not
2405 affect the display of tab characters in the buffer (@pxref{Usual
2406 Display}). Note that the @key{TAB} character as input uses this tab
2407 stop feature only in a few major modes, such as Text mode.
2408 @xref{Tab Stops,,, emacs, The GNU Emacs Manual}.
2409
2410 @deffn Command tab-to-tab-stop
2411 This command inserts spaces or tabs before point, up to the next tab
2412 stop column defined by @code{tab-stop-list}. It searches the list for
2413 an element greater than the current column number, and uses that element
2414 as the column to indent to. It does nothing if no such element is
2415 found.
2416 @end deffn
2417
2418 @defopt tab-stop-list
2419 This variable is the list of tab stop columns used by
2420 @code{tab-to-tab-stops}. The elements should be integers in increasing
2421 order. The tab stop columns need not be evenly spaced.
2422
2423 Use @kbd{M-x edit-tab-stops} to edit the location of tab stops
2424 interactively.
2425 @end defopt
2426
2427 @node Motion by Indent
2428 @subsection Indentation-Based Motion Commands
2429
2430 These commands, primarily for interactive use, act based on the
2431 indentation in the text.
2432
2433 @deffn Command back-to-indentation
2434 @comment !!SourceFile simple.el
2435 This command moves point to the first non-whitespace character in the
2436 current line (which is the line in which point is located). It returns
2437 @code{nil}.
2438 @end deffn
2439
2440 @deffn Command backward-to-indentation &optional arg
2441 @comment !!SourceFile simple.el
2442 This command moves point backward @var{arg} lines and then to the
2443 first nonblank character on that line. It returns @code{nil}.
2444 If @var{arg} is omitted or @code{nil}, it defaults to 1.
2445 @end deffn
2446
2447 @deffn Command forward-to-indentation &optional arg
2448 @comment !!SourceFile simple.el
2449 This command moves point forward @var{arg} lines and then to the first
2450 nonblank character on that line. It returns @code{nil}.
2451 If @var{arg} is omitted or @code{nil}, it defaults to 1.
2452 @end deffn
2453
2454 @node Case Changes
2455 @comment node-name, next, previous, up
2456 @section Case Changes
2457 @cindex case conversion in buffers
2458
2459 The case change commands described here work on text in the current
2460 buffer. @xref{Case Conversion}, for case conversion functions that work
2461 on strings and characters. @xref{Case Tables}, for how to customize
2462 which characters are upper or lower case and how to convert them.
2463
2464 @deffn Command capitalize-region start end
2465 This function capitalizes all words in the region defined by
2466 @var{start} and @var{end}. To capitalize means to convert each word's
2467 first character to upper case and convert the rest of each word to lower
2468 case. The function returns @code{nil}.
2469
2470 If one end of the region is in the middle of a word, the part of the
2471 word within the region is treated as an entire word.
2472
2473 When @code{capitalize-region} is called interactively, @var{start} and
2474 @var{end} are point and the mark, with the smallest first.
2475
2476 @example
2477 @group
2478 ---------- Buffer: foo ----------
2479 This is the contents of the 5th foo.
2480 ---------- Buffer: foo ----------
2481 @end group
2482
2483 @group
2484 (capitalize-region 1 44)
2485 @result{} nil
2486
2487 ---------- Buffer: foo ----------
2488 This Is The Contents Of The 5th Foo.
2489 ---------- Buffer: foo ----------
2490 @end group
2491 @end example
2492 @end deffn
2493
2494 @deffn Command downcase-region start end
2495 This function converts all of the letters in the region defined by
2496 @var{start} and @var{end} to lower case. The function returns
2497 @code{nil}.
2498
2499 When @code{downcase-region} is called interactively, @var{start} and
2500 @var{end} are point and the mark, with the smallest first.
2501 @end deffn
2502
2503 @deffn Command upcase-region start end
2504 This function converts all of the letters in the region defined by
2505 @var{start} and @var{end} to upper case. The function returns
2506 @code{nil}.
2507
2508 When @code{upcase-region} is called interactively, @var{start} and
2509 @var{end} are point and the mark, with the smallest first.
2510 @end deffn
2511
2512 @deffn Command capitalize-word count
2513 This function capitalizes @var{count} words after point, moving point
2514 over as it does. To capitalize means to convert each word's first
2515 character to upper case and convert the rest of each word to lower case.
2516 If @var{count} is negative, the function capitalizes the
2517 @minus{}@var{count} previous words but does not move point. The value
2518 is @code{nil}.
2519
2520 If point is in the middle of a word, the part of the word before point
2521 is ignored when moving forward. The rest is treated as an entire word.
2522
2523 When @code{capitalize-word} is called interactively, @var{count} is
2524 set to the numeric prefix argument.
2525 @end deffn
2526
2527 @deffn Command downcase-word count
2528 This function converts the @var{count} words after point to all lower
2529 case, moving point over as it does. If @var{count} is negative, it
2530 converts the @minus{}@var{count} previous words but does not move point.
2531 The value is @code{nil}.
2532
2533 When @code{downcase-word} is called interactively, @var{count} is set
2534 to the numeric prefix argument.
2535 @end deffn
2536
2537 @deffn Command upcase-word count
2538 This function converts the @var{count} words after point to all upper
2539 case, moving point over as it does. If @var{count} is negative, it
2540 converts the @minus{}@var{count} previous words but does not move point.
2541 The value is @code{nil}.
2542
2543 When @code{upcase-word} is called interactively, @var{count} is set to
2544 the numeric prefix argument.
2545 @end deffn
2546
2547 @node Text Properties
2548 @section Text Properties
2549 @cindex text properties
2550 @cindex attributes of text
2551 @cindex properties of text
2552
2553 Each character position in a buffer or a string can have a @dfn{text
2554 property list}, much like the property list of a symbol (@pxref{Property
2555 Lists}). The properties belong to a particular character at a
2556 particular place, such as, the letter @samp{T} at the beginning of this
2557 sentence or the first @samp{o} in @samp{foo}---if the same character
2558 occurs in two different places, the two occurrences in general have
2559 different properties.
2560
2561 Each property has a name and a value. Both of these can be any Lisp
2562 object, but the name is normally a symbol. Typically each property
2563 name symbol is used for a particular purpose; for instance, the text
2564 property @code{face} specifies the faces for displaying the character
2565 (@pxref{Special Properties}). The usual way to access the property
2566 list is to specify a name and ask what value corresponds to it.
2567
2568 If a character has a @code{category} property, we call it the
2569 @dfn{property category} of the character. It should be a symbol. The
2570 properties of the symbol serve as defaults for the properties of the
2571 character.
2572
2573 Copying text between strings and buffers preserves the properties
2574 along with the characters; this includes such diverse functions as
2575 @code{substring}, @code{insert}, and @code{buffer-substring}.
2576
2577 @menu
2578 * Examining Properties:: Looking at the properties of one character.
2579 * Changing Properties:: Setting the properties of a range of text.
2580 * Property Search:: Searching for where a property changes value.
2581 * Special Properties:: Particular properties with special meanings.
2582 * Format Properties:: Properties for representing formatting of text.
2583 * Sticky Properties:: How inserted text gets properties from
2584 neighboring text.
2585 * Lazy Properties:: Computing text properties in a lazy fashion
2586 only when text is examined.
2587 * Clickable Text:: Using text properties to make regions of text
2588 do something when you click on them.
2589 * Fields:: The @code{field} property defines
2590 fields within the buffer.
2591 * Not Intervals:: Why text properties do not use
2592 Lisp-visible text intervals.
2593 @end menu
2594
2595 @node Examining Properties
2596 @subsection Examining Text Properties
2597
2598 The simplest way to examine text properties is to ask for the value of
2599 a particular property of a particular character. For that, use
2600 @code{get-text-property}. Use @code{text-properties-at} to get the
2601 entire property list of a character. @xref{Property Search}, for
2602 functions to examine the properties of a number of characters at once.
2603
2604 These functions handle both strings and buffers. Keep in mind that
2605 positions in a string start from 0, whereas positions in a buffer start
2606 from 1.
2607
2608 @defun get-text-property pos prop &optional object
2609 This function returns the value of the @var{prop} property of the
2610 character after position @var{pos} in @var{object} (a buffer or
2611 string). The argument @var{object} is optional and defaults to the
2612 current buffer.
2613
2614 If there is no @var{prop} property strictly speaking, but the character
2615 has a property category that is a symbol, then @code{get-text-property} returns
2616 the @var{prop} property of that symbol.
2617 @end defun
2618
2619 @defun get-char-property position prop &optional object
2620 This function is like @code{get-text-property}, except that it checks
2621 overlays first and then text properties. @xref{Overlays}.
2622
2623 The argument @var{object} may be a string, a buffer, or a window. If
2624 it is a window, then the buffer displayed in that window is used for
2625 text properties and overlays, but only the overlays active for that
2626 window are considered. If @var{object} is a buffer, then overlays in
2627 that buffer are considered first, in order of decreasing priority,
2628 followed by the text properties. If @var{object} is a string, only
2629 text properties are considered, since strings never have overlays.
2630 @end defun
2631
2632 @defun get-char-property-and-overlay position prop &optional object
2633 This is like @code{get-char-property}, but gives extra information
2634 about the overlay that the property value comes from.
2635
2636 Its value is a cons cell whose @sc{car} is the property value, the
2637 same value @code{get-char-property} would return with the same
2638 arguments. Its @sc{cdr} is the overlay in which the property was
2639 found, or @code{nil}, if it was found as a text property or not found
2640 at all.
2641
2642 If @var{position} is at the end of @var{object}, both the @sc{car} and
2643 the @sc{cdr} of the value are @code{nil}.
2644 @end defun
2645
2646 @defvar char-property-alias-alist
2647 This variable holds an alist which maps property names to a list of
2648 alternative property names. If a character does not specify a direct
2649 value for a property, the alternative property names are consulted in
2650 order; the first non-@code{nil} value is used. This variable takes
2651 precedence over @code{default-text-properties}, and @code{category}
2652 properties take precedence over this variable.
2653 @end defvar
2654
2655 @defun text-properties-at position &optional object
2656 This function returns the entire property list of the character at
2657 @var{position} in the string or buffer @var{object}. If @var{object} is
2658 @code{nil}, it defaults to the current buffer.
2659 @end defun
2660
2661 @defvar default-text-properties
2662 This variable holds a property list giving default values for text
2663 properties. Whenever a character does not specify a value for a
2664 property, neither directly, through a category symbol, or through
2665 @code{char-property-alias-alist}, the value stored in this list is
2666 used instead. Here is an example:
2667
2668 @example
2669 (setq default-text-properties '(foo 69)
2670 char-property-alias-alist nil)
2671 ;; @r{Make sure character 1 has no properties of its own.}
2672 (set-text-properties 1 2 nil)
2673 ;; @r{What we get, when we ask, is the default value.}
2674 (get-text-property 1 'foo)
2675 @result{} 69
2676 @end example
2677 @end defvar
2678
2679 @node Changing Properties
2680 @subsection Changing Text Properties
2681
2682 The primitives for changing properties apply to a specified range of
2683 text in a buffer or string. The function @code{set-text-properties}
2684 (see end of section) sets the entire property list of the text in that
2685 range; more often, it is useful to add, change, or delete just certain
2686 properties specified by name.
2687
2688 Since text properties are considered part of the contents of the
2689 buffer (or string), and can affect how a buffer looks on the screen,
2690 any change in buffer text properties marks the buffer as modified.
2691 Buffer text property changes are undoable also (@pxref{Undo}).
2692 Positions in a string start from 0, whereas positions in a buffer
2693 start from 1.
2694
2695 @defun put-text-property start end prop value &optional object
2696 This function sets the @var{prop} property to @var{value} for the text
2697 between @var{start} and @var{end} in the string or buffer @var{object}.
2698 If @var{object} is @code{nil}, it defaults to the current buffer.
2699 @end defun
2700
2701 @defun add-text-properties start end props &optional object
2702 This function adds or overrides text properties for the text between
2703 @var{start} and @var{end} in the string or buffer @var{object}. If
2704 @var{object} is @code{nil}, it defaults to the current buffer.
2705
2706 The argument @var{props} specifies which properties to add. It should
2707 have the form of a property list (@pxref{Property Lists}): a list whose
2708 elements include the property names followed alternately by the
2709 corresponding values.
2710
2711 The return value is @code{t} if the function actually changed some
2712 property's value; @code{nil} otherwise (if @var{props} is @code{nil} or
2713 its values agree with those in the text).
2714
2715 For example, here is how to set the @code{comment} and @code{face}
2716 properties of a range of text:
2717
2718 @example
2719 (add-text-properties @var{start} @var{end}
2720 '(comment t face highlight))
2721 @end example
2722 @end defun
2723
2724 @defun remove-text-properties start end props &optional object
2725 This function deletes specified text properties from the text between
2726 @var{start} and @var{end} in the string or buffer @var{object}. If
2727 @var{object} is @code{nil}, it defaults to the current buffer.
2728
2729 The argument @var{props} specifies which properties to delete. It
2730 should have the form of a property list (@pxref{Property Lists}): a list
2731 whose elements are property names alternating with corresponding values.
2732 But only the names matter---the values that accompany them are ignored.
2733 For example, here's how to remove the @code{face} property.
2734
2735 @example
2736 (remove-text-properties @var{start} @var{end} '(face nil))
2737 @end example
2738
2739 The return value is @code{t} if the function actually changed some
2740 property's value; @code{nil} otherwise (if @var{props} is @code{nil} or
2741 if no character in the specified text had any of those properties).
2742
2743 To remove all text properties from certain text, use
2744 @code{set-text-properties} and specify @code{nil} for the new property
2745 list.
2746 @end defun
2747
2748 @defun remove-list-of-text-properties start end list-of-properties &optional object
2749 Like @code{remove-text-properties} except that
2750 @var{list-of-properties} is a list of property names only, not an
2751 alternating list of property names and values.
2752 @end defun
2753
2754 @defun set-text-properties start end props &optional object
2755 This function completely replaces the text property list for the text
2756 between @var{start} and @var{end} in the string or buffer @var{object}.
2757 If @var{object} is @code{nil}, it defaults to the current buffer.
2758
2759 The argument @var{props} is the new property list. It should be a list
2760 whose elements are property names alternating with corresponding values.
2761
2762 After @code{set-text-properties} returns, all the characters in the
2763 specified range have identical properties.
2764
2765 If @var{props} is @code{nil}, the effect is to get rid of all properties
2766 from the specified range of text. Here's an example:
2767
2768 @example
2769 (set-text-properties @var{start} @var{end} nil)
2770 @end example
2771
2772 Do not rely on the return value of this function.
2773 @end defun
2774
2775 The easiest way to make a string with text properties
2776 is with @code{propertize}:
2777
2778 @defun propertize string &rest properties
2779 This function returns a copy of @var{string} which has the text
2780 properties @var{properties}. These properties apply to all the
2781 characters in the string that is returned. Here is an example that
2782 constructs a string with a @code{face} property and a @code{mouse-face}
2783 property:
2784
2785 @smallexample
2786 (propertize "foo" 'face 'italic
2787 'mouse-face 'bold-italic)
2788 @result{} #("foo" 0 3 (mouse-face bold-italic face italic))
2789 @end smallexample
2790
2791 To put different properties on various parts of a string, you can
2792 construct each part with @code{propertize} and then combine them with
2793 @code{concat}:
2794
2795 @smallexample
2796 (concat
2797 (propertize "foo" 'face 'italic
2798 'mouse-face 'bold-italic)
2799 " and "
2800 (propertize "bar" 'face 'italic
2801 'mouse-face 'bold-italic))
2802 @result{} #("foo and bar"
2803 0 3 (face italic mouse-face bold-italic)
2804 3 8 nil
2805 8 11 (face italic mouse-face bold-italic))
2806 @end smallexample
2807 @end defun
2808
2809 @xref{Buffer Contents}, for the function
2810 @code{buffer-substring-no-properties}, which copies text from the
2811 buffer but does not copy its properties.
2812
2813 @node Property Search
2814 @subsection Text Property Search Functions
2815
2816 In typical use of text properties, most of the time several or many
2817 consecutive characters have the same value for a property. Rather than
2818 writing your programs to examine characters one by one, it is much
2819 faster to process chunks of text that have the same property value.
2820
2821 Here are functions you can use to do this. They use @code{eq} for
2822 comparing property values. In all cases, @var{object} defaults to the
2823 current buffer.
2824
2825 For high performance, it's very important to use the @var{limit}
2826 argument to these functions, especially the ones that search for a
2827 single property---otherwise, they may spend a long time scanning to the
2828 end of the buffer, if the property you are interested in does not change.
2829
2830 These functions do not move point; instead, they return a position (or
2831 @code{nil}). Remember that a position is always between two characters;
2832 the position returned by these functions is between two characters with
2833 different properties.
2834
2835 @defun next-property-change pos &optional object limit
2836 The function scans the text forward from position @var{pos} in the
2837 string or buffer @var{object} till it finds a change in some text
2838 property, then returns the position of the change. In other words, it
2839 returns the position of the first character beyond @var{pos} whose
2840 properties are not identical to those of the character just after
2841 @var{pos}.
2842
2843 If @var{limit} is non-@code{nil}, then the scan ends at position
2844 @var{limit}. If there is no property change before that point,
2845 @code{next-property-change} returns @var{limit}.
2846
2847 The value is @code{nil} if the properties remain unchanged all the way
2848 to the end of @var{object} and @var{limit} is @code{nil}. If the value
2849 is non-@code{nil}, it is a position greater than or equal to @var{pos}.
2850 The value equals @var{pos} only when @var{limit} equals @var{pos}.
2851
2852 Here is an example of how to scan the buffer by chunks of text within
2853 which all properties are constant:
2854
2855 @smallexample
2856 (while (not (eobp))
2857 (let ((plist (text-properties-at (point)))
2858 (next-change
2859 (or (next-property-change (point) (current-buffer))
2860 (point-max))))
2861 @r{Process text from point to @var{next-change}@dots{}}
2862 (goto-char next-change)))
2863 @end smallexample
2864 @end defun
2865
2866 @defun previous-property-change pos &optional object limit
2867 This is like @code{next-property-change}, but scans back from @var{pos}
2868 instead of forward. If the value is non-@code{nil}, it is a position
2869 less than or equal to @var{pos}; it equals @var{pos} only if @var{limit}
2870 equals @var{pos}.
2871 @end defun
2872
2873 @defun next-single-property-change pos prop &optional object limit
2874 The function scans text for a change in the @var{prop} property, then
2875 returns the position of the change. The scan goes forward from
2876 position @var{pos} in the string or buffer @var{object}. In other
2877 words, this function returns the position of the first character
2878 beyond @var{pos} whose @var{prop} property differs from that of the
2879 character just after @var{pos}.
2880
2881 If @var{limit} is non-@code{nil}, then the scan ends at position
2882 @var{limit}. If there is no property change before that point,
2883 @code{next-single-property-change} returns @var{limit}.
2884
2885 The value is @code{nil} if the property remains unchanged all the way to
2886 the end of @var{object} and @var{limit} is @code{nil}. If the value is
2887 non-@code{nil}, it is a position greater than or equal to @var{pos}; it
2888 equals @var{pos} only if @var{limit} equals @var{pos}.
2889 @end defun
2890
2891 @defun previous-single-property-change pos prop &optional object limit
2892 This is like @code{next-single-property-change}, but scans back from
2893 @var{pos} instead of forward. If the value is non-@code{nil}, it is a
2894 position less than or equal to @var{pos}; it equals @var{pos} only if
2895 @var{limit} equals @var{pos}.
2896 @end defun
2897
2898 @defun next-char-property-change pos &optional limit
2899 This is like @code{next-property-change} except that it considers
2900 overlay properties as well as text properties, and if no change is
2901 found before the end of the buffer, it returns the maximum buffer
2902 position rather than @code{nil} (in this sense, it resembles the
2903 corresponding overlay function @code{next-overlay-change}, rather than
2904 @code{next-property-change}). There is no @var{object} operand
2905 because this function operates only on the current buffer. It returns
2906 the next address at which either kind of property changes.
2907 @end defun
2908
2909 @defun previous-char-property-change pos &optional limit
2910 This is like @code{next-char-property-change}, but scans back from
2911 @var{pos} instead of forward, and returns the minimum buffer
2912 position if no change is found.
2913 @end defun
2914
2915 @defun next-single-char-property-change pos prop &optional object limit
2916 This is like @code{next-single-property-change} except that it
2917 considers overlay properties as well as text properties, and if no
2918 change is found before the end of the @var{object}, it returns the
2919 maximum valid position in @var{object} rather than @code{nil}. Unlike
2920 @code{next-char-property-change}, this function @emph{does} have an
2921 @var{object} operand; if @var{object} is not a buffer, only
2922 text-properties are considered.
2923 @end defun
2924
2925 @defun previous-single-char-property-change pos prop &optional object limit
2926 This is like @code{next-single-char-property-change}, but scans back
2927 from @var{pos} instead of forward, and returns the minimum valid
2928 position in @var{object} if no change is found.
2929 @end defun
2930
2931 @defun text-property-any start end prop value &optional object
2932 This function returns non-@code{nil} if at least one character between
2933 @var{start} and @var{end} has a property @var{prop} whose value is
2934 @var{value}. More precisely, it returns the position of the first such
2935 character. Otherwise, it returns @code{nil}.
2936
2937 The optional fifth argument, @var{object}, specifies the string or
2938 buffer to scan. Positions are relative to @var{object}. The default
2939 for @var{object} is the current buffer.
2940 @end defun
2941
2942 @defun text-property-not-all start end prop value &optional object
2943 This function returns non-@code{nil} if at least one character between
2944 @var{start} and @var{end} does not have a property @var{prop} with value
2945 @var{value}. More precisely, it returns the position of the first such
2946 character. Otherwise, it returns @code{nil}.
2947
2948 The optional fifth argument, @var{object}, specifies the string or
2949 buffer to scan. Positions are relative to @var{object}. The default
2950 for @var{object} is the current buffer.
2951 @end defun
2952
2953 @node Special Properties
2954 @subsection Properties with Special Meanings
2955
2956 Here is a table of text property names that have special built-in
2957 meanings. The following sections list a few additional special property
2958 names that control filling and property inheritance. All other names
2959 have no standard meaning, and you can use them as you like.
2960
2961 Note: the properties @code{composition}, @code{display},
2962 @code{invisible} and @code{intangible} can also cause point to move to
2963 an acceptable place, after each Emacs command. @xref{Adjusting
2964 Point}.
2965
2966 @table @code
2967 @cindex property category of text character
2968 @kindex category @r{(text property)}
2969 @item category
2970 If a character has a @code{category} property, we call it the
2971 @dfn{property category} of the character. It should be a symbol. The
2972 properties of this symbol serve as defaults for the properties of the
2973 character.
2974
2975 @item face
2976 @cindex face codes of text
2977 @kindex face @r{(text property)}
2978 You can use the property @code{face} to control the font and color of
2979 text. @xref{Faces}, for more information.
2980
2981 In the simplest case, the value is a face name. It can also be a list;
2982 then each element can be any of these possibilities;
2983
2984 @itemize @bullet
2985 @item
2986 A face name (a symbol or string).
2987
2988 @item
2989 A property list of face attributes. This has the
2990 form (@var{keyword} @var{value} @dots{}), where each @var{keyword} is a
2991 face attribute name and @var{value} is a meaningful value for that
2992 attribute. With this feature, you do not need to create a face each
2993 time you want to specify a particular attribute for certain text.
2994 @xref{Face Attributes}.
2995 @end itemize
2996
2997 It works to use the latter two forms directly as the value
2998 of the @code{face} property.
2999
3000 Font Lock mode (@pxref{Font Lock Mode}) works in most buffers by
3001 dynamically updating the @code{face} property of characters based on
3002 the context.
3003
3004 @item font-lock-face
3005 @kindex font-lock-face @r{(text property)}
3006 This property specifies a value for the @code{face} property that Font
3007 Lock mode should apply to the underlying text. It is one of the
3008 fontification methods used by Font Lock mode, and is useful for
3009 special modes that implement their own highlighting.
3010 @xref{Precalculated Fontification}. When Font Lock mode is disabled,
3011 @code{font-lock-face} has no effect.
3012
3013 @item mouse-face
3014 @kindex mouse-face @r{(text property)}
3015 This property is used instead of @code{face} when the mouse is on or
3016 near the character. For this purpose, ``near'' means that all text
3017 between the character and where the mouse is have the same
3018 @code{mouse-face} property value.
3019
3020 @item fontified
3021 @kindex fontified @r{(text property)}
3022 This property says whether the text is ready for display. If
3023 @code{nil}, Emacs's redisplay routine calls the functions in
3024 @code{fontification-functions} (@pxref{Auto Faces}) to prepare this
3025 part of the buffer before it is displayed. It is used internally by
3026 the ``just in time'' font locking code.
3027
3028 @item display
3029 This property activates various features that change the
3030 way text is displayed. For example, it can make text appear taller
3031 or shorter, higher or lower, wider or narrow, or replaced with an image.
3032 @xref{Display Property}.
3033
3034 @item help-echo
3035 @kindex help-echo @r{(text property)}
3036 @cindex tooltip
3037 @anchor{Text help-echo}
3038 If text has a string as its @code{help-echo} property, then when you
3039 move the mouse onto that text, Emacs displays that string in the echo
3040 area, or in the tooltip window (@pxref{Tooltips,,, emacs, The GNU Emacs
3041 Manual}).
3042
3043 If the value of the @code{help-echo} property is a function, that
3044 function is called with three arguments, @var{window}, @var{object} and
3045 @var{pos} and should return a help string or @code{nil} for
3046 none. The first argument, @var{window} is the window in which
3047 the help was found. The second, @var{object}, is the buffer, overlay or
3048 string which had the @code{help-echo} property. The @var{pos}
3049 argument is as follows:
3050
3051 @itemize @bullet{}
3052 @item
3053 If @var{object} is a buffer, @var{pos} is the position in the buffer.
3054 @item
3055 If @var{object} is an overlay, that overlay has a @code{help-echo}
3056 property, and @var{pos} is the position in the overlay's buffer.
3057 @item
3058 If @var{object} is a string (an overlay string or a string displayed
3059 with the @code{display} property), @var{pos} is the position in that
3060 string.
3061 @end itemize
3062
3063 If the value of the @code{help-echo} property is neither a function nor
3064 a string, it is evaluated to obtain a help string.
3065
3066 You can alter the way help text is displayed by setting the variable
3067 @code{show-help-function} (@pxref{Help display}).
3068
3069 This feature is used in the mode line and for other active text.
3070
3071 @item keymap
3072 @cindex keymap of character
3073 @kindex keymap @r{(text property)}
3074 The @code{keymap} property specifies an additional keymap for
3075 commands. When this keymap applies, it is used for key lookup before
3076 the minor mode keymaps and before the buffer's local map.
3077 @xref{Active Keymaps}. If the property value is a symbol, the
3078 symbol's function definition is used as the keymap.
3079
3080 The property's value for the character before point applies if it is
3081 non-@code{nil} and rear-sticky, and the property's value for the
3082 character after point applies if it is non-@code{nil} and
3083 front-sticky. (For mouse clicks, the position of the click is used
3084 instead of the position of point.)
3085
3086 @item local-map
3087 @kindex local-map @r{(text property)}
3088 This property works like @code{keymap} except that it specifies a
3089 keymap to use @emph{instead of} the buffer's local map. For most
3090 purposes (perhaps all purposes), it is better to use the @code{keymap}
3091 property.
3092
3093 @item syntax-table
3094 The @code{syntax-table} property overrides what the syntax table says
3095 about this particular character. @xref{Syntax Properties}.
3096
3097 @item read-only
3098 @cindex read-only character
3099 @kindex read-only @r{(text property)}
3100 If a character has the property @code{read-only}, then modifying that
3101 character is not allowed. Any command that would do so gets an error,
3102 @code{text-read-only}. If the property value is a string, that string
3103 is used as the error message.
3104
3105 Insertion next to a read-only character is an error if inserting
3106 ordinary text there would inherit the @code{read-only} property due to
3107 stickiness. Thus, you can control permission to insert next to
3108 read-only text by controlling the stickiness. @xref{Sticky Properties}.
3109
3110 Since changing properties counts as modifying the buffer, it is not
3111 possible to remove a @code{read-only} property unless you know the
3112 special trick: bind @code{inhibit-read-only} to a non-@code{nil} value
3113 and then remove the property. @xref{Read Only Buffers}.
3114
3115 @item invisible
3116 @kindex invisible @r{(text property)}
3117 A non-@code{nil} @code{invisible} property can make a character invisible
3118 on the screen. @xref{Invisible Text}, for details.
3119
3120 @item intangible
3121 @kindex intangible @r{(text property)}
3122 If a group of consecutive characters have equal and non-@code{nil}
3123 @code{intangible} properties, then you cannot place point between them.
3124 If you try to move point forward into the group, point actually moves to
3125 the end of the group. If you try to move point backward into the group,
3126 point actually moves to the start of the group.
3127
3128 If consecutive characters have unequal non-@code{nil}
3129 @code{intangible} properties, they belong to separate groups; each
3130 group is separately treated as described above.
3131
3132 When the variable @code{inhibit-point-motion-hooks} is non-@code{nil},
3133 the @code{intangible} property is ignored.
3134
3135 @item field
3136 @kindex field @r{(text property)}
3137 Consecutive characters with the same @code{field} property constitute a
3138 @dfn{field}. Some motion functions including @code{forward-word} and
3139 @code{beginning-of-line} stop moving at a field boundary.
3140 @xref{Fields}.
3141
3142 @item cursor
3143 @kindex cursor @r{(text property)}
3144 Normally, the cursor is displayed at the end of any overlay and text
3145 property strings present at the current buffer position. You can
3146 place the cursor on any desired character of these strings by giving
3147 that character a non-@code{nil} @code{cursor} text property. In
3148 addition, if the value of the @code{cursor} property of an overlay
3149 string is an integer number, it specifies the number of buffer's
3150 character positions associated with the overlay string; this way,
3151 Emacs will display the cursor on the character with that property
3152 regardless of whether the current buffer position is actually covered
3153 by the overlay. Specifically, if the value of the @code{cursor}
3154 property of a character is the number @var{n}, the cursor will be
3155 displayed on this character for any buffer position in the range
3156 @code{[@var{ovpos}..@var{ovpos}+@var{n}]}, where @var{ovpos} is the
3157 starting buffer position covered by the overlay (@pxref{Managing
3158 Overlays}).
3159
3160 @item pointer
3161 @kindex pointer @r{(text property)}
3162 This specifies a specific pointer shape when the mouse pointer is over
3163 this text or image. @xref{Pointer Shape}, for possible pointer
3164 shapes.
3165
3166 @item line-spacing
3167 @kindex line-spacing @r{(text property)}
3168 A newline can have a @code{line-spacing} text or overlay property that
3169 controls the height of the display line ending with that newline. The
3170 property value overrides the default frame line spacing and the buffer
3171 local @code{line-spacing} variable. @xref{Line Height}.
3172
3173 @item line-height
3174 @kindex line-height @r{(text property)}
3175 A newline can have a @code{line-height} text or overlay property that
3176 controls the total height of the display line ending in that newline.
3177 @xref{Line Height}.
3178
3179 @item wrap-prefix
3180 If text has a @code{wrap-prefix} property, the prefix it defines will
3181 be added at display-time to the beginning of every continuation line
3182 due to text wrapping (so if lines are truncated, the wrap-prefix is
3183 never used). It may be a string, an image, or a stretch-glyph such as
3184 used by the @code{display} text-property. @xref{Display Property}.
3185
3186 A wrap-prefix may also be specified for an entire buffer using the
3187 @code{wrap-prefix} buffer-local variable (however, a
3188 @code{wrap-prefix} text-property takes precedence over the value of
3189 the @code{wrap-prefix} variable). @xref{Truncation}.
3190
3191 @item line-prefix
3192 If text has a @code{line-prefix} property, the prefix it defines will
3193 be added at display-time to the beginning of every non-continuation
3194 line. It may be a string, an image, or a stretch-glyph such as used
3195 by the @code{display} text-property. @xref{Display Property}.
3196
3197 A line-prefix may also be specified for an entire buffer using the
3198 @code{line-prefix} buffer-local variable (however, a
3199 @code{line-prefix} text-property takes precedence over the value of
3200 the @code{line-prefix} variable). @xref{Truncation}.
3201
3202 @item modification-hooks
3203 @cindex change hooks for a character
3204 @cindex hooks for changing a character
3205 @kindex modification-hooks @r{(text property)}
3206 If a character has the property @code{modification-hooks}, then its
3207 value should be a list of functions; modifying that character calls
3208 all of those functions before the actual modification. Each function
3209 receives two arguments: the beginning and end of the part of the
3210 buffer being modified. Note that if a particular modification hook
3211 function appears on several characters being modified by a single
3212 primitive, you can't predict how many times the function will
3213 be called.
3214 Furthermore, insertion will not modify any existing character, so this
3215 hook will only be run when removing some characters, replacing them
3216 with others, or changing their text-properties.
3217
3218 If these functions modify the buffer, they should bind
3219 @code{inhibit-modification-hooks} to @code{t} around doing so, to
3220 avoid confusing the internal mechanism that calls these hooks.
3221
3222 Overlays also support the @code{modification-hooks} property, but the
3223 details are somewhat different (@pxref{Overlay Properties}).
3224
3225 @item insert-in-front-hooks
3226 @itemx insert-behind-hooks
3227 @kindex insert-in-front-hooks @r{(text property)}
3228 @kindex insert-behind-hooks @r{(text property)}
3229 The operation of inserting text in a buffer also calls the functions
3230 listed in the @code{insert-in-front-hooks} property of the following
3231 character and in the @code{insert-behind-hooks} property of the
3232 preceding character. These functions receive two arguments, the
3233 beginning and end of the inserted text. The functions are called
3234 @emph{after} the actual insertion takes place.
3235
3236 See also @ref{Change Hooks}, for other hooks that are called
3237 when you change text in a buffer.
3238
3239 @item point-entered
3240 @itemx point-left
3241 @cindex hooks for motion of point
3242 @kindex point-entered @r{(text property)}
3243 @kindex point-left @r{(text property)}
3244 The special properties @code{point-entered} and @code{point-left}
3245 record hook functions that report motion of point. Each time point
3246 moves, Emacs compares these two property values:
3247
3248 @itemize @bullet
3249 @item
3250 the @code{point-left} property of the character after the old location,
3251 and
3252 @item
3253 the @code{point-entered} property of the character after the new
3254 location.
3255 @end itemize
3256
3257 @noindent
3258 If these two values differ, each of them is called (if not @code{nil})
3259 with two arguments: the old value of point, and the new one.
3260
3261 The same comparison is made for the characters before the old and new
3262 locations. The result may be to execute two @code{point-left} functions
3263 (which may be the same function) and/or two @code{point-entered}
3264 functions (which may be the same function). In any case, all the
3265 @code{point-left} functions are called first, followed by all the
3266 @code{point-entered} functions.
3267
3268 It is possible to use @code{char-after} to examine characters at various
3269 buffer positions without moving point to those positions. Only an
3270 actual change in the value of point runs these hook functions.
3271
3272 The variable @code{inhibit-point-motion-hooks} can inhibit running the
3273 @code{point-left} and @code{point-entered} hooks, see @ref{Inhibit
3274 point motion hooks}.
3275
3276 @item composition
3277 @kindex composition @r{(text property)}
3278 This text property is used to display a sequence of characters as a
3279 single glyph composed from components. But the value of the property
3280 itself is completely internal to Emacs and should not be manipulated
3281 directly by, for instance, @code{put-text-property}.
3282
3283 @end table
3284
3285 @defvar inhibit-point-motion-hooks
3286 @anchor{Inhibit point motion hooks} When this variable is
3287 non-@code{nil}, @code{point-left} and @code{point-entered} hooks are
3288 not run, and the @code{intangible} property has no effect. Do not set
3289 this variable globally; bind it with @code{let}.
3290 @end defvar
3291
3292 @defvar show-help-function
3293 @anchor{Help display} If this variable is non-@code{nil}, it specifies a
3294 function called to display help strings. These may be @code{help-echo}
3295 properties, menu help strings (@pxref{Simple Menu Items},
3296 @pxref{Extended Menu Items}), or tool bar help strings (@pxref{Tool
3297 Bar}). The specified function is called with one argument, the help
3298 string to display. Tooltip mode (@pxref{Tooltips,,, emacs, The GNU Emacs
3299 Manual}) provides an example.
3300 @end defvar
3301
3302 @node Format Properties
3303 @subsection Formatted Text Properties
3304
3305 These text properties affect the behavior of the fill commands. They
3306 are used for representing formatted text. @xref{Filling}, and
3307 @ref{Margins}.
3308
3309 @table @code
3310 @item hard
3311 If a newline character has this property, it is a ``hard'' newline.
3312 The fill commands do not alter hard newlines and do not move words
3313 across them. However, this property takes effect only if the
3314 @code{use-hard-newlines} minor mode is enabled. @xref{Hard and Soft
3315 Newlines,, Hard and Soft Newlines, emacs, The GNU Emacs Manual}.
3316
3317 @item right-margin
3318 This property specifies an extra right margin for filling this part of the
3319 text.
3320
3321 @item left-margin
3322 This property specifies an extra left margin for filling this part of the
3323 text.
3324
3325 @item justification
3326 This property specifies the style of justification for filling this part
3327 of the text.
3328 @end table
3329
3330 @node Sticky Properties
3331 @subsection Stickiness of Text Properties
3332 @cindex sticky text properties
3333 @cindex inheritance of text properties
3334
3335 Self-inserting characters normally take on the same properties as the
3336 preceding character. This is called @dfn{inheritance} of properties.
3337
3338 In a Lisp program, you can do insertion with inheritance or without,
3339 depending on your choice of insertion primitive. The ordinary text
3340 insertion functions such as @code{insert} do not inherit any properties.
3341 They insert text with precisely the properties of the string being
3342 inserted, and no others. This is correct for programs that copy text
3343 from one context to another---for example, into or out of the kill ring.
3344 To insert with inheritance, use the special primitives described in this
3345 section. Self-inserting characters inherit properties because they work
3346 using these primitives.
3347
3348 When you do insertion with inheritance, @emph{which} properties are
3349 inherited, and from where, depends on which properties are @dfn{sticky}.
3350 Insertion after a character inherits those of its properties that are
3351 @dfn{rear-sticky}. Insertion before a character inherits those of its
3352 properties that are @dfn{front-sticky}. When both sides offer different
3353 sticky values for the same property, the previous character's value
3354 takes precedence.
3355
3356 By default, a text property is rear-sticky but not front-sticky; thus,
3357 the default is to inherit all the properties of the preceding character,
3358 and nothing from the following character.
3359
3360 You can control the stickiness of various text properties with two
3361 specific text properties, @code{front-sticky} and @code{rear-nonsticky},
3362 and with the variable @code{text-property-default-nonsticky}. You can
3363 use the variable to specify a different default for a given property.
3364 You can use those two text properties to make any specific properties
3365 sticky or nonsticky in any particular part of the text.
3366
3367 If a character's @code{front-sticky} property is @code{t}, then all
3368 its properties are front-sticky. If the @code{front-sticky} property is
3369 a list, then the sticky properties of the character are those whose
3370 names are in the list. For example, if a character has a
3371 @code{front-sticky} property whose value is @code{(face read-only)},
3372 then insertion before the character can inherit its @code{face} property
3373 and its @code{read-only} property, but no others.
3374
3375 The @code{rear-nonsticky} property works the opposite way. Most
3376 properties are rear-sticky by default, so the @code{rear-nonsticky}
3377 property says which properties are @emph{not} rear-sticky. If a
3378 character's @code{rear-nonsticky} property is @code{t}, then none of its
3379 properties are rear-sticky. If the @code{rear-nonsticky} property is a
3380 list, properties are rear-sticky @emph{unless} their names are in the
3381 list.
3382
3383 @defvar text-property-default-nonsticky
3384 This variable holds an alist which defines the default rear-stickiness
3385 of various text properties. Each element has the form
3386 @code{(@var{property} . @var{nonstickiness})}, and it defines the
3387 stickiness of a particular text property, @var{property}.
3388
3389 If @var{nonstickiness} is non-@code{nil}, this means that the property
3390 @var{property} is rear-nonsticky by default. Since all properties are
3391 front-nonsticky by default, this makes @var{property} nonsticky in both
3392 directions by default.
3393
3394 The text properties @code{front-sticky} and @code{rear-nonsticky}, when
3395 used, take precedence over the default @var{nonstickiness} specified in
3396 @code{text-property-default-nonsticky}.
3397 @end defvar
3398
3399 Here are the functions that insert text with inheritance of properties:
3400
3401 @defun insert-and-inherit &rest strings
3402 Insert the strings @var{strings}, just like the function @code{insert},
3403 but inherit any sticky properties from the adjoining text.
3404 @end defun
3405
3406 @defun insert-before-markers-and-inherit &rest strings
3407 Insert the strings @var{strings}, just like the function
3408 @code{insert-before-markers}, but inherit any sticky properties from the
3409 adjoining text.
3410 @end defun
3411
3412 @xref{Insertion}, for the ordinary insertion functions which do not
3413 inherit.
3414
3415 @node Lazy Properties
3416 @subsection Lazy Computation of Text Properties
3417
3418 Instead of computing text properties for all the text in the buffer,
3419 you can arrange to compute the text properties for parts of the text
3420 when and if something depends on them.
3421
3422 The primitive that extracts text from the buffer along with its
3423 properties is @code{buffer-substring}. Before examining the properties,
3424 this function runs the abnormal hook @code{buffer-access-fontify-functions}.
3425
3426 @defvar buffer-access-fontify-functions
3427 This variable holds a list of functions for computing text properties.
3428 Before @code{buffer-substring} copies the text and text properties for a
3429 portion of the buffer, it calls all the functions in this list. Each of
3430 the functions receives two arguments that specify the range of the
3431 buffer being accessed. (The buffer itself is always the current
3432 buffer.)
3433 @end defvar
3434
3435 The function @code{buffer-substring-no-properties} does not call these
3436 functions, since it ignores text properties anyway.
3437
3438 In order to prevent the hook functions from being called more than
3439 once for the same part of the buffer, you can use the variable
3440 @code{buffer-access-fontified-property}.
3441
3442 @defvar buffer-access-fontified-property
3443 If this variable's value is non-@code{nil}, it is a symbol which is used
3444 as a text property name. A non-@code{nil} value for that text property
3445 means, ``the other text properties for this character have already been
3446 computed.''
3447
3448 If all the characters in the range specified for @code{buffer-substring}
3449 have a non-@code{nil} value for this property, @code{buffer-substring}
3450 does not call the @code{buffer-access-fontify-functions} functions. It
3451 assumes these characters already have the right text properties, and
3452 just copies the properties they already have.
3453
3454 The normal way to use this feature is that the
3455 @code{buffer-access-fontify-functions} functions add this property, as
3456 well as others, to the characters they operate on. That way, they avoid
3457 being called over and over for the same text.
3458 @end defvar
3459
3460 @node Clickable Text
3461 @subsection Defining Clickable Text
3462 @cindex clickable text
3463 @cindex follow links
3464 @cindex mouse-1
3465
3466 @dfn{Clickable text} is text that can be clicked, with either the
3467 mouse or via a keyboard command, to produce some result. Many major
3468 modes use clickable text to implement textual hyper-links, or
3469 @dfn{links} for short.
3470
3471 The easiest way to insert and manipulate links is to use the
3472 @code{button} package. @xref{Buttons}. In this section, we will
3473 explain how to manually set up clickable text in a buffer, using text
3474 properties. For simplicity, we will refer to the clickable text as a
3475 @dfn{link}.
3476
3477 Implementing a link involves three separate steps: (1) indicating
3478 clickability when the mouse moves over the link; (2) making @kbd{RET}
3479 or @kbd{Mouse-2} on that link do something; and (3) setting up a
3480 @code{follow-link} condition so that the link obeys
3481 @code{mouse-1-click-follows-link}.
3482
3483 To indicate clickability, add the @code{mouse-face} text property to
3484 the text of the link; then Emacs will highlight the link when the
3485 mouse moves over it. In addition, you should define a tooltip or echo
3486 area message, using the @code{help-echo} text property. @xref{Special
3487 Properties}. For instance, here is how Dired indicates that file
3488 names are clickable:
3489
3490 @smallexample
3491 (if (dired-move-to-filename)
3492 (add-text-properties
3493 (point)
3494 (save-excursion
3495 (dired-move-to-end-of-filename)
3496 (point))
3497 '(mouse-face highlight
3498 help-echo "mouse-2: visit this file in other window")))
3499 @end smallexample
3500
3501 To make the link clickable, bind @key{RET} and @kbd{Mouse-2} to
3502 commands that perform the desired action. Each command should check
3503 to see whether it was called on a link, and act accordingly. For
3504 instance, Dired's major mode keymap binds @kbd{Mouse-2} to the
3505 following command:
3506
3507 @smallexample
3508 (defun dired-mouse-find-file-other-window (event)
3509 "In Dired, visit the file or directory name you click on."
3510 (interactive "e")
3511 (let ((window (posn-window (event-end event)))
3512 (pos (posn-point (event-end event)))
3513 file)
3514 (if (not (windowp window))
3515 (error "No file chosen"))
3516 (with-current-buffer (window-buffer window)
3517 (goto-char pos)
3518 (setq file (dired-get-file-for-visit)))
3519 (if (file-directory-p file)
3520 (or (and (cdr dired-subdir-alist)
3521 (dired-goto-subdir file))
3522 (progn
3523 (select-window window)
3524 (dired-other-window file)))
3525 (select-window window)
3526 (find-file-other-window (file-name-sans-versions file t)))))
3527 @end smallexample
3528
3529 @noindent
3530 This command uses the functions @code{posn-window} and
3531 @code{posn-point} to determine where the click occurred, and
3532 @code{dired-get-file-for-visit} to determine which file to visit.
3533
3534 Instead of binding the mouse command in a major mode keymap, you can
3535 bind it within the link text, using the @code{keymap} text property
3536 (@pxref{Special Properties}). For instance:
3537
3538 @example
3539 (let ((map (make-sparse-keymap)))
3540 (define-key map [mouse-2] 'operate-this-button)
3541 (put-text-property link-start link-end 'keymap map))
3542 @end example
3543
3544 @noindent
3545 With this method, you can easily define different commands for
3546 different links. Furthermore, the global definition of @key{RET} and
3547 @kbd{Mouse-2} remain available for the rest of the text in the buffer.
3548
3549 @vindex mouse-1-click-follows-link
3550 The basic Emacs command for clicking on links is @kbd{Mouse-2}.
3551 However, for compatibility with other graphical applications, Emacs
3552 also recognizes @kbd{Mouse-1} clicks on links, provided the user
3553 clicks on the link quickly without moving the mouse. This behavior is
3554 controlled by the user option @code{mouse-1-click-follows-link}.
3555 @xref{Mouse References,,, emacs, The GNU Emacs Manual}.
3556
3557 To set up the link so that it obeys
3558 @code{mouse-1-click-follows-link}, you must either (1) apply a
3559 @code{follow-link} text or overlay property to the link text, or (2)
3560 bind the @code{follow-link} event to a keymap (which can be a major
3561 mode keymap or a local keymap specified via the @code{keymap} text
3562 property). The value of the @code{follow-link} property, or the
3563 binding for the @code{follow-link} event, acts as a ``condition'' for
3564 the link action. This condition tells Emacs two things: the
3565 circumstances under which a @kbd{Mouse-1} click should be regarded as
3566 occurring ``inside'' the link, and how to compute an ``action code''
3567 that says what to translate the @kbd{Mouse-1} click into. The link
3568 action condition can be one of the following:
3569
3570 @table @asis
3571 @item @code{mouse-face}
3572 If the condition is the symbol @code{mouse-face}, a position is inside
3573 a link if there is a non-@code{nil} @code{mouse-face} property at that
3574 position. The action code is always @code{t}.
3575
3576 For example, here is how Info mode handles @key{Mouse-1}:
3577
3578 @smallexample
3579 (define-key Info-mode-map [follow-link] 'mouse-face)
3580 @end smallexample
3581
3582 @item a function
3583 If the condition is a function, @var{func}, then a position @var{pos}
3584 is inside a link if @code{(@var{func} @var{pos})} evaluates to
3585 non-@code{nil}. The value returned by @var{func} serves as the action
3586 code.
3587
3588 For example, here is how pcvs enables @kbd{Mouse-1} to follow links on
3589 file names only:
3590
3591 @smallexample
3592 (define-key map [follow-link]
3593 (lambda (pos)
3594 (eq (get-char-property pos 'face) 'cvs-filename-face)))
3595 @end smallexample
3596
3597 @item anything else
3598 If the condition value is anything else, then the position is inside a
3599 link and the condition itself is the action code. Clearly, you should
3600 specify this kind of condition only when applying the condition via a
3601 text or property overlay on the link text (so that it does not apply
3602 to the entire buffer).
3603 @end table
3604
3605 @noindent
3606 The action code tells @kbd{Mouse-1} how to follow the link:
3607
3608 @table @asis
3609 @item a string or vector
3610 If the action code is a string or vector, the @kbd{Mouse-1} event is
3611 translated into the first element of the string or vector; i.e., the
3612 action of the @kbd{Mouse-1} click is the local or global binding of
3613 that character or symbol. Thus, if the action code is @code{"foo"},
3614 @kbd{Mouse-1} translates into @kbd{f}. If it is @code{[foo]},
3615 @kbd{Mouse-1} translates into @key{foo}.
3616
3617 @item anything else
3618 For any other non-@code{nil} action code, the @kbd{Mouse-1} event is
3619 translated into a @kbd{Mouse-2} event at the same position.
3620 @end table
3621
3622 To define @kbd{Mouse-1} to activate a button defined with
3623 @code{define-button-type}, give the button a @code{follow-link}
3624 property. The property value should be a link action condition, as
3625 described above. @xref{Buttons}. For example, here is how Help mode
3626 handles @kbd{Mouse-1}:
3627
3628 @smallexample
3629 (define-button-type 'help-xref
3630 'follow-link t
3631 'action #'help-button-action)
3632 @end smallexample
3633
3634 To define @kbd{Mouse-1} on a widget defined with
3635 @code{define-widget}, give the widget a @code{:follow-link} property.
3636 The property value should be a link action condition, as described
3637 above. For example, here is how the @code{link} widget specifies that
3638 a @key{Mouse-1} click shall be translated to @key{RET}:
3639
3640 @smallexample
3641 (define-widget 'link 'item
3642 "An embedded link."
3643 :button-prefix 'widget-link-prefix
3644 :button-suffix 'widget-link-suffix
3645 :follow-link "\C-m"
3646 :help-echo "Follow the link."
3647 :format "%[%t%]")
3648 @end smallexample
3649
3650 @defun mouse-on-link-p pos
3651 This function returns non-@code{nil} if position @var{pos} in the
3652 current buffer is on a link. @var{pos} can also be a mouse event
3653 location, as returned by @code{event-start} (@pxref{Accessing Mouse}).
3654 @end defun
3655
3656 @node Fields
3657 @subsection Defining and Using Fields
3658 @cindex fields
3659
3660 A field is a range of consecutive characters in the buffer that are
3661 identified by having the same value (comparing with @code{eq}) of the
3662 @code{field} property (either a text-property or an overlay property).
3663 This section describes special functions that are available for
3664 operating on fields.
3665
3666 You specify a field with a buffer position, @var{pos}. We think of
3667 each field as containing a range of buffer positions, so the position
3668 you specify stands for the field containing that position.
3669
3670 When the characters before and after @var{pos} are part of the same
3671 field, there is no doubt which field contains @var{pos}: the one those
3672 characters both belong to. When @var{pos} is at a boundary between
3673 fields, which field it belongs to depends on the stickiness of the
3674 @code{field} properties of the two surrounding characters (@pxref{Sticky
3675 Properties}). The field whose property would be inherited by text
3676 inserted at @var{pos} is the field that contains @var{pos}.
3677
3678 There is an anomalous case where newly inserted text at @var{pos}
3679 would not inherit the @code{field} property from either side. This
3680 happens if the previous character's @code{field} property is not
3681 rear-sticky, and the following character's @code{field} property is not
3682 front-sticky. In this case, @var{pos} belongs to neither the preceding
3683 field nor the following field; the field functions treat it as belonging
3684 to an empty field whose beginning and end are both at @var{pos}.
3685
3686 In all of these functions, if @var{pos} is omitted or @code{nil}, the
3687 value of point is used by default. If narrowing is in effect, then
3688 @var{pos} should fall within the accessible portion. @xref{Narrowing}.
3689
3690 @defun field-beginning &optional pos escape-from-edge limit
3691 This function returns the beginning of the field specified by @var{pos}.
3692
3693 If @var{pos} is at the beginning of its field, and
3694 @var{escape-from-edge} is non-@code{nil}, then the return value is
3695 always the beginning of the preceding field that @emph{ends} at @var{pos},
3696 regardless of the stickiness of the @code{field} properties around
3697 @var{pos}.
3698
3699 If @var{limit} is non-@code{nil}, it is a buffer position; if the
3700 beginning of the field is before @var{limit}, then @var{limit} will be
3701 returned instead.
3702 @end defun
3703
3704 @defun field-end &optional pos escape-from-edge limit
3705 This function returns the end of the field specified by @var{pos}.
3706
3707 If @var{pos} is at the end of its field, and @var{escape-from-edge} is
3708 non-@code{nil}, then the return value is always the end of the following
3709 field that @emph{begins} at @var{pos}, regardless of the stickiness of
3710 the @code{field} properties around @var{pos}.
3711
3712 If @var{limit} is non-@code{nil}, it is a buffer position; if the end
3713 of the field is after @var{limit}, then @var{limit} will be returned
3714 instead.
3715 @end defun
3716
3717 @defun field-string &optional pos
3718 This function returns the contents of the field specified by @var{pos},
3719 as a string.
3720 @end defun
3721
3722 @defun field-string-no-properties &optional pos
3723 This function returns the contents of the field specified by @var{pos},
3724 as a string, discarding text properties.
3725 @end defun
3726
3727 @defun delete-field &optional pos
3728 This function deletes the text of the field specified by @var{pos}.
3729 @end defun
3730
3731 @defun constrain-to-field new-pos old-pos &optional escape-from-edge only-in-line inhibit-capture-property
3732 This function ``constrains'' @var{new-pos} to the field that
3733 @var{old-pos} belongs to---in other words, it returns the position
3734 closest to @var{new-pos} that is in the same field as @var{old-pos}.
3735
3736 If @var{new-pos} is @code{nil}, then @code{constrain-to-field} uses
3737 the value of point instead, and moves point to the resulting position
3738 as well as returning it.
3739
3740 If @var{old-pos} is at the boundary of two fields, then the acceptable
3741 final positions depend on the argument @var{escape-from-edge}. If
3742 @var{escape-from-edge} is @code{nil}, then @var{new-pos} must be in
3743 the field whose @code{field} property equals what new characters
3744 inserted at @var{old-pos} would inherit. (This depends on the
3745 stickiness of the @code{field} property for the characters before and
3746 after @var{old-pos}.) If @var{escape-from-edge} is non-@code{nil},
3747 @var{new-pos} can be anywhere in the two adjacent fields.
3748 Additionally, if two fields are separated by another field with the
3749 special value @code{boundary}, then any point within this special
3750 field is also considered to be ``on the boundary.''
3751
3752 Commands like @kbd{C-a} with no argumemt, that normally move backward
3753 to a specific kind of location and stay there once there, probably
3754 should specify @code{nil} for @var{escape-from-edge}. Other motion
3755 commands that check fields should probably pass @code{t}.
3756
3757 If the optional argument @var{only-in-line} is non-@code{nil}, and
3758 constraining @var{new-pos} in the usual way would move it to a different
3759 line, @var{new-pos} is returned unconstrained. This used in commands
3760 that move by line, such as @code{next-line} and
3761 @code{beginning-of-line}, so that they respect field boundaries only in
3762 the case where they can still move to the right line.
3763
3764 If the optional argument @var{inhibit-capture-property} is
3765 non-@code{nil}, and @var{old-pos} has a non-@code{nil} property of that
3766 name, then any field boundaries are ignored.
3767
3768 You can cause @code{constrain-to-field} to ignore all field boundaries
3769 (and so never constrain anything) by binding the variable
3770 @code{inhibit-field-text-motion} to a non-@code{nil} value.
3771 @end defun
3772
3773 @node Not Intervals
3774 @subsection Why Text Properties are not Intervals
3775 @cindex intervals
3776
3777 Some editors that support adding attributes to text in the buffer do
3778 so by letting the user specify ``intervals'' within the text, and adding
3779 the properties to the intervals. Those editors permit the user or the
3780 programmer to determine where individual intervals start and end. We
3781 deliberately provided a different sort of interface in Emacs Lisp to
3782 avoid certain paradoxical behavior associated with text modification.
3783
3784 If the actual subdivision into intervals is meaningful, that means you
3785 can distinguish between a buffer that is just one interval with a
3786 certain property, and a buffer containing the same text subdivided into
3787 two intervals, both of which have that property.
3788
3789 Suppose you take the buffer with just one interval and kill part of
3790 the text. The text remaining in the buffer is one interval, and the
3791 copy in the kill ring (and the undo list) becomes a separate interval.
3792 Then if you yank back the killed text, you get two intervals with the
3793 same properties. Thus, editing does not preserve the distinction
3794 between one interval and two.
3795
3796 Suppose we ``fix'' this problem by coalescing the two intervals when
3797 the text is inserted. That works fine if the buffer originally was a
3798 single interval. But suppose instead that we have two adjacent
3799 intervals with the same properties, and we kill the text of one interval
3800 and yank it back. The same interval-coalescence feature that rescues
3801 the other case causes trouble in this one: after yanking, we have just
3802 one interval. One again, editing does not preserve the distinction
3803 between one interval and two.
3804
3805 Insertion of text at the border between intervals also raises
3806 questions that have no satisfactory answer.
3807
3808 However, it is easy to arrange for editing to behave consistently for
3809 questions of the form, ``What are the properties of this character?''
3810 So we have decided these are the only questions that make sense; we have
3811 not implemented asking questions about where intervals start or end.
3812
3813 In practice, you can usually use the text property search functions in
3814 place of explicit interval boundaries. You can think of them as finding
3815 the boundaries of intervals, assuming that intervals are always
3816 coalesced whenever possible. @xref{Property Search}.
3817
3818 Emacs also provides explicit intervals as a presentation feature; see
3819 @ref{Overlays}.
3820
3821 @node Substitution
3822 @section Substituting for a Character Code
3823
3824 The following functions replace characters within a specified region
3825 based on their character codes.
3826
3827 @defun subst-char-in-region start end old-char new-char &optional noundo
3828 @cindex replace characters
3829 This function replaces all occurrences of the character @var{old-char}
3830 with the character @var{new-char} in the region of the current buffer
3831 defined by @var{start} and @var{end}.
3832
3833 @cindex undo avoidance
3834 If @var{noundo} is non-@code{nil}, then @code{subst-char-in-region} does
3835 not record the change for undo and does not mark the buffer as modified.
3836 This was useful for controlling the old selective display feature
3837 (@pxref{Selective Display}).
3838
3839 @code{subst-char-in-region} does not move point and returns
3840 @code{nil}.
3841
3842 @example
3843 @group
3844 ---------- Buffer: foo ----------
3845 This is the contents of the buffer before.
3846 ---------- Buffer: foo ----------
3847 @end group
3848
3849 @group
3850 (subst-char-in-region 1 20 ?i ?X)
3851 @result{} nil
3852
3853 ---------- Buffer: foo ----------
3854 ThXs Xs the contents of the buffer before.
3855 ---------- Buffer: foo ----------
3856 @end group
3857 @end example
3858 @end defun
3859
3860 @deffn Command translate-region start end table
3861 This function applies a translation table to the characters in the
3862 buffer between positions @var{start} and @var{end}.
3863
3864 The translation table @var{table} is a string or a char-table;
3865 @code{(aref @var{table} @var{ochar})} gives the translated character
3866 corresponding to @var{ochar}. If @var{table} is a string, any
3867 characters with codes larger than the length of @var{table} are not
3868 altered by the translation.
3869
3870 The return value of @code{translate-region} is the number of
3871 characters that were actually changed by the translation. This does
3872 not count characters that were mapped into themselves in the
3873 translation table.
3874 @end deffn
3875
3876 @node Registers
3877 @section Registers
3878 @cindex registers
3879
3880 A register is a sort of variable used in Emacs editing that can hold a
3881 variety of different kinds of values. Each register is named by a
3882 single character. All @acronym{ASCII} characters and their meta variants
3883 (but with the exception of @kbd{C-g}) can be used to name registers.
3884 Thus, there are 255 possible registers. A register is designated in
3885 Emacs Lisp by the character that is its name.
3886
3887 @defvar register-alist
3888 This variable is an alist of elements of the form @code{(@var{name} .
3889 @var{contents})}. Normally, there is one element for each Emacs
3890 register that has been used.
3891
3892 The object @var{name} is a character (an integer) identifying the
3893 register.
3894 @end defvar
3895
3896 The @var{contents} of a register can have several possible types:
3897
3898 @table @asis
3899 @item a number
3900 A number stands for itself. If @code{insert-register} finds a number
3901 in the register, it converts the number to decimal.
3902
3903 @item a marker
3904 A marker represents a buffer position to jump to.
3905
3906 @item a string
3907 A string is text saved in the register.
3908
3909 @item a rectangle
3910 A rectangle is represented by a list of strings.
3911
3912 @item @code{(@var{window-configuration} @var{position})}
3913 This represents a window configuration to restore in one frame, and a
3914 position to jump to in the current buffer.
3915
3916 @item @code{(@var{frame-configuration} @var{position})}
3917 This represents a frame configuration to restore, and a position
3918 to jump to in the current buffer.
3919
3920 @item (file @var{filename})
3921 This represents a file to visit; jumping to this value visits file
3922 @var{filename}.
3923
3924 @item (file-query @var{filename} @var{position})
3925 This represents a file to visit and a position in it; jumping to this
3926 value visits file @var{filename} and goes to buffer position
3927 @var{position}. Restoring this type of position asks the user for
3928 confirmation first.
3929 @end table
3930
3931 The functions in this section return unpredictable values unless
3932 otherwise stated.
3933
3934 @defun get-register reg
3935 This function returns the contents of the register
3936 @var{reg}, or @code{nil} if it has no contents.
3937 @end defun
3938
3939 @defun set-register reg value
3940 This function sets the contents of register @var{reg} to @var{value}.
3941 A register can be set to any value, but the other register functions
3942 expect only certain data types. The return value is @var{value}.
3943 @end defun
3944
3945 @deffn Command view-register reg
3946 This command displays what is contained in register @var{reg}.
3947 @end deffn
3948
3949 @deffn Command insert-register reg &optional beforep
3950 This command inserts contents of register @var{reg} into the current
3951 buffer.
3952
3953 Normally, this command puts point before the inserted text, and the
3954 mark after it. However, if the optional second argument @var{beforep}
3955 is non-@code{nil}, it puts the mark before and point after.
3956 You can pass a non-@code{nil} second argument @var{beforep} to this
3957 function interactively by supplying any prefix argument.
3958
3959 If the register contains a rectangle, then the rectangle is inserted
3960 with its upper left corner at point. This means that text is inserted
3961 in the current line and underneath it on successive lines.
3962
3963 If the register contains something other than saved text (a string) or
3964 a rectangle (a list), currently useless things happen. This may be
3965 changed in the future.
3966 @end deffn
3967
3968 @node Transposition
3969 @section Transposition of Text
3970
3971 This subroutine is used by the transposition commands.
3972
3973 @defun transpose-regions start1 end1 start2 end2 &optional leave-markers
3974 This function exchanges two nonoverlapping portions of the buffer.
3975 Arguments @var{start1} and @var{end1} specify the bounds of one portion
3976 and arguments @var{start2} and @var{end2} specify the bounds of the
3977 other portion.
3978
3979 Normally, @code{transpose-regions} relocates markers with the transposed
3980 text; a marker previously positioned within one of the two transposed
3981 portions moves along with that portion, thus remaining between the same
3982 two characters in their new position. However, if @var{leave-markers}
3983 is non-@code{nil}, @code{transpose-regions} does not do this---it leaves
3984 all markers unrelocated.
3985 @end defun
3986
3987 @node Base 64
3988 @section Base 64 Encoding
3989 @cindex base 64 encoding
3990
3991 Base 64 code is used in email to encode a sequence of 8-bit bytes as
3992 a longer sequence of @acronym{ASCII} graphic characters. It is defined in
3993 Internet RFC@footnote{
3994 An RFC, an acronym for @dfn{Request for Comments}, is a numbered
3995 Internet informational document describing a standard. RFCs are
3996 usually written by technical experts acting on their own initiative,
3997 and are traditionally written in a pragmatic, experience-driven
3998 manner.
3999 }2045. This section describes the functions for
4000 converting to and from this code.
4001
4002 @deffn Command base64-encode-region beg end &optional no-line-break
4003 This function converts the region from @var{beg} to @var{end} into base
4004 64 code. It returns the length of the encoded text. An error is
4005 signaled if a character in the region is multibyte, i.e.@: in a
4006 multibyte buffer the region must contain only characters from the
4007 charsets @code{ascii}, @code{eight-bit-control} and
4008 @code{eight-bit-graphic}.
4009
4010 Normally, this function inserts newline characters into the encoded
4011 text, to avoid overlong lines. However, if the optional argument
4012 @var{no-line-break} is non-@code{nil}, these newlines are not added, so
4013 the output is just one long line.
4014 @end deffn
4015
4016 @deffn Command base64-encode-string string &optional no-line-break
4017 This function converts the string @var{string} into base 64 code. It
4018 returns a string containing the encoded text. As for
4019 @code{base64-encode-region}, an error is signaled if a character in the
4020 string is multibyte.
4021
4022 Normally, this function inserts newline characters into the encoded
4023 text, to avoid overlong lines. However, if the optional argument
4024 @var{no-line-break} is non-@code{nil}, these newlines are not added, so
4025 the result string is just one long line.
4026 @end deffn
4027
4028 @defun base64-decode-region beg end
4029 This function converts the region from @var{beg} to @var{end} from base
4030 64 code into the corresponding decoded text. It returns the length of
4031 the decoded text.
4032
4033 The decoding functions ignore newline characters in the encoded text.
4034 @end defun
4035
4036 @defun base64-decode-string string
4037 This function converts the string @var{string} from base 64 code into
4038 the corresponding decoded text. It returns a unibyte string containing the
4039 decoded text.
4040
4041 The decoding functions ignore newline characters in the encoded text.
4042 @end defun
4043
4044 @node MD5 Checksum
4045 @section MD5 Checksum
4046 @cindex MD5 checksum
4047 @cindex message digest computation
4048
4049 MD5 cryptographic checksums, or @dfn{message digests}, are 128-bit
4050 ``fingerprints'' of a document or program. They are used to verify
4051 that you have an exact and unaltered copy of the data. The algorithm
4052 to calculate the MD5 message digest is defined in Internet
4053 RFC@footnote{
4054 For an explanation of what is an RFC, see the footnote in @ref{Base
4055 64}.
4056 }1321. This section describes the Emacs facilities for computing
4057 message digests.
4058
4059 @defun md5 object &optional start end coding-system noerror
4060 This function returns the MD5 message digest of @var{object}, which
4061 should be a buffer or a string.
4062
4063 The two optional arguments @var{start} and @var{end} are character
4064 positions specifying the portion of @var{object} to compute the
4065 message digest for. If they are @code{nil} or omitted, the digest is
4066 computed for the whole of @var{object}.
4067
4068 The function @code{md5} does not compute the message digest directly
4069 from the internal Emacs representation of the text (@pxref{Text
4070 Representations}). Instead, it encodes the text using a coding
4071 system, and computes the message digest from the encoded text. The
4072 optional fourth argument @var{coding-system} specifies which coding
4073 system to use for encoding the text. It should be the same coding
4074 system that you used to read the text, or that you used or will use
4075 when saving or sending the text. @xref{Coding Systems}, for more
4076 information about coding systems.
4077
4078 If @var{coding-system} is @code{nil} or omitted, the default depends
4079 on @var{object}. If @var{object} is a buffer, the default for
4080 @var{coding-system} is whatever coding system would be chosen by
4081 default for writing this text into a file. If @var{object} is a
4082 string, the user's most preferred coding system (@pxref{Recognize
4083 Coding, prefer-coding-system, the description of
4084 @code{prefer-coding-system}, emacs, GNU Emacs Manual}) is used.
4085
4086 Normally, @code{md5} signals an error if the text can't be encoded
4087 using the specified or chosen coding system. However, if
4088 @var{noerror} is non-@code{nil}, it silently uses @code{raw-text}
4089 coding instead.
4090 @end defun
4091
4092 @node Parsing HTML
4093 @section Parsing HTML
4094 @cindex parsing html
4095
4096 @defun libxml-parse-html-region start end &optional base-url
4097 This function provides HTML parsing via the @code{libxml2} library.
4098 It parses ``real world'' HTML and tries to return a sensible parse tree
4099 regardless.
4100
4101 In addition to @var{start} and @var{end} (specifying the start and end
4102 of the region to act on), it takes an optional parameter,
4103 @var{base-url}, which is used to expand relative URLs in the document,
4104 if any.
4105
4106 Here's an example demonstrating the structure of the parsed data you
4107 get out. Given this HTML document:
4108
4109 @example
4110 <html><hEad></head><body width=101><div class=thing>Foo<div>Yes
4111 @end example
4112
4113 You get this parse tree:
4114
4115 @example
4116 (html
4117 (head)
4118 (body
4119 (:width . "101")
4120 (div
4121 (:class . "thing")
4122 (text . "Foo")
4123 (div
4124 (text . "Yes\n")))))
4125 @end example
4126
4127 It's a simple tree structure, where the @code{car} for each node is
4128 the name of the node, and the @code{cdr} is the value, or the list of
4129 values.
4130
4131 Attributes are coded the same way as child nodes, but with @samp{:} as
4132 the first character.
4133 @end defun
4134
4135 @cindex parsing xml
4136 @defun libxml-parse-xml-region start end &optional base-url
4137
4138 This is much the same as @code{libxml-parse-html-region} above, but
4139 operates on XML instead of HTML, and is correspondingly stricter about
4140 syntax.
4141 @end defun
4142
4143 @node Atomic Changes
4144 @section Atomic Change Groups
4145 @cindex atomic changes
4146
4147 In database terminology, an @dfn{atomic} change is an indivisible
4148 change---it can succeed entirely or it can fail entirely, but it
4149 cannot partly succeed. A Lisp program can make a series of changes to
4150 one or several buffers as an @dfn{atomic change group}, meaning that
4151 either the entire series of changes will be installed in their buffers
4152 or, in case of an error, none of them will be.
4153
4154 To do this for one buffer, the one already current, simply write a
4155 call to @code{atomic-change-group} around the code that makes the
4156 changes, like this:
4157
4158 @example
4159 (atomic-change-group
4160 (insert foo)
4161 (delete-region x y))
4162 @end example
4163
4164 @noindent
4165 If an error (or other nonlocal exit) occurs inside the body of
4166 @code{atomic-change-group}, it unmakes all the changes in that buffer
4167 that were during the execution of the body. This kind of change group
4168 has no effect on any other buffers---any such changes remain.
4169
4170 If you need something more sophisticated, such as to make changes in
4171 various buffers constitute one atomic group, you must directly call
4172 lower-level functions that @code{atomic-change-group} uses.
4173
4174 @defun prepare-change-group &optional buffer
4175 This function sets up a change group for buffer @var{buffer}, which
4176 defaults to the current buffer. It returns a ``handle'' that
4177 represents the change group. You must use this handle to activate the
4178 change group and subsequently to finish it.
4179 @end defun
4180
4181 To use the change group, you must @dfn{activate} it. You must do
4182 this before making any changes in the text of @var{buffer}.
4183
4184 @defun activate-change-group handle
4185 This function activates the change group that @var{handle} designates.
4186 @end defun
4187
4188 After you activate the change group, any changes you make in that
4189 buffer become part of it. Once you have made all the desired changes
4190 in the buffer, you must @dfn{finish} the change group. There are two
4191 ways to do this: you can either accept (and finalize) all the changes,
4192 or cancel them all.
4193
4194 @defun accept-change-group handle
4195 This function accepts all the changes in the change group specified by
4196 @var{handle}, making them final.
4197 @end defun
4198
4199 @defun cancel-change-group handle
4200 This function cancels and undoes all the changes in the change group
4201 specified by @var{handle}.
4202 @end defun
4203
4204 Your code should use @code{unwind-protect} to make sure the group is
4205 always finished. The call to @code{activate-change-group} should be
4206 inside the @code{unwind-protect}, in case the user types @kbd{C-g}
4207 just after it runs. (This is one reason why
4208 @code{prepare-change-group} and @code{activate-change-group} are
4209 separate functions, because normally you would call
4210 @code{prepare-change-group} before the start of that
4211 @code{unwind-protect}.) Once you finish the group, don't use the
4212 handle again---in particular, don't try to finish the same group
4213 twice.
4214
4215 To make a multibuffer change group, call @code{prepare-change-group}
4216 once for each buffer you want to cover, then use @code{nconc} to
4217 combine the returned values, like this:
4218
4219 @example
4220 (nconc (prepare-change-group buffer-1)
4221 (prepare-change-group buffer-2))
4222 @end example
4223
4224 You can then activate the multibuffer change group with a single call
4225 to @code{activate-change-group}, and finish it with a single call to
4226 @code{accept-change-group} or @code{cancel-change-group}.
4227
4228 Nested use of several change groups for the same buffer works as you
4229 would expect. Non-nested use of change groups for the same buffer
4230 will get Emacs confused, so don't let it happen; the first change
4231 group you start for any given buffer should be the last one finished.
4232
4233 @node Change Hooks
4234 @section Change Hooks
4235 @cindex change hooks
4236 @cindex hooks for text changes
4237
4238 These hook variables let you arrange to take notice of all changes in
4239 all buffers (or in a particular buffer, if you make them buffer-local).
4240 See also @ref{Special Properties}, for how to detect changes to specific
4241 parts of the text.
4242
4243 The functions you use in these hooks should save and restore the match
4244 data if they do anything that uses regular expressions; otherwise, they
4245 will interfere in bizarre ways with the editing operations that call
4246 them.
4247
4248 @defvar before-change-functions
4249 This variable holds a list of functions to call before any buffer
4250 modification. Each function gets two arguments, the beginning and end
4251 of the region that is about to change, represented as integers. The
4252 buffer that is about to change is always the current buffer.
4253 @end defvar
4254
4255 @defvar after-change-functions
4256 This variable holds a list of functions to call after any buffer
4257 modification. Each function receives three arguments: the beginning and
4258 end of the region just changed, and the length of the text that existed
4259 before the change. All three arguments are integers. The buffer that's
4260 about to change is always the current buffer.
4261
4262 The length of the old text is the difference between the buffer positions
4263 before and after that text as it was before the change. As for the
4264 changed text, its length is simply the difference between the first two
4265 arguments.
4266 @end defvar
4267
4268 Output of messages into the @samp{*Messages*} buffer does not
4269 call these functions.
4270
4271 @defmac combine-after-change-calls body@dots{}
4272 The macro executes @var{body} normally, but arranges to call the
4273 after-change functions just once for a series of several changes---if
4274 that seems safe.
4275
4276 If a program makes several text changes in the same area of the buffer,
4277 using the macro @code{combine-after-change-calls} around that part of
4278 the program can make it run considerably faster when after-change hooks
4279 are in use. When the after-change hooks are ultimately called, the
4280 arguments specify a portion of the buffer including all of the changes
4281 made within the @code{combine-after-change-calls} body.
4282
4283 @strong{Warning:} You must not alter the values of
4284 @code{after-change-functions} within
4285 the body of a @code{combine-after-change-calls} form.
4286
4287 @strong{Warning:} if the changes you combine occur in widely scattered
4288 parts of the buffer, this will still work, but it is not advisable,
4289 because it may lead to inefficient behavior for some change hook
4290 functions.
4291 @end defmac
4292
4293 @defvar first-change-hook
4294 This variable is a normal hook that is run whenever a buffer is changed
4295 that was previously in the unmodified state.
4296 @end defvar
4297
4298 @defvar inhibit-modification-hooks
4299 If this variable is non-@code{nil}, all of the change hooks are
4300 disabled; none of them run. This affects all the hook variables
4301 described above in this section, as well as the hooks attached to
4302 certain special text properties (@pxref{Special Properties}) and overlay
4303 properties (@pxref{Overlay Properties}).
4304
4305 Also, this variable is bound to non-@code{nil} while running those
4306 same hook variables, so that by default modifying the buffer from
4307 a modification hook does not cause other modification hooks to be run.
4308 If you do want modification hooks to be run in a particular piece of
4309 code that is itself run from a modification hook, then rebind locally
4310 @code{inhibit-modification-hooks} to @code{nil}.
4311 @end defvar
4312
4313