Support bidi reordering of overlay and display strings.
[bpt/emacs.git] / doc / lispref / text.texi
1 @c -*-texinfo-*-
2 @c This is part of the GNU Emacs Lisp Reference Manual.
3 @c Copyright (C) 1990-1995, 1998-2011 Free Software Foundation, Inc.
4 @c See the file elisp.texi for copying conditions.
5 @setfilename ../../info/text
6 @node Text, Non-ASCII Characters, Markers, Top
7 @chapter Text
8 @cindex text
9
10 This chapter describes the functions that deal with the text in a
11 buffer. Most examine, insert, or delete text in the current buffer,
12 often operating at point or on text adjacent to point. Many are
13 interactive. All the functions that change the text provide for undoing
14 the changes (@pxref{Undo}).
15
16 Many text-related functions operate on a region of text defined by two
17 buffer positions passed in arguments named @var{start} and @var{end}.
18 These arguments should be either markers (@pxref{Markers}) or numeric
19 character positions (@pxref{Positions}). The order of these arguments
20 does not matter; it is all right for @var{start} to be the end of the
21 region and @var{end} the beginning. For example, @code{(delete-region 1
22 10)} and @code{(delete-region 10 1)} are equivalent. An
23 @code{args-out-of-range} error is signaled if either @var{start} or
24 @var{end} is outside the accessible portion of the buffer. In an
25 interactive call, point and the mark are used for these arguments.
26
27 @cindex buffer contents
28 Throughout this chapter, ``text'' refers to the characters in the
29 buffer, together with their properties (when relevant). Keep in mind
30 that point is always between two characters, and the cursor appears on
31 the character after point.
32
33 @menu
34 * Near Point:: Examining text in the vicinity of point.
35 * Buffer Contents:: Examining text in a general fashion.
36 * Comparing Text:: Comparing substrings of buffers.
37 * Insertion:: Adding new text to a buffer.
38 * Commands for Insertion:: User-level commands to insert text.
39 * Deletion:: Removing text from a buffer.
40 * User-Level Deletion:: User-level commands to delete text.
41 * The Kill Ring:: Where removed text sometimes is saved for later use.
42 * Undo:: Undoing changes to the text of a buffer.
43 * Maintaining Undo:: How to enable and disable undo information.
44 How to control how much information is kept.
45 * Filling:: Functions for explicit filling.
46 * Margins:: How to specify margins for filling commands.
47 * Adaptive Fill:: Adaptive Fill mode chooses a fill prefix from context.
48 * Auto Filling:: How auto-fill mode is implemented to break lines.
49 * Sorting:: Functions for sorting parts of the buffer.
50 * Columns:: Computing horizontal positions, and using them.
51 * Indentation:: Functions to insert or adjust indentation.
52 * Case Changes:: Case conversion of parts of the buffer.
53 * Text Properties:: Assigning Lisp property lists to text characters.
54 * Substitution:: Replacing a given character wherever it appears.
55 * Transposition:: Swapping two portions of a buffer.
56 * Registers:: How registers are implemented. Accessing the text or
57 position stored in a register.
58 * Base 64:: Conversion to or from base 64 encoding.
59 * MD5 Checksum:: Compute the MD5 "message digest"/"checksum".
60 * Parsing HTML:: Parsing HTML and XML.
61 * Atomic Changes:: Installing several buffer changes "atomically".
62 * Change Hooks:: Supplying functions to be run when text is changed.
63 @end menu
64
65 @node Near Point
66 @section Examining Text Near Point
67 @cindex text near point
68
69 Many functions are provided to look at the characters around point.
70 Several simple functions are described here. See also @code{looking-at}
71 in @ref{Regexp Search}.
72
73 In the following four functions, ``beginning'' or ``end'' of buffer
74 refers to the beginning or end of the accessible portion.
75
76 @defun char-after &optional position
77 This function returns the character in the current buffer at (i.e.,
78 immediately after) position @var{position}. If @var{position} is out of
79 range for this purpose, either before the beginning of the buffer, or at
80 or beyond the end, then the value is @code{nil}. The default for
81 @var{position} is point.
82
83 In the following example, assume that the first character in the
84 buffer is @samp{@@}:
85
86 @example
87 @group
88 (string (char-after 1))
89 @result{} "@@"
90 @end group
91 @end example
92 @end defun
93
94 @defun char-before &optional position
95 This function returns the character in the current buffer immediately
96 before position @var{position}. If @var{position} is out of range for
97 this purpose, either at or before the beginning of the buffer, or beyond
98 the end, then the value is @code{nil}. The default for
99 @var{position} is point.
100 @end defun
101
102 @defun following-char
103 This function returns the character following point in the current
104 buffer. This is similar to @code{(char-after (point))}. However, if
105 point is at the end of the buffer, then @code{following-char} returns 0.
106
107 Remember that point is always between characters, and the cursor
108 normally appears over the character following point. Therefore, the
109 character returned by @code{following-char} is the character the
110 cursor is over.
111
112 In this example, point is between the @samp{a} and the @samp{c}.
113
114 @example
115 @group
116 ---------- Buffer: foo ----------
117 Gentlemen may cry ``Pea@point{}ce! Peace!,''
118 but there is no peace.
119 ---------- Buffer: foo ----------
120 @end group
121
122 @group
123 (string (preceding-char))
124 @result{} "a"
125 (string (following-char))
126 @result{} "c"
127 @end group
128 @end example
129 @end defun
130
131 @defun preceding-char
132 This function returns the character preceding point in the current
133 buffer. See above, under @code{following-char}, for an example. If
134 point is at the beginning of the buffer, @code{preceding-char} returns
135 0.
136 @end defun
137
138 @defun bobp
139 This function returns @code{t} if point is at the beginning of the
140 buffer. If narrowing is in effect, this means the beginning of the
141 accessible portion of the text. See also @code{point-min} in
142 @ref{Point}.
143 @end defun
144
145 @defun eobp
146 This function returns @code{t} if point is at the end of the buffer.
147 If narrowing is in effect, this means the end of accessible portion of
148 the text. See also @code{point-max} in @xref{Point}.
149 @end defun
150
151 @defun bolp
152 This function returns @code{t} if point is at the beginning of a line.
153 @xref{Text Lines}. The beginning of the buffer (or of its accessible
154 portion) always counts as the beginning of a line.
155 @end defun
156
157 @defun eolp
158 This function returns @code{t} if point is at the end of a line. The
159 end of the buffer (or of its accessible portion) is always considered
160 the end of a line.
161 @end defun
162
163 @node Buffer Contents
164 @section Examining Buffer Contents
165
166 This section describes functions that allow a Lisp program to
167 convert any portion of the text in the buffer into a string.
168
169 @defun buffer-substring start end
170 This function returns a string containing a copy of the text of the
171 region defined by positions @var{start} and @var{end} in the current
172 buffer. If the arguments are not positions in the accessible portion of
173 the buffer, @code{buffer-substring} signals an @code{args-out-of-range}
174 error.
175
176 It is not necessary for @var{start} to be less than @var{end}; the
177 arguments can be given in either order. But most often the smaller
178 argument is written first.
179
180 Here's an example which assumes Font-Lock mode is not enabled:
181
182 @example
183 @group
184 ---------- Buffer: foo ----------
185 This is the contents of buffer foo
186
187 ---------- Buffer: foo ----------
188 @end group
189
190 @group
191 (buffer-substring 1 10)
192 @result{} "This is t"
193 @end group
194 @group
195 (buffer-substring (point-max) 10)
196 @result{} "he contents of buffer foo\n"
197 @end group
198 @end example
199
200 If the text being copied has any text properties, these are copied into
201 the string along with the characters they belong to. @xref{Text
202 Properties}. However, overlays (@pxref{Overlays}) in the buffer and
203 their properties are ignored, not copied.
204
205 For example, if Font-Lock mode is enabled, you might get results like
206 these:
207
208 @example
209 @group
210 (buffer-substring 1 10)
211 @result{} #("This is t" 0 1 (fontified t) 1 9 (fontified t))
212 @end group
213 @end example
214 @end defun
215
216 @defun buffer-substring-no-properties start end
217 This is like @code{buffer-substring}, except that it does not copy text
218 properties, just the characters themselves. @xref{Text Properties}.
219 @end defun
220
221 @defun filter-buffer-substring start end &optional delete noprops
222 This function passes the buffer text between @var{start} and @var{end}
223 through the filter functions specified by the variable
224 @code{buffer-substring-filters}, and returns the value from the last
225 filter function. If @code{buffer-substring-filters} is @code{nil},
226 the value is the unaltered text from the buffer, what
227 @code{buffer-substring} would return.
228
229 If @var{delete} is non-@code{nil}, this function deletes the text
230 between @var{start} and @var{end} after copying it, like
231 @code{delete-and-extract-region}.
232
233 If @var{noprops} is non-@code{nil}, the final string returned does not
234 include text properties, while the string passed through the filters
235 still includes text properties from the buffer text.
236
237 Lisp code should use this function instead of @code{buffer-substring},
238 @code{buffer-substring-no-properties},
239 or @code{delete-and-extract-region} when copying into user-accessible
240 data structures such as the kill-ring, X clipboard, and registers.
241 Major and minor modes can add functions to
242 @code{buffer-substring-filters} to alter such text as it is copied out
243 of the buffer.
244 @end defun
245
246 @defvar buffer-substring-filters
247 This variable should be a list of functions that accept a single
248 argument, a string, and return a string.
249 @code{filter-buffer-substring} passes the buffer substring to the
250 first function in this list, and the return value of each function is
251 passed to the next function. The return value of the last function is
252 used as the return value of @code{filter-buffer-substring}.
253
254 As a special convention, point is set to the start of the buffer text
255 being operated on (i.e., the @var{start} argument for
256 @code{filter-buffer-substring}) before these functions are called.
257
258 If this variable is @code{nil}, no filtering is performed.
259 @end defvar
260
261 @defun buffer-string
262 This function returns the contents of the entire accessible portion of
263 the current buffer as a string. It is equivalent to
264
265 @example
266 (buffer-substring (point-min) (point-max))
267 @end example
268
269 @example
270 @group
271 ---------- Buffer: foo ----------
272 This is the contents of buffer foo
273
274 ---------- Buffer: foo ----------
275
276 (buffer-string)
277 @result{} "This is the contents of buffer foo\n"
278 @end group
279 @end example
280 @end defun
281
282 @defun current-word &optional strict really-word
283 This function returns the symbol (or word) at or near point, as a string.
284 The return value includes no text properties.
285
286 If the optional argument @var{really-word} is non-@code{nil}, it finds a
287 word; otherwise, it finds a symbol (which includes both word
288 characters and symbol constituent characters).
289
290 If the optional argument @var{strict} is non-@code{nil}, then point
291 must be in or next to the symbol or word---if no symbol or word is
292 there, the function returns @code{nil}. Otherwise, a nearby symbol or
293 word on the same line is acceptable.
294 @end defun
295
296 @defun thing-at-point thing
297 Return the @var{thing} around or next to point, as a string.
298
299 The argument @var{thing} is a symbol which specifies a kind of syntactic
300 entity. Possibilities include @code{symbol}, @code{list}, @code{sexp},
301 @code{defun}, @code{filename}, @code{url}, @code{word}, @code{sentence},
302 @code{whitespace}, @code{line}, @code{page}, and others.
303
304 @example
305 ---------- Buffer: foo ----------
306 Gentlemen may cry ``Pea@point{}ce! Peace!,''
307 but there is no peace.
308 ---------- Buffer: foo ----------
309
310 (thing-at-point 'word)
311 @result{} "Peace"
312 (thing-at-point 'line)
313 @result{} "Gentlemen may cry ``Peace! Peace!,''\n"
314 (thing-at-point 'whitespace)
315 @result{} nil
316 @end example
317 @end defun
318
319 @node Comparing Text
320 @section Comparing Text
321 @cindex comparing buffer text
322
323 This function lets you compare portions of the text in a buffer, without
324 copying them into strings first.
325
326 @defun compare-buffer-substrings buffer1 start1 end1 buffer2 start2 end2
327 This function lets you compare two substrings of the same buffer or two
328 different buffers. The first three arguments specify one substring,
329 giving a buffer (or a buffer name) and two positions within the
330 buffer. The last three arguments specify the other substring in the
331 same way. You can use @code{nil} for @var{buffer1}, @var{buffer2}, or
332 both to stand for the current buffer.
333
334 The value is negative if the first substring is less, positive if the
335 first is greater, and zero if they are equal. The absolute value of
336 the result is one plus the index of the first differing characters
337 within the substrings.
338
339 This function ignores case when comparing characters
340 if @code{case-fold-search} is non-@code{nil}. It always ignores
341 text properties.
342
343 Suppose the current buffer contains the text @samp{foobarbar
344 haha!rara!}; then in this example the two substrings are @samp{rbar }
345 and @samp{rara!}. The value is 2 because the first substring is greater
346 at the second character.
347
348 @example
349 (compare-buffer-substrings nil 6 11 nil 16 21)
350 @result{} 2
351 @end example
352 @end defun
353
354 @node Insertion
355 @section Inserting Text
356 @cindex insertion of text
357 @cindex text insertion
358
359 @cindex insertion before point
360 @cindex before point, insertion
361 @dfn{Insertion} means adding new text to a buffer. The inserted text
362 goes at point---between the character before point and the character
363 after point. Some insertion functions leave point before the inserted
364 text, while other functions leave it after. We call the former
365 insertion @dfn{after point} and the latter insertion @dfn{before point}.
366
367 Insertion moves markers located at positions after the insertion
368 point, so that they stay with the surrounding text (@pxref{Markers}).
369 When a marker points at the place of insertion, insertion may or may
370 not relocate the marker, depending on the marker's insertion type
371 (@pxref{Marker Insertion Types}). Certain special functions such as
372 @code{insert-before-markers} relocate all such markers to point after
373 the inserted text, regardless of the markers' insertion type.
374
375 Insertion functions signal an error if the current buffer is
376 read-only or if they insert within read-only text.
377
378 These functions copy text characters from strings and buffers along
379 with their properties. The inserted characters have exactly the same
380 properties as the characters they were copied from. By contrast,
381 characters specified as separate arguments, not part of a string or
382 buffer, inherit their text properties from the neighboring text.
383
384 The insertion functions convert text from unibyte to multibyte in
385 order to insert in a multibyte buffer, and vice versa---if the text
386 comes from a string or from a buffer. However, they do not convert
387 unibyte character codes 128 through 255 to multibyte characters, not
388 even if the current buffer is a multibyte buffer. @xref{Converting
389 Representations}.
390
391 @defun insert &rest args
392 This function inserts the strings and/or characters @var{args} into the
393 current buffer, at point, moving point forward. In other words, it
394 inserts the text before point. An error is signaled unless all
395 @var{args} are either strings or characters. The value is @code{nil}.
396 @end defun
397
398 @defun insert-before-markers &rest args
399 This function inserts the strings and/or characters @var{args} into the
400 current buffer, at point, moving point forward. An error is signaled
401 unless all @var{args} are either strings or characters. The value is
402 @code{nil}.
403
404 This function is unlike the other insertion functions in that it
405 relocates markers initially pointing at the insertion point, to point
406 after the inserted text. If an overlay begins at the insertion point,
407 the inserted text falls outside the overlay; if a nonempty overlay
408 ends at the insertion point, the inserted text falls inside that
409 overlay.
410 @end defun
411
412 @defun insert-char character count &optional inherit
413 This function inserts @var{count} instances of @var{character} into the
414 current buffer before point. The argument @var{count} should be an
415 integer, and @var{character} must be a character. The value is @code{nil}.
416
417 This function does not convert unibyte character codes 128 through 255
418 to multibyte characters, not even if the current buffer is a multibyte
419 buffer. @xref{Converting Representations}.
420
421 If @var{inherit} is non-@code{nil}, then the inserted characters inherit
422 sticky text properties from the two characters before and after the
423 insertion point. @xref{Sticky Properties}.
424 @end defun
425
426 @defun insert-buffer-substring from-buffer-or-name &optional start end
427 This function inserts a portion of buffer @var{from-buffer-or-name}
428 (which must already exist) into the current buffer before point. The
429 text inserted is the region between @var{start} and @var{end}. (These
430 arguments default to the beginning and end of the accessible portion of
431 that buffer.) This function returns @code{nil}.
432
433 In this example, the form is executed with buffer @samp{bar} as the
434 current buffer. We assume that buffer @samp{bar} is initially empty.
435
436 @example
437 @group
438 ---------- Buffer: foo ----------
439 We hold these truths to be self-evident, that all
440 ---------- Buffer: foo ----------
441 @end group
442
443 @group
444 (insert-buffer-substring "foo" 1 20)
445 @result{} nil
446
447 ---------- Buffer: bar ----------
448 We hold these truth@point{}
449 ---------- Buffer: bar ----------
450 @end group
451 @end example
452 @end defun
453
454 @defun insert-buffer-substring-no-properties from-buffer-or-name &optional start end
455 This is like @code{insert-buffer-substring} except that it does not
456 copy any text properties.
457 @end defun
458
459 @xref{Sticky Properties}, for other insertion functions that inherit
460 text properties from the nearby text in addition to inserting it.
461 Whitespace inserted by indentation functions also inherits text
462 properties.
463
464 @node Commands for Insertion
465 @section User-Level Insertion Commands
466
467 This section describes higher-level commands for inserting text,
468 commands intended primarily for the user but useful also in Lisp
469 programs.
470
471 @deffn Command insert-buffer from-buffer-or-name
472 This command inserts the entire accessible contents of
473 @var{from-buffer-or-name} (which must exist) into the current buffer
474 after point. It leaves the mark after the inserted text. The value
475 is @code{nil}.
476 @end deffn
477
478 @deffn Command self-insert-command count
479 @cindex character insertion
480 @cindex self-insertion
481 This command inserts the last character typed; it does so @var{count}
482 times, before point, and returns @code{nil}. Most printing characters
483 are bound to this command. In routine use, @code{self-insert-command}
484 is the most frequently called function in Emacs, but programs rarely use
485 it except to install it on a keymap.
486
487 In an interactive call, @var{count} is the numeric prefix argument.
488
489 Self-insertion translates the input character through
490 @code{translation-table-for-input}. @xref{Translation of Characters}.
491
492 This command calls @code{auto-fill-function} whenever that is
493 non-@code{nil} and the character inserted is in the table
494 @code{auto-fill-chars} (@pxref{Auto Filling}).
495
496 @c Cross refs reworded to prevent overfull hbox. --rjc 15mar92
497 This command performs abbrev expansion if Abbrev mode is enabled and
498 the inserted character does not have word-constituent
499 syntax. (@xref{Abbrevs}, and @ref{Syntax Class Table}.) It is also
500 responsible for calling @code{blink-paren-function} when the inserted
501 character has close parenthesis syntax (@pxref{Blinking}).
502
503 Do not try substituting your own definition of
504 @code{self-insert-command} for the standard one. The editor command
505 loop handles this function specially.
506 @end deffn
507
508 @deffn Command newline &optional number-of-newlines
509 This command inserts newlines into the current buffer before point.
510 If @var{number-of-newlines} is supplied, that many newline characters
511 are inserted.
512
513 @cindex newline and Auto Fill mode
514 This function calls @code{auto-fill-function} if the current column
515 number is greater than the value of @code{fill-column} and
516 @var{number-of-newlines} is @code{nil}. Typically what
517 @code{auto-fill-function} does is insert a newline; thus, the overall
518 result in this case is to insert two newlines at different places: one
519 at point, and another earlier in the line. @code{newline} does not
520 auto-fill if @var{number-of-newlines} is non-@code{nil}.
521
522 This command indents to the left margin if that is not zero.
523 @xref{Margins}.
524
525 The value returned is @code{nil}. In an interactive call, @var{count}
526 is the numeric prefix argument.
527 @end deffn
528
529 @defvar overwrite-mode
530 This variable controls whether overwrite mode is in effect. The value
531 should be @code{overwrite-mode-textual}, @code{overwrite-mode-binary},
532 or @code{nil}. @code{overwrite-mode-textual} specifies textual
533 overwrite mode (treats newlines and tabs specially), and
534 @code{overwrite-mode-binary} specifies binary overwrite mode (treats
535 newlines and tabs like any other characters).
536 @end defvar
537
538 @node Deletion
539 @section Deleting Text
540 @cindex text deletion
541
542 @cindex deleting text vs killing
543 Deletion means removing part of the text in a buffer, without saving
544 it in the kill ring (@pxref{The Kill Ring}). Deleted text can't be
545 yanked, but can be reinserted using the undo mechanism (@pxref{Undo}).
546 Some deletion functions do save text in the kill ring in some special
547 cases.
548
549 All of the deletion functions operate on the current buffer.
550
551 @deffn Command erase-buffer
552 This function deletes the entire text of the current buffer
553 (@emph{not} just the accessible portion), leaving it
554 empty. If the buffer is read-only, it signals a @code{buffer-read-only}
555 error; if some of the text in it is read-only, it signals a
556 @code{text-read-only} error. Otherwise, it deletes the text without
557 asking for any confirmation. It returns @code{nil}.
558
559 Normally, deleting a large amount of text from a buffer inhibits further
560 auto-saving of that buffer ``because it has shrunk.'' However,
561 @code{erase-buffer} does not do this, the idea being that the future
562 text is not really related to the former text, and its size should not
563 be compared with that of the former text.
564 @end deffn
565
566 @deffn Command delete-region start end
567 This command deletes the text between positions @var{start} and
568 @var{end} in the current buffer, and returns @code{nil}. If point was
569 inside the deleted region, its value afterward is @var{start}.
570 Otherwise, point relocates with the surrounding text, as markers do.
571 @end deffn
572
573 @defun delete-and-extract-region start end
574 This function deletes the text between positions @var{start} and
575 @var{end} in the current buffer, and returns a string containing the
576 text just deleted.
577
578 If point was inside the deleted region, its value afterward is
579 @var{start}. Otherwise, point relocates with the surrounding text, as
580 markers do.
581 @end defun
582
583 @deffn Command delete-char count &optional killp
584 This command deletes @var{count} characters directly after point, or
585 before point if @var{count} is negative. If @var{killp} is
586 non-@code{nil}, then it saves the deleted characters in the kill ring.
587
588 In an interactive call, @var{count} is the numeric prefix argument, and
589 @var{killp} is the unprocessed prefix argument. Therefore, if a prefix
590 argument is supplied, the text is saved in the kill ring. If no prefix
591 argument is supplied, then one character is deleted, but not saved in
592 the kill ring.
593
594 The value returned is always @code{nil}.
595 @end deffn
596
597 @deffn Command delete-backward-char count &optional killp
598 @cindex deleting previous char
599 This command deletes @var{count} characters directly before point, or
600 after point if @var{count} is negative. If @var{killp} is
601 non-@code{nil}, then it saves the deleted characters in the kill ring.
602
603 In an interactive call, @var{count} is the numeric prefix argument, and
604 @var{killp} is the unprocessed prefix argument. Therefore, if a prefix
605 argument is supplied, the text is saved in the kill ring. If no prefix
606 argument is supplied, then one character is deleted, but not saved in
607 the kill ring.
608
609 The value returned is always @code{nil}.
610 @end deffn
611
612 @deffn Command backward-delete-char-untabify count &optional killp
613 @cindex tab deletion
614 This command deletes @var{count} characters backward, changing tabs
615 into spaces. When the next character to be deleted is a tab, it is
616 first replaced with the proper number of spaces to preserve alignment
617 and then one of those spaces is deleted instead of the tab. If
618 @var{killp} is non-@code{nil}, then the command saves the deleted
619 characters in the kill ring.
620
621 Conversion of tabs to spaces happens only if @var{count} is positive.
622 If it is negative, exactly @minus{}@var{count} characters after point
623 are deleted.
624
625 In an interactive call, @var{count} is the numeric prefix argument, and
626 @var{killp} is the unprocessed prefix argument. Therefore, if a prefix
627 argument is supplied, the text is saved in the kill ring. If no prefix
628 argument is supplied, then one character is deleted, but not saved in
629 the kill ring.
630
631 The value returned is always @code{nil}.
632 @end deffn
633
634 @defopt backward-delete-char-untabify-method
635 This option specifies how @code{backward-delete-char-untabify} should
636 deal with whitespace. Possible values include @code{untabify}, the
637 default, meaning convert a tab to many spaces and delete one;
638 @code{hungry}, meaning delete all tabs and spaces before point with
639 one command; @code{all} meaning delete all tabs, spaces and newlines
640 before point, and @code{nil}, meaning do nothing special for
641 whitespace characters.
642 @end defopt
643
644 @node User-Level Deletion
645 @section User-Level Deletion Commands
646
647 This section describes higher-level commands for deleting text,
648 commands intended primarily for the user but useful also in Lisp
649 programs.
650
651 @deffn Command delete-horizontal-space &optional backward-only
652 @cindex deleting whitespace
653 This function deletes all spaces and tabs around point. It returns
654 @code{nil}.
655
656 If @var{backward-only} is non-@code{nil}, the function deletes
657 spaces and tabs before point, but not after point.
658
659 In the following examples, we call @code{delete-horizontal-space} four
660 times, once on each line, with point between the second and third
661 characters on the line each time.
662
663 @example
664 @group
665 ---------- Buffer: foo ----------
666 I @point{}thought
667 I @point{} thought
668 We@point{} thought
669 Yo@point{}u thought
670 ---------- Buffer: foo ----------
671 @end group
672
673 @group
674 (delete-horizontal-space) ; @r{Four times.}
675 @result{} nil
676
677 ---------- Buffer: foo ----------
678 Ithought
679 Ithought
680 Wethought
681 You thought
682 ---------- Buffer: foo ----------
683 @end group
684 @end example
685 @end deffn
686
687 @deffn Command delete-indentation &optional join-following-p
688 This function joins the line point is on to the previous line, deleting
689 any whitespace at the join and in some cases replacing it with one
690 space. If @var{join-following-p} is non-@code{nil},
691 @code{delete-indentation} joins this line to the following line
692 instead. The function returns @code{nil}.
693
694 If there is a fill prefix, and the second of the lines being joined
695 starts with the prefix, then @code{delete-indentation} deletes the
696 fill prefix before joining the lines. @xref{Margins}.
697
698 In the example below, point is located on the line starting
699 @samp{events}, and it makes no difference if there are trailing spaces
700 in the preceding line.
701
702 @smallexample
703 @group
704 ---------- Buffer: foo ----------
705 When in the course of human
706 @point{} events, it becomes necessary
707 ---------- Buffer: foo ----------
708 @end group
709
710 (delete-indentation)
711 @result{} nil
712
713 @group
714 ---------- Buffer: foo ----------
715 When in the course of human@point{} events, it becomes necessary
716 ---------- Buffer: foo ----------
717 @end group
718 @end smallexample
719
720 After the lines are joined, the function @code{fixup-whitespace} is
721 responsible for deciding whether to leave a space at the junction.
722 @end deffn
723
724 @deffn Command fixup-whitespace
725 This function replaces all the horizontal whitespace surrounding point
726 with either one space or no space, according to the context. It
727 returns @code{nil}.
728
729 At the beginning or end of a line, the appropriate amount of space is
730 none. Before a character with close parenthesis syntax, or after a
731 character with open parenthesis or expression-prefix syntax, no space is
732 also appropriate. Otherwise, one space is appropriate. @xref{Syntax
733 Class Table}.
734
735 In the example below, @code{fixup-whitespace} is called the first time
736 with point before the word @samp{spaces} in the first line. For the
737 second invocation, point is directly after the @samp{(}.
738
739 @smallexample
740 @group
741 ---------- Buffer: foo ----------
742 This has too many @point{}spaces
743 This has too many spaces at the start of (@point{} this list)
744 ---------- Buffer: foo ----------
745 @end group
746
747 @group
748 (fixup-whitespace)
749 @result{} nil
750 (fixup-whitespace)
751 @result{} nil
752 @end group
753
754 @group
755 ---------- Buffer: foo ----------
756 This has too many spaces
757 This has too many spaces at the start of (this list)
758 ---------- Buffer: foo ----------
759 @end group
760 @end smallexample
761 @end deffn
762
763 @deffn Command just-one-space &optional n
764 @comment !!SourceFile simple.el
765 This command replaces any spaces and tabs around point with a single
766 space, or @var{n} spaces if @var{n} is specified. It returns
767 @code{nil}.
768 @end deffn
769
770 @deffn Command delete-blank-lines
771 This function deletes blank lines surrounding point. If point is on a
772 blank line with one or more blank lines before or after it, then all but
773 one of them are deleted. If point is on an isolated blank line, then it
774 is deleted. If point is on a nonblank line, the command deletes all
775 blank lines immediately following it.
776
777 A blank line is defined as a line containing only tabs and spaces.
778
779 @code{delete-blank-lines} returns @code{nil}.
780 @end deffn
781
782 @node The Kill Ring
783 @section The Kill Ring
784 @cindex kill ring
785
786 @dfn{Kill functions} delete text like the deletion functions, but save
787 it so that the user can reinsert it by @dfn{yanking}. Most of these
788 functions have @samp{kill-} in their name. By contrast, the functions
789 whose names start with @samp{delete-} normally do not save text for
790 yanking (though they can still be undone); these are ``deletion''
791 functions.
792
793 Most of the kill commands are primarily for interactive use, and are
794 not described here. What we do describe are the functions provided for
795 use in writing such commands. You can use these functions to write
796 commands for killing text. When you need to delete text for internal
797 purposes within a Lisp function, you should normally use deletion
798 functions, so as not to disturb the kill ring contents.
799 @xref{Deletion}.
800
801 Killed text is saved for later yanking in the @dfn{kill ring}. This
802 is a list that holds a number of recent kills, not just the last text
803 kill. We call this a ``ring'' because yanking treats it as having
804 elements in a cyclic order. The list is kept in the variable
805 @code{kill-ring}, and can be operated on with the usual functions for
806 lists; there are also specialized functions, described in this section,
807 that treat it as a ring.
808
809 Some people think this use of the word ``kill'' is unfortunate, since
810 it refers to operations that specifically @emph{do not} destroy the
811 entities ``killed.'' This is in sharp contrast to ordinary life, in
812 which death is permanent and ``killed'' entities do not come back to
813 life. Therefore, other metaphors have been proposed. For example, the
814 term ``cut ring'' makes sense to people who, in pre-computer days, used
815 scissors and paste to cut up and rearrange manuscripts. However, it
816 would be difficult to change the terminology now.
817
818 @menu
819 * Kill Ring Concepts:: What text looks like in the kill ring.
820 * Kill Functions:: Functions that kill text.
821 * Yanking:: How yanking is done.
822 * Yank Commands:: Commands that access the kill ring.
823 * Low-Level Kill Ring:: Functions and variables for kill ring access.
824 * Internals of Kill Ring:: Variables that hold kill ring data.
825 @end menu
826
827 @node Kill Ring Concepts
828 @comment node-name, next, previous, up
829 @subsection Kill Ring Concepts
830
831 The kill ring records killed text as strings in a list, most recent
832 first. A short kill ring, for example, might look like this:
833
834 @example
835 ("some text" "a different piece of text" "even older text")
836 @end example
837
838 @noindent
839 When the list reaches @code{kill-ring-max} entries in length, adding a
840 new entry automatically deletes the last entry.
841
842 When kill commands are interwoven with other commands, each kill
843 command makes a new entry in the kill ring. Multiple kill commands in
844 succession build up a single kill ring entry, which would be yanked as a
845 unit; the second and subsequent consecutive kill commands add text to
846 the entry made by the first one.
847
848 For yanking, one entry in the kill ring is designated the ``front'' of
849 the ring. Some yank commands ``rotate'' the ring by designating a
850 different element as the ``front.'' But this virtual rotation doesn't
851 change the list itself---the most recent entry always comes first in the
852 list.
853
854 @node Kill Functions
855 @comment node-name, next, previous, up
856 @subsection Functions for Killing
857
858 @code{kill-region} is the usual subroutine for killing text. Any
859 command that calls this function is a ``kill command'' (and should
860 probably have @samp{kill} in its name). @code{kill-region} puts the
861 newly killed text in a new element at the beginning of the kill ring or
862 adds it to the most recent element. It determines automatically (using
863 @code{last-command}) whether the previous command was a kill command,
864 and if so appends the killed text to the most recent entry.
865
866 @deffn Command kill-region start end
867 This function kills the text in the region defined by @var{start} and
868 @var{end}. The text is deleted but saved in the kill ring, along with
869 its text properties. The value is always @code{nil}.
870
871 In an interactive call, @var{start} and @var{end} are point and
872 the mark.
873
874 If the buffer or text is read-only, @code{kill-region} modifies the kill
875 ring just the same, then signals an error without modifying the buffer.
876 This is convenient because it lets the user use a series of kill
877 commands to copy text from a read-only buffer into the kill ring.
878 @end deffn
879
880 @defopt kill-read-only-ok
881 If this option is non-@code{nil}, @code{kill-region} does not signal an
882 error if the buffer or text is read-only. Instead, it simply returns,
883 updating the kill ring but not changing the buffer.
884 @end defopt
885
886 @deffn Command copy-region-as-kill start end
887 This command saves the region defined by @var{start} and @var{end} on
888 the kill ring (including text properties), but does not delete the text
889 from the buffer. It returns @code{nil}.
890
891 The command does not set @code{this-command} to @code{kill-region}, so a
892 subsequent kill command does not append to the same kill ring entry.
893
894 @c FIXME Why is it better? Why isn't copy-region-as-kill obsolete then?
895 @c Why is it used in many places in Emacs?
896 In Lisp programs, it is better to use @code{kill-new} or
897 @code{kill-append} instead of this command. @xref{Low-Level Kill Ring}.
898 @end deffn
899
900 @node Yanking
901 @subsection Yanking
902
903 Yanking means inserting text from the kill ring, but it does
904 not insert the text blindly. Yank commands and some other commands
905 use @code{insert-for-yank} to perform special processing on the
906 text that they copy into the buffer.
907
908 @defun insert-for-yank string
909 This function normally works like @code{insert} except that it doesn't
910 insert the text properties in the @code{yank-excluded-properties}
911 list. However, if any part of @var{string} has a non-@code{nil}
912 @code{yank-handler} text property, that property can do various
913 special processing on that part of the text being inserted.
914 @end defun
915
916 @defun insert-buffer-substring-as-yank buf &optional start end
917 This function resembles @code{insert-buffer-substring} except that it
918 doesn't insert the text properties in the
919 @code{yank-excluded-properties} list.
920 @end defun
921
922 You can put a @code{yank-handler} text property on all or part of
923 the text to control how it will be inserted if it is yanked. The
924 @code{insert-for-yank} function looks for that property. The property
925 value must be a list of one to four elements, with the following
926 format (where elements after the first may be omitted):
927
928 @example
929 (@var{function} @var{param} @var{noexclude} @var{undo})
930 @end example
931
932 Here is what the elements do:
933
934 @table @var
935 @item function
936 When @var{function} is present and non-@code{nil}, it is called instead of
937 @code{insert} to insert the string. @var{function} takes one
938 argument---the string to insert.
939
940 @item param
941 If @var{param} is present and non-@code{nil}, it replaces @var{string}
942 (or the part of @var{string} being processed) as the object passed to
943 @var{function} (or @code{insert}); for example, if @var{function} is
944 @code{yank-rectangle}, @var{param} should be a list of strings to
945 insert as a rectangle.
946
947 @item noexclude
948 If @var{noexclude} is present and non-@code{nil}, the normal removal of the
949 yank-excluded-properties is not performed; instead @var{function} is
950 responsible for removing those properties. This may be necessary
951 if @var{function} adjusts point before or after inserting the object.
952
953 @item undo
954 If @var{undo} is present and non-@code{nil}, it is a function that will be
955 called by @code{yank-pop} to undo the insertion of the current object.
956 It is called with two arguments, the start and end of the current
957 region. @var{function} can set @code{yank-undo-function} to override
958 the @var{undo} value.
959 @end table
960
961 @node Yank Commands
962 @comment node-name, next, previous, up
963 @subsection Functions for Yanking
964
965 This section describes higher-level commands for yanking, which are
966 intended primarily for the user but useful also in Lisp programs.
967 Both @code{yank} and @code{yank-pop} honor the
968 @code{yank-excluded-properties} variable and @code{yank-handler} text
969 property (@pxref{Yanking}).
970
971 @deffn Command yank &optional arg
972 @cindex inserting killed text
973 This command inserts before point the text at the front of the kill
974 ring. It sets the mark at the beginning of that text, using
975 @code{push-mark} (@pxref{The Mark}), and puts point at the end.
976
977 If @var{arg} is a non-@code{nil} list (which occurs interactively when
978 the user types @kbd{C-u} with no digits), then @code{yank} inserts the
979 text as described above, but puts point before the yanked text and
980 sets the mark after it.
981
982 If @var{arg} is a number, then @code{yank} inserts the @var{arg}th
983 most recently killed text---the @var{arg}th element of the kill ring
984 list, counted cyclically from the front, which is considered the
985 first element for this purpose.
986
987 @code{yank} does not alter the contents of the kill ring, unless it
988 used text provided by another program, in which case it pushes that text
989 onto the kill ring. However if @var{arg} is an integer different from
990 one, it rotates the kill ring to place the yanked string at the front.
991
992 @code{yank} returns @code{nil}.
993 @end deffn
994
995 @deffn Command yank-pop &optional arg
996 This command replaces the just-yanked entry from the kill ring with a
997 different entry from the kill ring.
998
999 This is allowed only immediately after a @code{yank} or another
1000 @code{yank-pop}. At such a time, the region contains text that was just
1001 inserted by yanking. @code{yank-pop} deletes that text and inserts in
1002 its place a different piece of killed text. It does not add the deleted
1003 text to the kill ring, since it is already in the kill ring somewhere.
1004 It does however rotate the kill ring to place the newly yanked string at
1005 the front.
1006
1007 If @var{arg} is @code{nil}, then the replacement text is the previous
1008 element of the kill ring. If @var{arg} is numeric, the replacement is
1009 the @var{arg}th previous kill. If @var{arg} is negative, a more recent
1010 kill is the replacement.
1011
1012 The sequence of kills in the kill ring wraps around, so that after the
1013 oldest one comes the newest one, and before the newest one goes the
1014 oldest.
1015
1016 The return value is always @code{nil}.
1017 @end deffn
1018
1019 @defvar yank-undo-function
1020 If this variable is non-@code{nil}, the function @code{yank-pop} uses
1021 its value instead of @code{delete-region} to delete the text
1022 inserted by the previous @code{yank} or
1023 @code{yank-pop} command. The value must be a function of two
1024 arguments, the start and end of the current region.
1025
1026 The function @code{insert-for-yank} automatically sets this variable
1027 according to the @var{undo} element of the @code{yank-handler}
1028 text property, if there is one.
1029 @end defvar
1030
1031 @node Low-Level Kill Ring
1032 @subsection Low-Level Kill Ring
1033
1034 These functions and variables provide access to the kill ring at a
1035 lower level, but are still convenient for use in Lisp programs,
1036 because they take care of interaction with window system selections
1037 (@pxref{Window System Selections}).
1038
1039 @defun current-kill n &optional do-not-move
1040 The function @code{current-kill} rotates the yanking pointer, which
1041 designates the ``front'' of the kill ring, by @var{n} places (from newer
1042 kills to older ones), and returns the text at that place in the ring.
1043
1044 If the optional second argument @var{do-not-move} is non-@code{nil},
1045 then @code{current-kill} doesn't alter the yanking pointer; it just
1046 returns the @var{n}th kill, counting from the current yanking pointer.
1047
1048 If @var{n} is zero, indicating a request for the latest kill,
1049 @code{current-kill} calls the value of
1050 @code{interprogram-paste-function} (documented below) before
1051 consulting the kill ring. If that value is a function and calling it
1052 returns a string or a list of several string, @code{current-kill}
1053 pushes the strings onto the kill ring and returns the first string.
1054 It also sets the yanking pointer to point to the kill-ring entry of
1055 the first string returned by @code{interprogram-paste-function},
1056 regardless of the value of @var{do-not-move}. Otherwise,
1057 @code{current-kill} does not treat a zero value for @var{n} specially:
1058 it returns the entry pointed at by the yanking pointer and does not
1059 move the yanking pointer.
1060 @end defun
1061
1062 @defun kill-new string &optional replace
1063 This function pushes the text @var{string} onto the kill ring and
1064 makes the yanking pointer point to it. It discards the oldest entry
1065 if appropriate. It also invokes the value of
1066 @code{interprogram-cut-function} (see below).
1067
1068 If @var{replace} is non-@code{nil}, then @code{kill-new} replaces the
1069 first element of the kill ring with @var{string}, rather than pushing
1070 @var{string} onto the kill ring.
1071 @end defun
1072
1073 @defun kill-append string before-p
1074 This function appends the text @var{string} to the first entry in the
1075 kill ring and makes the yanking pointer point to the combined entry.
1076 Normally @var{string} goes at the end of the entry, but if
1077 @var{before-p} is non-@code{nil}, it goes at the beginning. This
1078 function also invokes the value of @code{interprogram-cut-function}
1079 (see below).
1080 @end defun
1081
1082 @defvar interprogram-paste-function
1083 This variable provides a way of transferring killed text from other
1084 programs, when you are using a window system. Its value should be
1085 @code{nil} or a function of no arguments.
1086
1087 If the value is a function, @code{current-kill} calls it to get the
1088 ``most recent kill.'' If the function returns a non-@code{nil} value,
1089 then that value is used as the ``most recent kill.'' If it returns
1090 @code{nil}, then the front of the kill ring is used.
1091
1092 To facilitate support for window systems that support multiple
1093 selections, this function may also return a list of strings. In that
1094 case, the first string is used as the ``most recent kill'', and all
1095 the other strings are pushed onto the kill ring, for easy access by
1096 @code{yank-pop}.
1097
1098 The normal use of this function is to get the window system's primary
1099 selection as the most recent kill, even if the selection belongs to
1100 another application. @xref{Window System Selections}. However, if
1101 the selection was provided by the current Emacs session, this function
1102 should return @code{nil}. (If it is hard to tell whether Emacs or
1103 some other program provided the selection, it should be good enough to
1104 use @code{string=} to compare it with the last text Emacs provided.)
1105 @end defvar
1106
1107 @defvar interprogram-cut-function
1108 This variable provides a way of communicating killed text to other
1109 programs, when you are using a window system. Its value should be
1110 @code{nil} or a function of one required argument.
1111
1112 If the value is a function, @code{kill-new} and @code{kill-append} call
1113 it with the new first element of the kill ring as the argument.
1114
1115 The normal use of this function is to set the window system's primary
1116 selection from the newly killed text.
1117 @xref{Window System Selections}.
1118 @end defvar
1119
1120 @node Internals of Kill Ring
1121 @comment node-name, next, previous, up
1122 @subsection Internals of the Kill Ring
1123
1124 The variable @code{kill-ring} holds the kill ring contents, in the
1125 form of a list of strings. The most recent kill is always at the front
1126 of the list.
1127
1128 The @code{kill-ring-yank-pointer} variable points to a link in the
1129 kill ring list, whose @sc{car} is the text to yank next. We say it
1130 identifies the ``front'' of the ring. Moving
1131 @code{kill-ring-yank-pointer} to a different link is called
1132 @dfn{rotating the kill ring}. We call the kill ring a ``ring'' because
1133 the functions that move the yank pointer wrap around from the end of the
1134 list to the beginning, or vice-versa. Rotation of the kill ring is
1135 virtual; it does not change the value of @code{kill-ring}.
1136
1137 Both @code{kill-ring} and @code{kill-ring-yank-pointer} are Lisp
1138 variables whose values are normally lists. The word ``pointer'' in the
1139 name of the @code{kill-ring-yank-pointer} indicates that the variable's
1140 purpose is to identify one element of the list for use by the next yank
1141 command.
1142
1143 The value of @code{kill-ring-yank-pointer} is always @code{eq} to one
1144 of the links in the kill ring list. The element it identifies is the
1145 @sc{car} of that link. Kill commands, which change the kill ring, also
1146 set this variable to the value of @code{kill-ring}. The effect is to
1147 rotate the ring so that the newly killed text is at the front.
1148
1149 Here is a diagram that shows the variable @code{kill-ring-yank-pointer}
1150 pointing to the second entry in the kill ring @code{("some text" "a
1151 different piece of text" "yet older text")}.
1152
1153 @example
1154 @group
1155 kill-ring ---- kill-ring-yank-pointer
1156 | |
1157 | v
1158 | --- --- --- --- --- ---
1159 --> | | |------> | | |--> | | |--> nil
1160 --- --- --- --- --- ---
1161 | | |
1162 | | |
1163 | | -->"yet older text"
1164 | |
1165 | --> "a different piece of text"
1166 |
1167 --> "some text"
1168 @end group
1169 @end example
1170
1171 @noindent
1172 This state of affairs might occur after @kbd{C-y} (@code{yank})
1173 immediately followed by @kbd{M-y} (@code{yank-pop}).
1174
1175 @defvar kill-ring
1176 This variable holds the list of killed text sequences, most recently
1177 killed first.
1178 @end defvar
1179
1180 @defvar kill-ring-yank-pointer
1181 This variable's value indicates which element of the kill ring is at the
1182 ``front'' of the ring for yanking. More precisely, the value is a tail
1183 of the value of @code{kill-ring}, and its @sc{car} is the kill string
1184 that @kbd{C-y} should yank.
1185 @end defvar
1186
1187 @defopt kill-ring-max
1188 The value of this variable is the maximum length to which the kill
1189 ring can grow, before elements are thrown away at the end. The default
1190 value for @code{kill-ring-max} is 60.
1191 @end defopt
1192
1193 @node Undo
1194 @comment node-name, next, previous, up
1195 @section Undo
1196 @cindex redo
1197
1198 Most buffers have an @dfn{undo list}, which records all changes made
1199 to the buffer's text so that they can be undone. (The buffers that
1200 don't have one are usually special-purpose buffers for which Emacs
1201 assumes that undoing is not useful. In particular, any buffer whose
1202 name begins with a space has its undo recording off by default;
1203 see @ref{Buffer Names}.) All the primitives that modify the
1204 text in the buffer automatically add elements to the front of the undo
1205 list, which is in the variable @code{buffer-undo-list}.
1206
1207 @defvar buffer-undo-list
1208 This buffer-local variable's value is the undo list of the current
1209 buffer. A value of @code{t} disables the recording of undo information.
1210 @end defvar
1211
1212 Here are the kinds of elements an undo list can have:
1213
1214 @table @code
1215 @item @var{position}
1216 This kind of element records a previous value of point; undoing this
1217 element moves point to @var{position}. Ordinary cursor motion does not
1218 make any sort of undo record, but deletion operations use these entries
1219 to record where point was before the command.
1220
1221 @item (@var{beg} . @var{end})
1222 This kind of element indicates how to delete text that was inserted.
1223 Upon insertion, the text occupied the range @var{beg}--@var{end} in the
1224 buffer.
1225
1226 @item (@var{text} . @var{position})
1227 This kind of element indicates how to reinsert text that was deleted.
1228 The deleted text itself is the string @var{text}. The place to
1229 reinsert it is @code{(abs @var{position})}. If @var{position} is
1230 positive, point was at the beginning of the deleted text, otherwise it
1231 was at the end.
1232
1233 @item (t @var{high} . @var{low})
1234 This kind of element indicates that an unmodified buffer became
1235 modified. The elements @var{high} and @var{low} are two integers, each
1236 recording 16 bits of the visited file's modification time as of when it
1237 was previously visited or saved. @code{primitive-undo} uses those
1238 values to determine whether to mark the buffer as unmodified once again;
1239 it does so only if the file's modification time matches those numbers.
1240
1241 @item (nil @var{property} @var{value} @var{beg} . @var{end})
1242 This kind of element records a change in a text property.
1243 Here's how you might undo the change:
1244
1245 @example
1246 (put-text-property @var{beg} @var{end} @var{property} @var{value})
1247 @end example
1248
1249 @item (@var{marker} . @var{adjustment})
1250 This kind of element records the fact that the marker @var{marker} was
1251 relocated due to deletion of surrounding text, and that it moved
1252 @var{adjustment} character positions. Undoing this element moves
1253 @var{marker} @minus{} @var{adjustment} characters.
1254
1255 @item (apply @var{funname} . @var{args})
1256 This is an extensible undo item, which is undone by calling
1257 @var{funname} with arguments @var{args}.
1258
1259 @item (apply @var{delta} @var{beg} @var{end} @var{funname} . @var{args})
1260 This is an extensible undo item, which records a change limited to the
1261 range @var{beg} to @var{end}, which increased the size of the buffer
1262 by @var{delta}. It is undone by calling @var{funname} with arguments
1263 @var{args}.
1264
1265 This kind of element enables undo limited to a region to determine
1266 whether the element pertains to that region.
1267
1268 @item nil
1269 This element is a boundary. The elements between two boundaries are
1270 called a @dfn{change group}; normally, each change group corresponds to
1271 one keyboard command, and undo commands normally undo an entire group as
1272 a unit.
1273 @end table
1274
1275 @defun undo-boundary
1276 This function places a boundary element in the undo list. The undo
1277 command stops at such a boundary, and successive undo commands undo
1278 to earlier and earlier boundaries. This function returns @code{nil}.
1279
1280 The editor command loop automatically calls @code{undo-boundary} just
1281 before executing each key sequence, so that each undo normally undoes
1282 the effects of one command. As an exception, the command
1283 @code{self-insert-command}, which produces self-inserting input
1284 characters (@pxref{Commands for Insertion}), may remove the boundary
1285 inserted by the command loop: a boundary is accepted for the first
1286 such character, the next 19 consecutive self-inserting input
1287 characters do not have boundaries, and then the 20th does; and so on
1288 as long as the self-inserting characters continue. Hence, sequences
1289 of consecutive character insertions can be undone as a group.
1290
1291 All buffer modifications add a boundary whenever the previous undoable
1292 change was made in some other buffer. This is to ensure that
1293 each command makes a boundary in each buffer where it makes changes.
1294
1295 Calling this function explicitly is useful for splitting the effects of
1296 a command into more than one unit. For example, @code{query-replace}
1297 calls @code{undo-boundary} after each replacement, so that the user can
1298 undo individual replacements one by one.
1299 @end defun
1300
1301 @defvar undo-in-progress
1302 This variable is normally @code{nil}, but the undo commands bind it to
1303 @code{t}. This is so that various kinds of change hooks can tell when
1304 they're being called for the sake of undoing.
1305 @end defvar
1306
1307 @defun primitive-undo count list
1308 This is the basic function for undoing elements of an undo list.
1309 It undoes the first @var{count} elements of @var{list}, returning
1310 the rest of @var{list}.
1311
1312 @code{primitive-undo} adds elements to the buffer's undo list when it
1313 changes the buffer. Undo commands avoid confusion by saving the undo
1314 list value at the beginning of a sequence of undo operations. Then the
1315 undo operations use and update the saved value. The new elements added
1316 by undoing are not part of this saved value, so they don't interfere with
1317 continuing to undo.
1318
1319 This function does not bind @code{undo-in-progress}.
1320 @end defun
1321
1322 @node Maintaining Undo
1323 @section Maintaining Undo Lists
1324
1325 This section describes how to enable and disable undo information for
1326 a given buffer. It also explains how the undo list is truncated
1327 automatically so it doesn't get too big.
1328
1329 Recording of undo information in a newly created buffer is normally
1330 enabled to start with; but if the buffer name starts with a space, the
1331 undo recording is initially disabled. You can explicitly enable or
1332 disable undo recording with the following two functions, or by setting
1333 @code{buffer-undo-list} yourself.
1334
1335 @deffn Command buffer-enable-undo &optional buffer-or-name
1336 This command enables recording undo information for buffer
1337 @var{buffer-or-name}, so that subsequent changes can be undone. If no
1338 argument is supplied, then the current buffer is used. This function
1339 does nothing if undo recording is already enabled in the buffer. It
1340 returns @code{nil}.
1341
1342 In an interactive call, @var{buffer-or-name} is the current buffer.
1343 You cannot specify any other buffer.
1344 @end deffn
1345
1346 @deffn Command buffer-disable-undo &optional buffer-or-name
1347 @cindex disabling undo
1348 This function discards the undo list of @var{buffer-or-name}, and disables
1349 further recording of undo information. As a result, it is no longer
1350 possible to undo either previous changes or any subsequent changes. If
1351 the undo list of @var{buffer-or-name} is already disabled, this function
1352 has no effect.
1353
1354 This function returns @code{nil}.
1355 @end deffn
1356
1357 As editing continues, undo lists get longer and longer. To prevent
1358 them from using up all available memory space, garbage collection trims
1359 them back to size limits you can set. (For this purpose, the ``size''
1360 of an undo list measures the cons cells that make up the list, plus the
1361 strings of deleted text.) Three variables control the range of acceptable
1362 sizes: @code{undo-limit}, @code{undo-strong-limit} and
1363 @code{undo-outer-limit}. In these variables, size is counted as the
1364 number of bytes occupied, which includes both saved text and other
1365 data.
1366
1367 @defopt undo-limit
1368 This is the soft limit for the acceptable size of an undo list. The
1369 change group at which this size is exceeded is the last one kept.
1370 @end defopt
1371
1372 @defopt undo-strong-limit
1373 This is the upper limit for the acceptable size of an undo list. The
1374 change group at which this size is exceeded is discarded itself (along
1375 with all older change groups). There is one exception: the very latest
1376 change group is only discarded if it exceeds @code{undo-outer-limit}.
1377 @end defopt
1378
1379 @defopt undo-outer-limit
1380 If at garbage collection time the undo info for the current command
1381 exceeds this limit, Emacs discards the info and displays a warning.
1382 This is a last ditch limit to prevent memory overflow.
1383 @end defopt
1384
1385 @defopt undo-ask-before-discard
1386 If this variable is non-@code{nil}, when the undo info exceeds
1387 @code{undo-outer-limit}, Emacs asks in the echo area whether to
1388 discard the info. The default value is @code{nil}, which means to
1389 discard it automatically.
1390
1391 This option is mainly intended for debugging. Garbage collection is
1392 inhibited while the question is asked, which means that Emacs might
1393 leak memory if the user waits too long before answering the question.
1394 @end defopt
1395
1396 @node Filling
1397 @comment node-name, next, previous, up
1398 @section Filling
1399 @cindex filling text
1400
1401 @dfn{Filling} means adjusting the lengths of lines (by moving the line
1402 breaks) so that they are nearly (but no greater than) a specified
1403 maximum width. Additionally, lines can be @dfn{justified}, which means
1404 inserting spaces to make the left and/or right margins line up
1405 precisely. The width is controlled by the variable @code{fill-column}.
1406 For ease of reading, lines should be no longer than 70 or so columns.
1407
1408 You can use Auto Fill mode (@pxref{Auto Filling}) to fill text
1409 automatically as you insert it, but changes to existing text may leave
1410 it improperly filled. Then you must fill the text explicitly.
1411
1412 Most of the commands in this section return values that are not
1413 meaningful. All the functions that do filling take note of the current
1414 left margin, current right margin, and current justification style
1415 (@pxref{Margins}). If the current justification style is
1416 @code{none}, the filling functions don't actually do anything.
1417
1418 Several of the filling functions have an argument @var{justify}.
1419 If it is non-@code{nil}, that requests some kind of justification. It
1420 can be @code{left}, @code{right}, @code{full}, or @code{center}, to
1421 request a specific style of justification. If it is @code{t}, that
1422 means to use the current justification style for this part of the text
1423 (see @code{current-justification}, below). Any other value is treated
1424 as @code{full}.
1425
1426 When you call the filling functions interactively, using a prefix
1427 argument implies the value @code{full} for @var{justify}.
1428
1429 @deffn Command fill-paragraph &optional justify region
1430 This command fills the paragraph at or after point. If
1431 @var{justify} is non-@code{nil}, each line is justified as well.
1432 It uses the ordinary paragraph motion commands to find paragraph
1433 boundaries. @xref{Paragraphs,,, emacs, The GNU Emacs Manual}.
1434
1435 When @var{region} is non-@code{nil}, then if Transient Mark mode is
1436 enabled and the mark is active, this command calls @code{fill-region}
1437 to fill all the paragraphs in the region, instead of filling only the
1438 current paragraph. When this command is called interactively,
1439 @var{region} is @code{t}.
1440 @end deffn
1441
1442 @deffn Command fill-region start end &optional justify nosqueeze to-eop
1443 This command fills each of the paragraphs in the region from @var{start}
1444 to @var{end}. It justifies as well if @var{justify} is
1445 non-@code{nil}.
1446
1447 If @var{nosqueeze} is non-@code{nil}, that means to leave whitespace
1448 other than line breaks untouched. If @var{to-eop} is non-@code{nil},
1449 that means to keep filling to the end of the paragraph---or the next hard
1450 newline, if @code{use-hard-newlines} is enabled (see below).
1451
1452 The variable @code{paragraph-separate} controls how to distinguish
1453 paragraphs. @xref{Standard Regexps}.
1454 @end deffn
1455
1456 @deffn Command fill-individual-paragraphs start end &optional justify citation-regexp
1457 This command fills each paragraph in the region according to its
1458 individual fill prefix. Thus, if the lines of a paragraph were indented
1459 with spaces, the filled paragraph will remain indented in the same
1460 fashion.
1461
1462 The first two arguments, @var{start} and @var{end}, are the beginning
1463 and end of the region to be filled. The third and fourth arguments,
1464 @var{justify} and @var{citation-regexp}, are optional. If
1465 @var{justify} is non-@code{nil}, the paragraphs are justified as
1466 well as filled. If @var{citation-regexp} is non-@code{nil}, it means the
1467 function is operating on a mail message and therefore should not fill
1468 the header lines. If @var{citation-regexp} is a string, it is used as
1469 a regular expression; if it matches the beginning of a line, that line
1470 is treated as a citation marker.
1471
1472 Ordinarily, @code{fill-individual-paragraphs} regards each change in
1473 indentation as starting a new paragraph. If
1474 @code{fill-individual-varying-indent} is non-@code{nil}, then only
1475 separator lines separate paragraphs. That mode can handle indented
1476 paragraphs with additional indentation on the first line.
1477 @end deffn
1478
1479 @defopt fill-individual-varying-indent
1480 This variable alters the action of @code{fill-individual-paragraphs} as
1481 described above.
1482 @end defopt
1483
1484 @deffn Command fill-region-as-paragraph start end &optional justify nosqueeze squeeze-after
1485 This command considers a region of text as a single paragraph and fills
1486 it. If the region was made up of many paragraphs, the blank lines
1487 between paragraphs are removed. This function justifies as well as
1488 filling when @var{justify} is non-@code{nil}.
1489
1490 If @var{nosqueeze} is non-@code{nil}, that means to leave whitespace
1491 other than line breaks untouched. If @var{squeeze-after} is
1492 non-@code{nil}, it specifies a position in the region, and means don't
1493 canonicalize spaces before that position.
1494
1495 In Adaptive Fill mode, this command calls @code{fill-context-prefix} to
1496 choose a fill prefix by default. @xref{Adaptive Fill}.
1497 @end deffn
1498
1499 @deffn Command justify-current-line &optional how eop nosqueeze
1500 This command inserts spaces between the words of the current line so
1501 that the line ends exactly at @code{fill-column}. It returns
1502 @code{nil}.
1503
1504 The argument @var{how}, if non-@code{nil} specifies explicitly the style
1505 of justification. It can be @code{left}, @code{right}, @code{full},
1506 @code{center}, or @code{none}. If it is @code{t}, that means to do
1507 follow specified justification style (see @code{current-justification},
1508 below). @code{nil} means to do full justification.
1509
1510 If @var{eop} is non-@code{nil}, that means do only left-justification
1511 if @code{current-justification} specifies full justification. This is
1512 used for the last line of a paragraph; even if the paragraph as a
1513 whole is fully justified, the last line should not be.
1514
1515 If @var{nosqueeze} is non-@code{nil}, that means do not change interior
1516 whitespace.
1517 @end deffn
1518
1519 @defopt default-justification
1520 This variable's value specifies the style of justification to use for
1521 text that doesn't specify a style with a text property. The possible
1522 values are @code{left}, @code{right}, @code{full}, @code{center}, or
1523 @code{none}. The default value is @code{left}.
1524 @end defopt
1525
1526 @defun current-justification
1527 This function returns the proper justification style to use for filling
1528 the text around point.
1529
1530 This returns the value of the @code{justification} text property at
1531 point, or the variable @var{default-justification} if there is no such
1532 text property. However, it returns @code{nil} rather than @code{none}
1533 to mean ``don't justify''.
1534 @end defun
1535
1536 @defopt sentence-end-double-space
1537 @anchor{Definition of sentence-end-double-space}
1538 If this variable is non-@code{nil}, a period followed by just one space
1539 does not count as the end of a sentence, and the filling functions
1540 avoid breaking the line at such a place.
1541 @end defopt
1542
1543 @defopt sentence-end-without-period
1544 If this variable is non-@code{nil}, a sentence can end without a
1545 period. This is used for languages like Thai, where sentences end
1546 with a double space but without a period.
1547 @end defopt
1548
1549 @defopt sentence-end-without-space
1550 If this variable is non-@code{nil}, it should be a string of
1551 characters that can end a sentence without following spaces.
1552 @end defopt
1553
1554 @defvar fill-paragraph-function
1555 This variable provides a way to override the filling of paragraphs.
1556 If its value is non-@code{nil}, @code{fill-paragraph} calls this
1557 function to do the work. If the function returns a non-@code{nil}
1558 value, @code{fill-paragraph} assumes the job is done, and immediately
1559 returns that value.
1560
1561 The usual use of this feature is to fill comments in programming
1562 language modes. If the function needs to fill a paragraph in the usual
1563 way, it can do so as follows:
1564
1565 @example
1566 (let ((fill-paragraph-function nil))
1567 (fill-paragraph arg))
1568 @end example
1569 @end defvar
1570
1571 @defvar fill-forward-paragraph-function
1572 This variable provides a way to override how the filling functions,
1573 such as @code{fill-region} and @code{fill-paragraph}, move forward to
1574 the next paragraph. Its value should be a function, which is called
1575 with a single argument @var{n}, the number of paragraphs to move, and
1576 should return the difference between @var{n} and the number of
1577 paragraphs actually moved. The default value of this variable is
1578 @code{forward-paragraph}. @xref{Paragraphs,,, emacs, The GNU Emacs
1579 Manual}.
1580 @end defvar
1581
1582 @defvar use-hard-newlines
1583 If this variable is non-@code{nil}, the filling functions do not delete
1584 newlines that have the @code{hard} text property. These ``hard
1585 newlines'' act as paragraph separators.
1586 @end defvar
1587
1588 @node Margins
1589 @section Margins for Filling
1590
1591 @defopt fill-prefix
1592 This buffer-local variable, if non-@code{nil}, specifies a string of
1593 text that appears at the beginning of normal text lines and should be
1594 disregarded when filling them. Any line that fails to start with the
1595 fill prefix is considered the start of a paragraph; so is any line
1596 that starts with the fill prefix followed by additional whitespace.
1597 Lines that start with the fill prefix but no additional whitespace are
1598 ordinary text lines that can be filled together. The resulting filled
1599 lines also start with the fill prefix.
1600
1601 The fill prefix follows the left margin whitespace, if any.
1602 @end defopt
1603
1604 @defopt fill-column
1605 This buffer-local variable specifies the maximum width of filled lines.
1606 Its value should be an integer, which is a number of columns. All the
1607 filling, justification, and centering commands are affected by this
1608 variable, including Auto Fill mode (@pxref{Auto Filling}).
1609
1610 As a practical matter, if you are writing text for other people to
1611 read, you should set @code{fill-column} to no more than 70. Otherwise
1612 the line will be too long for people to read comfortably, and this can
1613 make the text seem clumsy.
1614
1615 The default value for @code{fill-column} is 70.
1616 @end defopt
1617
1618 @deffn Command set-left-margin from to margin
1619 This sets the @code{left-margin} property on the text from @var{from} to
1620 @var{to} to the value @var{margin}. If Auto Fill mode is enabled, this
1621 command also refills the region to fit the new margin.
1622 @end deffn
1623
1624 @deffn Command set-right-margin from to margin
1625 This sets the @code{right-margin} property on the text from @var{from}
1626 to @var{to} to the value @var{margin}. If Auto Fill mode is enabled,
1627 this command also refills the region to fit the new margin.
1628 @end deffn
1629
1630 @defun current-left-margin
1631 This function returns the proper left margin value to use for filling
1632 the text around point. The value is the sum of the @code{left-margin}
1633 property of the character at the start of the current line (or zero if
1634 none), and the value of the variable @code{left-margin}.
1635 @end defun
1636
1637 @defun current-fill-column
1638 This function returns the proper fill column value to use for filling
1639 the text around point. The value is the value of the @code{fill-column}
1640 variable, minus the value of the @code{right-margin} property of the
1641 character after point.
1642 @end defun
1643
1644 @deffn Command move-to-left-margin &optional n force
1645 This function moves point to the left margin of the current line. The
1646 column moved to is determined by calling the function
1647 @code{current-left-margin}. If the argument @var{n} is non-@code{nil},
1648 @code{move-to-left-margin} moves forward @var{n}@minus{}1 lines first.
1649
1650 If @var{force} is non-@code{nil}, that says to fix the line's
1651 indentation if that doesn't match the left margin value.
1652 @end deffn
1653
1654 @defun delete-to-left-margin &optional from to
1655 This function removes left margin indentation from the text between
1656 @var{from} and @var{to}. The amount of indentation to delete is
1657 determined by calling @code{current-left-margin}. In no case does this
1658 function delete non-whitespace. If @var{from} and @var{to} are omitted,
1659 they default to the whole buffer.
1660 @end defun
1661
1662 @defun indent-to-left-margin
1663 This function adjusts the indentation at the beginning of the current
1664 line to the value specified by the variable @code{left-margin}. (That
1665 may involve either inserting or deleting whitespace.) This function
1666 is value of @code{indent-line-function} in Paragraph-Indent Text mode.
1667 @end defun
1668
1669 @defopt left-margin
1670 This variable specifies the base left margin column. In Fundamental
1671 mode, @kbd{C-j} indents to this column. This variable automatically
1672 becomes buffer-local when set in any fashion.
1673 @end defopt
1674
1675 @defopt fill-nobreak-predicate
1676 This variable gives major modes a way to specify not to break a line
1677 at certain places. Its value should be a list of functions. Whenever
1678 filling considers breaking the line at a certain place in the buffer,
1679 it calls each of these functions with no arguments and with point
1680 located at that place. If any of the functions returns
1681 non-@code{nil}, then the line won't be broken there.
1682 @end defopt
1683
1684 @node Adaptive Fill
1685 @section Adaptive Fill Mode
1686 @c @cindex Adaptive Fill mode "adaptive-fill-mode" is adjacent.
1687
1688 When @dfn{Adaptive Fill Mode} is enabled, Emacs determines the fill
1689 prefix automatically from the text in each paragraph being filled
1690 rather than using a predetermined value. During filling, this fill
1691 prefix gets inserted at the start of the second and subsequent lines
1692 of the paragraph as described in @ref{Filling}, and in @ref{Auto
1693 Filling}.
1694
1695 @defopt adaptive-fill-mode
1696 Adaptive Fill mode is enabled when this variable is non-@code{nil}.
1697 It is @code{t} by default.
1698 @end defopt
1699
1700 @defun fill-context-prefix from to
1701 This function implements the heart of Adaptive Fill mode; it chooses a
1702 fill prefix based on the text between @var{from} and @var{to},
1703 typically the start and end of a paragraph. It does this by looking
1704 at the first two lines of the paragraph, based on the variables
1705 described below.
1706 @c The optional argument first-line-regexp is not documented
1707 @c because it exists for internal purposes and might be eliminated
1708 @c in the future.
1709
1710 Usually, this function returns the fill prefix, a string. However,
1711 before doing this, the function makes a final check (not specially
1712 mentioned in the following) that a line starting with this prefix
1713 wouldn't look like the start of a paragraph. Should this happen, the
1714 function signals the anomaly by returning @code{nil} instead.
1715
1716 In detail, @code{fill-context-prefix} does this:
1717
1718 @enumerate
1719 @item
1720 It takes a candidate for the fill prefix from the first line---it
1721 tries first the function in @code{adaptive-fill-function} (if any),
1722 then the regular expression @code{adaptive-fill-regexp} (see below).
1723 The first non-@code{nil} result of these, or the empty string if
1724 they're both @code{nil}, becomes the first line's candidate.
1725 @item
1726 If the paragraph has as yet only one line, the function tests the
1727 validity of the prefix candidate just found. The function then
1728 returns the candidate if it's valid, or a string of spaces otherwise.
1729 (see the description of @code{adaptive-fill-first-line-regexp} below).
1730 @item
1731 When the paragraph already has two lines, the function next looks for
1732 a prefix candidate on the second line, in just the same way it did for
1733 the first line. If it doesn't find one, it returns @code{nil}.
1734 @item
1735 The function now compares the two candidate prefixes heuristically: if
1736 the non-whitespace characters in the line 2 candidate occur in the
1737 same order in the line 1 candidate, the function returns the line 2
1738 candidate. Otherwise, it returns the largest initial substring which
1739 is common to both candidates (which might be the empty string).
1740 @end enumerate
1741 @end defun
1742
1743 @defopt adaptive-fill-regexp
1744 Adaptive Fill mode matches this regular expression against the text
1745 starting after the left margin whitespace (if any) on a line; the
1746 characters it matches are that line's candidate for the fill prefix.
1747
1748 The default value matches whitespace with certain punctuation
1749 characters intermingled.
1750 @end defopt
1751
1752 @defopt adaptive-fill-first-line-regexp
1753 Used only in one-line paragraphs, this regular expression acts as an
1754 additional check of the validity of the one available candidate fill
1755 prefix: the candidate must match this regular expression, or match
1756 @code{comment-start-skip}. If it doesn't, @code{fill-context-prefix}
1757 replaces the candidate with a string of spaces ``of the same width''
1758 as it.
1759
1760 The default value of this variable is @w{@code{"\\`[ \t]*\\'"}}, which
1761 matches only a string of whitespace. The effect of this default is to
1762 force the fill prefixes found in one-line paragraphs always to be pure
1763 whitespace.
1764 @end defopt
1765
1766 @defopt adaptive-fill-function
1767 You can specify more complex ways of choosing a fill prefix
1768 automatically by setting this variable to a function. The function is
1769 called with point after the left margin (if any) of a line, and it
1770 must preserve point. It should return either ``that line's'' fill
1771 prefix or @code{nil}, meaning it has failed to determine a prefix.
1772 @end defopt
1773
1774 @node Auto Filling
1775 @comment node-name, next, previous, up
1776 @section Auto Filling
1777 @cindex filling, automatic
1778 @cindex Auto Fill mode
1779
1780 Auto Fill mode is a minor mode that fills lines automatically as text
1781 is inserted. This section describes the hook used by Auto Fill mode.
1782 For a description of functions that you can call explicitly to fill and
1783 justify existing text, see @ref{Filling}.
1784
1785 Auto Fill mode also enables the functions that change the margins and
1786 justification style to refill portions of the text. @xref{Margins}.
1787
1788 @defvar auto-fill-function
1789 The value of this buffer-local variable should be a function (of no
1790 arguments) to be called after self-inserting a character from the table
1791 @code{auto-fill-chars}. It may be @code{nil}, in which case nothing
1792 special is done in that case.
1793
1794 The value of @code{auto-fill-function} is @code{do-auto-fill} when
1795 Auto-Fill mode is enabled. That is a function whose sole purpose is to
1796 implement the usual strategy for breaking a line.
1797
1798 @quotation
1799 In older Emacs versions, this variable was named @code{auto-fill-hook},
1800 but since it is not called with the standard convention for hooks, it
1801 was renamed to @code{auto-fill-function} in version 19.
1802 @end quotation
1803 @end defvar
1804
1805 @defvar normal-auto-fill-function
1806 This variable specifies the function to use for
1807 @code{auto-fill-function}, if and when Auto Fill is turned on. Major
1808 modes can set buffer-local values for this variable to alter how Auto
1809 Fill works.
1810 @end defvar
1811
1812 @defvar auto-fill-chars
1813 A char table of characters which invoke @code{auto-fill-function} when
1814 self-inserted---space and newline in most language environments. They
1815 have an entry @code{t} in the table.
1816 @end defvar
1817
1818 @node Sorting
1819 @section Sorting Text
1820 @cindex sorting text
1821
1822 The sorting functions described in this section all rearrange text in
1823 a buffer. This is in contrast to the function @code{sort}, which
1824 rearranges the order of the elements of a list (@pxref{Rearrangement}).
1825 The values returned by these functions are not meaningful.
1826
1827 @defun sort-subr reverse nextrecfun endrecfun &optional startkeyfun endkeyfun predicate
1828 This function is the general text-sorting routine that subdivides a
1829 buffer into records and then sorts them. Most of the commands in this
1830 section use this function.
1831
1832 To understand how @code{sort-subr} works, consider the whole accessible
1833 portion of the buffer as being divided into disjoint pieces called
1834 @dfn{sort records}. The records may or may not be contiguous, but they
1835 must not overlap. A portion of each sort record (perhaps all of it) is
1836 designated as the sort key. Sorting rearranges the records in order by
1837 their sort keys.
1838
1839 Usually, the records are rearranged in order of ascending sort key.
1840 If the first argument to the @code{sort-subr} function, @var{reverse},
1841 is non-@code{nil}, the sort records are rearranged in order of
1842 descending sort key.
1843
1844 The next four arguments to @code{sort-subr} are functions that are
1845 called to move point across a sort record. They are called many times
1846 from within @code{sort-subr}.
1847
1848 @enumerate
1849 @item
1850 @var{nextrecfun} is called with point at the end of a record. This
1851 function moves point to the start of the next record. The first record
1852 is assumed to start at the position of point when @code{sort-subr} is
1853 called. Therefore, you should usually move point to the beginning of
1854 the buffer before calling @code{sort-subr}.
1855
1856 This function can indicate there are no more sort records by leaving
1857 point at the end of the buffer.
1858
1859 @item
1860 @var{endrecfun} is called with point within a record. It moves point to
1861 the end of the record.
1862
1863 @item
1864 @var{startkeyfun} is called to move point from the start of a record to
1865 the start of the sort key. This argument is optional; if it is omitted,
1866 the whole record is the sort key. If supplied, the function should
1867 either return a non-@code{nil} value to be used as the sort key, or
1868 return @code{nil} to indicate that the sort key is in the buffer
1869 starting at point. In the latter case, @var{endkeyfun} is called to
1870 find the end of the sort key.
1871
1872 @item
1873 @var{endkeyfun} is called to move point from the start of the sort key
1874 to the end of the sort key. This argument is optional. If
1875 @var{startkeyfun} returns @code{nil} and this argument is omitted (or
1876 @code{nil}), then the sort key extends to the end of the record. There
1877 is no need for @var{endkeyfun} if @var{startkeyfun} returns a
1878 non-@code{nil} value.
1879 @end enumerate
1880
1881 The argument @var{predicate} is the function to use to compare keys.
1882 If keys are numbers, it defaults to @code{<}; otherwise it defaults to
1883 @code{string<}.
1884
1885 As an example of @code{sort-subr}, here is the complete function
1886 definition for @code{sort-lines}:
1887
1888 @example
1889 @group
1890 ;; @r{Note that the first two lines of doc string}
1891 ;; @r{are effectively one line when viewed by a user.}
1892 (defun sort-lines (reverse beg end)
1893 "Sort lines in region alphabetically;\
1894 argument means descending order.
1895 Called from a program, there are three arguments:
1896 @end group
1897 @group
1898 REVERSE (non-nil means reverse order),\
1899 BEG and END (region to sort).
1900 The variable `sort-fold-case' determines\
1901 whether alphabetic case affects
1902 the sort order."
1903 @end group
1904 @group
1905 (interactive "P\nr")
1906 (save-excursion
1907 (save-restriction
1908 (narrow-to-region beg end)
1909 (goto-char (point-min))
1910 (let ((inhibit-field-text-motion t))
1911 (sort-subr reverse 'forward-line 'end-of-line)))))
1912 @end group
1913 @end example
1914
1915 Here @code{forward-line} moves point to the start of the next record,
1916 and @code{end-of-line} moves point to the end of record. We do not pass
1917 the arguments @var{startkeyfun} and @var{endkeyfun}, because the entire
1918 record is used as the sort key.
1919
1920 The @code{sort-paragraphs} function is very much the same, except that
1921 its @code{sort-subr} call looks like this:
1922
1923 @example
1924 @group
1925 (sort-subr reverse
1926 (function
1927 (lambda ()
1928 (while (and (not (eobp))
1929 (looking-at paragraph-separate))
1930 (forward-line 1))))
1931 'forward-paragraph)
1932 @end group
1933 @end example
1934
1935 Markers pointing into any sort records are left with no useful
1936 position after @code{sort-subr} returns.
1937 @end defun
1938
1939 @defopt sort-fold-case
1940 If this variable is non-@code{nil}, @code{sort-subr} and the other
1941 buffer sorting functions ignore case when comparing strings.
1942 @end defopt
1943
1944 @deffn Command sort-regexp-fields reverse record-regexp key-regexp start end
1945 This command sorts the region between @var{start} and @var{end}
1946 alphabetically as specified by @var{record-regexp} and @var{key-regexp}.
1947 If @var{reverse} is a negative integer, then sorting is in reverse
1948 order.
1949
1950 Alphabetical sorting means that two sort keys are compared by
1951 comparing the first characters of each, the second characters of each,
1952 and so on. If a mismatch is found, it means that the sort keys are
1953 unequal; the sort key whose character is less at the point of first
1954 mismatch is the lesser sort key. The individual characters are compared
1955 according to their numerical character codes in the Emacs character set.
1956
1957 The value of the @var{record-regexp} argument specifies how to divide
1958 the buffer into sort records. At the end of each record, a search is
1959 done for this regular expression, and the text that matches it is taken
1960 as the next record. For example, the regular expression @samp{^.+$},
1961 which matches lines with at least one character besides a newline, would
1962 make each such line into a sort record. @xref{Regular Expressions}, for
1963 a description of the syntax and meaning of regular expressions.
1964
1965 The value of the @var{key-regexp} argument specifies what part of each
1966 record is the sort key. The @var{key-regexp} could match the whole
1967 record, or only a part. In the latter case, the rest of the record has
1968 no effect on the sorted order of records, but it is carried along when
1969 the record moves to its new position.
1970
1971 The @var{key-regexp} argument can refer to the text matched by a
1972 subexpression of @var{record-regexp}, or it can be a regular expression
1973 on its own.
1974
1975 If @var{key-regexp} is:
1976
1977 @table @asis
1978 @item @samp{\@var{digit}}
1979 then the text matched by the @var{digit}th @samp{\(...\)} parenthesis
1980 grouping in @var{record-regexp} is the sort key.
1981
1982 @item @samp{\&}
1983 then the whole record is the sort key.
1984
1985 @item a regular expression
1986 then @code{sort-regexp-fields} searches for a match for the regular
1987 expression within the record. If such a match is found, it is the sort
1988 key. If there is no match for @var{key-regexp} within a record then
1989 that record is ignored, which means its position in the buffer is not
1990 changed. (The other records may move around it.)
1991 @end table
1992
1993 For example, if you plan to sort all the lines in the region by the
1994 first word on each line starting with the letter @samp{f}, you should
1995 set @var{record-regexp} to @samp{^.*$} and set @var{key-regexp} to
1996 @samp{\<f\w*\>}. The resulting expression looks like this:
1997
1998 @example
1999 @group
2000 (sort-regexp-fields nil "^.*$" "\\<f\\w*\\>"
2001 (region-beginning)
2002 (region-end))
2003 @end group
2004 @end example
2005
2006 If you call @code{sort-regexp-fields} interactively, it prompts for
2007 @var{record-regexp} and @var{key-regexp} in the minibuffer.
2008 @end deffn
2009
2010 @deffn Command sort-lines reverse start end
2011 This command alphabetically sorts lines in the region between
2012 @var{start} and @var{end}. If @var{reverse} is non-@code{nil}, the sort
2013 is in reverse order.
2014 @end deffn
2015
2016 @deffn Command sort-paragraphs reverse start end
2017 This command alphabetically sorts paragraphs in the region between
2018 @var{start} and @var{end}. If @var{reverse} is non-@code{nil}, the sort
2019 is in reverse order.
2020 @end deffn
2021
2022 @deffn Command sort-pages reverse start end
2023 This command alphabetically sorts pages in the region between
2024 @var{start} and @var{end}. If @var{reverse} is non-@code{nil}, the sort
2025 is in reverse order.
2026 @end deffn
2027
2028 @deffn Command sort-fields field start end
2029 This command sorts lines in the region between @var{start} and
2030 @var{end}, comparing them alphabetically by the @var{field}th field
2031 of each line. Fields are separated by whitespace and numbered starting
2032 from 1. If @var{field} is negative, sorting is by the
2033 @w{@minus{}@var{field}th} field from the end of the line. This command
2034 is useful for sorting tables.
2035 @end deffn
2036
2037 @deffn Command sort-numeric-fields field start end
2038 This command sorts lines in the region between @var{start} and
2039 @var{end}, comparing them numerically by the @var{field}th field of
2040 each line. Fields are separated by whitespace and numbered starting
2041 from 1. The specified field must contain a number in each line of the
2042 region. Numbers starting with 0 are treated as octal, and numbers
2043 starting with @samp{0x} are treated as hexadecimal.
2044
2045 If @var{field} is negative, sorting is by the
2046 @w{@minus{}@var{field}th} field from the end of the line. This
2047 command is useful for sorting tables.
2048 @end deffn
2049
2050 @defopt sort-numeric-base
2051 This variable specifies the default radix for
2052 @code{sort-numeric-fields} to parse numbers.
2053 @end defopt
2054
2055 @deffn Command sort-columns reverse &optional beg end
2056 This command sorts the lines in the region between @var{beg} and
2057 @var{end}, comparing them alphabetically by a certain range of
2058 columns. The column positions of @var{beg} and @var{end} bound the
2059 range of columns to sort on.
2060
2061 If @var{reverse} is non-@code{nil}, the sort is in reverse order.
2062
2063 One unusual thing about this command is that the entire line
2064 containing position @var{beg}, and the entire line containing position
2065 @var{end}, are included in the region sorted.
2066
2067 Note that @code{sort-columns} rejects text that contains tabs, because
2068 tabs could be split across the specified columns. Use @kbd{M-x
2069 untabify} to convert tabs to spaces before sorting.
2070
2071 When possible, this command actually works by calling the @code{sort}
2072 utility program.
2073 @end deffn
2074
2075 @node Columns
2076 @comment node-name, next, previous, up
2077 @section Counting Columns
2078 @cindex columns
2079 @cindex counting columns
2080 @cindex horizontal position
2081
2082 The column functions convert between a character position (counting
2083 characters from the beginning of the buffer) and a column position
2084 (counting screen characters from the beginning of a line).
2085
2086 These functions count each character according to the number of
2087 columns it occupies on the screen. This means control characters count
2088 as occupying 2 or 4 columns, depending upon the value of
2089 @code{ctl-arrow}, and tabs count as occupying a number of columns that
2090 depends on the value of @code{tab-width} and on the column where the tab
2091 begins. @xref{Usual Display}.
2092
2093 Column number computations ignore the width of the window and the
2094 amount of horizontal scrolling. Consequently, a column value can be
2095 arbitrarily high. The first (or leftmost) column is numbered 0. They
2096 also ignore overlays and text properties, aside from invisibility.
2097
2098 @defun current-column
2099 This function returns the horizontal position of point, measured in
2100 columns, counting from 0 at the left margin. The column position is the
2101 sum of the widths of all the displayed representations of the characters
2102 between the start of the current line and point.
2103
2104 For an example of using @code{current-column}, see the description of
2105 @code{count-lines} in @ref{Text Lines}.
2106 @end defun
2107
2108 @deffn Command move-to-column column &optional force
2109 This function moves point to @var{column} in the current line. The
2110 calculation of @var{column} takes into account the widths of the
2111 displayed representations of the characters between the start of the
2112 line and point.
2113
2114 When called interactively, @var{column} is the value of prefix numeric
2115 argument. If @var{column} is not an integer, an error is signaled.
2116
2117 If column @var{column} is beyond the end of the line, point moves to
2118 the end of the line. If @var{column} is negative, point moves to the
2119 beginning of the line.
2120
2121 If it is impossible to move to column @var{column} because that is in
2122 the middle of a multicolumn character such as a tab, point moves to the
2123 end of that character. However, if @var{force} is non-@code{nil}, and
2124 @var{column} is in the middle of a tab, then @code{move-to-column}
2125 converts the tab into spaces so that it can move precisely to column
2126 @var{column}. Other multicolumn characters can cause anomalies despite
2127 @var{force}, since there is no way to split them.
2128
2129 The argument @var{force} also has an effect if the line isn't long
2130 enough to reach column @var{column}; if it is @code{t}, that means to
2131 add whitespace at the end of the line to reach that column.
2132
2133 The return value is the column number actually moved to.
2134 @end deffn
2135
2136 @node Indentation
2137 @section Indentation
2138 @cindex indentation
2139
2140 The indentation functions are used to examine, move to, and change
2141 whitespace that is at the beginning of a line. Some of the functions
2142 can also change whitespace elsewhere on a line. Columns and indentation
2143 count from zero at the left margin.
2144
2145 @menu
2146 * Primitive Indent:: Functions used to count and insert indentation.
2147 * Mode-Specific Indent:: Customize indentation for different modes.
2148 * Region Indent:: Indent all the lines in a region.
2149 * Relative Indent:: Indent the current line based on previous lines.
2150 * Indent Tabs:: Adjustable, typewriter-like tab stops.
2151 * Motion by Indent:: Move to first non-blank character.
2152 @end menu
2153
2154 @node Primitive Indent
2155 @subsection Indentation Primitives
2156
2157 This section describes the primitive functions used to count and
2158 insert indentation. The functions in the following sections use these
2159 primitives. @xref{Width}, for related functions.
2160
2161 @defun current-indentation
2162 @comment !!Type Primitive Function
2163 @comment !!SourceFile indent.c
2164 This function returns the indentation of the current line, which is
2165 the horizontal position of the first nonblank character. If the
2166 contents are entirely blank, then this is the horizontal position of the
2167 end of the line.
2168 @end defun
2169
2170 @deffn Command indent-to column &optional minimum
2171 @comment !!Type Primitive Function
2172 @comment !!SourceFile indent.c
2173 This function indents from point with tabs and spaces until @var{column}
2174 is reached. If @var{minimum} is specified and non-@code{nil}, then at
2175 least that many spaces are inserted even if this requires going beyond
2176 @var{column}. Otherwise the function does nothing if point is already
2177 beyond @var{column}. The value is the column at which the inserted
2178 indentation ends.
2179
2180 The inserted whitespace characters inherit text properties from the
2181 surrounding text (usually, from the preceding text only). @xref{Sticky
2182 Properties}.
2183 @end deffn
2184
2185 @defopt indent-tabs-mode
2186 @comment !!SourceFile indent.c
2187 If this variable is non-@code{nil}, indentation functions can insert
2188 tabs as well as spaces. Otherwise, they insert only spaces. Setting
2189 this variable automatically makes it buffer-local in the current buffer.
2190 @end defopt
2191
2192 @node Mode-Specific Indent
2193 @subsection Indentation Controlled by Major Mode
2194
2195 An important function of each major mode is to customize the @key{TAB}
2196 key to indent properly for the language being edited. This section
2197 describes the mechanism of the @key{TAB} key and how to control it.
2198 The functions in this section return unpredictable values.
2199
2200 @defvar indent-line-function
2201 This variable's value is the function to be used by @key{TAB} (and
2202 various commands) to indent the current line. The command
2203 @code{indent-according-to-mode} does little more than call this function.
2204
2205 In Lisp mode, the value is the symbol @code{lisp-indent-line}; in C
2206 mode, @code{c-indent-line}; in Fortran mode, @code{fortran-indent-line}.
2207 The default value is @code{indent-relative}. @xref{Auto-Indentation}.
2208 @end defvar
2209
2210 @deffn Command indent-according-to-mode
2211 This command calls the function in @code{indent-line-function} to
2212 indent the current line in a way appropriate for the current major mode.
2213 @end deffn
2214
2215 @deffn Command indent-for-tab-command &optional rigid
2216 This command calls the function in @code{indent-line-function} to
2217 indent the current line; however, if that function is
2218 @code{indent-to-left-margin}, @code{insert-tab} is called instead.
2219 (That is a trivial command that inserts a tab character.) If
2220 @var{rigid} is non-@code{nil}, this function also rigidly indents the
2221 entire balanced expression that starts at the beginning of the current
2222 line, to reflect change in indentation of the current line.
2223 @end deffn
2224
2225 @deffn Command newline-and-indent
2226 This function inserts a newline, then indents the new line (the one
2227 following the newline just inserted) according to the major mode.
2228
2229 It does indentation by calling the current @code{indent-line-function}.
2230 In programming language modes, this is the same thing @key{TAB} does,
2231 but in some text modes, where @key{TAB} inserts a tab,
2232 @code{newline-and-indent} indents to the column specified by
2233 @code{left-margin}.
2234 @end deffn
2235
2236 @deffn Command reindent-then-newline-and-indent
2237 @comment !!SourceFile simple.el
2238 This command reindents the current line, inserts a newline at point,
2239 and then indents the new line (the one following the newline just
2240 inserted).
2241
2242 This command does indentation on both lines according to the current
2243 major mode, by calling the current value of @code{indent-line-function}.
2244 In programming language modes, this is the same thing @key{TAB} does,
2245 but in some text modes, where @key{TAB} inserts a tab,
2246 @code{reindent-then-newline-and-indent} indents to the column specified
2247 by @code{left-margin}.
2248 @end deffn
2249
2250 @node Region Indent
2251 @subsection Indenting an Entire Region
2252
2253 This section describes commands that indent all the lines in the
2254 region. They return unpredictable values.
2255
2256 @deffn Command indent-region start end &optional to-column
2257 This command indents each nonblank line starting between @var{start}
2258 (inclusive) and @var{end} (exclusive). If @var{to-column} is
2259 @code{nil}, @code{indent-region} indents each nonblank line by calling
2260 the current mode's indentation function, the value of
2261 @code{indent-line-function}.
2262
2263 If @var{to-column} is non-@code{nil}, it should be an integer
2264 specifying the number of columns of indentation; then this function
2265 gives each line exactly that much indentation, by either adding or
2266 deleting whitespace.
2267
2268 If there is a fill prefix, @code{indent-region} indents each line
2269 by making it start with the fill prefix.
2270 @end deffn
2271
2272 @defvar indent-region-function
2273 The value of this variable is a function that can be used by
2274 @code{indent-region} as a short cut. It should take two arguments, the
2275 start and end of the region. You should design the function so
2276 that it will produce the same results as indenting the lines of the
2277 region one by one, but presumably faster.
2278
2279 If the value is @code{nil}, there is no short cut, and
2280 @code{indent-region} actually works line by line.
2281
2282 A short-cut function is useful in modes such as C mode and Lisp mode,
2283 where the @code{indent-line-function} must scan from the beginning of
2284 the function definition: applying it to each line would be quadratic in
2285 time. The short cut can update the scan information as it moves through
2286 the lines indenting them; this takes linear time. In a mode where
2287 indenting a line individually is fast, there is no need for a short cut.
2288
2289 @code{indent-region} with a non-@code{nil} argument @var{to-column} has
2290 a different meaning and does not use this variable.
2291 @end defvar
2292
2293 @deffn Command indent-rigidly start end count
2294 This command indents all lines starting between @var{start}
2295 (inclusive) and @var{end} (exclusive) sideways by @var{count} columns.
2296 This ``preserves the shape'' of the affected region, moving it as a
2297 rigid unit. Consequently, this command is useful not only for indenting
2298 regions of unindented text, but also for indenting regions of formatted
2299 code.
2300
2301 For example, if @var{count} is 3, this command adds 3 columns of
2302 indentation to each of the lines beginning in the region specified.
2303
2304 In Mail mode, @kbd{C-c C-y} (@code{mail-yank-original}) uses
2305 @code{indent-rigidly} to indent the text copied from the message being
2306 replied to.
2307 @end deffn
2308
2309 @deffn Command indent-code-rigidly start end columns &optional nochange-regexp
2310 This is like @code{indent-rigidly}, except that it doesn't alter lines
2311 that start within strings or comments.
2312
2313 In addition, it doesn't alter a line if @var{nochange-regexp} matches at
2314 the beginning of the line (if @var{nochange-regexp} is non-@code{nil}).
2315 @end deffn
2316
2317 @node Relative Indent
2318 @subsection Indentation Relative to Previous Lines
2319
2320 This section describes two commands that indent the current line
2321 based on the contents of previous lines.
2322
2323 @deffn Command indent-relative &optional unindented-ok
2324 This command inserts whitespace at point, extending to the same
2325 column as the next @dfn{indent point} of the previous nonblank line. An
2326 indent point is a non-whitespace character following whitespace. The
2327 next indent point is the first one at a column greater than the current
2328 column of point. For example, if point is underneath and to the left of
2329 the first non-blank character of a line of text, it moves to that column
2330 by inserting whitespace.
2331
2332 If the previous nonblank line has no next indent point (i.e., none at a
2333 great enough column position), @code{indent-relative} either does
2334 nothing (if @var{unindented-ok} is non-@code{nil}) or calls
2335 @code{tab-to-tab-stop}. Thus, if point is underneath and to the right
2336 of the last column of a short line of text, this command ordinarily
2337 moves point to the next tab stop by inserting whitespace.
2338
2339 The return value of @code{indent-relative} is unpredictable.
2340
2341 In the following example, point is at the beginning of the second
2342 line:
2343
2344 @example
2345 @group
2346 This line is indented twelve spaces.
2347 @point{}The quick brown fox jumped.
2348 @end group
2349 @end example
2350
2351 @noindent
2352 Evaluation of the expression @code{(indent-relative nil)} produces the
2353 following:
2354
2355 @example
2356 @group
2357 This line is indented twelve spaces.
2358 @point{}The quick brown fox jumped.
2359 @end group
2360 @end example
2361
2362 In this next example, point is between the @samp{m} and @samp{p} of
2363 @samp{jumped}:
2364
2365 @example
2366 @group
2367 This line is indented twelve spaces.
2368 The quick brown fox jum@point{}ped.
2369 @end group
2370 @end example
2371
2372 @noindent
2373 Evaluation of the expression @code{(indent-relative nil)} produces the
2374 following:
2375
2376 @example
2377 @group
2378 This line is indented twelve spaces.
2379 The quick brown fox jum @point{}ped.
2380 @end group
2381 @end example
2382 @end deffn
2383
2384 @deffn Command indent-relative-maybe
2385 @comment !!SourceFile indent.el
2386 This command indents the current line like the previous nonblank line,
2387 by calling @code{indent-relative} with @code{t} as the
2388 @var{unindented-ok} argument. The return value is unpredictable.
2389
2390 If the previous nonblank line has no indent points beyond the current
2391 column, this command does nothing.
2392 @end deffn
2393
2394 @node Indent Tabs
2395 @comment node-name, next, previous, up
2396 @subsection Adjustable ``Tab Stops''
2397 @cindex tabs stops for indentation
2398
2399 This section explains the mechanism for user-specified ``tab stops''
2400 and the mechanisms that use and set them. The name ``tab stops'' is
2401 used because the feature is similar to that of the tab stops on a
2402 typewriter. The feature works by inserting an appropriate number of
2403 spaces and tab characters to reach the next tab stop column; it does not
2404 affect the display of tab characters in the buffer (@pxref{Usual
2405 Display}). Note that the @key{TAB} character as input uses this tab
2406 stop feature only in a few major modes, such as Text mode.
2407 @xref{Tab Stops,,, emacs, The GNU Emacs Manual}.
2408
2409 @deffn Command tab-to-tab-stop
2410 This command inserts spaces or tabs before point, up to the next tab
2411 stop column defined by @code{tab-stop-list}. It searches the list for
2412 an element greater than the current column number, and uses that element
2413 as the column to indent to. It does nothing if no such element is
2414 found.
2415 @end deffn
2416
2417 @defopt tab-stop-list
2418 This variable is the list of tab stop columns used by
2419 @code{tab-to-tab-stops}. The elements should be integers in increasing
2420 order. The tab stop columns need not be evenly spaced.
2421
2422 Use @kbd{M-x edit-tab-stops} to edit the location of tab stops
2423 interactively.
2424 @end defopt
2425
2426 @node Motion by Indent
2427 @subsection Indentation-Based Motion Commands
2428
2429 These commands, primarily for interactive use, act based on the
2430 indentation in the text.
2431
2432 @deffn Command back-to-indentation
2433 @comment !!SourceFile simple.el
2434 This command moves point to the first non-whitespace character in the
2435 current line (which is the line in which point is located). It returns
2436 @code{nil}.
2437 @end deffn
2438
2439 @deffn Command backward-to-indentation &optional arg
2440 @comment !!SourceFile simple.el
2441 This command moves point backward @var{arg} lines and then to the
2442 first nonblank character on that line. It returns @code{nil}.
2443 If @var{arg} is omitted or @code{nil}, it defaults to 1.
2444 @end deffn
2445
2446 @deffn Command forward-to-indentation &optional arg
2447 @comment !!SourceFile simple.el
2448 This command moves point forward @var{arg} lines and then to the first
2449 nonblank character on that line. It returns @code{nil}.
2450 If @var{arg} is omitted or @code{nil}, it defaults to 1.
2451 @end deffn
2452
2453 @node Case Changes
2454 @comment node-name, next, previous, up
2455 @section Case Changes
2456 @cindex case conversion in buffers
2457
2458 The case change commands described here work on text in the current
2459 buffer. @xref{Case Conversion}, for case conversion functions that work
2460 on strings and characters. @xref{Case Tables}, for how to customize
2461 which characters are upper or lower case and how to convert them.
2462
2463 @deffn Command capitalize-region start end
2464 This function capitalizes all words in the region defined by
2465 @var{start} and @var{end}. To capitalize means to convert each word's
2466 first character to upper case and convert the rest of each word to lower
2467 case. The function returns @code{nil}.
2468
2469 If one end of the region is in the middle of a word, the part of the
2470 word within the region is treated as an entire word.
2471
2472 When @code{capitalize-region} is called interactively, @var{start} and
2473 @var{end} are point and the mark, with the smallest first.
2474
2475 @example
2476 @group
2477 ---------- Buffer: foo ----------
2478 This is the contents of the 5th foo.
2479 ---------- Buffer: foo ----------
2480 @end group
2481
2482 @group
2483 (capitalize-region 1 44)
2484 @result{} nil
2485
2486 ---------- Buffer: foo ----------
2487 This Is The Contents Of The 5th Foo.
2488 ---------- Buffer: foo ----------
2489 @end group
2490 @end example
2491 @end deffn
2492
2493 @deffn Command downcase-region start end
2494 This function converts all of the letters in the region defined by
2495 @var{start} and @var{end} to lower case. The function returns
2496 @code{nil}.
2497
2498 When @code{downcase-region} is called interactively, @var{start} and
2499 @var{end} are point and the mark, with the smallest first.
2500 @end deffn
2501
2502 @deffn Command upcase-region start end
2503 This function converts all of the letters in the region defined by
2504 @var{start} and @var{end} to upper case. The function returns
2505 @code{nil}.
2506
2507 When @code{upcase-region} is called interactively, @var{start} and
2508 @var{end} are point and the mark, with the smallest first.
2509 @end deffn
2510
2511 @deffn Command capitalize-word count
2512 This function capitalizes @var{count} words after point, moving point
2513 over as it does. To capitalize means to convert each word's first
2514 character to upper case and convert the rest of each word to lower case.
2515 If @var{count} is negative, the function capitalizes the
2516 @minus{}@var{count} previous words but does not move point. The value
2517 is @code{nil}.
2518
2519 If point is in the middle of a word, the part of the word before point
2520 is ignored when moving forward. The rest is treated as an entire word.
2521
2522 When @code{capitalize-word} is called interactively, @var{count} is
2523 set to the numeric prefix argument.
2524 @end deffn
2525
2526 @deffn Command downcase-word count
2527 This function converts the @var{count} words after point to all lower
2528 case, moving point over as it does. If @var{count} is negative, it
2529 converts the @minus{}@var{count} previous words but does not move point.
2530 The value is @code{nil}.
2531
2532 When @code{downcase-word} is called interactively, @var{count} is set
2533 to the numeric prefix argument.
2534 @end deffn
2535
2536 @deffn Command upcase-word count
2537 This function converts the @var{count} words after point to all upper
2538 case, moving point over as it does. If @var{count} is negative, it
2539 converts the @minus{}@var{count} previous words but does not move point.
2540 The value is @code{nil}.
2541
2542 When @code{upcase-word} is called interactively, @var{count} is set to
2543 the numeric prefix argument.
2544 @end deffn
2545
2546 @node Text Properties
2547 @section Text Properties
2548 @cindex text properties
2549 @cindex attributes of text
2550 @cindex properties of text
2551
2552 Each character position in a buffer or a string can have a @dfn{text
2553 property list}, much like the property list of a symbol (@pxref{Property
2554 Lists}). The properties belong to a particular character at a
2555 particular place, such as, the letter @samp{T} at the beginning of this
2556 sentence or the first @samp{o} in @samp{foo}---if the same character
2557 occurs in two different places, the two occurrences in general have
2558 different properties.
2559
2560 Each property has a name and a value. Both of these can be any Lisp
2561 object, but the name is normally a symbol. Typically each property
2562 name symbol is used for a particular purpose; for instance, the text
2563 property @code{face} specifies the faces for displaying the character
2564 (@pxref{Special Properties}). The usual way to access the property
2565 list is to specify a name and ask what value corresponds to it.
2566
2567 If a character has a @code{category} property, we call it the
2568 @dfn{property category} of the character. It should be a symbol. The
2569 properties of the symbol serve as defaults for the properties of the
2570 character.
2571
2572 Copying text between strings and buffers preserves the properties
2573 along with the characters; this includes such diverse functions as
2574 @code{substring}, @code{insert}, and @code{buffer-substring}.
2575
2576 @menu
2577 * Examining Properties:: Looking at the properties of one character.
2578 * Changing Properties:: Setting the properties of a range of text.
2579 * Property Search:: Searching for where a property changes value.
2580 * Special Properties:: Particular properties with special meanings.
2581 * Format Properties:: Properties for representing formatting of text.
2582 * Sticky Properties:: How inserted text gets properties from
2583 neighboring text.
2584 * Lazy Properties:: Computing text properties in a lazy fashion
2585 only when text is examined.
2586 * Clickable Text:: Using text properties to make regions of text
2587 do something when you click on them.
2588 * Fields:: The @code{field} property defines
2589 fields within the buffer.
2590 * Not Intervals:: Why text properties do not use
2591 Lisp-visible text intervals.
2592 @end menu
2593
2594 @node Examining Properties
2595 @subsection Examining Text Properties
2596
2597 The simplest way to examine text properties is to ask for the value of
2598 a particular property of a particular character. For that, use
2599 @code{get-text-property}. Use @code{text-properties-at} to get the
2600 entire property list of a character. @xref{Property Search}, for
2601 functions to examine the properties of a number of characters at once.
2602
2603 These functions handle both strings and buffers. Keep in mind that
2604 positions in a string start from 0, whereas positions in a buffer start
2605 from 1.
2606
2607 @defun get-text-property pos prop &optional object
2608 This function returns the value of the @var{prop} property of the
2609 character after position @var{pos} in @var{object} (a buffer or
2610 string). The argument @var{object} is optional and defaults to the
2611 current buffer.
2612
2613 If there is no @var{prop} property strictly speaking, but the character
2614 has a property category that is a symbol, then @code{get-text-property} returns
2615 the @var{prop} property of that symbol.
2616 @end defun
2617
2618 @defun get-char-property position prop &optional object
2619 This function is like @code{get-text-property}, except that it checks
2620 overlays first and then text properties. @xref{Overlays}.
2621
2622 The argument @var{object} may be a string, a buffer, or a window. If
2623 it is a window, then the buffer displayed in that window is used for
2624 text properties and overlays, but only the overlays active for that
2625 window are considered. If @var{object} is a buffer, then overlays in
2626 that buffer are considered first, in order of decreasing priority,
2627 followed by the text properties. If @var{object} is a string, only
2628 text properties are considered, since strings never have overlays.
2629 @end defun
2630
2631 @defun get-char-property-and-overlay position prop &optional object
2632 This is like @code{get-char-property}, but gives extra information
2633 about the overlay that the property value comes from.
2634
2635 Its value is a cons cell whose @sc{car} is the property value, the
2636 same value @code{get-char-property} would return with the same
2637 arguments. Its @sc{cdr} is the overlay in which the property was
2638 found, or @code{nil}, if it was found as a text property or not found
2639 at all.
2640
2641 If @var{position} is at the end of @var{object}, both the @sc{car} and
2642 the @sc{cdr} of the value are @code{nil}.
2643 @end defun
2644
2645 @defvar char-property-alias-alist
2646 This variable holds an alist which maps property names to a list of
2647 alternative property names. If a character does not specify a direct
2648 value for a property, the alternative property names are consulted in
2649 order; the first non-@code{nil} value is used. This variable takes
2650 precedence over @code{default-text-properties}, and @code{category}
2651 properties take precedence over this variable.
2652 @end defvar
2653
2654 @defun text-properties-at position &optional object
2655 This function returns the entire property list of the character at
2656 @var{position} in the string or buffer @var{object}. If @var{object} is
2657 @code{nil}, it defaults to the current buffer.
2658 @end defun
2659
2660 @defvar default-text-properties
2661 This variable holds a property list giving default values for text
2662 properties. Whenever a character does not specify a value for a
2663 property, neither directly, through a category symbol, or through
2664 @code{char-property-alias-alist}, the value stored in this list is
2665 used instead. Here is an example:
2666
2667 @example
2668 (setq default-text-properties '(foo 69)
2669 char-property-alias-alist nil)
2670 ;; @r{Make sure character 1 has no properties of its own.}
2671 (set-text-properties 1 2 nil)
2672 ;; @r{What we get, when we ask, is the default value.}
2673 (get-text-property 1 'foo)
2674 @result{} 69
2675 @end example
2676 @end defvar
2677
2678 @node Changing Properties
2679 @subsection Changing Text Properties
2680
2681 The primitives for changing properties apply to a specified range of
2682 text in a buffer or string. The function @code{set-text-properties}
2683 (see end of section) sets the entire property list of the text in that
2684 range; more often, it is useful to add, change, or delete just certain
2685 properties specified by name.
2686
2687 Since text properties are considered part of the contents of the
2688 buffer (or string), and can affect how a buffer looks on the screen,
2689 any change in buffer text properties marks the buffer as modified.
2690 Buffer text property changes are undoable also (@pxref{Undo}).
2691 Positions in a string start from 0, whereas positions in a buffer
2692 start from 1.
2693
2694 @defun put-text-property start end prop value &optional object
2695 This function sets the @var{prop} property to @var{value} for the text
2696 between @var{start} and @var{end} in the string or buffer @var{object}.
2697 If @var{object} is @code{nil}, it defaults to the current buffer.
2698 @end defun
2699
2700 @defun add-text-properties start end props &optional object
2701 This function adds or overrides text properties for the text between
2702 @var{start} and @var{end} in the string or buffer @var{object}. If
2703 @var{object} is @code{nil}, it defaults to the current buffer.
2704
2705 The argument @var{props} specifies which properties to add. It should
2706 have the form of a property list (@pxref{Property Lists}): a list whose
2707 elements include the property names followed alternately by the
2708 corresponding values.
2709
2710 The return value is @code{t} if the function actually changed some
2711 property's value; @code{nil} otherwise (if @var{props} is @code{nil} or
2712 its values agree with those in the text).
2713
2714 For example, here is how to set the @code{comment} and @code{face}
2715 properties of a range of text:
2716
2717 @example
2718 (add-text-properties @var{start} @var{end}
2719 '(comment t face highlight))
2720 @end example
2721 @end defun
2722
2723 @defun remove-text-properties start end props &optional object
2724 This function deletes specified text properties from the text between
2725 @var{start} and @var{end} in the string or buffer @var{object}. If
2726 @var{object} is @code{nil}, it defaults to the current buffer.
2727
2728 The argument @var{props} specifies which properties to delete. It
2729 should have the form of a property list (@pxref{Property Lists}): a list
2730 whose elements are property names alternating with corresponding values.
2731 But only the names matter---the values that accompany them are ignored.
2732 For example, here's how to remove the @code{face} property.
2733
2734 @example
2735 (remove-text-properties @var{start} @var{end} '(face nil))
2736 @end example
2737
2738 The return value is @code{t} if the function actually changed some
2739 property's value; @code{nil} otherwise (if @var{props} is @code{nil} or
2740 if no character in the specified text had any of those properties).
2741
2742 To remove all text properties from certain text, use
2743 @code{set-text-properties} and specify @code{nil} for the new property
2744 list.
2745 @end defun
2746
2747 @defun remove-list-of-text-properties start end list-of-properties &optional object
2748 Like @code{remove-text-properties} except that
2749 @var{list-of-properties} is a list of property names only, not an
2750 alternating list of property names and values.
2751 @end defun
2752
2753 @defun set-text-properties start end props &optional object
2754 This function completely replaces the text property list for the text
2755 between @var{start} and @var{end} in the string or buffer @var{object}.
2756 If @var{object} is @code{nil}, it defaults to the current buffer.
2757
2758 The argument @var{props} is the new property list. It should be a list
2759 whose elements are property names alternating with corresponding values.
2760
2761 After @code{set-text-properties} returns, all the characters in the
2762 specified range have identical properties.
2763
2764 If @var{props} is @code{nil}, the effect is to get rid of all properties
2765 from the specified range of text. Here's an example:
2766
2767 @example
2768 (set-text-properties @var{start} @var{end} nil)
2769 @end example
2770
2771 Do not rely on the return value of this function.
2772 @end defun
2773
2774 The easiest way to make a string with text properties
2775 is with @code{propertize}:
2776
2777 @defun propertize string &rest properties
2778 This function returns a copy of @var{string} which has the text
2779 properties @var{properties}. These properties apply to all the
2780 characters in the string that is returned. Here is an example that
2781 constructs a string with a @code{face} property and a @code{mouse-face}
2782 property:
2783
2784 @smallexample
2785 (propertize "foo" 'face 'italic
2786 'mouse-face 'bold-italic)
2787 @result{} #("foo" 0 3 (mouse-face bold-italic face italic))
2788 @end smallexample
2789
2790 To put different properties on various parts of a string, you can
2791 construct each part with @code{propertize} and then combine them with
2792 @code{concat}:
2793
2794 @smallexample
2795 (concat
2796 (propertize "foo" 'face 'italic
2797 'mouse-face 'bold-italic)
2798 " and "
2799 (propertize "bar" 'face 'italic
2800 'mouse-face 'bold-italic))
2801 @result{} #("foo and bar"
2802 0 3 (face italic mouse-face bold-italic)
2803 3 8 nil
2804 8 11 (face italic mouse-face bold-italic))
2805 @end smallexample
2806 @end defun
2807
2808 @xref{Buffer Contents}, for the function
2809 @code{buffer-substring-no-properties}, which copies text from the
2810 buffer but does not copy its properties.
2811
2812 @node Property Search
2813 @subsection Text Property Search Functions
2814
2815 In typical use of text properties, most of the time several or many
2816 consecutive characters have the same value for a property. Rather than
2817 writing your programs to examine characters one by one, it is much
2818 faster to process chunks of text that have the same property value.
2819
2820 Here are functions you can use to do this. They use @code{eq} for
2821 comparing property values. In all cases, @var{object} defaults to the
2822 current buffer.
2823
2824 For high performance, it's very important to use the @var{limit}
2825 argument to these functions, especially the ones that search for a
2826 single property---otherwise, they may spend a long time scanning to the
2827 end of the buffer, if the property you are interested in does not change.
2828
2829 These functions do not move point; instead, they return a position (or
2830 @code{nil}). Remember that a position is always between two characters;
2831 the position returned by these functions is between two characters with
2832 different properties.
2833
2834 @defun next-property-change pos &optional object limit
2835 The function scans the text forward from position @var{pos} in the
2836 string or buffer @var{object} till it finds a change in some text
2837 property, then returns the position of the change. In other words, it
2838 returns the position of the first character beyond @var{pos} whose
2839 properties are not identical to those of the character just after
2840 @var{pos}.
2841
2842 If @var{limit} is non-@code{nil}, then the scan ends at position
2843 @var{limit}. If there is no property change before that point,
2844 @code{next-property-change} returns @var{limit}.
2845
2846 The value is @code{nil} if the properties remain unchanged all the way
2847 to the end of @var{object} and @var{limit} is @code{nil}. If the value
2848 is non-@code{nil}, it is a position greater than or equal to @var{pos}.
2849 The value equals @var{pos} only when @var{limit} equals @var{pos}.
2850
2851 Here is an example of how to scan the buffer by chunks of text within
2852 which all properties are constant:
2853
2854 @smallexample
2855 (while (not (eobp))
2856 (let ((plist (text-properties-at (point)))
2857 (next-change
2858 (or (next-property-change (point) (current-buffer))
2859 (point-max))))
2860 @r{Process text from point to @var{next-change}@dots{}}
2861 (goto-char next-change)))
2862 @end smallexample
2863 @end defun
2864
2865 @defun previous-property-change pos &optional object limit
2866 This is like @code{next-property-change}, but scans back from @var{pos}
2867 instead of forward. If the value is non-@code{nil}, it is a position
2868 less than or equal to @var{pos}; it equals @var{pos} only if @var{limit}
2869 equals @var{pos}.
2870 @end defun
2871
2872 @defun next-single-property-change pos prop &optional object limit
2873 The function scans text for a change in the @var{prop} property, then
2874 returns the position of the change. The scan goes forward from
2875 position @var{pos} in the string or buffer @var{object}. In other
2876 words, this function returns the position of the first character
2877 beyond @var{pos} whose @var{prop} property differs from that of the
2878 character just after @var{pos}.
2879
2880 If @var{limit} is non-@code{nil}, then the scan ends at position
2881 @var{limit}. If there is no property change before that point,
2882 @code{next-single-property-change} returns @var{limit}.
2883
2884 The value is @code{nil} if the property remains unchanged all the way to
2885 the end of @var{object} and @var{limit} is @code{nil}. If the value is
2886 non-@code{nil}, it is a position greater than or equal to @var{pos}; it
2887 equals @var{pos} only if @var{limit} equals @var{pos}.
2888 @end defun
2889
2890 @defun previous-single-property-change pos prop &optional object limit
2891 This is like @code{next-single-property-change}, but scans back from
2892 @var{pos} instead of forward. If the value is non-@code{nil}, it is a
2893 position less than or equal to @var{pos}; it equals @var{pos} only if
2894 @var{limit} equals @var{pos}.
2895 @end defun
2896
2897 @defun next-char-property-change pos &optional limit
2898 This is like @code{next-property-change} except that it considers
2899 overlay properties as well as text properties, and if no change is
2900 found before the end of the buffer, it returns the maximum buffer
2901 position rather than @code{nil} (in this sense, it resembles the
2902 corresponding overlay function @code{next-overlay-change}, rather than
2903 @code{next-property-change}). There is no @var{object} operand
2904 because this function operates only on the current buffer. It returns
2905 the next address at which either kind of property changes.
2906 @end defun
2907
2908 @defun previous-char-property-change pos &optional limit
2909 This is like @code{next-char-property-change}, but scans back from
2910 @var{pos} instead of forward, and returns the minimum buffer
2911 position if no change is found.
2912 @end defun
2913
2914 @defun next-single-char-property-change pos prop &optional object limit
2915 This is like @code{next-single-property-change} except that it
2916 considers overlay properties as well as text properties, and if no
2917 change is found before the end of the @var{object}, it returns the
2918 maximum valid position in @var{object} rather than @code{nil}. Unlike
2919 @code{next-char-property-change}, this function @emph{does} have an
2920 @var{object} operand; if @var{object} is not a buffer, only
2921 text-properties are considered.
2922 @end defun
2923
2924 @defun previous-single-char-property-change pos prop &optional object limit
2925 This is like @code{next-single-char-property-change}, but scans back
2926 from @var{pos} instead of forward, and returns the minimum valid
2927 position in @var{object} if no change is found.
2928 @end defun
2929
2930 @defun text-property-any start end prop value &optional object
2931 This function returns non-@code{nil} if at least one character between
2932 @var{start} and @var{end} has a property @var{prop} whose value is
2933 @var{value}. More precisely, it returns the position of the first such
2934 character. Otherwise, it returns @code{nil}.
2935
2936 The optional fifth argument, @var{object}, specifies the string or
2937 buffer to scan. Positions are relative to @var{object}. The default
2938 for @var{object} is the current buffer.
2939 @end defun
2940
2941 @defun text-property-not-all start end prop value &optional object
2942 This function returns non-@code{nil} if at least one character between
2943 @var{start} and @var{end} does not have a property @var{prop} with value
2944 @var{value}. More precisely, it returns the position of the first such
2945 character. Otherwise, it returns @code{nil}.
2946
2947 The optional fifth argument, @var{object}, specifies the string or
2948 buffer to scan. Positions are relative to @var{object}. The default
2949 for @var{object} is the current buffer.
2950 @end defun
2951
2952 @node Special Properties
2953 @subsection Properties with Special Meanings
2954
2955 Here is a table of text property names that have special built-in
2956 meanings. The following sections list a few additional special property
2957 names that control filling and property inheritance. All other names
2958 have no standard meaning, and you can use them as you like.
2959
2960 Note: the properties @code{composition}, @code{display},
2961 @code{invisible} and @code{intangible} can also cause point to move to
2962 an acceptable place, after each Emacs command. @xref{Adjusting
2963 Point}.
2964
2965 @table @code
2966 @cindex property category of text character
2967 @kindex category @r{(text property)}
2968 @item category
2969 If a character has a @code{category} property, we call it the
2970 @dfn{property category} of the character. It should be a symbol. The
2971 properties of this symbol serve as defaults for the properties of the
2972 character.
2973
2974 @item face
2975 @cindex face codes of text
2976 @kindex face @r{(text property)}
2977 You can use the property @code{face} to control the font and color of
2978 text. @xref{Faces}, for more information.
2979
2980 @code{face} can be the following:
2981
2982 @itemize @bullet
2983 @item
2984 A face name (a symbol or string).
2985
2986 @item
2987 A property list of face attributes. This has the
2988 form (@var{keyword} @var{value} @dots{}), where each @var{keyword} is a
2989 face attribute name and @var{value} is a meaningful value for that
2990 attribute. With this feature, you do not need to create a face each
2991 time you want to specify a particular attribute for certain text.
2992 @xref{Face Attributes}.
2993 @end itemize
2994
2995 @code{face} can also be a list, where each element uses one of the
2996 forms listed above.
2997
2998 Font Lock mode (@pxref{Font Lock Mode}) works in most buffers by
2999 dynamically updating the @code{face} property of characters based on
3000 the context.
3001
3002 @item font-lock-face
3003 @kindex font-lock-face @r{(text property)}
3004 This property specifies a value for the @code{face} property that Font
3005 Lock mode should apply to the underlying text. It is one of the
3006 fontification methods used by Font Lock mode, and is useful for
3007 special modes that implement their own highlighting.
3008 @xref{Precalculated Fontification}. When Font Lock mode is disabled,
3009 @code{font-lock-face} has no effect.
3010
3011 @item mouse-face
3012 @kindex mouse-face @r{(text property)}
3013 This property is used instead of @code{face} when the mouse is on or
3014 near the character. For this purpose, ``near'' means that all text
3015 between the character and where the mouse is have the same
3016 @code{mouse-face} property value.
3017
3018 @item fontified
3019 @kindex fontified @r{(text property)}
3020 This property says whether the text is ready for display. If
3021 @code{nil}, Emacs's redisplay routine calls the functions in
3022 @code{fontification-functions} (@pxref{Auto Faces}) to prepare this
3023 part of the buffer before it is displayed. It is used internally by
3024 the ``just in time'' font locking code.
3025
3026 @item display
3027 This property activates various features that change the
3028 way text is displayed. For example, it can make text appear taller
3029 or shorter, higher or lower, wider or narrow, or replaced with an image.
3030 @xref{Display Property}.
3031
3032 @item help-echo
3033 @kindex help-echo @r{(text property)}
3034 @cindex tooltip
3035 @anchor{Text help-echo}
3036 If text has a string as its @code{help-echo} property, then when you
3037 move the mouse onto that text, Emacs displays that string in the echo
3038 area, or in the tooltip window (@pxref{Tooltips,,, emacs, The GNU Emacs
3039 Manual}).
3040
3041 If the value of the @code{help-echo} property is a function, that
3042 function is called with three arguments, @var{window}, @var{object} and
3043 @var{pos} and should return a help string or @code{nil} for
3044 none. The first argument, @var{window} is the window in which
3045 the help was found. The second, @var{object}, is the buffer, overlay or
3046 string which had the @code{help-echo} property. The @var{pos}
3047 argument is as follows:
3048
3049 @itemize @bullet{}
3050 @item
3051 If @var{object} is a buffer, @var{pos} is the position in the buffer.
3052 @item
3053 If @var{object} is an overlay, that overlay has a @code{help-echo}
3054 property, and @var{pos} is the position in the overlay's buffer.
3055 @item
3056 If @var{object} is a string (an overlay string or a string displayed
3057 with the @code{display} property), @var{pos} is the position in that
3058 string.
3059 @end itemize
3060
3061 If the value of the @code{help-echo} property is neither a function nor
3062 a string, it is evaluated to obtain a help string.
3063
3064 You can alter the way help text is displayed by setting the variable
3065 @code{show-help-function} (@pxref{Help display}).
3066
3067 This feature is used in the mode line and for other active text.
3068
3069 @item keymap
3070 @cindex keymap of character
3071 @kindex keymap @r{(text property)}
3072 The @code{keymap} property specifies an additional keymap for
3073 commands. When this keymap applies, it is used for key lookup before
3074 the minor mode keymaps and before the buffer's local map.
3075 @xref{Active Keymaps}. If the property value is a symbol, the
3076 symbol's function definition is used as the keymap.
3077
3078 The property's value for the character before point applies if it is
3079 non-@code{nil} and rear-sticky, and the property's value for the
3080 character after point applies if it is non-@code{nil} and
3081 front-sticky. (For mouse clicks, the position of the click is used
3082 instead of the position of point.)
3083
3084 @item local-map
3085 @kindex local-map @r{(text property)}
3086 This property works like @code{keymap} except that it specifies a
3087 keymap to use @emph{instead of} the buffer's local map. For most
3088 purposes (perhaps all purposes), it is better to use the @code{keymap}
3089 property.
3090
3091 @item syntax-table
3092 The @code{syntax-table} property overrides what the syntax table says
3093 about this particular character. @xref{Syntax Properties}.
3094
3095 @item read-only
3096 @cindex read-only character
3097 @kindex read-only @r{(text property)}
3098 If a character has the property @code{read-only}, then modifying that
3099 character is not allowed. Any command that would do so gets an error,
3100 @code{text-read-only}. If the property value is a string, that string
3101 is used as the error message.
3102
3103 Insertion next to a read-only character is an error if inserting
3104 ordinary text there would inherit the @code{read-only} property due to
3105 stickiness. Thus, you can control permission to insert next to
3106 read-only text by controlling the stickiness. @xref{Sticky Properties}.
3107
3108 Since changing properties counts as modifying the buffer, it is not
3109 possible to remove a @code{read-only} property unless you know the
3110 special trick: bind @code{inhibit-read-only} to a non-@code{nil} value
3111 and then remove the property. @xref{Read Only Buffers}.
3112
3113 @item invisible
3114 @kindex invisible @r{(text property)}
3115 A non-@code{nil} @code{invisible} property can make a character invisible
3116 on the screen. @xref{Invisible Text}, for details.
3117
3118 @item intangible
3119 @kindex intangible @r{(text property)}
3120 If a group of consecutive characters have equal and non-@code{nil}
3121 @code{intangible} properties, then you cannot place point between them.
3122 If you try to move point forward into the group, point actually moves to
3123 the end of the group. If you try to move point backward into the group,
3124 point actually moves to the start of the group.
3125
3126 If consecutive characters have unequal non-@code{nil}
3127 @code{intangible} properties, they belong to separate groups; each
3128 group is separately treated as described above.
3129
3130 When the variable @code{inhibit-point-motion-hooks} is non-@code{nil},
3131 the @code{intangible} property is ignored.
3132
3133 @item field
3134 @kindex field @r{(text property)}
3135 Consecutive characters with the same @code{field} property constitute a
3136 @dfn{field}. Some motion functions including @code{forward-word} and
3137 @code{beginning-of-line} stop moving at a field boundary.
3138 @xref{Fields}.
3139
3140 @item cursor
3141 @kindex cursor @r{(text property)}
3142 Normally, the cursor is displayed at the end of any overlay and text
3143 property strings present at the current buffer position. You can
3144 place the cursor on any desired character of these strings by giving
3145 that character a non-@code{nil} @code{cursor} text property. In
3146 addition, if the value of the @code{cursor} property of an overlay
3147 string is an integer number, it specifies the number of buffer's
3148 character positions associated with the overlay string; this way,
3149 Emacs will display the cursor on the character with that property
3150 regardless of whether the current buffer position is actually covered
3151 by the overlay. Specifically, if the value of the @code{cursor}
3152 property of a character is the number @var{n}, the cursor will be
3153 displayed on this character for any buffer position in the range
3154 @code{[@var{ovpos}..@var{ovpos}+@var{n}]}, where @var{ovpos} is the
3155 starting buffer position covered by the overlay (@pxref{Managing
3156 Overlays}).
3157
3158 @item pointer
3159 @kindex pointer @r{(text property)}
3160 This specifies a specific pointer shape when the mouse pointer is over
3161 this text or image. @xref{Pointer Shape}, for possible pointer
3162 shapes.
3163
3164 @item line-spacing
3165 @kindex line-spacing @r{(text property)}
3166 A newline can have a @code{line-spacing} text or overlay property that
3167 controls the height of the display line ending with that newline. The
3168 property value overrides the default frame line spacing and the buffer
3169 local @code{line-spacing} variable. @xref{Line Height}.
3170
3171 @item line-height
3172 @kindex line-height @r{(text property)}
3173 A newline can have a @code{line-height} text or overlay property that
3174 controls the total height of the display line ending in that newline.
3175 @xref{Line Height}.
3176
3177 @item wrap-prefix
3178 If text has a @code{wrap-prefix} property, the prefix it defines will
3179 be added at display-time to the beginning of every continuation line
3180 due to text wrapping (so if lines are truncated, the wrap-prefix is
3181 never used). It may be a string, an image, or a stretch-glyph such as
3182 used by the @code{display} text-property. @xref{Display Property}.
3183
3184 A wrap-prefix may also be specified for an entire buffer using the
3185 @code{wrap-prefix} buffer-local variable (however, a
3186 @code{wrap-prefix} text-property takes precedence over the value of
3187 the @code{wrap-prefix} variable). @xref{Truncation}.
3188
3189 @item line-prefix
3190 If text has a @code{line-prefix} property, the prefix it defines will
3191 be added at display-time to the beginning of every non-continuation
3192 line. It may be a string, an image, or a stretch-glyph such as used
3193 by the @code{display} text-property. @xref{Display Property}.
3194
3195 A line-prefix may also be specified for an entire buffer using the
3196 @code{line-prefix} buffer-local variable (however, a
3197 @code{line-prefix} text-property takes precedence over the value of
3198 the @code{line-prefix} variable). @xref{Truncation}.
3199
3200 @item modification-hooks
3201 @cindex change hooks for a character
3202 @cindex hooks for changing a character
3203 @kindex modification-hooks @r{(text property)}
3204 If a character has the property @code{modification-hooks}, then its
3205 value should be a list of functions; modifying that character calls
3206 all of those functions before the actual modification. Each function
3207 receives two arguments: the beginning and end of the part of the
3208 buffer being modified. Note that if a particular modification hook
3209 function appears on several characters being modified by a single
3210 primitive, you can't predict how many times the function will
3211 be called.
3212 Furthermore, insertion will not modify any existing character, so this
3213 hook will only be run when removing some characters, replacing them
3214 with others, or changing their text-properties.
3215
3216 If these functions modify the buffer, they should bind
3217 @code{inhibit-modification-hooks} to @code{t} around doing so, to
3218 avoid confusing the internal mechanism that calls these hooks.
3219
3220 Overlays also support the @code{modification-hooks} property, but the
3221 details are somewhat different (@pxref{Overlay Properties}).
3222
3223 @item insert-in-front-hooks
3224 @itemx insert-behind-hooks
3225 @kindex insert-in-front-hooks @r{(text property)}
3226 @kindex insert-behind-hooks @r{(text property)}
3227 The operation of inserting text in a buffer also calls the functions
3228 listed in the @code{insert-in-front-hooks} property of the following
3229 character and in the @code{insert-behind-hooks} property of the
3230 preceding character. These functions receive two arguments, the
3231 beginning and end of the inserted text. The functions are called
3232 @emph{after} the actual insertion takes place.
3233
3234 See also @ref{Change Hooks}, for other hooks that are called
3235 when you change text in a buffer.
3236
3237 @item point-entered
3238 @itemx point-left
3239 @cindex hooks for motion of point
3240 @kindex point-entered @r{(text property)}
3241 @kindex point-left @r{(text property)}
3242 The special properties @code{point-entered} and @code{point-left}
3243 record hook functions that report motion of point. Each time point
3244 moves, Emacs compares these two property values:
3245
3246 @itemize @bullet
3247 @item
3248 the @code{point-left} property of the character after the old location,
3249 and
3250 @item
3251 the @code{point-entered} property of the character after the new
3252 location.
3253 @end itemize
3254
3255 @noindent
3256 If these two values differ, each of them is called (if not @code{nil})
3257 with two arguments: the old value of point, and the new one.
3258
3259 The same comparison is made for the characters before the old and new
3260 locations. The result may be to execute two @code{point-left} functions
3261 (which may be the same function) and/or two @code{point-entered}
3262 functions (which may be the same function). In any case, all the
3263 @code{point-left} functions are called first, followed by all the
3264 @code{point-entered} functions.
3265
3266 It is possible to use @code{char-after} to examine characters at various
3267 buffer positions without moving point to those positions. Only an
3268 actual change in the value of point runs these hook functions.
3269
3270 The variable @code{inhibit-point-motion-hooks} can inhibit running the
3271 @code{point-left} and @code{point-entered} hooks, see @ref{Inhibit
3272 point motion hooks}.
3273
3274 @item composition
3275 @kindex composition @r{(text property)}
3276 This text property is used to display a sequence of characters as a
3277 single glyph composed from components. But the value of the property
3278 itself is completely internal to Emacs and should not be manipulated
3279 directly by, for instance, @code{put-text-property}.
3280
3281 @end table
3282
3283 @defvar inhibit-point-motion-hooks
3284 @anchor{Inhibit point motion hooks} When this variable is
3285 non-@code{nil}, @code{point-left} and @code{point-entered} hooks are
3286 not run, and the @code{intangible} property has no effect. Do not set
3287 this variable globally; bind it with @code{let}.
3288 @end defvar
3289
3290 @defvar show-help-function
3291 @anchor{Help display} If this variable is non-@code{nil}, it specifies a
3292 function called to display help strings. These may be @code{help-echo}
3293 properties, menu help strings (@pxref{Simple Menu Items},
3294 @pxref{Extended Menu Items}), or tool bar help strings (@pxref{Tool
3295 Bar}). The specified function is called with one argument, the help
3296 string to display. Tooltip mode (@pxref{Tooltips,,, emacs, The GNU Emacs
3297 Manual}) provides an example.
3298 @end defvar
3299
3300 @node Format Properties
3301 @subsection Formatted Text Properties
3302
3303 These text properties affect the behavior of the fill commands. They
3304 are used for representing formatted text. @xref{Filling}, and
3305 @ref{Margins}.
3306
3307 @table @code
3308 @item hard
3309 If a newline character has this property, it is a ``hard'' newline.
3310 The fill commands do not alter hard newlines and do not move words
3311 across them. However, this property takes effect only if the
3312 @code{use-hard-newlines} minor mode is enabled. @xref{Hard and Soft
3313 Newlines,, Hard and Soft Newlines, emacs, The GNU Emacs Manual}.
3314
3315 @item right-margin
3316 This property specifies an extra right margin for filling this part of the
3317 text.
3318
3319 @item left-margin
3320 This property specifies an extra left margin for filling this part of the
3321 text.
3322
3323 @item justification
3324 This property specifies the style of justification for filling this part
3325 of the text.
3326 @end table
3327
3328 @node Sticky Properties
3329 @subsection Stickiness of Text Properties
3330 @cindex sticky text properties
3331 @cindex inheritance of text properties
3332
3333 Self-inserting characters normally take on the same properties as the
3334 preceding character. This is called @dfn{inheritance} of properties.
3335
3336 In a Lisp program, you can do insertion with inheritance or without,
3337 depending on your choice of insertion primitive. The ordinary text
3338 insertion functions such as @code{insert} do not inherit any properties.
3339 They insert text with precisely the properties of the string being
3340 inserted, and no others. This is correct for programs that copy text
3341 from one context to another---for example, into or out of the kill ring.
3342 To insert with inheritance, use the special primitives described in this
3343 section. Self-inserting characters inherit properties because they work
3344 using these primitives.
3345
3346 When you do insertion with inheritance, @emph{which} properties are
3347 inherited, and from where, depends on which properties are @dfn{sticky}.
3348 Insertion after a character inherits those of its properties that are
3349 @dfn{rear-sticky}. Insertion before a character inherits those of its
3350 properties that are @dfn{front-sticky}. When both sides offer different
3351 sticky values for the same property, the previous character's value
3352 takes precedence.
3353
3354 By default, a text property is rear-sticky but not front-sticky; thus,
3355 the default is to inherit all the properties of the preceding character,
3356 and nothing from the following character.
3357
3358 You can control the stickiness of various text properties with two
3359 specific text properties, @code{front-sticky} and @code{rear-nonsticky},
3360 and with the variable @code{text-property-default-nonsticky}. You can
3361 use the variable to specify a different default for a given property.
3362 You can use those two text properties to make any specific properties
3363 sticky or nonsticky in any particular part of the text.
3364
3365 If a character's @code{front-sticky} property is @code{t}, then all
3366 its properties are front-sticky. If the @code{front-sticky} property is
3367 a list, then the sticky properties of the character are those whose
3368 names are in the list. For example, if a character has a
3369 @code{front-sticky} property whose value is @code{(face read-only)},
3370 then insertion before the character can inherit its @code{face} property
3371 and its @code{read-only} property, but no others.
3372
3373 The @code{rear-nonsticky} property works the opposite way. Most
3374 properties are rear-sticky by default, so the @code{rear-nonsticky}
3375 property says which properties are @emph{not} rear-sticky. If a
3376 character's @code{rear-nonsticky} property is @code{t}, then none of its
3377 properties are rear-sticky. If the @code{rear-nonsticky} property is a
3378 list, properties are rear-sticky @emph{unless} their names are in the
3379 list.
3380
3381 @defvar text-property-default-nonsticky
3382 This variable holds an alist which defines the default rear-stickiness
3383 of various text properties. Each element has the form
3384 @code{(@var{property} . @var{nonstickiness})}, and it defines the
3385 stickiness of a particular text property, @var{property}.
3386
3387 If @var{nonstickiness} is non-@code{nil}, this means that the property
3388 @var{property} is rear-nonsticky by default. Since all properties are
3389 front-nonsticky by default, this makes @var{property} nonsticky in both
3390 directions by default.
3391
3392 The text properties @code{front-sticky} and @code{rear-nonsticky}, when
3393 used, take precedence over the default @var{nonstickiness} specified in
3394 @code{text-property-default-nonsticky}.
3395 @end defvar
3396
3397 Here are the functions that insert text with inheritance of properties:
3398
3399 @defun insert-and-inherit &rest strings
3400 Insert the strings @var{strings}, just like the function @code{insert},
3401 but inherit any sticky properties from the adjoining text.
3402 @end defun
3403
3404 @defun insert-before-markers-and-inherit &rest strings
3405 Insert the strings @var{strings}, just like the function
3406 @code{insert-before-markers}, but inherit any sticky properties from the
3407 adjoining text.
3408 @end defun
3409
3410 @xref{Insertion}, for the ordinary insertion functions which do not
3411 inherit.
3412
3413 @node Lazy Properties
3414 @subsection Lazy Computation of Text Properties
3415
3416 Instead of computing text properties for all the text in the buffer,
3417 you can arrange to compute the text properties for parts of the text
3418 when and if something depends on them.
3419
3420 The primitive that extracts text from the buffer along with its
3421 properties is @code{buffer-substring}. Before examining the properties,
3422 this function runs the abnormal hook @code{buffer-access-fontify-functions}.
3423
3424 @defvar buffer-access-fontify-functions
3425 This variable holds a list of functions for computing text properties.
3426 Before @code{buffer-substring} copies the text and text properties for a
3427 portion of the buffer, it calls all the functions in this list. Each of
3428 the functions receives two arguments that specify the range of the
3429 buffer being accessed. (The buffer itself is always the current
3430 buffer.)
3431 @end defvar
3432
3433 The function @code{buffer-substring-no-properties} does not call these
3434 functions, since it ignores text properties anyway.
3435
3436 In order to prevent the hook functions from being called more than
3437 once for the same part of the buffer, you can use the variable
3438 @code{buffer-access-fontified-property}.
3439
3440 @defvar buffer-access-fontified-property
3441 If this variable's value is non-@code{nil}, it is a symbol which is used
3442 as a text property name. A non-@code{nil} value for that text property
3443 means, ``the other text properties for this character have already been
3444 computed.''
3445
3446 If all the characters in the range specified for @code{buffer-substring}
3447 have a non-@code{nil} value for this property, @code{buffer-substring}
3448 does not call the @code{buffer-access-fontify-functions} functions. It
3449 assumes these characters already have the right text properties, and
3450 just copies the properties they already have.
3451
3452 The normal way to use this feature is that the
3453 @code{buffer-access-fontify-functions} functions add this property, as
3454 well as others, to the characters they operate on. That way, they avoid
3455 being called over and over for the same text.
3456 @end defvar
3457
3458 @node Clickable Text
3459 @subsection Defining Clickable Text
3460 @cindex clickable text
3461 @cindex follow links
3462 @cindex mouse-1
3463
3464 @dfn{Clickable text} is text that can be clicked, with either the
3465 mouse or via a keyboard command, to produce some result. Many major
3466 modes use clickable text to implement textual hyper-links, or
3467 @dfn{links} for short.
3468
3469 The easiest way to insert and manipulate links is to use the
3470 @code{button} package. @xref{Buttons}. In this section, we will
3471 explain how to manually set up clickable text in a buffer, using text
3472 properties. For simplicity, we will refer to the clickable text as a
3473 @dfn{link}.
3474
3475 Implementing a link involves three separate steps: (1) indicating
3476 clickability when the mouse moves over the link; (2) making @kbd{RET}
3477 or @kbd{Mouse-2} on that link do something; and (3) setting up a
3478 @code{follow-link} condition so that the link obeys
3479 @code{mouse-1-click-follows-link}.
3480
3481 To indicate clickability, add the @code{mouse-face} text property to
3482 the text of the link; then Emacs will highlight the link when the
3483 mouse moves over it. In addition, you should define a tooltip or echo
3484 area message, using the @code{help-echo} text property. @xref{Special
3485 Properties}. For instance, here is how Dired indicates that file
3486 names are clickable:
3487
3488 @smallexample
3489 (if (dired-move-to-filename)
3490 (add-text-properties
3491 (point)
3492 (save-excursion
3493 (dired-move-to-end-of-filename)
3494 (point))
3495 '(mouse-face highlight
3496 help-echo "mouse-2: visit this file in other window")))
3497 @end smallexample
3498
3499 To make the link clickable, bind @key{RET} and @kbd{Mouse-2} to
3500 commands that perform the desired action. Each command should check
3501 to see whether it was called on a link, and act accordingly. For
3502 instance, Dired's major mode keymap binds @kbd{Mouse-2} to the
3503 following command:
3504
3505 @smallexample
3506 (defun dired-mouse-find-file-other-window (event)
3507 "In Dired, visit the file or directory name you click on."
3508 (interactive "e")
3509 (let ((window (posn-window (event-end event)))
3510 (pos (posn-point (event-end event)))
3511 file)
3512 (if (not (windowp window))
3513 (error "No file chosen"))
3514 (with-current-buffer (window-buffer window)
3515 (goto-char pos)
3516 (setq file (dired-get-file-for-visit)))
3517 (if (file-directory-p file)
3518 (or (and (cdr dired-subdir-alist)
3519 (dired-goto-subdir file))
3520 (progn
3521 (select-window window)
3522 (dired-other-window file)))
3523 (select-window window)
3524 (find-file-other-window (file-name-sans-versions file t)))))
3525 @end smallexample
3526
3527 @noindent
3528 This command uses the functions @code{posn-window} and
3529 @code{posn-point} to determine where the click occurred, and
3530 @code{dired-get-file-for-visit} to determine which file to visit.
3531
3532 Instead of binding the mouse command in a major mode keymap, you can
3533 bind it within the link text, using the @code{keymap} text property
3534 (@pxref{Special Properties}). For instance:
3535
3536 @example
3537 (let ((map (make-sparse-keymap)))
3538 (define-key map [mouse-2] 'operate-this-button)
3539 (put-text-property link-start link-end 'keymap map))
3540 @end example
3541
3542 @noindent
3543 With this method, you can easily define different commands for
3544 different links. Furthermore, the global definition of @key{RET} and
3545 @kbd{Mouse-2} remain available for the rest of the text in the buffer.
3546
3547 @vindex mouse-1-click-follows-link
3548 The basic Emacs command for clicking on links is @kbd{Mouse-2}.
3549 However, for compatibility with other graphical applications, Emacs
3550 also recognizes @kbd{Mouse-1} clicks on links, provided the user
3551 clicks on the link quickly without moving the mouse. This behavior is
3552 controlled by the user option @code{mouse-1-click-follows-link}.
3553 @xref{Mouse References,,, emacs, The GNU Emacs Manual}.
3554
3555 To set up the link so that it obeys
3556 @code{mouse-1-click-follows-link}, you must either (1) apply a
3557 @code{follow-link} text or overlay property to the link text, or (2)
3558 bind the @code{follow-link} event to a keymap (which can be a major
3559 mode keymap or a local keymap specified via the @code{keymap} text
3560 property). The value of the @code{follow-link} property, or the
3561 binding for the @code{follow-link} event, acts as a ``condition'' for
3562 the link action. This condition tells Emacs two things: the
3563 circumstances under which a @kbd{Mouse-1} click should be regarded as
3564 occurring ``inside'' the link, and how to compute an ``action code''
3565 that says what to translate the @kbd{Mouse-1} click into. The link
3566 action condition can be one of the following:
3567
3568 @table @asis
3569 @item @code{mouse-face}
3570 If the condition is the symbol @code{mouse-face}, a position is inside
3571 a link if there is a non-@code{nil} @code{mouse-face} property at that
3572 position. The action code is always @code{t}.
3573
3574 For example, here is how Info mode handles @key{Mouse-1}:
3575
3576 @smallexample
3577 (define-key Info-mode-map [follow-link] 'mouse-face)
3578 @end smallexample
3579
3580 @item a function
3581 If the condition is a function, @var{func}, then a position @var{pos}
3582 is inside a link if @code{(@var{func} @var{pos})} evaluates to
3583 non-@code{nil}. The value returned by @var{func} serves as the action
3584 code.
3585
3586 For example, here is how pcvs enables @kbd{Mouse-1} to follow links on
3587 file names only:
3588
3589 @smallexample
3590 (define-key map [follow-link]
3591 (lambda (pos)
3592 (eq (get-char-property pos 'face) 'cvs-filename-face)))
3593 @end smallexample
3594
3595 @item anything else
3596 If the condition value is anything else, then the position is inside a
3597 link and the condition itself is the action code. Clearly, you should
3598 specify this kind of condition only when applying the condition via a
3599 text or property overlay on the link text (so that it does not apply
3600 to the entire buffer).
3601 @end table
3602
3603 @noindent
3604 The action code tells @kbd{Mouse-1} how to follow the link:
3605
3606 @table @asis
3607 @item a string or vector
3608 If the action code is a string or vector, the @kbd{Mouse-1} event is
3609 translated into the first element of the string or vector; i.e., the
3610 action of the @kbd{Mouse-1} click is the local or global binding of
3611 that character or symbol. Thus, if the action code is @code{"foo"},
3612 @kbd{Mouse-1} translates into @kbd{f}. If it is @code{[foo]},
3613 @kbd{Mouse-1} translates into @key{foo}.
3614
3615 @item anything else
3616 For any other non-@code{nil} action code, the @kbd{Mouse-1} event is
3617 translated into a @kbd{Mouse-2} event at the same position.
3618 @end table
3619
3620 To define @kbd{Mouse-1} to activate a button defined with
3621 @code{define-button-type}, give the button a @code{follow-link}
3622 property. The property value should be a link action condition, as
3623 described above. @xref{Buttons}. For example, here is how Help mode
3624 handles @kbd{Mouse-1}:
3625
3626 @smallexample
3627 (define-button-type 'help-xref
3628 'follow-link t
3629 'action #'help-button-action)
3630 @end smallexample
3631
3632 To define @kbd{Mouse-1} on a widget defined with
3633 @code{define-widget}, give the widget a @code{:follow-link} property.
3634 The property value should be a link action condition, as described
3635 above. For example, here is how the @code{link} widget specifies that
3636 a @key{Mouse-1} click shall be translated to @key{RET}:
3637
3638 @smallexample
3639 (define-widget 'link 'item
3640 "An embedded link."
3641 :button-prefix 'widget-link-prefix
3642 :button-suffix 'widget-link-suffix
3643 :follow-link "\C-m"
3644 :help-echo "Follow the link."
3645 :format "%[%t%]")
3646 @end smallexample
3647
3648 @defun mouse-on-link-p pos
3649 This function returns non-@code{nil} if position @var{pos} in the
3650 current buffer is on a link. @var{pos} can also be a mouse event
3651 location, as returned by @code{event-start} (@pxref{Accessing Mouse}).
3652 @end defun
3653
3654 @node Fields
3655 @subsection Defining and Using Fields
3656 @cindex fields
3657
3658 A field is a range of consecutive characters in the buffer that are
3659 identified by having the same value (comparing with @code{eq}) of the
3660 @code{field} property (either a text-property or an overlay property).
3661 This section describes special functions that are available for
3662 operating on fields.
3663
3664 You specify a field with a buffer position, @var{pos}. We think of
3665 each field as containing a range of buffer positions, so the position
3666 you specify stands for the field containing that position.
3667
3668 When the characters before and after @var{pos} are part of the same
3669 field, there is no doubt which field contains @var{pos}: the one those
3670 characters both belong to. When @var{pos} is at a boundary between
3671 fields, which field it belongs to depends on the stickiness of the
3672 @code{field} properties of the two surrounding characters (@pxref{Sticky
3673 Properties}). The field whose property would be inherited by text
3674 inserted at @var{pos} is the field that contains @var{pos}.
3675
3676 There is an anomalous case where newly inserted text at @var{pos}
3677 would not inherit the @code{field} property from either side. This
3678 happens if the previous character's @code{field} property is not
3679 rear-sticky, and the following character's @code{field} property is not
3680 front-sticky. In this case, @var{pos} belongs to neither the preceding
3681 field nor the following field; the field functions treat it as belonging
3682 to an empty field whose beginning and end are both at @var{pos}.
3683
3684 In all of these functions, if @var{pos} is omitted or @code{nil}, the
3685 value of point is used by default. If narrowing is in effect, then
3686 @var{pos} should fall within the accessible portion. @xref{Narrowing}.
3687
3688 @defun field-beginning &optional pos escape-from-edge limit
3689 This function returns the beginning of the field specified by @var{pos}.
3690
3691 If @var{pos} is at the beginning of its field, and
3692 @var{escape-from-edge} is non-@code{nil}, then the return value is
3693 always the beginning of the preceding field that @emph{ends} at @var{pos},
3694 regardless of the stickiness of the @code{field} properties around
3695 @var{pos}.
3696
3697 If @var{limit} is non-@code{nil}, it is a buffer position; if the
3698 beginning of the field is before @var{limit}, then @var{limit} will be
3699 returned instead.
3700 @end defun
3701
3702 @defun field-end &optional pos escape-from-edge limit
3703 This function returns the end of the field specified by @var{pos}.
3704
3705 If @var{pos} is at the end of its field, and @var{escape-from-edge} is
3706 non-@code{nil}, then the return value is always the end of the following
3707 field that @emph{begins} at @var{pos}, regardless of the stickiness of
3708 the @code{field} properties around @var{pos}.
3709
3710 If @var{limit} is non-@code{nil}, it is a buffer position; if the end
3711 of the field is after @var{limit}, then @var{limit} will be returned
3712 instead.
3713 @end defun
3714
3715 @defun field-string &optional pos
3716 This function returns the contents of the field specified by @var{pos},
3717 as a string.
3718 @end defun
3719
3720 @defun field-string-no-properties &optional pos
3721 This function returns the contents of the field specified by @var{pos},
3722 as a string, discarding text properties.
3723 @end defun
3724
3725 @defun delete-field &optional pos
3726 This function deletes the text of the field specified by @var{pos}.
3727 @end defun
3728
3729 @defun constrain-to-field new-pos old-pos &optional escape-from-edge only-in-line inhibit-capture-property
3730 This function ``constrains'' @var{new-pos} to the field that
3731 @var{old-pos} belongs to---in other words, it returns the position
3732 closest to @var{new-pos} that is in the same field as @var{old-pos}.
3733
3734 If @var{new-pos} is @code{nil}, then @code{constrain-to-field} uses
3735 the value of point instead, and moves point to the resulting position
3736 as well as returning it.
3737
3738 If @var{old-pos} is at the boundary of two fields, then the acceptable
3739 final positions depend on the argument @var{escape-from-edge}. If
3740 @var{escape-from-edge} is @code{nil}, then @var{new-pos} must be in
3741 the field whose @code{field} property equals what new characters
3742 inserted at @var{old-pos} would inherit. (This depends on the
3743 stickiness of the @code{field} property for the characters before and
3744 after @var{old-pos}.) If @var{escape-from-edge} is non-@code{nil},
3745 @var{new-pos} can be anywhere in the two adjacent fields.
3746 Additionally, if two fields are separated by another field with the
3747 special value @code{boundary}, then any point within this special
3748 field is also considered to be ``on the boundary.''
3749
3750 Commands like @kbd{C-a} with no argumemt, that normally move backward
3751 to a specific kind of location and stay there once there, probably
3752 should specify @code{nil} for @var{escape-from-edge}. Other motion
3753 commands that check fields should probably pass @code{t}.
3754
3755 If the optional argument @var{only-in-line} is non-@code{nil}, and
3756 constraining @var{new-pos} in the usual way would move it to a different
3757 line, @var{new-pos} is returned unconstrained. This used in commands
3758 that move by line, such as @code{next-line} and
3759 @code{beginning-of-line}, so that they respect field boundaries only in
3760 the case where they can still move to the right line.
3761
3762 If the optional argument @var{inhibit-capture-property} is
3763 non-@code{nil}, and @var{old-pos} has a non-@code{nil} property of that
3764 name, then any field boundaries are ignored.
3765
3766 You can cause @code{constrain-to-field} to ignore all field boundaries
3767 (and so never constrain anything) by binding the variable
3768 @code{inhibit-field-text-motion} to a non-@code{nil} value.
3769 @end defun
3770
3771 @node Not Intervals
3772 @subsection Why Text Properties are not Intervals
3773 @cindex intervals
3774
3775 Some editors that support adding attributes to text in the buffer do
3776 so by letting the user specify ``intervals'' within the text, and adding
3777 the properties to the intervals. Those editors permit the user or the
3778 programmer to determine where individual intervals start and end. We
3779 deliberately provided a different sort of interface in Emacs Lisp to
3780 avoid certain paradoxical behavior associated with text modification.
3781
3782 If the actual subdivision into intervals is meaningful, that means you
3783 can distinguish between a buffer that is just one interval with a
3784 certain property, and a buffer containing the same text subdivided into
3785 two intervals, both of which have that property.
3786
3787 Suppose you take the buffer with just one interval and kill part of
3788 the text. The text remaining in the buffer is one interval, and the
3789 copy in the kill ring (and the undo list) becomes a separate interval.
3790 Then if you yank back the killed text, you get two intervals with the
3791 same properties. Thus, editing does not preserve the distinction
3792 between one interval and two.
3793
3794 Suppose we ``fix'' this problem by coalescing the two intervals when
3795 the text is inserted. That works fine if the buffer originally was a
3796 single interval. But suppose instead that we have two adjacent
3797 intervals with the same properties, and we kill the text of one interval
3798 and yank it back. The same interval-coalescence feature that rescues
3799 the other case causes trouble in this one: after yanking, we have just
3800 one interval. One again, editing does not preserve the distinction
3801 between one interval and two.
3802
3803 Insertion of text at the border between intervals also raises
3804 questions that have no satisfactory answer.
3805
3806 However, it is easy to arrange for editing to behave consistently for
3807 questions of the form, ``What are the properties of this character?''
3808 So we have decided these are the only questions that make sense; we have
3809 not implemented asking questions about where intervals start or end.
3810
3811 In practice, you can usually use the text property search functions in
3812 place of explicit interval boundaries. You can think of them as finding
3813 the boundaries of intervals, assuming that intervals are always
3814 coalesced whenever possible. @xref{Property Search}.
3815
3816 Emacs also provides explicit intervals as a presentation feature; see
3817 @ref{Overlays}.
3818
3819 @node Substitution
3820 @section Substituting for a Character Code
3821
3822 The following functions replace characters within a specified region
3823 based on their character codes.
3824
3825 @defun subst-char-in-region start end old-char new-char &optional noundo
3826 @cindex replace characters
3827 This function replaces all occurrences of the character @var{old-char}
3828 with the character @var{new-char} in the region of the current buffer
3829 defined by @var{start} and @var{end}.
3830
3831 @cindex undo avoidance
3832 If @var{noundo} is non-@code{nil}, then @code{subst-char-in-region} does
3833 not record the change for undo and does not mark the buffer as modified.
3834 This was useful for controlling the old selective display feature
3835 (@pxref{Selective Display}).
3836
3837 @code{subst-char-in-region} does not move point and returns
3838 @code{nil}.
3839
3840 @example
3841 @group
3842 ---------- Buffer: foo ----------
3843 This is the contents of the buffer before.
3844 ---------- Buffer: foo ----------
3845 @end group
3846
3847 @group
3848 (subst-char-in-region 1 20 ?i ?X)
3849 @result{} nil
3850
3851 ---------- Buffer: foo ----------
3852 ThXs Xs the contents of the buffer before.
3853 ---------- Buffer: foo ----------
3854 @end group
3855 @end example
3856 @end defun
3857
3858 @deffn Command translate-region start end table
3859 This function applies a translation table to the characters in the
3860 buffer between positions @var{start} and @var{end}.
3861
3862 The translation table @var{table} is a string or a char-table;
3863 @code{(aref @var{table} @var{ochar})} gives the translated character
3864 corresponding to @var{ochar}. If @var{table} is a string, any
3865 characters with codes larger than the length of @var{table} are not
3866 altered by the translation.
3867
3868 The return value of @code{translate-region} is the number of
3869 characters that were actually changed by the translation. This does
3870 not count characters that were mapped into themselves in the
3871 translation table.
3872 @end deffn
3873
3874 @node Registers
3875 @section Registers
3876 @cindex registers
3877
3878 A register is a sort of variable used in Emacs editing that can hold a
3879 variety of different kinds of values. Each register is named by a
3880 single character. All @acronym{ASCII} characters and their meta variants
3881 (but with the exception of @kbd{C-g}) can be used to name registers.
3882 Thus, there are 255 possible registers. A register is designated in
3883 Emacs Lisp by the character that is its name.
3884
3885 @defvar register-alist
3886 This variable is an alist of elements of the form @code{(@var{name} .
3887 @var{contents})}. Normally, there is one element for each Emacs
3888 register that has been used.
3889
3890 The object @var{name} is a character (an integer) identifying the
3891 register.
3892 @end defvar
3893
3894 The @var{contents} of a register can have several possible types:
3895
3896 @table @asis
3897 @item a number
3898 A number stands for itself. If @code{insert-register} finds a number
3899 in the register, it converts the number to decimal.
3900
3901 @item a marker
3902 A marker represents a buffer position to jump to.
3903
3904 @item a string
3905 A string is text saved in the register.
3906
3907 @item a rectangle
3908 A rectangle is represented by a list of strings.
3909
3910 @item @code{(@var{window-configuration} @var{position})}
3911 This represents a window configuration to restore in one frame, and a
3912 position to jump to in the current buffer.
3913
3914 @item @code{(@var{frame-configuration} @var{position})}
3915 This represents a frame configuration to restore, and a position
3916 to jump to in the current buffer.
3917
3918 @item (file @var{filename})
3919 This represents a file to visit; jumping to this value visits file
3920 @var{filename}.
3921
3922 @item (file-query @var{filename} @var{position})
3923 This represents a file to visit and a position in it; jumping to this
3924 value visits file @var{filename} and goes to buffer position
3925 @var{position}. Restoring this type of position asks the user for
3926 confirmation first.
3927 @end table
3928
3929 The functions in this section return unpredictable values unless
3930 otherwise stated.
3931
3932 @defun get-register reg
3933 This function returns the contents of the register
3934 @var{reg}, or @code{nil} if it has no contents.
3935 @end defun
3936
3937 @defun set-register reg value
3938 This function sets the contents of register @var{reg} to @var{value}.
3939 A register can be set to any value, but the other register functions
3940 expect only certain data types. The return value is @var{value}.
3941 @end defun
3942
3943 @deffn Command view-register reg
3944 This command displays what is contained in register @var{reg}.
3945 @end deffn
3946
3947 @deffn Command insert-register reg &optional beforep
3948 This command inserts contents of register @var{reg} into the current
3949 buffer.
3950
3951 Normally, this command puts point before the inserted text, and the
3952 mark after it. However, if the optional second argument @var{beforep}
3953 is non-@code{nil}, it puts the mark before and point after.
3954 You can pass a non-@code{nil} second argument @var{beforep} to this
3955 function interactively by supplying any prefix argument.
3956
3957 If the register contains a rectangle, then the rectangle is inserted
3958 with its upper left corner at point. This means that text is inserted
3959 in the current line and underneath it on successive lines.
3960
3961 If the register contains something other than saved text (a string) or
3962 a rectangle (a list), currently useless things happen. This may be
3963 changed in the future.
3964 @end deffn
3965
3966 @node Transposition
3967 @section Transposition of Text
3968
3969 This subroutine is used by the transposition commands.
3970
3971 @defun transpose-regions start1 end1 start2 end2 &optional leave-markers
3972 This function exchanges two nonoverlapping portions of the buffer.
3973 Arguments @var{start1} and @var{end1} specify the bounds of one portion
3974 and arguments @var{start2} and @var{end2} specify the bounds of the
3975 other portion.
3976
3977 Normally, @code{transpose-regions} relocates markers with the transposed
3978 text; a marker previously positioned within one of the two transposed
3979 portions moves along with that portion, thus remaining between the same
3980 two characters in their new position. However, if @var{leave-markers}
3981 is non-@code{nil}, @code{transpose-regions} does not do this---it leaves
3982 all markers unrelocated.
3983 @end defun
3984
3985 @node Base 64
3986 @section Base 64 Encoding
3987 @cindex base 64 encoding
3988
3989 Base 64 code is used in email to encode a sequence of 8-bit bytes as
3990 a longer sequence of @acronym{ASCII} graphic characters. It is defined in
3991 Internet RFC@footnote{
3992 An RFC, an acronym for @dfn{Request for Comments}, is a numbered
3993 Internet informational document describing a standard. RFCs are
3994 usually written by technical experts acting on their own initiative,
3995 and are traditionally written in a pragmatic, experience-driven
3996 manner.
3997 }2045. This section describes the functions for
3998 converting to and from this code.
3999
4000 @deffn Command base64-encode-region beg end &optional no-line-break
4001 This function converts the region from @var{beg} to @var{end} into base
4002 64 code. It returns the length of the encoded text. An error is
4003 signaled if a character in the region is multibyte, i.e.@: in a
4004 multibyte buffer the region must contain only characters from the
4005 charsets @code{ascii}, @code{eight-bit-control} and
4006 @code{eight-bit-graphic}.
4007
4008 Normally, this function inserts newline characters into the encoded
4009 text, to avoid overlong lines. However, if the optional argument
4010 @var{no-line-break} is non-@code{nil}, these newlines are not added, so
4011 the output is just one long line.
4012 @end deffn
4013
4014 @deffn Command base64-encode-string string &optional no-line-break
4015 This function converts the string @var{string} into base 64 code. It
4016 returns a string containing the encoded text. As for
4017 @code{base64-encode-region}, an error is signaled if a character in the
4018 string is multibyte.
4019
4020 Normally, this function inserts newline characters into the encoded
4021 text, to avoid overlong lines. However, if the optional argument
4022 @var{no-line-break} is non-@code{nil}, these newlines are not added, so
4023 the result string is just one long line.
4024 @end deffn
4025
4026 @defun base64-decode-region beg end
4027 This function converts the region from @var{beg} to @var{end} from base
4028 64 code into the corresponding decoded text. It returns the length of
4029 the decoded text.
4030
4031 The decoding functions ignore newline characters in the encoded text.
4032 @end defun
4033
4034 @defun base64-decode-string string
4035 This function converts the string @var{string} from base 64 code into
4036 the corresponding decoded text. It returns a unibyte string containing the
4037 decoded text.
4038
4039 The decoding functions ignore newline characters in the encoded text.
4040 @end defun
4041
4042 @node MD5 Checksum
4043 @section MD5 Checksum
4044 @cindex MD5 checksum
4045 @cindex message digest computation
4046
4047 MD5 cryptographic checksums, or @dfn{message digests}, are 128-bit
4048 ``fingerprints'' of a document or program. They are used to verify
4049 that you have an exact and unaltered copy of the data. The algorithm
4050 to calculate the MD5 message digest is defined in Internet
4051 RFC@footnote{
4052 For an explanation of what is an RFC, see the footnote in @ref{Base
4053 64}.
4054 }1321. This section describes the Emacs facilities for computing
4055 message digests.
4056
4057 @defun md5 object &optional start end coding-system noerror
4058 This function returns the MD5 message digest of @var{object}, which
4059 should be a buffer or a string.
4060
4061 The two optional arguments @var{start} and @var{end} are character
4062 positions specifying the portion of @var{object} to compute the
4063 message digest for. If they are @code{nil} or omitted, the digest is
4064 computed for the whole of @var{object}.
4065
4066 The function @code{md5} does not compute the message digest directly
4067 from the internal Emacs representation of the text (@pxref{Text
4068 Representations}). Instead, it encodes the text using a coding
4069 system, and computes the message digest from the encoded text. The
4070 optional fourth argument @var{coding-system} specifies which coding
4071 system to use for encoding the text. It should be the same coding
4072 system that you used to read the text, or that you used or will use
4073 when saving or sending the text. @xref{Coding Systems}, for more
4074 information about coding systems.
4075
4076 If @var{coding-system} is @code{nil} or omitted, the default depends
4077 on @var{object}. If @var{object} is a buffer, the default for
4078 @var{coding-system} is whatever coding system would be chosen by
4079 default for writing this text into a file. If @var{object} is a
4080 string, the user's most preferred coding system (@pxref{Recognize
4081 Coding, prefer-coding-system, the description of
4082 @code{prefer-coding-system}, emacs, GNU Emacs Manual}) is used.
4083
4084 Normally, @code{md5} signals an error if the text can't be encoded
4085 using the specified or chosen coding system. However, if
4086 @var{noerror} is non-@code{nil}, it silently uses @code{raw-text}
4087 coding instead.
4088 @end defun
4089
4090 @node Parsing HTML
4091 @section Parsing HTML
4092 @cindex parsing html
4093
4094 @defun libxml-parse-html-region start end &optional base-url
4095 This function provides HTML parsing via the @code{libxml2} library.
4096 It parses ``real world'' HTML and tries to return a sensible parse tree
4097 regardless.
4098
4099 In addition to @var{start} and @var{end} (specifying the start and end
4100 of the region to act on), it takes an optional parameter,
4101 @var{base-url}, which is used to expand relative URLs in the document,
4102 if any.
4103
4104 Here's an example demonstrating the structure of the parsed data you
4105 get out. Given this HTML document:
4106
4107 @example
4108 <html><hEad></head><body width=101><div class=thing>Foo<div>Yes
4109 @end example
4110
4111 You get this parse tree:
4112
4113 @example
4114 (html
4115 (head)
4116 (body
4117 (:width . "101")
4118 (div
4119 (:class . "thing")
4120 (text . "Foo")
4121 (div
4122 (text . "Yes\n")))))
4123 @end example
4124
4125 It's a simple tree structure, where the @code{car} for each node is
4126 the name of the node, and the @code{cdr} is the value, or the list of
4127 values.
4128
4129 Attributes are coded the same way as child nodes, but with @samp{:} as
4130 the first character.
4131 @end defun
4132
4133 @cindex parsing xml
4134 @defun libxml-parse-xml-region start end &optional base-url
4135
4136 This is much the same as @code{libxml-parse-html-region} above, but
4137 operates on XML instead of HTML, and is correspondingly stricter about
4138 syntax.
4139 @end defun
4140
4141 @node Atomic Changes
4142 @section Atomic Change Groups
4143 @cindex atomic changes
4144
4145 In database terminology, an @dfn{atomic} change is an indivisible
4146 change---it can succeed entirely or it can fail entirely, but it
4147 cannot partly succeed. A Lisp program can make a series of changes to
4148 one or several buffers as an @dfn{atomic change group}, meaning that
4149 either the entire series of changes will be installed in their buffers
4150 or, in case of an error, none of them will be.
4151
4152 To do this for one buffer, the one already current, simply write a
4153 call to @code{atomic-change-group} around the code that makes the
4154 changes, like this:
4155
4156 @example
4157 (atomic-change-group
4158 (insert foo)
4159 (delete-region x y))
4160 @end example
4161
4162 @noindent
4163 If an error (or other nonlocal exit) occurs inside the body of
4164 @code{atomic-change-group}, it unmakes all the changes in that buffer
4165 that were during the execution of the body. This kind of change group
4166 has no effect on any other buffers---any such changes remain.
4167
4168 If you need something more sophisticated, such as to make changes in
4169 various buffers constitute one atomic group, you must directly call
4170 lower-level functions that @code{atomic-change-group} uses.
4171
4172 @defun prepare-change-group &optional buffer
4173 This function sets up a change group for buffer @var{buffer}, which
4174 defaults to the current buffer. It returns a ``handle'' that
4175 represents the change group. You must use this handle to activate the
4176 change group and subsequently to finish it.
4177 @end defun
4178
4179 To use the change group, you must @dfn{activate} it. You must do
4180 this before making any changes in the text of @var{buffer}.
4181
4182 @defun activate-change-group handle
4183 This function activates the change group that @var{handle} designates.
4184 @end defun
4185
4186 After you activate the change group, any changes you make in that
4187 buffer become part of it. Once you have made all the desired changes
4188 in the buffer, you must @dfn{finish} the change group. There are two
4189 ways to do this: you can either accept (and finalize) all the changes,
4190 or cancel them all.
4191
4192 @defun accept-change-group handle
4193 This function accepts all the changes in the change group specified by
4194 @var{handle}, making them final.
4195 @end defun
4196
4197 @defun cancel-change-group handle
4198 This function cancels and undoes all the changes in the change group
4199 specified by @var{handle}.
4200 @end defun
4201
4202 Your code should use @code{unwind-protect} to make sure the group is
4203 always finished. The call to @code{activate-change-group} should be
4204 inside the @code{unwind-protect}, in case the user types @kbd{C-g}
4205 just after it runs. (This is one reason why
4206 @code{prepare-change-group} and @code{activate-change-group} are
4207 separate functions, because normally you would call
4208 @code{prepare-change-group} before the start of that
4209 @code{unwind-protect}.) Once you finish the group, don't use the
4210 handle again---in particular, don't try to finish the same group
4211 twice.
4212
4213 To make a multibuffer change group, call @code{prepare-change-group}
4214 once for each buffer you want to cover, then use @code{nconc} to
4215 combine the returned values, like this:
4216
4217 @example
4218 (nconc (prepare-change-group buffer-1)
4219 (prepare-change-group buffer-2))
4220 @end example
4221
4222 You can then activate the multibuffer change group with a single call
4223 to @code{activate-change-group}, and finish it with a single call to
4224 @code{accept-change-group} or @code{cancel-change-group}.
4225
4226 Nested use of several change groups for the same buffer works as you
4227 would expect. Non-nested use of change groups for the same buffer
4228 will get Emacs confused, so don't let it happen; the first change
4229 group you start for any given buffer should be the last one finished.
4230
4231 @node Change Hooks
4232 @section Change Hooks
4233 @cindex change hooks
4234 @cindex hooks for text changes
4235
4236 These hook variables let you arrange to take notice of all changes in
4237 all buffers (or in a particular buffer, if you make them buffer-local).
4238 See also @ref{Special Properties}, for how to detect changes to specific
4239 parts of the text.
4240
4241 The functions you use in these hooks should save and restore the match
4242 data if they do anything that uses regular expressions; otherwise, they
4243 will interfere in bizarre ways with the editing operations that call
4244 them.
4245
4246 @defvar before-change-functions
4247 This variable holds a list of functions to call before any buffer
4248 modification. Each function gets two arguments, the beginning and end
4249 of the region that is about to change, represented as integers. The
4250 buffer that is about to change is always the current buffer.
4251 @end defvar
4252
4253 @defvar after-change-functions
4254 This variable holds a list of functions to call after any buffer
4255 modification. Each function receives three arguments: the beginning and
4256 end of the region just changed, and the length of the text that existed
4257 before the change. All three arguments are integers. The buffer that's
4258 about to change is always the current buffer.
4259
4260 The length of the old text is the difference between the buffer positions
4261 before and after that text as it was before the change. As for the
4262 changed text, its length is simply the difference between the first two
4263 arguments.
4264 @end defvar
4265
4266 Output of messages into the @samp{*Messages*} buffer does not
4267 call these functions.
4268
4269 @defmac combine-after-change-calls body@dots{}
4270 The macro executes @var{body} normally, but arranges to call the
4271 after-change functions just once for a series of several changes---if
4272 that seems safe.
4273
4274 If a program makes several text changes in the same area of the buffer,
4275 using the macro @code{combine-after-change-calls} around that part of
4276 the program can make it run considerably faster when after-change hooks
4277 are in use. When the after-change hooks are ultimately called, the
4278 arguments specify a portion of the buffer including all of the changes
4279 made within the @code{combine-after-change-calls} body.
4280
4281 @strong{Warning:} You must not alter the values of
4282 @code{after-change-functions} within
4283 the body of a @code{combine-after-change-calls} form.
4284
4285 @strong{Warning:} if the changes you combine occur in widely scattered
4286 parts of the buffer, this will still work, but it is not advisable,
4287 because it may lead to inefficient behavior for some change hook
4288 functions.
4289 @end defmac
4290
4291 @defvar first-change-hook
4292 This variable is a normal hook that is run whenever a buffer is changed
4293 that was previously in the unmodified state.
4294 @end defvar
4295
4296 @defvar inhibit-modification-hooks
4297 If this variable is non-@code{nil}, all of the change hooks are
4298 disabled; none of them run. This affects all the hook variables
4299 described above in this section, as well as the hooks attached to
4300 certain special text properties (@pxref{Special Properties}) and overlay
4301 properties (@pxref{Overlay Properties}).
4302
4303 Also, this variable is bound to non-@code{nil} while running those
4304 same hook variables, so that by default modifying the buffer from
4305 a modification hook does not cause other modification hooks to be run.
4306 If you do want modification hooks to be run in a particular piece of
4307 code that is itself run from a modification hook, then rebind locally
4308 @code{inhibit-modification-hooks} to @code{nil}.
4309 @end defvar
4310
4311