Document new Emacs 24 faces in Lisp manual.
[bpt/emacs.git] / doc / lispref / modes.texi
1 @c -*-texinfo-*-
2 @c This is part of the GNU Emacs Lisp Reference Manual.
3 @c Copyright (C) 1990-1995, 1998-1999, 2001-2011 Free Software Foundation, Inc.
4 @c See the file elisp.texi for copying conditions.
5 @setfilename ../../info/modes
6 @node Modes, Documentation, Keymaps, Top
7 @chapter Major and Minor Modes
8 @cindex mode
9
10 A @dfn{mode} is a set of definitions that customize Emacs and can be
11 turned on and off while you edit. There are two varieties of modes:
12 @dfn{major modes}, which are mutually exclusive and used for editing
13 particular kinds of text, and @dfn{minor modes}, which provide features
14 that users can enable individually.
15
16 This chapter describes how to write both major and minor modes, how to
17 indicate them in the mode line, and how they run hooks supplied by the
18 user. For related topics such as keymaps and syntax tables, see
19 @ref{Keymaps}, and @ref{Syntax Tables}.
20
21 @menu
22 * Hooks:: How to use hooks; how to write code that provides hooks.
23 * Major Modes:: Defining major modes.
24 * Minor Modes:: Defining minor modes.
25 * Mode Line Format:: Customizing the text that appears in the mode line.
26 * Imenu:: Providing a menu of definitions made in a buffer.
27 * Font Lock Mode:: How modes can highlight text according to syntax.
28 * Auto-Indentation:: How to teach Emacs to indent for a major mode.
29 * Desktop Save Mode:: How modes can have buffer state saved between
30 Emacs sessions.
31 @end menu
32
33 @node Hooks
34 @section Hooks
35 @cindex hooks
36
37 A @dfn{hook} is a variable where you can store a function or functions
38 to be called on a particular occasion by an existing program. Emacs
39 provides hooks for the sake of customization. Most often, hooks are set
40 up in the init file (@pxref{Init File}), but Lisp programs can set them also.
41 @xref{Standard Hooks}, for a list of standard hook variables.
42
43 @cindex normal hook
44 Most of the hooks in Emacs are @dfn{normal hooks}. These variables
45 contain lists of functions to be called with no arguments. By
46 convention, whenever the hook name ends in @samp{-hook}, that tells
47 you it is normal. We try to make all hooks normal, as much as
48 possible, so that you can use them in a uniform way.
49
50 Every major mode command is supposed to run a normal hook called the
51 @dfn{mode hook} as the one of the last steps of initialization. This
52 makes it easy for a user to customize the behavior of the mode, by
53 overriding the buffer-local variable assignments already made by the
54 mode. Most minor mode functions also run a mode hook at the end. But
55 hooks are used in other contexts too. For example, the hook
56 @code{suspend-hook} runs just before Emacs suspends itself
57 (@pxref{Suspending Emacs}).
58
59 The recommended way to add a hook function to a normal hook is by
60 calling @code{add-hook} (see below). The hook functions may be any of
61 the valid kinds of functions that @code{funcall} accepts (@pxref{What
62 Is a Function}). Most normal hook variables are initially void;
63 @code{add-hook} knows how to deal with this. You can add hooks either
64 globally or buffer-locally with @code{add-hook}.
65
66 @cindex abnormal hook
67 If the hook variable's name does not end with @samp{-hook}, that
68 indicates it is probably an @dfn{abnormal hook}. That means the hook
69 functions are called with arguments, or their return values are used
70 in some way. The hook's documentation says how the functions are
71 called. You can use @code{add-hook} to add a function to an abnormal
72 hook, but you must write the function to follow the hook's calling
73 convention.
74
75 By convention, abnormal hook names end in @samp{-functions} or
76 @samp{-hooks}. If the variable's name ends in @samp{-function}, then
77 its value is just a single function, not a list of functions.
78
79 @menu
80 * Running Hooks:: How to run a hook.
81 * Setting Hooks:: How to put functions on a hook, or remove them.
82 @end menu
83
84 @node Running Hooks
85 @subsection Running Hooks
86
87 At the appropriate times, Emacs uses the @code{run-hooks} function
88 and the other functions below to run particular hooks.
89
90 @defun run-hooks &rest hookvars
91 This function takes one or more normal hook variable names as
92 arguments, and runs each hook in turn. Each argument should be a
93 symbol that is a normal hook variable. These arguments are processed
94 in the order specified.
95
96 If a hook variable has a non-@code{nil} value, that value should be a
97 list of functions. @code{run-hooks} calls all the functions, one by
98 one, with no arguments.
99
100 The hook variable's value can also be a single function---either a
101 lambda expression or a symbol with a function definition---which
102 @code{run-hooks} calls. But this usage is obsolete.
103
104 If the hook variable is buffer-local, the buffer-local variable will
105 be used instead of the global variable. However, if the buffer-local
106 variable contains the element @code{t}, the global hook variable will
107 be run as well.
108 @end defun
109
110 @defun run-hook-with-args hook &rest args
111 This function is the way to run an abnormal hook and always call all
112 of the hook functions. It calls each of the hook functions one by
113 one, passing each of them the arguments @var{args}.
114 @end defun
115
116 @defun run-hook-with-args-until-failure hook &rest args
117 This function is the way to run an abnormal hook until one of the hook
118 functions fails. It calls each of the hook functions, passing each of
119 them the arguments @var{args}, until some hook function returns
120 @code{nil}. It then stops and returns @code{nil}. If none of the
121 hook functions return @code{nil}, it returns a non-@code{nil} value.
122 @end defun
123
124 @defun run-hook-with-args-until-success hook &rest args
125 This function is the way to run an abnormal hook until a hook function
126 succeeds. It calls each of the hook functions, passing each of them
127 the arguments @var{args}, until some hook function returns
128 non-@code{nil}. Then it stops, and returns whatever was returned by
129 the last hook function that was called. If all hook functions return
130 @code{nil}, it returns @code{nil} as well.
131 @end defun
132
133 @node Setting Hooks
134 @subsection Setting Hooks
135
136 Here's an example that uses a mode hook to turn on Auto Fill mode when
137 in Lisp Interaction mode:
138
139 @example
140 (add-hook 'lisp-interaction-mode-hook 'turn-on-auto-fill)
141 @end example
142
143 @defun add-hook hook function &optional append local
144 This function is the handy way to add function @var{function} to hook
145 variable @var{hook}. You can use it for abnormal hooks as well as for
146 normal hooks. @var{function} can be any Lisp function that can accept
147 the proper number of arguments for @var{hook}. For example,
148
149 @example
150 (add-hook 'text-mode-hook 'my-text-hook-function)
151 @end example
152
153 @noindent
154 adds @code{my-text-hook-function} to the hook called @code{text-mode-hook}.
155
156 If @var{function} is already present in @var{hook} (comparing using
157 @code{equal}), then @code{add-hook} does not add it a second time.
158
159 If @var{function} has a non-@code{nil} property
160 @code{permanent-local-hook}, then @code{kill-all-local-variables} (or
161 changing major modes) won't delete it from the hook variable's local
162 value.
163
164 It is best to design your hook functions so that the order in which
165 they are executed does not matter. Any dependence on the order is
166 asking for trouble. However, the order is predictable: normally,
167 @var{function} goes at the front of the hook list, so it will be
168 executed first (barring another @code{add-hook} call). If the
169 optional argument @var{append} is non-@code{nil}, the new hook
170 function goes at the end of the hook list and will be executed last.
171
172 @code{add-hook} can handle the cases where @var{hook} is void or its
173 value is a single function; it sets or changes the value to a list of
174 functions.
175
176 If @var{local} is non-@code{nil}, that says to add @var{function} to the
177 buffer-local hook list instead of to the global hook list. This makes
178 the hook buffer-local and adds @code{t} to the buffer-local value. The
179 latter acts as a flag to run the hook functions in the default value as
180 well as in the local value.
181 @end defun
182
183 @defun remove-hook hook function &optional local
184 This function removes @var{function} from the hook variable
185 @var{hook}. It compares @var{function} with elements of @var{hook}
186 using @code{equal}, so it works for both symbols and lambda
187 expressions.
188
189 If @var{local} is non-@code{nil}, that says to remove @var{function}
190 from the buffer-local hook list instead of from the global hook list.
191 @end defun
192
193 @node Major Modes
194 @section Major Modes
195 @cindex major mode
196
197 @cindex major mode command
198 Major modes specialize Emacs for editing particular kinds of text.
199 Each buffer has one major mode at a time. Every major mode is
200 associated with a @dfn{major mode command}, whose name should end in
201 @samp{-mode}. This command takes care of switching to that mode in the
202 current buffer, by setting various buffer-local variables such as a
203 local keymap. @xref{Major Mode Conventions}.
204
205 The least specialized major mode is called @dfn{Fundamental mode},
206 which has no mode-specific definitions or variable settings.
207
208 @deffn Command fundamental-mode
209 This is the major mode command for Fundamental mode. Unlike other mode
210 commands, it does @emph{not} run any mode hooks (@pxref{Major Mode
211 Conventions}), since you are not supposed to customize this mode.
212 @end deffn
213
214 The easiest way to write a major mode is to use the macro
215 @code{define-derived-mode}, which sets up the new mode as a variant of
216 an existing major mode. @xref{Derived Modes}. We recommend using
217 @code{define-derived-mode} even if the new mode is not an obvious
218 derivative of another mode, as it automatically enforces many coding
219 conventions for you. @xref{Basic Major Modes}, for common modes to
220 derive from.
221
222 The standard GNU Emacs Lisp directory tree contains the code for
223 several major modes, in files such as @file{text-mode.el},
224 @file{texinfo.el}, @file{lisp-mode.el}, and @file{rmail.el}. You can
225 study these libraries to see how modes are written.
226
227 @defopt major-mode
228 The buffer-local value of this variable holds the symbol for the current
229 major mode. Its default value holds the default major mode for new
230 buffers. The standard default value is @code{fundamental-mode}.
231
232 If the default value is @code{nil}, then whenever Emacs creates a new
233 buffer via a command such as @kbd{C-x b} (@code{switch-to-buffer}), the
234 new buffer is put in the major mode of the previously current buffer.
235 As an exception, if the major mode of the previous buffer has a
236 @code{mode-class} symbol property with value @code{special}, the new
237 buffer is put in Fundamental mode (@pxref{Major Mode Conventions}).
238 @end defopt
239
240 @menu
241 * Major Mode Conventions:: Coding conventions for keymaps, etc.
242 * Auto Major Mode:: How Emacs chooses the major mode automatically.
243 * Mode Help:: Finding out how to use a mode.
244 * Derived Modes:: Defining a new major mode based on another major
245 mode.
246 * Basic Major Modes:: Modes that other modes are often derived from.
247 * Generic Modes:: Defining a simple major mode that supports
248 comment syntax and Font Lock mode.
249 * Mode Hooks:: Hooks run at the end of major mode commands.
250 * Example Major Modes:: Text mode and Lisp modes.
251 @end menu
252
253 @node Major Mode Conventions
254 @subsection Major Mode Conventions
255 @cindex major mode conventions
256 @cindex conventions for writing major modes
257
258 The code for every major mode should follow various coding
259 conventions, including conventions for local keymap and syntax table
260 initialization, function and variable names, and hooks.
261
262 If you use the @code{define-derived-mode} macro, it will take care of
263 many of these conventions automatically. @xref{Derived Modes}. Note
264 also that fundamental mode is an exception to many of these conventions,
265 because its definition is to present the global state of Emacs.
266
267 The following list of conventions is only partial. Each major mode
268 should aim for consistency in general with other Emacs major modes, as
269 this makes Emacs as a whole more coherent. It is impossible to list
270 here all the possible points where this issue might come up; if the
271 Emacs developers point out an area where your major mode deviates from
272 the usual conventions, please make it compatible.
273
274 @itemize @bullet
275 @item
276 Define a major mode command whose name ends in @samp{-mode}. When
277 called with no arguments, this command should switch to the new mode in
278 the current buffer by setting up the keymap, syntax table, and
279 buffer-local variables in an existing buffer. It should not change the
280 buffer's contents.
281
282 @item
283 Write a documentation string for this command that describes the special
284 commands available in this mode. @xref{Mode Help}.
285
286 The documentation string may include the special documentation
287 substrings, @samp{\[@var{command}]}, @samp{\@{@var{keymap}@}}, and
288 @samp{\<@var{keymap}>}, which allow the help display to adapt
289 automatically to the user's own key bindings. @xref{Keys in
290 Documentation}.
291
292 @item
293 The major mode command should start by calling
294 @code{kill-all-local-variables}. This runs the normal hook
295 @code{change-major-mode-hook}, then gets rid of the buffer-local
296 variables of the major mode previously in effect. @xref{Creating
297 Buffer-Local}.
298
299 @item
300 The major mode command should set the variable @code{major-mode} to the
301 major mode command symbol. This is how @code{describe-mode} discovers
302 which documentation to print.
303
304 @item
305 The major mode command should set the variable @code{mode-name} to the
306 ``pretty'' name of the mode, usually a string (but see @ref{Mode Line
307 Data}, for other possible forms). The name of the mode appears
308 in the mode line.
309
310 @item
311 @cindex functions in modes
312 Since all global names are in the same name space, all the global
313 variables, constants, and functions that are part of the mode should
314 have names that start with the major mode name (or with an abbreviation
315 of it if the name is long). @xref{Coding Conventions}.
316
317 @item
318 In a major mode for editing some kind of structured text, such as a
319 programming language, indentation of text according to structure is
320 probably useful. So the mode should set @code{indent-line-function}
321 to a suitable function, and probably customize other variables
322 for indentation. @xref{Auto-Indentation}.
323
324 @item
325 @cindex keymaps in modes
326 The major mode should usually have its own keymap, which is used as the
327 local keymap in all buffers in that mode. The major mode command should
328 call @code{use-local-map} to install this local map. @xref{Active
329 Keymaps}, for more information.
330
331 This keymap should be stored permanently in a global variable named
332 @code{@var{modename}-mode-map}. Normally the library that defines the
333 mode sets this variable.
334
335 @xref{Tips for Defining}, for advice about how to write the code to set
336 up the mode's keymap variable.
337
338 @item
339 The key sequences bound in a major mode keymap should usually start with
340 @kbd{C-c}, followed by a control character, a digit, or @kbd{@{},
341 @kbd{@}}, @kbd{<}, @kbd{>}, @kbd{:} or @kbd{;}. The other punctuation
342 characters are reserved for minor modes, and ordinary letters are
343 reserved for users.
344
345 A major mode can also rebind the keys @kbd{M-n}, @kbd{M-p} and
346 @kbd{M-s}. The bindings for @kbd{M-n} and @kbd{M-p} should normally
347 be some kind of ``moving forward and backward,'' but this does not
348 necessarily mean cursor motion.
349
350 It is legitimate for a major mode to rebind a standard key sequence if
351 it provides a command that does ``the same job'' in a way better
352 suited to the text this mode is used for. For example, a major mode
353 for editing a programming language might redefine @kbd{C-M-a} to
354 ``move to the beginning of a function'' in a way that works better for
355 that language.
356
357 It is also legitimate for a major mode to rebind a standard key
358 sequence whose standard meaning is rarely useful in that mode. For
359 instance, minibuffer modes rebind @kbd{M-r}, whose standard meaning is
360 rarely of any use in the minibuffer. Major modes such as Dired or
361 Rmail that do not allow self-insertion of text can reasonably redefine
362 letters and other printing characters as special commands.
363
364 @item
365 Major modes for editing text should not define @key{RET} to do
366 anything other than insert a newline. However, it is ok for
367 specialized modes for text that users don't directly edit, such as
368 Dired and Info modes, to redefine @key{RET} to do something entirely
369 different.
370
371 @item
372 Major modes should not alter options that are primarily a matter of user
373 preference, such as whether Auto-Fill mode is enabled. Leave this to
374 each user to decide. However, a major mode should customize other
375 variables so that Auto-Fill mode will work usefully @emph{if} the user
376 decides to use it.
377
378 @item
379 @cindex syntax tables in modes
380 The mode may have its own syntax table or may share one with other
381 related modes. If it has its own syntax table, it should store this in
382 a variable named @code{@var{modename}-mode-syntax-table}. @xref{Syntax
383 Tables}.
384
385 @item
386 If the mode handles a language that has a syntax for comments, it should
387 set the variables that define the comment syntax. @xref{Options for
388 Comments,, Options Controlling Comments, emacs, The GNU Emacs Manual}.
389
390 @item
391 @cindex abbrev tables in modes
392 The mode may have its own abbrev table or may share one with other
393 related modes. If it has its own abbrev table, it should store this
394 in a variable named @code{@var{modename}-mode-abbrev-table}. If the
395 major mode command defines any abbrevs itself, it should pass @code{t}
396 for the @var{system-flag} argument to @code{define-abbrev}.
397 @xref{Defining Abbrevs}.
398
399 @item
400 The mode should specify how to do highlighting for Font Lock mode, by
401 setting up a buffer-local value for the variable
402 @code{font-lock-defaults} (@pxref{Font Lock Mode}).
403
404 @item
405 Each face that the mode defines should, if possible, inherit from an
406 existing Emacs face. @xref{Basic Faces}, and @ref{Faces for Font Lock}.
407
408 @item
409 The mode should specify how Imenu should find the definitions or
410 sections of a buffer, by setting up a buffer-local value for the
411 variable @code{imenu-generic-expression}, for the two variables
412 @code{imenu-prev-index-position-function} and
413 @code{imenu-extract-index-name-function}, or for the variable
414 @code{imenu-create-index-function} (@pxref{Imenu}).
415
416 @item
417 The mode can specify a local value for
418 @code{eldoc-documentation-function} to tell ElDoc mode how to handle
419 this mode.
420
421 @item
422 The mode can specify how to complete various keywords by adding
423 to the special hook @code{completion-at-point-functions}.
424
425 @item
426 Use @code{defvar} or @code{defcustom} to set mode-related variables, so
427 that they are not reinitialized if they already have a value. (Such
428 reinitialization could discard customizations made by the user.)
429
430 @item
431 @cindex buffer-local variables in modes
432 To make a buffer-local binding for an Emacs customization variable, use
433 @code{make-local-variable} in the major mode command, not
434 @code{make-variable-buffer-local}. The latter function would make the
435 variable local to every buffer in which it is subsequently set, which
436 would affect buffers that do not use this mode. It is undesirable for a
437 mode to have such global effects. @xref{Buffer-Local Variables}.
438
439 With rare exceptions, the only reasonable way to use
440 @code{make-variable-buffer-local} in a Lisp package is for a variable
441 which is used only within that package. Using it on a variable used by
442 other packages would interfere with them.
443
444 @item
445 @cindex mode hook
446 @cindex major mode hook
447 Each major mode should have a normal @dfn{mode hook} named
448 @code{@var{modename}-mode-hook}. The very last thing the major mode command
449 should do is to call @code{run-mode-hooks}. This runs the mode hook,
450 and then runs the normal hook @code{after-change-major-mode-hook}.
451 @xref{Mode Hooks}.
452
453 @item
454 The major mode command may start by calling some other major mode
455 command (called the @dfn{parent mode}) and then alter some of its
456 settings. A mode that does this is called a @dfn{derived mode}. The
457 recommended way to define one is to use the @code{define-derived-mode}
458 macro, but this is not required. Such a mode should call the parent
459 mode command inside a @code{delay-mode-hooks} form. (Using
460 @code{define-derived-mode} does this automatically.) @xref{Derived
461 Modes}, and @ref{Mode Hooks}.
462
463 @item
464 If something special should be done if the user switches a buffer from
465 this mode to any other major mode, this mode can set up a buffer-local
466 value for @code{change-major-mode-hook} (@pxref{Creating Buffer-Local}).
467
468 @item
469 If this mode is appropriate only for specially-prepared text produced by
470 the mode itself (rather than by the user typing at the keyboard or by an
471 external file), then the major mode command symbol should have a
472 property named @code{mode-class} with value @code{special}, put on as
473 follows:
474
475 @kindex mode-class @r{(property)}
476 @cindex @code{special} modes
477 @example
478 (put 'funny-mode 'mode-class 'special)
479 @end example
480
481 @noindent
482 This tells Emacs that new buffers created while the current buffer is in
483 Funny mode should not be put in Funny mode, even though the default
484 value of @code{major-mode} is @code{nil}. By default, the value of
485 @code{nil} for @code{major-mode} means to use the current buffer's major
486 mode when creating new buffers (@pxref{Auto Major Mode}), but with such
487 @code{special} modes, Fundamental mode is used instead. Modes such as
488 Dired, Rmail, and Buffer List use this feature.
489
490 The @code{define-derived-mode} macro automatically marks the derived
491 mode as special if the parent mode is special. Special mode is a
492 convenient parent for such modes to inherit from; @xref{Basic Major
493 Modes}.
494
495 @item
496 If you want to make the new mode the default for files with certain
497 recognizable names, add an element to @code{auto-mode-alist} to select
498 the mode for those file names (@pxref{Auto Major Mode}). If you
499 define the mode command to autoload, you should add this element in
500 the same file that calls @code{autoload}. If you use an autoload
501 cookie for the mode command, you can also use an autoload cookie for
502 the form that adds the element (@pxref{autoload cookie}). If you do
503 not autoload the mode command, it is sufficient to add the element in
504 the file that contains the mode definition.
505
506 @item
507 In the comments that document the file, you should provide a sample
508 @code{autoload} form and an example of how to add to
509 @code{auto-mode-alist}, that users can include in their init files
510 (@pxref{Init File}).
511
512 @item
513 @cindex mode loading
514 The top-level forms in the file defining the mode should be written so
515 that they may be evaluated more than once without adverse consequences.
516 Even if you never load the file more than once, someone else will.
517 @end itemize
518
519 @node Auto Major Mode
520 @subsection How Emacs Chooses a Major Mode
521 @cindex major mode, automatic selection
522
523 Based on information in the file name or in the file itself, Emacs
524 automatically selects a major mode for the new buffer when a file is
525 visited. It also processes local variables specified in the file text.
526
527 @deffn Command normal-mode &optional find-file
528 This function establishes the proper major mode and buffer-local variable
529 bindings for the current buffer. First it calls @code{set-auto-mode}
530 (see below), then it runs @code{hack-local-variables} to parse, and
531 bind or evaluate as appropriate, the file's local variables
532 (@pxref{File Local Variables}).
533
534 If the @var{find-file} argument to @code{normal-mode} is non-@code{nil},
535 @code{normal-mode} assumes that the @code{find-file} function is calling
536 it. In this case, it may process local variables in the @samp{-*-}
537 line or at the end of the file. The variable
538 @code{enable-local-variables} controls whether to do so. @xref{File
539 Variables, , Local Variables in Files, emacs, The GNU Emacs Manual},
540 for the syntax of the local variables section of a file.
541
542 If you run @code{normal-mode} interactively, the argument
543 @var{find-file} is normally @code{nil}. In this case,
544 @code{normal-mode} unconditionally processes any file local variables.
545
546 The function calls @code{set-auto-mode} to choose a major mode. If this
547 does not specify a mode, the buffer stays in the major mode determined
548 by the default value of @code{major-mode} (see below).
549
550 @cindex file mode specification error
551 @code{normal-mode} uses @code{condition-case} around the call to the
552 major mode command, so errors are caught and reported as a @samp{File
553 mode specification error}, followed by the original error message.
554 @end deffn
555
556 @defun set-auto-mode &optional keep-mode-if-same
557 @cindex visited file mode
558 This function selects the major mode that is appropriate for the
559 current buffer. It bases its decision (in order of precedence) on the
560 @w{@samp{-*-}} line, on any @samp{mode:} local variable near the end of
561 a file, on the @w{@samp{#!}} line (using @code{interpreter-mode-alist}),
562 on the text at the beginning of the buffer (using
563 @code{magic-mode-alist}), and finally on the visited file name (using
564 @code{auto-mode-alist}). @xref{Choosing Modes, , How Major Modes are
565 Chosen, emacs, The GNU Emacs Manual}. If @code{enable-local-variables}
566 is @code{nil}, @code{set-auto-mode} does not check the @w{@samp{-*-}}
567 line, or near the end of the file, for any mode tag.
568
569 If @var{keep-mode-if-same} is non-@code{nil}, this function does not
570 call the mode command if the buffer is already in the proper major
571 mode. For instance, @code{set-visited-file-name} sets this to
572 @code{t} to avoid killing buffer local variables that the user may
573 have set.
574 @end defun
575
576 @defun set-buffer-major-mode buffer
577 This function sets the major mode of @var{buffer} to the default value of
578 @code{major-mode}; if that is @code{nil}, it uses the
579 current buffer's major mode (if that is suitable). As an exception,
580 if @var{buffer}'s name is @samp{*scratch*}, it sets the mode to
581 @code{initial-major-mode}.
582
583 The low-level primitives for creating buffers do not use this function,
584 but medium-level commands such as @code{switch-to-buffer} and
585 @code{find-file-noselect} use it whenever they create buffers.
586 @end defun
587
588 @defopt initial-major-mode
589 @cindex @samp{*scratch*}
590 The value of this variable determines the major mode of the initial
591 @samp{*scratch*} buffer. The value should be a symbol that is a major
592 mode command. The default value is @code{lisp-interaction-mode}.
593 @end defopt
594
595 @defvar interpreter-mode-alist
596 This variable specifies major modes to use for scripts that specify a
597 command interpreter in a @samp{#!} line. Its value is an alist with
598 elements of the form @code{(@var{interpreter} . @var{mode})}; for
599 example, @code{("perl" . perl-mode)} is one element present by
600 default. The element says to use mode @var{mode} if the file
601 specifies an interpreter which matches @var{interpreter}.
602 @end defvar
603
604 @defvar magic-mode-alist
605 This variable's value is an alist with elements of the form
606 @code{(@var{regexp} . @var{function})}, where @var{regexp} is a
607 regular expression and @var{function} is a function or @code{nil}.
608 After visiting a file, @code{set-auto-mode} calls @var{function} if
609 the text at the beginning of the buffer matches @var{regexp} and
610 @var{function} is non-@code{nil}; if @var{function} is @code{nil},
611 @code{auto-mode-alist} gets to decide the mode.
612 @end defvar
613
614 @defvar magic-fallback-mode-alist
615 This works like @code{magic-mode-alist}, except that it is handled
616 only if @code{auto-mode-alist} does not specify a mode for this file.
617 @end defvar
618
619 @defvar auto-mode-alist
620 This variable contains an association list of file name patterns
621 (regular expressions) and corresponding major mode commands. Usually,
622 the file name patterns test for suffixes, such as @samp{.el} and
623 @samp{.c}, but this need not be the case. An ordinary element of the
624 alist looks like @code{(@var{regexp} . @var{mode-function})}.
625
626 For example,
627
628 @smallexample
629 @group
630 (("\\`/tmp/fol/" . text-mode)
631 ("\\.texinfo\\'" . texinfo-mode)
632 ("\\.texi\\'" . texinfo-mode)
633 @end group
634 @group
635 ("\\.el\\'" . emacs-lisp-mode)
636 ("\\.c\\'" . c-mode)
637 ("\\.h\\'" . c-mode)
638 @dots{})
639 @end group
640 @end smallexample
641
642 When you visit a file whose expanded file name (@pxref{File Name
643 Expansion}), with version numbers and backup suffixes removed using
644 @code{file-name-sans-versions} (@pxref{File Name Components}), matches
645 a @var{regexp}, @code{set-auto-mode} calls the corresponding
646 @var{mode-function}. This feature enables Emacs to select the proper
647 major mode for most files.
648
649 If an element of @code{auto-mode-alist} has the form @code{(@var{regexp}
650 @var{function} t)}, then after calling @var{function}, Emacs searches
651 @code{auto-mode-alist} again for a match against the portion of the file
652 name that did not match before. This feature is useful for
653 uncompression packages: an entry of the form @code{("\\.gz\\'"
654 @var{function} t)} can uncompress the file and then put the uncompressed
655 file in the proper mode according to the name sans @samp{.gz}.
656
657 Here is an example of how to prepend several pattern pairs to
658 @code{auto-mode-alist}. (You might use this sort of expression in your
659 init file.)
660
661 @smallexample
662 @group
663 (setq auto-mode-alist
664 (append
665 ;; @r{File name (within directory) starts with a dot.}
666 '(("/\\.[^/]*\\'" . fundamental-mode)
667 ;; @r{File name has no dot.}
668 ("/[^\\./]*\\'" . fundamental-mode)
669 ;; @r{File name ends in @samp{.C}.}
670 ("\\.C\\'" . c++-mode))
671 auto-mode-alist))
672 @end group
673 @end smallexample
674 @end defvar
675
676 @node Mode Help
677 @subsection Getting Help about a Major Mode
678 @cindex mode help
679 @cindex help for major mode
680 @cindex documentation for major mode
681
682 The @code{describe-mode} function is provides information about major
683 modes. It is normally bound to @kbd{C-h m}. It uses the value of the
684 variable @code{major-mode} (@pxref{Major Modes}), which is why every
685 major mode command needs to set that variable.
686
687 @deffn Command describe-mode
688 This function displays the documentation of the current major mode.
689
690 The @code{describe-mode} function calls the @code{documentation}
691 function using the value of @code{major-mode} as an argument. Thus, it
692 displays the documentation string of the major mode command.
693 (@xref{Accessing Documentation}.)
694 @end deffn
695
696 @node Derived Modes
697 @subsection Defining Derived Modes
698 @cindex derived mode
699
700 The recommended way to define a new major mode is to derive it from an
701 existing one using @code{define-derived-mode}. If there is no closely
702 related mode, you should inherit from either @code{text-mode},
703 @code{special-mode}, or @code{prog-mode}. @xref{Basic Major Modes}. If
704 none of these are suitable, you can inherit from @code{fundamental-mode}
705 (@pxref{Major Modes}).
706
707 @defmac define-derived-mode variant parent name docstring keyword-args@dots{} body@dots{}
708 This macro defines @var{variant} as a major mode command, using
709 @var{name} as the string form of the mode name. @var{variant} and
710 @var{parent} should be unquoted symbols.
711
712 The new command @var{variant} is defined to call the function
713 @var{parent}, then override certain aspects of that parent mode:
714
715 @itemize @bullet
716 @item
717 The new mode has its own sparse keymap, named
718 @code{@var{variant}-map}. @code{define-derived-mode}
719 makes the parent mode's keymap the parent of the new map, unless
720 @code{@var{variant}-map} is already set and already has a parent.
721
722 @item
723 The new mode has its own syntax table, kept in the variable
724 @code{@var{variant}-syntax-table}, unless you override this using the
725 @code{:syntax-table} keyword (see below). @code{define-derived-mode}
726 makes the parent mode's syntax-table the parent of
727 @code{@var{variant}-syntax-table}, unless the latter is already set
728 and already has a parent different from the standard syntax table.
729
730 @item
731 The new mode has its own abbrev table, kept in the variable
732 @code{@var{variant}-abbrev-table}, unless you override this using the
733 @code{:abbrev-table} keyword (see below).
734
735 @item
736 The new mode has its own mode hook, @code{@var{variant}-hook}. It
737 runs this hook, after running the hooks of its ancestor modes, with
738 @code{run-mode-hooks}, as the last thing it does. @xref{Mode Hooks}.
739 @end itemize
740
741 In addition, you can specify how to override other aspects of
742 @var{parent} with @var{body}. The command @var{variant}
743 evaluates the forms in @var{body} after setting up all its usual
744 overrides, just before running the mode hooks.
745
746 If @var{parent} has a non-@code{nil} @code{mode-class} symbol
747 property, then @code{define-derived-mode} sets the @code{mode-class}
748 property of @var{variant} to the same value. This ensures, for
749 example, that if @var{parent} is a special mode, then @var{variant} is
750 also a special mode (@pxref{Major Mode Conventions}).
751
752 You can also specify @code{nil} for @var{parent}. This gives the new
753 mode no parent. Then @code{define-derived-mode} behaves as described
754 above, but, of course, omits all actions connected with @var{parent}.
755
756 The argument @var{docstring} specifies the documentation string for
757 the new mode. @code{define-derived-mode} adds some general
758 information about the mode's hook, followed by the mode's keymap, at
759 the end of this docstring. If you omit @var{docstring},
760 @code{define-derived-mode} generates a documentation string.
761
762 The @var{keyword-args} are pairs of keywords and values. The values
763 are evaluated. The following keywords are currently supported:
764
765 @table @code
766 @item :syntax-table
767 You can use this to explicitly specify a syntax table for the new
768 mode. If you specify a @code{nil} value, the new mode uses the same
769 syntax table as @var{parent}, or the standard syntax table if
770 @var{parent} is @code{nil}. (Note that this does @emph{not} follow
771 the convention used for non-keyword arguments that a @code{nil} value
772 is equivalent with not specifying the argument.)
773
774 @item :abbrev-table
775 You can use this to explicitly specify an abbrev table for the new
776 mode. If you specify a @code{nil} value, the new mode uses the same
777 abbrev table as @var{parent}, or @code{fundamental-mode-abbrev-table}
778 if @var{parent} is @code{nil}. (Again, a @code{nil} value is
779 @emph{not} equivalent to not specifying this keyword.)
780
781 @item :group
782 If this is specified, the value should be the customization group for
783 this mode. (Not all major modes have one.) Only the (still
784 experimental and unadvertised) command @code{customize-mode} currently
785 uses this. @code{define-derived-mode} does @emph{not} automatically
786 define the specified customization group.
787 @end table
788
789 Here is a hypothetical example:
790
791 @example
792 (define-derived-mode hypertext-mode
793 text-mode "Hypertext"
794 "Major mode for hypertext.
795 \\@{hypertext-mode-map@}"
796 (setq case-fold-search nil))
797
798 (define-key hypertext-mode-map
799 [down-mouse-3] 'do-hyper-link)
800 @end example
801
802 Do not write an @code{interactive} spec in the definition;
803 @code{define-derived-mode} does that automatically.
804 @end defmac
805
806 @defun derived-mode-p &rest modes
807 This function returns non-@code{nil} if the current major mode is
808 derived from any of the major modes given by the symbols @var{modes}.
809 @end defun
810
811 @node Basic Major Modes
812 @subsection Basic Major Modes
813
814 Apart from Fundamental mode, there are three major modes that other
815 major modes commonly derive from: Text mode, Prog mode, and Special
816 mode. While Text mode is useful in its own right (e.g. for editing
817 files ending in @file{.txt}), Prog mode and Special mode exist mainly to
818 let other modes derive from them.
819
820 @vindex prog-mode-hook
821 As far as possible, new major modes should be derived, either directly
822 or indirectly, from one of these three modes. One reason is that this
823 allows users to customize a single mode hook
824 (e.g. @code{prog-mode-hook}) for an entire family of relevant modes
825 (e.g. all programming language modes).
826
827 @deffn Command text-mode
828 Text mode is a major mode for editing human languages. It defines the
829 @samp{"} and @samp{\} characters as having punctuation syntax
830 (@pxref{Syntax Class Table}), and binds @kbd{M-@key{TAB}} to
831 @code{ispell-complete-word} (@pxref{Spelling,,, emacs, The GNU Emacs
832 Manual}).
833
834 An example of a major mode derived from Text mode is HTML mode.
835 @xref{HTML Mode,,SGML and HTML Modes, emacs, The GNU Emacs Manual}.
836 @end deffn
837
838 @deffn Command prog-mode
839 Prog mode is a basic major mode for buffers containing programming
840 language source code. Most of the programming language major modes
841 built into Emacs are derived from it.
842
843 Prog mode binds @code{parse-sexp-ignore-comments} to @code{t}
844 (@pxref{Motion via Parsing}) and @code{bidi-paragraph-direction} to
845 @code{left-to-right} (@pxref{Bidirectional Display}).
846 @end deffn
847
848 @deffn Command special-mode
849 Special mode is a basic major mode for buffers containing text that is
850 produced specially by Emacs, rather than from a file. Major modes
851 derived from Special mode are given a @code{mode-class} property of
852 @code{special} (@pxref{Major Mode Conventions}).
853
854 Special mode sets the buffer to read-only. Its keymap defines several
855 common bindings, including @kbd{q} for @code{quit-window}, @kbd{z} for
856 @code{kill-this-buffer}, and @kbd{g} for @code{revert-buffer}
857 (@pxref{Reverting}).
858
859 An example of a major mode derived from Special mode is Buffer Menu
860 mode, which is used by the @samp{*Buffer List*} buffer. @xref{List
861 Buffers,,Listing Existing Buffers, emacs, The GNU Emacs Manual}.
862 @end deffn
863
864 @node Generic Modes
865 @subsection Generic Modes
866 @cindex generic mode
867
868 @dfn{Generic modes} are simple major modes with basic support for
869 comment syntax and Font Lock mode. To define a generic mode, use the
870 macro @code{define-generic-mode}. See the file @file{generic-x.el}
871 for some examples of the use of @code{define-generic-mode}.
872
873 @defmac define-generic-mode mode comment-list keyword-list font-lock-list auto-mode-list function-list &optional docstring
874 This macro defines a generic mode command named @var{mode} (a symbol,
875 not quoted). The optional argument @var{docstring} is the
876 documentation for the mode command. If you do not supply it,
877 @code{define-generic-mode} generates one by default.
878
879 The argument @var{comment-list} is a list in which each element is
880 either a character, a string of one or two characters, or a cons cell.
881 A character or a string is set up in the mode's syntax table as a
882 ``comment starter.'' If the entry is a cons cell, the @sc{car} is set
883 up as a ``comment starter'' and the @sc{cdr} as a ``comment ender.''
884 (Use @code{nil} for the latter if you want comments to end at the end
885 of the line.) Note that the syntax table mechanism has limitations
886 about what comment starters and enders are actually possible.
887 @xref{Syntax Tables}.
888
889 The argument @var{keyword-list} is a list of keywords to highlight
890 with @code{font-lock-keyword-face}. Each keyword should be a string.
891 Meanwhile, @var{font-lock-list} is a list of additional expressions to
892 highlight. Each element of this list should have the same form as an
893 element of @code{font-lock-keywords}. @xref{Search-based
894 Fontification}.
895
896 The argument @var{auto-mode-list} is a list of regular expressions to
897 add to the variable @code{auto-mode-alist}. They are added by the execution
898 of the @code{define-generic-mode} form, not by expanding the macro call.
899
900 Finally, @var{function-list} is a list of functions for the mode
901 command to call for additional setup. It calls these functions just
902 before it runs the mode hook variable @code{@var{mode}-hook}.
903 @end defmac
904
905 @node Mode Hooks
906 @subsection Mode Hooks
907
908 Every major mode command should finish by running its mode hook and
909 the mode-independent normal hook @code{after-change-major-mode-hook}.
910 It does this by calling @code{run-mode-hooks}. If the major mode is a
911 derived mode, that is if it calls another major mode (the parent mode)
912 in its body, it should do this inside @code{delay-mode-hooks} so that
913 the parent won't run these hooks itself. Instead, the derived mode's
914 call to @code{run-mode-hooks} runs the parent's mode hook too.
915 @xref{Major Mode Conventions}.
916
917 Emacs versions before Emacs 22 did not have @code{delay-mode-hooks}.
918 When user-implemented major modes have not been updated to use it,
919 they won't entirely follow these conventions: they may run the
920 parent's mode hook too early, or fail to run
921 @code{after-change-major-mode-hook}. If you encounter such a major
922 mode, please correct it to follow these conventions.
923
924 When you defined a major mode using @code{define-derived-mode}, it
925 automatically makes sure these conventions are followed. If you
926 define a major mode ``by hand,'' not using @code{define-derived-mode},
927 use the following functions to handle these conventions automatically.
928
929 @defun run-mode-hooks &rest hookvars
930 Major modes should run their mode hook using this function. It is
931 similar to @code{run-hooks} (@pxref{Hooks}), but it also runs
932 @code{after-change-major-mode-hook}.
933
934 When this function is called during the execution of a
935 @code{delay-mode-hooks} form, it does not run the hooks immediately.
936 Instead, it arranges for the next call to @code{run-mode-hooks} to run
937 them.
938 @end defun
939
940 @defmac delay-mode-hooks body@dots{}
941 When one major mode command calls another, it should do so inside of
942 @code{delay-mode-hooks}.
943
944 This macro executes @var{body}, but tells all @code{run-mode-hooks}
945 calls during the execution of @var{body} to delay running their hooks.
946 The hooks will actually run during the next call to
947 @code{run-mode-hooks} after the end of the @code{delay-mode-hooks}
948 construct.
949 @end defmac
950
951 @defvar after-change-major-mode-hook
952 This is a normal hook run by @code{run-mode-hooks}. It is run at the
953 very end of every properly-written major mode command.
954 @end defvar
955
956 @node Example Major Modes
957 @subsection Major Mode Examples
958
959 Text mode is perhaps the simplest mode besides Fundamental mode.
960 Here are excerpts from @file{text-mode.el} that illustrate many of
961 the conventions listed above:
962
963 @smallexample
964 @group
965 ;; @r{Create the syntax table for this mode.}
966 (defvar text-mode-syntax-table
967 (let ((st (make-syntax-table)))
968 (modify-syntax-entry ?\" ". " st)
969 (modify-syntax-entry ?\\ ". " st)
970 ;; Add `p' so M-c on `hello' leads to `Hello', not `hello'.
971 (modify-syntax-entry ?' "w p" st)
972 st)
973 "Syntax table used while in `text-mode'.")
974 @end group
975
976 ;; @r{Create the keymap for this mode.}
977 @group
978 (defvar text-mode-map
979 (let ((map (make-sparse-keymap)))
980 (define-key map "\e\t" 'ispell-complete-word)
981 (define-key map "\es" 'center-line)
982 (define-key map "\eS" 'center-paragraph)
983 map)
984 "Keymap for `text-mode'.
985 Many other modes, such as Mail mode, Outline mode
986 and Indented Text mode, inherit all the commands
987 defined in this map.")
988 @end group
989 @end smallexample
990
991 Here is how the actual mode command is defined now:
992
993 @smallexample
994 @group
995 (define-derived-mode text-mode nil "Text"
996 "Major mode for editing text written for humans to read.
997 In this mode, paragraphs are delimited only by blank or white lines.
998 You can thus get the full benefit of adaptive filling
999 (see the variable `adaptive-fill-mode').
1000 \\@{text-mode-map@}
1001 Turning on Text mode runs the normal hook `text-mode-hook'."
1002 @end group
1003 @group
1004 (set (make-local-variable 'text-mode-variant) t)
1005 ;; @r{These two lines are a feature added recently.}
1006 (set (make-local-variable 'require-final-newline)
1007 mode-require-final-newline)
1008 (set (make-local-variable 'indent-line-function) 'indent-relative))
1009 @end group
1010 @end smallexample
1011
1012 @noindent
1013 (The last line is redundant nowadays, since @code{indent-relative} is
1014 the default value, and we'll delete it in a future version.)
1015
1016 Here is how it was defined formerly, before
1017 @code{define-derived-mode} existed:
1018
1019 @smallexample
1020 @group
1021 ;; @r{This isn't needed nowadays, since @code{define-derived-mode} does it.}
1022 (define-abbrev-table 'text-mode-abbrev-table ()
1023 "Abbrev table used while in text mode.")
1024 @end group
1025
1026 @group
1027 (defun text-mode ()
1028 "Major mode for editing text intended for humans to read...
1029 Special commands: \\@{text-mode-map@}
1030 @end group
1031 @group
1032 Turning on text-mode runs the hook `text-mode-hook'."
1033 (interactive)
1034 (kill-all-local-variables)
1035 (use-local-map text-mode-map)
1036 @end group
1037 @group
1038 (setq local-abbrev-table text-mode-abbrev-table)
1039 (set-syntax-table text-mode-syntax-table)
1040 @end group
1041 @group
1042 ;; @r{These four lines are absent from the current version}
1043 ;; @r{not because this is done some other way, but because}
1044 ;; @r{nowadays Text mode uses the normal definition of paragraphs.}
1045 (set (make-local-variable 'paragraph-start)
1046 (concat "[ \t]*$\\|" page-delimiter))
1047 (set (make-local-variable 'paragraph-separate) paragraph-start)
1048 (set (make-local-variable 'indent-line-function) 'indent-relative-maybe)
1049 @end group
1050 @group
1051 (setq mode-name "Text")
1052 (setq major-mode 'text-mode)
1053 (run-mode-hooks 'text-mode-hook)) ; @r{Finally, this permits the user to}
1054 ; @r{customize the mode with a hook.}
1055 @end group
1056 @end smallexample
1057
1058 @cindex @file{lisp-mode.el}
1059 The three Lisp modes (Lisp mode, Emacs Lisp mode, and Lisp
1060 Interaction mode) have more features than Text mode and the code is
1061 correspondingly more complicated. Here are excerpts from
1062 @file{lisp-mode.el} that illustrate how these modes are written.
1063
1064 @cindex syntax table example
1065 @smallexample
1066 @group
1067 ;; @r{Create mode-specific table variables.}
1068 (defvar lisp-mode-syntax-table nil "")
1069 (defvar lisp-mode-abbrev-table nil "")
1070 @end group
1071
1072 @group
1073 (defvar emacs-lisp-mode-syntax-table
1074 (let ((table (make-syntax-table)))
1075 (let ((i 0))
1076 @end group
1077
1078 @group
1079 ;; @r{Set syntax of chars up to @samp{0} to say they are}
1080 ;; @r{part of symbol names but not words.}
1081 ;; @r{(The digit @samp{0} is @code{48} in the @acronym{ASCII} character set.)}
1082 (while (< i ?0)
1083 (modify-syntax-entry i "_ " table)
1084 (setq i (1+ i)))
1085 ;; @r{@dots{} similar code follows for other character ranges.}
1086 @end group
1087 @group
1088 ;; @r{Then set the syntax codes for characters that are special in Lisp.}
1089 (modify-syntax-entry ? " " table)
1090 (modify-syntax-entry ?\t " " table)
1091 (modify-syntax-entry ?\f " " table)
1092 (modify-syntax-entry ?\n "> " table)
1093 @end group
1094 @group
1095 ;; @r{Give CR the same syntax as newline, for selective-display.}
1096 (modify-syntax-entry ?\^m "> " table)
1097 (modify-syntax-entry ?\; "< " table)
1098 (modify-syntax-entry ?` "' " table)
1099 (modify-syntax-entry ?' "' " table)
1100 (modify-syntax-entry ?, "' " table)
1101 @end group
1102 @group
1103 ;; @r{@dots{}likewise for many other characters@dots{}}
1104 (modify-syntax-entry ?\( "() " table)
1105 (modify-syntax-entry ?\) ")( " table)
1106 (modify-syntax-entry ?\[ "(] " table)
1107 (modify-syntax-entry ?\] ")[ " table))
1108 table))
1109 @end group
1110 @group
1111 ;; @r{Create an abbrev table for lisp-mode.}
1112 (define-abbrev-table 'lisp-mode-abbrev-table ())
1113 @end group
1114 @end smallexample
1115
1116 The three modes for Lisp share much of their code. For instance,
1117 each calls the following function to set various variables:
1118
1119 @smallexample
1120 @group
1121 (defun lisp-mode-variables (lisp-syntax)
1122 (when lisp-syntax
1123 (set-syntax-table lisp-mode-syntax-table))
1124 (setq local-abbrev-table lisp-mode-abbrev-table)
1125 @dots{}
1126 @end group
1127 @end smallexample
1128
1129 In Lisp and most programming languages, we want the paragraph
1130 commands to treat only blank lines as paragraph separators. And the
1131 modes should understand the Lisp conventions for comments. The rest of
1132 @code{lisp-mode-variables} sets this up:
1133
1134 @smallexample
1135 @group
1136 (set (make-local-variable 'paragraph-start)
1137 (concat page-delimiter "\\|$" ))
1138 (set (make-local-variable 'paragraph-separate)
1139 paragraph-start)
1140 @dots{}
1141 @end group
1142 @group
1143 (set (make-local-variable 'comment-indent-function)
1144 'lisp-comment-indent))
1145 @dots{}
1146 @end group
1147 @end smallexample
1148
1149 Each of the different Lisp modes has a slightly different keymap. For
1150 example, Lisp mode binds @kbd{C-c C-z} to @code{run-lisp}, but the other
1151 Lisp modes do not. However, all Lisp modes have some commands in
1152 common. The following code sets up the common commands:
1153
1154 @smallexample
1155 @group
1156 (defvar shared-lisp-mode-map
1157 (let ((map (make-sparse-keymap)))
1158 (define-key shared-lisp-mode-map "\e\C-q" 'indent-sexp)
1159 (define-key shared-lisp-mode-map "\177"
1160 'backward-delete-char-untabify)
1161 map)
1162 "Keymap for commands shared by all sorts of Lisp modes.")
1163 @end group
1164 @end smallexample
1165
1166 @noindent
1167 And here is the code to set up the keymap for Lisp mode:
1168
1169 @smallexample
1170 @group
1171 (defvar lisp-mode-map
1172 (let ((map (make-sparse-keymap)))
1173 (set-keymap-parent map shared-lisp-mode-map)
1174 (define-key map "\e\C-x" 'lisp-eval-defun)
1175 (define-key map "\C-c\C-z" 'run-lisp)
1176 map)
1177 "Keymap for ordinary Lisp mode...")
1178 @end group
1179 @end smallexample
1180
1181 Finally, here is the complete major mode command definition for Lisp
1182 mode.
1183
1184 @smallexample
1185 @group
1186 (defun lisp-mode ()
1187 "Major mode for editing Lisp code for Lisps other than GNU Emacs Lisp.
1188 Commands:
1189 Delete converts tabs to spaces as it moves back.
1190 Blank lines separate paragraphs. Semicolons start comments.
1191 \\@{lisp-mode-map@}
1192 Note that `run-lisp' may be used either to start an inferior Lisp job
1193 or to switch back to an existing one.
1194 @end group
1195
1196 @group
1197 Entry to this mode calls the value of `lisp-mode-hook'
1198 if that value is non-nil."
1199 (interactive)
1200 (kill-all-local-variables)
1201 @end group
1202 @group
1203 (use-local-map lisp-mode-map) ; @r{Select the mode's keymap.}
1204 (setq major-mode 'lisp-mode) ; @r{This is how @code{describe-mode}}
1205 ; @r{finds out what to describe.}
1206 (setq mode-name "Lisp") ; @r{This goes into the mode line.}
1207 (lisp-mode-variables t) ; @r{This defines various variables.}
1208 (set (make-local-variable 'comment-start-skip)
1209 "\\(\\(^\\|[^\\\\\n]\\)\\(\\\\\\\\\\)*\\)\\(;+\\|#|\\) *")
1210 (set (make-local-variable 'font-lock-keywords-case-fold-search) t)
1211 @end group
1212 @group
1213 (setq imenu-case-fold-search t)
1214 (set-syntax-table lisp-mode-syntax-table)
1215 (run-mode-hooks 'lisp-mode-hook)) ; @r{This permits the user to use a}
1216 ; @r{hook to customize the mode.}
1217 @end group
1218 @end smallexample
1219
1220 @node Minor Modes
1221 @section Minor Modes
1222 @cindex minor mode
1223
1224 A @dfn{minor mode} provides features that users may enable or disable
1225 independently of the choice of major mode. Minor modes can be enabled
1226 individually or in combination. Minor modes would be better named
1227 ``generally available, optional feature modes,'' except that such a name
1228 would be unwieldy.
1229
1230 A minor mode is not usually meant as a variation of a single major mode.
1231 Usually they are general and can apply to many major modes. For
1232 example, Auto Fill mode works with any major mode that permits text
1233 insertion. To be general, a minor mode must be effectively independent
1234 of the things major modes do.
1235
1236 A minor mode is often much more difficult to implement than a major
1237 mode. One reason is that you should be able to activate and deactivate
1238 minor modes in any order. A minor mode should be able to have its
1239 desired effect regardless of the major mode and regardless of the other
1240 minor modes in effect.
1241
1242 Often the biggest problem in implementing a minor mode is finding a
1243 way to insert the necessary hook into the rest of Emacs. Minor mode
1244 keymaps make this easier than it used to be.
1245
1246 @defvar minor-mode-list
1247 The value of this variable is a list of all minor mode commands.
1248 @end defvar
1249
1250 @menu
1251 * Minor Mode Conventions:: Tips for writing a minor mode.
1252 * Keymaps and Minor Modes:: How a minor mode can have its own keymap.
1253 * Defining Minor Modes:: A convenient facility for defining minor modes.
1254 @end menu
1255
1256 @node Minor Mode Conventions
1257 @subsection Conventions for Writing Minor Modes
1258 @cindex minor mode conventions
1259 @cindex conventions for writing minor modes
1260
1261 There are conventions for writing minor modes just as there are for
1262 major modes. Several of the major mode conventions apply to minor
1263 modes as well: those regarding the name of the mode initialization
1264 function, the names of global symbols, the use of a hook at the end of
1265 the initialization function, and the use of keymaps and other tables.
1266
1267 In addition, there are several conventions that are specific to
1268 minor modes. (The easiest way to follow all the conventions is to use
1269 the macro @code{define-minor-mode}; @ref{Defining Minor Modes}.)
1270
1271 @itemize @bullet
1272 @item
1273 @cindex mode variable
1274 Make a variable whose name ends in @samp{-mode} to control the minor
1275 mode. We call this the @dfn{mode variable}. The minor mode command
1276 should set this variable (@code{nil} to disable; anything else to
1277 enable).
1278
1279 If possible, implement the mode so that setting the variable
1280 automatically enables or disables the mode. Then the minor mode command
1281 does not need to do anything except set the variable.
1282
1283 This variable is used in conjunction with the @code{minor-mode-alist} to
1284 display the minor mode name in the mode line. It can also enable
1285 or disable a minor mode keymap. Individual commands or hooks can also
1286 check the variable's value.
1287
1288 If you want the minor mode to be enabled separately in each buffer,
1289 make the variable buffer-local.
1290
1291 @item
1292 Define a command whose name is the same as the mode variable.
1293 Its job is to enable and disable the mode by setting the variable.
1294
1295 The command should accept one optional argument. If the argument is
1296 @code{nil}, it should toggle the mode (turn it on if it is off, and
1297 off if it is on). It should turn the mode on if the argument is a
1298 positive integer, the symbol @code{t}, or a list whose @sc{car} is one
1299 of those. It should turn the mode off if the argument is a negative
1300 integer or zero, the symbol @code{-}, or a list whose @sc{car} is a
1301 negative integer or zero. The meaning of other arguments is not
1302 specified.
1303
1304 Here is an example taken from the definition of @code{transient-mark-mode}.
1305 It shows the use of @code{transient-mark-mode} as a variable that enables or
1306 disables the mode's behavior, and also shows the proper way to toggle,
1307 enable or disable the minor mode based on the raw prefix argument value.
1308
1309 @smallexample
1310 @group
1311 (setq transient-mark-mode
1312 (if (null arg) (not transient-mark-mode)
1313 (> (prefix-numeric-value arg) 0)))
1314 @end group
1315 @end smallexample
1316
1317 @item
1318 Add an element to @code{minor-mode-alist} for each minor mode
1319 (@pxref{Definition of minor-mode-alist}), if you want to indicate the
1320 minor mode in the mode line. This element should be a list of the
1321 following form:
1322
1323 @smallexample
1324 (@var{mode-variable} @var{string})
1325 @end smallexample
1326
1327 Here @var{mode-variable} is the variable that controls enabling of the
1328 minor mode, and @var{string} is a short string, starting with a space,
1329 to represent the mode in the mode line. These strings must be short so
1330 that there is room for several of them at once.
1331
1332 When you add an element to @code{minor-mode-alist}, use @code{assq} to
1333 check for an existing element, to avoid duplication. For example:
1334
1335 @smallexample
1336 @group
1337 (unless (assq 'leif-mode minor-mode-alist)
1338 (setq minor-mode-alist
1339 (cons '(leif-mode " Leif") minor-mode-alist)))
1340 @end group
1341 @end smallexample
1342
1343 @noindent
1344 or like this, using @code{add-to-list} (@pxref{List Variables}):
1345
1346 @smallexample
1347 @group
1348 (add-to-list 'minor-mode-alist '(leif-mode " Leif"))
1349 @end group
1350 @end smallexample
1351 @end itemize
1352
1353 Global minor modes distributed with Emacs should if possible support
1354 enabling and disabling via Custom (@pxref{Customization}). To do this,
1355 the first step is to define the mode variable with @code{defcustom}, and
1356 specify @code{:type 'boolean}.
1357
1358 If just setting the variable is not sufficient to enable the mode, you
1359 should also specify a @code{:set} method which enables the mode by
1360 invoking the mode command. Note in the variable's documentation string that
1361 setting the variable other than via Custom may not take effect.
1362
1363 Also mark the definition with an autoload cookie (@pxref{autoload cookie}),
1364 and specify a @code{:require} so that customizing the variable will load
1365 the library that defines the mode. This will copy suitable definitions
1366 into @file{loaddefs.el} so that users can use @code{customize-option} to
1367 enable the mode. For example:
1368
1369 @smallexample
1370 @group
1371
1372 ;;;###autoload
1373 (defcustom msb-mode nil
1374 "Toggle msb-mode.
1375 Setting this variable directly does not take effect;
1376 use either \\[customize] or the function `msb-mode'."
1377 :set 'custom-set-minor-mode
1378 :initialize 'custom-initialize-default
1379 :version "20.4"
1380 :type 'boolean
1381 :group 'msb
1382 :require 'msb)
1383 @end group
1384 @end smallexample
1385
1386 @node Keymaps and Minor Modes
1387 @subsection Keymaps and Minor Modes
1388
1389 Each minor mode can have its own keymap, which is active when the mode
1390 is enabled. To set up a keymap for a minor mode, add an element to the
1391 alist @code{minor-mode-map-alist}. @xref{Definition of minor-mode-map-alist}.
1392
1393 @cindex @code{self-insert-command}, minor modes
1394 One use of minor mode keymaps is to modify the behavior of certain
1395 self-inserting characters so that they do something else as well as
1396 self-insert. In general, this is the only way to do that, since the
1397 facilities for customizing @code{self-insert-command} are limited to
1398 special cases (designed for abbrevs and Auto Fill mode). (Do not try
1399 substituting your own definition of @code{self-insert-command} for the
1400 standard one. The editor command loop handles this function specially.)
1401
1402 The key sequences bound in a minor mode should consist of @kbd{C-c}
1403 followed by one of @kbd{.,/?`'"[]\|~!#$%^&*()-_+=}. (The other
1404 punctuation characters are reserved for major modes.)
1405
1406 @node Defining Minor Modes
1407 @subsection Defining Minor Modes
1408
1409 The macro @code{define-minor-mode} offers a convenient way of
1410 implementing a mode in one self-contained definition.
1411
1412 @defmac define-minor-mode mode doc [init-value [lighter [keymap]]] keyword-args@dots{} body@dots{}
1413 This macro defines a new minor mode whose name is @var{mode} (a
1414 symbol). It defines a command named @var{mode} to toggle the minor
1415 mode, with @var{doc} as its documentation string. It also defines a
1416 variable named @var{mode}, which is set to @code{t} or @code{nil} by
1417 enabling or disabling the mode. The variable is initialized to
1418 @var{init-value}. Except in unusual circumstances (see below), this
1419 value must be @code{nil}.
1420
1421 The string @var{lighter} says what to display in the mode line
1422 when the mode is enabled; if it is @code{nil}, the mode is not displayed
1423 in the mode line.
1424
1425 The optional argument @var{keymap} specifies the keymap for the minor
1426 mode. If non-@code{nil}, it should be a variable name (whose value is
1427 a keymap), a keymap, or an alist of the form
1428
1429 @example
1430 (@var{key-sequence} . @var{definition})
1431 @end example
1432
1433 @noindent
1434 where each @var{key-sequence} and @var{definition} are arguments
1435 suitable for passing to @code{define-key} (@pxref{Changing Key
1436 Bindings}). If @var{keymap} is a keymap or an alist, this also
1437 defines the variable @code{@var{mode}-map}.
1438
1439 The above three arguments @var{init-value}, @var{lighter}, and
1440 @var{keymap} can be (partially) omitted when @var{keyword-args} are
1441 used. The @var{keyword-args} consist of keywords followed by
1442 corresponding values. A few keywords have special meanings:
1443
1444 @table @code
1445 @item :group @var{group}
1446 Custom group name to use in all generated @code{defcustom} forms.
1447 Defaults to @var{mode} without the possible trailing @samp{-mode}.
1448 @strong{Warning:} don't use this default group name unless you have
1449 written a @code{defgroup} to define that group properly. @xref{Group
1450 Definitions}.
1451
1452 @item :global @var{global}
1453 If non-@code{nil}, this specifies that the minor mode should be global
1454 rather than buffer-local. It defaults to @code{nil}.
1455
1456 One of the effects of making a minor mode global is that the
1457 @var{mode} variable becomes a customization variable. Toggling it
1458 through the Custom interface turns the mode on and off, and its value
1459 can be saved for future Emacs sessions (@pxref{Saving
1460 Customizations,,, emacs, The GNU Emacs Manual}. For the saved
1461 variable to work, you should ensure that the @code{define-minor-mode}
1462 form is evaluated each time Emacs starts; for packages that are not
1463 part of Emacs, the easiest way to do this is to specify a
1464 @code{:require} keyword.
1465
1466 @item :init-value @var{init-value}
1467 This is equivalent to specifying @var{init-value} positionally.
1468
1469 @item :lighter @var{lighter}
1470 This is equivalent to specifying @var{lighter} positionally.
1471
1472 @item :keymap @var{keymap}
1473 This is equivalent to specifying @var{keymap} positionally.
1474 @end table
1475
1476 Any other keyword arguments are passed directly to the
1477 @code{defcustom} generated for the variable @var{mode}.
1478
1479 The command named @var{mode} first performs the standard actions such
1480 as setting the variable named @var{mode} and then executes the
1481 @var{body} forms, if any. It finishes by running the mode hook
1482 variable @code{@var{mode}-hook}.
1483 @end defmac
1484
1485 The initial value must be @code{nil} except in cases where (1) the
1486 mode is preloaded in Emacs, or (2) it is painless for loading to
1487 enable the mode even though the user did not request it. For
1488 instance, if the mode has no effect unless something else is enabled,
1489 and will always be loaded by that time, enabling it by default is
1490 harmless. But these are unusual circumstances. Normally, the
1491 initial value must be @code{nil}.
1492
1493 @findex easy-mmode-define-minor-mode
1494 The name @code{easy-mmode-define-minor-mode} is an alias
1495 for this macro.
1496
1497 Here is an example of using @code{define-minor-mode}:
1498
1499 @smallexample
1500 (define-minor-mode hungry-mode
1501 "Toggle Hungry mode.
1502 With no argument, this command toggles the mode.
1503 Non-null prefix argument turns on the mode.
1504 Null prefix argument turns off the mode.
1505
1506 When Hungry mode is enabled, the control delete key
1507 gobbles all preceding whitespace except the last.
1508 See the command \\[hungry-electric-delete]."
1509 ;; The initial value.
1510 nil
1511 ;; The indicator for the mode line.
1512 " Hungry"
1513 ;; The minor mode bindings.
1514 '(([C-backspace] . hungry-electric-delete))
1515 :group 'hunger)
1516 @end smallexample
1517
1518 @noindent
1519 This defines a minor mode named ``Hungry mode,'' a command named
1520 @code{hungry-mode} to toggle it, a variable named @code{hungry-mode}
1521 which indicates whether the mode is enabled, and a variable named
1522 @code{hungry-mode-map} which holds the keymap that is active when the
1523 mode is enabled. It initializes the keymap with a key binding for
1524 @kbd{C-@key{DEL}}. It puts the variable @code{hungry-mode} into
1525 custom group @code{hunger}. There are no @var{body} forms---many
1526 minor modes don't need any.
1527
1528 Here's an equivalent way to write it:
1529
1530 @smallexample
1531 (define-minor-mode hungry-mode
1532 "Toggle Hungry mode.
1533 With no argument, this command toggles the mode.
1534 Non-null prefix argument turns on the mode.
1535 Null prefix argument turns off the mode.
1536
1537 When Hungry mode is enabled, the control delete key
1538 gobbles all preceding whitespace except the last.
1539 See the command \\[hungry-electric-delete]."
1540 ;; The initial value.
1541 :init-value nil
1542 ;; The indicator for the mode line.
1543 :lighter " Hungry"
1544 ;; The minor mode bindings.
1545 :keymap
1546 '(([C-backspace] . hungry-electric-delete)
1547 ([C-M-backspace]
1548 . (lambda ()
1549 (interactive)
1550 (hungry-electric-delete t))))
1551 :group 'hunger)
1552 @end smallexample
1553
1554 @defmac define-globalized-minor-mode global-mode mode turn-on keyword-args@dots{}
1555 This defines a global toggle named @var{global-mode} whose meaning is
1556 to enable or disable the buffer-local minor mode @var{mode} in all
1557 buffers. To turn on the minor mode in a buffer, it uses the function
1558 @var{turn-on}; to turn off the minor mode, it calls @code{mode} with
1559 @minus{}1 as argument.
1560
1561 Globally enabling the mode also affects buffers subsequently created
1562 by visiting files, and buffers that use a major mode other than
1563 Fundamental mode; but it does not detect the creation of a new buffer
1564 in Fundamental mode.
1565
1566 This defines the customization option @var{global-mode} (@pxref{Customization}),
1567 which can be toggled in the Custom interface to turn the minor mode on
1568 and off. As with @code{define-minor-mode}, you should ensure that the
1569 @code{define-globalized-minor-mode} form is evaluated each time Emacs
1570 starts, for example by providing a @code{:require} keyword.
1571
1572 Use @code{:group @var{group}} in @var{keyword-args} to specify the
1573 custom group for the mode variable of the global minor mode.
1574 @end defmac
1575
1576 @node Mode Line Format
1577 @section Mode-Line Format
1578 @cindex mode line
1579
1580 Each Emacs window (aside from minibuffer windows) typically has a mode
1581 line at the bottom, which displays status information about the buffer
1582 displayed in the window. The mode line contains information about the
1583 buffer, such as its name, associated file, depth of recursive editing,
1584 and major and minor modes. A window can also have a @dfn{header
1585 line}, which is much like the mode line but appears at the top of the
1586 window.
1587
1588 This section describes how to control the contents of the mode line
1589 and header line. We include it in this chapter because much of the
1590 information displayed in the mode line relates to the enabled major and
1591 minor modes.
1592
1593 @menu
1594 * Base: Mode Line Basics. Basic ideas of mode line control.
1595 * Data: Mode Line Data. The data structure that controls the mode line.
1596 * Top: Mode Line Top. The top level variable, mode-line-format.
1597 * Mode Line Variables:: Variables used in that data structure.
1598 * %-Constructs:: Putting information into a mode line.
1599 * Properties in Mode:: Using text properties in the mode line.
1600 * Header Lines:: Like a mode line, but at the top.
1601 * Emulating Mode Line:: Formatting text as the mode line would.
1602 @end menu
1603
1604 @node Mode Line Basics
1605 @subsection Mode Line Basics
1606
1607 @code{mode-line-format} is a buffer-local variable that holds a
1608 @dfn{mode line construct}, a kind of template, which controls what is
1609 displayed on the mode line of the current buffer. The value of
1610 @code{header-line-format} specifies the buffer's header line in the
1611 same way. All windows for the same buffer use the same
1612 @code{mode-line-format} and @code{header-line-format}.
1613
1614 For efficiency, Emacs does not continuously recompute the mode
1615 line and header line of a window. It does so when circumstances
1616 appear to call for it---for instance, if you change the window
1617 configuration, switch buffers, narrow or widen the buffer, scroll, or
1618 change the buffer's modification status. If you modify any of the
1619 variables referenced by @code{mode-line-format} (@pxref{Mode Line
1620 Variables}), or any other variables and data structures that affect
1621 how text is displayed (@pxref{Display}), you may want to force an
1622 update of the mode line so as to display the new information or
1623 display it in the new way.
1624
1625 @defun force-mode-line-update &optional all
1626 Force redisplay of the current buffer's mode line and header line.
1627 The next redisplay will update the mode line and header line based on
1628 the latest values of all relevant variables. With optional
1629 non-@code{nil} @var{all}, force redisplay of all mode lines and header
1630 lines.
1631
1632 This function also forces recomputation of the menu bar menus
1633 and the frame title.
1634 @end defun
1635
1636 The selected window's mode line is usually displayed in a different
1637 color using the face @code{mode-line}. Other windows' mode lines
1638 appear in the face @code{mode-line-inactive} instead. @xref{Faces}.
1639
1640 @node Mode Line Data
1641 @subsection The Data Structure of the Mode Line
1642 @cindex mode-line construct
1643
1644 The mode-line contents are controlled by a data structure called a
1645 @dfn{mode-line construct}, made up of lists, strings, symbols, and
1646 numbers kept in buffer-local variables. Each data type has a specific
1647 meaning for the mode-line appearance, as described below. The same
1648 data structure is used for constructing frame titles (@pxref{Frame
1649 Titles}) and header lines (@pxref{Header Lines}).
1650
1651 A mode-line construct may be as simple as a fixed string of text,
1652 but it usually specifies how to combine fixed strings with variables'
1653 values to construct the text. Many of these variables are themselves
1654 defined to have mode-line constructs as their values.
1655
1656 Here are the meanings of various data types as mode-line constructs:
1657
1658 @table @code
1659 @cindex percent symbol in mode line
1660 @item @var{string}
1661 A string as a mode-line construct appears verbatim except for
1662 @dfn{@code{%}-constructs} in it. These stand for substitution of
1663 other data; see @ref{%-Constructs}.
1664
1665 If parts of the string have @code{face} properties, they control
1666 display of the text just as they would text in the buffer. Any
1667 characters which have no @code{face} properties are displayed, by
1668 default, in the face @code{mode-line} or @code{mode-line-inactive}
1669 (@pxref{Standard Faces,,, emacs, The GNU Emacs Manual}). The
1670 @code{help-echo} and @code{local-map} properties in @var{string} have
1671 special meanings. @xref{Properties in Mode}.
1672
1673 @item @var{symbol}
1674 A symbol as a mode-line construct stands for its value. The value of
1675 @var{symbol} is used as a mode-line construct, in place of @var{symbol}.
1676 However, the symbols @code{t} and @code{nil} are ignored, as is any
1677 symbol whose value is void.
1678
1679 There is one exception: if the value of @var{symbol} is a string, it is
1680 displayed verbatim: the @code{%}-constructs are not recognized.
1681
1682 Unless @var{symbol} is marked as ``risky'' (i.e., it has a
1683 non-@code{nil} @code{risky-local-variable} property), all text
1684 properties specified in @var{symbol}'s value are ignored. This
1685 includes the text properties of strings in @var{symbol}'s value, as
1686 well as all @code{:eval} and @code{:propertize} forms in it. (The
1687 reason for this is security: non-risky variables could be set
1688 automatically from file variables without prompting the user.)
1689
1690 @item (@var{string} @var{rest}@dots{})
1691 @itemx (@var{list} @var{rest}@dots{})
1692 A list whose first element is a string or list means to process all the
1693 elements recursively and concatenate the results. This is the most
1694 common form of mode-line construct.
1695
1696 @item (:eval @var{form})
1697 A list whose first element is the symbol @code{:eval} says to evaluate
1698 @var{form}, and use the result as a string to display. Make sure this
1699 evaluation cannot load any files, as doing so could cause infinite
1700 recursion.
1701
1702 @item (:propertize @var{elt} @var{props}@dots{})
1703 A list whose first element is the symbol @code{:propertize} says to
1704 process the mode-line construct @var{elt} recursively, then add the text
1705 properties specified by @var{props} to the result. The argument
1706 @var{props} should consist of zero or more pairs @var{text-property}
1707 @var{value}. (This feature is new as of Emacs 22.1.)
1708
1709 @item (@var{symbol} @var{then} @var{else})
1710 A list whose first element is a symbol that is not a keyword specifies
1711 a conditional. Its meaning depends on the value of @var{symbol}. If
1712 @var{symbol} has a non-@code{nil} value, the second element,
1713 @var{then}, is processed recursively as a mode-line element.
1714 Otherwise, the third element, @var{else}, is processed recursively.
1715 You may omit @var{else}; then the mode-line element displays nothing
1716 if the value of @var{symbol} is @code{nil} or void.
1717
1718 @item (@var{width} @var{rest}@dots{})
1719 A list whose first element is an integer specifies truncation or
1720 padding of the results of @var{rest}. The remaining elements
1721 @var{rest} are processed recursively as mode-line constructs and
1722 concatenated together. When @var{width} is positive, the result is
1723 space filled on the right if its width is less than @var{width}. When
1724 @var{width} is negative, the result is truncated on the right to
1725 @minus{}@var{width} columns if its width exceeds @minus{}@var{width}.
1726
1727 For example, the usual way to show what percentage of a buffer is above
1728 the top of the window is to use a list like this: @code{(-3 "%p")}.
1729 @end table
1730
1731 @node Mode Line Top
1732 @subsection The Top Level of Mode Line Control
1733
1734 The variable in overall control of the mode line is
1735 @code{mode-line-format}.
1736
1737 @defopt mode-line-format
1738 The value of this variable is a mode-line construct that controls the
1739 contents of the mode-line. It is always buffer-local in all buffers.
1740
1741 If you set this variable to @code{nil} in a buffer, that buffer does
1742 not have a mode line. (A window that is just one line tall never
1743 displays a mode line.)
1744 @end defopt
1745
1746 The default value of @code{mode-line-format} is designed to use the
1747 values of other variables such as @code{mode-line-position} and
1748 @code{mode-line-modes} (which in turn incorporates the values of the
1749 variables @code{mode-name} and @code{minor-mode-alist}). Very few
1750 modes need to alter @code{mode-line-format} itself. For most
1751 purposes, it is sufficient to alter some of the variables that
1752 @code{mode-line-format} either directly or indirectly refers to.
1753
1754 If you do alter @code{mode-line-format} itself, the new value should
1755 use the same variables that appear in the default value (@pxref{Mode
1756 Line Variables}), rather than duplicating their contents or displaying
1757 the information in another fashion. This way, customizations made by
1758 the user or by Lisp programs (such as @code{display-time} and major
1759 modes) via changes to those variables remain effective.
1760
1761 Here is an example of a @code{mode-line-format} that might be
1762 useful for @code{shell-mode}, since it contains the host name and default
1763 directory.
1764
1765 @example
1766 @group
1767 (setq mode-line-format
1768 (list "-"
1769 'mode-line-mule-info
1770 'mode-line-modified
1771 'mode-line-frame-identification
1772 "%b--"
1773 @end group
1774 @group
1775 ;; @r{Note that this is evaluated while making the list.}
1776 ;; @r{It makes a mode-line construct which is just a string.}
1777 (getenv "HOST")
1778 @end group
1779 ":"
1780 'default-directory
1781 " "
1782 'global-mode-string
1783 " %[("
1784 '(:eval (mode-line-mode-name))
1785 'mode-line-process
1786 'minor-mode-alist
1787 "%n"
1788 ")%]--"
1789 @group
1790 '(which-func-mode ("" which-func-format "--"))
1791 '(line-number-mode "L%l--")
1792 '(column-number-mode "C%c--")
1793 '(-3 "%p")
1794 "-%-"))
1795 @end group
1796 @end example
1797
1798 @noindent
1799 (The variables @code{line-number-mode}, @code{column-number-mode}
1800 and @code{which-func-mode} enable particular minor modes; as usual,
1801 these variable names are also the minor mode command names.)
1802
1803 @node Mode Line Variables
1804 @subsection Variables Used in the Mode Line
1805
1806 This section describes variables incorporated by the standard value
1807 of @code{mode-line-format} into the text of the mode line. There is
1808 nothing inherently special about these variables; any other variables
1809 could have the same effects on the mode line if
1810 @code{mode-line-format}'s value were changed to use them. However,
1811 various parts of Emacs set these variables on the understanding that
1812 they will control parts of the mode line; therefore, practically
1813 speaking, it is essential for the mode line to use them.
1814
1815 @defvar mode-line-mule-info
1816 This variable holds the value of the mode-line construct that displays
1817 information about the language environment, buffer coding system, and
1818 current input method. @xref{Non-ASCII Characters}.
1819 @end defvar
1820
1821 @defvar mode-line-modified
1822 This variable holds the value of the mode-line construct that displays
1823 whether the current buffer is modified. Its default value displays
1824 @samp{**} if the buffer is modified, @samp{--} if the buffer is not
1825 modified, @samp{%%} if the buffer is read only, and @samp{%*} if the
1826 buffer is read only and modified.
1827
1828 Changing this variable does not force an update of the mode line.
1829 @end defvar
1830
1831 @defvar mode-line-frame-identification
1832 This variable identifies the current frame. Its default value
1833 displays @code{" "} if you are using a window system which can show
1834 multiple frames, or @code{"-%F "} on an ordinary terminal which shows
1835 only one frame at a time.
1836 @end defvar
1837
1838 @defvar mode-line-buffer-identification
1839 This variable identifies the buffer being displayed in the window.
1840 Its default value displays the buffer name, padded with spaces to at
1841 least 12 columns.
1842 @end defvar
1843
1844 @defopt mode-line-position
1845 This variable indicates the position in the buffer. Its default value
1846 displays the buffer percentage and, optionally, the buffer size, the
1847 line number and the column number.
1848 @end defopt
1849
1850 @defvar vc-mode
1851 The variable @code{vc-mode}, buffer-local in each buffer, records
1852 whether the buffer's visited file is maintained with version control,
1853 and, if so, which kind. Its value is a string that appears in the mode
1854 line, or @code{nil} for no version control.
1855 @end defvar
1856
1857 @defopt mode-line-modes
1858 This variable displays the buffer's major and minor modes. Its
1859 default value also displays the recursive editing level, information
1860 on the process status, and whether narrowing is in effect.
1861 @end defopt
1862
1863 @defopt mode-line-remote
1864 This variable is used to show whether @code{default-directory} for the
1865 current buffer is remote.
1866 @end defopt
1867
1868 @defopt mode-line-client
1869 This variable is used to identify @code{emacsclient} frames.
1870 @end defopt
1871
1872 The following three variables are used in @code{mode-line-modes}:
1873
1874 @defvar mode-name
1875 This buffer-local variable holds the ``pretty'' name of the current
1876 buffer's major mode. Each major mode should set this variable so that
1877 the mode name will appear in the mode line. The value does not have
1878 to be a string, but can use any of the data types valid in a mode-line
1879 construct (@pxref{Mode Line Data}). To compute the string that will
1880 identify the mode name in the mode line, use @code{format-mode-line}
1881 (@pxref{Emulating Mode Line}).
1882 @end defvar
1883
1884 @defvar mode-line-process
1885 This buffer-local variable contains the mode-line information on process
1886 status in modes used for communicating with subprocesses. It is
1887 displayed immediately following the major mode name, with no intervening
1888 space. For example, its value in the @samp{*shell*} buffer is
1889 @code{(":%s")}, which allows the shell to display its status along
1890 with the major mode as: @samp{(Shell:run)}. Normally this variable
1891 is @code{nil}.
1892 @end defvar
1893
1894 @defvar minor-mode-alist
1895 @anchor{Definition of minor-mode-alist}
1896 This variable holds an association list whose elements specify how the
1897 mode line should indicate that a minor mode is active. Each element of
1898 the @code{minor-mode-alist} should be a two-element list:
1899
1900 @example
1901 (@var{minor-mode-variable} @var{mode-line-string})
1902 @end example
1903
1904 More generally, @var{mode-line-string} can be any mode-line spec. It
1905 appears in the mode line when the value of @var{minor-mode-variable}
1906 is non-@code{nil}, and not otherwise. These strings should begin with
1907 spaces so that they don't run together. Conventionally, the
1908 @var{minor-mode-variable} for a specific mode is set to a
1909 non-@code{nil} value when that minor mode is activated.
1910
1911 @code{minor-mode-alist} itself is not buffer-local. Each variable
1912 mentioned in the alist should be buffer-local if its minor mode can be
1913 enabled separately in each buffer.
1914 @end defvar
1915
1916 @defvar global-mode-string
1917 This variable holds a mode-line spec that, by default, appears in the
1918 mode line just after the @code{which-func-mode} minor mode if set,
1919 else after @code{mode-line-modes}. The command @code{display-time}
1920 sets @code{global-mode-string} to refer to the variable
1921 @code{display-time-string}, which holds a string containing the time
1922 and load information.
1923
1924 The @samp{%M} construct substitutes the value of
1925 @code{global-mode-string}, but that is obsolete, since the variable is
1926 included in the mode line from @code{mode-line-format}.
1927 @end defvar
1928
1929 Here is a simplified version of the default value of
1930 @code{mode-line-format}. The real default value also
1931 specifies addition of text properties.
1932
1933 @example
1934 @group
1935 ("-"
1936 mode-line-mule-info
1937 mode-line-modified
1938 mode-line-frame-identification
1939 mode-line-buffer-identification
1940 @end group
1941 " "
1942 mode-line-position
1943 (vc-mode vc-mode)
1944 " "
1945 @group
1946 mode-line-modes
1947 (which-func-mode ("" which-func-format "--"))
1948 (global-mode-string ("--" global-mode-string))
1949 "-%-")
1950 @end group
1951 @end example
1952
1953 @node %-Constructs
1954 @subsection @code{%}-Constructs in the Mode Line
1955
1956 Strings used as mode-line constructs can use certain
1957 @code{%}-constructs to substitute various kinds of data. Here is a
1958 list of the defined @code{%}-constructs, and what they mean. In any
1959 construct except @samp{%%}, you can add a decimal integer after the
1960 @samp{%} to specify a minimum field width. If the width is less, the
1961 field is padded with spaces to the right.
1962
1963 @table @code
1964 @item %b
1965 The current buffer name, obtained with the @code{buffer-name} function.
1966 @xref{Buffer Names}.
1967
1968 @item %c
1969 The current column number of point.
1970
1971 @item %e
1972 When Emacs is nearly out of memory for Lisp objects, a brief message
1973 saying so. Otherwise, this is empty.
1974
1975 @item %f
1976 The visited file name, obtained with the @code{buffer-file-name}
1977 function. @xref{Buffer File Name}.
1978
1979 @item %F
1980 The title (only on a window system) or the name of the selected frame.
1981 @xref{Basic Parameters}.
1982
1983 @item %i
1984 The size of the accessible part of the current buffer; basically
1985 @code{(- (point-max) (point-min))}.
1986
1987 @item %I
1988 Like @samp{%i}, but the size is printed in a more readable way by using
1989 @samp{k} for 10^3, @samp{M} for 10^6, @samp{G} for 10^9, etc., to
1990 abbreviate.
1991
1992 @item %l
1993 The current line number of point, counting within the accessible portion
1994 of the buffer.
1995
1996 @item %n
1997 @samp{Narrow} when narrowing is in effect; nothing otherwise (see
1998 @code{narrow-to-region} in @ref{Narrowing}).
1999
2000 @item %p
2001 The percentage of the buffer text above the @strong{top} of window, or
2002 @samp{Top}, @samp{Bottom} or @samp{All}. Note that the default
2003 mode-line specification truncates this to three characters.
2004
2005 @item %P
2006 The percentage of the buffer text that is above the @strong{bottom} of
2007 the window (which includes the text visible in the window, as well as
2008 the text above the top), plus @samp{Top} if the top of the buffer is
2009 visible on screen; or @samp{Bottom} or @samp{All}.
2010
2011 @item %s
2012 The status of the subprocess belonging to the current buffer, obtained with
2013 @code{process-status}. @xref{Process Information}.
2014
2015 @item %t
2016 Whether the visited file is a text file or a binary file. This is a
2017 meaningful distinction only on certain operating systems (@pxref{MS-DOS
2018 File Types}).
2019
2020 @item %z
2021 The mnemonics of keyboard, terminal, and buffer coding systems.
2022
2023 @item %Z
2024 Like @samp{%z}, but including the end-of-line format.
2025
2026 @item %*
2027 @samp{%} if the buffer is read only (see @code{buffer-read-only}); @*
2028 @samp{*} if the buffer is modified (see @code{buffer-modified-p}); @*
2029 @samp{-} otherwise. @xref{Buffer Modification}.
2030
2031 @item %+
2032 @samp{*} if the buffer is modified (see @code{buffer-modified-p}); @*
2033 @samp{%} if the buffer is read only (see @code{buffer-read-only}); @*
2034 @samp{-} otherwise. This differs from @samp{%*} only for a modified
2035 read-only buffer. @xref{Buffer Modification}.
2036
2037 @item %&
2038 @samp{*} if the buffer is modified, and @samp{-} otherwise.
2039
2040 @item %[
2041 An indication of the depth of recursive editing levels (not counting
2042 minibuffer levels): one @samp{[} for each editing level.
2043 @xref{Recursive Editing}.
2044
2045 @item %]
2046 One @samp{]} for each recursive editing level (not counting minibuffer
2047 levels).
2048
2049 @item %-
2050 Dashes sufficient to fill the remainder of the mode line.
2051
2052 @item %%
2053 The character @samp{%}---this is how to include a literal @samp{%} in a
2054 string in which @code{%}-constructs are allowed.
2055 @end table
2056
2057 The following two @code{%}-constructs are still supported, but they are
2058 obsolete, since you can get the same results with the variables
2059 @code{mode-name} and @code{global-mode-string}.
2060
2061 @table @code
2062 @item %m
2063 The value of @code{mode-name}.
2064
2065 @item %M
2066 The value of @code{global-mode-string}.
2067 @end table
2068
2069 @node Properties in Mode
2070 @subsection Properties in the Mode Line
2071 @cindex text properties in the mode line
2072
2073 Certain text properties are meaningful in the
2074 mode line. The @code{face} property affects the appearance of text; the
2075 @code{help-echo} property associates help strings with the text, and
2076 @code{local-map} can make the text mouse-sensitive.
2077
2078 There are four ways to specify text properties for text in the mode
2079 line:
2080
2081 @enumerate
2082 @item
2083 Put a string with a text property directly into the mode-line data
2084 structure.
2085
2086 @item
2087 Put a text property on a mode-line %-construct such as @samp{%12b}; then
2088 the expansion of the %-construct will have that same text property.
2089
2090 @item
2091 Use a @code{(:propertize @var{elt} @var{props}@dots{})} construct to
2092 give @var{elt} a text property specified by @var{props}.
2093
2094 @item
2095 Use a list containing @code{:eval @var{form}} in the mode-line data
2096 structure, and make @var{form} evaluate to a string that has a text
2097 property.
2098 @end enumerate
2099
2100 You can use the @code{local-map} property to specify a keymap. This
2101 keymap only takes real effect for mouse clicks; binding character keys
2102 and function keys to it has no effect, since it is impossible to move
2103 point into the mode line.
2104
2105 When the mode line refers to a variable which does not have a
2106 non-@code{nil} @code{risky-local-variable} property, any text
2107 properties given or specified within that variable's values are
2108 ignored. This is because such properties could otherwise specify
2109 functions to be called, and those functions could come from file
2110 local variables.
2111
2112 @node Header Lines
2113 @subsection Window Header Lines
2114 @cindex header line (of a window)
2115 @cindex window header line
2116
2117 A window can have a @dfn{header line} at the
2118 top, just as it can have a mode line at the bottom. The header line
2119 feature works just like the mode-line feature, except that it's
2120 controlled by different variables.
2121
2122 @defvar header-line-format
2123 This variable, local in every buffer, specifies how to display the
2124 header line, for windows displaying the buffer. The format of the value
2125 is the same as for @code{mode-line-format} (@pxref{Mode Line Data}).
2126 It is normally @code{nil}, so that ordinary buffers have no header line.
2127 @end defvar
2128
2129 A window that is just one line tall never displays a header line. A
2130 window that is two lines tall cannot display both a mode line and a
2131 header line at once; if it has a mode line, then it does not display a
2132 header line.
2133
2134 @node Emulating Mode Line
2135 @subsection Emulating Mode-Line Formatting
2136
2137 You can use the function @code{format-mode-line} to compute
2138 the text that would appear in a mode line or header line
2139 based on a certain mode-line specification.
2140
2141 @defun format-mode-line format &optional face window buffer
2142 This function formats a line of text according to @var{format} as if it
2143 were generating the mode line for @var{window}, but it also returns the
2144 text as a string. The argument @var{window} defaults to the selected
2145 window. If @var{buffer} is non-@code{nil}, all the information used is
2146 taken from @var{buffer}; by default, it comes from @var{window}'s
2147 buffer.
2148
2149 The value string normally has text properties that correspond to the
2150 faces, keymaps, etc., that the mode line would have. Any character for
2151 which no @code{face} property is specified by @var{format} gets a
2152 default value determined by @var{face}. If @var{face} is @code{t}, that
2153 stands for either @code{mode-line} if @var{window} is selected,
2154 otherwise @code{mode-line-inactive}. If @var{face} is @code{nil} or
2155 omitted, that stands for the default face. If @var{face} is an integer,
2156 the value returned by this function will have no text properties.
2157
2158 You can also specify other valid faces as the value of @var{face}.
2159 If specified, that face provides the @code{face} property for characters
2160 whose face is not specified by @var{format}.
2161
2162 Note that using @code{mode-line}, @code{mode-line-inactive}, or
2163 @code{header-line} as @var{face} will actually redisplay the mode line
2164 or the header line, respectively, using the current definitions of the
2165 corresponding face, in addition to returning the formatted string.
2166 (Other faces do not cause redisplay.)
2167
2168 For example, @code{(format-mode-line header-line-format)} returns the
2169 text that would appear in the selected window's header line (@code{""}
2170 if it has no header line). @code{(format-mode-line header-line-format
2171 'header-line)} returns the same text, with each character
2172 carrying the face that it will have in the header line itself, and also
2173 redraws the header line.
2174 @end defun
2175
2176 @node Imenu
2177 @section Imenu
2178
2179 @cindex Imenu
2180 @dfn{Imenu} is a feature that lets users select a definition or
2181 section in the buffer, from a menu which lists all of them, to go
2182 directly to that location in the buffer. Imenu works by constructing
2183 a buffer index which lists the names and buffer positions of the
2184 definitions, or other named portions of the buffer; then the user can
2185 choose one of them and move point to it. Major modes can add a menu
2186 bar item to use Imenu using @code{imenu-add-to-menubar}.
2187
2188 @defun imenu-add-to-menubar name
2189 This function defines a local menu bar item named @var{name}
2190 to run Imenu.
2191 @end defun
2192
2193 The user-level commands for using Imenu are described in the Emacs
2194 Manual (@pxref{Imenu,, Imenu, emacs, the Emacs Manual}). This section
2195 explains how to customize Imenu's method of finding definitions or
2196 buffer portions for a particular major mode.
2197
2198 The usual and simplest way is to set the variable
2199 @code{imenu-generic-expression}:
2200
2201 @defvar imenu-generic-expression
2202 This variable, if non-@code{nil}, is a list that specifies regular
2203 expressions for finding definitions for Imenu. Simple elements of
2204 @code{imenu-generic-expression} look like this:
2205
2206 @example
2207 (@var{menu-title} @var{regexp} @var{index})
2208 @end example
2209
2210 Here, if @var{menu-title} is non-@code{nil}, it says that the matches
2211 for this element should go in a submenu of the buffer index;
2212 @var{menu-title} itself specifies the name for the submenu. If
2213 @var{menu-title} is @code{nil}, the matches for this element go directly
2214 in the top level of the buffer index.
2215
2216 The second item in the list, @var{regexp}, is a regular expression
2217 (@pxref{Regular Expressions}); anything in the buffer that it matches
2218 is considered a definition, something to mention in the buffer index.
2219 The third item, @var{index}, is a non-negative integer that indicates
2220 which subexpression in @var{regexp} matches the definition's name.
2221
2222 An element can also look like this:
2223
2224 @example
2225 (@var{menu-title} @var{regexp} @var{index} @var{function} @var{arguments}@dots{})
2226 @end example
2227
2228 Each match for this element creates an index item, and when the index
2229 item is selected by the user, it calls @var{function} with arguments
2230 consisting of the item name, the buffer position, and @var{arguments}.
2231
2232 For Emacs Lisp mode, @code{imenu-generic-expression} could look like
2233 this:
2234
2235 @c should probably use imenu-syntax-alist and \\sw rather than [-A-Za-z0-9+]
2236 @example
2237 @group
2238 ((nil "^\\s-*(def\\(un\\|subst\\|macro\\|advice\\)\
2239 \\s-+\\([-A-Za-z0-9+]+\\)" 2)
2240 @end group
2241 @group
2242 ("*Vars*" "^\\s-*(def\\(var\\|const\\)\
2243 \\s-+\\([-A-Za-z0-9+]+\\)" 2)
2244 @end group
2245 @group
2246 ("*Types*"
2247 "^\\s-*\
2248 (def\\(type\\|struct\\|class\\|ine-condition\\)\
2249 \\s-+\\([-A-Za-z0-9+]+\\)" 2))
2250 @end group
2251 @end example
2252
2253 Setting this variable makes it buffer-local in the current buffer.
2254 @end defvar
2255
2256 @defvar imenu-case-fold-search
2257 This variable controls whether matching against the regular
2258 expressions in the value of @code{imenu-generic-expression} is
2259 case-sensitive: @code{t}, the default, means matching should ignore
2260 case.
2261
2262 Setting this variable makes it buffer-local in the current buffer.
2263 @end defvar
2264
2265 @defvar imenu-syntax-alist
2266 This variable is an alist of syntax table modifiers to use while
2267 processing @code{imenu-generic-expression}, to override the syntax table
2268 of the current buffer. Each element should have this form:
2269
2270 @example
2271 (@var{characters} . @var{syntax-description})
2272 @end example
2273
2274 The @sc{car}, @var{characters}, can be either a character or a string.
2275 The element says to give that character or characters the syntax
2276 specified by @var{syntax-description}, which is passed to
2277 @code{modify-syntax-entry} (@pxref{Syntax Table Functions}).
2278
2279 This feature is typically used to give word syntax to characters which
2280 normally have symbol syntax, and thus to simplify
2281 @code{imenu-generic-expression} and speed up matching.
2282 For example, Fortran mode uses it this way:
2283
2284 @example
2285 (setq imenu-syntax-alist '(("_$" . "w")))
2286 @end example
2287
2288 The @code{imenu-generic-expression} regular expressions can then use
2289 @samp{\\sw+} instead of @samp{\\(\\sw\\|\\s_\\)+}. Note that this
2290 technique may be inconvenient when the mode needs to limit the initial
2291 character of a name to a smaller set of characters than are allowed in
2292 the rest of a name.
2293
2294 Setting this variable makes it buffer-local in the current buffer.
2295 @end defvar
2296
2297 Another way to customize Imenu for a major mode is to set the
2298 variables @code{imenu-prev-index-position-function} and
2299 @code{imenu-extract-index-name-function}:
2300
2301 @defvar imenu-prev-index-position-function
2302 If this variable is non-@code{nil}, its value should be a function that
2303 finds the next ``definition'' to put in the buffer index, scanning
2304 backward in the buffer from point. It should return @code{nil} if it
2305 doesn't find another ``definition'' before point. Otherwise it should
2306 leave point at the place it finds a ``definition'' and return any
2307 non-@code{nil} value.
2308
2309 Setting this variable makes it buffer-local in the current buffer.
2310 @end defvar
2311
2312 @defvar imenu-extract-index-name-function
2313 If this variable is non-@code{nil}, its value should be a function to
2314 return the name for a definition, assuming point is in that definition
2315 as the @code{imenu-prev-index-position-function} function would leave
2316 it.
2317
2318 Setting this variable makes it buffer-local in the current buffer.
2319 @end defvar
2320
2321 The last way to customize Imenu for a major mode is to set the
2322 variable @code{imenu-create-index-function}:
2323
2324 @defvar imenu-create-index-function
2325 This variable specifies the function to use for creating a buffer
2326 index. The function should take no arguments, and return an index
2327 alist for the current buffer. It is called within
2328 @code{save-excursion}, so where it leaves point makes no difference.
2329
2330 The index alist can have three types of elements. Simple elements
2331 look like this:
2332
2333 @example
2334 (@var{index-name} . @var{index-position})
2335 @end example
2336
2337 Selecting a simple element has the effect of moving to position
2338 @var{index-position} in the buffer. Special elements look like this:
2339
2340 @example
2341 (@var{index-name} @var{index-position} @var{function} @var{arguments}@dots{})
2342 @end example
2343
2344 Selecting a special element performs:
2345
2346 @example
2347 (funcall @var{function}
2348 @var{index-name} @var{index-position} @var{arguments}@dots{})
2349 @end example
2350
2351 A nested sub-alist element looks like this:
2352
2353 @example
2354 (@var{menu-title} @var{sub-alist})
2355 @end example
2356
2357 It creates the submenu @var{menu-title} specified by @var{sub-alist}.
2358
2359 The default value of @code{imenu-create-index-function} is
2360 @code{imenu-default-create-index-function}. This function calls the
2361 value of @code{imenu-prev-index-position-function} and the value of
2362 @code{imenu-extract-index-name-function} to produce the index alist.
2363 However, if either of these two variables is @code{nil}, the default
2364 function uses @code{imenu-generic-expression} instead.
2365
2366 Setting this variable makes it buffer-local in the current buffer.
2367 @end defvar
2368
2369 @node Font Lock Mode
2370 @section Font Lock Mode
2371 @cindex Font Lock mode
2372
2373 @dfn{Font Lock mode} is a feature that automatically attaches
2374 @code{face} properties to certain parts of the buffer based on their
2375 syntactic role. How it parses the buffer depends on the major mode;
2376 most major modes define syntactic criteria for which faces to use in
2377 which contexts. This section explains how to customize Font Lock for a
2378 particular major mode.
2379
2380 Font Lock mode finds text to highlight in two ways: through
2381 syntactic parsing based on the syntax table, and through searching
2382 (usually for regular expressions). Syntactic fontification happens
2383 first; it finds comments and string constants and highlights them.
2384 Search-based fontification happens second.
2385
2386 @menu
2387 * Font Lock Basics:: Overview of customizing Font Lock.
2388 * Search-based Fontification:: Fontification based on regexps.
2389 * Customizing Keywords:: Customizing search-based fontification.
2390 * Other Font Lock Variables:: Additional customization facilities.
2391 * Levels of Font Lock:: Each mode can define alternative levels
2392 so that the user can select more or less.
2393 * Precalculated Fontification:: How Lisp programs that produce the buffer
2394 contents can also specify how to fontify it.
2395 * Faces for Font Lock:: Special faces specifically for Font Lock.
2396 * Syntactic Font Lock:: Fontification based on syntax tables.
2397 * Setting Syntax Properties:: Defining character syntax based on context
2398 using the Font Lock mechanism.
2399 * Multiline Font Lock:: How to coerce Font Lock into properly
2400 highlighting multiline constructs.
2401 @end menu
2402
2403 @node Font Lock Basics
2404 @subsection Font Lock Basics
2405
2406 There are several variables that control how Font Lock mode highlights
2407 text. But major modes should not set any of these variables directly.
2408 Instead, they should set @code{font-lock-defaults} as a buffer-local
2409 variable. The value assigned to this variable is used, if and when Font
2410 Lock mode is enabled, to set all the other variables.
2411
2412 @defvar font-lock-defaults
2413 This variable is set by major modes, as a buffer-local variable, to
2414 specify how to fontify text in that mode. It automatically becomes
2415 buffer-local when you set it. If its value is @code{nil}, Font-Lock
2416 mode does no highlighting, and you can use the @samp{Faces} menu
2417 (under @samp{Edit} and then @samp{Text Properties} in the menu bar) to
2418 assign faces explicitly to text in the buffer.
2419
2420 If non-@code{nil}, the value should look like this:
2421
2422 @example
2423 (@var{keywords} [@var{keywords-only} [@var{case-fold}
2424 [@var{syntax-alist} [@var{syntax-begin} @var{other-vars}@dots{}]]]])
2425 @end example
2426
2427 The first element, @var{keywords}, indirectly specifies the value of
2428 @code{font-lock-keywords} which directs search-based fontification.
2429 It can be a symbol, a variable or a function whose value is the list
2430 to use for @code{font-lock-keywords}. It can also be a list of
2431 several such symbols, one for each possible level of fontification.
2432 The first symbol specifies the @samp{mode default} level of
2433 fontification, the next symbol level 1 fontification, the next level 2,
2434 and so on. The @samp{mode default} level is normally the same as level
2435 1. It is used when @code{font-lock-maximum-decoration} has a @code{nil}
2436 value. @xref{Levels of Font Lock}.
2437
2438 The second element, @var{keywords-only}, specifies the value of the
2439 variable @code{font-lock-keywords-only}. If this is omitted or
2440 @code{nil}, syntactic fontification (of strings and comments) is also
2441 performed. If this is non-@code{nil}, such fontification is not
2442 performed. @xref{Syntactic Font Lock}.
2443
2444 The third element, @var{case-fold}, specifies the value of
2445 @code{font-lock-keywords-case-fold-search}. If it is non-@code{nil},
2446 Font Lock mode ignores case when searching as directed by
2447 @code{font-lock-keywords}.
2448
2449 If the fourth element, @var{syntax-alist}, is non-@code{nil}, it
2450 should be a list of cons cells of the form @code{(@var{char-or-string}
2451 . @var{string})}. These are used to set up a syntax table for
2452 syntactic fontification (@pxref{Syntax Table Functions}). The
2453 resulting syntax table is stored in @code{font-lock-syntax-table}.
2454
2455 The fifth element, @var{syntax-begin}, specifies the value of
2456 @code{font-lock-beginning-of-syntax-function}. We recommend setting
2457 this variable to @code{nil} and using @code{syntax-begin-function}
2458 instead.
2459
2460 All the remaining elements (if any) are collectively called
2461 @var{other-vars}. Each of these elements should have the form
2462 @code{(@var{variable} . @var{value})}---which means, make
2463 @var{variable} buffer-local and then set it to @var{value}. You can
2464 use these @var{other-vars} to set other variables that affect
2465 fontification, aside from those you can control with the first five
2466 elements. @xref{Other Font Lock Variables}.
2467 @end defvar
2468
2469 If your mode fontifies text explicitly by adding
2470 @code{font-lock-face} properties, it can specify @code{(nil t)} for
2471 @code{font-lock-defaults} to turn off all automatic fontification.
2472 However, this is not required; it is possible to fontify some things
2473 using @code{font-lock-face} properties and set up automatic
2474 fontification for other parts of the text.
2475
2476 @node Search-based Fontification
2477 @subsection Search-based Fontification
2478
2479 The most important variable for customizing Font Lock mode is
2480 @code{font-lock-keywords}. It specifies the search criteria for
2481 search-based fontification. You should specify the value of this
2482 variable with @var{keywords} in @code{font-lock-defaults}.
2483
2484 @defvar font-lock-keywords
2485 This variable's value is a list of the keywords to highlight. Be
2486 careful when composing regular expressions for this list; a poorly
2487 written pattern can dramatically slow things down!
2488 @end defvar
2489
2490 Each element of @code{font-lock-keywords} specifies how to find
2491 certain cases of text, and how to highlight those cases. Font Lock mode
2492 processes the elements of @code{font-lock-keywords} one by one, and for
2493 each element, it finds and handles all matches. Ordinarily, once
2494 part of the text has been fontified already, this cannot be overridden
2495 by a subsequent match in the same text; but you can specify different
2496 behavior using the @var{override} element of a @var{subexp-highlighter}.
2497
2498 Each element of @code{font-lock-keywords} should have one of these
2499 forms:
2500
2501 @table @code
2502 @item @var{regexp}
2503 Highlight all matches for @var{regexp} using
2504 @code{font-lock-keyword-face}. For example,
2505
2506 @example
2507 ;; @r{Highlight occurrences of the word @samp{foo}}
2508 ;; @r{using @code{font-lock-keyword-face}.}
2509 "\\<foo\\>"
2510 @end example
2511
2512 The function @code{regexp-opt} (@pxref{Regexp Functions}) is useful
2513 for calculating optimal regular expressions to match a number of
2514 different keywords.
2515
2516 @item @var{function}
2517 Find text by calling @var{function}, and highlight the matches
2518 it finds using @code{font-lock-keyword-face}.
2519
2520 When @var{function} is called, it receives one argument, the limit of
2521 the search; it should begin searching at point, and not search beyond the
2522 limit. It should return non-@code{nil} if it succeeds, and set the
2523 match data to describe the match that was found. Returning @code{nil}
2524 indicates failure of the search.
2525
2526 Fontification will call @var{function} repeatedly with the same limit,
2527 and with point where the previous invocation left it, until
2528 @var{function} fails. On failure, @var{function} need not reset point
2529 in any particular way.
2530
2531 @item (@var{matcher} . @var{subexp})
2532 In this kind of element, @var{matcher} is either a regular
2533 expression or a function, as described above. The @sc{cdr},
2534 @var{subexp}, specifies which subexpression of @var{matcher} should be
2535 highlighted (instead of the entire text that @var{matcher} matched).
2536
2537 @example
2538 ;; @r{Highlight the @samp{bar} in each occurrence of @samp{fubar},}
2539 ;; @r{using @code{font-lock-keyword-face}.}
2540 ("fu\\(bar\\)" . 1)
2541 @end example
2542
2543 If you use @code{regexp-opt} to produce the regular expression
2544 @var{matcher}, you can use @code{regexp-opt-depth} (@pxref{Regexp
2545 Functions}) to calculate the value for @var{subexp}.
2546
2547 @item (@var{matcher} . @var{facespec})
2548 In this kind of element, @var{facespec} is an expression whose value
2549 specifies the face to use for highlighting. In the simplest case,
2550 @var{facespec} is a Lisp variable (a symbol) whose value is a face
2551 name.
2552
2553 @example
2554 ;; @r{Highlight occurrences of @samp{fubar},}
2555 ;; @r{using the face which is the value of @code{fubar-face}.}
2556 ("fubar" . fubar-face)
2557 @end example
2558
2559 However, @var{facespec} can also evaluate to a list of this form:
2560
2561 @example
2562 (face @var{face} @var{prop1} @var{val1} @var{prop2} @var{val2}@dots{})
2563 @end example
2564
2565 @noindent
2566 to specify the face @var{face} and various additional text properties
2567 to put on the text that matches. If you do this, be sure to add the
2568 other text property names that you set in this way to the value of
2569 @code{font-lock-extra-managed-props} so that the properties will also
2570 be cleared out when they are no longer appropriate. Alternatively,
2571 you can set the variable @code{font-lock-unfontify-region-function} to
2572 a function that clears these properties. @xref{Other Font Lock
2573 Variables}.
2574
2575 @item (@var{matcher} . @var{subexp-highlighter})
2576 In this kind of element, @var{subexp-highlighter} is a list
2577 which specifies how to highlight matches found by @var{matcher}.
2578 It has the form:
2579
2580 @example
2581 (@var{subexp} @var{facespec} [@var{override} [@var{laxmatch}]])
2582 @end example
2583
2584 The @sc{car}, @var{subexp}, is an integer specifying which subexpression
2585 of the match to fontify (0 means the entire matching text). The second
2586 subelement, @var{facespec}, is an expression whose value specifies the
2587 face, as described above.
2588
2589 The last two values in @var{subexp-highlighter}, @var{override} and
2590 @var{laxmatch}, are optional flags. If @var{override} is @code{t},
2591 this element can override existing fontification made by previous
2592 elements of @code{font-lock-keywords}. If it is @code{keep}, then
2593 each character is fontified if it has not been fontified already by
2594 some other element. If it is @code{prepend}, the face specified by
2595 @var{facespec} is added to the beginning of the @code{font-lock-face}
2596 property. If it is @code{append}, the face is added to the end of the
2597 @code{font-lock-face} property.
2598
2599 If @var{laxmatch} is non-@code{nil}, it means there should be no error
2600 if there is no subexpression numbered @var{subexp} in @var{matcher}.
2601 Obviously, fontification of the subexpression numbered @var{subexp} will
2602 not occur. However, fontification of other subexpressions (and other
2603 regexps) will continue. If @var{laxmatch} is @code{nil}, and the
2604 specified subexpression is missing, then an error is signaled which
2605 terminates search-based fontification.
2606
2607 Here are some examples of elements of this kind, and what they do:
2608
2609 @smallexample
2610 ;; @r{Highlight occurrences of either @samp{foo} or @samp{bar}, using}
2611 ;; @r{@code{foo-bar-face}, even if they have already been highlighted.}
2612 ;; @r{@code{foo-bar-face} should be a variable whose value is a face.}
2613 ("foo\\|bar" 0 foo-bar-face t)
2614
2615 ;; @r{Highlight the first subexpression within each occurrence}
2616 ;; @r{that the function @code{fubar-match} finds,}
2617 ;; @r{using the face which is the value of @code{fubar-face}.}
2618 (fubar-match 1 fubar-face)
2619 @end smallexample
2620
2621 @item (@var{matcher} . @var{anchored-highlighter})
2622 In this kind of element, @var{anchored-highlighter} specifies how to
2623 highlight text that follows a match found by @var{matcher}. So a
2624 match found by @var{matcher} acts as the anchor for further searches
2625 specified by @var{anchored-highlighter}. @var{anchored-highlighter}
2626 is a list of the following form:
2627
2628 @example
2629 (@var{anchored-matcher} @var{pre-form} @var{post-form}
2630 @var{subexp-highlighters}@dots{})
2631 @end example
2632
2633 Here, @var{anchored-matcher}, like @var{matcher}, is either a regular
2634 expression or a function. After a match of @var{matcher} is found,
2635 point is at the end of the match. Now, Font Lock evaluates the form
2636 @var{pre-form}. Then it searches for matches of
2637 @var{anchored-matcher} and uses @var{subexp-highlighters} to highlight
2638 these. A @var{subexp-highlighter} is as described above. Finally,
2639 Font Lock evaluates @var{post-form}.
2640
2641 The forms @var{pre-form} and @var{post-form} can be used to initialize
2642 before, and cleanup after, @var{anchored-matcher} is used. Typically,
2643 @var{pre-form} is used to move point to some position relative to the
2644 match of @var{matcher}, before starting with @var{anchored-matcher}.
2645 @var{post-form} might be used to move back, before resuming with
2646 @var{matcher}.
2647
2648 After Font Lock evaluates @var{pre-form}, it does not search for
2649 @var{anchored-matcher} beyond the end of the line. However, if
2650 @var{pre-form} returns a buffer position that is greater than the
2651 position of point after @var{pre-form} is evaluated, then the position
2652 returned by @var{pre-form} is used as the limit of the search instead.
2653 It is generally a bad idea to return a position greater than the end
2654 of the line; in other words, the @var{anchored-matcher} search should
2655 not span lines.
2656
2657 For example,
2658
2659 @smallexample
2660 ;; @r{Highlight occurrences of the word @samp{item} following}
2661 ;; @r{an occurrence of the word @samp{anchor} (on the same line)}
2662 ;; @r{in the value of @code{item-face}.}
2663 ("\\<anchor\\>" "\\<item\\>" nil nil (0 item-face))
2664 @end smallexample
2665
2666 Here, @var{pre-form} and @var{post-form} are @code{nil}. Therefore
2667 searching for @samp{item} starts at the end of the match of
2668 @samp{anchor}, and searching for subsequent instances of @samp{anchor}
2669 resumes from where searching for @samp{item} concluded.
2670
2671 @item (@var{matcher} @var{highlighters}@dots{})
2672 This sort of element specifies several @var{highlighter} lists for a
2673 single @var{matcher}. A @var{highlighter} list can be of the type
2674 @var{subexp-highlighter} or @var{anchored-highlighter} as described
2675 above.
2676
2677 For example,
2678
2679 @smallexample
2680 ;; @r{Highlight occurrences of the word @samp{anchor} in the value}
2681 ;; @r{of @code{anchor-face}, and subsequent occurrences of the word}
2682 ;; @r{@samp{item} (on the same line) in the value of @code{item-face}.}
2683 ("\\<anchor\\>" (0 anchor-face)
2684 ("\\<item\\>" nil nil (0 item-face)))
2685 @end smallexample
2686
2687 @item (eval . @var{form})
2688 Here @var{form} is an expression to be evaluated the first time
2689 this value of @code{font-lock-keywords} is used in a buffer.
2690 Its value should have one of the forms described in this table.
2691 @end table
2692
2693 @strong{Warning:} Do not design an element of @code{font-lock-keywords}
2694 to match text which spans lines; this does not work reliably.
2695 For details, see @xref{Multiline Font Lock}.
2696
2697 You can use @var{case-fold} in @code{font-lock-defaults} to specify
2698 the value of @code{font-lock-keywords-case-fold-search} which says
2699 whether search-based fontification should be case-insensitive.
2700
2701 @defvar font-lock-keywords-case-fold-search
2702 Non-@code{nil} means that regular expression matching for the sake of
2703 @code{font-lock-keywords} should be case-insensitive.
2704 @end defvar
2705
2706 @node Customizing Keywords
2707 @subsection Customizing Search-Based Fontification
2708
2709 You can use @code{font-lock-add-keywords} to add additional
2710 search-based fontification rules to a major mode, and
2711 @code{font-lock-remove-keywords} to remove rules.
2712
2713 @defun font-lock-add-keywords mode keywords &optional how
2714 This function adds highlighting @var{keywords}, for the current buffer
2715 or for major mode @var{mode}. The argument @var{keywords} should be a
2716 list with the same format as the variable @code{font-lock-keywords}.
2717
2718 If @var{mode} is a symbol which is a major mode command name, such as
2719 @code{c-mode}, the effect is that enabling Font Lock mode in
2720 @var{mode} will add @var{keywords} to @code{font-lock-keywords}.
2721 Calling with a non-@code{nil} value of @var{mode} is correct only in
2722 your @file{~/.emacs} file.
2723
2724 If @var{mode} is @code{nil}, this function adds @var{keywords} to
2725 @code{font-lock-keywords} in the current buffer. This way of calling
2726 @code{font-lock-add-keywords} is usually used in mode hook functions.
2727
2728 By default, @var{keywords} are added at the beginning of
2729 @code{font-lock-keywords}. If the optional argument @var{how} is
2730 @code{set}, they are used to replace the value of
2731 @code{font-lock-keywords}. If @var{how} is any other non-@code{nil}
2732 value, they are added at the end of @code{font-lock-keywords}.
2733
2734 Some modes provide specialized support you can use in additional
2735 highlighting patterns. See the variables
2736 @code{c-font-lock-extra-types}, @code{c++-font-lock-extra-types},
2737 and @code{java-font-lock-extra-types}, for example.
2738
2739 @strong{Warning:} major mode commands must not call
2740 @code{font-lock-add-keywords} under any circumstances, either directly
2741 or indirectly, except through their mode hooks. (Doing so would lead to
2742 incorrect behavior for some minor modes.) They should set up their
2743 rules for search-based fontification by setting
2744 @code{font-lock-keywords}.
2745 @end defun
2746
2747 @defun font-lock-remove-keywords mode keywords
2748 This function removes @var{keywords} from @code{font-lock-keywords}
2749 for the current buffer or for major mode @var{mode}. As in
2750 @code{font-lock-add-keywords}, @var{mode} should be a major mode
2751 command name or @code{nil}. All the caveats and requirements for
2752 @code{font-lock-add-keywords} apply here too.
2753 @end defun
2754
2755 For example, this code
2756
2757 @smallexample
2758 (font-lock-add-keywords 'c-mode
2759 '(("\\<\\(FIXME\\):" 1 font-lock-warning-face prepend)
2760 ("\\<\\(and\\|or\\|not\\)\\>" . font-lock-keyword-face)))
2761 @end smallexample
2762
2763 @noindent
2764 adds two fontification patterns for C mode: one to fontify the word
2765 @samp{FIXME}, even in comments, and another to fontify the words
2766 @samp{and}, @samp{or} and @samp{not} as keywords.
2767
2768 @noindent
2769 That example affects only C mode proper. To add the same patterns to
2770 C mode @emph{and} all modes derived from it, do this instead:
2771
2772 @smallexample
2773 (add-hook 'c-mode-hook
2774 (lambda ()
2775 (font-lock-add-keywords nil
2776 '(("\\<\\(FIXME\\):" 1 font-lock-warning-face prepend)
2777 ("\\<\\(and\\|or\\|not\\)\\>" .
2778 font-lock-keyword-face)))))
2779 @end smallexample
2780
2781 @node Other Font Lock Variables
2782 @subsection Other Font Lock Variables
2783
2784 This section describes additional variables that a major mode can
2785 set by means of @var{other-vars} in @code{font-lock-defaults}
2786 (@pxref{Font Lock Basics}).
2787
2788 @defvar font-lock-mark-block-function
2789 If this variable is non-@code{nil}, it should be a function that is
2790 called with no arguments, to choose an enclosing range of text for
2791 refontification for the command @kbd{M-o M-o}
2792 (@code{font-lock-fontify-block}).
2793
2794 The function should report its choice by placing the region around it.
2795 A good choice is a range of text large enough to give proper results,
2796 but not too large so that refontification becomes slow. Typical values
2797 are @code{mark-defun} for programming modes or @code{mark-paragraph} for
2798 textual modes.
2799 @end defvar
2800
2801 @defvar font-lock-extra-managed-props
2802 This variable specifies additional properties (other than
2803 @code{font-lock-face}) that are being managed by Font Lock mode. It
2804 is used by @code{font-lock-default-unfontify-region}, which normally
2805 only manages the @code{font-lock-face} property. If you want Font
2806 Lock to manage other properties as well, you must specify them in a
2807 @var{facespec} in @code{font-lock-keywords} as well as add them to
2808 this list. @xref{Search-based Fontification}.
2809 @end defvar
2810
2811 @defvar font-lock-fontify-buffer-function
2812 Function to use for fontifying the buffer. The default value is
2813 @code{font-lock-default-fontify-buffer}.
2814 @end defvar
2815
2816 @defvar font-lock-unfontify-buffer-function
2817 Function to use for unfontifying the buffer. This is used when
2818 turning off Font Lock mode. The default value is
2819 @code{font-lock-default-unfontify-buffer}.
2820 @end defvar
2821
2822 @defvar font-lock-fontify-region-function
2823 Function to use for fontifying a region. It should take two
2824 arguments, the beginning and end of the region, and an optional third
2825 argument @var{verbose}. If @var{verbose} is non-@code{nil}, the
2826 function should print status messages. The default value is
2827 @code{font-lock-default-fontify-region}.
2828 @end defvar
2829
2830 @defvar font-lock-unfontify-region-function
2831 Function to use for unfontifying a region. It should take two
2832 arguments, the beginning and end of the region. The default value is
2833 @code{font-lock-default-unfontify-region}.
2834 @end defvar
2835
2836 @defun jit-lock-register function &optional contextual
2837 This function tells Font Lock mode to run the Lisp function
2838 @var{function} any time it has to fontify or refontify part of the
2839 current buffer. It calls @var{function} before calling the default
2840 fontification functions, and gives it two arguments, @var{start} and
2841 @var{end}, which specify the region to be fontified or refontified.
2842
2843 The optional argument @var{contextual}, if non-@code{nil}, forces Font
2844 Lock mode to always refontify a syntactically relevant part of the
2845 buffer, and not just the modified lines. This argument can usually be
2846 omitted.
2847 @end defun
2848
2849 @defun jit-lock-unregister function
2850 If @var{function} was previously registered as a fontification
2851 function using @code{jit-lock-register}, this function unregisters it.
2852 @end defun
2853
2854 @node Levels of Font Lock
2855 @subsection Levels of Font Lock
2856
2857 Many major modes offer three different levels of fontification. You
2858 can define multiple levels by using a list of symbols for @var{keywords}
2859 in @code{font-lock-defaults}. Each symbol specifies one level of
2860 fontification; it is up to the user to choose one of these levels,
2861 normally by setting @code{font-lock-maximum-decoration} (@pxref{Font
2862 Lock,,, emacs, the GNU Emacs Manual}). The chosen level's symbol
2863 value is used to initialize @code{font-lock-keywords}.
2864
2865 Here are the conventions for how to define the levels of
2866 fontification:
2867
2868 @itemize @bullet
2869 @item
2870 Level 1: highlight function declarations, file directives (such as include or
2871 import directives), strings and comments. The idea is speed, so only
2872 the most important and top-level components are fontified.
2873
2874 @item
2875 Level 2: in addition to level 1, highlight all language keywords,
2876 including type names that act like keywords, as well as named constant
2877 values. The idea is that all keywords (either syntactic or semantic)
2878 should be fontified appropriately.
2879
2880 @item
2881 Level 3: in addition to level 2, highlight the symbols being defined in
2882 function and variable declarations, and all builtin function names,
2883 wherever they appear.
2884 @end itemize
2885
2886 @node Precalculated Fontification
2887 @subsection Precalculated Fontification
2888
2889 Some major modes such as @code{list-buffers} and @code{occur}
2890 construct the buffer text programmatically. The easiest way for them
2891 to support Font Lock mode is to specify the faces of text when they
2892 insert the text in the buffer.
2893
2894 The way to do this is to specify the faces in the text with the
2895 special text property @code{font-lock-face} (@pxref{Special
2896 Properties}). When Font Lock mode is enabled, this property controls
2897 the display, just like the @code{face} property. When Font Lock mode
2898 is disabled, @code{font-lock-face} has no effect on the display.
2899
2900 It is ok for a mode to use @code{font-lock-face} for some text and
2901 also use the normal Font Lock machinery. But if the mode does not use
2902 the normal Font Lock machinery, it should not set the variable
2903 @code{font-lock-defaults}.
2904
2905 @node Faces for Font Lock
2906 @subsection Faces for Font Lock
2907 @cindex faces for font lock
2908 @cindex font lock faces
2909
2910 Font Lock mode can highlight using any face, but Emacs defines several
2911 faces specifically for syntactic highlighting. These @dfn{Font Lock
2912 faces} are listed below. They can also be used by major modes for
2913 syntactic highlighting outside of Font Lock mode (@pxref{Major Mode
2914 Conventions}).
2915
2916 Each of these symbols is both a face name, and a variable whose
2917 default value is the symbol itself. Thus, the default value of
2918 @code{font-lock-comment-face} is @code{font-lock-comment-face}.
2919
2920 The faces are listed with descriptions of their typical usage, and in
2921 order of greater to lesser ``prominence''. If a mode's syntactic
2922 categories do not fit well with the usage descriptions, the faces can be
2923 assigned using the ordering as a guide.
2924
2925 @table @code
2926 @item font-lock-warning-face
2927 @vindex font-lock-warning-face
2928 for a construct that is peculiar, or that greatly changes the meaning of
2929 other text, like @samp{;;;###autoload} in Emacs Lisp and @samp{#error}
2930 in C.
2931
2932 @item font-lock-function-name-face
2933 @vindex font-lock-function-name-face
2934 for the name of a function being defined or declared.
2935
2936 @item font-lock-variable-name-face
2937 @vindex font-lock-variable-name-face
2938 for the name of a variable being defined or declared.
2939
2940 @item font-lock-keyword-face
2941 @vindex font-lock-keyword-face
2942 for a keyword with special syntactic significance, like @samp{for} and
2943 @samp{if} in C.
2944
2945 @item font-lock-comment-face
2946 @vindex font-lock-comment-face
2947 for comments.
2948
2949 @item font-lock-comment-delimiter-face
2950 @vindex font-lock-comment-delimiter-face
2951 for comments delimiters, like @samp{/*} and @samp{*/} in C. On most
2952 terminals, this inherits from @code{font-lock-comment-face}.
2953
2954 @item font-lock-type-face
2955 @vindex font-lock-type-face
2956 for the names of user-defined data types.
2957
2958 @item font-lock-constant-face
2959 @vindex font-lock-constant-face
2960 for the names of constants, like @samp{NULL} in C.
2961
2962 @item font-lock-builtin-face
2963 @vindex font-lock-builtin-face
2964 for the names of built-in functions.
2965
2966 @item font-lock-preprocessor-face
2967 @vindex font-lock-preprocessor-face
2968 for preprocessor commands. This inherits, by default, from
2969 @code{font-lock-builtin-face}.
2970
2971 @item font-lock-string-face
2972 @vindex font-lock-string-face
2973 for string constants.
2974
2975 @item font-lock-doc-face
2976 @vindex font-lock-doc-face
2977 for documentation strings in the code. This inherits, by default, from
2978 @code{font-lock-string-face}.
2979
2980 @item font-lock-negation-char-face
2981 @vindex font-lock-negation-char-face
2982 for easily-overlooked negation characters.
2983 @end table
2984
2985 @node Syntactic Font Lock
2986 @subsection Syntactic Font Lock
2987 @cindex syntactic font lock
2988
2989 Syntactic fontification uses the syntax table to find comments and
2990 string constants (@pxref{Syntax Tables}). It highlights them using
2991 @code{font-lock-comment-face} and @code{font-lock-string-face}
2992 (@pxref{Faces for Font Lock}), or whatever
2993 @code{font-lock-syntactic-face-function} chooses. There are several
2994 variables that affect syntactic fontification; you should set them by
2995 means of @code{font-lock-defaults} (@pxref{Font Lock Basics}).
2996
2997 @defvar font-lock-keywords-only
2998 Non-@code{nil} means Font Lock should not do syntactic fontification;
2999 it should only fontify based on @code{font-lock-keywords}. The normal
3000 way for a mode to set this variable to @code{t} is with
3001 @var{keywords-only} in @code{font-lock-defaults}.
3002 @end defvar
3003
3004 @defvar font-lock-syntax-table
3005 This variable holds the syntax table to use for fontification of
3006 comments and strings. Specify it using @var{syntax-alist} in
3007 @code{font-lock-defaults}. If this is @code{nil}, fontification uses
3008 the buffer's syntax table.
3009 @end defvar
3010
3011 @defvar font-lock-beginning-of-syntax-function
3012 If this variable is non-@code{nil}, it should be a function to move
3013 point back to a position that is syntactically at ``top level'' and
3014 outside of strings or comments. Font Lock uses this when necessary
3015 to get the right results for syntactic fontification.
3016
3017 This function is called with no arguments. It should leave point at
3018 the beginning of any enclosing syntactic block. Typical values are
3019 @code{beginning-of-line} (used when the start of the line is known to
3020 be outside a syntactic block), or @code{beginning-of-defun} for
3021 programming modes, or @code{backward-paragraph} for textual modes.
3022
3023 If the value is @code{nil}, Font Lock uses
3024 @code{syntax-begin-function} to move back outside of any comment,
3025 string, or sexp. This variable is semi-obsolete; we recommend setting
3026 @code{syntax-begin-function} instead.
3027
3028 Specify this variable using @var{syntax-begin} in
3029 @code{font-lock-defaults}.
3030 @end defvar
3031
3032 @defvar font-lock-syntactic-face-function
3033 A function to determine which face to use for a given syntactic
3034 element (a string or a comment). The function is called with one
3035 argument, the parse state at point returned by
3036 @code{parse-partial-sexp}, and should return a face. The default
3037 value returns @code{font-lock-comment-face} for comments and
3038 @code{font-lock-string-face} for strings.
3039
3040 This can be used to highlighting different kinds of strings or
3041 comments differently. It is also sometimes abused together with
3042 @code{font-lock-syntactic-keywords} to highlight constructs that span
3043 multiple lines, but this is too esoteric to document here.
3044
3045 Specify this variable using @var{other-vars} in
3046 @code{font-lock-defaults}.
3047 @end defvar
3048
3049 @node Setting Syntax Properties
3050 @subsection Setting Syntax Properties
3051
3052 Font Lock mode can be used to update @code{syntax-table} properties
3053 automatically (@pxref{Syntax Properties}). This is useful in
3054 languages for which a single syntax table by itself is not sufficient.
3055
3056 @defvar font-lock-syntactic-keywords
3057 This variable enables and controls updating @code{syntax-table}
3058 properties by Font Lock. Its value should be a list of elements of
3059 this form:
3060
3061 @example
3062 (@var{matcher} @var{subexp} @var{syntax} @var{override} @var{laxmatch})
3063 @end example
3064
3065 The parts of this element have the same meanings as in the corresponding
3066 sort of element of @code{font-lock-keywords},
3067
3068 @example
3069 (@var{matcher} @var{subexp} @var{facespec} @var{override} @var{laxmatch})
3070 @end example
3071
3072 However, instead of specifying the value @var{facespec} to use for the
3073 @code{face} property, it specifies the value @var{syntax} to use for
3074 the @code{syntax-table} property. Here, @var{syntax} can be a string
3075 (as taken by @code{modify-syntax-entry}), a syntax table, a cons cell
3076 (as returned by @code{string-to-syntax}), or an expression whose value
3077 is one of those two types. @var{override} cannot be @code{prepend} or
3078 @code{append}.
3079
3080 For example, an element of the form:
3081
3082 @example
3083 ("\\$\\(#\\)" 1 ".")
3084 @end example
3085
3086 highlights syntactically a hash character when following a dollar
3087 character, with a SYNTAX of @code{"."} (meaning punctuation syntax).
3088 Assuming that the buffer syntax table specifies hash characters to
3089 have comment start syntax, the element will only highlight hash
3090 characters that do not follow dollar characters as comments
3091 syntactically.
3092
3093 An element of the form:
3094
3095 @example
3096 ("\\('\\).\\('\\)"
3097 (1 "\"")
3098 (2 "\""))
3099 @end example
3100
3101 highlights syntactically both single quotes which surround a single
3102 character, with a SYNTAX of @code{"\""} (meaning string quote syntax).
3103 Assuming that the buffer syntax table does not specify single quotes
3104 to have quote syntax, the element will only highlight single quotes of
3105 the form @samp{'@var{c}'} as strings syntactically. Other forms, such
3106 as @samp{foo'bar} or @samp{'fubar'}, will not be highlighted as
3107 strings.
3108
3109 Major modes normally set this variable with @var{other-vars} in
3110 @code{font-lock-defaults}.
3111 @end defvar
3112
3113 @node Multiline Font Lock
3114 @subsection Multiline Font Lock Constructs
3115 @cindex multiline font lock
3116
3117 Normally, elements of @code{font-lock-keywords} should not match
3118 across multiple lines; that doesn't work reliably, because Font Lock
3119 usually scans just part of the buffer, and it can miss a multi-line
3120 construct that crosses the line boundary where the scan starts. (The
3121 scan normally starts at the beginning of a line.)
3122
3123 Making elements that match multiline constructs work properly has
3124 two aspects: correct @emph{identification} and correct
3125 @emph{rehighlighting}. The first means that Font Lock finds all
3126 multiline constructs. The second means that Font Lock will correctly
3127 rehighlight all the relevant text when a multiline construct is
3128 changed---for example, if some of the text that was previously part of
3129 a multiline construct ceases to be part of it. The two aspects are
3130 closely related, and often getting one of them to work will appear to
3131 make the other also work. However, for reliable results you must
3132 attend explicitly to both aspects.
3133
3134 There are three ways to ensure correct identification of multiline
3135 constructs:
3136
3137 @itemize
3138 @item
3139 Add a function to @code{font-lock-extend-region-functions} that does
3140 the @emph{identification} and extends the scan so that the scanned
3141 text never starts or ends in the middle of a multiline construct.
3142 @item
3143 Use the @code{font-lock-fontify-region-function} hook similarly to
3144 extend the scan so that the scanned text never starts or ends in the
3145 middle of a multiline construct.
3146 @item
3147 Somehow identify the multiline construct right when it gets inserted
3148 into the buffer (or at any point after that but before font-lock
3149 tries to highlight it), and mark it with a @code{font-lock-multiline}
3150 which will instruct font-lock not to start or end the scan in the
3151 middle of the construct.
3152 @end itemize
3153
3154 There are three ways to do rehighlighting of multiline constructs:
3155
3156 @itemize
3157 @item
3158 Place a @code{font-lock-multiline} property on the construct. This
3159 will rehighlight the whole construct if any part of it is changed. In
3160 some cases you can do this automatically by setting the
3161 @code{font-lock-multiline} variable, which see.
3162 @item
3163 Make sure @code{jit-lock-contextually} is set and rely on it doing its
3164 job. This will only rehighlight the part of the construct that
3165 follows the actual change, and will do it after a short delay.
3166 This only works if the highlighting of the various parts of your
3167 multiline construct never depends on text in subsequent lines.
3168 Since @code{jit-lock-contextually} is activated by default, this can
3169 be an attractive solution.
3170 @item
3171 Place a @code{jit-lock-defer-multiline} property on the construct.
3172 This works only if @code{jit-lock-contextually} is used, and with the
3173 same delay before rehighlighting, but like @code{font-lock-multiline},
3174 it also handles the case where highlighting depends on
3175 subsequent lines.
3176 @end itemize
3177
3178 @menu
3179 * Font Lock Multiline:: Marking multiline chunks with a text property.
3180 * Region to Refontify:: Controlling which region gets refontified
3181 after a buffer change.
3182 @end menu
3183
3184 @node Font Lock Multiline
3185 @subsubsection Font Lock Multiline
3186
3187 One way to ensure reliable rehighlighting of multiline Font Lock
3188 constructs is to put on them the text property @code{font-lock-multiline}.
3189 It should be present and non-@code{nil} for text that is part of a
3190 multiline construct.
3191
3192 When Font Lock is about to highlight a range of text, it first
3193 extends the boundaries of the range as necessary so that they do not
3194 fall within text marked with the @code{font-lock-multiline} property.
3195 Then it removes any @code{font-lock-multiline} properties from the
3196 range, and highlights it. The highlighting specification (mostly
3197 @code{font-lock-keywords}) must reinstall this property each time,
3198 whenever it is appropriate.
3199
3200 @strong{Warning:} don't use the @code{font-lock-multiline} property
3201 on large ranges of text, because that will make rehighlighting slow.
3202
3203 @defvar font-lock-multiline
3204 If the @code{font-lock-multiline} variable is set to @code{t}, Font
3205 Lock will try to add the @code{font-lock-multiline} property
3206 automatically on multiline constructs. This is not a universal
3207 solution, however, since it slows down Font Lock somewhat. It can
3208 miss some multiline constructs, or make the property larger or smaller
3209 than necessary.
3210
3211 For elements whose @var{matcher} is a function, the function should
3212 ensure that submatch 0 covers the whole relevant multiline construct,
3213 even if only a small subpart will be highlighted. It is often just as
3214 easy to add the @code{font-lock-multiline} property by hand.
3215 @end defvar
3216
3217 The @code{font-lock-multiline} property is meant to ensure proper
3218 refontification; it does not automatically identify new multiline
3219 constructs. Identifying the requires that Font-Lock operate on large
3220 enough chunks at a time. This will happen by accident on many cases,
3221 which may give the impression that multiline constructs magically work.
3222 If you set the @code{font-lock-multiline} variable non-@code{nil},
3223 this impression will be even stronger, since the highlighting of those
3224 constructs which are found will be properly updated from then on.
3225 But that does not work reliably.
3226
3227 To find multiline constructs reliably, you must either manually
3228 place the @code{font-lock-multiline} property on the text before
3229 Font-Lock looks at it, or use
3230 @code{font-lock-fontify-region-function}.
3231
3232 @node Region to Refontify
3233 @subsubsection Region to Fontify after a Buffer Change
3234
3235 When a buffer is changed, the region that Font Lock refontifies is
3236 by default the smallest sequence of whole lines that spans the change.
3237 While this works well most of the time, sometimes it doesn't---for
3238 example, when a change alters the syntactic meaning of text on an
3239 earlier line.
3240
3241 You can enlarge (or even reduce) the region to refontify by setting
3242 the following variable:
3243
3244 @defvar font-lock-extend-after-change-region-function
3245 This buffer-local variable is either @code{nil} or a function for
3246 Font-Lock to call to determine the region to scan and fontify.
3247
3248 The function is given three parameters, the standard @var{beg},
3249 @var{end}, and @var{old-len} from @code{after-change-functions}
3250 (@pxref{Change Hooks}). It should return either a cons of the
3251 beginning and end buffer positions (in that order) of the region to
3252 fontify, or @code{nil} (which means choose the region in the standard
3253 way). This function needs to preserve point, the match-data, and the
3254 current restriction. The region it returns may start or end in the
3255 middle of a line.
3256
3257 Since this function is called after every buffer change, it should be
3258 reasonably fast.
3259 @end defvar
3260
3261 @node Auto-Indentation
3262 @section Auto-indentation of code
3263
3264 For programming languages, an important feature of a major mode is to
3265 provide automatic indentation. This is controlled in Emacs by
3266 @code{indent-line-function} (@pxref{Mode-Specific Indent}).
3267 Writing a good indentation function can be difficult and to a large
3268 extent it is still a black art.
3269
3270 Many major mode authors will start by writing a simple indentation
3271 function that works for simple cases, for example by comparing with the
3272 indentation of the previous text line. For most programming languages
3273 that are not really line-based, this tends to scale very poorly:
3274 improving such a function to let it handle more diverse situations tends
3275 to become more and more difficult, resulting in the end with a large,
3276 complex, unmaintainable indentation function which nobody dares to touch.
3277
3278 A good indentation function will usually need to actually parse the
3279 text, according to the syntax of the language. Luckily, it is not
3280 necessary to parse the text in as much detail as would be needed
3281 for a compiler, but on the other hand, the parser embedded in the
3282 indentation code will want to be somewhat friendly to syntactically
3283 incorrect code.
3284
3285 Good maintainable indentation functions usually fall into 2 categories:
3286 either parsing forward from some ``safe'' starting point until the
3287 position of interest, or parsing backward from the position of interest.
3288 Neither of the two is a clearly better choice than the other: parsing
3289 backward is often more difficult than parsing forward because
3290 programming languages are designed to be parsed forward, but for the
3291 purpose of indentation it has the advantage of not needing to
3292 guess a ``safe'' starting point, and it generally enjoys the property
3293 that only a minimum of text will be analyzed to decide the indentation
3294 of a line, so indentation will tend to be unaffected by syntax errors in
3295 some earlier unrelated piece of code. Parsing forward on the other hand
3296 is usually easier and has the advantage of making it possible to
3297 reindent efficiently a whole region at a time, with a single parse.
3298
3299 Rather than write your own indentation function from scratch, it is
3300 often preferable to try and reuse some existing ones or to rely
3301 on a generic indentation engine. There are sadly few such
3302 engines. The CC-mode indentation code (used with C, C++, Java, Awk
3303 and a few other such modes) has been made more generic over the years,
3304 so if your language seems somewhat similar to one of those languages,
3305 you might try to use that engine. @c FIXME: documentation?
3306 Another one is SMIE which takes an approach in the spirit
3307 of Lisp sexps and adapts it to non-Lisp languages.
3308
3309 @menu
3310 * SMIE:: A simple minded indentation engine
3311 @end menu
3312
3313 @node SMIE
3314 @subsection Simple Minded Indentation Engine
3315
3316 SMIE is a package that provides a generic navigation and indentation
3317 engine. Based on a very simple parser using an ``operator precedence
3318 grammar'', it lets major modes extend the sexp-based navigation of Lisp
3319 to non-Lisp languages as well as provide a simple to use but reliable
3320 auto-indentation.
3321
3322 Operator precedence grammar is a very primitive technology for parsing
3323 compared to some of the more common techniques used in compilers.
3324 It has the following characteristics: its parsing power is very limited,
3325 and it is largely unable to detect syntax errors, but it has the
3326 advantage of being algorithmically efficient and able to parse forward
3327 just as well as backward. In practice that means that SMIE can use it
3328 for indentation based on backward parsing, that it can provide both
3329 @code{forward-sexp} and @code{backward-sexp} functionality, and that it
3330 will naturally work on syntactically incorrect code without any extra
3331 effort. The downside is that it also means that most programming
3332 languages cannot be parsed correctly using SMIE, at least not without
3333 resorting to some special tricks (@pxref{SMIE Tricks}).
3334
3335 @menu
3336 * SMIE setup:: SMIE setup and features
3337 * Operator Precedence Grammars:: A very simple parsing technique
3338 * SMIE Grammar:: Defining the grammar of a language
3339 * SMIE Lexer:: Defining tokens
3340 * SMIE Tricks:: Working around the parser's limitations
3341 * SMIE Indentation:: Specifying indentation rules
3342 * SMIE Indentation Helpers:: Helper functions for indentation rules
3343 * SMIE Indentation Example:: Sample indentation rules
3344 @end menu
3345
3346 @node SMIE setup
3347 @subsubsection SMIE Setup and Features
3348
3349 SMIE is meant to be a one-stop shop for structural navigation and
3350 various other features which rely on the syntactic structure of code, in
3351 particular automatic indentation. The main entry point is
3352 @code{smie-setup} which is a function typically called while setting
3353 up a major mode.
3354
3355 @defun smie-setup grammar rules-function &rest keywords
3356 Setup SMIE navigation and indentation.
3357 @var{grammar} is a grammar table generated by @code{smie-prec2->grammar}.
3358 @var{rules-function} is a set of indentation rules for use on
3359 @code{smie-rules-function}.
3360 @var{keywords} are additional arguments, which can include the following
3361 keywords:
3362 @itemize
3363 @item
3364 @code{:forward-token} @var{fun}: Specify the forward lexer to use.
3365 @item
3366 @code{:backward-token} @var{fun}: Specify the backward lexer to use.
3367 @end itemize
3368 @end defun
3369
3370 Calling this function is sufficient to make commands such as
3371 @code{forward-sexp}, @code{backward-sexp}, and @code{transpose-sexps} be
3372 able to properly handle structural elements other than just the paired
3373 parentheses already handled by syntax tables. For example, if the
3374 provided grammar is precise enough, @code{transpose-sexps} can correctly
3375 transpose the two arguments of a @code{+} operator, taking into account
3376 the precedence rules of the language.
3377
3378 Calling `smie-setup' is also sufficient to make TAB indentation work in
3379 the expected way, extends @code{blink-matching-paren} to apply to
3380 elements like @code{begin...end}, and provides some commands that you
3381 can bind in the major mode keymap.
3382
3383 @deffn Command smie-close-block
3384 This command closes the most recently opened (and not yet closed) block.
3385 @end deffn
3386
3387 @deffn Command smie-down-list &optional arg
3388 This command is like @code{down-list} but it also pays attention to
3389 nesting of tokens other than parentheses, such as @code{begin...end}.
3390 @end deffn
3391
3392 @node Operator Precedence Grammars
3393 @subsubsection Operator Precedence Grammars
3394
3395 SMIE's precedence grammars simply give to each token a pair of
3396 precedences: the left-precedence and the right-precedence. We say
3397 @code{T1 < T2} if the right-precedence of token @code{T1} is less than
3398 the left-precedence of token @code{T2}. A good way to read this
3399 @code{<} is as a kind of parenthesis: if we find @code{... T1 something
3400 T2 ...} then that should be parsed as @code{... T1 (something T2 ...}
3401 rather than as @code{... T1 something) T2 ...}. The latter
3402 interpretation would be the case if we had @code{T1 > T2}. If we have
3403 @code{T1 = T2}, it means that token T2 follows token T1 in the same
3404 syntactic construction, so typically we have @code{"begin" = "end"}.
3405 Such pairs of precedences are sufficient to express left-associativity
3406 or right-associativity of infix operators, nesting of tokens like
3407 parentheses and many other cases.
3408
3409 @c Let's leave this undocumented to leave it more open for change!
3410 @c @defvar smie-grammar
3411 @c The value of this variable is an alist specifying the left and right
3412 @c precedence of each token. It is meant to be initialized by using one of
3413 @c the functions below.
3414 @c @end defvar
3415
3416 @defun smie-prec2->grammar table
3417 This function takes a @emph{prec2} grammar @var{table} and returns an
3418 alist suitable for use in @code{smie-setup}. The @emph{prec2}
3419 @var{table} is itself meant to be built by one of the functions below.
3420 @end defun
3421
3422 @defun smie-merge-prec2s &rest tables
3423 This function takes several @emph{prec2} @var{tables} and merges them
3424 into a new @emph{prec2} table.
3425 @end defun
3426
3427 @defun smie-precs->prec2 precs
3428 This function builds a @emph{prec2} table from a table of precedences
3429 @var{precs}. @var{precs} should be a list, sorted by precedence (for
3430 example @code{"+"} will come before @code{"*"}), of elements of the form
3431 @code{(@var{assoc} @var{op} ...)}, where each @var{op} is a token that
3432 acts as an operator; @var{assoc} is their associativity, which can be
3433 either @code{left}, @code{right}, @code{assoc}, or @code{nonassoc}.
3434 All operators in a given element share the same precedence level
3435 and associativity.
3436 @end defun
3437
3438 @defun smie-bnf->prec2 bnf &rest resolvers
3439 This function lets you specify the grammar using a BNF notation.
3440 It accepts a @var{bnf} description of the grammar along with a set of
3441 conflict resolution rules @var{resolvers}, and
3442 returns a @emph{prec2} table.
3443
3444 @var{bnf} is a list of nonterminal definitions of the form
3445 @code{(@var{nonterm} @var{rhs1} @var{rhs2} ...)} where each @var{rhs}
3446 is a (non-empty) list of terminals (aka tokens) or non-terminals.
3447
3448 Not all grammars are accepted:
3449 @itemize
3450 @item
3451 An @var{rhs} cannot be an empty list (an empty list is never needed,
3452 since SMIE allows all non-terminals to match the empty string anyway).
3453 @item
3454 An @var{rhs} cannot have 2 consecutive non-terminals: each pair of
3455 non-terminals needs to be separated by a terminal (aka token).
3456 This is a fundamental limitation of operator precedence grammars.
3457 @end itemize
3458
3459 Additionally, conflicts can occur:
3460 @itemize
3461 @item
3462 The returned @emph{prec2} table holds constraints between pairs of tokens, and
3463 for any given pair only one constraint can be present: T1 < T2,
3464 T1 = T2, or T1 > T2.
3465 @item
3466 A token can be an @code{opener} (something similar to an open-paren),
3467 a @code{closer} (like a close-paren), or @code{neither} of the two
3468 (e.g. an infix operator, or an inner token like @code{"else"}).
3469 @end itemize
3470
3471 Precedence conflicts can be resolved via @var{resolvers}, which
3472 is a list of @emph{precs} tables (see @code{smie-precs->prec2}): for
3473 each precedence conflict, if those @code{precs} tables
3474 specify a particular constraint, then the conflict is resolved by using
3475 this constraint instead, else a conflict is reported and one of the
3476 conflicting constraints is picked arbitrarily and the others are
3477 simply ignored.
3478 @end defun
3479
3480 @node SMIE Grammar
3481 @subsubsection Defining the Grammar of a Language
3482
3483 The usual way to define the SMIE grammar of a language is by
3484 defining a new global variable that holds the precedence table by
3485 giving a set of BNF rules.
3486 For example, the grammar definition for a small Pascal-like language
3487 could look like:
3488 @example
3489 @group
3490 (require 'smie)
3491 (defvar sample-smie-grammar
3492 (smie-prec2->grammar
3493 (smie-bnf->prec2
3494 @end group
3495 @group
3496 '((id)
3497 (inst ("begin" insts "end")
3498 ("if" exp "then" inst "else" inst)
3499 (id ":=" exp)
3500 (exp))
3501 (insts (insts ";" insts) (inst))
3502 (exp (exp "+" exp)
3503 (exp "*" exp)
3504 ("(" exps ")"))
3505 (exps (exps "," exps) (exp)))
3506 @end group
3507 @group
3508 '((assoc ";"))
3509 '((assoc ","))
3510 '((assoc "+") (assoc "*")))))
3511 @end group
3512 @end example
3513
3514 @noindent
3515 A few things to note:
3516
3517 @itemize
3518 @item
3519 The above grammar does not explicitly mention the syntax of function
3520 calls: SMIE will automatically allow any sequence of sexps, such as
3521 identifiers, balanced parentheses, or @code{begin ... end} blocks
3522 to appear anywhere anyway.
3523 @item
3524 The grammar category @code{id} has no right hand side: this does not
3525 mean that it can match only the empty string, since as mentioned any
3526 sequence of sexps can appear anywhere anyway.
3527 @item
3528 Because non terminals cannot appear consecutively in the BNF grammar, it
3529 is difficult to correctly handle tokens that act as terminators, so the
3530 above grammar treats @code{";"} as a statement @emph{separator} instead,
3531 which SMIE can handle very well.
3532 @item
3533 Separators used in sequences (such as @code{","} and @code{";"} above)
3534 are best defined with BNF rules such as @code{(foo (foo "separator" foo) ...)}
3535 which generate precedence conflicts which are then resolved by giving
3536 them an explicit @code{(assoc "separator")}.
3537 @item
3538 The @code{("(" exps ")")} rule was not needed to pair up parens, since
3539 SMIE will pair up any characters that are marked as having paren syntax
3540 in the syntax table. What this rule does instead (together with the
3541 definition of @code{exps}) is to make it clear that @code{","} should
3542 not appear outside of parentheses.
3543 @item
3544 Rather than have a single @emph{precs} table to resolve conflicts, it is
3545 preferable to have several tables, so as to let the BNF part of the
3546 grammar specify relative precedences where possible.
3547 @item
3548 Unless there is a very good reason to prefer @code{left} or
3549 @code{right}, it is usually preferable to mark operators as associative,
3550 using @code{assoc}. For that reason @code{"+"} and @code{"*"} are
3551 defined above as @code{assoc}, although the language defines them
3552 formally as left associative.
3553 @end itemize
3554
3555 @node SMIE Lexer
3556 @subsubsection Defining Tokens
3557
3558 SMIE comes with a predefined lexical analyzer which uses syntax tables
3559 in the following way: any sequence of characters that have word or
3560 symbol syntax is considered a token, and so is any sequence of
3561 characters that have punctuation syntax. This default lexer is
3562 often a good starting point but is rarely actually correct for any given
3563 language. For example, it will consider @code{"2,+3"} to be composed
3564 of 3 tokens: @code{"2"}, @code{",+"}, and @code{"3"}.
3565
3566 To describe the lexing rules of your language to SMIE, you need
3567 2 functions, one to fetch the next token, and another to fetch the
3568 previous token. Those functions will usually first skip whitespace and
3569 comments and then look at the next chunk of text to see if it
3570 is a special token. If so it should skip the token and
3571 return a description of this token. Usually this is simply the string
3572 extracted from the buffer, but it can be anything you want.
3573 For example:
3574 @example
3575 @group
3576 (defvar sample-keywords-regexp
3577 (regexp-opt '("+" "*" "," ";" ">" ">=" "<" "<=" ":=" "=")))
3578 @end group
3579 @group
3580 (defun sample-smie-forward-token ()
3581 (forward-comment (point-max))
3582 (cond
3583 ((looking-at sample-keywords-regexp)
3584 (goto-char (match-end 0))
3585 (match-string-no-properties 0))
3586 (t (buffer-substring-no-properties
3587 (point)
3588 (progn (skip-syntax-forward "w_")
3589 (point))))))
3590 @end group
3591 @group
3592 (defun sample-smie-backward-token ()
3593 (forward-comment (- (point)))
3594 (cond
3595 ((looking-back sample-keywords-regexp (- (point) 2) t)
3596 (goto-char (match-beginning 0))
3597 (match-string-no-properties 0))
3598 (t (buffer-substring-no-properties
3599 (point)
3600 (progn (skip-syntax-backward "w_")
3601 (point))))))
3602 @end group
3603 @end example
3604
3605 Notice how those lexers return the empty string when in front of
3606 parentheses. This is because SMIE automatically takes care of the
3607 parentheses defined in the syntax table. More specifically if the lexer
3608 returns nil or an empty string, SMIE tries to handle the corresponding
3609 text as a sexp according to syntax tables.
3610
3611 @node SMIE Tricks
3612 @subsubsection Living With a Weak Parser
3613
3614 The parsing technique used by SMIE does not allow tokens to behave
3615 differently in different contexts. For most programming languages, this
3616 manifests itself by precedence conflicts when converting the
3617 BNF grammar.
3618
3619 Sometimes, those conflicts can be worked around by expressing the
3620 grammar slightly differently. For example, for Modula-2 it might seem
3621 natural to have a BNF grammar that looks like this:
3622
3623 @example
3624 ...
3625 (inst ("IF" exp "THEN" insts "ELSE" insts "END")
3626 ("CASE" exp "OF" cases "END")
3627 ...)
3628 (cases (cases "|" cases)
3629 (caselabel ":" insts)
3630 ("ELSE" insts))
3631 ...
3632 @end example
3633
3634 But this will create conflicts for @code{"ELSE"}: on the one hand, the
3635 IF rule implies (among many other things) that @code{"ELSE" = "END"};
3636 but on the other hand, since @code{"ELSE"} appears within @code{cases},
3637 which appears left of @code{"END"}, we also have @code{"ELSE" > "END"}.
3638 We can solve the conflict either by using:
3639 @example
3640 ...
3641 (inst ("IF" exp "THEN" insts "ELSE" insts "END")
3642 ("CASE" exp "OF" cases "END")
3643 ("CASE" exp "OF" cases "ELSE" insts "END")
3644 ...)
3645 (cases (cases "|" cases) (caselabel ":" insts))
3646 ...
3647 @end example
3648 or
3649 @example
3650 ...
3651 (inst ("IF" exp "THEN" else "END")
3652 ("CASE" exp "OF" cases "END")
3653 ...)
3654 (else (insts "ELSE" insts))
3655 (cases (cases "|" cases) (caselabel ":" insts) (else))
3656 ...
3657 @end example
3658
3659 Reworking the grammar to try and solve conflicts has its downsides, tho,
3660 because SMIE assumes that the grammar reflects the logical structure of
3661 the code, so it is preferable to keep the BNF closer to the intended
3662 abstract syntax tree.
3663
3664 Other times, after careful consideration you may conclude that those
3665 conflicts are not serious and simply resolve them via the
3666 @var{resolvers} argument of @code{smie-bnf->prec2}. Usually this is
3667 because the grammar is simply ambiguous: the conflict does not affect
3668 the set of programs described by the grammar, but only the way those
3669 programs are parsed. This is typically the case for separators and
3670 associative infix operators, where you want to add a resolver like
3671 @code{'((assoc "|"))}. Another case where this can happen is for the
3672 classic @emph{dangling else} problem, where you will use @code{'((assoc
3673 "else" "then"))}. It can also happen for cases where the conflict is
3674 real and cannot really be resolved, but it is unlikely to pose a problem
3675 in practice.
3676
3677 Finally, in many cases some conflicts will remain despite all efforts to
3678 restructure the grammar. Do not despair: while the parser cannot be
3679 made more clever, you can make the lexer as smart as you want. So, the
3680 solution is then to look at the tokens involved in the conflict and to
3681 split one of those tokens into 2 (or more) different tokens. E.g. if
3682 the grammar needs to distinguish between two incompatible uses of the
3683 token @code{"begin"}, make the lexer return different tokens (say
3684 @code{"begin-fun"} and @code{"begin-plain"}) depending on which kind of
3685 @code{"begin"} it finds. This pushes the work of distinguishing the
3686 different cases to the lexer, which will thus have to look at the
3687 surrounding text to find ad-hoc clues.
3688
3689 @node SMIE Indentation
3690 @subsubsection Specifying Indentation Rules
3691
3692 Based on the provided grammar, SMIE will be able to provide automatic
3693 indentation without any extra effort. But in practice, this default
3694 indentation style will probably not be good enough. You will want to
3695 tweak it in many different cases.
3696
3697 SMIE indentation is based on the idea that indentation rules should be
3698 as local as possible. To this end, it relies on the idea of
3699 @emph{virtual} indentation, which is the indentation that a particular
3700 program point would have if it were at the beginning of a line.
3701 Of course, if that program point is indeed at the beginning of a line,
3702 its virtual indentation is its current indentation. But if not, then
3703 SMIE uses the indentation algorithm to compute the virtual indentation
3704 of that point. Now in practice, the virtual indentation of a program
3705 point does not have to be identical to the indentation it would have if
3706 we inserted a newline before it. To see how this works, the SMIE rule
3707 for indentation after a @code{@{} in C does not care whether the
3708 @code{@{} is standing on a line of its own or is at the end of the
3709 preceding line. Instead, these different cases are handled in the
3710 indentation rule that decides how to indent before a @code{@{}.
3711
3712 Another important concept is the notion of @emph{parent}: The
3713 @emph{parent} of a token, is the head token of the nearest enclosing
3714 syntactic construct. For example, the parent of an @code{else} is the
3715 @code{if} to which it belongs, and the parent of an @code{if}, in turn,
3716 is the lead token of the surrounding construct. The command
3717 @code{backward-sexp} jumps from a token to its parent, but there are
3718 some caveats: for @emph{openers} (tokens which start a construct, like
3719 @code{if}), you need to start with point before the token, while for
3720 others you need to start with point after the token.
3721 @code{backward-sexp} stops with point before the parent token if that is
3722 the @emph{opener} of the token of interest, and otherwise it stops with
3723 point after the parent token.
3724
3725 SMIE indentation rules are specified using a function that takes two
3726 arguments @var{method} and @var{arg} where the meaning of @var{arg} and the
3727 expected return value depend on @var{method}.
3728
3729 @var{method} can be:
3730 @itemize
3731 @item
3732 @code{:after}, in which case @var{arg} is a token and the function
3733 should return the @var{offset} to use for indentation after @var{arg}.
3734 @item
3735 @code{:before}, in which case @var{arg} is a token and the function
3736 should return the @var{offset} to use to indent @var{arg} itself.
3737 @item
3738 @code{:elem}, in which case the function should return either the offset
3739 to use to indent function arguments (if @var{arg} is the symbol
3740 @code{arg}) or the basic indentation step (if @var{arg} is the symbol
3741 @code{basic}).
3742 @item
3743 @code{:list-intro}, in which case @var{arg} is a token and the function
3744 should return non-@code{nil} if the token is followed by a list of
3745 expressions (not separated by any token) rather than an expression.
3746 @end itemize
3747
3748 When @var{arg} is a token, the function is called with point just before
3749 that token. A return value of nil always means to fallback on the
3750 default behavior, so the function should return nil for arguments it
3751 does not expect.
3752
3753 @var{offset} can be:
3754 @itemize
3755 @item
3756 @code{nil}: use the default indentation rule.
3757 @item
3758 @code{(column . @var{column})}: indent to column @var{column}.
3759 @item
3760 @var{number}: offset by @var{number}, relative to a base token which is
3761 the current token for @code{:after} and its parent for @code{:before}.
3762 @end itemize
3763
3764 @node SMIE Indentation Helpers
3765 @subsubsection Helper Functions for Indentation Rules
3766
3767 SMIE provides various functions designed specifically for use in the
3768 indentation rules function (several of those functions break if used in
3769 another context). These functions all start with the prefix
3770 @code{smie-rule-}.
3771
3772 @defun smie-rule-bolp
3773 Return non-@code{nil} if the current token is the first on the line.
3774 @end defun
3775
3776 @defun smie-rule-hanging-p
3777 Return non-@code{nil} if the current token is @emph{hanging}.
3778 A token is @emph{hanging} if it is the last token on the line
3779 and if it is preceded by other tokens: a lone token on a line is not
3780 hanging.
3781 @end defun
3782
3783 @defun smie-rule-next-p &rest tokens
3784 Return non-@code{nil} if the next token is among @var{tokens}.
3785 @end defun
3786
3787 @defun smie-rule-prev-p &rest tokens
3788 Return non-@code{nil} if the previous token is among @var{tokens}.
3789 @end defun
3790
3791 @defun smie-rule-parent-p &rest parents
3792 Return non-@code{nil} if the current token's parent is among @var{parents}.
3793 @end defun
3794
3795 @defun smie-rule-sibling-p
3796 Return non-nil if the current token's parent is actually a sibling.
3797 This is the case for example when the parent of a @code{","} is just the
3798 previous @code{","}.
3799 @end defun
3800
3801 @defun smie-rule-parent &optional offset
3802 Return the proper offset to align the current token with the parent.
3803 If non-@code{nil}, @var{offset} should be an integer giving an
3804 additional offset to apply.
3805 @end defun
3806
3807 @defun smie-rule-separator method
3808 Indent current token as a @emph{separator}.
3809
3810 By @emph{separator}, we mean here a token whose sole purpose is to
3811 separate various elements within some enclosing syntactic construct, and
3812 which does not have any semantic significance in itself (i.e. it would
3813 typically not exist as a node in an abstract syntax tree).
3814
3815 Such a token is expected to have an associative syntax and be closely
3816 tied to its syntactic parent. Typical examples are @code{","} in lists
3817 of arguments (enclosed inside parentheses), or @code{";"} in sequences
3818 of instructions (enclosed in a @code{@{...@}} or @code{begin...end}
3819 block).
3820
3821 @var{method} should be the method name that was passed to
3822 `smie-rules-function'.
3823 @end defun
3824
3825 @node SMIE Indentation Example
3826 @subsubsection Sample Indentation Rules
3827
3828 Here is an example of an indentation function:
3829
3830 @example
3831 (defun sample-smie-rules (kind token)
3832 (pcase (cons kind token)
3833 (`(:elem . basic) sample-indent-basic)
3834 (`(,_ . ",") (smie-rule-separator kind))
3835 (`(:after . ":=") sample-indent-basic)
3836 (`(:before . ,(or `"begin" `"(" `"@{")))
3837 (if (smie-rule-hanging-p) (smie-rule-parent)))
3838 (`(:before . "if")
3839 (and (not (smie-rule-bolp)) (smie-rule-prev-p "else")
3840 (smie-rule-parent)))))
3841 @end example
3842
3843 @noindent
3844 A few things to note:
3845
3846 @itemize
3847 @item
3848 The first case indicates the basic indentation increment to use.
3849 If @code{sample-indent-basic} is nil, then SMIE uses the global
3850 setting @code{smie-indent-basic}. The major mode could have set
3851 @code{smie-indent-basic} buffer-locally instead, but that
3852 is discouraged.
3853
3854 @item
3855 The rule for the token @code{","} make SMIE try to be more clever when
3856 the comma separator is placed at the beginning of lines. It tries to
3857 outdent the separator so as to align the code after the comma; for
3858 example:
3859
3860 @example
3861 x = longfunctionname (
3862 arg1
3863 , arg2
3864 );
3865 @end example
3866
3867 @item
3868 The rule for indentation after @code{":="} exists because otherwise
3869 SMIE would treat @code{":="} as an infix operator and would align the
3870 right argument with the left one.
3871
3872 @item
3873 The rule for indentation before @code{"begin"} is an example of the use
3874 of virtual indentation: This rule is used only when @code{"begin"} is
3875 hanging, which can happen only when @code{"begin"} is not at the
3876 beginning of a line. So this is not used when indenting
3877 @code{"begin"} itself but only when indenting something relative to this
3878 @code{"begin"}. Concretely, this rule changes the indentation from:
3879
3880 @example
3881 if x > 0 then begin
3882 dosomething(x);
3883 end
3884 @end example
3885 to
3886 @example
3887 if x > 0 then begin
3888 dosomething(x);
3889 end
3890 @end example
3891
3892 @item
3893 The rule for indentation before @code{"if"} is similar to the one for
3894 @code{"begin"}, but where the purpose is to treat @code{"else if"}
3895 as a single unit, so as to align a sequence of tests rather than indent
3896 each test further to the right. This function does this only in the
3897 case where the @code{"if"} is not placed on a separate line, hence the
3898 @code{smie-rule-bolp} test.
3899
3900 If we know that the @code{"else"} is always aligned with its @code{"if"}
3901 and is always at the beginning of a line, we can use a more efficient
3902 rule:
3903 @example
3904 ((equal token "if")
3905 (and (not (smie-rule-bolp))
3906 (smie-rule-prev-p "else")
3907 (save-excursion
3908 (sample-smie-backward-token)
3909 (cons 'column (current-column)))))
3910 @end example
3911
3912 The advantage of this formulation is that it reuses the indentation of
3913 the previous @code{"else"}, rather than going all the way back to the
3914 first @code{"if"} of the sequence.
3915 @end itemize
3916
3917 @node Desktop Save Mode
3918 @section Desktop Save Mode
3919 @cindex desktop save mode
3920
3921 @dfn{Desktop Save Mode} is a feature to save the state of Emacs from
3922 one session to another. The user-level commands for using Desktop
3923 Save Mode are described in the GNU Emacs Manual (@pxref{Saving Emacs
3924 Sessions,,, emacs, the GNU Emacs Manual}). Modes whose buffers visit
3925 a file, don't have to do anything to use this feature.
3926
3927 For buffers not visiting a file to have their state saved, the major
3928 mode must bind the buffer local variable @code{desktop-save-buffer} to
3929 a non-@code{nil} value.
3930
3931 @defvar desktop-save-buffer
3932 If this buffer-local variable is non-@code{nil}, the buffer will have
3933 its state saved in the desktop file at desktop save. If the value is
3934 a function, it is called at desktop save with argument
3935 @var{desktop-dirname}, and its value is saved in the desktop file along
3936 with the state of the buffer for which it was called. When file names
3937 are returned as part of the auxiliary information, they should be
3938 formatted using the call
3939
3940 @example
3941 (desktop-file-name @var{file-name} @var{desktop-dirname})
3942 @end example
3943
3944 @end defvar
3945
3946 For buffers not visiting a file to be restored, the major mode must
3947 define a function to do the job, and that function must be listed in
3948 the alist @code{desktop-buffer-mode-handlers}.
3949
3950 @defvar desktop-buffer-mode-handlers
3951 Alist with elements
3952
3953 @example
3954 (@var{major-mode} . @var{restore-buffer-function})
3955 @end example
3956
3957 The function @var{restore-buffer-function} will be called with
3958 argument list
3959
3960 @example
3961 (@var{buffer-file-name} @var{buffer-name} @var{desktop-buffer-misc})
3962 @end example
3963
3964 and it should return the restored buffer.
3965 Here @var{desktop-buffer-misc} is the value returned by the function
3966 optionally bound to @code{desktop-save-buffer}.
3967 @end defvar
3968
3969 @ignore
3970 Local Variables:
3971 fill-column: 72
3972 End:
3973 @end ignore