Doc updates for define-minor-mode argument behavior
[bpt/emacs.git] / doc / lispref / modes.texi
1 @c -*-texinfo-*-
2 @c This is part of the GNU Emacs Lisp Reference Manual.
3 @c Copyright (C) 1990-1995, 1998-1999, 2001-2012 Free Software Foundation, Inc.
4 @c See the file elisp.texi for copying conditions.
5 @setfilename ../../info/modes
6 @node Modes, Documentation, Keymaps, Top
7 @chapter Major and Minor Modes
8 @cindex mode
9
10 A @dfn{mode} is a set of definitions that customize Emacs and can be
11 turned on and off while you edit. There are two varieties of modes:
12 @dfn{major modes}, which are mutually exclusive and used for editing
13 particular kinds of text, and @dfn{minor modes}, which provide features
14 that users can enable individually.
15
16 This chapter describes how to write both major and minor modes, how to
17 indicate them in the mode line, and how they run hooks supplied by the
18 user. For related topics such as keymaps and syntax tables, see
19 @ref{Keymaps}, and @ref{Syntax Tables}.
20
21 @menu
22 * Hooks:: How to use hooks; how to write code that provides hooks.
23 * Major Modes:: Defining major modes.
24 * Minor Modes:: Defining minor modes.
25 * Mode Line Format:: Customizing the text that appears in the mode line.
26 * Imenu:: Providing a menu of definitions made in a buffer.
27 * Font Lock Mode:: How modes can highlight text according to syntax.
28 * Auto-Indentation:: How to teach Emacs to indent for a major mode.
29 * Desktop Save Mode:: How modes can have buffer state saved between
30 Emacs sessions.
31 @end menu
32
33 @node Hooks
34 @section Hooks
35 @cindex hooks
36
37 A @dfn{hook} is a variable where you can store a function or functions
38 to be called on a particular occasion by an existing program. Emacs
39 provides hooks for the sake of customization. Most often, hooks are set
40 up in the init file (@pxref{Init File}), but Lisp programs can set them also.
41 @xref{Standard Hooks}, for a list of standard hook variables.
42
43 @cindex normal hook
44 Most of the hooks in Emacs are @dfn{normal hooks}. These variables
45 contain lists of functions to be called with no arguments. By
46 convention, whenever the hook name ends in @samp{-hook}, that tells
47 you it is normal. We try to make all hooks normal, as much as
48 possible, so that you can use them in a uniform way.
49
50 Every major mode command is supposed to run a normal hook called the
51 @dfn{mode hook} as the one of the last steps of initialization. This
52 makes it easy for a user to customize the behavior of the mode, by
53 overriding the buffer-local variable assignments already made by the
54 mode. Most minor mode functions also run a mode hook at the end. But
55 hooks are used in other contexts too. For example, the hook
56 @code{suspend-hook} runs just before Emacs suspends itself
57 (@pxref{Suspending Emacs}).
58
59 The recommended way to add a hook function to a normal hook is by
60 calling @code{add-hook} (see below). The hook functions may be any of
61 the valid kinds of functions that @code{funcall} accepts (@pxref{What
62 Is a Function}). Most normal hook variables are initially void;
63 @code{add-hook} knows how to deal with this. You can add hooks either
64 globally or buffer-locally with @code{add-hook}.
65
66 @cindex abnormal hook
67 If the hook variable's name does not end with @samp{-hook}, that
68 indicates it is probably an @dfn{abnormal hook}. That means the hook
69 functions are called with arguments, or their return values are used
70 in some way. The hook's documentation says how the functions are
71 called. You can use @code{add-hook} to add a function to an abnormal
72 hook, but you must write the function to follow the hook's calling
73 convention.
74
75 By convention, abnormal hook names end in @samp{-functions} or
76 @samp{-hooks}. If the variable's name ends in @samp{-function}, then
77 its value is just a single function, not a list of functions.
78
79 @menu
80 * Running Hooks:: How to run a hook.
81 * Setting Hooks:: How to put functions on a hook, or remove them.
82 @end menu
83
84 @node Running Hooks
85 @subsection Running Hooks
86
87 In this section, we document the @code{run-hooks} function, which is
88 used to run a normal hook. We also document the functions for running
89 various kinds of abnormal hooks.
90
91 @defun run-hooks &rest hookvars
92 This function takes one or more normal hook variable names as
93 arguments, and runs each hook in turn. Each argument should be a
94 symbol that is a normal hook variable. These arguments are processed
95 in the order specified.
96
97 If a hook variable has a non-@code{nil} value, that value should be a
98 list of functions. @code{run-hooks} calls all the functions, one by
99 one, with no arguments.
100
101 The hook variable's value can also be a single function---either a
102 lambda expression or a symbol with a function definition---which
103 @code{run-hooks} calls. But this usage is obsolete.
104
105 If the hook variable is buffer-local, the buffer-local variable will
106 be used instead of the global variable. However, if the buffer-local
107 variable contains the element @code{t}, the global hook variable will
108 be run as well.
109 @end defun
110
111 @defun run-hook-with-args hook &rest args
112 This function runs an abnormal hook by calling all the hook functions in
113 @var{hook}, passing each one the arguments @var{args}.
114 @end defun
115
116 @defun run-hook-with-args-until-failure hook &rest args
117 This function runs an abnormal hook by calling each hook function in
118 turn, stopping if one of them ``fails'' by returning @code{nil}. Each
119 hook function is passed the arguments @var{args}. If this function
120 stops because one of the hook functions fails, it returns @code{nil};
121 otherwise it returns a non-@code{nil} value.
122 @end defun
123
124 @defun run-hook-with-args-until-success hook &rest args
125 This function runs an abnormal hook by calling each hook function,
126 stopping if one of them ``succeeds'' by returning a non-@code{nil}
127 value. Each hook function is passed the arguments @var{args}. If this
128 function stops because one of the hook functions returns a
129 non-@code{nil} value, it returns that value; otherwise it returns
130 @code{nil}.
131 @end defun
132
133 @defmac with-wrapper-hook hook args &rest body
134 This macro runs the abnormal hook @code{hook} as a series of nested
135 ``wrapper functions'' around the @var{body} forms. The effect is
136 similar to nested @code{around} advices (@pxref{Around-Advice}).
137
138 Each hook function should accept an argument list consisting of a function
139 @var{fun}, followed by the additional arguments listed in @var{args}.
140 The first hook function is passed a function @var{fun} that, if it is
141 called with arguments @var{args}, performs @var{body} (i.e., the default
142 operation). The @var{fun} passed to each successive hook function is
143 constructed from all the preceding hook functions (and @var{body}); if
144 this @var{fun} is called with arguments @var{args}, it does what the
145 @code{with-wrapper-hook} call would if the preceding hook functions were
146 the only ones in @var{hook}.
147
148 Each hook function may call its @var{fun} argument as many times as it
149 wishes, including never. In that case, such a hook function acts to
150 replace the default definition altogether, and any preceding hook
151 functions. Of course, a subsequent hook function may do the same thing.
152
153 Each hook function definition is used to construct the @var{fun} passed
154 to the next hook function in @var{hook}, if any. The last or
155 ``outermost'' @var{fun} is called once to produce the overall effect.
156
157 When might you want to use a wrapper hook? The function
158 @code{filter-buffer-substring} illustrates a common case. There is a
159 basic functionality, performed by @var{body}---in this case, to extract
160 a buffer-substring. Then any number of hook functions can act in
161 sequence to modify that string, before returning the final result.
162 A wrapper-hook also allows for a hook function to completely replace the
163 default definition (by not calling @var{fun}).
164 @end defmac
165
166 @defun run-hook-wrapped hook wrap-function &rest args
167 This function is similar to @code{run-hook-with-args-until-success}.
168 Like that function, it runs the functions on the abnormal hook
169 @code{hook}, stopping at the first one that returns non-@code{nil}.
170 Instead of calling the hook functions directly, though, it actually
171 calls @code{wrap-function} with arguments @code{fun} and @code{args}.
172 @end defun
173
174 @node Setting Hooks
175 @subsection Setting Hooks
176
177 Here's an example that uses a mode hook to turn on Auto Fill mode when
178 in Lisp Interaction mode:
179
180 @example
181 (add-hook 'lisp-interaction-mode-hook 'turn-on-auto-fill)
182 @end example
183
184 @defun add-hook hook function &optional append local
185 This function is the handy way to add function @var{function} to hook
186 variable @var{hook}. You can use it for abnormal hooks as well as for
187 normal hooks. @var{function} can be any Lisp function that can accept
188 the proper number of arguments for @var{hook}. For example,
189
190 @example
191 (add-hook 'text-mode-hook 'my-text-hook-function)
192 @end example
193
194 @noindent
195 adds @code{my-text-hook-function} to the hook called @code{text-mode-hook}.
196
197 If @var{function} is already present in @var{hook} (comparing using
198 @code{equal}), then @code{add-hook} does not add it a second time.
199
200 If @var{function} has a non-@code{nil} property
201 @code{permanent-local-hook}, then @code{kill-all-local-variables} (or
202 changing major modes) won't delete it from the hook variable's local
203 value.
204
205 It is best to design your hook functions so that the order in which
206 they are executed does not matter. Any dependence on the order is
207 asking for trouble. However, the order is predictable: normally,
208 @var{function} goes at the front of the hook list, so it will be
209 executed first (barring another @code{add-hook} call). If the
210 optional argument @var{append} is non-@code{nil}, the new hook
211 function goes at the end of the hook list and will be executed last.
212
213 @code{add-hook} can handle the cases where @var{hook} is void or its
214 value is a single function; it sets or changes the value to a list of
215 functions.
216
217 If @var{local} is non-@code{nil}, that says to add @var{function} to the
218 buffer-local hook list instead of to the global hook list. This makes
219 the hook buffer-local and adds @code{t} to the buffer-local value. The
220 latter acts as a flag to run the hook functions in the default value as
221 well as in the local value.
222 @end defun
223
224 @defun remove-hook hook function &optional local
225 This function removes @var{function} from the hook variable
226 @var{hook}. It compares @var{function} with elements of @var{hook}
227 using @code{equal}, so it works for both symbols and lambda
228 expressions.
229
230 If @var{local} is non-@code{nil}, that says to remove @var{function}
231 from the buffer-local hook list instead of from the global hook list.
232 @end defun
233
234 @node Major Modes
235 @section Major Modes
236 @cindex major mode
237
238 @cindex major mode command
239 Major modes specialize Emacs for editing particular kinds of text.
240 Each buffer has one major mode at a time. Every major mode is
241 associated with a @dfn{major mode command}, whose name should end in
242 @samp{-mode}. This command takes care of switching to that mode in the
243 current buffer, by setting various buffer-local variables such as a
244 local keymap. @xref{Major Mode Conventions}.
245
246 The least specialized major mode is called @dfn{Fundamental mode},
247 which has no mode-specific definitions or variable settings.
248
249 @deffn Command fundamental-mode
250 This is the major mode command for Fundamental mode. Unlike other mode
251 commands, it does @emph{not} run any mode hooks (@pxref{Major Mode
252 Conventions}), since you are not supposed to customize this mode.
253 @end deffn
254
255 The easiest way to write a major mode is to use the macro
256 @code{define-derived-mode}, which sets up the new mode as a variant of
257 an existing major mode. @xref{Derived Modes}. We recommend using
258 @code{define-derived-mode} even if the new mode is not an obvious
259 derivative of another mode, as it automatically enforces many coding
260 conventions for you. @xref{Basic Major Modes}, for common modes to
261 derive from.
262
263 The standard GNU Emacs Lisp directory tree contains the code for
264 several major modes, in files such as @file{text-mode.el},
265 @file{texinfo.el}, @file{lisp-mode.el}, and @file{rmail.el}. You can
266 study these libraries to see how modes are written.
267
268 @defopt major-mode
269 The buffer-local value of this variable holds the symbol for the current
270 major mode. Its default value holds the default major mode for new
271 buffers. The standard default value is @code{fundamental-mode}.
272
273 If the default value is @code{nil}, then whenever Emacs creates a new
274 buffer via a command such as @kbd{C-x b} (@code{switch-to-buffer}), the
275 new buffer is put in the major mode of the previously current buffer.
276 As an exception, if the major mode of the previous buffer has a
277 @code{mode-class} symbol property with value @code{special}, the new
278 buffer is put in Fundamental mode (@pxref{Major Mode Conventions}).
279 @end defopt
280
281 @menu
282 * Major Mode Conventions:: Coding conventions for keymaps, etc.
283 * Auto Major Mode:: How Emacs chooses the major mode automatically.
284 * Mode Help:: Finding out how to use a mode.
285 * Derived Modes:: Defining a new major mode based on another major
286 mode.
287 * Basic Major Modes:: Modes that other modes are often derived from.
288 * Generic Modes:: Defining a simple major mode that supports
289 comment syntax and Font Lock mode.
290 * Mode Hooks:: Hooks run at the end of major mode commands.
291 * Example Major Modes:: Text mode and Lisp modes.
292 @end menu
293
294 @node Major Mode Conventions
295 @subsection Major Mode Conventions
296 @cindex major mode conventions
297 @cindex conventions for writing major modes
298
299 The code for every major mode should follow various coding
300 conventions, including conventions for local keymap and syntax table
301 initialization, function and variable names, and hooks.
302
303 If you use the @code{define-derived-mode} macro, it will take care of
304 many of these conventions automatically. @xref{Derived Modes}. Note
305 also that fundamental mode is an exception to many of these conventions,
306 because its definition is to present the global state of Emacs.
307
308 The following list of conventions is only partial. Each major mode
309 should aim for consistency in general with other Emacs major modes, as
310 this makes Emacs as a whole more coherent. It is impossible to list
311 here all the possible points where this issue might come up; if the
312 Emacs developers point out an area where your major mode deviates from
313 the usual conventions, please make it compatible.
314
315 @itemize @bullet
316 @item
317 Define a major mode command whose name ends in @samp{-mode}. When
318 called with no arguments, this command should switch to the new mode in
319 the current buffer by setting up the keymap, syntax table, and
320 buffer-local variables in an existing buffer. It should not change the
321 buffer's contents.
322
323 @item
324 Write a documentation string for this command that describes the special
325 commands available in this mode. @xref{Mode Help}.
326
327 The documentation string may include the special documentation
328 substrings, @samp{\[@var{command}]}, @samp{\@{@var{keymap}@}}, and
329 @samp{\<@var{keymap}>}, which allow the help display to adapt
330 automatically to the user's own key bindings. @xref{Keys in
331 Documentation}.
332
333 @item
334 The major mode command should start by calling
335 @code{kill-all-local-variables}. This runs the normal hook
336 @code{change-major-mode-hook}, then gets rid of the buffer-local
337 variables of the major mode previously in effect. @xref{Creating
338 Buffer-Local}.
339
340 @item
341 The major mode command should set the variable @code{major-mode} to the
342 major mode command symbol. This is how @code{describe-mode} discovers
343 which documentation to print.
344
345 @item
346 The major mode command should set the variable @code{mode-name} to the
347 ``pretty'' name of the mode, usually a string (but see @ref{Mode Line
348 Data}, for other possible forms). The name of the mode appears
349 in the mode line.
350
351 @item
352 @cindex functions in modes
353 Since all global names are in the same name space, all the global
354 variables, constants, and functions that are part of the mode should
355 have names that start with the major mode name (or with an abbreviation
356 of it if the name is long). @xref{Coding Conventions}.
357
358 @item
359 In a major mode for editing some kind of structured text, such as a
360 programming language, indentation of text according to structure is
361 probably useful. So the mode should set @code{indent-line-function}
362 to a suitable function, and probably customize other variables
363 for indentation. @xref{Auto-Indentation}.
364
365 @item
366 @cindex keymaps in modes
367 The major mode should usually have its own keymap, which is used as the
368 local keymap in all buffers in that mode. The major mode command should
369 call @code{use-local-map} to install this local map. @xref{Active
370 Keymaps}, for more information.
371
372 This keymap should be stored permanently in a global variable named
373 @code{@var{modename}-mode-map}. Normally the library that defines the
374 mode sets this variable.
375
376 @xref{Tips for Defining}, for advice about how to write the code to set
377 up the mode's keymap variable.
378
379 @item
380 The key sequences bound in a major mode keymap should usually start with
381 @kbd{C-c}, followed by a control character, a digit, or @kbd{@{},
382 @kbd{@}}, @kbd{<}, @kbd{>}, @kbd{:} or @kbd{;}. The other punctuation
383 characters are reserved for minor modes, and ordinary letters are
384 reserved for users.
385
386 A major mode can also rebind the keys @kbd{M-n}, @kbd{M-p} and
387 @kbd{M-s}. The bindings for @kbd{M-n} and @kbd{M-p} should normally
388 be some kind of ``moving forward and backward,'' but this does not
389 necessarily mean cursor motion.
390
391 It is legitimate for a major mode to rebind a standard key sequence if
392 it provides a command that does ``the same job'' in a way better
393 suited to the text this mode is used for. For example, a major mode
394 for editing a programming language might redefine @kbd{C-M-a} to
395 ``move to the beginning of a function'' in a way that works better for
396 that language.
397
398 It is also legitimate for a major mode to rebind a standard key
399 sequence whose standard meaning is rarely useful in that mode. For
400 instance, minibuffer modes rebind @kbd{M-r}, whose standard meaning is
401 rarely of any use in the minibuffer. Major modes such as Dired or
402 Rmail that do not allow self-insertion of text can reasonably redefine
403 letters and other printing characters as special commands.
404
405 @item
406 Major modes for editing text should not define @key{RET} to do
407 anything other than insert a newline. However, it is ok for
408 specialized modes for text that users don't directly edit, such as
409 Dired and Info modes, to redefine @key{RET} to do something entirely
410 different.
411
412 @item
413 Major modes should not alter options that are primarily a matter of user
414 preference, such as whether Auto-Fill mode is enabled. Leave this to
415 each user to decide. However, a major mode should customize other
416 variables so that Auto-Fill mode will work usefully @emph{if} the user
417 decides to use it.
418
419 @item
420 @cindex syntax tables in modes
421 The mode may have its own syntax table or may share one with other
422 related modes. If it has its own syntax table, it should store this in
423 a variable named @code{@var{modename}-mode-syntax-table}. @xref{Syntax
424 Tables}.
425
426 @item
427 If the mode handles a language that has a syntax for comments, it should
428 set the variables that define the comment syntax. @xref{Options for
429 Comments,, Options Controlling Comments, emacs, The GNU Emacs Manual}.
430
431 @item
432 @cindex abbrev tables in modes
433 The mode may have its own abbrev table or may share one with other
434 related modes. If it has its own abbrev table, it should store this
435 in a variable named @code{@var{modename}-mode-abbrev-table}. If the
436 major mode command defines any abbrevs itself, it should pass @code{t}
437 for the @var{system-flag} argument to @code{define-abbrev}.
438 @xref{Defining Abbrevs}.
439
440 @item
441 The mode should specify how to do highlighting for Font Lock mode, by
442 setting up a buffer-local value for the variable
443 @code{font-lock-defaults} (@pxref{Font Lock Mode}).
444
445 @item
446 Each face that the mode defines should, if possible, inherit from an
447 existing Emacs face. @xref{Basic Faces}, and @ref{Faces for Font Lock}.
448
449 @item
450 The mode should specify how Imenu should find the definitions or
451 sections of a buffer, by setting up a buffer-local value for the
452 variable @code{imenu-generic-expression}, for the two variables
453 @code{imenu-prev-index-position-function} and
454 @code{imenu-extract-index-name-function}, or for the variable
455 @code{imenu-create-index-function} (@pxref{Imenu}).
456
457 @item
458 The mode can specify a local value for
459 @code{eldoc-documentation-function} to tell ElDoc mode how to handle
460 this mode.
461
462 @item
463 The mode can specify how to complete various keywords by adding
464 to the special hook @code{completion-at-point-functions}.
465
466 @item
467 Use @code{defvar} or @code{defcustom} to set mode-related variables, so
468 that they are not reinitialized if they already have a value. (Such
469 reinitialization could discard customizations made by the user.)
470
471 @item
472 @cindex buffer-local variables in modes
473 To make a buffer-local binding for an Emacs customization variable, use
474 @code{make-local-variable} in the major mode command, not
475 @code{make-variable-buffer-local}. The latter function would make the
476 variable local to every buffer in which it is subsequently set, which
477 would affect buffers that do not use this mode. It is undesirable for a
478 mode to have such global effects. @xref{Buffer-Local Variables}.
479
480 With rare exceptions, the only reasonable way to use
481 @code{make-variable-buffer-local} in a Lisp package is for a variable
482 which is used only within that package. Using it on a variable used by
483 other packages would interfere with them.
484
485 @item
486 @cindex mode hook
487 @cindex major mode hook
488 Each major mode should have a normal @dfn{mode hook} named
489 @code{@var{modename}-mode-hook}. The very last thing the major mode command
490 should do is to call @code{run-mode-hooks}. This runs the normal
491 hook @code{change-major-mode-after-body-hook}, the mode hook,
492 and then the normal hook @code{after-change-major-mode-hook}.
493 @xref{Mode Hooks}.
494
495 @item
496 The major mode command may start by calling some other major mode
497 command (called the @dfn{parent mode}) and then alter some of its
498 settings. A mode that does this is called a @dfn{derived mode}. The
499 recommended way to define one is to use the @code{define-derived-mode}
500 macro, but this is not required. Such a mode should call the parent
501 mode command inside a @code{delay-mode-hooks} form. (Using
502 @code{define-derived-mode} does this automatically.) @xref{Derived
503 Modes}, and @ref{Mode Hooks}.
504
505 @item
506 If something special should be done if the user switches a buffer from
507 this mode to any other major mode, this mode can set up a buffer-local
508 value for @code{change-major-mode-hook} (@pxref{Creating Buffer-Local}).
509
510 @item
511 If this mode is appropriate only for specially-prepared text produced by
512 the mode itself (rather than by the user typing at the keyboard or by an
513 external file), then the major mode command symbol should have a
514 property named @code{mode-class} with value @code{special}, put on as
515 follows:
516
517 @kindex mode-class @r{(property)}
518 @cindex @code{special} modes
519 @example
520 (put 'funny-mode 'mode-class 'special)
521 @end example
522
523 @noindent
524 This tells Emacs that new buffers created while the current buffer is in
525 Funny mode should not be put in Funny mode, even though the default
526 value of @code{major-mode} is @code{nil}. By default, the value of
527 @code{nil} for @code{major-mode} means to use the current buffer's major
528 mode when creating new buffers (@pxref{Auto Major Mode}), but with such
529 @code{special} modes, Fundamental mode is used instead. Modes such as
530 Dired, Rmail, and Buffer List use this feature.
531
532 The @code{define-derived-mode} macro automatically marks the derived
533 mode as special if the parent mode is special. Special mode is a
534 convenient parent for such modes to inherit from; @xref{Basic Major
535 Modes}.
536
537 @item
538 If you want to make the new mode the default for files with certain
539 recognizable names, add an element to @code{auto-mode-alist} to select
540 the mode for those file names (@pxref{Auto Major Mode}). If you
541 define the mode command to autoload, you should add this element in
542 the same file that calls @code{autoload}. If you use an autoload
543 cookie for the mode command, you can also use an autoload cookie for
544 the form that adds the element (@pxref{autoload cookie}). If you do
545 not autoload the mode command, it is sufficient to add the element in
546 the file that contains the mode definition.
547
548 @item
549 In the comments that document the file, you should provide a sample
550 @code{autoload} form and an example of how to add to
551 @code{auto-mode-alist}, that users can include in their init files
552 (@pxref{Init File}).
553
554 @item
555 @cindex mode loading
556 The top-level forms in the file defining the mode should be written so
557 that they may be evaluated more than once without adverse consequences.
558 Even if you never load the file more than once, someone else will.
559 @end itemize
560
561 @node Auto Major Mode
562 @subsection How Emacs Chooses a Major Mode
563 @cindex major mode, automatic selection
564
565 Based on information in the file name or in the file itself, Emacs
566 automatically selects a major mode for the new buffer when a file is
567 visited. It also processes local variables specified in the file text.
568
569 @deffn Command normal-mode &optional find-file
570 This function establishes the proper major mode and buffer-local variable
571 bindings for the current buffer. First it calls @code{set-auto-mode}
572 (see below), then it runs @code{hack-local-variables} to parse, and
573 bind or evaluate as appropriate, the file's local variables
574 (@pxref{File Local Variables}).
575
576 If the @var{find-file} argument to @code{normal-mode} is non-@code{nil},
577 @code{normal-mode} assumes that the @code{find-file} function is calling
578 it. In this case, it may process local variables in the @samp{-*-}
579 line or at the end of the file. The variable
580 @code{enable-local-variables} controls whether to do so. @xref{File
581 Variables, , Local Variables in Files, emacs, The GNU Emacs Manual},
582 for the syntax of the local variables section of a file.
583
584 If you run @code{normal-mode} interactively, the argument
585 @var{find-file} is normally @code{nil}. In this case,
586 @code{normal-mode} unconditionally processes any file local variables.
587
588 The function calls @code{set-auto-mode} to choose a major mode. If this
589 does not specify a mode, the buffer stays in the major mode determined
590 by the default value of @code{major-mode} (see below).
591
592 @cindex file mode specification error
593 @code{normal-mode} uses @code{condition-case} around the call to the
594 major mode command, so errors are caught and reported as a @samp{File
595 mode specification error}, followed by the original error message.
596 @end deffn
597
598 @defun set-auto-mode &optional keep-mode-if-same
599 @cindex visited file mode
600 This function selects the major mode that is appropriate for the
601 current buffer. It bases its decision (in order of precedence) on the
602 @w{@samp{-*-}} line, on any @samp{mode:} local variable near the end of
603 a file, on the @w{@samp{#!}} line (using @code{interpreter-mode-alist}),
604 on the text at the beginning of the buffer (using
605 @code{magic-mode-alist}), and finally on the visited file name (using
606 @code{auto-mode-alist}). @xref{Choosing Modes, , How Major Modes are
607 Chosen, emacs, The GNU Emacs Manual}. If @code{enable-local-variables}
608 is @code{nil}, @code{set-auto-mode} does not check the @w{@samp{-*-}}
609 line, or near the end of the file, for any mode tag.
610
611 @vindex inhibit-local-variables-regexps
612 There are some file types where it is not appropriate to scan the file
613 contents for a mode specifier. For example, a tar archive may happen to
614 contain, near the end of the file, a member file that has a local
615 variables section specifying a mode for that particular file. This
616 should not be applied to the containing tar file. Similarly, a tiff
617 image file might just happen to contain a first line that seems to
618 match the @w{@samp{-*-}} pattern. For these reasons, both these file
619 extensions are members of the list @var{inhibit-local-variables-regexps}.
620 Add patterns to this list to prevent Emacs searching them for local
621 variables of any kind (not just mode specifiers).
622
623 If @var{keep-mode-if-same} is non-@code{nil}, this function does not
624 call the mode command if the buffer is already in the proper major
625 mode. For instance, @code{set-visited-file-name} sets this to
626 @code{t} to avoid killing buffer local variables that the user may
627 have set.
628 @end defun
629
630 @defun set-buffer-major-mode buffer
631 This function sets the major mode of @var{buffer} to the default value of
632 @code{major-mode}; if that is @code{nil}, it uses the
633 current buffer's major mode (if that is suitable). As an exception,
634 if @var{buffer}'s name is @samp{*scratch*}, it sets the mode to
635 @code{initial-major-mode}.
636
637 The low-level primitives for creating buffers do not use this function,
638 but medium-level commands such as @code{switch-to-buffer} and
639 @code{find-file-noselect} use it whenever they create buffers.
640 @end defun
641
642 @defopt initial-major-mode
643 @cindex @samp{*scratch*}
644 The value of this variable determines the major mode of the initial
645 @samp{*scratch*} buffer. The value should be a symbol that is a major
646 mode command. The default value is @code{lisp-interaction-mode}.
647 @end defopt
648
649 @defvar interpreter-mode-alist
650 This variable specifies major modes to use for scripts that specify a
651 command interpreter in a @samp{#!} line. Its value is an alist with
652 elements of the form @code{(@var{interpreter} . @var{mode})}; for
653 example, @code{("perl" . perl-mode)} is one element present by
654 default. The element says to use mode @var{mode} if the file
655 specifies an interpreter which matches @var{interpreter}.
656 @end defvar
657
658 @defvar magic-mode-alist
659 This variable's value is an alist with elements of the form
660 @code{(@var{regexp} . @var{function})}, where @var{regexp} is a
661 regular expression and @var{function} is a function or @code{nil}.
662 After visiting a file, @code{set-auto-mode} calls @var{function} if
663 the text at the beginning of the buffer matches @var{regexp} and
664 @var{function} is non-@code{nil}; if @var{function} is @code{nil},
665 @code{auto-mode-alist} gets to decide the mode.
666 @end defvar
667
668 @defvar magic-fallback-mode-alist
669 This works like @code{magic-mode-alist}, except that it is handled
670 only if @code{auto-mode-alist} does not specify a mode for this file.
671 @end defvar
672
673 @defvar auto-mode-alist
674 This variable contains an association list of file name patterns
675 (regular expressions) and corresponding major mode commands. Usually,
676 the file name patterns test for suffixes, such as @samp{.el} and
677 @samp{.c}, but this need not be the case. An ordinary element of the
678 alist looks like @code{(@var{regexp} . @var{mode-function})}.
679
680 For example,
681
682 @smallexample
683 @group
684 (("\\`/tmp/fol/" . text-mode)
685 ("\\.texinfo\\'" . texinfo-mode)
686 ("\\.texi\\'" . texinfo-mode)
687 @end group
688 @group
689 ("\\.el\\'" . emacs-lisp-mode)
690 ("\\.c\\'" . c-mode)
691 ("\\.h\\'" . c-mode)
692 @dots{})
693 @end group
694 @end smallexample
695
696 When you visit a file whose expanded file name (@pxref{File Name
697 Expansion}), with version numbers and backup suffixes removed using
698 @code{file-name-sans-versions} (@pxref{File Name Components}), matches
699 a @var{regexp}, @code{set-auto-mode} calls the corresponding
700 @var{mode-function}. This feature enables Emacs to select the proper
701 major mode for most files.
702
703 If an element of @code{auto-mode-alist} has the form @code{(@var{regexp}
704 @var{function} t)}, then after calling @var{function}, Emacs searches
705 @code{auto-mode-alist} again for a match against the portion of the file
706 name that did not match before. This feature is useful for
707 uncompression packages: an entry of the form @code{("\\.gz\\'"
708 @var{function} t)} can uncompress the file and then put the uncompressed
709 file in the proper mode according to the name sans @samp{.gz}.
710
711 Here is an example of how to prepend several pattern pairs to
712 @code{auto-mode-alist}. (You might use this sort of expression in your
713 init file.)
714
715 @smallexample
716 @group
717 (setq auto-mode-alist
718 (append
719 ;; @r{File name (within directory) starts with a dot.}
720 '(("/\\.[^/]*\\'" . fundamental-mode)
721 ;; @r{File name has no dot.}
722 ("/[^\\./]*\\'" . fundamental-mode)
723 ;; @r{File name ends in @samp{.C}.}
724 ("\\.C\\'" . c++-mode))
725 auto-mode-alist))
726 @end group
727 @end smallexample
728 @end defvar
729
730 @node Mode Help
731 @subsection Getting Help about a Major Mode
732 @cindex mode help
733 @cindex help for major mode
734 @cindex documentation for major mode
735
736 The @code{describe-mode} function provides information about major
737 modes. It is normally bound to @kbd{C-h m}. It uses the value of the
738 variable @code{major-mode} (@pxref{Major Modes}), which is why every
739 major mode command needs to set that variable.
740
741 @deffn Command describe-mode
742 This function displays the documentation of the current major mode.
743
744 The @code{describe-mode} function calls the @code{documentation}
745 function using the value of @code{major-mode} as an argument. Thus, it
746 displays the documentation string of the major mode command.
747 (@xref{Accessing Documentation}.)
748 @end deffn
749
750 @node Derived Modes
751 @subsection Defining Derived Modes
752 @cindex derived mode
753
754 The recommended way to define a new major mode is to derive it from an
755 existing one using @code{define-derived-mode}. If there is no closely
756 related mode, you should inherit from either @code{text-mode},
757 @code{special-mode}, or @code{prog-mode}. @xref{Basic Major Modes}. If
758 none of these are suitable, you can inherit from @code{fundamental-mode}
759 (@pxref{Major Modes}).
760
761 @defmac define-derived-mode variant parent name docstring keyword-args@dots{} body@dots{}
762 This macro defines @var{variant} as a major mode command, using
763 @var{name} as the string form of the mode name. @var{variant} and
764 @var{parent} should be unquoted symbols.
765
766 The new command @var{variant} is defined to call the function
767 @var{parent}, then override certain aspects of that parent mode:
768
769 @itemize @bullet
770 @item
771 The new mode has its own sparse keymap, named
772 @code{@var{variant}-map}. @code{define-derived-mode}
773 makes the parent mode's keymap the parent of the new map, unless
774 @code{@var{variant}-map} is already set and already has a parent.
775
776 @item
777 The new mode has its own syntax table, kept in the variable
778 @code{@var{variant}-syntax-table}, unless you override this using the
779 @code{:syntax-table} keyword (see below). @code{define-derived-mode}
780 makes the parent mode's syntax-table the parent of
781 @code{@var{variant}-syntax-table}, unless the latter is already set
782 and already has a parent different from the standard syntax table.
783
784 @item
785 The new mode has its own abbrev table, kept in the variable
786 @code{@var{variant}-abbrev-table}, unless you override this using the
787 @code{:abbrev-table} keyword (see below).
788
789 @item
790 The new mode has its own mode hook, @code{@var{variant}-hook}. It
791 runs this hook, after running the hooks of its ancestor modes, with
792 @code{run-mode-hooks}, as the last thing it does. @xref{Mode Hooks}.
793 @end itemize
794
795 In addition, you can specify how to override other aspects of
796 @var{parent} with @var{body}. The command @var{variant}
797 evaluates the forms in @var{body} after setting up all its usual
798 overrides, just before running the mode hooks.
799
800 If @var{parent} has a non-@code{nil} @code{mode-class} symbol
801 property, then @code{define-derived-mode} sets the @code{mode-class}
802 property of @var{variant} to the same value. This ensures, for
803 example, that if @var{parent} is a special mode, then @var{variant} is
804 also a special mode (@pxref{Major Mode Conventions}).
805
806 You can also specify @code{nil} for @var{parent}. This gives the new
807 mode no parent. Then @code{define-derived-mode} behaves as described
808 above, but, of course, omits all actions connected with @var{parent}.
809
810 The argument @var{docstring} specifies the documentation string for
811 the new mode. @code{define-derived-mode} adds some general
812 information about the mode's hook, followed by the mode's keymap, at
813 the end of this docstring. If you omit @var{docstring},
814 @code{define-derived-mode} generates a documentation string.
815
816 The @var{keyword-args} are pairs of keywords and values. The values
817 are evaluated. The following keywords are currently supported:
818
819 @table @code
820 @item :syntax-table
821 You can use this to explicitly specify a syntax table for the new
822 mode. If you specify a @code{nil} value, the new mode uses the same
823 syntax table as @var{parent}, or the standard syntax table if
824 @var{parent} is @code{nil}. (Note that this does @emph{not} follow
825 the convention used for non-keyword arguments that a @code{nil} value
826 is equivalent with not specifying the argument.)
827
828 @item :abbrev-table
829 You can use this to explicitly specify an abbrev table for the new
830 mode. If you specify a @code{nil} value, the new mode uses the same
831 abbrev table as @var{parent}, or @code{fundamental-mode-abbrev-table}
832 if @var{parent} is @code{nil}. (Again, a @code{nil} value is
833 @emph{not} equivalent to not specifying this keyword.)
834
835 @item :group
836 If this is specified, the value should be the customization group for
837 this mode. (Not all major modes have one.) Only the (still
838 experimental and unadvertised) command @code{customize-mode} currently
839 uses this. @code{define-derived-mode} does @emph{not} automatically
840 define the specified customization group.
841 @end table
842
843 Here is a hypothetical example:
844
845 @example
846 (define-derived-mode hypertext-mode
847 text-mode "Hypertext"
848 "Major mode for hypertext.
849 \\@{hypertext-mode-map@}"
850 (setq case-fold-search nil))
851
852 (define-key hypertext-mode-map
853 [down-mouse-3] 'do-hyper-link)
854 @end example
855
856 Do not write an @code{interactive} spec in the definition;
857 @code{define-derived-mode} does that automatically.
858 @end defmac
859
860 @defun derived-mode-p &rest modes
861 This function returns non-@code{nil} if the current major mode is
862 derived from any of the major modes given by the symbols @var{modes}.
863 @end defun
864
865 @node Basic Major Modes
866 @subsection Basic Major Modes
867
868 Apart from Fundamental mode, there are three major modes that other
869 major modes commonly derive from: Text mode, Prog mode, and Special
870 mode. While Text mode is useful in its own right (e.g. for editing
871 files ending in @file{.txt}), Prog mode and Special mode exist mainly to
872 let other modes derive from them.
873
874 @vindex prog-mode-hook
875 As far as possible, new major modes should be derived, either directly
876 or indirectly, from one of these three modes. One reason is that this
877 allows users to customize a single mode hook
878 (e.g. @code{prog-mode-hook}) for an entire family of relevant modes
879 (e.g. all programming language modes).
880
881 @deffn Command text-mode
882 Text mode is a major mode for editing human languages. It defines the
883 @samp{"} and @samp{\} characters as having punctuation syntax
884 (@pxref{Syntax Class Table}), and binds @kbd{M-@key{TAB}} to
885 @code{ispell-complete-word} (@pxref{Spelling,,, emacs, The GNU Emacs
886 Manual}).
887
888 An example of a major mode derived from Text mode is HTML mode.
889 @xref{HTML Mode,,SGML and HTML Modes, emacs, The GNU Emacs Manual}.
890 @end deffn
891
892 @deffn Command prog-mode
893 Prog mode is a basic major mode for buffers containing programming
894 language source code. Most of the programming language major modes
895 built into Emacs are derived from it.
896
897 Prog mode binds @code{parse-sexp-ignore-comments} to @code{t}
898 (@pxref{Motion via Parsing}) and @code{bidi-paragraph-direction} to
899 @code{left-to-right} (@pxref{Bidirectional Display}).
900 @end deffn
901
902 @deffn Command special-mode
903 Special mode is a basic major mode for buffers containing text that is
904 produced specially by Emacs, rather than from a file. Major modes
905 derived from Special mode are given a @code{mode-class} property of
906 @code{special} (@pxref{Major Mode Conventions}).
907
908 Special mode sets the buffer to read-only. Its keymap defines several
909 common bindings, including @kbd{q} for @code{quit-window}, @kbd{z} for
910 @code{kill-this-buffer}, and @kbd{g} for @code{revert-buffer}
911 (@pxref{Reverting}).
912
913 An example of a major mode derived from Special mode is Buffer Menu
914 mode, which is used by the @samp{*Buffer List*} buffer. @xref{List
915 Buffers,,Listing Existing Buffers, emacs, The GNU Emacs Manual}.
916 @end deffn
917
918 @node Generic Modes
919 @subsection Generic Modes
920 @cindex generic mode
921
922 @dfn{Generic modes} are simple major modes with basic support for
923 comment syntax and Font Lock mode. To define a generic mode, use the
924 macro @code{define-generic-mode}. See the file @file{generic-x.el}
925 for some examples of the use of @code{define-generic-mode}.
926
927 @defmac define-generic-mode mode comment-list keyword-list font-lock-list auto-mode-list function-list &optional docstring
928 This macro defines a generic mode command named @var{mode} (a symbol,
929 not quoted). The optional argument @var{docstring} is the
930 documentation for the mode command. If you do not supply it,
931 @code{define-generic-mode} generates one by default.
932
933 The argument @var{comment-list} is a list in which each element is
934 either a character, a string of one or two characters, or a cons cell.
935 A character or a string is set up in the mode's syntax table as a
936 ``comment starter.'' If the entry is a cons cell, the @sc{car} is set
937 up as a ``comment starter'' and the @sc{cdr} as a ``comment ender.''
938 (Use @code{nil} for the latter if you want comments to end at the end
939 of the line.) Note that the syntax table mechanism has limitations
940 about what comment starters and enders are actually possible.
941 @xref{Syntax Tables}.
942
943 The argument @var{keyword-list} is a list of keywords to highlight
944 with @code{font-lock-keyword-face}. Each keyword should be a string.
945 Meanwhile, @var{font-lock-list} is a list of additional expressions to
946 highlight. Each element of this list should have the same form as an
947 element of @code{font-lock-keywords}. @xref{Search-based
948 Fontification}.
949
950 The argument @var{auto-mode-list} is a list of regular expressions to
951 add to the variable @code{auto-mode-alist}. They are added by the execution
952 of the @code{define-generic-mode} form, not by expanding the macro call.
953
954 Finally, @var{function-list} is a list of functions for the mode
955 command to call for additional setup. It calls these functions just
956 before it runs the mode hook variable @code{@var{mode}-hook}.
957 @end defmac
958
959 @node Mode Hooks
960 @subsection Mode Hooks
961
962 Every major mode command should finish by running the mode-independent
963 normal hook @code{change-major-mode-after-body-hook}, its mode hook,
964 and the normal hook @code{after-change-major-mode-hook}.
965 It does this by calling @code{run-mode-hooks}. If the major mode is a
966 derived mode, that is if it calls another major mode (the parent mode)
967 in its body, it should do this inside @code{delay-mode-hooks} so that
968 the parent won't run these hooks itself. Instead, the derived mode's
969 call to @code{run-mode-hooks} runs the parent's mode hook too.
970 @xref{Major Mode Conventions}.
971
972 Emacs versions before Emacs 22 did not have @code{delay-mode-hooks}.
973 Versions before 24 did not have @code{change-major-mode-after-body-hook}.
974 When user-implemented major modes do not use @code{run-mode-hooks} and
975 have not been updated to use these newer features, they won't entirely
976 follow these conventions: they may run the parent's mode hook too early,
977 or fail to run @code{after-change-major-mode-hook}. If you encounter
978 such a major mode, please correct it to follow these conventions.
979
980 When you defined a major mode using @code{define-derived-mode}, it
981 automatically makes sure these conventions are followed. If you
982 define a major mode ``by hand,'' not using @code{define-derived-mode},
983 use the following functions to handle these conventions automatically.
984
985 @defun run-mode-hooks &rest hookvars
986 Major modes should run their mode hook using this function. It is
987 similar to @code{run-hooks} (@pxref{Hooks}), but it also runs
988 @code{change-major-mode-after-body-hook} and
989 @code{after-change-major-mode-hook}.
990
991 When this function is called during the execution of a
992 @code{delay-mode-hooks} form, it does not run the hooks immediately.
993 Instead, it arranges for the next call to @code{run-mode-hooks} to run
994 them.
995 @end defun
996
997 @defmac delay-mode-hooks body@dots{}
998 When one major mode command calls another, it should do so inside of
999 @code{delay-mode-hooks}.
1000
1001 This macro executes @var{body}, but tells all @code{run-mode-hooks}
1002 calls during the execution of @var{body} to delay running their hooks.
1003 The hooks will actually run during the next call to
1004 @code{run-mode-hooks} after the end of the @code{delay-mode-hooks}
1005 construct.
1006 @end defmac
1007
1008 @defvar change-major-mode-after-body-hook
1009 This is a normal hook run by @code{run-mode-hooks}. It is run before
1010 the mode hooks.
1011 @end defvar
1012
1013 @defvar after-change-major-mode-hook
1014 This is a normal hook run by @code{run-mode-hooks}. It is run at the
1015 very end of every properly-written major mode command.
1016 @end defvar
1017
1018 @node Example Major Modes
1019 @subsection Major Mode Examples
1020
1021 Text mode is perhaps the simplest mode besides Fundamental mode.
1022 Here are excerpts from @file{text-mode.el} that illustrate many of
1023 the conventions listed above:
1024
1025 @smallexample
1026 @group
1027 ;; @r{Create the syntax table for this mode.}
1028 (defvar text-mode-syntax-table
1029 (let ((st (make-syntax-table)))
1030 (modify-syntax-entry ?\" ". " st)
1031 (modify-syntax-entry ?\\ ". " st)
1032 ;; Add `p' so M-c on `hello' leads to `Hello', not `hello'.
1033 (modify-syntax-entry ?' "w p" st)
1034 st)
1035 "Syntax table used while in `text-mode'.")
1036 @end group
1037
1038 ;; @r{Create the keymap for this mode.}
1039 @group
1040 (defvar text-mode-map
1041 (let ((map (make-sparse-keymap)))
1042 (define-key map "\e\t" 'ispell-complete-word)
1043 map)
1044 "Keymap for `text-mode'.
1045 Many other modes, such as `mail-mode', `outline-mode' and
1046 `indented-text-mode', inherit all the commands defined in this map.")
1047 @end group
1048 @end smallexample
1049
1050 Here is how the actual mode command is defined now:
1051
1052 @smallexample
1053 @group
1054 (define-derived-mode text-mode nil "Text"
1055 "Major mode for editing text written for humans to read.
1056 In this mode, paragraphs are delimited only by blank or white lines.
1057 You can thus get the full benefit of adaptive filling
1058 (see the variable `adaptive-fill-mode').
1059 \\@{text-mode-map@}
1060 Turning on Text mode runs the normal hook `text-mode-hook'."
1061 @end group
1062 @group
1063 (set (make-local-variable 'text-mode-variant) t)
1064 (set (make-local-variable 'require-final-newline)
1065 mode-require-final-newline)
1066 (set (make-local-variable 'indent-line-function) 'indent-relative))
1067 @end group
1068 @end smallexample
1069
1070 @noindent
1071 (The last line is redundant nowadays, since @code{indent-relative} is
1072 the default value, and we'll delete it in a future version.)
1073
1074 @cindex @file{lisp-mode.el}
1075 The three Lisp modes (Lisp mode, Emacs Lisp mode, and Lisp Interaction
1076 mode) have more features than Text mode and the code is correspondingly
1077 more complicated. Here are excerpts from @file{lisp-mode.el} that
1078 illustrate how these modes are written.
1079
1080 Here is how the Lisp mode syntax and abbrev tables are defined:
1081
1082 @cindex syntax table example
1083 @smallexample
1084 @group
1085 ;; @r{Create mode-specific table variables.}
1086 (defvar lisp-mode-abbrev-table nil)
1087 (define-abbrev-table 'lisp-mode-abbrev-table ())
1088
1089 (defvar lisp-mode-syntax-table
1090 (let ((table (copy-syntax-table emacs-lisp-mode-syntax-table)))
1091 (modify-syntax-entry ?\[ "_ " table)
1092 (modify-syntax-entry ?\] "_ " table)
1093 (modify-syntax-entry ?# "' 14" table)
1094 (modify-syntax-entry ?| "\" 23bn" table)
1095 table)
1096 "Syntax table used in `lisp-mode'.")
1097 @end group
1098 @end smallexample
1099
1100 The three modes for Lisp share much of their code. For instance,
1101 each calls the following function to set various variables:
1102
1103 @smallexample
1104 @group
1105 (defun lisp-mode-variables (&optional lisp-syntax keywords-case-insensitive)
1106 (when lisp-syntax
1107 (set-syntax-table lisp-mode-syntax-table))
1108 (setq local-abbrev-table lisp-mode-abbrev-table)
1109 @dots{}
1110 @end group
1111 @end smallexample
1112
1113 @noindent
1114 Amongst other things, this function sets up the @code{comment-start}
1115 variable to handle Lisp comments:
1116
1117 @smallexample
1118 @group
1119 (make-local-variable 'comment-start)
1120 (setq comment-start ";")
1121 @dots{}
1122 @end group
1123 @end smallexample
1124
1125 Each of the different Lisp modes has a slightly different keymap. For
1126 example, Lisp mode binds @kbd{C-c C-z} to @code{run-lisp}, but the other
1127 Lisp modes do not. However, all Lisp modes have some commands in
1128 common. The following code sets up the common commands:
1129
1130 @smallexample
1131 @group
1132 (defvar lisp-mode-shared-map
1133 (let ((map (make-sparse-keymap)))
1134 (define-key map "\e\C-q" 'indent-sexp)
1135 (define-key map "\177" 'backward-delete-char-untabify)
1136 map)
1137 "Keymap for commands shared by all sorts of Lisp modes.")
1138 @end group
1139 @end smallexample
1140
1141 @noindent
1142 And here is the code to set up the keymap for Lisp mode:
1143
1144 @smallexample
1145 @group
1146 (defvar lisp-mode-map
1147 (let ((map (make-sparse-keymap))
1148 (menu-map (make-sparse-keymap "Lisp")))
1149 (set-keymap-parent map lisp-mode-shared-map)
1150 (define-key map "\e\C-x" 'lisp-eval-defun)
1151 (define-key map "\C-c\C-z" 'run-lisp)
1152 @dots{}
1153 map)
1154 "Keymap for ordinary Lisp mode.
1155 All commands in `lisp-mode-shared-map' are inherited by this map.")
1156 @end group
1157 @end smallexample
1158
1159 @noindent
1160 Finally, here is the major mode command for Lisp mode:
1161
1162 @smallexample
1163 @group
1164 (define-derived-mode lisp-mode prog-mode "Lisp"
1165 "Major mode for editing Lisp code for Lisps other than GNU Emacs Lisp.
1166 Commands:
1167 Delete converts tabs to spaces as it moves back.
1168 Blank lines separate paragraphs. Semicolons start comments.
1169
1170 \\@{lisp-mode-map@}
1171 Note that `run-lisp' may be used either to start an inferior Lisp job
1172 or to switch back to an existing one.
1173 @end group
1174
1175 @group
1176 Entry to this mode calls the value of `lisp-mode-hook'
1177 if that value is non-nil."
1178 (lisp-mode-variables nil t)
1179 (set (make-local-variable 'find-tag-default-function) 'lisp-find-tag-default)
1180 (make-local-variable 'comment-start-skip)
1181 (setq comment-start-skip
1182 "\\(\\(^\\|[^\\\\\n]\\)\\(\\\\\\\\\\)*\\)\\(;+\\|#|\\) *")
1183 (setq imenu-case-fold-search t))
1184 @end group
1185 @end smallexample
1186
1187 @node Minor Modes
1188 @section Minor Modes
1189 @cindex minor mode
1190
1191 A @dfn{minor mode} provides features that users may enable or disable
1192 independently of the choice of major mode. Minor modes can be enabled
1193 individually or in combination. Minor modes would be better named
1194 ``generally available, optional feature modes,'' except that such a name
1195 would be unwieldy.
1196
1197 A minor mode is not usually meant as a variation of a single major mode.
1198 Usually they are general and can apply to many major modes. For
1199 example, Auto Fill mode works with any major mode that permits text
1200 insertion. To be general, a minor mode must be effectively independent
1201 of the things major modes do.
1202
1203 A minor mode is often much more difficult to implement than a major
1204 mode. One reason is that you should be able to activate and deactivate
1205 minor modes in any order. A minor mode should be able to have its
1206 desired effect regardless of the major mode and regardless of the other
1207 minor modes in effect.
1208
1209 Often the biggest problem in implementing a minor mode is finding a
1210 way to insert the necessary hook into the rest of Emacs. Minor mode
1211 keymaps make this easier than it used to be.
1212
1213 @defvar minor-mode-list
1214 The value of this variable is a list of all minor mode commands.
1215 @end defvar
1216
1217 @menu
1218 * Minor Mode Conventions:: Tips for writing a minor mode.
1219 * Keymaps and Minor Modes:: How a minor mode can have its own keymap.
1220 * Defining Minor Modes:: A convenient facility for defining minor modes.
1221 @end menu
1222
1223 @node Minor Mode Conventions
1224 @subsection Conventions for Writing Minor Modes
1225 @cindex minor mode conventions
1226 @cindex conventions for writing minor modes
1227
1228 There are conventions for writing minor modes just as there are for
1229 major modes. Several of the major mode conventions apply to minor
1230 modes as well: those regarding the name of the mode initialization
1231 function, the names of global symbols, the use of a hook at the end of
1232 the initialization function, and the use of keymaps and other tables.
1233
1234 In addition, there are several conventions that are specific to
1235 minor modes. (The easiest way to follow all the conventions is to use
1236 the macro @code{define-minor-mode}; @ref{Defining Minor Modes}.)
1237
1238 @itemize @bullet
1239 @item
1240 @cindex mode variable
1241 Make a variable whose name ends in @samp{-mode} to control the minor
1242 mode. We call this the @dfn{mode variable}. The minor mode command
1243 should set this variable (@code{nil} to disable; anything else to
1244 enable).
1245
1246 If possible, implement the mode so that setting the variable
1247 automatically enables or disables the mode. Then the minor mode command
1248 does not need to do anything except set the variable.
1249
1250 This variable is used in conjunction with the @code{minor-mode-alist} to
1251 display the minor mode name in the mode line. It can also enable
1252 or disable a minor mode keymap. Individual commands or hooks can also
1253 check the variable's value.
1254
1255 If you want the minor mode to be enabled separately in each buffer,
1256 make the variable buffer-local.
1257
1258 @item
1259 Define a command whose name is the same as the mode variable.
1260 Its job is to enable and disable the mode by setting the variable.
1261
1262 The command should accept one optional argument. If the argument is
1263 @code{nil}, it should toggle the mode (turn it on if it is off, and
1264 off if it is on). It should turn the mode on if the argument is a
1265 positive integer, the symbol @code{t}, or a list whose @sc{car} is one
1266 of those. It should turn the mode off if the argument is a negative
1267 integer or zero, the symbol @code{-}, or a list whose @sc{car} is a
1268 negative integer or zero. The meaning of other arguments is not
1269 specified.
1270
1271 Here is an example taken from the definition of @code{transient-mark-mode}.
1272 It shows the use of @code{transient-mark-mode} as a variable that enables or
1273 disables the mode's behavior, and also shows the proper way to toggle,
1274 enable or disable the minor mode based on the raw prefix argument value.
1275
1276 @smallexample
1277 @group
1278 (setq transient-mark-mode
1279 (if (null arg) (not transient-mark-mode)
1280 (> (prefix-numeric-value arg) 0)))
1281 @end group
1282 @end smallexample
1283
1284 @item
1285 Add an element to @code{minor-mode-alist} for each minor mode
1286 (@pxref{Definition of minor-mode-alist}), if you want to indicate the
1287 minor mode in the mode line. This element should be a list of the
1288 following form:
1289
1290 @smallexample
1291 (@var{mode-variable} @var{string})
1292 @end smallexample
1293
1294 Here @var{mode-variable} is the variable that controls enabling of the
1295 minor mode, and @var{string} is a short string, starting with a space,
1296 to represent the mode in the mode line. These strings must be short so
1297 that there is room for several of them at once.
1298
1299 When you add an element to @code{minor-mode-alist}, use @code{assq} to
1300 check for an existing element, to avoid duplication. For example:
1301
1302 @smallexample
1303 @group
1304 (unless (assq 'leif-mode minor-mode-alist)
1305 (setq minor-mode-alist
1306 (cons '(leif-mode " Leif") minor-mode-alist)))
1307 @end group
1308 @end smallexample
1309
1310 @noindent
1311 or like this, using @code{add-to-list} (@pxref{List Variables}):
1312
1313 @smallexample
1314 @group
1315 (add-to-list 'minor-mode-alist '(leif-mode " Leif"))
1316 @end group
1317 @end smallexample
1318 @end itemize
1319
1320 Global minor modes distributed with Emacs should if possible support
1321 enabling and disabling via Custom (@pxref{Customization}). To do this,
1322 the first step is to define the mode variable with @code{defcustom}, and
1323 specify @code{:type 'boolean}.
1324
1325 If just setting the variable is not sufficient to enable the mode, you
1326 should also specify a @code{:set} method which enables the mode by
1327 invoking the mode command. Note in the variable's documentation string that
1328 setting the variable other than via Custom may not take effect.
1329
1330 Also mark the definition with an autoload cookie (@pxref{autoload cookie}),
1331 and specify a @code{:require} so that customizing the variable will load
1332 the library that defines the mode. This will copy suitable definitions
1333 into @file{loaddefs.el} so that users can use @code{customize-option} to
1334 enable the mode. For example:
1335
1336 @smallexample
1337 @group
1338
1339 ;;;###autoload
1340 (defcustom msb-mode nil
1341 "Toggle msb-mode.
1342 Setting this variable directly does not take effect;
1343 use either \\[customize] or the function `msb-mode'."
1344 :set 'custom-set-minor-mode
1345 :initialize 'custom-initialize-default
1346 :version "20.4"
1347 :type 'boolean
1348 :group 'msb
1349 :require 'msb)
1350 @end group
1351 @end smallexample
1352
1353 @node Keymaps and Minor Modes
1354 @subsection Keymaps and Minor Modes
1355
1356 Each minor mode can have its own keymap, which is active when the mode
1357 is enabled. To set up a keymap for a minor mode, add an element to the
1358 alist @code{minor-mode-map-alist}. @xref{Definition of minor-mode-map-alist}.
1359
1360 @cindex @code{self-insert-command}, minor modes
1361 One use of minor mode keymaps is to modify the behavior of certain
1362 self-inserting characters so that they do something else as well as
1363 self-insert. (Another way to customize @code{self-insert-command} is
1364 through @code{post-self-insert-hook}. Apart from this, the facilities
1365 for customizing @code{self-insert-command} are limited to special cases,
1366 designed for abbrevs and Auto Fill mode. Do not try substituting your
1367 own definition of @code{self-insert-command} for the standard one. The
1368 editor command loop handles this function specially.)
1369
1370 The key sequences bound in a minor mode should consist of @kbd{C-c}
1371 followed by one of @kbd{.,/?`'"[]\|~!#$%^&*()-_+=}. (The other
1372 punctuation characters are reserved for major modes.)
1373
1374 @node Defining Minor Modes
1375 @subsection Defining Minor Modes
1376
1377 The macro @code{define-minor-mode} offers a convenient way of
1378 implementing a mode in one self-contained definition.
1379
1380 @defmac define-minor-mode mode doc [init-value [lighter [keymap]]] keyword-args@dots{} body@dots{}
1381 This macro defines a new minor mode whose name is @var{mode} (a
1382 symbol). It defines a command named @var{mode} to toggle the minor
1383 mode, with @var{doc} as its documentation string.
1384
1385 The toggle command takes one optional (prefix) argument.
1386 If called interactively with no argument it toggles the mode on or off.
1387 A positive prefix argument enables the mode, any other prefix argument
1388 disables it. From Lisp, an argument of @code{toggle} toggles the mode,
1389 whereas an omitted or @code{nil} argument enables the mode.
1390 This makes it easy to enable the minor mode in a major mode hook, for example.
1391 If @var{doc} is nil, the macro supplies a default documentation string
1392 explaining the above.
1393
1394 By default, it also defines a variable named @var{mode}, which is set to
1395 @code{t} or @code{nil} by enabling or disabling the mode. The variable
1396 is initialized to @var{init-value}. Except in unusual circumstances
1397 (see below), this value must be @code{nil}.
1398
1399 The string @var{lighter} says what to display in the mode line
1400 when the mode is enabled; if it is @code{nil}, the mode is not displayed
1401 in the mode line.
1402
1403 The optional argument @var{keymap} specifies the keymap for the minor
1404 mode. If non-@code{nil}, it should be a variable name (whose value is
1405 a keymap), a keymap, or an alist of the form
1406
1407 @example
1408 (@var{key-sequence} . @var{definition})
1409 @end example
1410
1411 @noindent
1412 where each @var{key-sequence} and @var{definition} are arguments
1413 suitable for passing to @code{define-key} (@pxref{Changing Key
1414 Bindings}). If @var{keymap} is a keymap or an alist, this also
1415 defines the variable @code{@var{mode}-map}.
1416
1417 The above three arguments @var{init-value}, @var{lighter}, and
1418 @var{keymap} can be (partially) omitted when @var{keyword-args} are
1419 used. The @var{keyword-args} consist of keywords followed by
1420 corresponding values. A few keywords have special meanings:
1421
1422 @table @code
1423 @item :group @var{group}
1424 Custom group name to use in all generated @code{defcustom} forms.
1425 Defaults to @var{mode} without the possible trailing @samp{-mode}.
1426 @strong{Warning:} don't use this default group name unless you have
1427 written a @code{defgroup} to define that group properly. @xref{Group
1428 Definitions}.
1429
1430 @item :global @var{global}
1431 If non-@code{nil}, this specifies that the minor mode should be global
1432 rather than buffer-local. It defaults to @code{nil}.
1433
1434 One of the effects of making a minor mode global is that the
1435 @var{mode} variable becomes a customization variable. Toggling it
1436 through the Custom interface turns the mode on and off, and its value
1437 can be saved for future Emacs sessions (@pxref{Saving
1438 Customizations,,, emacs, The GNU Emacs Manual}. For the saved
1439 variable to work, you should ensure that the @code{define-minor-mode}
1440 form is evaluated each time Emacs starts; for packages that are not
1441 part of Emacs, the easiest way to do this is to specify a
1442 @code{:require} keyword.
1443
1444 @item :init-value @var{init-value}
1445 This is equivalent to specifying @var{init-value} positionally.
1446
1447 @item :lighter @var{lighter}
1448 This is equivalent to specifying @var{lighter} positionally.
1449
1450 @item :keymap @var{keymap}
1451 This is equivalent to specifying @var{keymap} positionally.
1452
1453 @item :variable @var{place}
1454 This replaces the default variable @var{mode}, used to store the state
1455 of the mode. If you specify this, the @var{mode} variable is not
1456 defined, and any @var{init-value} argument is unused. @var{place}
1457 can be a different named variable (which you must define yourself), or
1458 anything that can be used with the @code{setf} function
1459 (@pxref{Generalized Variables,,, cl, Common Lisp Extensions}).
1460 @var{place} can also be a cons @code{(@var{get} . @var{set})},
1461 where @var{get} is an expression that returns the current state,
1462 and @var{set} is a function of one argument (a state) that sets it.
1463 @end table
1464
1465 Any other keyword arguments are passed directly to the
1466 @code{defcustom} generated for the variable @var{mode}.
1467
1468 The command named @var{mode} first performs the standard actions such
1469 as setting the variable named @var{mode} and then executes the
1470 @var{body} forms, if any. It finishes by running the mode hook
1471 variable @code{@var{mode}-hook}.
1472 @end defmac
1473
1474 The initial value must be @code{nil} except in cases where (1) the
1475 mode is preloaded in Emacs, or (2) it is painless for loading to
1476 enable the mode even though the user did not request it. For
1477 instance, if the mode has no effect unless something else is enabled,
1478 and will always be loaded by that time, enabling it by default is
1479 harmless. But these are unusual circumstances. Normally, the
1480 initial value must be @code{nil}.
1481
1482 @findex easy-mmode-define-minor-mode
1483 The name @code{easy-mmode-define-minor-mode} is an alias
1484 for this macro.
1485
1486 Here is an example of using @code{define-minor-mode}:
1487
1488 @smallexample
1489 (define-minor-mode hungry-mode
1490 "Toggle Hungry mode.
1491 Interactively with no argument, this command toggles the mode.
1492 A positive prefix argument enables the mode, any other prefix
1493 argument disables it. From Lisp, argument omitted or nil enables
1494 the mode, `toggle' toggles the state.
1495
1496 When Hungry mode is enabled, the control delete key
1497 gobbles all preceding whitespace except the last.
1498 See the command \\[hungry-electric-delete]."
1499 ;; The initial value.
1500 nil
1501 ;; The indicator for the mode line.
1502 " Hungry"
1503 ;; The minor mode bindings.
1504 '(([C-backspace] . hungry-electric-delete))
1505 :group 'hunger)
1506 @end smallexample
1507
1508 @noindent
1509 This defines a minor mode named ``Hungry mode,'' a command named
1510 @code{hungry-mode} to toggle it, a variable named @code{hungry-mode}
1511 which indicates whether the mode is enabled, and a variable named
1512 @code{hungry-mode-map} which holds the keymap that is active when the
1513 mode is enabled. It initializes the keymap with a key binding for
1514 @kbd{C-@key{DEL}}. It puts the variable @code{hungry-mode} into
1515 custom group @code{hunger}. There are no @var{body} forms---many
1516 minor modes don't need any.
1517
1518 Here's an equivalent way to write it:
1519
1520 @smallexample
1521 (define-minor-mode hungry-mode
1522 "Toggle Hungry mode.
1523 ...rest of documentation as before..."
1524 ;; The initial value.
1525 :init-value nil
1526 ;; The indicator for the mode line.
1527 :lighter " Hungry"
1528 ;; The minor mode bindings.
1529 :keymap
1530 '(([C-backspace] . hungry-electric-delete)
1531 ([C-M-backspace]
1532 . (lambda ()
1533 (interactive)
1534 (hungry-electric-delete t))))
1535 :group 'hunger)
1536 @end smallexample
1537
1538 @defmac define-globalized-minor-mode global-mode mode turn-on keyword-args@dots{}
1539 This defines a global toggle named @var{global-mode} whose meaning is
1540 to enable or disable the buffer-local minor mode @var{mode} in all
1541 buffers. To turn on the minor mode in a buffer, it uses the function
1542 @var{turn-on}; to turn off the minor mode, it calls @code{mode} with
1543 @minus{}1 as argument.
1544
1545 Globally enabling the mode also affects buffers subsequently created
1546 by visiting files, and buffers that use a major mode other than
1547 Fundamental mode; but it does not detect the creation of a new buffer
1548 in Fundamental mode.
1549
1550 This defines the customization option @var{global-mode} (@pxref{Customization}),
1551 which can be toggled in the Custom interface to turn the minor mode on
1552 and off. As with @code{define-minor-mode}, you should ensure that the
1553 @code{define-globalized-minor-mode} form is evaluated each time Emacs
1554 starts, for example by providing a @code{:require} keyword.
1555
1556 Use @code{:group @var{group}} in @var{keyword-args} to specify the
1557 custom group for the mode variable of the global minor mode.
1558
1559 Generally speaking, when you define a globalized minor mode, you should
1560 also define a non-globalized version, so that people can use (or
1561 disable) it in individual buffers. This also allows them to disable a
1562 globally enabled minor mode in a specific major mode, by using that
1563 mode's hook.
1564 @end defmac
1565
1566
1567 @node Mode Line Format
1568 @section Mode-Line Format
1569 @cindex mode line
1570
1571 Each Emacs window (aside from minibuffer windows) typically has a mode
1572 line at the bottom, which displays status information about the buffer
1573 displayed in the window. The mode line contains information about the
1574 buffer, such as its name, associated file, depth of recursive editing,
1575 and major and minor modes. A window can also have a @dfn{header
1576 line}, which is much like the mode line but appears at the top of the
1577 window.
1578
1579 This section describes how to control the contents of the mode line
1580 and header line. We include it in this chapter because much of the
1581 information displayed in the mode line relates to the enabled major and
1582 minor modes.
1583
1584 @menu
1585 * Base: Mode Line Basics. Basic ideas of mode line control.
1586 * Data: Mode Line Data. The data structure that controls the mode line.
1587 * Top: Mode Line Top. The top level variable, mode-line-format.
1588 * Mode Line Variables:: Variables used in that data structure.
1589 * %-Constructs:: Putting information into a mode line.
1590 * Properties in Mode:: Using text properties in the mode line.
1591 * Header Lines:: Like a mode line, but at the top.
1592 * Emulating Mode Line:: Formatting text as the mode line would.
1593 @end menu
1594
1595 @node Mode Line Basics
1596 @subsection Mode Line Basics
1597
1598 @code{mode-line-format} is a buffer-local variable that holds a
1599 @dfn{mode line construct}, a kind of template, which controls what is
1600 displayed on the mode line of the current buffer. The value of
1601 @code{header-line-format} specifies the buffer's header line in the
1602 same way. All windows for the same buffer use the same
1603 @code{mode-line-format} and @code{header-line-format}.
1604
1605 For efficiency, Emacs does not continuously recompute the mode
1606 line and header line of a window. It does so when circumstances
1607 appear to call for it---for instance, if you change the window
1608 configuration, switch buffers, narrow or widen the buffer, scroll, or
1609 change the buffer's modification status. If you modify any of the
1610 variables referenced by @code{mode-line-format} (@pxref{Mode Line
1611 Variables}), or any other variables and data structures that affect
1612 how text is displayed (@pxref{Display}), you may want to force an
1613 update of the mode line so as to display the new information or
1614 display it in the new way.
1615
1616 @defun force-mode-line-update &optional all
1617 Force redisplay of the current buffer's mode line and header line.
1618 The next redisplay will update the mode line and header line based on
1619 the latest values of all relevant variables. With optional
1620 non-@code{nil} @var{all}, force redisplay of all mode lines and header
1621 lines.
1622
1623 This function also forces recomputation of the menu bar menus
1624 and the frame title.
1625 @end defun
1626
1627 The selected window's mode line is usually displayed in a different
1628 color using the face @code{mode-line}. Other windows' mode lines
1629 appear in the face @code{mode-line-inactive} instead. @xref{Faces}.
1630
1631 @node Mode Line Data
1632 @subsection The Data Structure of the Mode Line
1633 @cindex mode-line construct
1634
1635 The mode-line contents are controlled by a data structure called a
1636 @dfn{mode-line construct}, made up of lists, strings, symbols, and
1637 numbers kept in buffer-local variables. Each data type has a specific
1638 meaning for the mode-line appearance, as described below. The same
1639 data structure is used for constructing frame titles (@pxref{Frame
1640 Titles}) and header lines (@pxref{Header Lines}).
1641
1642 A mode-line construct may be as simple as a fixed string of text,
1643 but it usually specifies how to combine fixed strings with variables'
1644 values to construct the text. Many of these variables are themselves
1645 defined to have mode-line constructs as their values.
1646
1647 Here are the meanings of various data types as mode-line constructs:
1648
1649 @table @code
1650 @cindex percent symbol in mode line
1651 @item @var{string}
1652 A string as a mode-line construct appears verbatim except for
1653 @dfn{@code{%}-constructs} in it. These stand for substitution of
1654 other data; see @ref{%-Constructs}.
1655
1656 If parts of the string have @code{face} properties, they control
1657 display of the text just as they would text in the buffer. Any
1658 characters which have no @code{face} properties are displayed, by
1659 default, in the face @code{mode-line} or @code{mode-line-inactive}
1660 (@pxref{Standard Faces,,, emacs, The GNU Emacs Manual}). The
1661 @code{help-echo} and @code{local-map} properties in @var{string} have
1662 special meanings. @xref{Properties in Mode}.
1663
1664 @item @var{symbol}
1665 A symbol as a mode-line construct stands for its value. The value of
1666 @var{symbol} is used as a mode-line construct, in place of @var{symbol}.
1667 However, the symbols @code{t} and @code{nil} are ignored, as is any
1668 symbol whose value is void.
1669
1670 There is one exception: if the value of @var{symbol} is a string, it is
1671 displayed verbatim: the @code{%}-constructs are not recognized.
1672
1673 Unless @var{symbol} is marked as ``risky'' (i.e., it has a
1674 non-@code{nil} @code{risky-local-variable} property), all text
1675 properties specified in @var{symbol}'s value are ignored. This
1676 includes the text properties of strings in @var{symbol}'s value, as
1677 well as all @code{:eval} and @code{:propertize} forms in it. (The
1678 reason for this is security: non-risky variables could be set
1679 automatically from file variables without prompting the user.)
1680
1681 @item (@var{string} @var{rest}@dots{})
1682 @itemx (@var{list} @var{rest}@dots{})
1683 A list whose first element is a string or list means to process all the
1684 elements recursively and concatenate the results. This is the most
1685 common form of mode-line construct.
1686
1687 @item (:eval @var{form})
1688 A list whose first element is the symbol @code{:eval} says to evaluate
1689 @var{form}, and use the result as a string to display. Make sure this
1690 evaluation cannot load any files, as doing so could cause infinite
1691 recursion.
1692
1693 @item (:propertize @var{elt} @var{props}@dots{})
1694 A list whose first element is the symbol @code{:propertize} says to
1695 process the mode-line construct @var{elt} recursively, then add the text
1696 properties specified by @var{props} to the result. The argument
1697 @var{props} should consist of zero or more pairs @var{text-property}
1698 @var{value}. (This feature is new as of Emacs 22.1.)
1699
1700 @item (@var{symbol} @var{then} @var{else})
1701 A list whose first element is a symbol that is not a keyword specifies
1702 a conditional. Its meaning depends on the value of @var{symbol}. If
1703 @var{symbol} has a non-@code{nil} value, the second element,
1704 @var{then}, is processed recursively as a mode-line element.
1705 Otherwise, the third element, @var{else}, is processed recursively.
1706 You may omit @var{else}; then the mode-line element displays nothing
1707 if the value of @var{symbol} is @code{nil} or void.
1708
1709 @item (@var{width} @var{rest}@dots{})
1710 A list whose first element is an integer specifies truncation or
1711 padding of the results of @var{rest}. The remaining elements
1712 @var{rest} are processed recursively as mode-line constructs and
1713 concatenated together. When @var{width} is positive, the result is
1714 space filled on the right if its width is less than @var{width}. When
1715 @var{width} is negative, the result is truncated on the right to
1716 @minus{}@var{width} columns if its width exceeds @minus{}@var{width}.
1717
1718 For example, the usual way to show what percentage of a buffer is above
1719 the top of the window is to use a list like this: @code{(-3 "%p")}.
1720 @end table
1721
1722 @node Mode Line Top
1723 @subsection The Top Level of Mode Line Control
1724
1725 The variable in overall control of the mode line is
1726 @code{mode-line-format}.
1727
1728 @defopt mode-line-format
1729 The value of this variable is a mode-line construct that controls the
1730 contents of the mode-line. It is always buffer-local in all buffers.
1731
1732 If you set this variable to @code{nil} in a buffer, that buffer does
1733 not have a mode line. (A window that is just one line tall never
1734 displays a mode line.)
1735 @end defopt
1736
1737 The default value of @code{mode-line-format} is designed to use the
1738 values of other variables such as @code{mode-line-position} and
1739 @code{mode-line-modes} (which in turn incorporates the values of the
1740 variables @code{mode-name} and @code{minor-mode-alist}). Very few
1741 modes need to alter @code{mode-line-format} itself. For most
1742 purposes, it is sufficient to alter some of the variables that
1743 @code{mode-line-format} either directly or indirectly refers to.
1744
1745 If you do alter @code{mode-line-format} itself, the new value should
1746 use the same variables that appear in the default value (@pxref{Mode
1747 Line Variables}), rather than duplicating their contents or displaying
1748 the information in another fashion. This way, customizations made by
1749 the user or by Lisp programs (such as @code{display-time} and major
1750 modes) via changes to those variables remain effective.
1751
1752 Here is an example of a @code{mode-line-format} that might be
1753 useful for @code{shell-mode}, since it contains the host name and default
1754 directory.
1755
1756 @example
1757 @group
1758 (setq mode-line-format
1759 (list "-"
1760 'mode-line-mule-info
1761 'mode-line-modified
1762 'mode-line-frame-identification
1763 "%b--"
1764 @end group
1765 @group
1766 ;; @r{Note that this is evaluated while making the list.}
1767 ;; @r{It makes a mode-line construct which is just a string.}
1768 (getenv "HOST")
1769 @end group
1770 ":"
1771 'default-directory
1772 " "
1773 'global-mode-string
1774 " %[("
1775 '(:eval (mode-line-mode-name))
1776 'mode-line-process
1777 'minor-mode-alist
1778 "%n"
1779 ")%]--"
1780 @group
1781 '(which-func-mode ("" which-func-format "--"))
1782 '(line-number-mode "L%l--")
1783 '(column-number-mode "C%c--")
1784 '(-3 "%p")
1785 "-%-"))
1786 @end group
1787 @end example
1788
1789 @noindent
1790 (The variables @code{line-number-mode}, @code{column-number-mode}
1791 and @code{which-func-mode} enable particular minor modes; as usual,
1792 these variable names are also the minor mode command names.)
1793
1794 @node Mode Line Variables
1795 @subsection Variables Used in the Mode Line
1796
1797 This section describes variables incorporated by the standard value
1798 of @code{mode-line-format} into the text of the mode line. There is
1799 nothing inherently special about these variables; any other variables
1800 could have the same effects on the mode line if
1801 @code{mode-line-format}'s value were changed to use them. However,
1802 various parts of Emacs set these variables on the understanding that
1803 they will control parts of the mode line; therefore, practically
1804 speaking, it is essential for the mode line to use them.
1805
1806 @defvar mode-line-mule-info
1807 This variable holds the value of the mode-line construct that displays
1808 information about the language environment, buffer coding system, and
1809 current input method. @xref{Non-ASCII Characters}.
1810 @end defvar
1811
1812 @defvar mode-line-modified
1813 This variable holds the value of the mode-line construct that displays
1814 whether the current buffer is modified. Its default value displays
1815 @samp{**} if the buffer is modified, @samp{--} if the buffer is not
1816 modified, @samp{%%} if the buffer is read only, and @samp{%*} if the
1817 buffer is read only and modified.
1818
1819 Changing this variable does not force an update of the mode line.
1820 @end defvar
1821
1822 @defvar mode-line-frame-identification
1823 This variable identifies the current frame. Its default value
1824 displays @code{" "} if you are using a window system which can show
1825 multiple frames, or @code{"-%F "} on an ordinary terminal which shows
1826 only one frame at a time.
1827 @end defvar
1828
1829 @defvar mode-line-buffer-identification
1830 This variable identifies the buffer being displayed in the window.
1831 Its default value displays the buffer name, padded with spaces to at
1832 least 12 columns.
1833 @end defvar
1834
1835 @defopt mode-line-position
1836 This variable indicates the position in the buffer. Its default value
1837 displays the buffer percentage and, optionally, the buffer size, the
1838 line number and the column number.
1839 @end defopt
1840
1841 @defvar vc-mode
1842 The variable @code{vc-mode}, buffer-local in each buffer, records
1843 whether the buffer's visited file is maintained with version control,
1844 and, if so, which kind. Its value is a string that appears in the mode
1845 line, or @code{nil} for no version control.
1846 @end defvar
1847
1848 @defopt mode-line-modes
1849 This variable displays the buffer's major and minor modes. Its
1850 default value also displays the recursive editing level, information
1851 on the process status, and whether narrowing is in effect.
1852 @end defopt
1853
1854 @defopt mode-line-remote
1855 This variable is used to show whether @code{default-directory} for the
1856 current buffer is remote.
1857 @end defopt
1858
1859 @defopt mode-line-client
1860 This variable is used to identify @code{emacsclient} frames.
1861 @end defopt
1862
1863 The following three variables are used in @code{mode-line-modes}:
1864
1865 @defvar mode-name
1866 This buffer-local variable holds the ``pretty'' name of the current
1867 buffer's major mode. Each major mode should set this variable so that
1868 the mode name will appear in the mode line. The value does not have
1869 to be a string, but can use any of the data types valid in a mode-line
1870 construct (@pxref{Mode Line Data}). To compute the string that will
1871 identify the mode name in the mode line, use @code{format-mode-line}
1872 (@pxref{Emulating Mode Line}).
1873 @end defvar
1874
1875 @defvar mode-line-process
1876 This buffer-local variable contains the mode-line information on process
1877 status in modes used for communicating with subprocesses. It is
1878 displayed immediately following the major mode name, with no intervening
1879 space. For example, its value in the @samp{*shell*} buffer is
1880 @code{(":%s")}, which allows the shell to display its status along
1881 with the major mode as: @samp{(Shell:run)}. Normally this variable
1882 is @code{nil}.
1883 @end defvar
1884
1885 @defvar minor-mode-alist
1886 @anchor{Definition of minor-mode-alist}
1887 This variable holds an association list whose elements specify how the
1888 mode line should indicate that a minor mode is active. Each element of
1889 the @code{minor-mode-alist} should be a two-element list:
1890
1891 @example
1892 (@var{minor-mode-variable} @var{mode-line-string})
1893 @end example
1894
1895 More generally, @var{mode-line-string} can be any mode-line spec. It
1896 appears in the mode line when the value of @var{minor-mode-variable}
1897 is non-@code{nil}, and not otherwise. These strings should begin with
1898 spaces so that they don't run together. Conventionally, the
1899 @var{minor-mode-variable} for a specific mode is set to a
1900 non-@code{nil} value when that minor mode is activated.
1901
1902 @code{minor-mode-alist} itself is not buffer-local. Each variable
1903 mentioned in the alist should be buffer-local if its minor mode can be
1904 enabled separately in each buffer.
1905 @end defvar
1906
1907 @defvar global-mode-string
1908 This variable holds a mode-line spec that, by default, appears in the
1909 mode line just after the @code{which-func-mode} minor mode if set,
1910 else after @code{mode-line-modes}. The command @code{display-time}
1911 sets @code{global-mode-string} to refer to the variable
1912 @code{display-time-string}, which holds a string containing the time
1913 and load information.
1914
1915 The @samp{%M} construct substitutes the value of
1916 @code{global-mode-string}, but that is obsolete, since the variable is
1917 included in the mode line from @code{mode-line-format}.
1918 @end defvar
1919
1920 Here is a simplified version of the default value of
1921 @code{mode-line-format}. The real default value also
1922 specifies addition of text properties.
1923
1924 @example
1925 @group
1926 ("-"
1927 mode-line-mule-info
1928 mode-line-modified
1929 mode-line-frame-identification
1930 mode-line-buffer-identification
1931 @end group
1932 " "
1933 mode-line-position
1934 (vc-mode vc-mode)
1935 " "
1936 @group
1937 mode-line-modes
1938 (which-func-mode ("" which-func-format "--"))
1939 (global-mode-string ("--" global-mode-string))
1940 "-%-")
1941 @end group
1942 @end example
1943
1944 @node %-Constructs
1945 @subsection @code{%}-Constructs in the Mode Line
1946
1947 Strings used as mode-line constructs can use certain
1948 @code{%}-constructs to substitute various kinds of data. Here is a
1949 list of the defined @code{%}-constructs, and what they mean. In any
1950 construct except @samp{%%}, you can add a decimal integer after the
1951 @samp{%} to specify a minimum field width. If the width is less, the
1952 field is padded with spaces to the right.
1953
1954 @table @code
1955 @item %b
1956 The current buffer name, obtained with the @code{buffer-name} function.
1957 @xref{Buffer Names}.
1958
1959 @item %c
1960 The current column number of point.
1961
1962 @item %e
1963 When Emacs is nearly out of memory for Lisp objects, a brief message
1964 saying so. Otherwise, this is empty.
1965
1966 @item %f
1967 The visited file name, obtained with the @code{buffer-file-name}
1968 function. @xref{Buffer File Name}.
1969
1970 @item %F
1971 The title (only on a window system) or the name of the selected frame.
1972 @xref{Basic Parameters}.
1973
1974 @item %i
1975 The size of the accessible part of the current buffer; basically
1976 @code{(- (point-max) (point-min))}.
1977
1978 @item %I
1979 Like @samp{%i}, but the size is printed in a more readable way by using
1980 @samp{k} for 10^3, @samp{M} for 10^6, @samp{G} for 10^9, etc., to
1981 abbreviate.
1982
1983 @item %l
1984 The current line number of point, counting within the accessible portion
1985 of the buffer.
1986
1987 @item %n
1988 @samp{Narrow} when narrowing is in effect; nothing otherwise (see
1989 @code{narrow-to-region} in @ref{Narrowing}).
1990
1991 @item %p
1992 The percentage of the buffer text above the @strong{top} of window, or
1993 @samp{Top}, @samp{Bottom} or @samp{All}. Note that the default
1994 mode-line specification truncates this to three characters.
1995
1996 @item %P
1997 The percentage of the buffer text that is above the @strong{bottom} of
1998 the window (which includes the text visible in the window, as well as
1999 the text above the top), plus @samp{Top} if the top of the buffer is
2000 visible on screen; or @samp{Bottom} or @samp{All}.
2001
2002 @item %s
2003 The status of the subprocess belonging to the current buffer, obtained with
2004 @code{process-status}. @xref{Process Information}.
2005
2006 @item %t
2007 Whether the visited file is a text file or a binary file. This is a
2008 meaningful distinction only on certain operating systems (@pxref{MS-DOS
2009 File Types}).
2010
2011 @item %z
2012 The mnemonics of keyboard, terminal, and buffer coding systems.
2013
2014 @item %Z
2015 Like @samp{%z}, but including the end-of-line format.
2016
2017 @item %*
2018 @samp{%} if the buffer is read only (see @code{buffer-read-only}); @*
2019 @samp{*} if the buffer is modified (see @code{buffer-modified-p}); @*
2020 @samp{-} otherwise. @xref{Buffer Modification}.
2021
2022 @item %+
2023 @samp{*} if the buffer is modified (see @code{buffer-modified-p}); @*
2024 @samp{%} if the buffer is read only (see @code{buffer-read-only}); @*
2025 @samp{-} otherwise. This differs from @samp{%*} only for a modified
2026 read-only buffer. @xref{Buffer Modification}.
2027
2028 @item %&
2029 @samp{*} if the buffer is modified, and @samp{-} otherwise.
2030
2031 @item %[
2032 An indication of the depth of recursive editing levels (not counting
2033 minibuffer levels): one @samp{[} for each editing level.
2034 @xref{Recursive Editing}.
2035
2036 @item %]
2037 One @samp{]} for each recursive editing level (not counting minibuffer
2038 levels).
2039
2040 @item %-
2041 Dashes sufficient to fill the remainder of the mode line.
2042
2043 @item %%
2044 The character @samp{%}---this is how to include a literal @samp{%} in a
2045 string in which @code{%}-constructs are allowed.
2046 @end table
2047
2048 The following two @code{%}-constructs are still supported, but they are
2049 obsolete, since you can get the same results with the variables
2050 @code{mode-name} and @code{global-mode-string}.
2051
2052 @table @code
2053 @item %m
2054 The value of @code{mode-name}.
2055
2056 @item %M
2057 The value of @code{global-mode-string}.
2058 @end table
2059
2060 @node Properties in Mode
2061 @subsection Properties in the Mode Line
2062 @cindex text properties in the mode line
2063
2064 Certain text properties are meaningful in the
2065 mode line. The @code{face} property affects the appearance of text; the
2066 @code{help-echo} property associates help strings with the text, and
2067 @code{local-map} can make the text mouse-sensitive.
2068
2069 There are four ways to specify text properties for text in the mode
2070 line:
2071
2072 @enumerate
2073 @item
2074 Put a string with a text property directly into the mode-line data
2075 structure.
2076
2077 @item
2078 Put a text property on a mode-line %-construct such as @samp{%12b}; then
2079 the expansion of the %-construct will have that same text property.
2080
2081 @item
2082 Use a @code{(:propertize @var{elt} @var{props}@dots{})} construct to
2083 give @var{elt} a text property specified by @var{props}.
2084
2085 @item
2086 Use a list containing @code{:eval @var{form}} in the mode-line data
2087 structure, and make @var{form} evaluate to a string that has a text
2088 property.
2089 @end enumerate
2090
2091 You can use the @code{local-map} property to specify a keymap. This
2092 keymap only takes real effect for mouse clicks; binding character keys
2093 and function keys to it has no effect, since it is impossible to move
2094 point into the mode line.
2095
2096 When the mode line refers to a variable which does not have a
2097 non-@code{nil} @code{risky-local-variable} property, any text
2098 properties given or specified within that variable's values are
2099 ignored. This is because such properties could otherwise specify
2100 functions to be called, and those functions could come from file
2101 local variables.
2102
2103 @node Header Lines
2104 @subsection Window Header Lines
2105 @cindex header line (of a window)
2106 @cindex window header line
2107
2108 A window can have a @dfn{header line} at the
2109 top, just as it can have a mode line at the bottom. The header line
2110 feature works just like the mode-line feature, except that it's
2111 controlled by different variables.
2112
2113 @defvar header-line-format
2114 This variable, local in every buffer, specifies how to display the
2115 header line, for windows displaying the buffer. The format of the value
2116 is the same as for @code{mode-line-format} (@pxref{Mode Line Data}).
2117 It is normally @code{nil}, so that ordinary buffers have no header line.
2118 @end defvar
2119
2120 A window that is just one line tall never displays a header line. A
2121 window that is two lines tall cannot display both a mode line and a
2122 header line at once; if it has a mode line, then it does not display a
2123 header line.
2124
2125 @node Emulating Mode Line
2126 @subsection Emulating Mode-Line Formatting
2127
2128 You can use the function @code{format-mode-line} to compute
2129 the text that would appear in a mode line or header line
2130 based on a certain mode-line specification.
2131
2132 @defun format-mode-line format &optional face window buffer
2133 This function formats a line of text according to @var{format} as if it
2134 were generating the mode line for @var{window}, but it also returns the
2135 text as a string. The argument @var{window} defaults to the selected
2136 window. If @var{buffer} is non-@code{nil}, all the information used is
2137 taken from @var{buffer}; by default, it comes from @var{window}'s
2138 buffer.
2139
2140 The value string normally has text properties that correspond to the
2141 faces, keymaps, etc., that the mode line would have. Any character for
2142 which no @code{face} property is specified by @var{format} gets a
2143 default value determined by @var{face}. If @var{face} is @code{t}, that
2144 stands for either @code{mode-line} if @var{window} is selected,
2145 otherwise @code{mode-line-inactive}. If @var{face} is @code{nil} or
2146 omitted, that stands for the default face. If @var{face} is an integer,
2147 the value returned by this function will have no text properties.
2148
2149 You can also specify other valid faces as the value of @var{face}.
2150 If specified, that face provides the @code{face} property for characters
2151 whose face is not specified by @var{format}.
2152
2153 Note that using @code{mode-line}, @code{mode-line-inactive}, or
2154 @code{header-line} as @var{face} will actually redisplay the mode line
2155 or the header line, respectively, using the current definitions of the
2156 corresponding face, in addition to returning the formatted string.
2157 (Other faces do not cause redisplay.)
2158
2159 For example, @code{(format-mode-line header-line-format)} returns the
2160 text that would appear in the selected window's header line (@code{""}
2161 if it has no header line). @code{(format-mode-line header-line-format
2162 'header-line)} returns the same text, with each character
2163 carrying the face that it will have in the header line itself, and also
2164 redraws the header line.
2165 @end defun
2166
2167 @node Imenu
2168 @section Imenu
2169
2170 @cindex Imenu
2171 @dfn{Imenu} is a feature that lets users select a definition or
2172 section in the buffer, from a menu which lists all of them, to go
2173 directly to that location in the buffer. Imenu works by constructing
2174 a buffer index which lists the names and buffer positions of the
2175 definitions, or other named portions of the buffer; then the user can
2176 choose one of them and move point to it. Major modes can add a menu
2177 bar item to use Imenu using @code{imenu-add-to-menubar}.
2178
2179 @defun imenu-add-to-menubar name
2180 This function defines a local menu bar item named @var{name}
2181 to run Imenu.
2182 @end defun
2183
2184 The user-level commands for using Imenu are described in the Emacs
2185 Manual (@pxref{Imenu,, Imenu, emacs, the Emacs Manual}). This section
2186 explains how to customize Imenu's method of finding definitions or
2187 buffer portions for a particular major mode.
2188
2189 The usual and simplest way is to set the variable
2190 @code{imenu-generic-expression}:
2191
2192 @defvar imenu-generic-expression
2193 This variable, if non-@code{nil}, is a list that specifies regular
2194 expressions for finding definitions for Imenu. Simple elements of
2195 @code{imenu-generic-expression} look like this:
2196
2197 @example
2198 (@var{menu-title} @var{regexp} @var{index})
2199 @end example
2200
2201 Here, if @var{menu-title} is non-@code{nil}, it says that the matches
2202 for this element should go in a submenu of the buffer index;
2203 @var{menu-title} itself specifies the name for the submenu. If
2204 @var{menu-title} is @code{nil}, the matches for this element go directly
2205 in the top level of the buffer index.
2206
2207 The second item in the list, @var{regexp}, is a regular expression
2208 (@pxref{Regular Expressions}); anything in the buffer that it matches
2209 is considered a definition, something to mention in the buffer index.
2210 The third item, @var{index}, is a non-negative integer that indicates
2211 which subexpression in @var{regexp} matches the definition's name.
2212
2213 An element can also look like this:
2214
2215 @example
2216 (@var{menu-title} @var{regexp} @var{index} @var{function} @var{arguments}@dots{})
2217 @end example
2218
2219 Each match for this element creates an index item, and when the index
2220 item is selected by the user, it calls @var{function} with arguments
2221 consisting of the item name, the buffer position, and @var{arguments}.
2222
2223 For Emacs Lisp mode, @code{imenu-generic-expression} could look like
2224 this:
2225
2226 @c should probably use imenu-syntax-alist and \\sw rather than [-A-Za-z0-9+]
2227 @example
2228 @group
2229 ((nil "^\\s-*(def\\(un\\|subst\\|macro\\|advice\\)\
2230 \\s-+\\([-A-Za-z0-9+]+\\)" 2)
2231 @end group
2232 @group
2233 ("*Vars*" "^\\s-*(def\\(var\\|const\\)\
2234 \\s-+\\([-A-Za-z0-9+]+\\)" 2)
2235 @end group
2236 @group
2237 ("*Types*"
2238 "^\\s-*\
2239 (def\\(type\\|struct\\|class\\|ine-condition\\)\
2240 \\s-+\\([-A-Za-z0-9+]+\\)" 2))
2241 @end group
2242 @end example
2243
2244 Setting this variable makes it buffer-local in the current buffer.
2245 @end defvar
2246
2247 @defvar imenu-case-fold-search
2248 This variable controls whether matching against the regular
2249 expressions in the value of @code{imenu-generic-expression} is
2250 case-sensitive: @code{t}, the default, means matching should ignore
2251 case.
2252
2253 Setting this variable makes it buffer-local in the current buffer.
2254 @end defvar
2255
2256 @defvar imenu-syntax-alist
2257 This variable is an alist of syntax table modifiers to use while
2258 processing @code{imenu-generic-expression}, to override the syntax table
2259 of the current buffer. Each element should have this form:
2260
2261 @example
2262 (@var{characters} . @var{syntax-description})
2263 @end example
2264
2265 The @sc{car}, @var{characters}, can be either a character or a string.
2266 The element says to give that character or characters the syntax
2267 specified by @var{syntax-description}, which is passed to
2268 @code{modify-syntax-entry} (@pxref{Syntax Table Functions}).
2269
2270 This feature is typically used to give word syntax to characters which
2271 normally have symbol syntax, and thus to simplify
2272 @code{imenu-generic-expression} and speed up matching.
2273 For example, Fortran mode uses it this way:
2274
2275 @example
2276 (setq imenu-syntax-alist '(("_$" . "w")))
2277 @end example
2278
2279 The @code{imenu-generic-expression} regular expressions can then use
2280 @samp{\\sw+} instead of @samp{\\(\\sw\\|\\s_\\)+}. Note that this
2281 technique may be inconvenient when the mode needs to limit the initial
2282 character of a name to a smaller set of characters than are allowed in
2283 the rest of a name.
2284
2285 Setting this variable makes it buffer-local in the current buffer.
2286 @end defvar
2287
2288 Another way to customize Imenu for a major mode is to set the
2289 variables @code{imenu-prev-index-position-function} and
2290 @code{imenu-extract-index-name-function}:
2291
2292 @defvar imenu-prev-index-position-function
2293 If this variable is non-@code{nil}, its value should be a function that
2294 finds the next ``definition'' to put in the buffer index, scanning
2295 backward in the buffer from point. It should return @code{nil} if it
2296 doesn't find another ``definition'' before point. Otherwise it should
2297 leave point at the place it finds a ``definition'' and return any
2298 non-@code{nil} value.
2299
2300 Setting this variable makes it buffer-local in the current buffer.
2301 @end defvar
2302
2303 @defvar imenu-extract-index-name-function
2304 If this variable is non-@code{nil}, its value should be a function to
2305 return the name for a definition, assuming point is in that definition
2306 as the @code{imenu-prev-index-position-function} function would leave
2307 it.
2308
2309 Setting this variable makes it buffer-local in the current buffer.
2310 @end defvar
2311
2312 The last way to customize Imenu for a major mode is to set the
2313 variable @code{imenu-create-index-function}:
2314
2315 @defvar imenu-create-index-function
2316 This variable specifies the function to use for creating a buffer
2317 index. The function should take no arguments, and return an index
2318 alist for the current buffer. It is called within
2319 @code{save-excursion}, so where it leaves point makes no difference.
2320
2321 The index alist can have three types of elements. Simple elements
2322 look like this:
2323
2324 @example
2325 (@var{index-name} . @var{index-position})
2326 @end example
2327
2328 Selecting a simple element has the effect of moving to position
2329 @var{index-position} in the buffer. Special elements look like this:
2330
2331 @example
2332 (@var{index-name} @var{index-position} @var{function} @var{arguments}@dots{})
2333 @end example
2334
2335 Selecting a special element performs:
2336
2337 @example
2338 (funcall @var{function}
2339 @var{index-name} @var{index-position} @var{arguments}@dots{})
2340 @end example
2341
2342 A nested sub-alist element looks like this:
2343
2344 @example
2345 (@var{menu-title} @var{sub-alist})
2346 @end example
2347
2348 It creates the submenu @var{menu-title} specified by @var{sub-alist}.
2349
2350 The default value of @code{imenu-create-index-function} is
2351 @code{imenu-default-create-index-function}. This function calls the
2352 value of @code{imenu-prev-index-position-function} and the value of
2353 @code{imenu-extract-index-name-function} to produce the index alist.
2354 However, if either of these two variables is @code{nil}, the default
2355 function uses @code{imenu-generic-expression} instead.
2356
2357 Setting this variable makes it buffer-local in the current buffer.
2358 @end defvar
2359
2360 @node Font Lock Mode
2361 @section Font Lock Mode
2362 @cindex Font Lock mode
2363
2364 @dfn{Font Lock mode} is a feature that automatically attaches
2365 @code{face} properties to certain parts of the buffer based on their
2366 syntactic role. How it parses the buffer depends on the major mode;
2367 most major modes define syntactic criteria for which faces to use in
2368 which contexts. This section explains how to customize Font Lock for a
2369 particular major mode.
2370
2371 Font Lock mode finds text to highlight in two ways: through
2372 syntactic parsing based on the syntax table, and through searching
2373 (usually for regular expressions). Syntactic fontification happens
2374 first; it finds comments and string constants and highlights them.
2375 Search-based fontification happens second.
2376
2377 @menu
2378 * Font Lock Basics:: Overview of customizing Font Lock.
2379 * Search-based Fontification:: Fontification based on regexps.
2380 * Customizing Keywords:: Customizing search-based fontification.
2381 * Other Font Lock Variables:: Additional customization facilities.
2382 * Levels of Font Lock:: Each mode can define alternative levels
2383 so that the user can select more or less.
2384 * Precalculated Fontification:: How Lisp programs that produce the buffer
2385 contents can also specify how to fontify it.
2386 * Faces for Font Lock:: Special faces specifically for Font Lock.
2387 * Syntactic Font Lock:: Fontification based on syntax tables.
2388 * Setting Syntax Properties:: Defining character syntax based on context
2389 using the Font Lock mechanism.
2390 * Multiline Font Lock:: How to coerce Font Lock into properly
2391 highlighting multiline constructs.
2392 @end menu
2393
2394 @node Font Lock Basics
2395 @subsection Font Lock Basics
2396
2397 There are several variables that control how Font Lock mode highlights
2398 text. But major modes should not set any of these variables directly.
2399 Instead, they should set @code{font-lock-defaults} as a buffer-local
2400 variable. The value assigned to this variable is used, if and when Font
2401 Lock mode is enabled, to set all the other variables.
2402
2403 @defvar font-lock-defaults
2404 This variable is set by major modes, as a buffer-local variable, to
2405 specify how to fontify text in that mode. It automatically becomes
2406 buffer-local when you set it. If its value is @code{nil}, Font-Lock
2407 mode does no highlighting, and you can use the @samp{Faces} menu
2408 (under @samp{Edit} and then @samp{Text Properties} in the menu bar) to
2409 assign faces explicitly to text in the buffer.
2410
2411 If non-@code{nil}, the value should look like this:
2412
2413 @example
2414 (@var{keywords} [@var{keywords-only} [@var{case-fold}
2415 [@var{syntax-alist} [@var{syntax-begin} @var{other-vars}@dots{}]]]])
2416 @end example
2417
2418 The first element, @var{keywords}, indirectly specifies the value of
2419 @code{font-lock-keywords} which directs search-based fontification.
2420 It can be a symbol, a variable or a function whose value is the list
2421 to use for @code{font-lock-keywords}. It can also be a list of
2422 several such symbols, one for each possible level of fontification.
2423 The first symbol specifies the @samp{mode default} level of
2424 fontification, the next symbol level 1 fontification, the next level 2,
2425 and so on. The @samp{mode default} level is normally the same as level
2426 1. It is used when @code{font-lock-maximum-decoration} has a @code{nil}
2427 value. @xref{Levels of Font Lock}.
2428
2429 The second element, @var{keywords-only}, specifies the value of the
2430 variable @code{font-lock-keywords-only}. If this is omitted or
2431 @code{nil}, syntactic fontification (of strings and comments) is also
2432 performed. If this is non-@code{nil}, such fontification is not
2433 performed. @xref{Syntactic Font Lock}.
2434
2435 The third element, @var{case-fold}, specifies the value of
2436 @code{font-lock-keywords-case-fold-search}. If it is non-@code{nil},
2437 Font Lock mode ignores case when searching as directed by
2438 @code{font-lock-keywords}.
2439
2440 If the fourth element, @var{syntax-alist}, is non-@code{nil}, it
2441 should be a list of cons cells of the form @code{(@var{char-or-string}
2442 . @var{string})}. These are used to set up a syntax table for
2443 syntactic fontification (@pxref{Syntax Table Functions}). The
2444 resulting syntax table is stored in @code{font-lock-syntax-table}.
2445
2446 The fifth element, @var{syntax-begin}, specifies the value of
2447 @code{font-lock-beginning-of-syntax-function}. We recommend setting
2448 this variable to @code{nil} and using @code{syntax-begin-function}
2449 instead.
2450
2451 All the remaining elements (if any) are collectively called
2452 @var{other-vars}. Each of these elements should have the form
2453 @code{(@var{variable} . @var{value})}---which means, make
2454 @var{variable} buffer-local and then set it to @var{value}. You can
2455 use these @var{other-vars} to set other variables that affect
2456 fontification, aside from those you can control with the first five
2457 elements. @xref{Other Font Lock Variables}.
2458 @end defvar
2459
2460 If your mode fontifies text explicitly by adding
2461 @code{font-lock-face} properties, it can specify @code{(nil t)} for
2462 @code{font-lock-defaults} to turn off all automatic fontification.
2463 However, this is not required; it is possible to fontify some things
2464 using @code{font-lock-face} properties and set up automatic
2465 fontification for other parts of the text.
2466
2467 @node Search-based Fontification
2468 @subsection Search-based Fontification
2469
2470 The most important variable for customizing Font Lock mode is
2471 @code{font-lock-keywords}. It specifies the search criteria for
2472 search-based fontification. You should specify the value of this
2473 variable with @var{keywords} in @code{font-lock-defaults}.
2474
2475 @defvar font-lock-keywords
2476 This variable's value is a list of the keywords to highlight. Be
2477 careful when composing regular expressions for this list; a poorly
2478 written pattern can dramatically slow things down!
2479 @end defvar
2480
2481 Each element of @code{font-lock-keywords} specifies how to find
2482 certain cases of text, and how to highlight those cases. Font Lock mode
2483 processes the elements of @code{font-lock-keywords} one by one, and for
2484 each element, it finds and handles all matches. Ordinarily, once
2485 part of the text has been fontified already, this cannot be overridden
2486 by a subsequent match in the same text; but you can specify different
2487 behavior using the @var{override} element of a @var{subexp-highlighter}.
2488
2489 Each element of @code{font-lock-keywords} should have one of these
2490 forms:
2491
2492 @table @code
2493 @item @var{regexp}
2494 Highlight all matches for @var{regexp} using
2495 @code{font-lock-keyword-face}. For example,
2496
2497 @example
2498 ;; @r{Highlight occurrences of the word @samp{foo}}
2499 ;; @r{using @code{font-lock-keyword-face}.}
2500 "\\<foo\\>"
2501 @end example
2502
2503 The function @code{regexp-opt} (@pxref{Regexp Functions}) is useful
2504 for calculating optimal regular expressions to match a number of
2505 different keywords.
2506
2507 @item @var{function}
2508 Find text by calling @var{function}, and highlight the matches
2509 it finds using @code{font-lock-keyword-face}.
2510
2511 When @var{function} is called, it receives one argument, the limit of
2512 the search; it should begin searching at point, and not search beyond the
2513 limit. It should return non-@code{nil} if it succeeds, and set the
2514 match data to describe the match that was found. Returning @code{nil}
2515 indicates failure of the search.
2516
2517 Fontification will call @var{function} repeatedly with the same limit,
2518 and with point where the previous invocation left it, until
2519 @var{function} fails. On failure, @var{function} need not reset point
2520 in any particular way.
2521
2522 @item (@var{matcher} . @var{subexp})
2523 In this kind of element, @var{matcher} is either a regular
2524 expression or a function, as described above. The @sc{cdr},
2525 @var{subexp}, specifies which subexpression of @var{matcher} should be
2526 highlighted (instead of the entire text that @var{matcher} matched).
2527
2528 @example
2529 ;; @r{Highlight the @samp{bar} in each occurrence of @samp{fubar},}
2530 ;; @r{using @code{font-lock-keyword-face}.}
2531 ("fu\\(bar\\)" . 1)
2532 @end example
2533
2534 If you use @code{regexp-opt} to produce the regular expression
2535 @var{matcher}, you can use @code{regexp-opt-depth} (@pxref{Regexp
2536 Functions}) to calculate the value for @var{subexp}.
2537
2538 @item (@var{matcher} . @var{facespec})
2539 In this kind of element, @var{facespec} is an expression whose value
2540 specifies the face to use for highlighting. In the simplest case,
2541 @var{facespec} is a Lisp variable (a symbol) whose value is a face
2542 name.
2543
2544 @example
2545 ;; @r{Highlight occurrences of @samp{fubar},}
2546 ;; @r{using the face which is the value of @code{fubar-face}.}
2547 ("fubar" . fubar-face)
2548 @end example
2549
2550 However, @var{facespec} can also evaluate to a list of this form:
2551
2552 @example
2553 (face @var{face} @var{prop1} @var{val1} @var{prop2} @var{val2}@dots{})
2554 @end example
2555
2556 @noindent
2557 to specify the face @var{face} and various additional text properties
2558 to put on the text that matches. If you do this, be sure to add the
2559 other text property names that you set in this way to the value of
2560 @code{font-lock-extra-managed-props} so that the properties will also
2561 be cleared out when they are no longer appropriate. Alternatively,
2562 you can set the variable @code{font-lock-unfontify-region-function} to
2563 a function that clears these properties. @xref{Other Font Lock
2564 Variables}.
2565
2566 @item (@var{matcher} . @var{subexp-highlighter})
2567 In this kind of element, @var{subexp-highlighter} is a list
2568 which specifies how to highlight matches found by @var{matcher}.
2569 It has the form:
2570
2571 @example
2572 (@var{subexp} @var{facespec} [@var{override} [@var{laxmatch}]])
2573 @end example
2574
2575 The @sc{car}, @var{subexp}, is an integer specifying which subexpression
2576 of the match to fontify (0 means the entire matching text). The second
2577 subelement, @var{facespec}, is an expression whose value specifies the
2578 face, as described above.
2579
2580 The last two values in @var{subexp-highlighter}, @var{override} and
2581 @var{laxmatch}, are optional flags. If @var{override} is @code{t},
2582 this element can override existing fontification made by previous
2583 elements of @code{font-lock-keywords}. If it is @code{keep}, then
2584 each character is fontified if it has not been fontified already by
2585 some other element. If it is @code{prepend}, the face specified by
2586 @var{facespec} is added to the beginning of the @code{font-lock-face}
2587 property. If it is @code{append}, the face is added to the end of the
2588 @code{font-lock-face} property.
2589
2590 If @var{laxmatch} is non-@code{nil}, it means there should be no error
2591 if there is no subexpression numbered @var{subexp} in @var{matcher}.
2592 Obviously, fontification of the subexpression numbered @var{subexp} will
2593 not occur. However, fontification of other subexpressions (and other
2594 regexps) will continue. If @var{laxmatch} is @code{nil}, and the
2595 specified subexpression is missing, then an error is signaled which
2596 terminates search-based fontification.
2597
2598 Here are some examples of elements of this kind, and what they do:
2599
2600 @smallexample
2601 ;; @r{Highlight occurrences of either @samp{foo} or @samp{bar}, using}
2602 ;; @r{@code{foo-bar-face}, even if they have already been highlighted.}
2603 ;; @r{@code{foo-bar-face} should be a variable whose value is a face.}
2604 ("foo\\|bar" 0 foo-bar-face t)
2605
2606 ;; @r{Highlight the first subexpression within each occurrence}
2607 ;; @r{that the function @code{fubar-match} finds,}
2608 ;; @r{using the face which is the value of @code{fubar-face}.}
2609 (fubar-match 1 fubar-face)
2610 @end smallexample
2611
2612 @item (@var{matcher} . @var{anchored-highlighter})
2613 In this kind of element, @var{anchored-highlighter} specifies how to
2614 highlight text that follows a match found by @var{matcher}. So a
2615 match found by @var{matcher} acts as the anchor for further searches
2616 specified by @var{anchored-highlighter}. @var{anchored-highlighter}
2617 is a list of the following form:
2618
2619 @example
2620 (@var{anchored-matcher} @var{pre-form} @var{post-form}
2621 @var{subexp-highlighters}@dots{})
2622 @end example
2623
2624 Here, @var{anchored-matcher}, like @var{matcher}, is either a regular
2625 expression or a function. After a match of @var{matcher} is found,
2626 point is at the end of the match. Now, Font Lock evaluates the form
2627 @var{pre-form}. Then it searches for matches of
2628 @var{anchored-matcher} and uses @var{subexp-highlighters} to highlight
2629 these. A @var{subexp-highlighter} is as described above. Finally,
2630 Font Lock evaluates @var{post-form}.
2631
2632 The forms @var{pre-form} and @var{post-form} can be used to initialize
2633 before, and cleanup after, @var{anchored-matcher} is used. Typically,
2634 @var{pre-form} is used to move point to some position relative to the
2635 match of @var{matcher}, before starting with @var{anchored-matcher}.
2636 @var{post-form} might be used to move back, before resuming with
2637 @var{matcher}.
2638
2639 After Font Lock evaluates @var{pre-form}, it does not search for
2640 @var{anchored-matcher} beyond the end of the line. However, if
2641 @var{pre-form} returns a buffer position that is greater than the
2642 position of point after @var{pre-form} is evaluated, then the position
2643 returned by @var{pre-form} is used as the limit of the search instead.
2644 It is generally a bad idea to return a position greater than the end
2645 of the line; in other words, the @var{anchored-matcher} search should
2646 not span lines.
2647
2648 For example,
2649
2650 @smallexample
2651 ;; @r{Highlight occurrences of the word @samp{item} following}
2652 ;; @r{an occurrence of the word @samp{anchor} (on the same line)}
2653 ;; @r{in the value of @code{item-face}.}
2654 ("\\<anchor\\>" "\\<item\\>" nil nil (0 item-face))
2655 @end smallexample
2656
2657 Here, @var{pre-form} and @var{post-form} are @code{nil}. Therefore
2658 searching for @samp{item} starts at the end of the match of
2659 @samp{anchor}, and searching for subsequent instances of @samp{anchor}
2660 resumes from where searching for @samp{item} concluded.
2661
2662 @item (@var{matcher} @var{highlighters}@dots{})
2663 This sort of element specifies several @var{highlighter} lists for a
2664 single @var{matcher}. A @var{highlighter} list can be of the type
2665 @var{subexp-highlighter} or @var{anchored-highlighter} as described
2666 above.
2667
2668 For example,
2669
2670 @smallexample
2671 ;; @r{Highlight occurrences of the word @samp{anchor} in the value}
2672 ;; @r{of @code{anchor-face}, and subsequent occurrences of the word}
2673 ;; @r{@samp{item} (on the same line) in the value of @code{item-face}.}
2674 ("\\<anchor\\>" (0 anchor-face)
2675 ("\\<item\\>" nil nil (0 item-face)))
2676 @end smallexample
2677
2678 @item (eval . @var{form})
2679 Here @var{form} is an expression to be evaluated the first time
2680 this value of @code{font-lock-keywords} is used in a buffer.
2681 Its value should have one of the forms described in this table.
2682 @end table
2683
2684 @strong{Warning:} Do not design an element of @code{font-lock-keywords}
2685 to match text which spans lines; this does not work reliably.
2686 For details, see @xref{Multiline Font Lock}.
2687
2688 You can use @var{case-fold} in @code{font-lock-defaults} to specify
2689 the value of @code{font-lock-keywords-case-fold-search} which says
2690 whether search-based fontification should be case-insensitive.
2691
2692 @defvar font-lock-keywords-case-fold-search
2693 Non-@code{nil} means that regular expression matching for the sake of
2694 @code{font-lock-keywords} should be case-insensitive.
2695 @end defvar
2696
2697 @node Customizing Keywords
2698 @subsection Customizing Search-Based Fontification
2699
2700 You can use @code{font-lock-add-keywords} to add additional
2701 search-based fontification rules to a major mode, and
2702 @code{font-lock-remove-keywords} to remove rules.
2703
2704 @defun font-lock-add-keywords mode keywords &optional how
2705 This function adds highlighting @var{keywords}, for the current buffer
2706 or for major mode @var{mode}. The argument @var{keywords} should be a
2707 list with the same format as the variable @code{font-lock-keywords}.
2708
2709 If @var{mode} is a symbol which is a major mode command name, such as
2710 @code{c-mode}, the effect is that enabling Font Lock mode in
2711 @var{mode} will add @var{keywords} to @code{font-lock-keywords}.
2712 Calling with a non-@code{nil} value of @var{mode} is correct only in
2713 your @file{~/.emacs} file.
2714
2715 If @var{mode} is @code{nil}, this function adds @var{keywords} to
2716 @code{font-lock-keywords} in the current buffer. This way of calling
2717 @code{font-lock-add-keywords} is usually used in mode hook functions.
2718
2719 By default, @var{keywords} are added at the beginning of
2720 @code{font-lock-keywords}. If the optional argument @var{how} is
2721 @code{set}, they are used to replace the value of
2722 @code{font-lock-keywords}. If @var{how} is any other non-@code{nil}
2723 value, they are added at the end of @code{font-lock-keywords}.
2724
2725 Some modes provide specialized support you can use in additional
2726 highlighting patterns. See the variables
2727 @code{c-font-lock-extra-types}, @code{c++-font-lock-extra-types},
2728 and @code{java-font-lock-extra-types}, for example.
2729
2730 @strong{Warning:} major mode commands must not call
2731 @code{font-lock-add-keywords} under any circumstances, either directly
2732 or indirectly, except through their mode hooks. (Doing so would lead to
2733 incorrect behavior for some minor modes.) They should set up their
2734 rules for search-based fontification by setting
2735 @code{font-lock-keywords}.
2736 @end defun
2737
2738 @defun font-lock-remove-keywords mode keywords
2739 This function removes @var{keywords} from @code{font-lock-keywords}
2740 for the current buffer or for major mode @var{mode}. As in
2741 @code{font-lock-add-keywords}, @var{mode} should be a major mode
2742 command name or @code{nil}. All the caveats and requirements for
2743 @code{font-lock-add-keywords} apply here too.
2744 @end defun
2745
2746 For example, this code
2747
2748 @smallexample
2749 (font-lock-add-keywords 'c-mode
2750 '(("\\<\\(FIXME\\):" 1 font-lock-warning-face prepend)
2751 ("\\<\\(and\\|or\\|not\\)\\>" . font-lock-keyword-face)))
2752 @end smallexample
2753
2754 @noindent
2755 adds two fontification patterns for C mode: one to fontify the word
2756 @samp{FIXME}, even in comments, and another to fontify the words
2757 @samp{and}, @samp{or} and @samp{not} as keywords.
2758
2759 @noindent
2760 That example affects only C mode proper. To add the same patterns to
2761 C mode @emph{and} all modes derived from it, do this instead:
2762
2763 @smallexample
2764 (add-hook 'c-mode-hook
2765 (lambda ()
2766 (font-lock-add-keywords nil
2767 '(("\\<\\(FIXME\\):" 1 font-lock-warning-face prepend)
2768 ("\\<\\(and\\|or\\|not\\)\\>" .
2769 font-lock-keyword-face)))))
2770 @end smallexample
2771
2772 @node Other Font Lock Variables
2773 @subsection Other Font Lock Variables
2774
2775 This section describes additional variables that a major mode can
2776 set by means of @var{other-vars} in @code{font-lock-defaults}
2777 (@pxref{Font Lock Basics}).
2778
2779 @defvar font-lock-mark-block-function
2780 If this variable is non-@code{nil}, it should be a function that is
2781 called with no arguments, to choose an enclosing range of text for
2782 refontification for the command @kbd{M-o M-o}
2783 (@code{font-lock-fontify-block}).
2784
2785 The function should report its choice by placing the region around it.
2786 A good choice is a range of text large enough to give proper results,
2787 but not too large so that refontification becomes slow. Typical values
2788 are @code{mark-defun} for programming modes or @code{mark-paragraph} for
2789 textual modes.
2790 @end defvar
2791
2792 @defvar font-lock-extra-managed-props
2793 This variable specifies additional properties (other than
2794 @code{font-lock-face}) that are being managed by Font Lock mode. It
2795 is used by @code{font-lock-default-unfontify-region}, which normally
2796 only manages the @code{font-lock-face} property. If you want Font
2797 Lock to manage other properties as well, you must specify them in a
2798 @var{facespec} in @code{font-lock-keywords} as well as add them to
2799 this list. @xref{Search-based Fontification}.
2800 @end defvar
2801
2802 @defvar font-lock-fontify-buffer-function
2803 Function to use for fontifying the buffer. The default value is
2804 @code{font-lock-default-fontify-buffer}.
2805 @end defvar
2806
2807 @defvar font-lock-unfontify-buffer-function
2808 Function to use for unfontifying the buffer. This is used when
2809 turning off Font Lock mode. The default value is
2810 @code{font-lock-default-unfontify-buffer}.
2811 @end defvar
2812
2813 @defvar font-lock-fontify-region-function
2814 Function to use for fontifying a region. It should take two
2815 arguments, the beginning and end of the region, and an optional third
2816 argument @var{verbose}. If @var{verbose} is non-@code{nil}, the
2817 function should print status messages. The default value is
2818 @code{font-lock-default-fontify-region}.
2819 @end defvar
2820
2821 @defvar font-lock-unfontify-region-function
2822 Function to use for unfontifying a region. It should take two
2823 arguments, the beginning and end of the region. The default value is
2824 @code{font-lock-default-unfontify-region}.
2825 @end defvar
2826
2827 @defun jit-lock-register function &optional contextual
2828 This function tells Font Lock mode to run the Lisp function
2829 @var{function} any time it has to fontify or refontify part of the
2830 current buffer. It calls @var{function} before calling the default
2831 fontification functions, and gives it two arguments, @var{start} and
2832 @var{end}, which specify the region to be fontified or refontified.
2833
2834 The optional argument @var{contextual}, if non-@code{nil}, forces Font
2835 Lock mode to always refontify a syntactically relevant part of the
2836 buffer, and not just the modified lines. This argument can usually be
2837 omitted.
2838 @end defun
2839
2840 @defun jit-lock-unregister function
2841 If @var{function} was previously registered as a fontification
2842 function using @code{jit-lock-register}, this function unregisters it.
2843 @end defun
2844
2845 @node Levels of Font Lock
2846 @subsection Levels of Font Lock
2847
2848 Many major modes offer three different levels of fontification. You
2849 can define multiple levels by using a list of symbols for @var{keywords}
2850 in @code{font-lock-defaults}. Each symbol specifies one level of
2851 fontification; it is up to the user to choose one of these levels,
2852 normally by setting @code{font-lock-maximum-decoration} (@pxref{Font
2853 Lock,,, emacs, the GNU Emacs Manual}). The chosen level's symbol
2854 value is used to initialize @code{font-lock-keywords}.
2855
2856 Here are the conventions for how to define the levels of
2857 fontification:
2858
2859 @itemize @bullet
2860 @item
2861 Level 1: highlight function declarations, file directives (such as include or
2862 import directives), strings and comments. The idea is speed, so only
2863 the most important and top-level components are fontified.
2864
2865 @item
2866 Level 2: in addition to level 1, highlight all language keywords,
2867 including type names that act like keywords, as well as named constant
2868 values. The idea is that all keywords (either syntactic or semantic)
2869 should be fontified appropriately.
2870
2871 @item
2872 Level 3: in addition to level 2, highlight the symbols being defined in
2873 function and variable declarations, and all builtin function names,
2874 wherever they appear.
2875 @end itemize
2876
2877 @node Precalculated Fontification
2878 @subsection Precalculated Fontification
2879
2880 Some major modes such as @code{list-buffers} and @code{occur}
2881 construct the buffer text programmatically. The easiest way for them
2882 to support Font Lock mode is to specify the faces of text when they
2883 insert the text in the buffer.
2884
2885 The way to do this is to specify the faces in the text with the
2886 special text property @code{font-lock-face} (@pxref{Special
2887 Properties}). When Font Lock mode is enabled, this property controls
2888 the display, just like the @code{face} property. When Font Lock mode
2889 is disabled, @code{font-lock-face} has no effect on the display.
2890
2891 It is ok for a mode to use @code{font-lock-face} for some text and
2892 also use the normal Font Lock machinery. But if the mode does not use
2893 the normal Font Lock machinery, it should not set the variable
2894 @code{font-lock-defaults}.
2895
2896 @node Faces for Font Lock
2897 @subsection Faces for Font Lock
2898 @cindex faces for font lock
2899 @cindex font lock faces
2900
2901 Font Lock mode can highlight using any face, but Emacs defines several
2902 faces specifically for syntactic highlighting. These @dfn{Font Lock
2903 faces} are listed below. They can also be used by major modes for
2904 syntactic highlighting outside of Font Lock mode (@pxref{Major Mode
2905 Conventions}).
2906
2907 Each of these symbols is both a face name, and a variable whose
2908 default value is the symbol itself. Thus, the default value of
2909 @code{font-lock-comment-face} is @code{font-lock-comment-face}.
2910
2911 The faces are listed with descriptions of their typical usage, and in
2912 order of greater to lesser ``prominence''. If a mode's syntactic
2913 categories do not fit well with the usage descriptions, the faces can be
2914 assigned using the ordering as a guide.
2915
2916 @table @code
2917 @item font-lock-warning-face
2918 @vindex font-lock-warning-face
2919 for a construct that is peculiar, or that greatly changes the meaning of
2920 other text, like @samp{;;;###autoload} in Emacs Lisp and @samp{#error}
2921 in C.
2922
2923 @item font-lock-function-name-face
2924 @vindex font-lock-function-name-face
2925 for the name of a function being defined or declared.
2926
2927 @item font-lock-variable-name-face
2928 @vindex font-lock-variable-name-face
2929 for the name of a variable being defined or declared.
2930
2931 @item font-lock-keyword-face
2932 @vindex font-lock-keyword-face
2933 for a keyword with special syntactic significance, like @samp{for} and
2934 @samp{if} in C.
2935
2936 @item font-lock-comment-face
2937 @vindex font-lock-comment-face
2938 for comments.
2939
2940 @item font-lock-comment-delimiter-face
2941 @vindex font-lock-comment-delimiter-face
2942 for comments delimiters, like @samp{/*} and @samp{*/} in C. On most
2943 terminals, this inherits from @code{font-lock-comment-face}.
2944
2945 @item font-lock-type-face
2946 @vindex font-lock-type-face
2947 for the names of user-defined data types.
2948
2949 @item font-lock-constant-face
2950 @vindex font-lock-constant-face
2951 for the names of constants, like @samp{NULL} in C.
2952
2953 @item font-lock-builtin-face
2954 @vindex font-lock-builtin-face
2955 for the names of built-in functions.
2956
2957 @item font-lock-preprocessor-face
2958 @vindex font-lock-preprocessor-face
2959 for preprocessor commands. This inherits, by default, from
2960 @code{font-lock-builtin-face}.
2961
2962 @item font-lock-string-face
2963 @vindex font-lock-string-face
2964 for string constants.
2965
2966 @item font-lock-doc-face
2967 @vindex font-lock-doc-face
2968 for documentation strings in the code. This inherits, by default, from
2969 @code{font-lock-string-face}.
2970
2971 @item font-lock-negation-char-face
2972 @vindex font-lock-negation-char-face
2973 for easily-overlooked negation characters.
2974 @end table
2975
2976 @node Syntactic Font Lock
2977 @subsection Syntactic Font Lock
2978 @cindex syntactic font lock
2979
2980 Syntactic fontification uses the syntax table to find comments and
2981 string constants (@pxref{Syntax Tables}). It highlights them using
2982 @code{font-lock-comment-face} and @code{font-lock-string-face}
2983 (@pxref{Faces for Font Lock}), or whatever
2984 @code{font-lock-syntactic-face-function} chooses. There are several
2985 variables that affect syntactic fontification; you should set them by
2986 means of @code{font-lock-defaults} (@pxref{Font Lock Basics}).
2987
2988 @defvar font-lock-keywords-only
2989 Non-@code{nil} means Font Lock should not do syntactic fontification;
2990 it should only fontify based on @code{font-lock-keywords}. The normal
2991 way for a mode to set this variable to @code{t} is with
2992 @var{keywords-only} in @code{font-lock-defaults}.
2993 @end defvar
2994
2995 @defvar font-lock-syntax-table
2996 This variable holds the syntax table to use for fontification of
2997 comments and strings. Specify it using @var{syntax-alist} in
2998 @code{font-lock-defaults}. If this is @code{nil}, fontification uses
2999 the buffer's syntax table.
3000 @end defvar
3001
3002 @defvar font-lock-beginning-of-syntax-function
3003 If this variable is non-@code{nil}, it should be a function to move
3004 point back to a position that is syntactically at ``top level'' and
3005 outside of strings or comments. Font Lock uses this when necessary
3006 to get the right results for syntactic fontification.
3007
3008 This function is called with no arguments. It should leave point at
3009 the beginning of any enclosing syntactic block. Typical values are
3010 @code{beginning-of-line} (used when the start of the line is known to
3011 be outside a syntactic block), or @code{beginning-of-defun} for
3012 programming modes, or @code{backward-paragraph} for textual modes.
3013
3014 If the value is @code{nil}, Font Lock uses
3015 @code{syntax-begin-function} to move back outside of any comment,
3016 string, or sexp. This variable is semi-obsolete; we recommend setting
3017 @code{syntax-begin-function} instead.
3018
3019 Specify this variable using @var{syntax-begin} in
3020 @code{font-lock-defaults}.
3021 @end defvar
3022
3023 @defvar font-lock-syntactic-face-function
3024 A function to determine which face to use for a given syntactic
3025 element (a string or a comment). The function is called with one
3026 argument, the parse state at point returned by
3027 @code{parse-partial-sexp}, and should return a face. The default
3028 value returns @code{font-lock-comment-face} for comments and
3029 @code{font-lock-string-face} for strings.
3030
3031 This can be used to highlighting different kinds of strings or
3032 comments differently. It is also sometimes abused together with
3033 @code{font-lock-syntactic-keywords} to highlight constructs that span
3034 multiple lines, but this is too esoteric to document here.
3035
3036 Specify this variable using @var{other-vars} in
3037 @code{font-lock-defaults}.
3038 @end defvar
3039
3040 @node Setting Syntax Properties
3041 @subsection Setting Syntax Properties
3042
3043 Font Lock mode can be used to update @code{syntax-table} properties
3044 automatically (@pxref{Syntax Properties}). This is useful in
3045 languages for which a single syntax table by itself is not sufficient.
3046
3047 @defvar font-lock-syntactic-keywords
3048 This variable enables and controls updating @code{syntax-table}
3049 properties by Font Lock. Its value should be a list of elements of
3050 this form:
3051
3052 @example
3053 (@var{matcher} @var{subexp} @var{syntax} @var{override} @var{laxmatch})
3054 @end example
3055
3056 The parts of this element have the same meanings as in the corresponding
3057 sort of element of @code{font-lock-keywords},
3058
3059 @example
3060 (@var{matcher} @var{subexp} @var{facespec} @var{override} @var{laxmatch})
3061 @end example
3062
3063 However, instead of specifying the value @var{facespec} to use for the
3064 @code{face} property, it specifies the value @var{syntax} to use for
3065 the @code{syntax-table} property. Here, @var{syntax} can be a string
3066 (as taken by @code{modify-syntax-entry}), a syntax table, a cons cell
3067 (as returned by @code{string-to-syntax}), or an expression whose value
3068 is one of those two types. @var{override} cannot be @code{prepend} or
3069 @code{append}.
3070
3071 For example, an element of the form:
3072
3073 @example
3074 ("\\$\\(#\\)" 1 ".")
3075 @end example
3076
3077 highlights syntactically a hash character when following a dollar
3078 character, with a SYNTAX of @code{"."} (meaning punctuation syntax).
3079 Assuming that the buffer syntax table specifies hash characters to
3080 have comment start syntax, the element will only highlight hash
3081 characters that do not follow dollar characters as comments
3082 syntactically.
3083
3084 An element of the form:
3085
3086 @example
3087 ("\\('\\).\\('\\)"
3088 (1 "\"")
3089 (2 "\""))
3090 @end example
3091
3092 highlights syntactically both single quotes which surround a single
3093 character, with a SYNTAX of @code{"\""} (meaning string quote syntax).
3094 Assuming that the buffer syntax table does not specify single quotes
3095 to have quote syntax, the element will only highlight single quotes of
3096 the form @samp{'@var{c}'} as strings syntactically. Other forms, such
3097 as @samp{foo'bar} or @samp{'fubar'}, will not be highlighted as
3098 strings.
3099
3100 Major modes normally set this variable with @var{other-vars} in
3101 @code{font-lock-defaults}.
3102 @end defvar
3103
3104 @node Multiline Font Lock
3105 @subsection Multiline Font Lock Constructs
3106 @cindex multiline font lock
3107
3108 Normally, elements of @code{font-lock-keywords} should not match
3109 across multiple lines; that doesn't work reliably, because Font Lock
3110 usually scans just part of the buffer, and it can miss a multi-line
3111 construct that crosses the line boundary where the scan starts. (The
3112 scan normally starts at the beginning of a line.)
3113
3114 Making elements that match multiline constructs work properly has
3115 two aspects: correct @emph{identification} and correct
3116 @emph{rehighlighting}. The first means that Font Lock finds all
3117 multiline constructs. The second means that Font Lock will correctly
3118 rehighlight all the relevant text when a multiline construct is
3119 changed---for example, if some of the text that was previously part of
3120 a multiline construct ceases to be part of it. The two aspects are
3121 closely related, and often getting one of them to work will appear to
3122 make the other also work. However, for reliable results you must
3123 attend explicitly to both aspects.
3124
3125 There are three ways to ensure correct identification of multiline
3126 constructs:
3127
3128 @itemize
3129 @item
3130 Add a function to @code{font-lock-extend-region-functions} that does
3131 the @emph{identification} and extends the scan so that the scanned
3132 text never starts or ends in the middle of a multiline construct.
3133 @item
3134 Use the @code{font-lock-fontify-region-function} hook similarly to
3135 extend the scan so that the scanned text never starts or ends in the
3136 middle of a multiline construct.
3137 @item
3138 Somehow identify the multiline construct right when it gets inserted
3139 into the buffer (or at any point after that but before font-lock
3140 tries to highlight it), and mark it with a @code{font-lock-multiline}
3141 which will instruct font-lock not to start or end the scan in the
3142 middle of the construct.
3143 @end itemize
3144
3145 There are three ways to do rehighlighting of multiline constructs:
3146
3147 @itemize
3148 @item
3149 Place a @code{font-lock-multiline} property on the construct. This
3150 will rehighlight the whole construct if any part of it is changed. In
3151 some cases you can do this automatically by setting the
3152 @code{font-lock-multiline} variable, which see.
3153 @item
3154 Make sure @code{jit-lock-contextually} is set and rely on it doing its
3155 job. This will only rehighlight the part of the construct that
3156 follows the actual change, and will do it after a short delay.
3157 This only works if the highlighting of the various parts of your
3158 multiline construct never depends on text in subsequent lines.
3159 Since @code{jit-lock-contextually} is activated by default, this can
3160 be an attractive solution.
3161 @item
3162 Place a @code{jit-lock-defer-multiline} property on the construct.
3163 This works only if @code{jit-lock-contextually} is used, and with the
3164 same delay before rehighlighting, but like @code{font-lock-multiline},
3165 it also handles the case where highlighting depends on
3166 subsequent lines.
3167 @end itemize
3168
3169 @menu
3170 * Font Lock Multiline:: Marking multiline chunks with a text property.
3171 * Region to Refontify:: Controlling which region gets refontified
3172 after a buffer change.
3173 @end menu
3174
3175 @node Font Lock Multiline
3176 @subsubsection Font Lock Multiline
3177
3178 One way to ensure reliable rehighlighting of multiline Font Lock
3179 constructs is to put on them the text property @code{font-lock-multiline}.
3180 It should be present and non-@code{nil} for text that is part of a
3181 multiline construct.
3182
3183 When Font Lock is about to highlight a range of text, it first
3184 extends the boundaries of the range as necessary so that they do not
3185 fall within text marked with the @code{font-lock-multiline} property.
3186 Then it removes any @code{font-lock-multiline} properties from the
3187 range, and highlights it. The highlighting specification (mostly
3188 @code{font-lock-keywords}) must reinstall this property each time,
3189 whenever it is appropriate.
3190
3191 @strong{Warning:} don't use the @code{font-lock-multiline} property
3192 on large ranges of text, because that will make rehighlighting slow.
3193
3194 @defvar font-lock-multiline
3195 If the @code{font-lock-multiline} variable is set to @code{t}, Font
3196 Lock will try to add the @code{font-lock-multiline} property
3197 automatically on multiline constructs. This is not a universal
3198 solution, however, since it slows down Font Lock somewhat. It can
3199 miss some multiline constructs, or make the property larger or smaller
3200 than necessary.
3201
3202 For elements whose @var{matcher} is a function, the function should
3203 ensure that submatch 0 covers the whole relevant multiline construct,
3204 even if only a small subpart will be highlighted. It is often just as
3205 easy to add the @code{font-lock-multiline} property by hand.
3206 @end defvar
3207
3208 The @code{font-lock-multiline} property is meant to ensure proper
3209 refontification; it does not automatically identify new multiline
3210 constructs. Identifying the requires that Font-Lock operate on large
3211 enough chunks at a time. This will happen by accident on many cases,
3212 which may give the impression that multiline constructs magically work.
3213 If you set the @code{font-lock-multiline} variable non-@code{nil},
3214 this impression will be even stronger, since the highlighting of those
3215 constructs which are found will be properly updated from then on.
3216 But that does not work reliably.
3217
3218 To find multiline constructs reliably, you must either manually
3219 place the @code{font-lock-multiline} property on the text before
3220 Font-Lock looks at it, or use
3221 @code{font-lock-fontify-region-function}.
3222
3223 @node Region to Refontify
3224 @subsubsection Region to Fontify after a Buffer Change
3225
3226 When a buffer is changed, the region that Font Lock refontifies is
3227 by default the smallest sequence of whole lines that spans the change.
3228 While this works well most of the time, sometimes it doesn't---for
3229 example, when a change alters the syntactic meaning of text on an
3230 earlier line.
3231
3232 You can enlarge (or even reduce) the region to refontify by setting
3233 the following variable:
3234
3235 @defvar font-lock-extend-after-change-region-function
3236 This buffer-local variable is either @code{nil} or a function for
3237 Font-Lock to call to determine the region to scan and fontify.
3238
3239 The function is given three parameters, the standard @var{beg},
3240 @var{end}, and @var{old-len} from @code{after-change-functions}
3241 (@pxref{Change Hooks}). It should return either a cons of the
3242 beginning and end buffer positions (in that order) of the region to
3243 fontify, or @code{nil} (which means choose the region in the standard
3244 way). This function needs to preserve point, the match-data, and the
3245 current restriction. The region it returns may start or end in the
3246 middle of a line.
3247
3248 Since this function is called after every buffer change, it should be
3249 reasonably fast.
3250 @end defvar
3251
3252 @node Auto-Indentation
3253 @section Auto-indentation of code
3254
3255 For programming languages, an important feature of a major mode is to
3256 provide automatic indentation. This is controlled in Emacs by
3257 @code{indent-line-function} (@pxref{Mode-Specific Indent}).
3258 Writing a good indentation function can be difficult and to a large
3259 extent it is still a black art.
3260
3261 Many major mode authors will start by writing a simple indentation
3262 function that works for simple cases, for example by comparing with the
3263 indentation of the previous text line. For most programming languages
3264 that are not really line-based, this tends to scale very poorly:
3265 improving such a function to let it handle more diverse situations tends
3266 to become more and more difficult, resulting in the end with a large,
3267 complex, unmaintainable indentation function which nobody dares to touch.
3268
3269 A good indentation function will usually need to actually parse the
3270 text, according to the syntax of the language. Luckily, it is not
3271 necessary to parse the text in as much detail as would be needed
3272 for a compiler, but on the other hand, the parser embedded in the
3273 indentation code will want to be somewhat friendly to syntactically
3274 incorrect code.
3275
3276 Good maintainable indentation functions usually fall into 2 categories:
3277 either parsing forward from some ``safe'' starting point until the
3278 position of interest, or parsing backward from the position of interest.
3279 Neither of the two is a clearly better choice than the other: parsing
3280 backward is often more difficult than parsing forward because
3281 programming languages are designed to be parsed forward, but for the
3282 purpose of indentation it has the advantage of not needing to
3283 guess a ``safe'' starting point, and it generally enjoys the property
3284 that only a minimum of text will be analyzed to decide the indentation
3285 of a line, so indentation will tend to be unaffected by syntax errors in
3286 some earlier unrelated piece of code. Parsing forward on the other hand
3287 is usually easier and has the advantage of making it possible to
3288 reindent efficiently a whole region at a time, with a single parse.
3289
3290 Rather than write your own indentation function from scratch, it is
3291 often preferable to try and reuse some existing ones or to rely
3292 on a generic indentation engine. There are sadly few such
3293 engines. The CC-mode indentation code (used with C, C++, Java, Awk
3294 and a few other such modes) has been made more generic over the years,
3295 so if your language seems somewhat similar to one of those languages,
3296 you might try to use that engine. @c FIXME: documentation?
3297 Another one is SMIE which takes an approach in the spirit
3298 of Lisp sexps and adapts it to non-Lisp languages.
3299
3300 @menu
3301 * SMIE:: A simple minded indentation engine
3302 @end menu
3303
3304 @node SMIE
3305 @subsection Simple Minded Indentation Engine
3306
3307 SMIE is a package that provides a generic navigation and indentation
3308 engine. Based on a very simple parser using an ``operator precedence
3309 grammar'', it lets major modes extend the sexp-based navigation of Lisp
3310 to non-Lisp languages as well as provide a simple to use but reliable
3311 auto-indentation.
3312
3313 Operator precedence grammar is a very primitive technology for parsing
3314 compared to some of the more common techniques used in compilers.
3315 It has the following characteristics: its parsing power is very limited,
3316 and it is largely unable to detect syntax errors, but it has the
3317 advantage of being algorithmically efficient and able to parse forward
3318 just as well as backward. In practice that means that SMIE can use it
3319 for indentation based on backward parsing, that it can provide both
3320 @code{forward-sexp} and @code{backward-sexp} functionality, and that it
3321 will naturally work on syntactically incorrect code without any extra
3322 effort. The downside is that it also means that most programming
3323 languages cannot be parsed correctly using SMIE, at least not without
3324 resorting to some special tricks (@pxref{SMIE Tricks}).
3325
3326 @menu
3327 * SMIE setup:: SMIE setup and features
3328 * Operator Precedence Grammars:: A very simple parsing technique
3329 * SMIE Grammar:: Defining the grammar of a language
3330 * SMIE Lexer:: Defining tokens
3331 * SMIE Tricks:: Working around the parser's limitations
3332 * SMIE Indentation:: Specifying indentation rules
3333 * SMIE Indentation Helpers:: Helper functions for indentation rules
3334 * SMIE Indentation Example:: Sample indentation rules
3335 @end menu
3336
3337 @node SMIE setup
3338 @subsubsection SMIE Setup and Features
3339
3340 SMIE is meant to be a one-stop shop for structural navigation and
3341 various other features which rely on the syntactic structure of code, in
3342 particular automatic indentation. The main entry point is
3343 @code{smie-setup} which is a function typically called while setting
3344 up a major mode.
3345
3346 @defun smie-setup grammar rules-function &rest keywords
3347 Setup SMIE navigation and indentation.
3348 @var{grammar} is a grammar table generated by @code{smie-prec2->grammar}.
3349 @var{rules-function} is a set of indentation rules for use on
3350 @code{smie-rules-function}.
3351 @var{keywords} are additional arguments, which can include the following
3352 keywords:
3353 @itemize
3354 @item
3355 @code{:forward-token} @var{fun}: Specify the forward lexer to use.
3356 @item
3357 @code{:backward-token} @var{fun}: Specify the backward lexer to use.
3358 @end itemize
3359 @end defun
3360
3361 Calling this function is sufficient to make commands such as
3362 @code{forward-sexp}, @code{backward-sexp}, and @code{transpose-sexps} be
3363 able to properly handle structural elements other than just the paired
3364 parentheses already handled by syntax tables. For example, if the
3365 provided grammar is precise enough, @code{transpose-sexps} can correctly
3366 transpose the two arguments of a @code{+} operator, taking into account
3367 the precedence rules of the language.
3368
3369 Calling `smie-setup' is also sufficient to make TAB indentation work in
3370 the expected way, extends @code{blink-matching-paren} to apply to
3371 elements like @code{begin...end}, and provides some commands that you
3372 can bind in the major mode keymap.
3373
3374 @deffn Command smie-close-block
3375 This command closes the most recently opened (and not yet closed) block.
3376 @end deffn
3377
3378 @deffn Command smie-down-list &optional arg
3379 This command is like @code{down-list} but it also pays attention to
3380 nesting of tokens other than parentheses, such as @code{begin...end}.
3381 @end deffn
3382
3383 @node Operator Precedence Grammars
3384 @subsubsection Operator Precedence Grammars
3385
3386 SMIE's precedence grammars simply give to each token a pair of
3387 precedences: the left-precedence and the right-precedence. We say
3388 @code{T1 < T2} if the right-precedence of token @code{T1} is less than
3389 the left-precedence of token @code{T2}. A good way to read this
3390 @code{<} is as a kind of parenthesis: if we find @code{... T1 something
3391 T2 ...} then that should be parsed as @code{... T1 (something T2 ...}
3392 rather than as @code{... T1 something) T2 ...}. The latter
3393 interpretation would be the case if we had @code{T1 > T2}. If we have
3394 @code{T1 = T2}, it means that token T2 follows token T1 in the same
3395 syntactic construction, so typically we have @code{"begin" = "end"}.
3396 Such pairs of precedences are sufficient to express left-associativity
3397 or right-associativity of infix operators, nesting of tokens like
3398 parentheses and many other cases.
3399
3400 @c Let's leave this undocumented to leave it more open for change!
3401 @c @defvar smie-grammar
3402 @c The value of this variable is an alist specifying the left and right
3403 @c precedence of each token. It is meant to be initialized by using one of
3404 @c the functions below.
3405 @c @end defvar
3406
3407 @defun smie-prec2->grammar table
3408 This function takes a @emph{prec2} grammar @var{table} and returns an
3409 alist suitable for use in @code{smie-setup}. The @emph{prec2}
3410 @var{table} is itself meant to be built by one of the functions below.
3411 @end defun
3412
3413 @defun smie-merge-prec2s &rest tables
3414 This function takes several @emph{prec2} @var{tables} and merges them
3415 into a new @emph{prec2} table.
3416 @end defun
3417
3418 @defun smie-precs->prec2 precs
3419 This function builds a @emph{prec2} table from a table of precedences
3420 @var{precs}. @var{precs} should be a list, sorted by precedence (for
3421 example @code{"+"} will come before @code{"*"}), of elements of the form
3422 @code{(@var{assoc} @var{op} ...)}, where each @var{op} is a token that
3423 acts as an operator; @var{assoc} is their associativity, which can be
3424 either @code{left}, @code{right}, @code{assoc}, or @code{nonassoc}.
3425 All operators in a given element share the same precedence level
3426 and associativity.
3427 @end defun
3428
3429 @defun smie-bnf->prec2 bnf &rest resolvers
3430 This function lets you specify the grammar using a BNF notation.
3431 It accepts a @var{bnf} description of the grammar along with a set of
3432 conflict resolution rules @var{resolvers}, and
3433 returns a @emph{prec2} table.
3434
3435 @var{bnf} is a list of nonterminal definitions of the form
3436 @code{(@var{nonterm} @var{rhs1} @var{rhs2} ...)} where each @var{rhs}
3437 is a (non-empty) list of terminals (aka tokens) or non-terminals.
3438
3439 Not all grammars are accepted:
3440 @itemize
3441 @item
3442 An @var{rhs} cannot be an empty list (an empty list is never needed,
3443 since SMIE allows all non-terminals to match the empty string anyway).
3444 @item
3445 An @var{rhs} cannot have 2 consecutive non-terminals: each pair of
3446 non-terminals needs to be separated by a terminal (aka token).
3447 This is a fundamental limitation of operator precedence grammars.
3448 @end itemize
3449
3450 Additionally, conflicts can occur:
3451 @itemize
3452 @item
3453 The returned @emph{prec2} table holds constraints between pairs of tokens, and
3454 for any given pair only one constraint can be present: T1 < T2,
3455 T1 = T2, or T1 > T2.
3456 @item
3457 A token can be an @code{opener} (something similar to an open-paren),
3458 a @code{closer} (like a close-paren), or @code{neither} of the two
3459 (e.g. an infix operator, or an inner token like @code{"else"}).
3460 @end itemize
3461
3462 Precedence conflicts can be resolved via @var{resolvers}, which
3463 is a list of @emph{precs} tables (see @code{smie-precs->prec2}): for
3464 each precedence conflict, if those @code{precs} tables
3465 specify a particular constraint, then the conflict is resolved by using
3466 this constraint instead, else a conflict is reported and one of the
3467 conflicting constraints is picked arbitrarily and the others are
3468 simply ignored.
3469 @end defun
3470
3471 @node SMIE Grammar
3472 @subsubsection Defining the Grammar of a Language
3473
3474 The usual way to define the SMIE grammar of a language is by
3475 defining a new global variable that holds the precedence table by
3476 giving a set of BNF rules.
3477 For example, the grammar definition for a small Pascal-like language
3478 could look like:
3479 @example
3480 @group
3481 (require 'smie)
3482 (defvar sample-smie-grammar
3483 (smie-prec2->grammar
3484 (smie-bnf->prec2
3485 @end group
3486 @group
3487 '((id)
3488 (inst ("begin" insts "end")
3489 ("if" exp "then" inst "else" inst)
3490 (id ":=" exp)
3491 (exp))
3492 (insts (insts ";" insts) (inst))
3493 (exp (exp "+" exp)
3494 (exp "*" exp)
3495 ("(" exps ")"))
3496 (exps (exps "," exps) (exp)))
3497 @end group
3498 @group
3499 '((assoc ";"))
3500 '((assoc ","))
3501 '((assoc "+") (assoc "*")))))
3502 @end group
3503 @end example
3504
3505 @noindent
3506 A few things to note:
3507
3508 @itemize
3509 @item
3510 The above grammar does not explicitly mention the syntax of function
3511 calls: SMIE will automatically allow any sequence of sexps, such as
3512 identifiers, balanced parentheses, or @code{begin ... end} blocks
3513 to appear anywhere anyway.
3514 @item
3515 The grammar category @code{id} has no right hand side: this does not
3516 mean that it can match only the empty string, since as mentioned any
3517 sequence of sexps can appear anywhere anyway.
3518 @item
3519 Because non terminals cannot appear consecutively in the BNF grammar, it
3520 is difficult to correctly handle tokens that act as terminators, so the
3521 above grammar treats @code{";"} as a statement @emph{separator} instead,
3522 which SMIE can handle very well.
3523 @item
3524 Separators used in sequences (such as @code{","} and @code{";"} above)
3525 are best defined with BNF rules such as @code{(foo (foo "separator" foo) ...)}
3526 which generate precedence conflicts which are then resolved by giving
3527 them an explicit @code{(assoc "separator")}.
3528 @item
3529 The @code{("(" exps ")")} rule was not needed to pair up parens, since
3530 SMIE will pair up any characters that are marked as having paren syntax
3531 in the syntax table. What this rule does instead (together with the
3532 definition of @code{exps}) is to make it clear that @code{","} should
3533 not appear outside of parentheses.
3534 @item
3535 Rather than have a single @emph{precs} table to resolve conflicts, it is
3536 preferable to have several tables, so as to let the BNF part of the
3537 grammar specify relative precedences where possible.
3538 @item
3539 Unless there is a very good reason to prefer @code{left} or
3540 @code{right}, it is usually preferable to mark operators as associative,
3541 using @code{assoc}. For that reason @code{"+"} and @code{"*"} are
3542 defined above as @code{assoc}, although the language defines them
3543 formally as left associative.
3544 @end itemize
3545
3546 @node SMIE Lexer
3547 @subsubsection Defining Tokens
3548
3549 SMIE comes with a predefined lexical analyzer which uses syntax tables
3550 in the following way: any sequence of characters that have word or
3551 symbol syntax is considered a token, and so is any sequence of
3552 characters that have punctuation syntax. This default lexer is
3553 often a good starting point but is rarely actually correct for any given
3554 language. For example, it will consider @code{"2,+3"} to be composed
3555 of 3 tokens: @code{"2"}, @code{",+"}, and @code{"3"}.
3556
3557 To describe the lexing rules of your language to SMIE, you need
3558 2 functions, one to fetch the next token, and another to fetch the
3559 previous token. Those functions will usually first skip whitespace and
3560 comments and then look at the next chunk of text to see if it
3561 is a special token. If so it should skip the token and
3562 return a description of this token. Usually this is simply the string
3563 extracted from the buffer, but it can be anything you want.
3564 For example:
3565 @example
3566 @group
3567 (defvar sample-keywords-regexp
3568 (regexp-opt '("+" "*" "," ";" ">" ">=" "<" "<=" ":=" "=")))
3569 @end group
3570 @group
3571 (defun sample-smie-forward-token ()
3572 (forward-comment (point-max))
3573 (cond
3574 ((looking-at sample-keywords-regexp)
3575 (goto-char (match-end 0))
3576 (match-string-no-properties 0))
3577 (t (buffer-substring-no-properties
3578 (point)
3579 (progn (skip-syntax-forward "w_")
3580 (point))))))
3581 @end group
3582 @group
3583 (defun sample-smie-backward-token ()
3584 (forward-comment (- (point)))
3585 (cond
3586 ((looking-back sample-keywords-regexp (- (point) 2) t)
3587 (goto-char (match-beginning 0))
3588 (match-string-no-properties 0))
3589 (t (buffer-substring-no-properties
3590 (point)
3591 (progn (skip-syntax-backward "w_")
3592 (point))))))
3593 @end group
3594 @end example
3595
3596 Notice how those lexers return the empty string when in front of
3597 parentheses. This is because SMIE automatically takes care of the
3598 parentheses defined in the syntax table. More specifically if the lexer
3599 returns nil or an empty string, SMIE tries to handle the corresponding
3600 text as a sexp according to syntax tables.
3601
3602 @node SMIE Tricks
3603 @subsubsection Living With a Weak Parser
3604
3605 The parsing technique used by SMIE does not allow tokens to behave
3606 differently in different contexts. For most programming languages, this
3607 manifests itself by precedence conflicts when converting the
3608 BNF grammar.
3609
3610 Sometimes, those conflicts can be worked around by expressing the
3611 grammar slightly differently. For example, for Modula-2 it might seem
3612 natural to have a BNF grammar that looks like this:
3613
3614 @example
3615 ...
3616 (inst ("IF" exp "THEN" insts "ELSE" insts "END")
3617 ("CASE" exp "OF" cases "END")
3618 ...)
3619 (cases (cases "|" cases)
3620 (caselabel ":" insts)
3621 ("ELSE" insts))
3622 ...
3623 @end example
3624
3625 But this will create conflicts for @code{"ELSE"}: on the one hand, the
3626 IF rule implies (among many other things) that @code{"ELSE" = "END"};
3627 but on the other hand, since @code{"ELSE"} appears within @code{cases},
3628 which appears left of @code{"END"}, we also have @code{"ELSE" > "END"}.
3629 We can solve the conflict either by using:
3630 @example
3631 ...
3632 (inst ("IF" exp "THEN" insts "ELSE" insts "END")
3633 ("CASE" exp "OF" cases "END")
3634 ("CASE" exp "OF" cases "ELSE" insts "END")
3635 ...)
3636 (cases (cases "|" cases) (caselabel ":" insts))
3637 ...
3638 @end example
3639 or
3640 @example
3641 ...
3642 (inst ("IF" exp "THEN" else "END")
3643 ("CASE" exp "OF" cases "END")
3644 ...)
3645 (else (insts "ELSE" insts))
3646 (cases (cases "|" cases) (caselabel ":" insts) (else))
3647 ...
3648 @end example
3649
3650 Reworking the grammar to try and solve conflicts has its downsides, tho,
3651 because SMIE assumes that the grammar reflects the logical structure of
3652 the code, so it is preferable to keep the BNF closer to the intended
3653 abstract syntax tree.
3654
3655 Other times, after careful consideration you may conclude that those
3656 conflicts are not serious and simply resolve them via the
3657 @var{resolvers} argument of @code{smie-bnf->prec2}. Usually this is
3658 because the grammar is simply ambiguous: the conflict does not affect
3659 the set of programs described by the grammar, but only the way those
3660 programs are parsed. This is typically the case for separators and
3661 associative infix operators, where you want to add a resolver like
3662 @code{'((assoc "|"))}. Another case where this can happen is for the
3663 classic @emph{dangling else} problem, where you will use @code{'((assoc
3664 "else" "then"))}. It can also happen for cases where the conflict is
3665 real and cannot really be resolved, but it is unlikely to pose a problem
3666 in practice.
3667
3668 Finally, in many cases some conflicts will remain despite all efforts to
3669 restructure the grammar. Do not despair: while the parser cannot be
3670 made more clever, you can make the lexer as smart as you want. So, the
3671 solution is then to look at the tokens involved in the conflict and to
3672 split one of those tokens into 2 (or more) different tokens. E.g. if
3673 the grammar needs to distinguish between two incompatible uses of the
3674 token @code{"begin"}, make the lexer return different tokens (say
3675 @code{"begin-fun"} and @code{"begin-plain"}) depending on which kind of
3676 @code{"begin"} it finds. This pushes the work of distinguishing the
3677 different cases to the lexer, which will thus have to look at the
3678 surrounding text to find ad-hoc clues.
3679
3680 @node SMIE Indentation
3681 @subsubsection Specifying Indentation Rules
3682
3683 Based on the provided grammar, SMIE will be able to provide automatic
3684 indentation without any extra effort. But in practice, this default
3685 indentation style will probably not be good enough. You will want to
3686 tweak it in many different cases.
3687
3688 SMIE indentation is based on the idea that indentation rules should be
3689 as local as possible. To this end, it relies on the idea of
3690 @emph{virtual} indentation, which is the indentation that a particular
3691 program point would have if it were at the beginning of a line.
3692 Of course, if that program point is indeed at the beginning of a line,
3693 its virtual indentation is its current indentation. But if not, then
3694 SMIE uses the indentation algorithm to compute the virtual indentation
3695 of that point. Now in practice, the virtual indentation of a program
3696 point does not have to be identical to the indentation it would have if
3697 we inserted a newline before it. To see how this works, the SMIE rule
3698 for indentation after a @code{@{} in C does not care whether the
3699 @code{@{} is standing on a line of its own or is at the end of the
3700 preceding line. Instead, these different cases are handled in the
3701 indentation rule that decides how to indent before a @code{@{}.
3702
3703 Another important concept is the notion of @emph{parent}: The
3704 @emph{parent} of a token, is the head token of the nearest enclosing
3705 syntactic construct. For example, the parent of an @code{else} is the
3706 @code{if} to which it belongs, and the parent of an @code{if}, in turn,
3707 is the lead token of the surrounding construct. The command
3708 @code{backward-sexp} jumps from a token to its parent, but there are
3709 some caveats: for @emph{openers} (tokens which start a construct, like
3710 @code{if}), you need to start with point before the token, while for
3711 others you need to start with point after the token.
3712 @code{backward-sexp} stops with point before the parent token if that is
3713 the @emph{opener} of the token of interest, and otherwise it stops with
3714 point after the parent token.
3715
3716 SMIE indentation rules are specified using a function that takes two
3717 arguments @var{method} and @var{arg} where the meaning of @var{arg} and the
3718 expected return value depend on @var{method}.
3719
3720 @var{method} can be:
3721 @itemize
3722 @item
3723 @code{:after}, in which case @var{arg} is a token and the function
3724 should return the @var{offset} to use for indentation after @var{arg}.
3725 @item
3726 @code{:before}, in which case @var{arg} is a token and the function
3727 should return the @var{offset} to use to indent @var{arg} itself.
3728 @item
3729 @code{:elem}, in which case the function should return either the offset
3730 to use to indent function arguments (if @var{arg} is the symbol
3731 @code{arg}) or the basic indentation step (if @var{arg} is the symbol
3732 @code{basic}).
3733 @item
3734 @code{:list-intro}, in which case @var{arg} is a token and the function
3735 should return non-@code{nil} if the token is followed by a list of
3736 expressions (not separated by any token) rather than an expression.
3737 @end itemize
3738
3739 When @var{arg} is a token, the function is called with point just before
3740 that token. A return value of nil always means to fallback on the
3741 default behavior, so the function should return nil for arguments it
3742 does not expect.
3743
3744 @var{offset} can be:
3745 @itemize
3746 @item
3747 @code{nil}: use the default indentation rule.
3748 @item
3749 @code{(column . @var{column})}: indent to column @var{column}.
3750 @item
3751 @var{number}: offset by @var{number}, relative to a base token which is
3752 the current token for @code{:after} and its parent for @code{:before}.
3753 @end itemize
3754
3755 @node SMIE Indentation Helpers
3756 @subsubsection Helper Functions for Indentation Rules
3757
3758 SMIE provides various functions designed specifically for use in the
3759 indentation rules function (several of those functions break if used in
3760 another context). These functions all start with the prefix
3761 @code{smie-rule-}.
3762
3763 @defun smie-rule-bolp
3764 Return non-@code{nil} if the current token is the first on the line.
3765 @end defun
3766
3767 @defun smie-rule-hanging-p
3768 Return non-@code{nil} if the current token is @emph{hanging}.
3769 A token is @emph{hanging} if it is the last token on the line
3770 and if it is preceded by other tokens: a lone token on a line is not
3771 hanging.
3772 @end defun
3773
3774 @defun smie-rule-next-p &rest tokens
3775 Return non-@code{nil} if the next token is among @var{tokens}.
3776 @end defun
3777
3778 @defun smie-rule-prev-p &rest tokens
3779 Return non-@code{nil} if the previous token is among @var{tokens}.
3780 @end defun
3781
3782 @defun smie-rule-parent-p &rest parents
3783 Return non-@code{nil} if the current token's parent is among @var{parents}.
3784 @end defun
3785
3786 @defun smie-rule-sibling-p
3787 Return non-nil if the current token's parent is actually a sibling.
3788 This is the case for example when the parent of a @code{","} is just the
3789 previous @code{","}.
3790 @end defun
3791
3792 @defun smie-rule-parent &optional offset
3793 Return the proper offset to align the current token with the parent.
3794 If non-@code{nil}, @var{offset} should be an integer giving an
3795 additional offset to apply.
3796 @end defun
3797
3798 @defun smie-rule-separator method
3799 Indent current token as a @emph{separator}.
3800
3801 By @emph{separator}, we mean here a token whose sole purpose is to
3802 separate various elements within some enclosing syntactic construct, and
3803 which does not have any semantic significance in itself (i.e. it would
3804 typically not exist as a node in an abstract syntax tree).
3805
3806 Such a token is expected to have an associative syntax and be closely
3807 tied to its syntactic parent. Typical examples are @code{","} in lists
3808 of arguments (enclosed inside parentheses), or @code{";"} in sequences
3809 of instructions (enclosed in a @code{@{...@}} or @code{begin...end}
3810 block).
3811
3812 @var{method} should be the method name that was passed to
3813 `smie-rules-function'.
3814 @end defun
3815
3816 @node SMIE Indentation Example
3817 @subsubsection Sample Indentation Rules
3818
3819 Here is an example of an indentation function:
3820
3821 @example
3822 (defun sample-smie-rules (kind token)
3823 (pcase (cons kind token)
3824 (`(:elem . basic) sample-indent-basic)
3825 (`(,_ . ",") (smie-rule-separator kind))
3826 (`(:after . ":=") sample-indent-basic)
3827 (`(:before . ,(or `"begin" `"(" `"@{")))
3828 (if (smie-rule-hanging-p) (smie-rule-parent)))
3829 (`(:before . "if")
3830 (and (not (smie-rule-bolp)) (smie-rule-prev-p "else")
3831 (smie-rule-parent)))))
3832 @end example
3833
3834 @noindent
3835 A few things to note:
3836
3837 @itemize
3838 @item
3839 The first case indicates the basic indentation increment to use.
3840 If @code{sample-indent-basic} is nil, then SMIE uses the global
3841 setting @code{smie-indent-basic}. The major mode could have set
3842 @code{smie-indent-basic} buffer-locally instead, but that
3843 is discouraged.
3844
3845 @item
3846 The rule for the token @code{","} make SMIE try to be more clever when
3847 the comma separator is placed at the beginning of lines. It tries to
3848 outdent the separator so as to align the code after the comma; for
3849 example:
3850
3851 @example
3852 x = longfunctionname (
3853 arg1
3854 , arg2
3855 );
3856 @end example
3857
3858 @item
3859 The rule for indentation after @code{":="} exists because otherwise
3860 SMIE would treat @code{":="} as an infix operator and would align the
3861 right argument with the left one.
3862
3863 @item
3864 The rule for indentation before @code{"begin"} is an example of the use
3865 of virtual indentation: This rule is used only when @code{"begin"} is
3866 hanging, which can happen only when @code{"begin"} is not at the
3867 beginning of a line. So this is not used when indenting
3868 @code{"begin"} itself but only when indenting something relative to this
3869 @code{"begin"}. Concretely, this rule changes the indentation from:
3870
3871 @example
3872 if x > 0 then begin
3873 dosomething(x);
3874 end
3875 @end example
3876 to
3877 @example
3878 if x > 0 then begin
3879 dosomething(x);
3880 end
3881 @end example
3882
3883 @item
3884 The rule for indentation before @code{"if"} is similar to the one for
3885 @code{"begin"}, but where the purpose is to treat @code{"else if"}
3886 as a single unit, so as to align a sequence of tests rather than indent
3887 each test further to the right. This function does this only in the
3888 case where the @code{"if"} is not placed on a separate line, hence the
3889 @code{smie-rule-bolp} test.
3890
3891 If we know that the @code{"else"} is always aligned with its @code{"if"}
3892 and is always at the beginning of a line, we can use a more efficient
3893 rule:
3894 @example
3895 ((equal token "if")
3896 (and (not (smie-rule-bolp))
3897 (smie-rule-prev-p "else")
3898 (save-excursion
3899 (sample-smie-backward-token)
3900 (cons 'column (current-column)))))
3901 @end example
3902
3903 The advantage of this formulation is that it reuses the indentation of
3904 the previous @code{"else"}, rather than going all the way back to the
3905 first @code{"if"} of the sequence.
3906 @end itemize
3907
3908 @node Desktop Save Mode
3909 @section Desktop Save Mode
3910 @cindex desktop save mode
3911
3912 @dfn{Desktop Save Mode} is a feature to save the state of Emacs from
3913 one session to another. The user-level commands for using Desktop
3914 Save Mode are described in the GNU Emacs Manual (@pxref{Saving Emacs
3915 Sessions,,, emacs, the GNU Emacs Manual}). Modes whose buffers visit
3916 a file, don't have to do anything to use this feature.
3917
3918 For buffers not visiting a file to have their state saved, the major
3919 mode must bind the buffer local variable @code{desktop-save-buffer} to
3920 a non-@code{nil} value.
3921
3922 @defvar desktop-save-buffer
3923 If this buffer-local variable is non-@code{nil}, the buffer will have
3924 its state saved in the desktop file at desktop save. If the value is
3925 a function, it is called at desktop save with argument
3926 @var{desktop-dirname}, and its value is saved in the desktop file along
3927 with the state of the buffer for which it was called. When file names
3928 are returned as part of the auxiliary information, they should be
3929 formatted using the call
3930
3931 @example
3932 (desktop-file-name @var{file-name} @var{desktop-dirname})
3933 @end example
3934
3935 @end defvar
3936
3937 For buffers not visiting a file to be restored, the major mode must
3938 define a function to do the job, and that function must be listed in
3939 the alist @code{desktop-buffer-mode-handlers}.
3940
3941 @defvar desktop-buffer-mode-handlers
3942 Alist with elements
3943
3944 @example
3945 (@var{major-mode} . @var{restore-buffer-function})
3946 @end example
3947
3948 The function @var{restore-buffer-function} will be called with
3949 argument list
3950
3951 @example
3952 (@var{buffer-file-name} @var{buffer-name} @var{desktop-buffer-misc})
3953 @end example
3954
3955 and it should return the restored buffer.
3956 Here @var{desktop-buffer-misc} is the value returned by the function
3957 optionally bound to @code{desktop-save-buffer}.
3958 @end defvar
3959
3960 @ignore
3961 Local Variables:
3962 fill-column: 72
3963 End:
3964 @end ignore