Tweaks to lisp manual to remove over/underflows in printed version.
[bpt/emacs.git] / doc / lispref / modes.texi
1 @c -*-texinfo-*-
2 @c This is part of the GNU Emacs Lisp Reference Manual.
3 @c Copyright (C) 1990, 1991, 1992, 1993, 1994, 1995, 1998, 1999, 2001,
4 @c 2002, 2003, 2004, 2005, 2006, 2007, 2008, 2009, 2010, 2011 Free Software Foundation, Inc.
5 @c See the file elisp.texi for copying conditions.
6 @setfilename ../../info/modes
7 @node Modes, Documentation, Keymaps, Top
8 @chapter Major and Minor Modes
9 @cindex mode
10
11 A @dfn{mode} is a set of definitions that customize Emacs and can be
12 turned on and off while you edit. There are two varieties of modes:
13 @dfn{major modes}, which are mutually exclusive and used for editing
14 particular kinds of text, and @dfn{minor modes}, which provide features
15 that users can enable individually.
16
17 This chapter describes how to write both major and minor modes, how to
18 indicate them in the mode line, and how they run hooks supplied by the
19 user. For related topics such as keymaps and syntax tables, see
20 @ref{Keymaps}, and @ref{Syntax Tables}.
21
22 @menu
23 * Hooks:: How to use hooks; how to write code that provides hooks.
24 * Major Modes:: Defining major modes.
25 * Minor Modes:: Defining minor modes.
26 * Mode Line Format:: Customizing the text that appears in the mode line.
27 * Imenu:: How a mode can provide a menu
28 of definitions in the buffer.
29 * Font Lock Mode:: How modes can highlight text according to syntax.
30 * Auto-Indentation:: How to teach Emacs to indent for a major mode.
31 * Desktop Save Mode:: How modes can have buffer state saved between
32 Emacs sessions.
33 @end menu
34
35 @node Hooks
36 @section Hooks
37 @cindex hooks
38
39 A @dfn{hook} is a variable where you can store a function or functions
40 to be called on a particular occasion by an existing program. Emacs
41 provides hooks for the sake of customization. Most often, hooks are set
42 up in the init file (@pxref{Init File}), but Lisp programs can set them also.
43 @xref{Standard Hooks}, for a list of standard hook variables.
44
45 @cindex normal hook
46 Most of the hooks in Emacs are @dfn{normal hooks}. These variables
47 contain lists of functions to be called with no arguments. By
48 convention, whenever the hook name ends in @samp{-hook}, that tells
49 you it is normal. We try to make all hooks normal, as much as
50 possible, so that you can use them in a uniform way.
51
52 Every major mode function is supposed to run a normal hook called
53 the @dfn{mode hook} as the one of the last steps of initialization.
54 This makes it easy for a user to customize the behavior of the mode,
55 by overriding the buffer-local variable assignments already made by
56 the mode. Most minor mode functions also run a mode hook at the end.
57 But hooks are used in other contexts too. For example, the hook
58 @code{suspend-hook} runs just before Emacs suspends itself
59 (@pxref{Suspending Emacs}).
60
61 The recommended way to add a hook function to a normal hook is by
62 calling @code{add-hook} (see below). The hook functions may be any of
63 the valid kinds of functions that @code{funcall} accepts (@pxref{What
64 Is a Function}). Most normal hook variables are initially void;
65 @code{add-hook} knows how to deal with this. You can add hooks either
66 globally or buffer-locally with @code{add-hook}.
67
68 @cindex abnormal hook
69 If the hook variable's name does not end with @samp{-hook}, that
70 indicates it is probably an @dfn{abnormal hook}. That means the hook
71 functions are called with arguments, or their return values are used
72 in some way. The hook's documentation says how the functions are
73 called. You can use @code{add-hook} to add a function to an abnormal
74 hook, but you must write the function to follow the hook's calling
75 convention.
76
77 By convention, abnormal hook names end in @samp{-functions} or
78 @samp{-hooks}. If the variable's name ends in @samp{-function}, then
79 its value is just a single function, not a list of functions.
80
81 @menu
82 * Running Hooks:: How to run a hook.
83 * Setting Hooks:: How to put functions on a hook, or remove them.
84 @end menu
85
86 @node Running Hooks
87 @subsection Running Hooks
88
89 At the appropriate times, Emacs uses the @code{run-hooks} function
90 and the other functions below to run particular hooks.
91
92 @defun run-hooks &rest hookvars
93 This function takes one or more normal hook variable names as
94 arguments, and runs each hook in turn. Each argument should be a
95 symbol that is a normal hook variable. These arguments are processed
96 in the order specified.
97
98 If a hook variable has a non-@code{nil} value, that value should be a
99 list of functions. @code{run-hooks} calls all the functions, one by
100 one, with no arguments.
101
102 The hook variable's value can also be a single function---either a
103 lambda expression or a symbol with a function definition---which
104 @code{run-hooks} calls. But this usage is obsolete.
105 @end defun
106
107 @defun run-hook-with-args hook &rest args
108 This function is the way to run an abnormal hook and always call all
109 of the hook functions. It calls each of the hook functions one by
110 one, passing each of them the arguments @var{args}.
111 @end defun
112
113 @defun run-hook-with-args-until-failure hook &rest args
114 This function is the way to run an abnormal hook until one of the hook
115 functions fails. It calls each of the hook functions, passing each of
116 them the arguments @var{args}, until some hook function returns
117 @code{nil}. It then stops and returns @code{nil}. If none of the
118 hook functions return @code{nil}, it returns a non-@code{nil} value.
119 @end defun
120
121 @defun run-hook-with-args-until-success hook &rest args
122 This function is the way to run an abnormal hook until a hook function
123 succeeds. It calls each of the hook functions, passing each of them
124 the arguments @var{args}, until some hook function returns
125 non-@code{nil}. Then it stops, and returns whatever was returned by
126 the last hook function that was called. If all hook functions return
127 @code{nil}, it returns @code{nil} as well.
128 @end defun
129
130 @node Setting Hooks
131 @subsection Setting Hooks
132
133 Here's an example that uses a mode hook to turn on Auto Fill mode when
134 in Lisp Interaction mode:
135
136 @example
137 (add-hook 'lisp-interaction-mode-hook 'turn-on-auto-fill)
138 @end example
139
140 @defun add-hook hook function &optional append local
141 This function is the handy way to add function @var{function} to hook
142 variable @var{hook}. You can use it for abnormal hooks as well as for
143 normal hooks. @var{function} can be any Lisp function that can accept
144 the proper number of arguments for @var{hook}. For example,
145
146 @example
147 (add-hook 'text-mode-hook 'my-text-hook-function)
148 @end example
149
150 @noindent
151 adds @code{my-text-hook-function} to the hook called @code{text-mode-hook}.
152
153 If @var{function} is already present in @var{hook} (comparing using
154 @code{equal}), then @code{add-hook} does not add it a second time.
155
156 If @var{function} has a non-@code{nil} property
157 @code{permanent-local-hook}, then @code{kill-all-local-variables} (or
158 changing major modes) won't delete it from the hook variable's local
159 value.
160
161 It is best to design your hook functions so that the order in which
162 they are executed does not matter. Any dependence on the order is
163 asking for trouble. However, the order is predictable: normally,
164 @var{function} goes at the front of the hook list, so it will be
165 executed first (barring another @code{add-hook} call). If the
166 optional argument @var{append} is non-@code{nil}, the new hook
167 function goes at the end of the hook list and will be executed last.
168
169 @code{add-hook} can handle the cases where @var{hook} is void or its
170 value is a single function; it sets or changes the value to a list of
171 functions.
172
173 If @var{local} is non-@code{nil}, that says to add @var{function} to
174 the buffer-local hook list instead of to the global hook list. If
175 needed, this makes the hook buffer-local and adds @code{t} to the
176 buffer-local value. The latter acts as a flag to run the hook
177 functions in the default value as well as in the local value.
178 @end defun
179
180 @defun remove-hook hook function &optional local
181 This function removes @var{function} from the hook variable
182 @var{hook}. It compares @var{function} with elements of @var{hook}
183 using @code{equal}, so it works for both symbols and lambda
184 expressions.
185
186 If @var{local} is non-@code{nil}, that says to remove @var{function}
187 from the buffer-local hook list instead of from the global hook list.
188 @end defun
189
190 @node Major Modes
191 @section Major Modes
192 @cindex major mode
193
194 Major modes specialize Emacs for editing particular kinds of text.
195 Each buffer has only one major mode at a time. For each major mode
196 there is a function to switch to that mode in the current buffer; its
197 name should end in @samp{-mode}. These functions work by setting
198 buffer-local variable bindings and other data associated with the
199 buffer, such as a local keymap. The effect lasts until you switch
200 to another major mode in the same buffer.
201
202 @menu
203 * Major Mode Basics::
204 * Major Mode Conventions:: Coding conventions for keymaps, etc.
205 * Auto Major Mode:: How Emacs chooses the major mode automatically.
206 * Mode Help:: Finding out how to use a mode.
207 * Derived Modes:: Defining a new major mode based on another major
208 mode.
209 * Generic Modes:: Defining a simple major mode that supports
210 comment syntax and Font Lock mode.
211 * Mode Hooks:: Hooks run at the end of major mode functions.
212 * Example Major Modes:: Text mode and Lisp modes.
213 @end menu
214
215 @node Major Mode Basics
216 @subsection Major Mode Basics
217 @cindex Fundamental mode
218
219 The least specialized major mode is called @dfn{Fundamental mode}.
220 This mode has no mode-specific definitions or variable settings, so each
221 Emacs command behaves in its default manner, and each option is in its
222 default state. All other major modes redefine various keys and options.
223 For example, Lisp Interaction mode provides special key bindings for
224 @kbd{C-j} (@code{eval-print-last-sexp}), @key{TAB}
225 (@code{lisp-indent-line}), and other keys.
226
227 When you need to write several editing commands to help you perform a
228 specialized editing task, creating a new major mode is usually a good
229 idea. In practice, writing a major mode is easy (in contrast to
230 writing a minor mode, which is often difficult).
231
232 If the new mode is similar to an old one, it is often unwise to
233 modify the old one to serve two purposes, since it may become harder
234 to use and maintain. Instead, copy and rename an existing major mode
235 definition and alter the copy---or use the @code{define-derived-mode}
236 macro to define a @dfn{derived mode} (@pxref{Derived Modes}). For
237 example, Rmail Edit mode is a major mode that is very similar to Text
238 mode except that it provides two additional commands. Its definition
239 is distinct from that of Text mode, but uses that of Text mode.
240
241 Even if the new mode is not an obvious derivative of any other mode,
242 we recommend to use @code{define-derived-mode}, since it automatically
243 enforces the most important coding conventions for you.
244
245 For a very simple programming language major mode that handles
246 comments and fontification, you can use @code{define-generic-mode}.
247 @xref{Generic Modes}.
248
249 Rmail Edit mode offers an example of changing the major mode
250 temporarily for a buffer, so it can be edited in a different way (with
251 ordinary Emacs commands rather than Rmail commands). In such cases, the
252 temporary major mode usually provides a command to switch back to the
253 buffer's usual mode (Rmail mode, in this case). You might be tempted to
254 present the temporary redefinitions inside a recursive edit and restore
255 the usual ones when the user exits; but this is a bad idea because it
256 constrains the user's options when it is done in more than one buffer:
257 recursive edits must be exited most-recently-entered first. Using an
258 alternative major mode avoids this limitation. @xref{Recursive
259 Editing}.
260
261 The standard GNU Emacs Lisp library directory tree contains the code
262 for several major modes, in files such as @file{text-mode.el},
263 @file{texinfo.el}, @file{lisp-mode.el}, @file{c-mode.el}, and
264 @file{rmail.el}. They are found in various subdirectories of the
265 @file{lisp} directory. You can study these libraries to see how modes
266 are written. Text mode is perhaps the simplest major mode aside from
267 Fundamental mode. Rmail mode is a complicated and specialized mode.
268
269 @node Major Mode Conventions
270 @subsection Major Mode Conventions
271 @cindex major mode conventions
272 @cindex conventions for writing major modes
273
274 The code for existing major modes follows various coding conventions,
275 including conventions for local keymap and syntax table initialization,
276 global names, and hooks. Please follow these conventions when you
277 define a new major mode. (Fundamental mode is an exception to many
278 of these conventions, because its definition is to present the global
279 state of Emacs.)
280
281 This list of conventions is only partial, because each major mode
282 should aim for consistency in general with other Emacs major modes.
283 This makes Emacs as a whole more coherent. It is impossible to list
284 here all the possible points where this issue might come up; if the
285 Emacs developers point out an area where your major mode deviates from
286 the usual conventions, please make it compatible.
287
288 @itemize @bullet
289 @item
290 Define a command whose name ends in @samp{-mode}, with no arguments,
291 that switches to the new mode in the current buffer. This command
292 should set up the keymap, syntax table, and buffer-local variables in an
293 existing buffer, without changing the buffer's contents.
294
295 @item
296 Write a documentation string for this command that describes the
297 special commands available in this mode. @kbd{C-h m}
298 (@code{describe-mode}) in your mode will display this string.
299
300 The documentation string may include the special documentation
301 substrings, @samp{\[@var{command}]}, @samp{\@{@var{keymap}@}}, and
302 @samp{\<@var{keymap}>}, which enable the documentation to adapt
303 automatically to the user's own key bindings. @xref{Keys in
304 Documentation}.
305
306 @item
307 The major mode command should start by calling
308 @code{kill-all-local-variables}. This runs the normal hook
309 @code{change-major-mode-hook}, then gets rid of the buffer-local
310 variables of the major mode previously in effect. @xref{Creating
311 Buffer-Local}.
312
313 @item
314 The major mode command should set the variable @code{major-mode} to the
315 major mode command symbol. This is how @code{describe-mode} discovers
316 which documentation to print.
317
318 @item
319 The major mode command should set the variable @code{mode-name} to the
320 ``pretty'' name of the mode, usually a string (but see @ref{Mode Line
321 Data}, for other possible forms). The name of the mode appears
322 in the mode line.
323
324 @item
325 @cindex functions in modes
326 Since all global names are in the same name space, all the global
327 variables, constants, and functions that are part of the mode should
328 have names that start with the major mode name (or with an abbreviation
329 of it if the name is long). @xref{Coding Conventions}.
330
331 @item
332 In a major mode for editing some kind of structured text, such as a
333 programming language, indentation of text according to structure is
334 probably useful. So the mode should set @code{indent-line-function}
335 to a suitable function, and probably customize other variables
336 for indentation. @xref{Auto-Indentation}.
337
338 @item
339 @cindex keymaps in modes
340 The major mode should usually have its own keymap, which is used as the
341 local keymap in all buffers in that mode. The major mode command should
342 call @code{use-local-map} to install this local map. @xref{Active
343 Keymaps}, for more information.
344
345 This keymap should be stored permanently in a global variable named
346 @code{@var{modename}-mode-map}. Normally the library that defines the
347 mode sets this variable.
348
349 @xref{Tips for Defining}, for advice about how to write the code to set
350 up the mode's keymap variable.
351
352 @item
353 The key sequences bound in a major mode keymap should usually start with
354 @kbd{C-c}, followed by a control character, a digit, or @kbd{@{},
355 @kbd{@}}, @kbd{<}, @kbd{>}, @kbd{:} or @kbd{;}. The other punctuation
356 characters are reserved for minor modes, and ordinary letters are
357 reserved for users.
358
359 A major mode can also rebind the keys @kbd{M-n}, @kbd{M-p} and
360 @kbd{M-s}. The bindings for @kbd{M-n} and @kbd{M-p} should normally
361 be some kind of ``moving forward and backward,'' but this does not
362 necessarily mean cursor motion.
363
364 It is legitimate for a major mode to rebind a standard key sequence if
365 it provides a command that does ``the same job'' in a way better
366 suited to the text this mode is used for. For example, a major mode
367 for editing a programming language might redefine @kbd{C-M-a} to
368 ``move to the beginning of a function'' in a way that works better for
369 that language.
370
371 It is also legitimate for a major mode to rebind a standard key
372 sequence whose standard meaning is rarely useful in that mode. For
373 instance, minibuffer modes rebind @kbd{M-r}, whose standard meaning is
374 rarely of any use in the minibuffer. Major modes such as Dired or
375 Rmail that do not allow self-insertion of text can reasonably redefine
376 letters and other printing characters as special commands.
377
378 @item
379 Major modes for editing text should not define @key{RET} to do
380 anything other than insert a newline. However, it is ok for
381 specialized modes for text that users don't directly edit, such as
382 Dired and Info modes, to redefine @key{RET} to do something entirely
383 different.
384
385 @item
386 Major modes should not alter options that are primarily a matter of user
387 preference, such as whether Auto-Fill mode is enabled. Leave this to
388 each user to decide. However, a major mode should customize other
389 variables so that Auto-Fill mode will work usefully @emph{if} the user
390 decides to use it.
391
392 @item
393 @cindex syntax tables in modes
394 The mode may have its own syntax table or may share one with other
395 related modes. If it has its own syntax table, it should store this in
396 a variable named @code{@var{modename}-mode-syntax-table}. @xref{Syntax
397 Tables}.
398
399 @item
400 If the mode handles a language that has a syntax for comments, it should
401 set the variables that define the comment syntax. @xref{Options for
402 Comments,, Options Controlling Comments, emacs, The GNU Emacs Manual}.
403
404 @item
405 @cindex abbrev tables in modes
406 The mode may have its own abbrev table or may share one with other
407 related modes. If it has its own abbrev table, it should store this
408 in a variable named @code{@var{modename}-mode-abbrev-table}. If the
409 major mode command defines any abbrevs itself, it should pass @code{t}
410 for the @var{system-flag} argument to @code{define-abbrev}.
411 @xref{Defining Abbrevs}.
412
413 @item
414 The mode should specify how to do highlighting for Font Lock mode, by
415 setting up a buffer-local value for the variable
416 @code{font-lock-defaults} (@pxref{Font Lock Mode}).
417
418 @item
419 The mode should specify how Imenu should find the definitions or
420 sections of a buffer, by setting up a buffer-local value for the
421 variable @code{imenu-generic-expression}, for the two variables
422 @code{imenu-prev-index-position-function} and
423 @code{imenu-extract-index-name-function}, or for the variable
424 @code{imenu-create-index-function} (@pxref{Imenu}).
425
426 @item
427 The mode can specify a local value for
428 @code{eldoc-documentation-function} to tell ElDoc mode how to handle
429 this mode.
430
431 @item
432 The mode can specify how to complete various keywords by adding
433 to the special hook @code{completion-at-point-functions}.
434
435 @item
436 Use @code{defvar} or @code{defcustom} to set mode-related variables, so
437 that they are not reinitialized if they already have a value. (Such
438 reinitialization could discard customizations made by the user.)
439
440 @item
441 @cindex buffer-local variables in modes
442 To make a buffer-local binding for an Emacs customization variable, use
443 @code{make-local-variable} in the major mode command, not
444 @code{make-variable-buffer-local}. The latter function would make the
445 variable local to every buffer in which it is subsequently set, which
446 would affect buffers that do not use this mode. It is undesirable for a
447 mode to have such global effects. @xref{Buffer-Local Variables}.
448
449 With rare exceptions, the only reasonable way to use
450 @code{make-variable-buffer-local} in a Lisp package is for a variable
451 which is used only within that package. Using it on a variable used by
452 other packages would interfere with them.
453
454 @item
455 @cindex mode hook
456 @cindex major mode hook
457 Each major mode should have a normal @dfn{mode hook} named
458 @code{@var{modename}-mode-hook}. The very last thing the major mode command
459 should do is to call @code{run-mode-hooks}. This runs the mode hook,
460 and then runs the normal hook @code{after-change-major-mode-hook}.
461 @xref{Mode Hooks}.
462
463 @item
464 The major mode command may start by calling some other major mode
465 command (called the @dfn{parent mode}) and then alter some of its
466 settings. A mode that does this is called a @dfn{derived mode}. The
467 recommended way to define one is to use the @code{define-derived-mode}
468 macro, but this is not required. Such a mode should call the parent
469 mode command inside a @code{delay-mode-hooks} form. (Using
470 @code{define-derived-mode} does this automatically.) @xref{Derived
471 Modes}, and @ref{Mode Hooks}.
472
473 @item
474 If something special should be done if the user switches a buffer from
475 this mode to any other major mode, this mode can set up a buffer-local
476 value for @code{change-major-mode-hook} (@pxref{Creating Buffer-Local}).
477
478 @item
479 If this mode is appropriate only for specially-prepared text, then the
480 major mode command symbol should have a property named @code{mode-class}
481 with value @code{special}, put on as follows:
482
483 @kindex mode-class @r{(property)}
484 @cindex @code{special}
485 @example
486 (put 'funny-mode 'mode-class 'special)
487 @end example
488
489 @noindent
490 This tells Emacs that new buffers created while the current buffer is
491 in Funny mode should not inherit Funny mode, in case the default value
492 of @code{major-mode} is @code{nil}. Modes such as Dired, Rmail,
493 and Buffer List use this feature.
494
495 The @code{define-derived-mode} macro automatically marks the derived
496 mode as special if the parent mode is special. The special mode
497 @code{special-mode} provides a convenient parent for other special
498 modes to inherit from; it sets @code{buffer-read-only} to @code{t},
499 and does little else.
500
501 @item
502 If you want to make the new mode the default for files with certain
503 recognizable names, add an element to @code{auto-mode-alist} to select
504 the mode for those file names (@pxref{Auto Major Mode}). If you
505 define the mode command to autoload, you should add this element in
506 the same file that calls @code{autoload}. If you use an autoload
507 cookie for the mode command, you can also use an autoload cookie for
508 the form that adds the element (@pxref{autoload cookie}). If you do
509 not autoload the mode command, it is sufficient to add the element in
510 the file that contains the mode definition.
511
512 @item
513 In the comments that document the file, you should provide a sample
514 @code{autoload} form and an example of how to add to
515 @code{auto-mode-alist}, that users can include in their init files
516 (@pxref{Init File}).
517
518 @item
519 @cindex mode loading
520 The top-level forms in the file defining the mode should be written so
521 that they may be evaluated more than once without adverse consequences.
522 Even if you never load the file more than once, someone else will.
523 @end itemize
524
525 @node Auto Major Mode
526 @subsection How Emacs Chooses a Major Mode
527 @cindex major mode, automatic selection
528
529 Based on information in the file name or in the file itself, Emacs
530 automatically selects a major mode for the new buffer when a file is
531 visited. It also processes local variables specified in the file text.
532
533 @deffn Command fundamental-mode
534 Fundamental mode is a major mode that is not specialized for anything
535 in particular. Other major modes are defined in effect by comparison
536 with this one---their definitions say what to change, starting from
537 Fundamental mode. The @code{fundamental-mode} function does @emph{not}
538 run any mode hooks; you're not supposed to customize it. (If you want Emacs
539 to behave differently in Fundamental mode, change the @emph{global}
540 state of Emacs.)
541 @end deffn
542
543 @deffn Command normal-mode &optional find-file
544 This function establishes the proper major mode and buffer-local variable
545 bindings for the current buffer. First it calls @code{set-auto-mode}
546 (see below), then it runs @code{hack-local-variables} to parse, and
547 bind or evaluate as appropriate, the file's local variables
548 (@pxref{File Local Variables}).
549
550 If the @var{find-file} argument to @code{normal-mode} is non-@code{nil},
551 @code{normal-mode} assumes that the @code{find-file} function is calling
552 it. In this case, it may process local variables in the @samp{-*-}
553 line or at the end of the file. The variable
554 @code{enable-local-variables} controls whether to do so. @xref{File
555 Variables, , Local Variables in Files, emacs, The GNU Emacs Manual},
556 for the syntax of the local variables section of a file.
557
558 If you run @code{normal-mode} interactively, the argument
559 @var{find-file} is normally @code{nil}. In this case,
560 @code{normal-mode} unconditionally processes any file local variables.
561
562 If @code{normal-mode} processes the local variables list and this list
563 specifies a major mode, that mode overrides any mode chosen by
564 @code{set-auto-mode}. If neither @code{set-auto-mode} nor
565 @code{hack-local-variables} specify a major mode, the buffer stays in
566 the major mode determined by the default value of @code{major-mode}
567 (see below).
568
569 @cindex file mode specification error
570 @code{normal-mode} uses @code{condition-case} around the call to the
571 major mode function, so errors are caught and reported as a @samp{File
572 mode specification error}, followed by the original error message.
573 @end deffn
574
575 @defun set-auto-mode &optional keep-mode-if-same
576 @cindex visited file mode
577 This function selects the major mode that is appropriate for the
578 current buffer. It bases its decision (in order of precedence) on
579 the @w{@samp{-*-}} line, on the @w{@samp{#!}} line (using
580 @code{interpreter-mode-alist}), on the text at the beginning of the
581 buffer (using @code{magic-mode-alist}), and finally on the visited
582 file name (using @code{auto-mode-alist}). @xref{Choosing Modes, , How
583 Major Modes are Chosen, emacs, The GNU Emacs Manual}. However, this
584 function does not look for the @samp{mode:} local variable near the
585 end of a file; the @code{hack-local-variables} function does that.
586 If @code{enable-local-variables} is @code{nil}, @code{set-auto-mode}
587 does not check the @w{@samp{-*-}} line for a mode tag either.
588
589 If @var{keep-mode-if-same} is non-@code{nil}, this function does not
590 call the mode command if the buffer is already in the proper major
591 mode. For instance, @code{set-visited-file-name} sets this to
592 @code{t} to avoid killing buffer local variables that the user may
593 have set.
594 @end defun
595
596 @defopt major-mode
597 The buffer-local value of this variable holds the major mode
598 currently active. The default value of this variable holds the
599 default major mode for new buffers. The standard default value is
600 @code{fundamental-mode}.
601
602 If the default value of @code{major-mode} is @code{nil}, Emacs uses
603 the (previously) current buffer's major mode as the default major mode
604 of a new buffer. However, if that major mode symbol has a @code{mode-class}
605 property with value @code{special}, then it is not used for new buffers;
606 Fundamental mode is used instead. The modes that have this property are
607 those such as Dired and Rmail that are useful only with text that has
608 been specially prepared.
609 @end defopt
610
611 @defun set-buffer-major-mode buffer
612 This function sets the major mode of @var{buffer} to the default value of
613 @code{major-mode}; if that is @code{nil}, it uses the
614 current buffer's major mode (if that is suitable). As an exception,
615 if @var{buffer}'s name is @samp{*scratch*}, it sets the mode to
616 @code{initial-major-mode}.
617
618 The low-level primitives for creating buffers do not use this function,
619 but medium-level commands such as @code{switch-to-buffer} and
620 @code{find-file-noselect} use it whenever they create buffers.
621 @end defun
622
623 @defopt initial-major-mode
624 @cindex @samp{*scratch*}
625 The value of this variable determines the major mode of the initial
626 @samp{*scratch*} buffer. The value should be a symbol that is a major
627 mode command. The default value is @code{lisp-interaction-mode}.
628 @end defopt
629
630 @defvar interpreter-mode-alist
631 This variable specifies major modes to use for scripts that specify a
632 command interpreter in a @samp{#!} line. Its value is an alist with
633 elements of the form @code{(@var{interpreter} . @var{mode})}; for
634 example, @code{("perl" . perl-mode)} is one element present by
635 default. The element says to use mode @var{mode} if the file
636 specifies an interpreter which matches @var{interpreter}.
637 @end defvar
638
639 @defvar magic-mode-alist
640 This variable's value is an alist with elements of the form
641 @code{(@var{regexp} . @var{function})}, where @var{regexp} is a
642 regular expression and @var{function} is a function or @code{nil}.
643 After visiting a file, @code{set-auto-mode} calls @var{function} if
644 the text at the beginning of the buffer matches @var{regexp} and
645 @var{function} is non-@code{nil}; if @var{function} is @code{nil},
646 @code{auto-mode-alist} gets to decide the mode.
647 @end defvar
648
649 @defvar magic-fallback-mode-alist
650 This works like @code{magic-mode-alist}, except that it is handled
651 only if @code{auto-mode-alist} does not specify a mode for this file.
652 @end defvar
653
654 @defvar auto-mode-alist
655 This variable contains an association list of file name patterns
656 (regular expressions) and corresponding major mode commands. Usually,
657 the file name patterns test for suffixes, such as @samp{.el} and
658 @samp{.c}, but this need not be the case. An ordinary element of the
659 alist looks like @code{(@var{regexp} . @var{mode-function})}.
660
661 For example,
662
663 @smallexample
664 @group
665 (("\\`/tmp/fol/" . text-mode)
666 ("\\.texinfo\\'" . texinfo-mode)
667 ("\\.texi\\'" . texinfo-mode)
668 @end group
669 @group
670 ("\\.el\\'" . emacs-lisp-mode)
671 ("\\.c\\'" . c-mode)
672 ("\\.h\\'" . c-mode)
673 @dots{})
674 @end group
675 @end smallexample
676
677 When you visit a file whose expanded file name (@pxref{File Name
678 Expansion}), with version numbers and backup suffixes removed using
679 @code{file-name-sans-versions} (@pxref{File Name Components}), matches
680 a @var{regexp}, @code{set-auto-mode} calls the corresponding
681 @var{mode-function}. This feature enables Emacs to select the proper
682 major mode for most files.
683
684 If an element of @code{auto-mode-alist} has the form @code{(@var{regexp}
685 @var{function} t)}, then after calling @var{function}, Emacs searches
686 @code{auto-mode-alist} again for a match against the portion of the file
687 name that did not match before. This feature is useful for
688 uncompression packages: an entry of the form @code{("\\.gz\\'"
689 @var{function} t)} can uncompress the file and then put the uncompressed
690 file in the proper mode according to the name sans @samp{.gz}.
691
692 Here is an example of how to prepend several pattern pairs to
693 @code{auto-mode-alist}. (You might use this sort of expression in your
694 init file.)
695
696 @smallexample
697 @group
698 (setq auto-mode-alist
699 (append
700 ;; @r{File name (within directory) starts with a dot.}
701 '(("/\\.[^/]*\\'" . fundamental-mode)
702 ;; @r{File name has no dot.}
703 ("/[^\\./]*\\'" . fundamental-mode)
704 ;; @r{File name ends in @samp{.C}.}
705 ("\\.C\\'" . c++-mode))
706 auto-mode-alist))
707 @end group
708 @end smallexample
709 @end defvar
710
711 @node Mode Help
712 @subsection Getting Help about a Major Mode
713 @cindex mode help
714 @cindex help for major mode
715 @cindex documentation for major mode
716
717 The @code{describe-mode} function is used to provide information
718 about major modes. It is normally called with @kbd{C-h m}. The
719 @code{describe-mode} function uses the value of @code{major-mode},
720 which is why every major mode function needs to set the
721 @code{major-mode} variable.
722
723 @deffn Command describe-mode
724 This function displays the documentation of the current major mode.
725
726 The @code{describe-mode} function calls the @code{documentation}
727 function using the value of @code{major-mode} as an argument. Thus, it
728 displays the documentation string of the major mode function.
729 (@xref{Accessing Documentation}.)
730 @end deffn
731
732 @defvar major-mode
733 This buffer-local variable holds the symbol for the current buffer's
734 major mode. This symbol should have a function definition that is the
735 command to switch to that major mode. The @code{describe-mode}
736 function uses the documentation string of the function as the
737 documentation of the major mode.
738 @end defvar
739
740 @node Derived Modes
741 @subsection Defining Derived Modes
742 @cindex derived mode
743
744 The recommended way to define a new major mode is to derive it
745 from an existing one using @code{define-derived-mode}. If there is no
746 closely related mode, you can inherit from @code{text-mode},
747 @code{special-mode}, or in the worst case @code{fundamental-mode}.
748
749 @defmac define-derived-mode variant parent name docstring keyword-args@dots{} body@dots{}
750 This macro defines @var{variant} as a major mode command, using
751 @var{name} as the string form of the mode name. @var{variant} and
752 @var{parent} should be unquoted symbols.
753
754 The new command @var{variant} is defined to call the function
755 @var{parent}, then override certain aspects of that parent mode:
756
757 @itemize @bullet
758 @item
759 The new mode has its own sparse keymap, named
760 @code{@var{variant}-map}. @code{define-derived-mode}
761 makes the parent mode's keymap the parent of the new map, unless
762 @code{@var{variant}-map} is already set and already has a parent.
763
764 @item
765 The new mode has its own syntax table, kept in the variable
766 @code{@var{variant}-syntax-table}, unless you override this using the
767 @code{:syntax-table} keyword (see below). @code{define-derived-mode}
768 makes the parent mode's syntax-table the parent of
769 @code{@var{variant}-syntax-table}, unless the latter is already set
770 and already has a parent different from the standard syntax table.
771
772 @item
773 The new mode has its own abbrev table, kept in the variable
774 @code{@var{variant}-abbrev-table}, unless you override this using the
775 @code{:abbrev-table} keyword (see below).
776
777 @item
778 The new mode has its own mode hook, @code{@var{variant}-hook}. It
779 runs this hook, after running the hooks of its ancestor modes, with
780 @code{run-mode-hooks}, as the last thing it does. @xref{Mode Hooks}.
781 @end itemize
782
783 In addition, you can specify how to override other aspects of
784 @var{parent} with @var{body}. The command @var{variant}
785 evaluates the forms in @var{body} after setting up all its usual
786 overrides, just before running the mode hooks.
787
788 If @var{parent} has a non-@code{nil} @code{mode-class} symbol
789 property, then @code{define-derived-mode} sets the @code{mode-class}
790 property of @var{variant} to the same value. This ensures, for
791 example, that if @var{parent} is a special mode, then @var{variant} is
792 also a special mode (@pxref{Major Mode Conventions}).
793
794 You can also specify @code{nil} for @var{parent}. This gives the new
795 mode no parent. Then @code{define-derived-mode} behaves as described
796 above, but, of course, omits all actions connected with @var{parent}.
797
798 The argument @var{docstring} specifies the documentation string for
799 the new mode. @code{define-derived-mode} adds some general
800 information about the mode's hook, followed by the mode's keymap, at
801 the end of this docstring. If you omit @var{docstring},
802 @code{define-derived-mode} generates a documentation string.
803
804 The @var{keyword-args} are pairs of keywords and values. The values
805 are evaluated. The following keywords are currently supported:
806
807 @table @code
808 @item :syntax-table
809 You can use this to explicitly specify a syntax table for the new
810 mode. If you specify a @code{nil} value, the new mode uses the same
811 syntax table as @var{parent}, or the standard syntax table if
812 @var{parent} is @code{nil}. (Note that this does @emph{not} follow
813 the convention used for non-keyword arguments that a @code{nil} value
814 is equivalent with not specifying the argument.)
815
816 @item :abbrev-table
817 You can use this to explicitly specify an abbrev table for the new
818 mode. If you specify a @code{nil} value, the new mode uses the same
819 abbrev table as @var{parent}, or @code{fundamental-mode-abbrev-table}
820 if @var{parent} is @code{nil}. (Again, a @code{nil} value is
821 @emph{not} equivalent to not specifying this keyword.)
822
823 @item :group
824 If this is specified, the value should be the customization group for
825 this mode. (Not all major modes have one.) Only the (still
826 experimental and unadvertised) command @code{customize-mode} currently
827 uses this. @code{define-derived-mode} does @emph{not} automatically
828 define the specified customization group.
829 @end table
830
831 Here is a hypothetical example:
832
833 @example
834 (define-derived-mode hypertext-mode
835 text-mode "Hypertext"
836 "Major mode for hypertext.
837 \\@{hypertext-mode-map@}"
838 (setq case-fold-search nil))
839
840 (define-key hypertext-mode-map
841 [down-mouse-3] 'do-hyper-link)
842 @end example
843
844 Do not write an @code{interactive} spec in the definition;
845 @code{define-derived-mode} does that automatically.
846 @end defmac
847
848 @node Generic Modes
849 @subsection Generic Modes
850 @cindex generic mode
851
852 @dfn{Generic modes} are simple major modes with basic support for
853 comment syntax and Font Lock mode. To define a generic mode, use the
854 macro @code{define-generic-mode}. See the file @file{generic-x.el}
855 for some examples of the use of @code{define-generic-mode}.
856
857 @defmac define-generic-mode mode comment-list keyword-list font-lock-list auto-mode-list function-list &optional docstring
858 This macro defines a generic mode command named @var{mode} (a symbol,
859 not quoted). The optional argument @var{docstring} is the
860 documentation for the mode command. If you do not supply it,
861 @code{define-generic-mode} generates one by default.
862
863 The argument @var{comment-list} is a list in which each element is
864 either a character, a string of one or two characters, or a cons cell.
865 A character or a string is set up in the mode's syntax table as a
866 ``comment starter.'' If the entry is a cons cell, the @sc{car} is set
867 up as a ``comment starter'' and the @sc{cdr} as a ``comment ender.''
868 (Use @code{nil} for the latter if you want comments to end at the end
869 of the line.) Note that the syntax table mechanism has limitations
870 about what comment starters and enders are actually possible.
871 @xref{Syntax Tables}.
872
873 The argument @var{keyword-list} is a list of keywords to highlight
874 with @code{font-lock-keyword-face}. Each keyword should be a string.
875 Meanwhile, @var{font-lock-list} is a list of additional expressions to
876 highlight. Each element of this list should have the same form as an
877 element of @code{font-lock-keywords}. @xref{Search-based
878 Fontification}.
879
880 The argument @var{auto-mode-list} is a list of regular expressions to
881 add to the variable @code{auto-mode-alist}. They are added by the execution
882 of the @code{define-generic-mode} form, not by expanding the macro call.
883
884 Finally, @var{function-list} is a list of functions for the mode
885 command to call for additional setup. It calls these functions just
886 before it runs the mode hook variable @code{@var{mode}-hook}.
887 @end defmac
888
889 @node Mode Hooks
890 @subsection Mode Hooks
891
892 Every major mode function should finish by running its mode hook and
893 the mode-independent normal hook @code{after-change-major-mode-hook}.
894 It does this by calling @code{run-mode-hooks}. If the major mode is a
895 derived mode, that is if it calls another major mode (the parent mode)
896 in its body, it should do this inside @code{delay-mode-hooks} so that
897 the parent won't run these hooks itself. Instead, the derived mode's
898 call to @code{run-mode-hooks} runs the parent's mode hook too.
899 @xref{Major Mode Conventions}.
900
901 Emacs versions before Emacs 22 did not have @code{delay-mode-hooks}.
902 When user-implemented major modes have not been updated to use it,
903 they won't entirely follow these conventions: they may run the
904 parent's mode hook too early, or fail to run
905 @code{after-change-major-mode-hook}. If you encounter such a major
906 mode, please correct it to follow these conventions.
907
908 When you defined a major mode using @code{define-derived-mode}, it
909 automatically makes sure these conventions are followed. If you
910 define a major mode ``by hand,'' not using @code{define-derived-mode},
911 use the following functions to handle these conventions automatically.
912
913 @defun run-mode-hooks &rest hookvars
914 Major modes should run their mode hook using this function. It is
915 similar to @code{run-hooks} (@pxref{Hooks}), but it also runs
916 @code{after-change-major-mode-hook}.
917
918 When this function is called during the execution of a
919 @code{delay-mode-hooks} form, it does not run the hooks immediately.
920 Instead, it arranges for the next call to @code{run-mode-hooks} to run
921 them.
922 @end defun
923
924 @defmac delay-mode-hooks body@dots{}
925 When one major mode command calls another, it should do so inside of
926 @code{delay-mode-hooks}.
927
928 This macro executes @var{body}, but tells all @code{run-mode-hooks}
929 calls during the execution of @var{body} to delay running their hooks.
930 The hooks will actually run during the next call to
931 @code{run-mode-hooks} after the end of the @code{delay-mode-hooks}
932 construct.
933 @end defmac
934
935 @defvar after-change-major-mode-hook
936 This is a normal hook run by @code{run-mode-hooks}. It is run at the
937 very end of every properly-written major mode function.
938 @end defvar
939
940 @node Example Major Modes
941 @subsection Major Mode Examples
942
943 Text mode is perhaps the simplest mode besides Fundamental mode.
944 Here are excerpts from @file{text-mode.el} that illustrate many of
945 the conventions listed above:
946
947 @smallexample
948 @group
949 ;; @r{Create the syntax table for this mode.}
950 (defvar text-mode-syntax-table
951 (let ((st (make-syntax-table)))
952 (modify-syntax-entry ?\" ". " st)
953 (modify-syntax-entry ?\\ ". " st)
954 ;; Add `p' so M-c on `hello' leads to `Hello', not `hello'.
955 (modify-syntax-entry ?' "w p" st)
956 st)
957 "Syntax table used while in `text-mode'.")
958 @end group
959
960 ;; @r{Create the keymap for this mode.}
961 @group
962 (defvar text-mode-map
963 (let ((map (make-sparse-keymap)))
964 (define-key map "\e\t" 'ispell-complete-word)
965 (define-key map "\es" 'center-line)
966 (define-key map "\eS" 'center-paragraph)
967 map)
968 "Keymap for `text-mode'.
969 Many other modes, such as Mail mode, Outline mode
970 and Indented Text mode, inherit all the commands
971 defined in this map.")
972 @end group
973 @end smallexample
974
975 Here is how the actual mode command is defined now:
976
977 @smallexample
978 @group
979 (define-derived-mode text-mode nil "Text"
980 "Major mode for editing text written for humans to read.
981 In this mode, paragraphs are delimited only by blank or white lines.
982 You can thus get the full benefit of adaptive filling
983 (see the variable `adaptive-fill-mode').
984 \\@{text-mode-map@}
985 Turning on Text mode runs the normal hook `text-mode-hook'."
986 @end group
987 @group
988 (set (make-local-variable 'text-mode-variant) t)
989 ;; @r{These two lines are a feature added recently.}
990 (set (make-local-variable 'require-final-newline)
991 mode-require-final-newline)
992 (set (make-local-variable 'indent-line-function) 'indent-relative))
993 @end group
994 @end smallexample
995
996 @noindent
997 (The last line is redundant nowadays, since @code{indent-relative} is
998 the default value, and we'll delete it in a future version.)
999
1000 Here is how it was defined formerly, before
1001 @code{define-derived-mode} existed:
1002
1003 @smallexample
1004 @group
1005 ;; @r{This isn't needed nowadays, since @code{define-derived-mode} does it.}
1006 (define-abbrev-table 'text-mode-abbrev-table ()
1007 "Abbrev table used while in text mode.")
1008 @end group
1009
1010 @group
1011 (defun text-mode ()
1012 "Major mode for editing text intended for humans to read...
1013 Special commands: \\@{text-mode-map@}
1014 @end group
1015 @group
1016 Turning on text-mode runs the hook `text-mode-hook'."
1017 (interactive)
1018 (kill-all-local-variables)
1019 (use-local-map text-mode-map)
1020 @end group
1021 @group
1022 (setq local-abbrev-table text-mode-abbrev-table)
1023 (set-syntax-table text-mode-syntax-table)
1024 @end group
1025 @group
1026 ;; @r{These four lines are absent from the current version}
1027 ;; @r{not because this is done some other way, but because}
1028 ;; @r{nowadays Text mode uses the normal definition of paragraphs.}
1029 (set (make-local-variable 'paragraph-start)
1030 (concat "[ \t]*$\\|" page-delimiter))
1031 (set (make-local-variable 'paragraph-separate) paragraph-start)
1032 (set (make-local-variable 'indent-line-function) 'indent-relative-maybe)
1033 @end group
1034 @group
1035 (setq mode-name "Text")
1036 (setq major-mode 'text-mode)
1037 (run-mode-hooks 'text-mode-hook)) ; @r{Finally, this permits the user to}
1038 ; @r{customize the mode with a hook.}
1039 @end group
1040 @end smallexample
1041
1042 @cindex @file{lisp-mode.el}
1043 The three Lisp modes (Lisp mode, Emacs Lisp mode, and Lisp
1044 Interaction mode) have more features than Text mode and the code is
1045 correspondingly more complicated. Here are excerpts from
1046 @file{lisp-mode.el} that illustrate how these modes are written.
1047
1048 @cindex syntax table example
1049 @smallexample
1050 @group
1051 ;; @r{Create mode-specific table variables.}
1052 (defvar lisp-mode-syntax-table nil "")
1053 (defvar lisp-mode-abbrev-table nil "")
1054 @end group
1055
1056 @group
1057 (defvar emacs-lisp-mode-syntax-table
1058 (let ((table (make-syntax-table)))
1059 (let ((i 0))
1060 @end group
1061
1062 @group
1063 ;; @r{Set syntax of chars up to @samp{0} to say they are}
1064 ;; @r{part of symbol names but not words.}
1065 ;; @r{(The digit @samp{0} is @code{48} in the @acronym{ASCII} character set.)}
1066 (while (< i ?0)
1067 (modify-syntax-entry i "_ " table)
1068 (setq i (1+ i)))
1069 ;; @r{@dots{} similar code follows for other character ranges.}
1070 @end group
1071 @group
1072 ;; @r{Then set the syntax codes for characters that are special in Lisp.}
1073 (modify-syntax-entry ? " " table)
1074 (modify-syntax-entry ?\t " " table)
1075 (modify-syntax-entry ?\f " " table)
1076 (modify-syntax-entry ?\n "> " table)
1077 @end group
1078 @group
1079 ;; @r{Give CR the same syntax as newline, for selective-display.}
1080 (modify-syntax-entry ?\^m "> " table)
1081 (modify-syntax-entry ?\; "< " table)
1082 (modify-syntax-entry ?` "' " table)
1083 (modify-syntax-entry ?' "' " table)
1084 (modify-syntax-entry ?, "' " table)
1085 @end group
1086 @group
1087 ;; @r{@dots{}likewise for many other characters@dots{}}
1088 (modify-syntax-entry ?\( "() " table)
1089 (modify-syntax-entry ?\) ")( " table)
1090 (modify-syntax-entry ?\[ "(] " table)
1091 (modify-syntax-entry ?\] ")[ " table))
1092 table))
1093 @end group
1094 @group
1095 ;; @r{Create an abbrev table for lisp-mode.}
1096 (define-abbrev-table 'lisp-mode-abbrev-table ())
1097 @end group
1098 @end smallexample
1099
1100 The three modes for Lisp share much of their code. For instance,
1101 each calls the following function to set various variables:
1102
1103 @smallexample
1104 @group
1105 (defun lisp-mode-variables (lisp-syntax)
1106 (when lisp-syntax
1107 (set-syntax-table lisp-mode-syntax-table))
1108 (setq local-abbrev-table lisp-mode-abbrev-table)
1109 @dots{}
1110 @end group
1111 @end smallexample
1112
1113 In Lisp and most programming languages, we want the paragraph
1114 commands to treat only blank lines as paragraph separators. And the
1115 modes should understand the Lisp conventions for comments. The rest of
1116 @code{lisp-mode-variables} sets this up:
1117
1118 @smallexample
1119 @group
1120 (set (make-local-variable 'paragraph-start)
1121 (concat page-delimiter "\\|$" ))
1122 (set (make-local-variable 'paragraph-separate)
1123 paragraph-start)
1124 @dots{}
1125 @end group
1126 @group
1127 (set (make-local-variable 'comment-indent-function)
1128 'lisp-comment-indent))
1129 @dots{}
1130 @end group
1131 @end smallexample
1132
1133 Each of the different Lisp modes has a slightly different keymap. For
1134 example, Lisp mode binds @kbd{C-c C-z} to @code{run-lisp}, but the other
1135 Lisp modes do not. However, all Lisp modes have some commands in
1136 common. The following code sets up the common commands:
1137
1138 @smallexample
1139 @group
1140 (defvar shared-lisp-mode-map
1141 (let ((map (make-sparse-keymap)))
1142 (define-key shared-lisp-mode-map "\e\C-q" 'indent-sexp)
1143 (define-key shared-lisp-mode-map "\177"
1144 'backward-delete-char-untabify)
1145 map)
1146 "Keymap for commands shared by all sorts of Lisp modes.")
1147 @end group
1148 @end smallexample
1149
1150 @noindent
1151 And here is the code to set up the keymap for Lisp mode:
1152
1153 @smallexample
1154 @group
1155 (defvar lisp-mode-map
1156 (let ((map (make-sparse-keymap)))
1157 (set-keymap-parent map shared-lisp-mode-map)
1158 (define-key map "\e\C-x" 'lisp-eval-defun)
1159 (define-key map "\C-c\C-z" 'run-lisp)
1160 map)
1161 "Keymap for ordinary Lisp mode...")
1162 @end group
1163 @end smallexample
1164
1165 Finally, here is the complete major mode function definition for
1166 Lisp mode.
1167
1168 @smallexample
1169 @group
1170 (defun lisp-mode ()
1171 "Major mode for editing Lisp code for Lisps other than GNU Emacs Lisp.
1172 Commands:
1173 Delete converts tabs to spaces as it moves back.
1174 Blank lines separate paragraphs. Semicolons start comments.
1175 \\@{lisp-mode-map@}
1176 Note that `run-lisp' may be used either to start an inferior Lisp job
1177 or to switch back to an existing one.
1178 @end group
1179
1180 @group
1181 Entry to this mode calls the value of `lisp-mode-hook'
1182 if that value is non-nil."
1183 (interactive)
1184 (kill-all-local-variables)
1185 @end group
1186 @group
1187 (use-local-map lisp-mode-map) ; @r{Select the mode's keymap.}
1188 (setq major-mode 'lisp-mode) ; @r{This is how @code{describe-mode}}
1189 ; @r{finds out what to describe.}
1190 (setq mode-name "Lisp") ; @r{This goes into the mode line.}
1191 (lisp-mode-variables t) ; @r{This defines various variables.}
1192 (set (make-local-variable 'comment-start-skip)
1193 "\\(\\(^\\|[^\\\\\n]\\)\\(\\\\\\\\\\)*\\)\\(;+\\|#|\\) *")
1194 (set (make-local-variable 'font-lock-keywords-case-fold-search) t)
1195 @end group
1196 @group
1197 (setq imenu-case-fold-search t)
1198 (set-syntax-table lisp-mode-syntax-table)
1199 (run-mode-hooks 'lisp-mode-hook)) ; @r{This permits the user to use a}
1200 ; @r{hook to customize the mode.}
1201 @end group
1202 @end smallexample
1203
1204 @node Minor Modes
1205 @section Minor Modes
1206 @cindex minor mode
1207
1208 A @dfn{minor mode} provides features that users may enable or disable
1209 independently of the choice of major mode. Minor modes can be enabled
1210 individually or in combination. Minor modes would be better named
1211 ``generally available, optional feature modes,'' except that such a name
1212 would be unwieldy.
1213
1214 A minor mode is not usually meant as a variation of a single major mode.
1215 Usually they are general and can apply to many major modes. For
1216 example, Auto Fill mode works with any major mode that permits text
1217 insertion. To be general, a minor mode must be effectively independent
1218 of the things major modes do.
1219
1220 A minor mode is often much more difficult to implement than a major
1221 mode. One reason is that you should be able to activate and deactivate
1222 minor modes in any order. A minor mode should be able to have its
1223 desired effect regardless of the major mode and regardless of the other
1224 minor modes in effect.
1225
1226 Often the biggest problem in implementing a minor mode is finding a
1227 way to insert the necessary hook into the rest of Emacs. Minor mode
1228 keymaps make this easier than it used to be.
1229
1230 @defvar minor-mode-list
1231 The value of this variable is a list of all minor mode commands.
1232 @end defvar
1233
1234 @menu
1235 * Minor Mode Conventions:: Tips for writing a minor mode.
1236 * Keymaps and Minor Modes:: How a minor mode can have its own keymap.
1237 * Defining Minor Modes:: A convenient facility for defining minor modes.
1238 @end menu
1239
1240 @node Minor Mode Conventions
1241 @subsection Conventions for Writing Minor Modes
1242 @cindex minor mode conventions
1243 @cindex conventions for writing minor modes
1244
1245 There are conventions for writing minor modes just as there are for
1246 major modes. Several of the major mode conventions apply to minor
1247 modes as well: those regarding the name of the mode initialization
1248 function, the names of global symbols, the use of a hook at the end of
1249 the initialization function, and the use of keymaps and other tables.
1250
1251 In addition, there are several conventions that are specific to
1252 minor modes. (The easiest way to follow all the conventions is to use
1253 the macro @code{define-minor-mode}; @ref{Defining Minor Modes}.)
1254
1255 @itemize @bullet
1256 @item
1257 @cindex mode variable
1258 Make a variable whose name ends in @samp{-mode} to control the minor
1259 mode. We call this the @dfn{mode variable}. The minor mode command
1260 should set this variable (@code{nil} to disable; anything else to
1261 enable).
1262
1263 If possible, implement the mode so that setting the variable
1264 automatically enables or disables the mode. Then the minor mode command
1265 does not need to do anything except set the variable.
1266
1267 This variable is used in conjunction with the @code{minor-mode-alist} to
1268 display the minor mode name in the mode line. It can also enable
1269 or disable a minor mode keymap. Individual commands or hooks can also
1270 check the variable's value.
1271
1272 If you want the minor mode to be enabled separately in each buffer,
1273 make the variable buffer-local.
1274
1275 @item
1276 Define a command whose name is the same as the mode variable.
1277 Its job is to enable and disable the mode by setting the variable.
1278
1279 The command should accept one optional argument. If the argument is
1280 @code{nil}, it should toggle the mode (turn it on if it is off, and
1281 off if it is on). It should turn the mode on if the argument is a
1282 positive integer, the symbol @code{t}, or a list whose @sc{car} is one
1283 of those. It should turn the mode off if the argument is a negative
1284 integer or zero, the symbol @code{-}, or a list whose @sc{car} is a
1285 negative integer or zero. The meaning of other arguments is not
1286 specified.
1287
1288 Here is an example taken from the definition of @code{transient-mark-mode}.
1289 It shows the use of @code{transient-mark-mode} as a variable that enables or
1290 disables the mode's behavior, and also shows the proper way to toggle,
1291 enable or disable the minor mode based on the raw prefix argument value.
1292
1293 @smallexample
1294 @group
1295 (setq transient-mark-mode
1296 (if (null arg) (not transient-mark-mode)
1297 (> (prefix-numeric-value arg) 0)))
1298 @end group
1299 @end smallexample
1300
1301 @item
1302 Add an element to @code{minor-mode-alist} for each minor mode
1303 (@pxref{Definition of minor-mode-alist}), if you want to indicate the
1304 minor mode in the mode line. This element should be a list of the
1305 following form:
1306
1307 @smallexample
1308 (@var{mode-variable} @var{string})
1309 @end smallexample
1310
1311 Here @var{mode-variable} is the variable that controls enabling of the
1312 minor mode, and @var{string} is a short string, starting with a space,
1313 to represent the mode in the mode line. These strings must be short so
1314 that there is room for several of them at once.
1315
1316 When you add an element to @code{minor-mode-alist}, use @code{assq} to
1317 check for an existing element, to avoid duplication. For example:
1318
1319 @smallexample
1320 @group
1321 (unless (assq 'leif-mode minor-mode-alist)
1322 (setq minor-mode-alist
1323 (cons '(leif-mode " Leif") minor-mode-alist)))
1324 @end group
1325 @end smallexample
1326
1327 @noindent
1328 or like this, using @code{add-to-list} (@pxref{List Variables}):
1329
1330 @smallexample
1331 @group
1332 (add-to-list 'minor-mode-alist '(leif-mode " Leif"))
1333 @end group
1334 @end smallexample
1335 @end itemize
1336
1337 Global minor modes distributed with Emacs should if possible support
1338 enabling and disabling via Custom (@pxref{Customization}). To do this,
1339 the first step is to define the mode variable with @code{defcustom}, and
1340 specify @code{:type 'boolean}.
1341
1342 If just setting the variable is not sufficient to enable the mode, you
1343 should also specify a @code{:set} method which enables the mode by
1344 invoking the mode command. Note in the variable's documentation string that
1345 setting the variable other than via Custom may not take effect.
1346
1347 Also mark the definition with an autoload cookie (@pxref{autoload cookie}),
1348 and specify a @code{:require} so that customizing the variable will load
1349 the library that defines the mode. This will copy suitable definitions
1350 into @file{loaddefs.el} so that users can use @code{customize-option} to
1351 enable the mode. For example:
1352
1353 @smallexample
1354 @group
1355
1356 ;;;###autoload
1357 (defcustom msb-mode nil
1358 "Toggle msb-mode.
1359 Setting this variable directly does not take effect;
1360 use either \\[customize] or the function `msb-mode'."
1361 :set 'custom-set-minor-mode
1362 :initialize 'custom-initialize-default
1363 :version "20.4"
1364 :type 'boolean
1365 :group 'msb
1366 :require 'msb)
1367 @end group
1368 @end smallexample
1369
1370 @node Keymaps and Minor Modes
1371 @subsection Keymaps and Minor Modes
1372
1373 Each minor mode can have its own keymap, which is active when the mode
1374 is enabled. To set up a keymap for a minor mode, add an element to the
1375 alist @code{minor-mode-map-alist}. @xref{Definition of minor-mode-map-alist}.
1376
1377 @cindex @code{self-insert-command}, minor modes
1378 One use of minor mode keymaps is to modify the behavior of certain
1379 self-inserting characters so that they do something else as well as
1380 self-insert. In general, this is the only way to do that, since the
1381 facilities for customizing @code{self-insert-command} are limited to
1382 special cases (designed for abbrevs and Auto Fill mode). (Do not try
1383 substituting your own definition of @code{self-insert-command} for the
1384 standard one. The editor command loop handles this function specially.)
1385
1386 The key sequences bound in a minor mode should consist of @kbd{C-c}
1387 followed by one of @kbd{.,/?`'"[]\|~!#$%^&*()-_+=}. (The other
1388 punctuation characters are reserved for major modes.)
1389
1390 @node Defining Minor Modes
1391 @subsection Defining Minor Modes
1392
1393 The macro @code{define-minor-mode} offers a convenient way of
1394 implementing a mode in one self-contained definition.
1395
1396 @defmac define-minor-mode mode doc [init-value [lighter [keymap]]] keyword-args@dots{} body@dots{}
1397 This macro defines a new minor mode whose name is @var{mode} (a
1398 symbol). It defines a command named @var{mode} to toggle the minor
1399 mode, with @var{doc} as its documentation string. It also defines a
1400 variable named @var{mode}, which is set to @code{t} or @code{nil} by
1401 enabling or disabling the mode. The variable is initialized to
1402 @var{init-value}. Except in unusual circumstances (see below), this
1403 value must be @code{nil}.
1404
1405 The string @var{lighter} says what to display in the mode line
1406 when the mode is enabled; if it is @code{nil}, the mode is not displayed
1407 in the mode line.
1408
1409 The optional argument @var{keymap} specifies the keymap for the minor
1410 mode. If non-@code{nil}, it should be a variable name (whose value is
1411 a keymap), a keymap, or an alist of the form
1412
1413 @example
1414 (@var{key-sequence} . @var{definition})
1415 @end example
1416
1417 @noindent
1418 where each @var{key-sequence} and @var{definition} are arguments
1419 suitable for passing to @code{define-key} (@pxref{Changing Key
1420 Bindings}). If @var{keymap} is a keymap or an alist, this also
1421 defines the variable @code{@var{mode}-map}.
1422
1423 The above three arguments @var{init-value}, @var{lighter}, and
1424 @var{keymap} can be (partially) omitted when @var{keyword-args} are
1425 used. The @var{keyword-args} consist of keywords followed by
1426 corresponding values. A few keywords have special meanings:
1427
1428 @table @code
1429 @item :group @var{group}
1430 Custom group name to use in all generated @code{defcustom} forms.
1431 Defaults to @var{mode} without the possible trailing @samp{-mode}.
1432 @strong{Warning:} don't use this default group name unless you have
1433 written a @code{defgroup} to define that group properly. @xref{Group
1434 Definitions}.
1435
1436 @item :global @var{global}
1437 If non-@code{nil}, this specifies that the minor mode should be global
1438 rather than buffer-local. It defaults to @code{nil}.
1439
1440 One of the effects of making a minor mode global is that the
1441 @var{mode} variable becomes a customization variable. Toggling it
1442 through the Custom interface turns the mode on and off, and its value
1443 can be saved for future Emacs sessions (@pxref{Saving
1444 Customizations,,, emacs, The GNU Emacs Manual}. For the saved
1445 variable to work, you should ensure that the @code{define-minor-mode}
1446 form is evaluated each time Emacs starts; for packages that are not
1447 part of Emacs, the easiest way to do this is to specify a
1448 @code{:require} keyword.
1449
1450 @item :init-value @var{init-value}
1451 This is equivalent to specifying @var{init-value} positionally.
1452
1453 @item :lighter @var{lighter}
1454 This is equivalent to specifying @var{lighter} positionally.
1455
1456 @item :keymap @var{keymap}
1457 This is equivalent to specifying @var{keymap} positionally.
1458 @end table
1459
1460 Any other keyword arguments are passed directly to the
1461 @code{defcustom} generated for the variable @var{mode}.
1462
1463 The command named @var{mode} first performs the standard actions such
1464 as setting the variable named @var{mode} and then executes the
1465 @var{body} forms, if any. It finishes by running the mode hook
1466 variable @code{@var{mode}-hook}.
1467 @end defmac
1468
1469 The initial value must be @code{nil} except in cases where (1) the
1470 mode is preloaded in Emacs, or (2) it is painless for loading to
1471 enable the mode even though the user did not request it. For
1472 instance, if the mode has no effect unless something else is enabled,
1473 and will always be loaded by that time, enabling it by default is
1474 harmless. But these are unusual circumstances. Normally, the
1475 initial value must be @code{nil}.
1476
1477 @findex easy-mmode-define-minor-mode
1478 The name @code{easy-mmode-define-minor-mode} is an alias
1479 for this macro.
1480
1481 Here is an example of using @code{define-minor-mode}:
1482
1483 @smallexample
1484 (define-minor-mode hungry-mode
1485 "Toggle Hungry mode.
1486 With no argument, this command toggles the mode.
1487 Non-null prefix argument turns on the mode.
1488 Null prefix argument turns off the mode.
1489
1490 When Hungry mode is enabled, the control delete key
1491 gobbles all preceding whitespace except the last.
1492 See the command \\[hungry-electric-delete]."
1493 ;; The initial value.
1494 nil
1495 ;; The indicator for the mode line.
1496 " Hungry"
1497 ;; The minor mode bindings.
1498 '(([C-backspace] . hungry-electric-delete))
1499 :group 'hunger)
1500 @end smallexample
1501
1502 @noindent
1503 This defines a minor mode named ``Hungry mode,'' a command named
1504 @code{hungry-mode} to toggle it, a variable named @code{hungry-mode}
1505 which indicates whether the mode is enabled, and a variable named
1506 @code{hungry-mode-map} which holds the keymap that is active when the
1507 mode is enabled. It initializes the keymap with a key binding for
1508 @kbd{C-@key{DEL}}. It puts the variable @code{hungry-mode} into
1509 custom group @code{hunger}. There are no @var{body} forms---many
1510 minor modes don't need any.
1511
1512 Here's an equivalent way to write it:
1513
1514 @smallexample
1515 (define-minor-mode hungry-mode
1516 "Toggle Hungry mode.
1517 With no argument, this command toggles the mode.
1518 Non-null prefix argument turns on the mode.
1519 Null prefix argument turns off the mode.
1520
1521 When Hungry mode is enabled, the control delete key
1522 gobbles all preceding whitespace except the last.
1523 See the command \\[hungry-electric-delete]."
1524 ;; The initial value.
1525 :init-value nil
1526 ;; The indicator for the mode line.
1527 :lighter " Hungry"
1528 ;; The minor mode bindings.
1529 :keymap
1530 '(([C-backspace] . hungry-electric-delete)
1531 ([C-M-backspace]
1532 . (lambda ()
1533 (interactive)
1534 (hungry-electric-delete t))))
1535 :group 'hunger)
1536 @end smallexample
1537
1538 @defmac define-globalized-minor-mode global-mode mode turn-on keyword-args@dots{}
1539 This defines a global toggle named @var{global-mode} whose meaning is
1540 to enable or disable the buffer-local minor mode @var{mode} in all
1541 buffers. To turn on the minor mode in a buffer, it uses the function
1542 @var{turn-on}; to turn off the minor mode, it calls @code{mode} with
1543 @minus{}1 as argument.
1544
1545 Globally enabling the mode also affects buffers subsequently created
1546 by visiting files, and buffers that use a major mode other than
1547 Fundamental mode; but it does not detect the creation of a new buffer
1548 in Fundamental mode.
1549
1550 This defines the customization option @var{global-mode} (@pxref{Customization}),
1551 which can be toggled in the Custom interface to turn the minor mode on
1552 and off. As with @code{define-minor-mode}, you should ensure that the
1553 @code{define-globalized-minor-mode} form is evaluated each time Emacs
1554 starts, for example by providing a @code{:require} keyword.
1555
1556 Use @code{:group @var{group}} in @var{keyword-args} to specify the
1557 custom group for the mode variable of the global minor mode.
1558 @end defmac
1559
1560 @node Mode Line Format
1561 @section Mode-Line Format
1562 @cindex mode line
1563
1564 Each Emacs window (aside from minibuffer windows) typically has a mode
1565 line at the bottom, which displays status information about the buffer
1566 displayed in the window. The mode line contains information about the
1567 buffer, such as its name, associated file, depth of recursive editing,
1568 and major and minor modes. A window can also have a @dfn{header
1569 line}, which is much like the mode line but appears at the top of the
1570 window.
1571
1572 This section describes how to control the contents of the mode line
1573 and header line. We include it in this chapter because much of the
1574 information displayed in the mode line relates to the enabled major and
1575 minor modes.
1576
1577 @menu
1578 * Base: Mode Line Basics. Basic ideas of mode line control.
1579 * Data: Mode Line Data. The data structure that controls the mode line.
1580 * Top: Mode Line Top. The top level variable, mode-line-format.
1581 * Mode Line Variables:: Variables used in that data structure.
1582 * %-Constructs:: Putting information into a mode line.
1583 * Properties in Mode:: Using text properties in the mode line.
1584 * Header Lines:: Like a mode line, but at the top.
1585 * Emulating Mode Line:: Formatting text as the mode line would.
1586 @end menu
1587
1588 @node Mode Line Basics
1589 @subsection Mode Line Basics
1590
1591 @code{mode-line-format} is a buffer-local variable that holds a
1592 @dfn{mode line construct}, a kind of template, which controls what is
1593 displayed on the mode line of the current buffer. The value of
1594 @code{header-line-format} specifies the buffer's header line in the
1595 same way. All windows for the same buffer use the same
1596 @code{mode-line-format} and @code{header-line-format}.
1597
1598 For efficiency, Emacs does not continuously recompute the mode
1599 line and header line of a window. It does so when circumstances
1600 appear to call for it---for instance, if you change the window
1601 configuration, switch buffers, narrow or widen the buffer, scroll, or
1602 change the buffer's modification status. If you modify any of the
1603 variables referenced by @code{mode-line-format} (@pxref{Mode Line
1604 Variables}), or any other variables and data structures that affect
1605 how text is displayed (@pxref{Display}), you may want to force an
1606 update of the mode line so as to display the new information or
1607 display it in the new way.
1608
1609 @defun force-mode-line-update &optional all
1610 Force redisplay of the current buffer's mode line and header line.
1611 The next redisplay will update the mode line and header line based on
1612 the latest values of all relevant variables. With optional
1613 non-@code{nil} @var{all}, force redisplay of all mode lines and header
1614 lines.
1615
1616 This function also forces recomputation of the menu bar menus
1617 and the frame title.
1618 @end defun
1619
1620 The selected window's mode line is usually displayed in a different
1621 color using the face @code{mode-line}. Other windows' mode lines
1622 appear in the face @code{mode-line-inactive} instead. @xref{Faces}.
1623
1624 @node Mode Line Data
1625 @subsection The Data Structure of the Mode Line
1626 @cindex mode-line construct
1627
1628 The mode-line contents are controlled by a data structure called a
1629 @dfn{mode-line construct}, made up of lists, strings, symbols, and
1630 numbers kept in buffer-local variables. Each data type has a specific
1631 meaning for the mode-line appearance, as described below. The same
1632 data structure is used for constructing frame titles (@pxref{Frame
1633 Titles}) and header lines (@pxref{Header Lines}).
1634
1635 A mode-line construct may be as simple as a fixed string of text,
1636 but it usually specifies how to combine fixed strings with variables'
1637 values to construct the text. Many of these variables are themselves
1638 defined to have mode-line constructs as their values.
1639
1640 Here are the meanings of various data types as mode-line constructs:
1641
1642 @table @code
1643 @cindex percent symbol in mode line
1644 @item @var{string}
1645 A string as a mode-line construct appears verbatim except for
1646 @dfn{@code{%}-constructs} in it. These stand for substitution of
1647 other data; see @ref{%-Constructs}.
1648
1649 If parts of the string have @code{face} properties, they control
1650 display of the text just as they would text in the buffer. Any
1651 characters which have no @code{face} properties are displayed, by
1652 default, in the face @code{mode-line} or @code{mode-line-inactive}
1653 (@pxref{Standard Faces,,, emacs, The GNU Emacs Manual}). The
1654 @code{help-echo} and @code{local-map} properties in @var{string} have
1655 special meanings. @xref{Properties in Mode}.
1656
1657 @item @var{symbol}
1658 A symbol as a mode-line construct stands for its value. The value of
1659 @var{symbol} is used as a mode-line construct, in place of @var{symbol}.
1660 However, the symbols @code{t} and @code{nil} are ignored, as is any
1661 symbol whose value is void.
1662
1663 There is one exception: if the value of @var{symbol} is a string, it is
1664 displayed verbatim: the @code{%}-constructs are not recognized.
1665
1666 Unless @var{symbol} is marked as ``risky'' (i.e., it has a
1667 non-@code{nil} @code{risky-local-variable} property), all text
1668 properties specified in @var{symbol}'s value are ignored. This
1669 includes the text properties of strings in @var{symbol}'s value, as
1670 well as all @code{:eval} and @code{:propertize} forms in it. (The
1671 reason for this is security: non-risky variables could be set
1672 automatically from file variables without prompting the user.)
1673
1674 @item (@var{string} @var{rest}@dots{})
1675 @itemx (@var{list} @var{rest}@dots{})
1676 A list whose first element is a string or list means to process all the
1677 elements recursively and concatenate the results. This is the most
1678 common form of mode-line construct.
1679
1680 @item (:eval @var{form})
1681 A list whose first element is the symbol @code{:eval} says to evaluate
1682 @var{form}, and use the result as a string to display. Make sure this
1683 evaluation cannot load any files, as doing so could cause infinite
1684 recursion.
1685
1686 @item (:propertize @var{elt} @var{props}@dots{})
1687 A list whose first element is the symbol @code{:propertize} says to
1688 process the mode-line construct @var{elt} recursively, then add the text
1689 properties specified by @var{props} to the result. The argument
1690 @var{props} should consist of zero or more pairs @var{text-property}
1691 @var{value}. (This feature is new as of Emacs 22.1.)
1692
1693 @item (@var{symbol} @var{then} @var{else})
1694 A list whose first element is a symbol that is not a keyword specifies
1695 a conditional. Its meaning depends on the value of @var{symbol}. If
1696 @var{symbol} has a non-@code{nil} value, the second element,
1697 @var{then}, is processed recursively as a mode-line element.
1698 Otherwise, the third element, @var{else}, is processed recursively.
1699 You may omit @var{else}; then the mode-line element displays nothing
1700 if the value of @var{symbol} is @code{nil} or void.
1701
1702 @item (@var{width} @var{rest}@dots{})
1703 A list whose first element is an integer specifies truncation or
1704 padding of the results of @var{rest}. The remaining elements
1705 @var{rest} are processed recursively as mode-line constructs and
1706 concatenated together. When @var{width} is positive, the result is
1707 space filled on the right if its width is less than @var{width}. When
1708 @var{width} is negative, the result is truncated on the right to
1709 @minus{}@var{width} columns if its width exceeds @minus{}@var{width}.
1710
1711 For example, the usual way to show what percentage of a buffer is above
1712 the top of the window is to use a list like this: @code{(-3 "%p")}.
1713 @end table
1714
1715 @node Mode Line Top
1716 @subsection The Top Level of Mode Line Control
1717
1718 The variable in overall control of the mode line is
1719 @code{mode-line-format}.
1720
1721 @defopt mode-line-format
1722 The value of this variable is a mode-line construct that controls the
1723 contents of the mode-line. It is always buffer-local in all buffers.
1724
1725 If you set this variable to @code{nil} in a buffer, that buffer does
1726 not have a mode line. (A window that is just one line tall never
1727 displays a mode line.)
1728 @end defopt
1729
1730 The default value of @code{mode-line-format} is designed to use the
1731 values of other variables such as @code{mode-line-position} and
1732 @code{mode-line-modes} (which in turn incorporates the values of the
1733 variables @code{mode-name} and @code{minor-mode-alist}). Very few
1734 modes need to alter @code{mode-line-format} itself. For most
1735 purposes, it is sufficient to alter some of the variables that
1736 @code{mode-line-format} either directly or indirectly refers to.
1737
1738 If you do alter @code{mode-line-format} itself, the new value should
1739 use the same variables that appear in the default value (@pxref{Mode
1740 Line Variables}), rather than duplicating their contents or displaying
1741 the information in another fashion. This way, customizations made by
1742 the user or by Lisp programs (such as @code{display-time} and major
1743 modes) via changes to those variables remain effective.
1744
1745 Here is an example of a @code{mode-line-format} that might be
1746 useful for @code{shell-mode}, since it contains the host name and default
1747 directory.
1748
1749 @example
1750 @group
1751 (setq mode-line-format
1752 (list "-"
1753 'mode-line-mule-info
1754 'mode-line-modified
1755 'mode-line-frame-identification
1756 "%b--"
1757 @end group
1758 @group
1759 ;; @r{Note that this is evaluated while making the list.}
1760 ;; @r{It makes a mode-line construct which is just a string.}
1761 (getenv "HOST")
1762 @end group
1763 ":"
1764 'default-directory
1765 " "
1766 'global-mode-string
1767 " %[("
1768 '(:eval (mode-line-mode-name))
1769 'mode-line-process
1770 'minor-mode-alist
1771 "%n"
1772 ")%]--"
1773 @group
1774 '(which-func-mode ("" which-func-format "--"))
1775 '(line-number-mode "L%l--")
1776 '(column-number-mode "C%c--")
1777 '(-3 "%p")
1778 "-%-"))
1779 @end group
1780 @end example
1781
1782 @noindent
1783 (The variables @code{line-number-mode}, @code{column-number-mode}
1784 and @code{which-func-mode} enable particular minor modes; as usual,
1785 these variable names are also the minor mode command names.)
1786
1787 @node Mode Line Variables
1788 @subsection Variables Used in the Mode Line
1789
1790 This section describes variables incorporated by the standard value
1791 of @code{mode-line-format} into the text of the mode line. There is
1792 nothing inherently special about these variables; any other variables
1793 could have the same effects on the mode line if
1794 @code{mode-line-format}'s value were changed to use them. However,
1795 various parts of Emacs set these variables on the understanding that
1796 they will control parts of the mode line; therefore, practically
1797 speaking, it is essential for the mode line to use them.
1798
1799 @defvar mode-line-mule-info
1800 This variable holds the value of the mode-line construct that displays
1801 information about the language environment, buffer coding system, and
1802 current input method. @xref{Non-ASCII Characters}.
1803 @end defvar
1804
1805 @defvar mode-line-modified
1806 This variable holds the value of the mode-line construct that displays
1807 whether the current buffer is modified. Its default value displays
1808 @samp{**} if the buffer is modified, @samp{--} if the buffer is not
1809 modified, @samp{%%} if the buffer is read only, and @samp{%*} if the
1810 buffer is read only and modified.
1811
1812 Changing this variable does not force an update of the mode line.
1813 @end defvar
1814
1815 @defvar mode-line-frame-identification
1816 This variable identifies the current frame. Its default value
1817 displays @code{" "} if you are using a window system which can show
1818 multiple frames, or @code{"-%F "} on an ordinary terminal which shows
1819 only one frame at a time.
1820 @end defvar
1821
1822 @defvar mode-line-buffer-identification
1823 This variable identifies the buffer being displayed in the window.
1824 Its default value displays the buffer name, padded with spaces to at
1825 least 12 columns.
1826 @end defvar
1827
1828 @defopt mode-line-position
1829 This variable indicates the position in the buffer. Its default value
1830 displays the buffer percentage and, optionally, the buffer size, the
1831 line number and the column number.
1832 @end defopt
1833
1834 @defvar vc-mode
1835 The variable @code{vc-mode}, buffer-local in each buffer, records
1836 whether the buffer's visited file is maintained with version control,
1837 and, if so, which kind. Its value is a string that appears in the mode
1838 line, or @code{nil} for no version control.
1839 @end defvar
1840
1841 @defopt mode-line-modes
1842 This variable displays the buffer's major and minor modes. Its
1843 default value also displays the recursive editing level, information
1844 on the process status, and whether narrowing is in effect.
1845 @end defopt
1846
1847 The following three variables are used in @code{mode-line-modes}:
1848
1849 @defvar mode-name
1850 This buffer-local variable holds the ``pretty'' name of the current
1851 buffer's major mode. Each major mode should set this variable so that
1852 the mode name will appear in the mode line. The value does not have
1853 to be a string, but can use any of the data types valid in a mode-line
1854 construct (@pxref{Mode Line Data}). To compute the string that will
1855 identify the mode name in the mode line, use @code{format-mode-line}
1856 (@pxref{Emulating Mode Line}).
1857 @end defvar
1858
1859 @defvar mode-line-process
1860 This buffer-local variable contains the mode-line information on process
1861 status in modes used for communicating with subprocesses. It is
1862 displayed immediately following the major mode name, with no intervening
1863 space. For example, its value in the @samp{*shell*} buffer is
1864 @code{(":%s")}, which allows the shell to display its status along
1865 with the major mode as: @samp{(Shell:run)}. Normally this variable
1866 is @code{nil}.
1867 @end defvar
1868
1869 @defvar minor-mode-alist
1870 @anchor{Definition of minor-mode-alist}
1871 This variable holds an association list whose elements specify how the
1872 mode line should indicate that a minor mode is active. Each element of
1873 the @code{minor-mode-alist} should be a two-element list:
1874
1875 @example
1876 (@var{minor-mode-variable} @var{mode-line-string})
1877 @end example
1878
1879 More generally, @var{mode-line-string} can be any mode-line spec. It
1880 appears in the mode line when the value of @var{minor-mode-variable}
1881 is non-@code{nil}, and not otherwise. These strings should begin with
1882 spaces so that they don't run together. Conventionally, the
1883 @var{minor-mode-variable} for a specific mode is set to a
1884 non-@code{nil} value when that minor mode is activated.
1885
1886 @code{minor-mode-alist} itself is not buffer-local. Each variable
1887 mentioned in the alist should be buffer-local if its minor mode can be
1888 enabled separately in each buffer.
1889 @end defvar
1890
1891 @defvar global-mode-string
1892 This variable holds a mode-line spec that, by default, appears in the
1893 mode line just after the @code{which-func-mode} minor mode if set,
1894 else after @code{mode-line-modes}. The command @code{display-time}
1895 sets @code{global-mode-string} to refer to the variable
1896 @code{display-time-string}, which holds a string containing the time
1897 and load information.
1898
1899 The @samp{%M} construct substitutes the value of
1900 @code{global-mode-string}, but that is obsolete, since the variable is
1901 included in the mode line from @code{mode-line-format}.
1902 @end defvar
1903
1904 Here is a simplified version of the default value of
1905 @code{mode-line-format}. The real default value also
1906 specifies addition of text properties.
1907
1908 @example
1909 @group
1910 ("-"
1911 mode-line-mule-info
1912 mode-line-modified
1913 mode-line-frame-identification
1914 mode-line-buffer-identification
1915 @end group
1916 " "
1917 mode-line-position
1918 (vc-mode vc-mode)
1919 " "
1920 @group
1921 mode-line-modes
1922 (which-func-mode ("" which-func-format "--"))
1923 (global-mode-string ("--" global-mode-string))
1924 "-%-")
1925 @end group
1926 @end example
1927
1928 @node %-Constructs
1929 @subsection @code{%}-Constructs in the Mode Line
1930
1931 Strings used as mode-line constructs can use certain
1932 @code{%}-constructs to substitute various kinds of data. Here is a
1933 list of the defined @code{%}-constructs, and what they mean. In any
1934 construct except @samp{%%}, you can add a decimal integer after the
1935 @samp{%} to specify a minimum field width. If the width is less, the
1936 field is padded with spaces to the right.
1937
1938 @table @code
1939 @item %b
1940 The current buffer name, obtained with the @code{buffer-name} function.
1941 @xref{Buffer Names}.
1942
1943 @item %c
1944 The current column number of point.
1945
1946 @item %e
1947 When Emacs is nearly out of memory for Lisp objects, a brief message
1948 saying so. Otherwise, this is empty.
1949
1950 @item %f
1951 The visited file name, obtained with the @code{buffer-file-name}
1952 function. @xref{Buffer File Name}.
1953
1954 @item %F
1955 The title (only on a window system) or the name of the selected frame.
1956 @xref{Basic Parameters}.
1957
1958 @item %i
1959 The size of the accessible part of the current buffer; basically
1960 @code{(- (point-max) (point-min))}.
1961
1962 @item %I
1963 Like @samp{%i}, but the size is printed in a more readable way by using
1964 @samp{k} for 10^3, @samp{M} for 10^6, @samp{G} for 10^9, etc., to
1965 abbreviate.
1966
1967 @item %l
1968 The current line number of point, counting within the accessible portion
1969 of the buffer.
1970
1971 @item %n
1972 @samp{Narrow} when narrowing is in effect; nothing otherwise (see
1973 @code{narrow-to-region} in @ref{Narrowing}).
1974
1975 @item %p
1976 The percentage of the buffer text above the @strong{top} of window, or
1977 @samp{Top}, @samp{Bottom} or @samp{All}. Note that the default
1978 mode-line specification truncates this to three characters.
1979
1980 @item %P
1981 The percentage of the buffer text that is above the @strong{bottom} of
1982 the window (which includes the text visible in the window, as well as
1983 the text above the top), plus @samp{Top} if the top of the buffer is
1984 visible on screen; or @samp{Bottom} or @samp{All}.
1985
1986 @item %s
1987 The status of the subprocess belonging to the current buffer, obtained with
1988 @code{process-status}. @xref{Process Information}.
1989
1990 @item %t
1991 Whether the visited file is a text file or a binary file. This is a
1992 meaningful distinction only on certain operating systems (@pxref{MS-DOS
1993 File Types}).
1994
1995 @item %z
1996 The mnemonics of keyboard, terminal, and buffer coding systems.
1997
1998 @item %Z
1999 Like @samp{%z}, but including the end-of-line format.
2000
2001 @item %*
2002 @samp{%} if the buffer is read only (see @code{buffer-read-only}); @*
2003 @samp{*} if the buffer is modified (see @code{buffer-modified-p}); @*
2004 @samp{-} otherwise. @xref{Buffer Modification}.
2005
2006 @item %+
2007 @samp{*} if the buffer is modified (see @code{buffer-modified-p}); @*
2008 @samp{%} if the buffer is read only (see @code{buffer-read-only}); @*
2009 @samp{-} otherwise. This differs from @samp{%*} only for a modified
2010 read-only buffer. @xref{Buffer Modification}.
2011
2012 @item %&
2013 @samp{*} if the buffer is modified, and @samp{-} otherwise.
2014
2015 @item %[
2016 An indication of the depth of recursive editing levels (not counting
2017 minibuffer levels): one @samp{[} for each editing level.
2018 @xref{Recursive Editing}.
2019
2020 @item %]
2021 One @samp{]} for each recursive editing level (not counting minibuffer
2022 levels).
2023
2024 @item %-
2025 Dashes sufficient to fill the remainder of the mode line.
2026
2027 @item %%
2028 The character @samp{%}---this is how to include a literal @samp{%} in a
2029 string in which @code{%}-constructs are allowed.
2030 @end table
2031
2032 The following two @code{%}-constructs are still supported, but they are
2033 obsolete, since you can get the same results with the variables
2034 @code{mode-name} and @code{global-mode-string}.
2035
2036 @table @code
2037 @item %m
2038 The value of @code{mode-name}.
2039
2040 @item %M
2041 The value of @code{global-mode-string}.
2042 @end table
2043
2044 @node Properties in Mode
2045 @subsection Properties in the Mode Line
2046 @cindex text properties in the mode line
2047
2048 Certain text properties are meaningful in the
2049 mode line. The @code{face} property affects the appearance of text; the
2050 @code{help-echo} property associates help strings with the text, and
2051 @code{local-map} can make the text mouse-sensitive.
2052
2053 There are four ways to specify text properties for text in the mode
2054 line:
2055
2056 @enumerate
2057 @item
2058 Put a string with a text property directly into the mode-line data
2059 structure.
2060
2061 @item
2062 Put a text property on a mode-line %-construct such as @samp{%12b}; then
2063 the expansion of the %-construct will have that same text property.
2064
2065 @item
2066 Use a @code{(:propertize @var{elt} @var{props}@dots{})} construct to
2067 give @var{elt} a text property specified by @var{props}.
2068
2069 @item
2070 Use a list containing @code{:eval @var{form}} in the mode-line data
2071 structure, and make @var{form} evaluate to a string that has a text
2072 property.
2073 @end enumerate
2074
2075 You can use the @code{local-map} property to specify a keymap. This
2076 keymap only takes real effect for mouse clicks; binding character keys
2077 and function keys to it has no effect, since it is impossible to move
2078 point into the mode line.
2079
2080 When the mode line refers to a variable which does not have a
2081 non-@code{nil} @code{risky-local-variable} property, any text
2082 properties given or specified within that variable's values are
2083 ignored. This is because such properties could otherwise specify
2084 functions to be called, and those functions could come from file
2085 local variables.
2086
2087 @node Header Lines
2088 @subsection Window Header Lines
2089 @cindex header line (of a window)
2090 @cindex window header line
2091
2092 A window can have a @dfn{header line} at the
2093 top, just as it can have a mode line at the bottom. The header line
2094 feature works just like the mode-line feature, except that it's
2095 controlled by different variables.
2096
2097 @defvar header-line-format
2098 This variable, local in every buffer, specifies how to display the
2099 header line, for windows displaying the buffer. The format of the value
2100 is the same as for @code{mode-line-format} (@pxref{Mode Line Data}).
2101 It is normally @code{nil}, so that ordinary buffers have no header line.
2102 @end defvar
2103
2104 A window that is just one line tall never displays a header line. A
2105 window that is two lines tall cannot display both a mode line and a
2106 header line at once; if it has a mode line, then it does not display a
2107 header line.
2108
2109 @node Emulating Mode Line
2110 @subsection Emulating Mode-Line Formatting
2111
2112 You can use the function @code{format-mode-line} to compute
2113 the text that would appear in a mode line or header line
2114 based on a certain mode-line specification.
2115
2116 @defun format-mode-line format &optional face window buffer
2117 This function formats a line of text according to @var{format} as if it
2118 were generating the mode line for @var{window}, but it also returns the
2119 text as a string. The argument @var{window} defaults to the selected
2120 window. If @var{buffer} is non-@code{nil}, all the information used is
2121 taken from @var{buffer}; by default, it comes from @var{window}'s
2122 buffer.
2123
2124 The value string normally has text properties that correspond to the
2125 faces, keymaps, etc., that the mode line would have. Any character for
2126 which no @code{face} property is specified by @var{format} gets a
2127 default value determined by @var{face}. If @var{face} is @code{t}, that
2128 stands for either @code{mode-line} if @var{window} is selected,
2129 otherwise @code{mode-line-inactive}. If @var{face} is @code{nil} or
2130 omitted, that stands for the default face. If @var{face} is an integer,
2131 the value returned by this function will have no text properties.
2132
2133 You can also specify other valid faces as the value of @var{face}.
2134 If specified, that face provides the @code{face} property for characters
2135 whose face is not specified by @var{format}.
2136
2137 Note that using @code{mode-line}, @code{mode-line-inactive}, or
2138 @code{header-line} as @var{face} will actually redisplay the mode line
2139 or the header line, respectively, using the current definitions of the
2140 corresponding face, in addition to returning the formatted string.
2141 (Other faces do not cause redisplay.)
2142
2143 For example, @code{(format-mode-line header-line-format)} returns the
2144 text that would appear in the selected window's header line (@code{""}
2145 if it has no header line). @code{(format-mode-line header-line-format
2146 'header-line)} returns the same text, with each character
2147 carrying the face that it will have in the header line itself, and also
2148 redraws the header line.
2149 @end defun
2150
2151 @node Imenu
2152 @section Imenu
2153
2154 @cindex Imenu
2155 @dfn{Imenu} is a feature that lets users select a definition or
2156 section in the buffer, from a menu which lists all of them, to go
2157 directly to that location in the buffer. Imenu works by constructing
2158 a buffer index which lists the names and buffer positions of the
2159 definitions, or other named portions of the buffer; then the user can
2160 choose one of them and move point to it. Major modes can add a menu
2161 bar item to use Imenu using @code{imenu-add-to-menubar}.
2162
2163 @defun imenu-add-to-menubar name
2164 This function defines a local menu bar item named @var{name}
2165 to run Imenu.
2166 @end defun
2167
2168 The user-level commands for using Imenu are described in the Emacs
2169 Manual (@pxref{Imenu,, Imenu, emacs, the Emacs Manual}). This section
2170 explains how to customize Imenu's method of finding definitions or
2171 buffer portions for a particular major mode.
2172
2173 The usual and simplest way is to set the variable
2174 @code{imenu-generic-expression}:
2175
2176 @defvar imenu-generic-expression
2177 This variable, if non-@code{nil}, is a list that specifies regular
2178 expressions for finding definitions for Imenu. Simple elements of
2179 @code{imenu-generic-expression} look like this:
2180
2181 @example
2182 (@var{menu-title} @var{regexp} @var{index})
2183 @end example
2184
2185 Here, if @var{menu-title} is non-@code{nil}, it says that the matches
2186 for this element should go in a submenu of the buffer index;
2187 @var{menu-title} itself specifies the name for the submenu. If
2188 @var{menu-title} is @code{nil}, the matches for this element go directly
2189 in the top level of the buffer index.
2190
2191 The second item in the list, @var{regexp}, is a regular expression
2192 (@pxref{Regular Expressions}); anything in the buffer that it matches
2193 is considered a definition, something to mention in the buffer index.
2194 The third item, @var{index}, is a non-negative integer that indicates
2195 which subexpression in @var{regexp} matches the definition's name.
2196
2197 An element can also look like this:
2198
2199 @example
2200 (@var{menu-title} @var{regexp} @var{index} @var{function} @var{arguments}@dots{})
2201 @end example
2202
2203 Each match for this element creates an index item, and when the index
2204 item is selected by the user, it calls @var{function} with arguments
2205 consisting of the item name, the buffer position, and @var{arguments}.
2206
2207 For Emacs Lisp mode, @code{imenu-generic-expression} could look like
2208 this:
2209
2210 @c should probably use imenu-syntax-alist and \\sw rather than [-A-Za-z0-9+]
2211 @example
2212 @group
2213 ((nil "^\\s-*(def\\(un\\|subst\\|macro\\|advice\\)\
2214 \\s-+\\([-A-Za-z0-9+]+\\)" 2)
2215 @end group
2216 @group
2217 ("*Vars*" "^\\s-*(def\\(var\\|const\\)\
2218 \\s-+\\([-A-Za-z0-9+]+\\)" 2)
2219 @end group
2220 @group
2221 ("*Types*"
2222 "^\\s-*\
2223 (def\\(type\\|struct\\|class\\|ine-condition\\)\
2224 \\s-+\\([-A-Za-z0-9+]+\\)" 2))
2225 @end group
2226 @end example
2227
2228 Setting this variable makes it buffer-local in the current buffer.
2229 @end defvar
2230
2231 @defvar imenu-case-fold-search
2232 This variable controls whether matching against the regular
2233 expressions in the value of @code{imenu-generic-expression} is
2234 case-sensitive: @code{t}, the default, means matching should ignore
2235 case.
2236
2237 Setting this variable makes it buffer-local in the current buffer.
2238 @end defvar
2239
2240 @defvar imenu-syntax-alist
2241 This variable is an alist of syntax table modifiers to use while
2242 processing @code{imenu-generic-expression}, to override the syntax table
2243 of the current buffer. Each element should have this form:
2244
2245 @example
2246 (@var{characters} . @var{syntax-description})
2247 @end example
2248
2249 The @sc{car}, @var{characters}, can be either a character or a string.
2250 The element says to give that character or characters the syntax
2251 specified by @var{syntax-description}, which is passed to
2252 @code{modify-syntax-entry} (@pxref{Syntax Table Functions}).
2253
2254 This feature is typically used to give word syntax to characters which
2255 normally have symbol syntax, and thus to simplify
2256 @code{imenu-generic-expression} and speed up matching.
2257 For example, Fortran mode uses it this way:
2258
2259 @example
2260 (setq imenu-syntax-alist '(("_$" . "w")))
2261 @end example
2262
2263 The @code{imenu-generic-expression} regular expressions can then use
2264 @samp{\\sw+} instead of @samp{\\(\\sw\\|\\s_\\)+}. Note that this
2265 technique may be inconvenient when the mode needs to limit the initial
2266 character of a name to a smaller set of characters than are allowed in
2267 the rest of a name.
2268
2269 Setting this variable makes it buffer-local in the current buffer.
2270 @end defvar
2271
2272 Another way to customize Imenu for a major mode is to set the
2273 variables @code{imenu-prev-index-position-function} and
2274 @code{imenu-extract-index-name-function}:
2275
2276 @defvar imenu-prev-index-position-function
2277 If this variable is non-@code{nil}, its value should be a function that
2278 finds the next ``definition'' to put in the buffer index, scanning
2279 backward in the buffer from point. It should return @code{nil} if it
2280 doesn't find another ``definition'' before point. Otherwise it should
2281 leave point at the place it finds a ``definition'' and return any
2282 non-@code{nil} value.
2283
2284 Setting this variable makes it buffer-local in the current buffer.
2285 @end defvar
2286
2287 @defvar imenu-extract-index-name-function
2288 If this variable is non-@code{nil}, its value should be a function to
2289 return the name for a definition, assuming point is in that definition
2290 as the @code{imenu-prev-index-position-function} function would leave
2291 it.
2292
2293 Setting this variable makes it buffer-local in the current buffer.
2294 @end defvar
2295
2296 The last way to customize Imenu for a major mode is to set the
2297 variable @code{imenu-create-index-function}:
2298
2299 @defvar imenu-create-index-function
2300 This variable specifies the function to use for creating a buffer
2301 index. The function should take no arguments, and return an index
2302 alist for the current buffer. It is called within
2303 @code{save-excursion}, so where it leaves point makes no difference.
2304
2305 The index alist can have three types of elements. Simple elements
2306 look like this:
2307
2308 @example
2309 (@var{index-name} . @var{index-position})
2310 @end example
2311
2312 Selecting a simple element has the effect of moving to position
2313 @var{index-position} in the buffer. Special elements look like this:
2314
2315 @example
2316 (@var{index-name} @var{index-position} @var{function} @var{arguments}@dots{})
2317 @end example
2318
2319 Selecting a special element performs:
2320
2321 @example
2322 (funcall @var{function}
2323 @var{index-name} @var{index-position} @var{arguments}@dots{})
2324 @end example
2325
2326 A nested sub-alist element looks like this:
2327
2328 @example
2329 (@var{menu-title} @var{sub-alist})
2330 @end example
2331
2332 It creates the submenu @var{menu-title} specified by @var{sub-alist}.
2333
2334 The default value of @code{imenu-create-index-function} is
2335 @code{imenu-default-create-index-function}. This function calls the
2336 value of @code{imenu-prev-index-position-function} and the value of
2337 @code{imenu-extract-index-name-function} to produce the index alist.
2338 However, if either of these two variables is @code{nil}, the default
2339 function uses @code{imenu-generic-expression} instead.
2340
2341 Setting this variable makes it buffer-local in the current buffer.
2342 @end defvar
2343
2344 @node Font Lock Mode
2345 @section Font Lock Mode
2346 @cindex Font Lock mode
2347
2348 @dfn{Font Lock mode} is a feature that automatically attaches
2349 @code{face} properties to certain parts of the buffer based on their
2350 syntactic role. How it parses the buffer depends on the major mode;
2351 most major modes define syntactic criteria for which faces to use in
2352 which contexts. This section explains how to customize Font Lock for a
2353 particular major mode.
2354
2355 Font Lock mode finds text to highlight in two ways: through
2356 syntactic parsing based on the syntax table, and through searching
2357 (usually for regular expressions). Syntactic fontification happens
2358 first; it finds comments and string constants and highlights them.
2359 Search-based fontification happens second.
2360
2361 @menu
2362 * Font Lock Basics:: Overview of customizing Font Lock.
2363 * Search-based Fontification:: Fontification based on regexps.
2364 * Customizing Keywords:: Customizing search-based fontification.
2365 * Other Font Lock Variables:: Additional customization facilities.
2366 * Levels of Font Lock:: Each mode can define alternative levels
2367 so that the user can select more or less.
2368 * Precalculated Fontification:: How Lisp programs that produce the buffer
2369 contents can also specify how to fontify it.
2370 * Faces for Font Lock:: Special faces specifically for Font Lock.
2371 * Syntactic Font Lock:: Fontification based on syntax tables.
2372 * Setting Syntax Properties:: Defining character syntax based on context
2373 using the Font Lock mechanism.
2374 * Multiline Font Lock:: How to coerce Font Lock into properly
2375 highlighting multiline constructs.
2376 @end menu
2377
2378 @node Font Lock Basics
2379 @subsection Font Lock Basics
2380
2381 There are several variables that control how Font Lock mode highlights
2382 text. But major modes should not set any of these variables directly.
2383 Instead, they should set @code{font-lock-defaults} as a buffer-local
2384 variable. The value assigned to this variable is used, if and when Font
2385 Lock mode is enabled, to set all the other variables.
2386
2387 @defvar font-lock-defaults
2388 This variable is set by major modes, as a buffer-local variable, to
2389 specify how to fontify text in that mode. It automatically becomes
2390 buffer-local when you set it. If its value is @code{nil}, Font-Lock
2391 mode does no highlighting, and you can use the @samp{Faces} menu
2392 (under @samp{Edit} and then @samp{Text Properties} in the menu bar) to
2393 assign faces explicitly to text in the buffer.
2394
2395 If non-@code{nil}, the value should look like this:
2396
2397 @example
2398 (@var{keywords} [@var{keywords-only} [@var{case-fold}
2399 [@var{syntax-alist} [@var{syntax-begin} @var{other-vars}@dots{}]]]])
2400 @end example
2401
2402 The first element, @var{keywords}, indirectly specifies the value of
2403 @code{font-lock-keywords} which directs search-based fontification.
2404 It can be a symbol, a variable or a function whose value is the list
2405 to use for @code{font-lock-keywords}. It can also be a list of
2406 several such symbols, one for each possible level of fontification.
2407 The first symbol specifies the @samp{mode default} level of
2408 fontification, the next symbol level 1 fontification, the next level 2,
2409 and so on. The @samp{mode default} level is normally the same as level
2410 1. It is used when @code{font-lock-maximum-decoration} has a @code{nil}
2411 value. @xref{Levels of Font Lock}.
2412
2413 The second element, @var{keywords-only}, specifies the value of the
2414 variable @code{font-lock-keywords-only}. If this is omitted or
2415 @code{nil}, syntactic fontification (of strings and comments) is also
2416 performed. If this is non-@code{nil}, such fontification is not
2417 performed. @xref{Syntactic Font Lock}.
2418
2419 The third element, @var{case-fold}, specifies the value of
2420 @code{font-lock-keywords-case-fold-search}. If it is non-@code{nil},
2421 Font Lock mode ignores case when searching as directed by
2422 @code{font-lock-keywords}.
2423
2424 If the fourth element, @var{syntax-alist}, is non-@code{nil}, it
2425 should be a list of cons cells of the form @code{(@var{char-or-string}
2426 . @var{string})}. These are used to set up a syntax table for
2427 syntactic fontification (@pxref{Syntax Table Functions}). The
2428 resulting syntax table is stored in @code{font-lock-syntax-table}.
2429
2430 The fifth element, @var{syntax-begin}, specifies the value of
2431 @code{font-lock-beginning-of-syntax-function}. We recommend setting
2432 this variable to @code{nil} and using @code{syntax-begin-function}
2433 instead.
2434
2435 All the remaining elements (if any) are collectively called
2436 @var{other-vars}. Each of these elements should have the form
2437 @code{(@var{variable} . @var{value})}---which means, make
2438 @var{variable} buffer-local and then set it to @var{value}. You can
2439 use these @var{other-vars} to set other variables that affect
2440 fontification, aside from those you can control with the first five
2441 elements. @xref{Other Font Lock Variables}.
2442 @end defvar
2443
2444 If your mode fontifies text explicitly by adding
2445 @code{font-lock-face} properties, it can specify @code{(nil t)} for
2446 @code{font-lock-defaults} to turn off all automatic fontification.
2447 However, this is not required; it is possible to fontify some things
2448 using @code{font-lock-face} properties and set up automatic
2449 fontification for other parts of the text.
2450
2451 @node Search-based Fontification
2452 @subsection Search-based Fontification
2453
2454 The most important variable for customizing Font Lock mode is
2455 @code{font-lock-keywords}. It specifies the search criteria for
2456 search-based fontification. You should specify the value of this
2457 variable with @var{keywords} in @code{font-lock-defaults}.
2458
2459 @defvar font-lock-keywords
2460 This variable's value is a list of the keywords to highlight. Be
2461 careful when composing regular expressions for this list; a poorly
2462 written pattern can dramatically slow things down!
2463 @end defvar
2464
2465 Each element of @code{font-lock-keywords} specifies how to find
2466 certain cases of text, and how to highlight those cases. Font Lock mode
2467 processes the elements of @code{font-lock-keywords} one by one, and for
2468 each element, it finds and handles all matches. Ordinarily, once
2469 part of the text has been fontified already, this cannot be overridden
2470 by a subsequent match in the same text; but you can specify different
2471 behavior using the @var{override} element of a @var{subexp-highlighter}.
2472
2473 Each element of @code{font-lock-keywords} should have one of these
2474 forms:
2475
2476 @table @code
2477 @item @var{regexp}
2478 Highlight all matches for @var{regexp} using
2479 @code{font-lock-keyword-face}. For example,
2480
2481 @example
2482 ;; @r{Highlight occurrences of the word @samp{foo}}
2483 ;; @r{using @code{font-lock-keyword-face}.}
2484 "\\<foo\\>"
2485 @end example
2486
2487 The function @code{regexp-opt} (@pxref{Regexp Functions}) is useful
2488 for calculating optimal regular expressions to match a number of
2489 different keywords.
2490
2491 @item @var{function}
2492 Find text by calling @var{function}, and highlight the matches
2493 it finds using @code{font-lock-keyword-face}.
2494
2495 When @var{function} is called, it receives one argument, the limit of
2496 the search; it should begin searching at point, and not search beyond the
2497 limit. It should return non-@code{nil} if it succeeds, and set the
2498 match data to describe the match that was found. Returning @code{nil}
2499 indicates failure of the search.
2500
2501 Fontification will call @var{function} repeatedly with the same limit,
2502 and with point where the previous invocation left it, until
2503 @var{function} fails. On failure, @var{function} need not reset point
2504 in any particular way.
2505
2506 @item (@var{matcher} . @var{subexp})
2507 In this kind of element, @var{matcher} is either a regular
2508 expression or a function, as described above. The @sc{cdr},
2509 @var{subexp}, specifies which subexpression of @var{matcher} should be
2510 highlighted (instead of the entire text that @var{matcher} matched).
2511
2512 @example
2513 ;; @r{Highlight the @samp{bar} in each occurrence of @samp{fubar},}
2514 ;; @r{using @code{font-lock-keyword-face}.}
2515 ("fu\\(bar\\)" . 1)
2516 @end example
2517
2518 If you use @code{regexp-opt} to produce the regular expression
2519 @var{matcher}, you can use @code{regexp-opt-depth} (@pxref{Regexp
2520 Functions}) to calculate the value for @var{subexp}.
2521
2522 @item (@var{matcher} . @var{facespec})
2523 In this kind of element, @var{facespec} is an expression whose value
2524 specifies the face to use for highlighting. In the simplest case,
2525 @var{facespec} is a Lisp variable (a symbol) whose value is a face
2526 name.
2527
2528 @example
2529 ;; @r{Highlight occurrences of @samp{fubar},}
2530 ;; @r{using the face which is the value of @code{fubar-face}.}
2531 ("fubar" . fubar-face)
2532 @end example
2533
2534 However, @var{facespec} can also evaluate to a list of this form:
2535
2536 @example
2537 (face @var{face} @var{prop1} @var{val1} @var{prop2} @var{val2}@dots{})
2538 @end example
2539
2540 @noindent
2541 to specify the face @var{face} and various additional text properties
2542 to put on the text that matches. If you do this, be sure to add the
2543 other text property names that you set in this way to the value of
2544 @code{font-lock-extra-managed-props} so that the properties will also
2545 be cleared out when they are no longer appropriate. Alternatively,
2546 you can set the variable @code{font-lock-unfontify-region-function} to
2547 a function that clears these properties. @xref{Other Font Lock
2548 Variables}.
2549
2550 @item (@var{matcher} . @var{subexp-highlighter})
2551 In this kind of element, @var{subexp-highlighter} is a list
2552 which specifies how to highlight matches found by @var{matcher}.
2553 It has the form:
2554
2555 @example
2556 (@var{subexp} @var{facespec} [@var{override} [@var{laxmatch}]])
2557 @end example
2558
2559 The @sc{car}, @var{subexp}, is an integer specifying which subexpression
2560 of the match to fontify (0 means the entire matching text). The second
2561 subelement, @var{facespec}, is an expression whose value specifies the
2562 face, as described above.
2563
2564 The last two values in @var{subexp-highlighter}, @var{override} and
2565 @var{laxmatch}, are optional flags. If @var{override} is @code{t},
2566 this element can override existing fontification made by previous
2567 elements of @code{font-lock-keywords}. If it is @code{keep}, then
2568 each character is fontified if it has not been fontified already by
2569 some other element. If it is @code{prepend}, the face specified by
2570 @var{facespec} is added to the beginning of the @code{font-lock-face}
2571 property. If it is @code{append}, the face is added to the end of the
2572 @code{font-lock-face} property.
2573
2574 If @var{laxmatch} is non-@code{nil}, it means there should be no error
2575 if there is no subexpression numbered @var{subexp} in @var{matcher}.
2576 Obviously, fontification of the subexpression numbered @var{subexp} will
2577 not occur. However, fontification of other subexpressions (and other
2578 regexps) will continue. If @var{laxmatch} is @code{nil}, and the
2579 specified subexpression is missing, then an error is signaled which
2580 terminates search-based fontification.
2581
2582 Here are some examples of elements of this kind, and what they do:
2583
2584 @smallexample
2585 ;; @r{Highlight occurrences of either @samp{foo} or @samp{bar}, using}
2586 ;; @r{@code{foo-bar-face}, even if they have already been highlighted.}
2587 ;; @r{@code{foo-bar-face} should be a variable whose value is a face.}
2588 ("foo\\|bar" 0 foo-bar-face t)
2589
2590 ;; @r{Highlight the first subexpression within each occurrence}
2591 ;; @r{that the function @code{fubar-match} finds,}
2592 ;; @r{using the face which is the value of @code{fubar-face}.}
2593 (fubar-match 1 fubar-face)
2594 @end smallexample
2595
2596 @item (@var{matcher} . @var{anchored-highlighter})
2597 In this kind of element, @var{anchored-highlighter} specifies how to
2598 highlight text that follows a match found by @var{matcher}. So a
2599 match found by @var{matcher} acts as the anchor for further searches
2600 specified by @var{anchored-highlighter}. @var{anchored-highlighter}
2601 is a list of the following form:
2602
2603 @example
2604 (@var{anchored-matcher} @var{pre-form} @var{post-form}
2605 @var{subexp-highlighters}@dots{})
2606 @end example
2607
2608 Here, @var{anchored-matcher}, like @var{matcher}, is either a regular
2609 expression or a function. After a match of @var{matcher} is found,
2610 point is at the end of the match. Now, Font Lock evaluates the form
2611 @var{pre-form}. Then it searches for matches of
2612 @var{anchored-matcher} and uses @var{subexp-highlighters} to highlight
2613 these. A @var{subexp-highlighter} is as described above. Finally,
2614 Font Lock evaluates @var{post-form}.
2615
2616 The forms @var{pre-form} and @var{post-form} can be used to initialize
2617 before, and cleanup after, @var{anchored-matcher} is used. Typically,
2618 @var{pre-form} is used to move point to some position relative to the
2619 match of @var{matcher}, before starting with @var{anchored-matcher}.
2620 @var{post-form} might be used to move back, before resuming with
2621 @var{matcher}.
2622
2623 After Font Lock evaluates @var{pre-form}, it does not search for
2624 @var{anchored-matcher} beyond the end of the line. However, if
2625 @var{pre-form} returns a buffer position that is greater than the
2626 position of point after @var{pre-form} is evaluated, then the position
2627 returned by @var{pre-form} is used as the limit of the search instead.
2628 It is generally a bad idea to return a position greater than the end
2629 of the line; in other words, the @var{anchored-matcher} search should
2630 not span lines.
2631
2632 For example,
2633
2634 @smallexample
2635 ;; @r{Highlight occurrences of the word @samp{item} following}
2636 ;; @r{an occurrence of the word @samp{anchor} (on the same line)}
2637 ;; @r{in the value of @code{item-face}.}
2638 ("\\<anchor\\>" "\\<item\\>" nil nil (0 item-face))
2639 @end smallexample
2640
2641 Here, @var{pre-form} and @var{post-form} are @code{nil}. Therefore
2642 searching for @samp{item} starts at the end of the match of
2643 @samp{anchor}, and searching for subsequent instances of @samp{anchor}
2644 resumes from where searching for @samp{item} concluded.
2645
2646 @item (@var{matcher} @var{highlighters}@dots{})
2647 This sort of element specifies several @var{highlighter} lists for a
2648 single @var{matcher}. A @var{highlighter} list can be of the type
2649 @var{subexp-highlighter} or @var{anchored-highlighter} as described
2650 above.
2651
2652 For example,
2653
2654 @smallexample
2655 ;; @r{Highlight occurrences of the word @samp{anchor} in the value}
2656 ;; @r{of @code{anchor-face}, and subsequent occurrences of the word}
2657 ;; @r{@samp{item} (on the same line) in the value of @code{item-face}.}
2658 ("\\<anchor\\>" (0 anchor-face)
2659 ("\\<item\\>" nil nil (0 item-face)))
2660 @end smallexample
2661
2662 @item (eval . @var{form})
2663 Here @var{form} is an expression to be evaluated the first time
2664 this value of @code{font-lock-keywords} is used in a buffer.
2665 Its value should have one of the forms described in this table.
2666 @end table
2667
2668 @strong{Warning:} Do not design an element of @code{font-lock-keywords}
2669 to match text which spans lines; this does not work reliably.
2670 For details, see @xref{Multiline Font Lock}.
2671
2672 You can use @var{case-fold} in @code{font-lock-defaults} to specify
2673 the value of @code{font-lock-keywords-case-fold-search} which says
2674 whether search-based fontification should be case-insensitive.
2675
2676 @defvar font-lock-keywords-case-fold-search
2677 Non-@code{nil} means that regular expression matching for the sake of
2678 @code{font-lock-keywords} should be case-insensitive.
2679 @end defvar
2680
2681 @node Customizing Keywords
2682 @subsection Customizing Search-Based Fontification
2683
2684 You can use @code{font-lock-add-keywords} to add additional
2685 search-based fontification rules to a major mode, and
2686 @code{font-lock-remove-keywords} to remove rules.
2687
2688 @defun font-lock-add-keywords mode keywords &optional how
2689 This function adds highlighting @var{keywords}, for the current buffer
2690 or for major mode @var{mode}. The argument @var{keywords} should be a
2691 list with the same format as the variable @code{font-lock-keywords}.
2692
2693 If @var{mode} is a symbol which is a major mode command name, such as
2694 @code{c-mode}, the effect is that enabling Font Lock mode in
2695 @var{mode} will add @var{keywords} to @code{font-lock-keywords}.
2696 Calling with a non-@code{nil} value of @var{mode} is correct only in
2697 your @file{~/.emacs} file.
2698
2699 If @var{mode} is @code{nil}, this function adds @var{keywords} to
2700 @code{font-lock-keywords} in the current buffer. This way of calling
2701 @code{font-lock-add-keywords} is usually used in mode hook functions.
2702
2703 By default, @var{keywords} are added at the beginning of
2704 @code{font-lock-keywords}. If the optional argument @var{how} is
2705 @code{set}, they are used to replace the value of
2706 @code{font-lock-keywords}. If @var{how} is any other non-@code{nil}
2707 value, they are added at the end of @code{font-lock-keywords}.
2708
2709 Some modes provide specialized support you can use in additional
2710 highlighting patterns. See the variables
2711 @code{c-font-lock-extra-types}, @code{c++-font-lock-extra-types},
2712 and @code{java-font-lock-extra-types}, for example.
2713
2714 @strong{Warning:} major mode functions must not call
2715 @code{font-lock-add-keywords} under any circumstances, either directly
2716 or indirectly, except through their mode hooks. (Doing so would lead
2717 to incorrect behavior for some minor modes.) They should set up their
2718 rules for search-based fontification by setting
2719 @code{font-lock-keywords}.
2720 @end defun
2721
2722 @defun font-lock-remove-keywords mode keywords
2723 This function removes @var{keywords} from @code{font-lock-keywords}
2724 for the current buffer or for major mode @var{mode}. As in
2725 @code{font-lock-add-keywords}, @var{mode} should be a major mode
2726 command name or @code{nil}. All the caveats and requirements for
2727 @code{font-lock-add-keywords} apply here too.
2728 @end defun
2729
2730 For example, this code
2731
2732 @smallexample
2733 (font-lock-add-keywords 'c-mode
2734 '(("\\<\\(FIXME\\):" 1 font-lock-warning-face prepend)
2735 ("\\<\\(and\\|or\\|not\\)\\>" . font-lock-keyword-face)))
2736 @end smallexample
2737
2738 @noindent
2739 adds two fontification patterns for C mode: one to fontify the word
2740 @samp{FIXME}, even in comments, and another to fontify the words
2741 @samp{and}, @samp{or} and @samp{not} as keywords.
2742
2743 @noindent
2744 That example affects only C mode proper. To add the same patterns to
2745 C mode @emph{and} all modes derived from it, do this instead:
2746
2747 @smallexample
2748 (add-hook 'c-mode-hook
2749 (lambda ()
2750 (font-lock-add-keywords nil
2751 '(("\\<\\(FIXME\\):" 1 font-lock-warning-face prepend)
2752 ("\\<\\(and\\|or\\|not\\)\\>" .
2753 font-lock-keyword-face)))))
2754 @end smallexample
2755
2756 @node Other Font Lock Variables
2757 @subsection Other Font Lock Variables
2758
2759 This section describes additional variables that a major mode can
2760 set by means of @var{other-vars} in @code{font-lock-defaults}
2761 (@pxref{Font Lock Basics}).
2762
2763 @defvar font-lock-mark-block-function
2764 If this variable is non-@code{nil}, it should be a function that is
2765 called with no arguments, to choose an enclosing range of text for
2766 refontification for the command @kbd{M-o M-o}
2767 (@code{font-lock-fontify-block}).
2768
2769 The function should report its choice by placing the region around it.
2770 A good choice is a range of text large enough to give proper results,
2771 but not too large so that refontification becomes slow. Typical values
2772 are @code{mark-defun} for programming modes or @code{mark-paragraph} for
2773 textual modes.
2774 @end defvar
2775
2776 @defvar font-lock-extra-managed-props
2777 This variable specifies additional properties (other than
2778 @code{font-lock-face}) that are being managed by Font Lock mode. It
2779 is used by @code{font-lock-default-unfontify-region}, which normally
2780 only manages the @code{font-lock-face} property. If you want Font
2781 Lock to manage other properties as well, you must specify them in a
2782 @var{facespec} in @code{font-lock-keywords} as well as add them to
2783 this list. @xref{Search-based Fontification}.
2784 @end defvar
2785
2786 @defvar font-lock-fontify-buffer-function
2787 Function to use for fontifying the buffer. The default value is
2788 @code{font-lock-default-fontify-buffer}.
2789 @end defvar
2790
2791 @defvar font-lock-unfontify-buffer-function
2792 Function to use for unfontifying the buffer. This is used when
2793 turning off Font Lock mode. The default value is
2794 @code{font-lock-default-unfontify-buffer}.
2795 @end defvar
2796
2797 @defvar font-lock-fontify-region-function
2798 Function to use for fontifying a region. It should take two
2799 arguments, the beginning and end of the region, and an optional third
2800 argument @var{verbose}. If @var{verbose} is non-@code{nil}, the
2801 function should print status messages. The default value is
2802 @code{font-lock-default-fontify-region}.
2803 @end defvar
2804
2805 @defvar font-lock-unfontify-region-function
2806 Function to use for unfontifying a region. It should take two
2807 arguments, the beginning and end of the region. The default value is
2808 @code{font-lock-default-unfontify-region}.
2809 @end defvar
2810
2811 @defun jit-lock-register function &optional contextual
2812 This function tells Font Lock mode to run the Lisp function
2813 @var{function} any time it has to fontify or refontify part of the
2814 current buffer. It calls @var{function} before calling the default
2815 fontification functions, and gives it two arguments, @var{start} and
2816 @var{end}, which specify the region to be fontified or refontified.
2817
2818 The optional argument @var{contextual}, if non-@code{nil}, forces Font
2819 Lock mode to always refontify a syntactically relevant part of the
2820 buffer, and not just the modified lines. This argument can usually be
2821 omitted.
2822 @end defun
2823
2824 @defun jit-lock-unregister function
2825 If @var{function} was previously registered as a fontification
2826 function using @code{jit-lock-register}, this function unregisters it.
2827 @end defun
2828
2829 @node Levels of Font Lock
2830 @subsection Levels of Font Lock
2831
2832 Many major modes offer three different levels of fontification. You
2833 can define multiple levels by using a list of symbols for @var{keywords}
2834 in @code{font-lock-defaults}. Each symbol specifies one level of
2835 fontification; it is up to the user to choose one of these levels,
2836 normally by setting @code{font-lock-maximum-decoration} (@pxref{Font
2837 Lock,,, emacs, the GNU Emacs Manual}). The chosen level's symbol
2838 value is used to initialize @code{font-lock-keywords}.
2839
2840 Here are the conventions for how to define the levels of
2841 fontification:
2842
2843 @itemize @bullet
2844 @item
2845 Level 1: highlight function declarations, file directives (such as include or
2846 import directives), strings and comments. The idea is speed, so only
2847 the most important and top-level components are fontified.
2848
2849 @item
2850 Level 2: in addition to level 1, highlight all language keywords,
2851 including type names that act like keywords, as well as named constant
2852 values. The idea is that all keywords (either syntactic or semantic)
2853 should be fontified appropriately.
2854
2855 @item
2856 Level 3: in addition to level 2, highlight the symbols being defined in
2857 function and variable declarations, and all builtin function names,
2858 wherever they appear.
2859 @end itemize
2860
2861 @node Precalculated Fontification
2862 @subsection Precalculated Fontification
2863
2864 Some major modes such as @code{list-buffers} and @code{occur}
2865 construct the buffer text programmatically. The easiest way for them
2866 to support Font Lock mode is to specify the faces of text when they
2867 insert the text in the buffer.
2868
2869 The way to do this is to specify the faces in the text with the
2870 special text property @code{font-lock-face} (@pxref{Special
2871 Properties}). When Font Lock mode is enabled, this property controls
2872 the display, just like the @code{face} property. When Font Lock mode
2873 is disabled, @code{font-lock-face} has no effect on the display.
2874
2875 It is ok for a mode to use @code{font-lock-face} for some text and
2876 also use the normal Font Lock machinery. But if the mode does not use
2877 the normal Font Lock machinery, it should not set the variable
2878 @code{font-lock-defaults}.
2879
2880 @node Faces for Font Lock
2881 @subsection Faces for Font Lock
2882 @cindex faces for font lock
2883 @cindex font lock faces
2884
2885 You can make Font Lock mode use any face, but several faces are
2886 defined specifically for Font Lock mode. Each of these symbols is both
2887 a face name, and a variable whose default value is the symbol itself.
2888 Thus, the default value of @code{font-lock-comment-face} is
2889 @code{font-lock-comment-face}. This means you can write
2890 @code{font-lock-comment-face} in a context such as
2891 @code{font-lock-keywords} where a face-name-valued expression is used.
2892
2893 @table @code
2894 @item font-lock-comment-face
2895 @vindex font-lock-comment-face
2896 Used (typically) for comments.
2897
2898 @item font-lock-comment-delimiter-face
2899 @vindex font-lock-comment-delimiter-face
2900 Used (typically) for comments delimiters.
2901
2902 @item font-lock-doc-face
2903 @vindex font-lock-doc-face
2904 Used (typically) for documentation strings in the code.
2905
2906 @item font-lock-string-face
2907 @vindex font-lock-string-face
2908 Used (typically) for string constants.
2909
2910 @item font-lock-keyword-face
2911 @vindex font-lock-keyword-face
2912 Used (typically) for keywords---names that have special syntactic
2913 significance, like @code{for} and @code{if} in C.
2914
2915 @item font-lock-builtin-face
2916 @vindex font-lock-builtin-face
2917 Used (typically) for built-in function names.
2918
2919 @item font-lock-function-name-face
2920 @vindex font-lock-function-name-face
2921 Used (typically) for the name of a function being defined or declared,
2922 in a function definition or declaration.
2923
2924 @item font-lock-variable-name-face
2925 @vindex font-lock-variable-name-face
2926 Used (typically) for the name of a variable being defined or declared,
2927 in a variable definition or declaration.
2928
2929 @item font-lock-type-face
2930 @vindex font-lock-type-face
2931 Used (typically) for names of user-defined data types,
2932 where they are defined and where they are used.
2933
2934 @item font-lock-constant-face
2935 @vindex font-lock-constant-face
2936 Used (typically) for constant names.
2937
2938 @item font-lock-preprocessor-face
2939 @vindex font-lock-preprocessor-face
2940 Used (typically) for preprocessor commands.
2941
2942 @item font-lock-negation-char-face
2943 @vindex font-lock-negation-char-face
2944 Used (typically) for easily-overlooked negation characters.
2945
2946 @item font-lock-warning-face
2947 @vindex font-lock-warning-face
2948 Used (typically) for constructs that are peculiar, or that greatly
2949 change the meaning of other text. For example, this is used for
2950 @samp{;;;###autoload} cookies in Emacs Lisp, and for @code{#error}
2951 directives in C.
2952 @end table
2953
2954 @node Syntactic Font Lock
2955 @subsection Syntactic Font Lock
2956 @cindex syntactic font lock
2957
2958 Syntactic fontification uses the syntax table to find comments and
2959 string constants (@pxref{Syntax Tables}). It highlights them using
2960 @code{font-lock-comment-face} and @code{font-lock-string-face}
2961 (@pxref{Faces for Font Lock}), or whatever
2962 @code{font-lock-syntactic-face-function} chooses. There are several
2963 variables that affect syntactic fontification; you should set them by
2964 means of @code{font-lock-defaults} (@pxref{Font Lock Basics}).
2965
2966 @defvar font-lock-keywords-only
2967 Non-@code{nil} means Font Lock should not do syntactic fontification;
2968 it should only fontify based on @code{font-lock-keywords}. The normal
2969 way for a mode to set this variable to @code{t} is with
2970 @var{keywords-only} in @code{font-lock-defaults}.
2971 @end defvar
2972
2973 @defvar font-lock-syntax-table
2974 This variable holds the syntax table to use for fontification of
2975 comments and strings. Specify it using @var{syntax-alist} in
2976 @code{font-lock-defaults}. If this is @code{nil}, fontification uses
2977 the buffer's syntax table.
2978 @end defvar
2979
2980 @defvar font-lock-beginning-of-syntax-function
2981 If this variable is non-@code{nil}, it should be a function to move
2982 point back to a position that is syntactically at ``top level'' and
2983 outside of strings or comments. Font Lock uses this when necessary
2984 to get the right results for syntactic fontification.
2985
2986 This function is called with no arguments. It should leave point at
2987 the beginning of any enclosing syntactic block. Typical values are
2988 @code{beginning-of-line} (used when the start of the line is known to
2989 be outside a syntactic block), or @code{beginning-of-defun} for
2990 programming modes, or @code{backward-paragraph} for textual modes.
2991
2992 If the value is @code{nil}, Font Lock uses
2993 @code{syntax-begin-function} to move back outside of any comment,
2994 string, or sexp. This variable is semi-obsolete; we recommend setting
2995 @code{syntax-begin-function} instead.
2996
2997 Specify this variable using @var{syntax-begin} in
2998 @code{font-lock-defaults}.
2999 @end defvar
3000
3001 @defvar font-lock-syntactic-face-function
3002 A function to determine which face to use for a given syntactic
3003 element (a string or a comment). The function is called with one
3004 argument, the parse state at point returned by
3005 @code{parse-partial-sexp}, and should return a face. The default
3006 value returns @code{font-lock-comment-face} for comments and
3007 @code{font-lock-string-face} for strings.
3008
3009 This can be used to highlighting different kinds of strings or
3010 comments differently. It is also sometimes abused together with
3011 @code{font-lock-syntactic-keywords} to highlight constructs that span
3012 multiple lines, but this is too esoteric to document here.
3013
3014 Specify this variable using @var{other-vars} in
3015 @code{font-lock-defaults}.
3016 @end defvar
3017
3018 @node Setting Syntax Properties
3019 @subsection Setting Syntax Properties
3020
3021 Font Lock mode can be used to update @code{syntax-table} properties
3022 automatically (@pxref{Syntax Properties}). This is useful in
3023 languages for which a single syntax table by itself is not sufficient.
3024
3025 @defvar font-lock-syntactic-keywords
3026 This variable enables and controls updating @code{syntax-table}
3027 properties by Font Lock. Its value should be a list of elements of
3028 this form:
3029
3030 @example
3031 (@var{matcher} @var{subexp} @var{syntax} @var{override} @var{laxmatch})
3032 @end example
3033
3034 The parts of this element have the same meanings as in the corresponding
3035 sort of element of @code{font-lock-keywords},
3036
3037 @example
3038 (@var{matcher} @var{subexp} @var{facespec} @var{override} @var{laxmatch})
3039 @end example
3040
3041 However, instead of specifying the value @var{facespec} to use for the
3042 @code{face} property, it specifies the value @var{syntax} to use for
3043 the @code{syntax-table} property. Here, @var{syntax} can be a string
3044 (as taken by @code{modify-syntax-entry}), a syntax table, a cons cell
3045 (as returned by @code{string-to-syntax}), or an expression whose value
3046 is one of those two types. @var{override} cannot be @code{prepend} or
3047 @code{append}.
3048
3049 For example, an element of the form:
3050
3051 @example
3052 ("\\$\\(#\\)" 1 ".")
3053 @end example
3054
3055 highlights syntactically a hash character when following a dollar
3056 character, with a SYNTAX of @code{"."} (meaning punctuation syntax).
3057 Assuming that the buffer syntax table specifies hash characters to
3058 have comment start syntax, the element will only highlight hash
3059 characters that do not follow dollar characters as comments
3060 syntactically.
3061
3062 An element of the form:
3063
3064 @example
3065 ("\\('\\).\\('\\)"
3066 (1 "\"")
3067 (2 "\""))
3068 @end example
3069
3070 highlights syntactically both single quotes which surround a single
3071 character, with a SYNTAX of @code{"\""} (meaning string quote syntax).
3072 Assuming that the buffer syntax table does not specify single quotes
3073 to have quote syntax, the element will only highlight single quotes of
3074 the form @samp{'@var{c}'} as strings syntactically. Other forms, such
3075 as @samp{foo'bar} or @samp{'fubar'}, will not be highlighted as
3076 strings.
3077
3078 Major modes normally set this variable with @var{other-vars} in
3079 @code{font-lock-defaults}.
3080 @end defvar
3081
3082 @node Multiline Font Lock
3083 @subsection Multiline Font Lock Constructs
3084 @cindex multiline font lock
3085
3086 Normally, elements of @code{font-lock-keywords} should not match
3087 across multiple lines; that doesn't work reliably, because Font Lock
3088 usually scans just part of the buffer, and it can miss a multi-line
3089 construct that crosses the line boundary where the scan starts. (The
3090 scan normally starts at the beginning of a line.)
3091
3092 Making elements that match multiline constructs work properly has
3093 two aspects: correct @emph{identification} and correct
3094 @emph{rehighlighting}. The first means that Font Lock finds all
3095 multiline constructs. The second means that Font Lock will correctly
3096 rehighlight all the relevant text when a multiline construct is
3097 changed---for example, if some of the text that was previously part of
3098 a multiline construct ceases to be part of it. The two aspects are
3099 closely related, and often getting one of them to work will appear to
3100 make the other also work. However, for reliable results you must
3101 attend explicitly to both aspects.
3102
3103 There are three ways to ensure correct identification of multiline
3104 constructs:
3105
3106 @itemize
3107 @item
3108 Add a function to @code{font-lock-extend-region-functions} that does
3109 the @emph{identification} and extends the scan so that the scanned
3110 text never starts or ends in the middle of a multiline construct.
3111 @item
3112 Use the @code{font-lock-fontify-region-function} hook similarly to
3113 extend the scan so that the scanned text never starts or ends in the
3114 middle of a multiline construct.
3115 @item
3116 Somehow identify the multiline construct right when it gets inserted
3117 into the buffer (or at any point after that but before font-lock
3118 tries to highlight it), and mark it with a @code{font-lock-multiline}
3119 which will instruct font-lock not to start or end the scan in the
3120 middle of the construct.
3121 @end itemize
3122
3123 There are three ways to do rehighlighting of multiline constructs:
3124
3125 @itemize
3126 @item
3127 Place a @code{font-lock-multiline} property on the construct. This
3128 will rehighlight the whole construct if any part of it is changed. In
3129 some cases you can do this automatically by setting the
3130 @code{font-lock-multiline} variable, which see.
3131 @item
3132 Make sure @code{jit-lock-contextually} is set and rely on it doing its
3133 job. This will only rehighlight the part of the construct that
3134 follows the actual change, and will do it after a short delay.
3135 This only works if the highlighting of the various parts of your
3136 multiline construct never depends on text in subsequent lines.
3137 Since @code{jit-lock-contextually} is activated by default, this can
3138 be an attractive solution.
3139 @item
3140 Place a @code{jit-lock-defer-multiline} property on the construct.
3141 This works only if @code{jit-lock-contextually} is used, and with the
3142 same delay before rehighlighting, but like @code{font-lock-multiline},
3143 it also handles the case where highlighting depends on
3144 subsequent lines.
3145 @end itemize
3146
3147 @menu
3148 * Font Lock Multiline:: Marking multiline chunks with a text property.
3149 * Region to Refontify:: Controlling which region gets refontified
3150 after a buffer change.
3151 @end menu
3152
3153 @node Font Lock Multiline
3154 @subsubsection Font Lock Multiline
3155
3156 One way to ensure reliable rehighlighting of multiline Font Lock
3157 constructs is to put on them the text property @code{font-lock-multiline}.
3158 It should be present and non-@code{nil} for text that is part of a
3159 multiline construct.
3160
3161 When Font Lock is about to highlight a range of text, it first
3162 extends the boundaries of the range as necessary so that they do not
3163 fall within text marked with the @code{font-lock-multiline} property.
3164 Then it removes any @code{font-lock-multiline} properties from the
3165 range, and highlights it. The highlighting specification (mostly
3166 @code{font-lock-keywords}) must reinstall this property each time,
3167 whenever it is appropriate.
3168
3169 @strong{Warning:} don't use the @code{font-lock-multiline} property
3170 on large ranges of text, because that will make rehighlighting slow.
3171
3172 @defvar font-lock-multiline
3173 If the @code{font-lock-multiline} variable is set to @code{t}, Font
3174 Lock will try to add the @code{font-lock-multiline} property
3175 automatically on multiline constructs. This is not a universal
3176 solution, however, since it slows down Font Lock somewhat. It can
3177 miss some multiline constructs, or make the property larger or smaller
3178 than necessary.
3179
3180 For elements whose @var{matcher} is a function, the function should
3181 ensure that submatch 0 covers the whole relevant multiline construct,
3182 even if only a small subpart will be highlighted. It is often just as
3183 easy to add the @code{font-lock-multiline} property by hand.
3184 @end defvar
3185
3186 The @code{font-lock-multiline} property is meant to ensure proper
3187 refontification; it does not automatically identify new multiline
3188 constructs. Identifying the requires that Font-Lock operate on large
3189 enough chunks at a time. This will happen by accident on many cases,
3190 which may give the impression that multiline constructs magically work.
3191 If you set the @code{font-lock-multiline} variable non-@code{nil},
3192 this impression will be even stronger, since the highlighting of those
3193 constructs which are found will be properly updated from then on.
3194 But that does not work reliably.
3195
3196 To find multiline constructs reliably, you must either manually
3197 place the @code{font-lock-multiline} property on the text before
3198 Font-Lock looks at it, or use
3199 @code{font-lock-fontify-region-function}.
3200
3201 @node Region to Refontify
3202 @subsubsection Region to Fontify after a Buffer Change
3203
3204 When a buffer is changed, the region that Font Lock refontifies is
3205 by default the smallest sequence of whole lines that spans the change.
3206 While this works well most of the time, sometimes it doesn't---for
3207 example, when a change alters the syntactic meaning of text on an
3208 earlier line.
3209
3210 You can enlarge (or even reduce) the region to refontify by setting
3211 the following variable:
3212
3213 @defvar font-lock-extend-after-change-region-function
3214 This buffer-local variable is either @code{nil} or a function for
3215 Font-Lock to call to determine the region to scan and fontify.
3216
3217 The function is given three parameters, the standard @var{beg},
3218 @var{end}, and @var{old-len} from @code{after-change-functions}
3219 (@pxref{Change Hooks}). It should return either a cons of the
3220 beginning and end buffer positions (in that order) of the region to
3221 fontify, or @code{nil} (which means choose the region in the standard
3222 way). This function needs to preserve point, the match-data, and the
3223 current restriction. The region it returns may start or end in the
3224 middle of a line.
3225
3226 Since this function is called after every buffer change, it should be
3227 reasonably fast.
3228 @end defvar
3229
3230 @node Auto-Indentation
3231 @section Auto-indention of code
3232
3233 For programming languages, an important feature of a major mode is to
3234 provide automatic indentation. This is controlled in Emacs by
3235 @code{indent-line-function} (@pxref{Mode-Specific Indent}).
3236 Writing a good indentation function can be difficult and to a large
3237 extent it is still a black art.
3238
3239 Many major mode authors will start by writing a simple indentation
3240 function that works for simple cases, for example by comparing with the
3241 indentation of the previous text line. For most programming languages
3242 that are not really line-based, this tends to scale very poorly:
3243 improving such a function to let it handle more diverse situations tends
3244 to become more and more difficult, resulting in the end with a large,
3245 complex, unmaintainable indentation function which nobody dares to touch.
3246
3247 A good indentation function will usually need to actually parse the
3248 text, according to the syntax of the language. Luckily, it is not
3249 necessary to parse the text in as much detail as would be needed
3250 for a compiler, but on the other hand, the parser embedded in the
3251 indentation code will want to be somewhat friendly to syntactically
3252 incorrect code.
3253
3254 Good maintainable indentation functions usually fall into 2 categories:
3255 either parsing forward from some ``safe'' starting point until the
3256 position of interest, or parsing backward from the position of interest.
3257 Neither of the two is a clearly better choice than the other: parsing
3258 backward is often more difficult than parsing forward because
3259 programming languages are designed to be parsed forward, but for the
3260 purpose of indentation it has the advantage of not needing to
3261 guess a ``safe'' starting point, and it generally enjoys the property
3262 that only a minimum of text will be analyzed to decide the indentation
3263 of a line, so indentation will tend to be unaffected by syntax errors in
3264 some earlier unrelated piece of code. Parsing forward on the other hand
3265 is usually easier and has the advantage of making it possible to
3266 reindent efficiently a whole region at a time, with a single parse.
3267
3268 Rather than write your own indentation function from scratch, it is
3269 often preferable to try and reuse some existing ones or to rely
3270 on a generic indentation engine. There are sadly few such
3271 engines. The CC-mode indentation code (used with C, C++, Java, Awk
3272 and a few other such modes) has been made more generic over the years,
3273 so if your language seems somewhat similar to one of those languages,
3274 you might try to use that engine. @c FIXME: documentation?
3275 Another one is SMIE which takes an approach in the spirit
3276 of Lisp sexps and adapts it to non-Lisp languages.
3277
3278 @menu
3279 * SMIE:: A simple minded indentation engine
3280 @end menu
3281
3282 @node SMIE
3283 @subsection Simple Minded Indentation Engine
3284
3285 SMIE is a package that provides a generic navigation and indentation
3286 engine. Based on a very simple parser using an ``operator precedence
3287 grammar'', it lets major modes extend the sexp-based navigation of Lisp
3288 to non-Lisp languages as well as provide a simple to use but reliable
3289 auto-indentation.
3290
3291 Operator precedence grammar is a very primitive technology for parsing
3292 compared to some of the more common techniques used in compilers.
3293 It has the following characteristics: its parsing power is very limited,
3294 and it is largely unable to detect syntax errors, but it has the
3295 advantage of being algorithmically efficient and able to parse forward
3296 just as well as backward. In practice that means that SMIE can use it
3297 for indentation based on backward parsing, that it can provide both
3298 @code{forward-sexp} and @code{backward-sexp} functionality, and that it
3299 will naturally work on syntactically incorrect code without any extra
3300 effort. The downside is that it also means that most programming
3301 languages cannot be parsed correctly using SMIE, at least not without
3302 resorting to some special tricks (@pxref{SMIE Tricks}).
3303
3304 @menu
3305 * SMIE setup:: SMIE setup and features
3306 * Operator Precedence Grammars:: A very simple parsing technique
3307 * SMIE Grammar:: Defining the grammar of a language
3308 * SMIE Lexer:: Defining tokens
3309 * SMIE Tricks:: Working around the parser's limitations
3310 * SMIE Indentation:: Specifying indentation rules
3311 * SMIE Indentation Helpers:: Helper functions for indentation rules
3312 * SMIE Indentation Example:: Sample indentation rules
3313 @end menu
3314
3315 @node SMIE setup
3316 @subsubsection SMIE Setup and Features
3317
3318 SMIE is meant to be a one-stop shop for structural navigation and
3319 various other features which rely on the syntactic structure of code, in
3320 particular automatic indentation. The main entry point is
3321 @code{smie-setup} which is a function typically called while setting
3322 up a major mode.
3323
3324 @defun smie-setup grammar rules-function &rest keywords
3325 Setup SMIE navigation and indentation.
3326 @var{grammar} is a grammar table generated by @code{smie-prec2->grammar}.
3327 @var{rules-function} is a set of indentation rules for use on
3328 @code{smie-rules-function}.
3329 @var{keywords} are additional arguments, which can include the following
3330 keywords:
3331 @itemize
3332 @item
3333 @code{:forward-token} @var{fun}: Specify the forward lexer to use.
3334 @item
3335 @code{:backward-token} @var{fun}: Specify the backward lexer to use.
3336 @end itemize
3337 @end defun
3338
3339 Calling this function is sufficient to make commands such as
3340 @code{forward-sexp}, @code{backward-sexp}, and @code{transpose-sexps} be
3341 able to properly handle structural elements other than just the paired
3342 parentheses already handled by syntax tables. For example, if the
3343 provided grammar is precise enough, @code{transpose-sexps} can correctly
3344 transpose the two arguments of a @code{+} operator, taking into account
3345 the precedence rules of the language.
3346
3347 Calling `smie-setup' is also sufficient to make TAB indentation work in
3348 the expected way, and provides some commands that you can bind in the
3349 major mode keymap.
3350
3351 @deffn Command smie-close-block
3352 This command closes the most recently opened (and not yet closed) block.
3353 @end deffn
3354
3355 @deffn Command smie-down-list &optional arg
3356 This command is like @code{down-list} but it also pays attention to
3357 nesting of tokens other than parentheses, such as @code{begin...end}.
3358 @end deffn
3359
3360 @node Operator Precedence Grammars
3361 @subsubsection Operator Precedence Grammars
3362
3363 SMIE's precedence grammars simply give to each token a pair of
3364 precedences: the left-precedence and the right-precedence. We say
3365 @code{T1 < T2} if the right-precedence of token @code{T1} is less than
3366 the left-precedence of token @code{T2}. A good way to read this
3367 @code{<} is as a kind of parenthesis: if we find @code{... T1 something
3368 T2 ...} then that should be parsed as @code{... T1 (something T2 ...}
3369 rather than as @code{... T1 something) T2 ...}. The latter
3370 interpretation would be the case if we had @code{T1 > T2}. If we have
3371 @code{T1 = T2}, it means that token T2 follows token T1 in the same
3372 syntactic construction, so typically we have @code{"begin" = "end"}.
3373 Such pairs of precedences are sufficient to express left-associativity
3374 or right-associativity of infix operators, nesting of tokens like
3375 parentheses and many other cases.
3376
3377 @c ¡Let's leave this undocumented to leave it more open for change!
3378 @c @defvar smie-grammar
3379 @c The value of this variable is an alist specifying the left and right
3380 @c precedence of each token. It is meant to be initialized by using one of
3381 @c the functions below.
3382 @c @end defvar
3383
3384 @defun smie-prec2->grammar table
3385 This function takes a @emph{prec2} grammar @var{table} and returns an
3386 alist suitable for use in @code{smie-setup}. The @emph{prec2}
3387 @var{table} is itself meant to be built by one of the functions below.
3388 @end defun
3389
3390 @defun smie-merge-prec2s &rest tables
3391 This function takes several @emph{prec2} @var{tables} and merges them
3392 into a new @emph{prec2} table.
3393 @end defun
3394
3395 @defun smie-precs->prec2 precs
3396 This function builds a @emph{prec2} table from a table of precedences
3397 @var{precs}. @var{precs} should be a list, sorted by precedence (for
3398 example @code{"+"} will come before @code{"*"}), of elements of the form
3399 @code{(@var{assoc} @var{op} ...)}, where each @var{op} is a token that
3400 acts as an operator; @var{assoc} is their associativity, which can be
3401 either @code{left}, @code{right}, @code{assoc}, or @code{nonassoc}.
3402 All operators in a given element share the same precedence level
3403 and associativity.
3404 @end defun
3405
3406 @defun smie-bnf->prec2 bnf &rest resolvers
3407 This function lets you specify the grammar using a BNF notation.
3408 It accepts a @var{bnf} description of the grammar along with a set of
3409 conflict resolution rules @var{resolvers}, and
3410 returns a @emph{prec2} table.
3411
3412 @var{bnf} is a list of nonterminal definitions of the form
3413 @code{(@var{nonterm} @var{rhs1} @var{rhs2} ...)} where each @var{rhs}
3414 is a (non-empty) list of terminals (aka tokens) or non-terminals.
3415
3416 Not all grammars are accepted:
3417 @itemize
3418 @item
3419 An @var{rhs} cannot be an empty list (an empty list is never needed,
3420 since SMIE allows all non-terminals to match the empty string anyway).
3421 @item
3422 An @var{rhs} cannot have 2 consecutive non-terminals: each pair of
3423 non-terminals needs to be separated by a terminal (aka token).
3424 This is a fundamental limitation of operator precedence grammars.
3425 @end itemize
3426
3427 Additionally, conflicts can occur:
3428 @itemize
3429 @item
3430 The returned @emph{prec2} table holds constraints between pairs of tokens, and
3431 for any given pair only one constraint can be present: T1 < T2,
3432 T1 = T2, or T1 > T2.
3433 @item
3434 A token can be an @code{opener} (something similar to an open-paren),
3435 a @code{closer} (like a close-paren), or @code{neither} of the two
3436 (e.g. an infix operator, or an inner token like @code{"else"}).
3437 @end itemize
3438
3439 Precedence conflicts can be resolved via @var{resolvers}, which
3440 is a list of @emph{precs} tables (see @code{smie-precs->prec2}): for
3441 each precedence conflict, if those @code{precs} tables
3442 specify a particular constraint, then the conflict is resolved by using
3443 this constraint instead, else a conflict is reported and one of the
3444 conflicting constraints is picked arbitrarily and the others are
3445 simply ignored.
3446 @end defun
3447
3448 @node SMIE Grammar
3449 @subsubsection Defining the Grammar of a Language
3450
3451 The usual way to define the SMIE grammar of a language is by
3452 defining a new global variable that holds the precedence table by
3453 giving a set of BNF rules.
3454 For example, the grammar definition for a small Pascal-like language
3455 could look like:
3456 @example
3457 @group
3458 (require 'smie)
3459 (defvar sample-smie-grammar
3460 (smie-prec2->grammar
3461 (smie-bnf->prec2
3462 @end group
3463 @group
3464 '((id)
3465 (inst ("begin" insts "end")
3466 ("if" exp "then" inst "else" inst)
3467 (id ":=" exp)
3468 (exp))
3469 (insts (insts ";" insts) (inst))
3470 (exp (exp "+" exp)
3471 (exp "*" exp)
3472 ("(" exps ")"))
3473 (exps (exps "," exps) (exp)))
3474 @end group
3475 @group
3476 '((assoc ";"))
3477 '((assoc ","))
3478 '((assoc "+") (assoc "*")))))
3479 @end group
3480 @end example
3481
3482 @noindent
3483 A few things to note:
3484
3485 @itemize
3486 @item
3487 The above grammar does not explicitly mention the syntax of function
3488 calls: SMIE will automatically allow any sequence of sexps, such as
3489 identifiers, balanced parentheses, or @code{begin ... end} blocks
3490 to appear anywhere anyway.
3491 @item
3492 The grammar category @code{id} has no right hand side: this does not
3493 mean that it can match only the empty string, since as mentioned any
3494 sequence of sexps can appear anywhere anyway.
3495 @item
3496 Because non terminals cannot appear consecutively in the BNF grammar, it
3497 is difficult to correctly handle tokens that act as terminators, so the
3498 above grammar treats @code{";"} as a statement @emph{separator} instead,
3499 which SMIE can handle very well.
3500 @item
3501 Separators used in sequences (such as @code{","} and @code{";"} above)
3502 are best defined with BNF rules such as @code{(foo (foo "separator" foo) ...)}
3503 which generate precedence conflicts which are then resolved by giving
3504 them an explicit @code{(assoc "separator")}.
3505 @item
3506 The @code{("(" exps ")")} rule was not needed to pair up parens, since
3507 SMIE will pair up any characters that are marked as having paren syntax
3508 in the syntax table. What this rule does instead (together with the
3509 definition of @code{exps}) is to make it clear that @code{","} should
3510 not appear outside of parentheses.
3511 @item
3512 Rather than have a single @emph{precs} table to resolve conflicts, it is
3513 preferable to have several tables, so as to let the BNF part of the
3514 grammar specify relative precedences where possible.
3515 @item
3516 Unless there is a very good reason to prefer @code{left} or
3517 @code{right}, it is usually preferable to mark operators as associative,
3518 using @code{assoc}. For that reason @code{"+"} and @code{"*"} are
3519 defined above as @code{assoc}, although the language defines them
3520 formally as left associative.
3521 @end itemize
3522
3523 @node SMIE Lexer
3524 @subsubsection Defining Tokens
3525
3526 SMIE comes with a predefined lexical analyzer which uses syntax tables
3527 in the following way: any sequence of characters that have word or
3528 symbol syntax is considered a token, and so is any sequence of
3529 characters that have punctuation syntax. This default lexer is
3530 often a good starting point but is rarely actually correct for any given
3531 language. For example, it will consider @code{"2,+3"} to be composed
3532 of 3 tokens: @code{"2"}, @code{",+"}, and @code{"3"}.
3533
3534 To describe the lexing rules of your language to SMIE, you need
3535 2 functions, one to fetch the next token, and another to fetch the
3536 previous token. Those functions will usually first skip whitespace and
3537 comments and then look at the next chunk of text to see if it
3538 is a special token. If so it should skip the token and
3539 return a description of this token. Usually this is simply the string
3540 extracted from the buffer, but it can be anything you want.
3541 For example:
3542 @example
3543 @group
3544 (defvar sample-keywords-regexp
3545 (regexp-opt '("+" "*" "," ";" ">" ">=" "<" "<=" ":=" "=")))
3546 @end group
3547 @group
3548 (defun sample-smie-forward-token ()
3549 (forward-comment (point-max))
3550 (cond
3551 ((looking-at sample-keywords-regexp)
3552 (goto-char (match-end 0))
3553 (match-string-no-properties 0))
3554 (t (buffer-substring-no-properties
3555 (point)
3556 (progn (skip-syntax-forward "w_")
3557 (point))))))
3558 @end group
3559 @group
3560 (defun sample-smie-backward-token ()
3561 (forward-comment (- (point)))
3562 (cond
3563 ((looking-back sample-keywords-regexp (- (point) 2) t)
3564 (goto-char (match-beginning 0))
3565 (match-string-no-properties 0))
3566 (t (buffer-substring-no-properties
3567 (point)
3568 (progn (skip-syntax-backward "w_")
3569 (point))))))
3570 @end group
3571 @end example
3572
3573 Notice how those lexers return the empty string when in front of
3574 parentheses. This is because SMIE automatically takes care of the
3575 parentheses defined in the syntax table. More specifically if the lexer
3576 returns nil or an empty string, SMIE tries to handle the corresponding
3577 text as a sexp according to syntax tables.
3578
3579 @node SMIE Tricks
3580 @subsubsection Living With a Weak Parser
3581
3582 The parsing technique used by SMIE does not allow tokens to behave
3583 differently in different contexts. For most programming languages, this
3584 manifests itself by precedence conflicts when converting the
3585 BNF grammar.
3586
3587 Sometimes, those conflicts can be worked around by expressing the
3588 grammar slightly differently. For example, for Modula-2 it might seem
3589 natural to have a BNF grammar that looks like this:
3590
3591 @example
3592 ...
3593 (inst ("IF" exp "THEN" insts "ELSE" insts "END")
3594 ("CASE" exp "OF" cases "END")
3595 ...)
3596 (cases (cases "|" cases)
3597 (caselabel ":" insts)
3598 ("ELSE" insts))
3599 ...
3600 @end example
3601
3602 But this will create conflicts for @code{"ELSE"}: on the one hand, the
3603 IF rule implies (among many other things) that @code{"ELSE" = "END"};
3604 but on the other hand, since @code{"ELSE"} appears within @code{cases},
3605 which appears left of @code{"END"}, we also have @code{"ELSE" > "END"}.
3606 We can solve the conflict either by using:
3607 @example
3608 ...
3609 (inst ("IF" exp "THEN" insts "ELSE" insts "END")
3610 ("CASE" exp "OF" cases "END")
3611 ("CASE" exp "OF" cases "ELSE" insts "END")
3612 ...)
3613 (cases (cases "|" cases) (caselabel ":" insts))
3614 ...
3615 @end example
3616 or
3617 @example
3618 ...
3619 (inst ("IF" exp "THEN" else "END")
3620 ("CASE" exp "OF" cases "END")
3621 ...)
3622 (else (insts "ELSE" insts))
3623 (cases (cases "|" cases) (caselabel ":" insts) (else))
3624 ...
3625 @end example
3626
3627 Reworking the grammar to try and solve conflicts has its downsides, tho,
3628 because SMIE assumes that the grammar reflects the logical structure of
3629 the code, so it is preferable to keep the BNF closer to the intended
3630 abstract syntax tree.
3631
3632 Other times, after careful consideration you may conclude that those
3633 conflicts are not serious and simply resolve them via the
3634 @var{resolvers} argument of @code{smie-bnf->prec2}. Usually this is
3635 because the grammar is simply ambiguous: the conflict does not affect
3636 the set of programs described by the grammar, but only the way those
3637 programs are parsed. This is typically the case for separators and
3638 associative infix operators, where you want to add a resolver like
3639 @code{'((assoc "|"))}. Another case where this can happen is for the
3640 classic @emph{dangling else} problem, where you will use @code{'((assoc
3641 "else" "then"))}. It can also happen for cases where the conflict is
3642 real and cannot really be resolved, but it is unlikely to pose a problem
3643 in practice.
3644
3645 Finally, in many cases some conflicts will remain despite all efforts to
3646 restructure the grammar. Do not despair: while the parser cannot be
3647 made more clever, you can make the lexer as smart as you want. So, the
3648 solution is then to look at the tokens involved in the conflict and to
3649 split one of those tokens into 2 (or more) different tokens. E.g. if
3650 the grammar needs to distinguish between two incompatible uses of the
3651 token @code{"begin"}, make the lexer return different tokens (say
3652 @code{"begin-fun"} and @code{"begin-plain"}) depending on which kind of
3653 @code{"begin"} it finds. This pushes the work of distinguishing the
3654 different cases to the lexer, which will thus have to look at the
3655 surrounding text to find ad-hoc clues.
3656
3657 @node SMIE Indentation
3658 @subsubsection Specifying Indentation Rules
3659
3660 Based on the provided grammar, SMIE will be able to provide automatic
3661 indentation without any extra effort. But in practice, this default
3662 indentation style will probably not be good enough. You will want to
3663 tweak it in many different cases.
3664
3665 SMIE indentation is based on the idea that indentation rules should be
3666 as local as possible. To this end, it relies on the idea of
3667 @emph{virtual} indentation, which is the indentation that a particular
3668 program point would have if it were at the beginning of a line.
3669 Of course, if that program point is indeed at the beginning of a line,
3670 its virtual indentation is its current indentation. But if not, then
3671 SMIE uses the indentation algorithm to compute the virtual indentation
3672 of that point. Now in practice, the virtual indentation of a program
3673 point does not have to be identical to the indentation it would have if
3674 we inserted a newline before it. To see how this works, the SMIE rule
3675 for indentation after a @code{@{} in C does not care whether the
3676 @code{@{} is standing on a line of its own or is at the end of the
3677 preceding line. Instead, these different cases are handled in the
3678 indentation rule that decides how to indent before a @code{@{}.
3679
3680 Another important concept is the notion of @emph{parent}: The
3681 @emph{parent} of a token, is the head token of the nearest enclosing
3682 syntactic construct. For example, the parent of an @code{else} is the
3683 @code{if} to which it belongs, and the parent of an @code{if}, in turn,
3684 is the lead token of the surrounding construct. The command
3685 @code{backward-sexp} jumps from a token to its parent, but there are
3686 some caveats: for @emph{openers} (tokens which start a construct, like
3687 @code{if}), you need to start with point before the token, while for
3688 others you need to start with point after the token.
3689 @code{backward-sexp} stops with point before the parent token if that is
3690 the @emph{opener} of the token of interest, and otherwise it stops with
3691 point after the parent token.
3692
3693 SMIE indentation rules are specified using a function that takes two
3694 arguments @var{method} and @var{arg} where the meaning of @var{arg} and the
3695 expected return value depend on @var{method}.
3696
3697 @var{method} can be:
3698 @itemize
3699 @item
3700 @code{:after}, in which case @var{arg} is a token and the function
3701 should return the @var{offset} to use for indentation after @var{arg}.
3702 @item
3703 @code{:before}, in which case @var{arg} is a token and the function
3704 should return the @var{offset} to use to indent @var{arg} itself.
3705 @item
3706 @code{:elem}, in which case the function should return either the offset
3707 to use to indent function arguments (if @var{arg} is the symbol
3708 @code{arg}) or the basic indentation step (if @var{arg} is the symbol
3709 @code{basic}).
3710 @item
3711 @code{:list-intro}, in which case @var{arg} is a token and the function
3712 should return non-@code{nil} if the token is followed by a list of
3713 expressions (not separated by any token) rather than an expression.
3714 @end itemize
3715
3716 When @var{arg} is a token, the function is called with point just before
3717 that token. A return value of nil always means to fallback on the
3718 default behavior, so the function should return nil for arguments it
3719 does not expect.
3720
3721 @var{offset} can be:
3722 @itemize
3723 @item
3724 @code{nil}: use the default indentation rule.
3725 @item
3726 @code{(column . @var{column})}: indent to column @var{column}.
3727 @item
3728 @var{number}: offset by @var{number}, relative to a base token which is
3729 the current token for @code{:after} and its parent for @code{:before}.
3730 @end itemize
3731
3732 @node SMIE Indentation Helpers
3733 @subsubsection Helper Functions for Indentation Rules
3734
3735 SMIE provides various functions designed specifically for use in the
3736 indentation rules function (several of those functions break if used in
3737 another context). These functions all start with the prefix
3738 @code{smie-rule-}.
3739
3740 @defun smie-rule-bolp
3741 Return non-@code{nil} if the current token is the first on the line.
3742 @end defun
3743
3744 @defun smie-rule-hanging-p
3745 Return non-@code{nil} if the current token is @emph{hanging}.
3746 A token is @emph{hanging} if it is the last token on the line
3747 and if it is preceded by other tokens: a lone token on a line is not
3748 hanging.
3749 @end defun
3750
3751 @defun smie-rule-next-p &rest tokens
3752 Return non-@code{nil} if the next token is among @var{tokens}.
3753 @end defun
3754
3755 @defun smie-rule-prev-p &rest tokens
3756 Return non-@code{nil} if the previous token is among @var{tokens}.
3757 @end defun
3758
3759 @defun smie-rule-parent-p &rest parents
3760 Return non-@code{nil} if the current token's parent is among @var{parents}.
3761 @end defun
3762
3763 @defun smie-rule-sibling-p
3764 Return non-nil if the current token's parent is actually a sibling.
3765 This is the case for example when the parent of a @code{","} is just the
3766 previous @code{","}.
3767 @end defun
3768
3769 @defun smie-rule-parent &optional offset
3770 Return the proper offset to align the current token with the parent.
3771 If non-@code{nil}, @var{offset} should be an integer giving an
3772 additional offset to apply.
3773 @end defun
3774
3775 @defun smie-rule-separator method
3776 Indent current token as a @emph{separator}.
3777
3778 By @emph{separator}, we mean here a token whose sole purpose is to
3779 separate various elements within some enclosing syntactic construct, and
3780 which does not have any semantic significance in itself (i.e. it would
3781 typically not exist as a node in an abstract syntax tree).
3782
3783 Such a token is expected to have an associative syntax and be closely
3784 tied to its syntactic parent. Typical examples are @code{","} in lists
3785 of arguments (enclosed inside parentheses), or @code{";"} in sequences
3786 of instructions (enclosed in a @code{@{...@}} or @code{begin...end}
3787 block).
3788
3789 @var{method} should be the method name that was passed to
3790 `smie-rules-function'.
3791 @end defun
3792
3793 @node SMIE Indentation Example
3794 @subsubsection Sample Indentation Rules
3795
3796 Here is an example of an indentation function:
3797
3798 @example
3799 ;; For the `case' macro.
3800 (eval-when-compile (require 'cl))
3801
3802 (defun sample-smie-rules (kind token)
3803 (case kind
3804 (:elem (case token
3805 (basic sample-indent-basic)))
3806 (:after
3807 (cond
3808 ((equal token ",") (smie-rule-separator kind))
3809 ((equal token ":=") sample-indent-basic)))
3810 (:before
3811 (cond
3812 ((equal token ",") (smie-rule-separator kind))
3813 ((member token '("begin" "(" "@{"))
3814 (if (smie-rule-hanging-p) (smie-rule-parent)))
3815 ((equal token "if")
3816 (and (not (smie-rule-bolp)) (smie-rule-prev-p "else")
3817 (smie-rule-parent)))))))
3818 @end example
3819
3820 @noindent
3821 A few things to note:
3822
3823 @itemize
3824 @item
3825 The first case indicates the basic indentation increment to use.
3826 If @code{sample-indent-basic} is nil, then SMIE uses the global
3827 setting @code{smie-indent-basic}. The major mode could have set
3828 @code{smie-indent-basic} buffer-locally instead, but that
3829 is discouraged.
3830
3831 @item
3832 The two (identical) rules for the token @code{","} make SMIE try to be
3833 more clever when the comma separator is placed at the beginning of
3834 lines. It tries to outdent the separator so as to align the code after
3835 the comma; for example:
3836
3837 @example
3838 x = longfunctionname (
3839 arg1
3840 , arg2
3841 );
3842 @end example
3843
3844 @item
3845 The rule for indentation after @code{":="} exists because otherwise
3846 SMIE would treat @code{":="} as an infix operator and would align the
3847 right argument with the left one.
3848
3849 @item
3850 The rule for indentation before @code{"begin"} is an example of the use
3851 of virtual indentation: This rule is used only when @code{"begin"} is
3852 hanging, which can happen only when @code{"begin"} is not at the
3853 beginning of a line. So this is not used when indenting
3854 @code{"begin"} itself but only when indenting something relative to this
3855 @code{"begin"}. Concretely, this rule changes the indentation from:
3856
3857 @example
3858 if x > 0 then begin
3859 dosomething(x);
3860 end
3861 @end example
3862 to
3863 @example
3864 if x > 0 then begin
3865 dosomething(x);
3866 end
3867 @end example
3868
3869 @item
3870 The rule for indentation before @code{"if"} is similar to the one for
3871 @code{"begin"}, but where the purpose is to treat @code{"else if"}
3872 as a single unit, so as to align a sequence of tests rather than indent
3873 each test further to the right. This function does this only in the
3874 case where the @code{"if"} is not placed on a separate line, hence the
3875 @code{smie-rule-bolp} test.
3876
3877 If we know that the @code{"else"} is always aligned with its @code{"if"}
3878 and is always at the beginning of a line, we can use a more efficient
3879 rule:
3880 @example
3881 ((equal token "if")
3882 (and (not (smie-rule-bolp))
3883 (smie-rule-prev-p "else")
3884 (save-excursion
3885 (sample-smie-backward-token)
3886 (cons 'column (current-column)))))
3887 @end example
3888
3889 The advantage of this formulation is that it reuses the indentation of
3890 the previous @code{"else"}, rather than going all the way back to the
3891 first @code{"if"} of the sequence.
3892 @end itemize
3893
3894 @node Desktop Save Mode
3895 @section Desktop Save Mode
3896 @cindex desktop save mode
3897
3898 @dfn{Desktop Save Mode} is a feature to save the state of Emacs from
3899 one session to another. The user-level commands for using Desktop
3900 Save Mode are described in the GNU Emacs Manual (@pxref{Saving Emacs
3901 Sessions,,, emacs, the GNU Emacs Manual}). Modes whose buffers visit
3902 a file, don't have to do anything to use this feature.
3903
3904 For buffers not visiting a file to have their state saved, the major
3905 mode must bind the buffer local variable @code{desktop-save-buffer} to
3906 a non-@code{nil} value.
3907
3908 @defvar desktop-save-buffer
3909 If this buffer-local variable is non-@code{nil}, the buffer will have
3910 its state saved in the desktop file at desktop save. If the value is
3911 a function, it is called at desktop save with argument
3912 @var{desktop-dirname}, and its value is saved in the desktop file along
3913 with the state of the buffer for which it was called. When file names
3914 are returned as part of the auxiliary information, they should be
3915 formatted using the call
3916
3917 @example
3918 (desktop-file-name @var{file-name} @var{desktop-dirname})
3919 @end example
3920
3921 @end defvar
3922
3923 For buffers not visiting a file to be restored, the major mode must
3924 define a function to do the job, and that function must be listed in
3925 the alist @code{desktop-buffer-mode-handlers}.
3926
3927 @defvar desktop-buffer-mode-handlers
3928 Alist with elements
3929
3930 @example
3931 (@var{major-mode} . @var{restore-buffer-function})
3932 @end example
3933
3934 The function @var{restore-buffer-function} will be called with
3935 argument list
3936
3937 @example
3938 (@var{buffer-file-name} @var{buffer-name} @var{desktop-buffer-misc})
3939 @end example
3940
3941 and it should return the restored buffer.
3942 Here @var{desktop-buffer-misc} is the value returned by the function
3943 optionally bound to @code{desktop-save-buffer}.
3944 @end defvar
3945
3946 @ignore
3947 Local Variables:
3948 fill-column: 72
3949 End:
3950 @end ignore