Make set-auto-mode respect mode: entries at the end of the file (bug#8586)
[bpt/emacs.git] / doc / lispref / modes.texi
1 @c -*-texinfo-*-
2 @c This is part of the GNU Emacs Lisp Reference Manual.
3 @c Copyright (C) 1990-1995, 1998-1999, 2001-2011 Free Software Foundation, Inc.
4 @c See the file elisp.texi for copying conditions.
5 @setfilename ../../info/modes
6 @node Modes, Documentation, Keymaps, Top
7 @chapter Major and Minor Modes
8 @cindex mode
9
10 A @dfn{mode} is a set of definitions that customize Emacs and can be
11 turned on and off while you edit. There are two varieties of modes:
12 @dfn{major modes}, which are mutually exclusive and used for editing
13 particular kinds of text, and @dfn{minor modes}, which provide features
14 that users can enable individually.
15
16 This chapter describes how to write both major and minor modes, how to
17 indicate them in the mode line, and how they run hooks supplied by the
18 user. For related topics such as keymaps and syntax tables, see
19 @ref{Keymaps}, and @ref{Syntax Tables}.
20
21 @menu
22 * Hooks:: How to use hooks; how to write code that provides hooks.
23 * Major Modes:: Defining major modes.
24 * Minor Modes:: Defining minor modes.
25 * Mode Line Format:: Customizing the text that appears in the mode line.
26 * Imenu:: How a mode can provide a menu
27 of definitions in the buffer.
28 * Font Lock Mode:: How modes can highlight text according to syntax.
29 * Auto-Indentation:: How to teach Emacs to indent for a major mode.
30 * Desktop Save Mode:: How modes can have buffer state saved between
31 Emacs sessions.
32 @end menu
33
34 @node Hooks
35 @section Hooks
36 @cindex hooks
37
38 A @dfn{hook} is a variable where you can store a function or functions
39 to be called on a particular occasion by an existing program. Emacs
40 provides hooks for the sake of customization. Most often, hooks are set
41 up in the init file (@pxref{Init File}), but Lisp programs can set them also.
42 @xref{Standard Hooks}, for a list of standard hook variables.
43
44 @cindex normal hook
45 Most of the hooks in Emacs are @dfn{normal hooks}. These variables
46 contain lists of functions to be called with no arguments. By
47 convention, whenever the hook name ends in @samp{-hook}, that tells
48 you it is normal. We try to make all hooks normal, as much as
49 possible, so that you can use them in a uniform way.
50
51 Every major mode function is supposed to run a normal hook called
52 the @dfn{mode hook} as the one of the last steps of initialization.
53 This makes it easy for a user to customize the behavior of the mode,
54 by overriding the buffer-local variable assignments already made by
55 the mode. Most minor mode functions also run a mode hook at the end.
56 But hooks are used in other contexts too. For example, the hook
57 @code{suspend-hook} runs just before Emacs suspends itself
58 (@pxref{Suspending Emacs}).
59
60 The recommended way to add a hook function to a normal hook is by
61 calling @code{add-hook} (see below). The hook functions may be any of
62 the valid kinds of functions that @code{funcall} accepts (@pxref{What
63 Is a Function}). Most normal hook variables are initially void;
64 @code{add-hook} knows how to deal with this. You can add hooks either
65 globally or buffer-locally with @code{add-hook}.
66
67 @cindex abnormal hook
68 If the hook variable's name does not end with @samp{-hook}, that
69 indicates it is probably an @dfn{abnormal hook}. That means the hook
70 functions are called with arguments, or their return values are used
71 in some way. The hook's documentation says how the functions are
72 called. You can use @code{add-hook} to add a function to an abnormal
73 hook, but you must write the function to follow the hook's calling
74 convention.
75
76 By convention, abnormal hook names end in @samp{-functions} or
77 @samp{-hooks}. If the variable's name ends in @samp{-function}, then
78 its value is just a single function, not a list of functions.
79
80 @menu
81 * Running Hooks:: How to run a hook.
82 * Setting Hooks:: How to put functions on a hook, or remove them.
83 @end menu
84
85 @node Running Hooks
86 @subsection Running Hooks
87
88 At the appropriate times, Emacs uses the @code{run-hooks} function
89 and the other functions below to run particular hooks.
90
91 @defun run-hooks &rest hookvars
92 This function takes one or more normal hook variable names as
93 arguments, and runs each hook in turn. Each argument should be a
94 symbol that is a normal hook variable. These arguments are processed
95 in the order specified.
96
97 If a hook variable has a non-@code{nil} value, that value should be a
98 list of functions. @code{run-hooks} calls all the functions, one by
99 one, with no arguments.
100
101 The hook variable's value can also be a single function---either a
102 lambda expression or a symbol with a function definition---which
103 @code{run-hooks} calls. But this usage is obsolete.
104 @end defun
105
106 @defun run-hook-with-args hook &rest args
107 This function is the way to run an abnormal hook and always call all
108 of the hook functions. It calls each of the hook functions one by
109 one, passing each of them the arguments @var{args}.
110 @end defun
111
112 @defun run-hook-with-args-until-failure hook &rest args
113 This function is the way to run an abnormal hook until one of the hook
114 functions fails. It calls each of the hook functions, passing each of
115 them the arguments @var{args}, until some hook function returns
116 @code{nil}. It then stops and returns @code{nil}. If none of the
117 hook functions return @code{nil}, it returns a non-@code{nil} value.
118 @end defun
119
120 @defun run-hook-with-args-until-success hook &rest args
121 This function is the way to run an abnormal hook until a hook function
122 succeeds. It calls each of the hook functions, passing each of them
123 the arguments @var{args}, until some hook function returns
124 non-@code{nil}. Then it stops, and returns whatever was returned by
125 the last hook function that was called. If all hook functions return
126 @code{nil}, it returns @code{nil} as well.
127 @end defun
128
129 @node Setting Hooks
130 @subsection Setting Hooks
131
132 Here's an example that uses a mode hook to turn on Auto Fill mode when
133 in Lisp Interaction mode:
134
135 @example
136 (add-hook 'lisp-interaction-mode-hook 'turn-on-auto-fill)
137 @end example
138
139 @defun add-hook hook function &optional append local
140 This function is the handy way to add function @var{function} to hook
141 variable @var{hook}. You can use it for abnormal hooks as well as for
142 normal hooks. @var{function} can be any Lisp function that can accept
143 the proper number of arguments for @var{hook}. For example,
144
145 @example
146 (add-hook 'text-mode-hook 'my-text-hook-function)
147 @end example
148
149 @noindent
150 adds @code{my-text-hook-function} to the hook called @code{text-mode-hook}.
151
152 If @var{function} is already present in @var{hook} (comparing using
153 @code{equal}), then @code{add-hook} does not add it a second time.
154
155 If @var{function} has a non-@code{nil} property
156 @code{permanent-local-hook}, then @code{kill-all-local-variables} (or
157 changing major modes) won't delete it from the hook variable's local
158 value.
159
160 It is best to design your hook functions so that the order in which
161 they are executed does not matter. Any dependence on the order is
162 asking for trouble. However, the order is predictable: normally,
163 @var{function} goes at the front of the hook list, so it will be
164 executed first (barring another @code{add-hook} call). If the
165 optional argument @var{append} is non-@code{nil}, the new hook
166 function goes at the end of the hook list and will be executed last.
167
168 @code{add-hook} can handle the cases where @var{hook} is void or its
169 value is a single function; it sets or changes the value to a list of
170 functions.
171
172 If @var{local} is non-@code{nil}, that says to add @var{function} to
173 the buffer-local hook list instead of to the global hook list. If
174 needed, this makes the hook buffer-local and adds @code{t} to the
175 buffer-local value. The latter acts as a flag to run the hook
176 functions in the default value as well as in the local value.
177 @end defun
178
179 @defun remove-hook hook function &optional local
180 This function removes @var{function} from the hook variable
181 @var{hook}. It compares @var{function} with elements of @var{hook}
182 using @code{equal}, so it works for both symbols and lambda
183 expressions.
184
185 If @var{local} is non-@code{nil}, that says to remove @var{function}
186 from the buffer-local hook list instead of from the global hook list.
187 @end defun
188
189 @node Major Modes
190 @section Major Modes
191 @cindex major mode
192
193 Major modes specialize Emacs for editing particular kinds of text.
194 Each buffer has only one major mode at a time. For each major mode
195 there is a function to switch to that mode in the current buffer; its
196 name should end in @samp{-mode}. These functions work by setting
197 buffer-local variable bindings and other data associated with the
198 buffer, such as a local keymap. The effect lasts until you switch
199 to another major mode in the same buffer.
200
201 @menu
202 * Major Mode Basics::
203 * Major Mode Conventions:: Coding conventions for keymaps, etc.
204 * Auto Major Mode:: How Emacs chooses the major mode automatically.
205 * Mode Help:: Finding out how to use a mode.
206 * Derived Modes:: Defining a new major mode based on another major
207 mode.
208 * Generic Modes:: Defining a simple major mode that supports
209 comment syntax and Font Lock mode.
210 * Mode Hooks:: Hooks run at the end of major mode functions.
211 * Example Major Modes:: Text mode and Lisp modes.
212 @end menu
213
214 @node Major Mode Basics
215 @subsection Major Mode Basics
216 @cindex Fundamental mode
217
218 The least specialized major mode is called @dfn{Fundamental mode}.
219 This mode has no mode-specific definitions or variable settings, so each
220 Emacs command behaves in its default manner, and each option is in its
221 default state. All other major modes redefine various keys and options.
222 For example, Lisp Interaction mode provides special key bindings for
223 @kbd{C-j} (@code{eval-print-last-sexp}), @key{TAB}
224 (@code{lisp-indent-line}), and other keys.
225
226 When you need to write several editing commands to help you perform a
227 specialized editing task, creating a new major mode is usually a good
228 idea. In practice, writing a major mode is easy (in contrast to
229 writing a minor mode, which is often difficult).
230
231 If the new mode is similar to an old one, it is often unwise to
232 modify the old one to serve two purposes, since it may become harder
233 to use and maintain. Instead, copy and rename an existing major mode
234 definition and alter the copy---or use the @code{define-derived-mode}
235 macro to define a @dfn{derived mode} (@pxref{Derived Modes}). For
236 example, Rmail Edit mode is a major mode that is very similar to Text
237 mode except that it provides two additional commands. Its definition
238 is distinct from that of Text mode, but uses that of Text mode.
239
240 Even if the new mode is not an obvious derivative of any other mode,
241 we recommend to use @code{define-derived-mode}, since it automatically
242 enforces the most important coding conventions for you.
243
244 For a very simple programming language major mode that handles
245 comments and fontification, you can use @code{define-generic-mode}.
246 @xref{Generic Modes}.
247
248 Rmail Edit mode offers an example of changing the major mode
249 temporarily for a buffer, so it can be edited in a different way (with
250 ordinary Emacs commands rather than Rmail commands). In such cases, the
251 temporary major mode usually provides a command to switch back to the
252 buffer's usual mode (Rmail mode, in this case). You might be tempted to
253 present the temporary redefinitions inside a recursive edit and restore
254 the usual ones when the user exits; but this is a bad idea because it
255 constrains the user's options when it is done in more than one buffer:
256 recursive edits must be exited most-recently-entered first. Using an
257 alternative major mode avoids this limitation. @xref{Recursive
258 Editing}.
259
260 The standard GNU Emacs Lisp library directory tree contains the code
261 for several major modes, in files such as @file{text-mode.el},
262 @file{texinfo.el}, @file{lisp-mode.el}, @file{c-mode.el}, and
263 @file{rmail.el}. They are found in various subdirectories of the
264 @file{lisp} directory. You can study these libraries to see how modes
265 are written. Text mode is perhaps the simplest major mode aside from
266 Fundamental mode. Rmail mode is a complicated and specialized mode.
267
268 @node Major Mode Conventions
269 @subsection Major Mode Conventions
270 @cindex major mode conventions
271 @cindex conventions for writing major modes
272
273 The code for existing major modes follows various coding conventions,
274 including conventions for local keymap and syntax table initialization,
275 global names, and hooks. Please follow these conventions when you
276 define a new major mode. (Fundamental mode is an exception to many
277 of these conventions, because its definition is to present the global
278 state of Emacs.)
279
280 This list of conventions is only partial, because each major mode
281 should aim for consistency in general with other Emacs major modes.
282 This makes Emacs as a whole more coherent. It is impossible to list
283 here all the possible points where this issue might come up; if the
284 Emacs developers point out an area where your major mode deviates from
285 the usual conventions, please make it compatible.
286
287 @itemize @bullet
288 @item
289 Define a command whose name ends in @samp{-mode}, with no arguments,
290 that switches to the new mode in the current buffer. This command
291 should set up the keymap, syntax table, and buffer-local variables in an
292 existing buffer, without changing the buffer's contents.
293
294 @item
295 Write a documentation string for this command that describes the
296 special commands available in this mode. @kbd{C-h m}
297 (@code{describe-mode}) in your mode will display this string.
298
299 The documentation string may include the special documentation
300 substrings, @samp{\[@var{command}]}, @samp{\@{@var{keymap}@}}, and
301 @samp{\<@var{keymap}>}, which enable the documentation to adapt
302 automatically to the user's own key bindings. @xref{Keys in
303 Documentation}.
304
305 @item
306 The major mode command should start by calling
307 @code{kill-all-local-variables}. This runs the normal hook
308 @code{change-major-mode-hook}, then gets rid of the buffer-local
309 variables of the major mode previously in effect. @xref{Creating
310 Buffer-Local}.
311
312 @item
313 The major mode command should set the variable @code{major-mode} to the
314 major mode command symbol. This is how @code{describe-mode} discovers
315 which documentation to print.
316
317 @item
318 The major mode command should set the variable @code{mode-name} to the
319 ``pretty'' name of the mode, usually a string (but see @ref{Mode Line
320 Data}, for other possible forms). The name of the mode appears
321 in the mode line.
322
323 @item
324 @cindex functions in modes
325 Since all global names are in the same name space, all the global
326 variables, constants, and functions that are part of the mode should
327 have names that start with the major mode name (or with an abbreviation
328 of it if the name is long). @xref{Coding Conventions}.
329
330 @item
331 In a major mode for editing some kind of structured text, such as a
332 programming language, indentation of text according to structure is
333 probably useful. So the mode should set @code{indent-line-function}
334 to a suitable function, and probably customize other variables
335 for indentation. @xref{Auto-Indentation}.
336
337 @item
338 @cindex keymaps in modes
339 The major mode should usually have its own keymap, which is used as the
340 local keymap in all buffers in that mode. The major mode command should
341 call @code{use-local-map} to install this local map. @xref{Active
342 Keymaps}, for more information.
343
344 This keymap should be stored permanently in a global variable named
345 @code{@var{modename}-mode-map}. Normally the library that defines the
346 mode sets this variable.
347
348 @xref{Tips for Defining}, for advice about how to write the code to set
349 up the mode's keymap variable.
350
351 @item
352 The key sequences bound in a major mode keymap should usually start with
353 @kbd{C-c}, followed by a control character, a digit, or @kbd{@{},
354 @kbd{@}}, @kbd{<}, @kbd{>}, @kbd{:} or @kbd{;}. The other punctuation
355 characters are reserved for minor modes, and ordinary letters are
356 reserved for users.
357
358 A major mode can also rebind the keys @kbd{M-n}, @kbd{M-p} and
359 @kbd{M-s}. The bindings for @kbd{M-n} and @kbd{M-p} should normally
360 be some kind of ``moving forward and backward,'' but this does not
361 necessarily mean cursor motion.
362
363 It is legitimate for a major mode to rebind a standard key sequence if
364 it provides a command that does ``the same job'' in a way better
365 suited to the text this mode is used for. For example, a major mode
366 for editing a programming language might redefine @kbd{C-M-a} to
367 ``move to the beginning of a function'' in a way that works better for
368 that language.
369
370 It is also legitimate for a major mode to rebind a standard key
371 sequence whose standard meaning is rarely useful in that mode. For
372 instance, minibuffer modes rebind @kbd{M-r}, whose standard meaning is
373 rarely of any use in the minibuffer. Major modes such as Dired or
374 Rmail that do not allow self-insertion of text can reasonably redefine
375 letters and other printing characters as special commands.
376
377 @item
378 Major modes for editing text should not define @key{RET} to do
379 anything other than insert a newline. However, it is ok for
380 specialized modes for text that users don't directly edit, such as
381 Dired and Info modes, to redefine @key{RET} to do something entirely
382 different.
383
384 @item
385 Major modes should not alter options that are primarily a matter of user
386 preference, such as whether Auto-Fill mode is enabled. Leave this to
387 each user to decide. However, a major mode should customize other
388 variables so that Auto-Fill mode will work usefully @emph{if} the user
389 decides to use it.
390
391 @item
392 @cindex syntax tables in modes
393 The mode may have its own syntax table or may share one with other
394 related modes. If it has its own syntax table, it should store this in
395 a variable named @code{@var{modename}-mode-syntax-table}. @xref{Syntax
396 Tables}.
397
398 @item
399 If the mode handles a language that has a syntax for comments, it should
400 set the variables that define the comment syntax. @xref{Options for
401 Comments,, Options Controlling Comments, emacs, The GNU Emacs Manual}.
402
403 @item
404 @cindex abbrev tables in modes
405 The mode may have its own abbrev table or may share one with other
406 related modes. If it has its own abbrev table, it should store this
407 in a variable named @code{@var{modename}-mode-abbrev-table}. If the
408 major mode command defines any abbrevs itself, it should pass @code{t}
409 for the @var{system-flag} argument to @code{define-abbrev}.
410 @xref{Defining Abbrevs}.
411
412 @item
413 The mode should specify how to do highlighting for Font Lock mode, by
414 setting up a buffer-local value for the variable
415 @code{font-lock-defaults} (@pxref{Font Lock Mode}).
416
417 @item
418 Each face that the mode defines should, if possible, inherit from an
419 existing Emacs face. This reduces the chance of conflicting with a
420 user's face customizations. Useful faces include:
421
422 @table @asis
423 @item @code{highlight}
424 for stretches of text that should temporarily stand out.
425
426 @item @code{match}
427 for text matching a search command.
428
429 @item @code{link} and @code{link-visited}
430 for clickable text buttons that send the user to a different buffer or
431 ``location''.
432
433 @item @code{button}
434 for clickable text buttons that perform other actions.
435
436 @item @asis{Font Lock faces}
437 for other kinds of syntactic highlighting, if highlighting is not
438 handled by Font Lock mode or some Font Lock faces are not in use.
439 @xref{Faces for Font Lock}, for how to assign Font Lock faces.
440 @end table
441
442 @item
443 The mode should specify how Imenu should find the definitions or
444 sections of a buffer, by setting up a buffer-local value for the
445 variable @code{imenu-generic-expression}, for the two variables
446 @code{imenu-prev-index-position-function} and
447 @code{imenu-extract-index-name-function}, or for the variable
448 @code{imenu-create-index-function} (@pxref{Imenu}).
449
450 @item
451 The mode can specify a local value for
452 @code{eldoc-documentation-function} to tell ElDoc mode how to handle
453 this mode.
454
455 @item
456 The mode can specify how to complete various keywords by adding
457 to the special hook @code{completion-at-point-functions}.
458
459 @item
460 Use @code{defvar} or @code{defcustom} to set mode-related variables, so
461 that they are not reinitialized if they already have a value. (Such
462 reinitialization could discard customizations made by the user.)
463
464 @item
465 @cindex buffer-local variables in modes
466 To make a buffer-local binding for an Emacs customization variable, use
467 @code{make-local-variable} in the major mode command, not
468 @code{make-variable-buffer-local}. The latter function would make the
469 variable local to every buffer in which it is subsequently set, which
470 would affect buffers that do not use this mode. It is undesirable for a
471 mode to have such global effects. @xref{Buffer-Local Variables}.
472
473 With rare exceptions, the only reasonable way to use
474 @code{make-variable-buffer-local} in a Lisp package is for a variable
475 which is used only within that package. Using it on a variable used by
476 other packages would interfere with them.
477
478 @item
479 @cindex mode hook
480 @cindex major mode hook
481 Each major mode should have a normal @dfn{mode hook} named
482 @code{@var{modename}-mode-hook}. The very last thing the major mode command
483 should do is to call @code{run-mode-hooks}. This runs the mode hook,
484 and then runs the normal hook @code{after-change-major-mode-hook}.
485 @xref{Mode Hooks}.
486
487 @item
488 The major mode command may start by calling some other major mode
489 command (called the @dfn{parent mode}) and then alter some of its
490 settings. A mode that does this is called a @dfn{derived mode}. The
491 recommended way to define one is to use the @code{define-derived-mode}
492 macro, but this is not required. Such a mode should call the parent
493 mode command inside a @code{delay-mode-hooks} form. (Using
494 @code{define-derived-mode} does this automatically.) @xref{Derived
495 Modes}, and @ref{Mode Hooks}.
496
497 @item
498 If something special should be done if the user switches a buffer from
499 this mode to any other major mode, this mode can set up a buffer-local
500 value for @code{change-major-mode-hook} (@pxref{Creating Buffer-Local}).
501
502 @item
503 If this mode is appropriate only for specially-prepared text, then the
504 major mode command symbol should have a property named @code{mode-class}
505 with value @code{special}, put on as follows:
506
507 @kindex mode-class @r{(property)}
508 @cindex @code{special}
509 @example
510 (put 'funny-mode 'mode-class 'special)
511 @end example
512
513 @noindent
514 This tells Emacs that new buffers created while the current buffer is
515 in Funny mode should not inherit Funny mode, in case the default value
516 of @code{major-mode} is @code{nil}. Modes such as Dired, Rmail,
517 and Buffer List use this feature.
518
519 The @code{define-derived-mode} macro automatically marks the derived
520 mode as special if the parent mode is special. The special mode
521 @code{special-mode} provides a convenient parent for other special
522 modes to inherit from; it sets @code{buffer-read-only} to @code{t},
523 and does little else.
524
525 @item
526 If you want to make the new mode the default for files with certain
527 recognizable names, add an element to @code{auto-mode-alist} to select
528 the mode for those file names (@pxref{Auto Major Mode}). If you
529 define the mode command to autoload, you should add this element in
530 the same file that calls @code{autoload}. If you use an autoload
531 cookie for the mode command, you can also use an autoload cookie for
532 the form that adds the element (@pxref{autoload cookie}). If you do
533 not autoload the mode command, it is sufficient to add the element in
534 the file that contains the mode definition.
535
536 @item
537 In the comments that document the file, you should provide a sample
538 @code{autoload} form and an example of how to add to
539 @code{auto-mode-alist}, that users can include in their init files
540 (@pxref{Init File}).
541
542 @item
543 @cindex mode loading
544 The top-level forms in the file defining the mode should be written so
545 that they may be evaluated more than once without adverse consequences.
546 Even if you never load the file more than once, someone else will.
547 @end itemize
548
549 @node Auto Major Mode
550 @subsection How Emacs Chooses a Major Mode
551 @cindex major mode, automatic selection
552
553 Based on information in the file name or in the file itself, Emacs
554 automatically selects a major mode for the new buffer when a file is
555 visited. It also processes local variables specified in the file text.
556
557 @deffn Command fundamental-mode
558 Fundamental mode is a major mode that is not specialized for anything
559 in particular. Other major modes are defined in effect by comparison
560 with this one---their definitions say what to change, starting from
561 Fundamental mode. The @code{fundamental-mode} function does @emph{not}
562 run any mode hooks; you're not supposed to customize it. (If you want Emacs
563 to behave differently in Fundamental mode, change the @emph{global}
564 state of Emacs.)
565 @end deffn
566
567 @deffn Command normal-mode &optional find-file
568 This function establishes the proper major mode and buffer-local variable
569 bindings for the current buffer. First it calls @code{set-auto-mode}
570 (see below), then it runs @code{hack-local-variables} to parse, and
571 bind or evaluate as appropriate, the file's local variables
572 (@pxref{File Local Variables}).
573
574 If the @var{find-file} argument to @code{normal-mode} is non-@code{nil},
575 @code{normal-mode} assumes that the @code{find-file} function is calling
576 it. In this case, it may process local variables in the @samp{-*-}
577 line or at the end of the file. The variable
578 @code{enable-local-variables} controls whether to do so. @xref{File
579 Variables, , Local Variables in Files, emacs, The GNU Emacs Manual},
580 for the syntax of the local variables section of a file.
581
582 If you run @code{normal-mode} interactively, the argument
583 @var{find-file} is normally @code{nil}. In this case,
584 @code{normal-mode} unconditionally processes any file local variables.
585
586 The function calls @code{set-auto-mode} to choose a major mode. If it
587 does not specify a mode, the buffer stays in the major mode determined
588 by the default value of @code{major-mode} (see below).
589
590 @cindex file mode specification error
591 @code{normal-mode} uses @code{condition-case} around the call to the
592 major mode function, so errors are caught and reported as a @samp{File
593 mode specification error}, followed by the original error message.
594 @end deffn
595
596 @defun set-auto-mode &optional keep-mode-if-same
597 @cindex visited file mode
598 This function selects the major mode that is appropriate for the
599 current buffer. It bases its decision (in order of precedence) on
600 the @w{@samp{-*-}} line, on any @samp{mode:} local variable near the
601 end of a file, on the @w{@samp{#!}} line (using
602 @code{interpreter-mode-alist}), on the text at the beginning of the
603 buffer (using @code{magic-mode-alist}), and finally on the visited
604 file name (using @code{auto-mode-alist}). @xref{Choosing Modes, , How
605 Major Modes are Chosen, emacs, The GNU Emacs Manual}.
606 If @code{enable-local-variables} is @code{nil}, @code{set-auto-mode}
607 does not check the @w{@samp{-*-}} line, or near the end of the file,
608 for any mode tag.
609
610 If @var{keep-mode-if-same} is non-@code{nil}, this function does not
611 call the mode command if the buffer is already in the proper major
612 mode. For instance, @code{set-visited-file-name} sets this to
613 @code{t} to avoid killing buffer local variables that the user may
614 have set.
615 @end defun
616
617 @defopt major-mode
618 The buffer-local value of this variable holds the major mode
619 currently active. The default value of this variable holds the
620 default major mode for new buffers. The standard default value is
621 @code{fundamental-mode}.
622
623 If the default value of @code{major-mode} is @code{nil}, Emacs uses
624 the (previously) current buffer's major mode as the default major mode
625 of a new buffer. However, if that major mode symbol has a @code{mode-class}
626 property with value @code{special}, then it is not used for new buffers;
627 Fundamental mode is used instead. The modes that have this property are
628 those such as Dired and Rmail that are useful only with text that has
629 been specially prepared.
630 @end defopt
631
632 @defun set-buffer-major-mode buffer
633 This function sets the major mode of @var{buffer} to the default value of
634 @code{major-mode}; if that is @code{nil}, it uses the
635 current buffer's major mode (if that is suitable). As an exception,
636 if @var{buffer}'s name is @samp{*scratch*}, it sets the mode to
637 @code{initial-major-mode}.
638
639 The low-level primitives for creating buffers do not use this function,
640 but medium-level commands such as @code{switch-to-buffer} and
641 @code{find-file-noselect} use it whenever they create buffers.
642 @end defun
643
644 @defopt initial-major-mode
645 @cindex @samp{*scratch*}
646 The value of this variable determines the major mode of the initial
647 @samp{*scratch*} buffer. The value should be a symbol that is a major
648 mode command. The default value is @code{lisp-interaction-mode}.
649 @end defopt
650
651 @defvar interpreter-mode-alist
652 This variable specifies major modes to use for scripts that specify a
653 command interpreter in a @samp{#!} line. Its value is an alist with
654 elements of the form @code{(@var{interpreter} . @var{mode})}; for
655 example, @code{("perl" . perl-mode)} is one element present by
656 default. The element says to use mode @var{mode} if the file
657 specifies an interpreter which matches @var{interpreter}.
658 @end defvar
659
660 @defvar magic-mode-alist
661 This variable's value is an alist with elements of the form
662 @code{(@var{regexp} . @var{function})}, where @var{regexp} is a
663 regular expression and @var{function} is a function or @code{nil}.
664 After visiting a file, @code{set-auto-mode} calls @var{function} if
665 the text at the beginning of the buffer matches @var{regexp} and
666 @var{function} is non-@code{nil}; if @var{function} is @code{nil},
667 @code{auto-mode-alist} gets to decide the mode.
668 @end defvar
669
670 @defvar magic-fallback-mode-alist
671 This works like @code{magic-mode-alist}, except that it is handled
672 only if @code{auto-mode-alist} does not specify a mode for this file.
673 @end defvar
674
675 @defvar auto-mode-alist
676 This variable contains an association list of file name patterns
677 (regular expressions) and corresponding major mode commands. Usually,
678 the file name patterns test for suffixes, such as @samp{.el} and
679 @samp{.c}, but this need not be the case. An ordinary element of the
680 alist looks like @code{(@var{regexp} . @var{mode-function})}.
681
682 For example,
683
684 @smallexample
685 @group
686 (("\\`/tmp/fol/" . text-mode)
687 ("\\.texinfo\\'" . texinfo-mode)
688 ("\\.texi\\'" . texinfo-mode)
689 @end group
690 @group
691 ("\\.el\\'" . emacs-lisp-mode)
692 ("\\.c\\'" . c-mode)
693 ("\\.h\\'" . c-mode)
694 @dots{})
695 @end group
696 @end smallexample
697
698 When you visit a file whose expanded file name (@pxref{File Name
699 Expansion}), with version numbers and backup suffixes removed using
700 @code{file-name-sans-versions} (@pxref{File Name Components}), matches
701 a @var{regexp}, @code{set-auto-mode} calls the corresponding
702 @var{mode-function}. This feature enables Emacs to select the proper
703 major mode for most files.
704
705 If an element of @code{auto-mode-alist} has the form @code{(@var{regexp}
706 @var{function} t)}, then after calling @var{function}, Emacs searches
707 @code{auto-mode-alist} again for a match against the portion of the file
708 name that did not match before. This feature is useful for
709 uncompression packages: an entry of the form @code{("\\.gz\\'"
710 @var{function} t)} can uncompress the file and then put the uncompressed
711 file in the proper mode according to the name sans @samp{.gz}.
712
713 Here is an example of how to prepend several pattern pairs to
714 @code{auto-mode-alist}. (You might use this sort of expression in your
715 init file.)
716
717 @smallexample
718 @group
719 (setq auto-mode-alist
720 (append
721 ;; @r{File name (within directory) starts with a dot.}
722 '(("/\\.[^/]*\\'" . fundamental-mode)
723 ;; @r{File name has no dot.}
724 ("/[^\\./]*\\'" . fundamental-mode)
725 ;; @r{File name ends in @samp{.C}.}
726 ("\\.C\\'" . c++-mode))
727 auto-mode-alist))
728 @end group
729 @end smallexample
730 @end defvar
731
732 @node Mode Help
733 @subsection Getting Help about a Major Mode
734 @cindex mode help
735 @cindex help for major mode
736 @cindex documentation for major mode
737
738 The @code{describe-mode} function is used to provide information
739 about major modes. It is normally called with @kbd{C-h m}. The
740 @code{describe-mode} function uses the value of @code{major-mode},
741 which is why every major mode function needs to set the
742 @code{major-mode} variable.
743
744 @deffn Command describe-mode
745 This function displays the documentation of the current major mode.
746
747 The @code{describe-mode} function calls the @code{documentation}
748 function using the value of @code{major-mode} as an argument. Thus, it
749 displays the documentation string of the major mode function.
750 (@xref{Accessing Documentation}.)
751 @end deffn
752
753 @defvar major-mode
754 This buffer-local variable holds the symbol for the current buffer's
755 major mode. This symbol should have a function definition that is the
756 command to switch to that major mode. The @code{describe-mode}
757 function uses the documentation string of the function as the
758 documentation of the major mode.
759 @end defvar
760
761 @node Derived Modes
762 @subsection Defining Derived Modes
763 @cindex derived mode
764
765 The recommended way to define a new major mode is to derive it
766 from an existing one using @code{define-derived-mode}. If there is no
767 closely related mode, you can inherit from @code{text-mode},
768 @code{special-mode}, @code{prog-mode}, or in the worst case
769 @code{fundamental-mode}.
770
771 @defmac define-derived-mode variant parent name docstring keyword-args@dots{} body@dots{}
772 This macro defines @var{variant} as a major mode command, using
773 @var{name} as the string form of the mode name. @var{variant} and
774 @var{parent} should be unquoted symbols.
775
776 The new command @var{variant} is defined to call the function
777 @var{parent}, then override certain aspects of that parent mode:
778
779 @itemize @bullet
780 @item
781 The new mode has its own sparse keymap, named
782 @code{@var{variant}-map}. @code{define-derived-mode}
783 makes the parent mode's keymap the parent of the new map, unless
784 @code{@var{variant}-map} is already set and already has a parent.
785
786 @item
787 The new mode has its own syntax table, kept in the variable
788 @code{@var{variant}-syntax-table}, unless you override this using the
789 @code{:syntax-table} keyword (see below). @code{define-derived-mode}
790 makes the parent mode's syntax-table the parent of
791 @code{@var{variant}-syntax-table}, unless the latter is already set
792 and already has a parent different from the standard syntax table.
793
794 @item
795 The new mode has its own abbrev table, kept in the variable
796 @code{@var{variant}-abbrev-table}, unless you override this using the
797 @code{:abbrev-table} keyword (see below).
798
799 @item
800 The new mode has its own mode hook, @code{@var{variant}-hook}. It
801 runs this hook, after running the hooks of its ancestor modes, with
802 @code{run-mode-hooks}, as the last thing it does. @xref{Mode Hooks}.
803 @end itemize
804
805 In addition, you can specify how to override other aspects of
806 @var{parent} with @var{body}. The command @var{variant}
807 evaluates the forms in @var{body} after setting up all its usual
808 overrides, just before running the mode hooks.
809
810 If @var{parent} has a non-@code{nil} @code{mode-class} symbol
811 property, then @code{define-derived-mode} sets the @code{mode-class}
812 property of @var{variant} to the same value. This ensures, for
813 example, that if @var{parent} is a special mode, then @var{variant} is
814 also a special mode (@pxref{Major Mode Conventions}).
815
816 You can also specify @code{nil} for @var{parent}. This gives the new
817 mode no parent. Then @code{define-derived-mode} behaves as described
818 above, but, of course, omits all actions connected with @var{parent}.
819
820 The argument @var{docstring} specifies the documentation string for
821 the new mode. @code{define-derived-mode} adds some general
822 information about the mode's hook, followed by the mode's keymap, at
823 the end of this docstring. If you omit @var{docstring},
824 @code{define-derived-mode} generates a documentation string.
825
826 The @var{keyword-args} are pairs of keywords and values. The values
827 are evaluated. The following keywords are currently supported:
828
829 @table @code
830 @item :syntax-table
831 You can use this to explicitly specify a syntax table for the new
832 mode. If you specify a @code{nil} value, the new mode uses the same
833 syntax table as @var{parent}, or the standard syntax table if
834 @var{parent} is @code{nil}. (Note that this does @emph{not} follow
835 the convention used for non-keyword arguments that a @code{nil} value
836 is equivalent with not specifying the argument.)
837
838 @item :abbrev-table
839 You can use this to explicitly specify an abbrev table for the new
840 mode. If you specify a @code{nil} value, the new mode uses the same
841 abbrev table as @var{parent}, or @code{fundamental-mode-abbrev-table}
842 if @var{parent} is @code{nil}. (Again, a @code{nil} value is
843 @emph{not} equivalent to not specifying this keyword.)
844
845 @item :group
846 If this is specified, the value should be the customization group for
847 this mode. (Not all major modes have one.) Only the (still
848 experimental and unadvertised) command @code{customize-mode} currently
849 uses this. @code{define-derived-mode} does @emph{not} automatically
850 define the specified customization group.
851 @end table
852
853 Here is a hypothetical example:
854
855 @example
856 (define-derived-mode hypertext-mode
857 text-mode "Hypertext"
858 "Major mode for hypertext.
859 \\@{hypertext-mode-map@}"
860 (setq case-fold-search nil))
861
862 (define-key hypertext-mode-map
863 [down-mouse-3] 'do-hyper-link)
864 @end example
865
866 Do not write an @code{interactive} spec in the definition;
867 @code{define-derived-mode} does that automatically.
868 @end defmac
869
870 @node Generic Modes
871 @subsection Generic Modes
872 @cindex generic mode
873
874 @dfn{Generic modes} are simple major modes with basic support for
875 comment syntax and Font Lock mode. To define a generic mode, use the
876 macro @code{define-generic-mode}. See the file @file{generic-x.el}
877 for some examples of the use of @code{define-generic-mode}.
878
879 @defmac define-generic-mode mode comment-list keyword-list font-lock-list auto-mode-list function-list &optional docstring
880 This macro defines a generic mode command named @var{mode} (a symbol,
881 not quoted). The optional argument @var{docstring} is the
882 documentation for the mode command. If you do not supply it,
883 @code{define-generic-mode} generates one by default.
884
885 The argument @var{comment-list} is a list in which each element is
886 either a character, a string of one or two characters, or a cons cell.
887 A character or a string is set up in the mode's syntax table as a
888 ``comment starter.'' If the entry is a cons cell, the @sc{car} is set
889 up as a ``comment starter'' and the @sc{cdr} as a ``comment ender.''
890 (Use @code{nil} for the latter if you want comments to end at the end
891 of the line.) Note that the syntax table mechanism has limitations
892 about what comment starters and enders are actually possible.
893 @xref{Syntax Tables}.
894
895 The argument @var{keyword-list} is a list of keywords to highlight
896 with @code{font-lock-keyword-face}. Each keyword should be a string.
897 Meanwhile, @var{font-lock-list} is a list of additional expressions to
898 highlight. Each element of this list should have the same form as an
899 element of @code{font-lock-keywords}. @xref{Search-based
900 Fontification}.
901
902 The argument @var{auto-mode-list} is a list of regular expressions to
903 add to the variable @code{auto-mode-alist}. They are added by the execution
904 of the @code{define-generic-mode} form, not by expanding the macro call.
905
906 Finally, @var{function-list} is a list of functions for the mode
907 command to call for additional setup. It calls these functions just
908 before it runs the mode hook variable @code{@var{mode}-hook}.
909 @end defmac
910
911 @node Mode Hooks
912 @subsection Mode Hooks
913
914 Every major mode function should finish by running its mode hook and
915 the mode-independent normal hook @code{after-change-major-mode-hook}.
916 It does this by calling @code{run-mode-hooks}. If the major mode is a
917 derived mode, that is if it calls another major mode (the parent mode)
918 in its body, it should do this inside @code{delay-mode-hooks} so that
919 the parent won't run these hooks itself. Instead, the derived mode's
920 call to @code{run-mode-hooks} runs the parent's mode hook too.
921 @xref{Major Mode Conventions}.
922
923 Emacs versions before Emacs 22 did not have @code{delay-mode-hooks}.
924 When user-implemented major modes have not been updated to use it,
925 they won't entirely follow these conventions: they may run the
926 parent's mode hook too early, or fail to run
927 @code{after-change-major-mode-hook}. If you encounter such a major
928 mode, please correct it to follow these conventions.
929
930 When you defined a major mode using @code{define-derived-mode}, it
931 automatically makes sure these conventions are followed. If you
932 define a major mode ``by hand,'' not using @code{define-derived-mode},
933 use the following functions to handle these conventions automatically.
934
935 @defun run-mode-hooks &rest hookvars
936 Major modes should run their mode hook using this function. It is
937 similar to @code{run-hooks} (@pxref{Hooks}), but it also runs
938 @code{after-change-major-mode-hook}.
939
940 When this function is called during the execution of a
941 @code{delay-mode-hooks} form, it does not run the hooks immediately.
942 Instead, it arranges for the next call to @code{run-mode-hooks} to run
943 them.
944 @end defun
945
946 @defmac delay-mode-hooks body@dots{}
947 When one major mode command calls another, it should do so inside of
948 @code{delay-mode-hooks}.
949
950 This macro executes @var{body}, but tells all @code{run-mode-hooks}
951 calls during the execution of @var{body} to delay running their hooks.
952 The hooks will actually run during the next call to
953 @code{run-mode-hooks} after the end of the @code{delay-mode-hooks}
954 construct.
955 @end defmac
956
957 @defvar after-change-major-mode-hook
958 This is a normal hook run by @code{run-mode-hooks}. It is run at the
959 very end of every properly-written major mode function.
960 @end defvar
961
962 @node Example Major Modes
963 @subsection Major Mode Examples
964
965 Text mode is perhaps the simplest mode besides Fundamental mode.
966 Here are excerpts from @file{text-mode.el} that illustrate many of
967 the conventions listed above:
968
969 @smallexample
970 @group
971 ;; @r{Create the syntax table for this mode.}
972 (defvar text-mode-syntax-table
973 (let ((st (make-syntax-table)))
974 (modify-syntax-entry ?\" ". " st)
975 (modify-syntax-entry ?\\ ". " st)
976 ;; Add `p' so M-c on `hello' leads to `Hello', not `hello'.
977 (modify-syntax-entry ?' "w p" st)
978 st)
979 "Syntax table used while in `text-mode'.")
980 @end group
981
982 ;; @r{Create the keymap for this mode.}
983 @group
984 (defvar text-mode-map
985 (let ((map (make-sparse-keymap)))
986 (define-key map "\e\t" 'ispell-complete-word)
987 (define-key map "\es" 'center-line)
988 (define-key map "\eS" 'center-paragraph)
989 map)
990 "Keymap for `text-mode'.
991 Many other modes, such as Mail mode, Outline mode
992 and Indented Text mode, inherit all the commands
993 defined in this map.")
994 @end group
995 @end smallexample
996
997 Here is how the actual mode command is defined now:
998
999 @smallexample
1000 @group
1001 (define-derived-mode text-mode nil "Text"
1002 "Major mode for editing text written for humans to read.
1003 In this mode, paragraphs are delimited only by blank or white lines.
1004 You can thus get the full benefit of adaptive filling
1005 (see the variable `adaptive-fill-mode').
1006 \\@{text-mode-map@}
1007 Turning on Text mode runs the normal hook `text-mode-hook'."
1008 @end group
1009 @group
1010 (set (make-local-variable 'text-mode-variant) t)
1011 ;; @r{These two lines are a feature added recently.}
1012 (set (make-local-variable 'require-final-newline)
1013 mode-require-final-newline)
1014 (set (make-local-variable 'indent-line-function) 'indent-relative))
1015 @end group
1016 @end smallexample
1017
1018 @noindent
1019 (The last line is redundant nowadays, since @code{indent-relative} is
1020 the default value, and we'll delete it in a future version.)
1021
1022 Here is how it was defined formerly, before
1023 @code{define-derived-mode} existed:
1024
1025 @smallexample
1026 @group
1027 ;; @r{This isn't needed nowadays, since @code{define-derived-mode} does it.}
1028 (define-abbrev-table 'text-mode-abbrev-table ()
1029 "Abbrev table used while in text mode.")
1030 @end group
1031
1032 @group
1033 (defun text-mode ()
1034 "Major mode for editing text intended for humans to read...
1035 Special commands: \\@{text-mode-map@}
1036 @end group
1037 @group
1038 Turning on text-mode runs the hook `text-mode-hook'."
1039 (interactive)
1040 (kill-all-local-variables)
1041 (use-local-map text-mode-map)
1042 @end group
1043 @group
1044 (setq local-abbrev-table text-mode-abbrev-table)
1045 (set-syntax-table text-mode-syntax-table)
1046 @end group
1047 @group
1048 ;; @r{These four lines are absent from the current version}
1049 ;; @r{not because this is done some other way, but rather}
1050 ;; @r{because nowadays Text mode uses the normal definition of paragraphs.}
1051 (set (make-local-variable 'paragraph-start)
1052 (concat "[ \t]*$\\|" page-delimiter))
1053 (set (make-local-variable 'paragraph-separate) paragraph-start)
1054 (set (make-local-variable 'indent-line-function) 'indent-relative-maybe)
1055 @end group
1056 @group
1057 (setq mode-name "Text")
1058 (setq major-mode 'text-mode)
1059 (run-mode-hooks 'text-mode-hook)) ; @r{Finally, this permits the user to}
1060 ; @r{customize the mode with a hook.}
1061 @end group
1062 @end smallexample
1063
1064 @cindex @file{lisp-mode.el}
1065 The three Lisp modes (Lisp mode, Emacs Lisp mode, and Lisp
1066 Interaction mode) have more features than Text mode and the code is
1067 correspondingly more complicated. Here are excerpts from
1068 @file{lisp-mode.el} that illustrate how these modes are written.
1069
1070 @cindex syntax table example
1071 @smallexample
1072 @group
1073 ;; @r{Create mode-specific table variables.}
1074 (defvar lisp-mode-syntax-table nil "")
1075 (defvar lisp-mode-abbrev-table nil "")
1076 @end group
1077
1078 @group
1079 (defvar emacs-lisp-mode-syntax-table
1080 (let ((table (make-syntax-table)))
1081 (let ((i 0))
1082 @end group
1083
1084 @group
1085 ;; @r{Set syntax of chars up to @samp{0} to say they are}
1086 ;; @r{part of symbol names but not words.}
1087 ;; @r{(The digit @samp{0} is @code{48} in the @acronym{ASCII} character set.)}
1088 (while (< i ?0)
1089 (modify-syntax-entry i "_ " table)
1090 (setq i (1+ i)))
1091 ;; @r{@dots{} similar code follows for other character ranges.}
1092 @end group
1093 @group
1094 ;; @r{Then set the syntax codes for characters that are special in Lisp.}
1095 (modify-syntax-entry ? " " table)
1096 (modify-syntax-entry ?\t " " table)
1097 (modify-syntax-entry ?\f " " table)
1098 (modify-syntax-entry ?\n "> " table)
1099 @end group
1100 @group
1101 ;; @r{Give CR the same syntax as newline, for selective-display.}
1102 (modify-syntax-entry ?\^m "> " table)
1103 (modify-syntax-entry ?\; "< " table)
1104 (modify-syntax-entry ?` "' " table)
1105 (modify-syntax-entry ?' "' " table)
1106 (modify-syntax-entry ?, "' " table)
1107 @end group
1108 @group
1109 ;; @r{@dots{}likewise for many other characters@dots{}}
1110 (modify-syntax-entry ?\( "() " table)
1111 (modify-syntax-entry ?\) ")( " table)
1112 (modify-syntax-entry ?\[ "(] " table)
1113 (modify-syntax-entry ?\] ")[ " table))
1114 table))
1115 @end group
1116 @group
1117 ;; @r{Create an abbrev table for lisp-mode.}
1118 (define-abbrev-table 'lisp-mode-abbrev-table ())
1119 @end group
1120 @end smallexample
1121
1122 The three modes for Lisp share much of their code. For instance,
1123 each calls the following function to set various variables:
1124
1125 @smallexample
1126 @group
1127 (defun lisp-mode-variables (lisp-syntax)
1128 (when lisp-syntax
1129 (set-syntax-table lisp-mode-syntax-table))
1130 (setq local-abbrev-table lisp-mode-abbrev-table)
1131 @dots{}
1132 @end group
1133 @end smallexample
1134
1135 In Lisp and most programming languages, we want the paragraph
1136 commands to treat only blank lines as paragraph separators. And the
1137 modes should understand the Lisp conventions for comments. The rest of
1138 @code{lisp-mode-variables} sets this up:
1139
1140 @smallexample
1141 @group
1142 (set (make-local-variable 'paragraph-start) (concat page-delimiter "\\|$" ))
1143 (set (make-local-variable 'paragraph-separate) paragraph-start)
1144 @dots{}
1145 @end group
1146 @group
1147 (set (make-local-variable 'comment-indent-function) 'lisp-comment-indent))
1148 @dots{}
1149 @end group
1150 @end smallexample
1151
1152 Each of the different Lisp modes has a slightly different keymap. For
1153 example, Lisp mode binds @kbd{C-c C-z} to @code{run-lisp}, but the other
1154 Lisp modes do not. However, all Lisp modes have some commands in
1155 common. The following code sets up the common commands:
1156
1157 @smallexample
1158 @group
1159 (defvar shared-lisp-mode-map
1160 (let ((map (make-sparse-keymap)))
1161 (define-key shared-lisp-mode-map "\e\C-q" 'indent-sexp)
1162 (define-key shared-lisp-mode-map "\177"
1163 'backward-delete-char-untabify)
1164 map)
1165 "Keymap for commands shared by all sorts of Lisp modes.")
1166 @end group
1167 @end smallexample
1168
1169 @noindent
1170 And here is the code to set up the keymap for Lisp mode:
1171
1172 @smallexample
1173 @group
1174 (defvar lisp-mode-map
1175 (let ((map (make-sparse-keymap)))
1176 (set-keymap-parent map shared-lisp-mode-map)
1177 (define-key map "\e\C-x" 'lisp-eval-defun)
1178 (define-key map "\C-c\C-z" 'run-lisp)
1179 map)
1180 "Keymap for ordinary Lisp mode...")
1181 @end group
1182 @end smallexample
1183
1184 Finally, here is the complete major mode function definition for
1185 Lisp mode.
1186
1187 @smallexample
1188 @group
1189 (defun lisp-mode ()
1190 "Major mode for editing Lisp code for Lisps other than GNU Emacs Lisp.
1191 Commands:
1192 Delete converts tabs to spaces as it moves back.
1193 Blank lines separate paragraphs. Semicolons start comments.
1194 \\@{lisp-mode-map@}
1195 Note that `run-lisp' may be used either to start an inferior Lisp job
1196 or to switch back to an existing one.
1197 @end group
1198
1199 @group
1200 Entry to this mode calls the value of `lisp-mode-hook'
1201 if that value is non-nil."
1202 (interactive)
1203 (kill-all-local-variables)
1204 @end group
1205 @group
1206 (use-local-map lisp-mode-map) ; @r{Select the mode's keymap.}
1207 (setq major-mode 'lisp-mode) ; @r{This is how @code{describe-mode}}
1208 ; @r{finds out what to describe.}
1209 (setq mode-name "Lisp") ; @r{This goes into the mode line.}
1210 (lisp-mode-variables t) ; @r{This defines various variables.}
1211 (set (make-local-variable 'comment-start-skip)
1212 "\\(\\(^\\|[^\\\\\n]\\)\\(\\\\\\\\\\)*\\)\\(;+\\|#|\\) *")
1213 (set (make-local-variable 'font-lock-keywords-case-fold-search) t)
1214 @end group
1215 @group
1216 (setq imenu-case-fold-search t)
1217 (set-syntax-table lisp-mode-syntax-table)
1218 (run-mode-hooks 'lisp-mode-hook)) ; @r{This permits the user to use a}
1219 ; @r{hook to customize the mode.}
1220 @end group
1221 @end smallexample
1222
1223 @node Minor Modes
1224 @section Minor Modes
1225 @cindex minor mode
1226
1227 A @dfn{minor mode} provides features that users may enable or disable
1228 independently of the choice of major mode. Minor modes can be enabled
1229 individually or in combination. Minor modes would be better named
1230 ``generally available, optional feature modes,'' except that such a name
1231 would be unwieldy.
1232
1233 A minor mode is not usually meant as a variation of a single major mode.
1234 Usually they are general and can apply to many major modes. For
1235 example, Auto Fill mode works with any major mode that permits text
1236 insertion. To be general, a minor mode must be effectively independent
1237 of the things major modes do.
1238
1239 A minor mode is often much more difficult to implement than a major
1240 mode. One reason is that you should be able to activate and deactivate
1241 minor modes in any order. A minor mode should be able to have its
1242 desired effect regardless of the major mode and regardless of the other
1243 minor modes in effect.
1244
1245 Often the biggest problem in implementing a minor mode is finding a
1246 way to insert the necessary hook into the rest of Emacs. Minor mode
1247 keymaps make this easier than it used to be.
1248
1249 @defvar minor-mode-list
1250 The value of this variable is a list of all minor mode commands.
1251 @end defvar
1252
1253 @menu
1254 * Minor Mode Conventions:: Tips for writing a minor mode.
1255 * Keymaps and Minor Modes:: How a minor mode can have its own keymap.
1256 * Defining Minor Modes:: A convenient facility for defining minor modes.
1257 @end menu
1258
1259 @node Minor Mode Conventions
1260 @subsection Conventions for Writing Minor Modes
1261 @cindex minor mode conventions
1262 @cindex conventions for writing minor modes
1263
1264 There are conventions for writing minor modes just as there are for
1265 major modes. Several of the major mode conventions apply to minor
1266 modes as well: those regarding the name of the mode initialization
1267 function, the names of global symbols, the use of a hook at the end of
1268 the initialization function, and the use of keymaps and other tables.
1269
1270 In addition, there are several conventions that are specific to
1271 minor modes. (The easiest way to follow all the conventions is to use
1272 the macro @code{define-minor-mode}; @ref{Defining Minor Modes}.)
1273
1274 @itemize @bullet
1275 @item
1276 @cindex mode variable
1277 Make a variable whose name ends in @samp{-mode} to control the minor
1278 mode. We call this the @dfn{mode variable}. The minor mode command
1279 should set this variable (@code{nil} to disable; anything else to
1280 enable).
1281
1282 If possible, implement the mode so that setting the variable
1283 automatically enables or disables the mode. Then the minor mode command
1284 does not need to do anything except set the variable.
1285
1286 This variable is used in conjunction with the @code{minor-mode-alist} to
1287 display the minor mode name in the mode line. It can also enable
1288 or disable a minor mode keymap. Individual commands or hooks can also
1289 check the variable's value.
1290
1291 If you want the minor mode to be enabled separately in each buffer,
1292 make the variable buffer-local.
1293
1294 @item
1295 Define a command whose name is the same as the mode variable.
1296 Its job is to enable and disable the mode by setting the variable.
1297
1298 The command should accept one optional argument. If the argument is
1299 @code{nil}, it should toggle the mode (turn it on if it is off, and
1300 off if it is on). It should turn the mode on if the argument is a
1301 positive integer, the symbol @code{t}, or a list whose @sc{car} is one
1302 of those. It should turn the mode off if the argument is a negative
1303 integer or zero, the symbol @code{-}, or a list whose @sc{car} is a
1304 negative integer or zero. The meaning of other arguments is not
1305 specified.
1306
1307 Here is an example taken from the definition of @code{transient-mark-mode}.
1308 It shows the use of @code{transient-mark-mode} as a variable that enables or
1309 disables the mode's behavior, and also shows the proper way to toggle,
1310 enable or disable the minor mode based on the raw prefix argument value.
1311
1312 @smallexample
1313 @group
1314 (setq transient-mark-mode
1315 (if (null arg) (not transient-mark-mode)
1316 (> (prefix-numeric-value arg) 0)))
1317 @end group
1318 @end smallexample
1319
1320 @item
1321 Add an element to @code{minor-mode-alist} for each minor mode
1322 (@pxref{Definition of minor-mode-alist}), if you want to indicate the
1323 minor mode in the mode line. This element should be a list of the
1324 following form:
1325
1326 @smallexample
1327 (@var{mode-variable} @var{string})
1328 @end smallexample
1329
1330 Here @var{mode-variable} is the variable that controls enabling of the
1331 minor mode, and @var{string} is a short string, starting with a space,
1332 to represent the mode in the mode line. These strings must be short so
1333 that there is room for several of them at once.
1334
1335 When you add an element to @code{minor-mode-alist}, use @code{assq} to
1336 check for an existing element, to avoid duplication. For example:
1337
1338 @smallexample
1339 @group
1340 (unless (assq 'leif-mode minor-mode-alist)
1341 (setq minor-mode-alist
1342 (cons '(leif-mode " Leif") minor-mode-alist)))
1343 @end group
1344 @end smallexample
1345
1346 @noindent
1347 or like this, using @code{add-to-list} (@pxref{List Variables}):
1348
1349 @smallexample
1350 @group
1351 (add-to-list 'minor-mode-alist '(leif-mode " Leif"))
1352 @end group
1353 @end smallexample
1354 @end itemize
1355
1356 Global minor modes distributed with Emacs should if possible support
1357 enabling and disabling via Custom (@pxref{Customization}). To do this,
1358 the first step is to define the mode variable with @code{defcustom}, and
1359 specify @code{:type 'boolean}.
1360
1361 If just setting the variable is not sufficient to enable the mode, you
1362 should also specify a @code{:set} method which enables the mode by
1363 invoking the mode command. Note in the variable's documentation string that
1364 setting the variable other than via Custom may not take effect.
1365
1366 Also mark the definition with an autoload cookie (@pxref{autoload cookie}),
1367 and specify a @code{:require} so that customizing the variable will load
1368 the library that defines the mode. This will copy suitable definitions
1369 into @file{loaddefs.el} so that users can use @code{customize-option} to
1370 enable the mode. For example:
1371
1372 @smallexample
1373 @group
1374
1375 ;;;###autoload
1376 (defcustom msb-mode nil
1377 "Toggle msb-mode.
1378 Setting this variable directly does not take effect;
1379 use either \\[customize] or the function `msb-mode'."
1380 :set 'custom-set-minor-mode
1381 :initialize 'custom-initialize-default
1382 :version "20.4"
1383 :type 'boolean
1384 :group 'msb
1385 :require 'msb)
1386 @end group
1387 @end smallexample
1388
1389 @node Keymaps and Minor Modes
1390 @subsection Keymaps and Minor Modes
1391
1392 Each minor mode can have its own keymap, which is active when the mode
1393 is enabled. To set up a keymap for a minor mode, add an element to the
1394 alist @code{minor-mode-map-alist}. @xref{Definition of minor-mode-map-alist}.
1395
1396 @cindex @code{self-insert-command}, minor modes
1397 One use of minor mode keymaps is to modify the behavior of certain
1398 self-inserting characters so that they do something else as well as
1399 self-insert. In general, this is the only way to do that, since the
1400 facilities for customizing @code{self-insert-command} are limited to
1401 special cases (designed for abbrevs and Auto Fill mode). (Do not try
1402 substituting your own definition of @code{self-insert-command} for the
1403 standard one. The editor command loop handles this function specially.)
1404
1405 The key sequences bound in a minor mode should consist of @kbd{C-c}
1406 followed by one of @kbd{.,/?`'"[]\|~!#$%^&*()-_+=}. (The other
1407 punctuation characters are reserved for major modes.)
1408
1409 @node Defining Minor Modes
1410 @subsection Defining Minor Modes
1411
1412 The macro @code{define-minor-mode} offers a convenient way of
1413 implementing a mode in one self-contained definition.
1414
1415 @defmac define-minor-mode mode doc [init-value [lighter [keymap]]] keyword-args@dots{} body@dots{}
1416 This macro defines a new minor mode whose name is @var{mode} (a
1417 symbol). It defines a command named @var{mode} to toggle the minor
1418 mode, with @var{doc} as its documentation string. It also defines a
1419 variable named @var{mode}, which is set to @code{t} or @code{nil} by
1420 enabling or disabling the mode. The variable is initialized to
1421 @var{init-value}. Except in unusual circumstances (see below), this
1422 value must be @code{nil}.
1423
1424 The string @var{lighter} says what to display in the mode line
1425 when the mode is enabled; if it is @code{nil}, the mode is not displayed
1426 in the mode line.
1427
1428 The optional argument @var{keymap} specifies the keymap for the minor
1429 mode. If non-@code{nil}, it should be a variable name (whose value is
1430 a keymap), a keymap, or an alist of the form
1431
1432 @example
1433 (@var{key-sequence} . @var{definition})
1434 @end example
1435
1436 @noindent
1437 where each @var{key-sequence} and @var{definition} are arguments
1438 suitable for passing to @code{define-key} (@pxref{Changing Key
1439 Bindings}). If @var{keymap} is a keymap or an alist, this also
1440 defines the variable @code{@var{mode}-map}.
1441
1442 The above three arguments @var{init-value}, @var{lighter}, and
1443 @var{keymap} can be (partially) omitted when @var{keyword-args} are
1444 used. The @var{keyword-args} consist of keywords followed by
1445 corresponding values. A few keywords have special meanings:
1446
1447 @table @code
1448 @item :group @var{group}
1449 Custom group name to use in all generated @code{defcustom} forms.
1450 Defaults to @var{mode} without the possible trailing @samp{-mode}.
1451 @strong{Warning:} don't use this default group name unless you have
1452 written a @code{defgroup} to define that group properly. @xref{Group
1453 Definitions}.
1454
1455 @item :global @var{global}
1456 If non-@code{nil}, this specifies that the minor mode should be global
1457 rather than buffer-local. It defaults to @code{nil}.
1458
1459 One of the effects of making a minor mode global is that the
1460 @var{mode} variable becomes a customization variable. Toggling it
1461 through the Custom interface turns the mode on and off, and its value
1462 can be saved for future Emacs sessions (@pxref{Saving
1463 Customizations,,, emacs, The GNU Emacs Manual}. For the saved
1464 variable to work, you should ensure that the @code{define-minor-mode}
1465 form is evaluated each time Emacs starts; for packages that are not
1466 part of Emacs, the easiest way to do this is to specify a
1467 @code{:require} keyword.
1468
1469 @item :init-value @var{init-value}
1470 This is equivalent to specifying @var{init-value} positionally.
1471
1472 @item :lighter @var{lighter}
1473 This is equivalent to specifying @var{lighter} positionally.
1474
1475 @item :keymap @var{keymap}
1476 This is equivalent to specifying @var{keymap} positionally.
1477 @end table
1478
1479 Any other keyword arguments are passed directly to the
1480 @code{defcustom} generated for the variable @var{mode}.
1481
1482 The command named @var{mode} first performs the standard actions such
1483 as setting the variable named @var{mode} and then executes the
1484 @var{body} forms, if any. It finishes by running the mode hook
1485 variable @code{@var{mode}-hook}.
1486 @end defmac
1487
1488 The initial value must be @code{nil} except in cases where (1) the
1489 mode is preloaded in Emacs, or (2) it is painless for loading to
1490 enable the mode even though the user did not request it. For
1491 instance, if the mode has no effect unless something else is enabled,
1492 and will always be loaded by that time, enabling it by default is
1493 harmless. But these are unusual circumstances. Normally, the
1494 initial value must be @code{nil}.
1495
1496 @findex easy-mmode-define-minor-mode
1497 The name @code{easy-mmode-define-minor-mode} is an alias
1498 for this macro.
1499
1500 Here is an example of using @code{define-minor-mode}:
1501
1502 @smallexample
1503 (define-minor-mode hungry-mode
1504 "Toggle Hungry mode.
1505 With no argument, this command toggles the mode.
1506 Non-null prefix argument turns on the mode.
1507 Null prefix argument turns off the mode.
1508
1509 When Hungry mode is enabled, the control delete key
1510 gobbles all preceding whitespace except the last.
1511 See the command \\[hungry-electric-delete]."
1512 ;; The initial value.
1513 nil
1514 ;; The indicator for the mode line.
1515 " Hungry"
1516 ;; The minor mode bindings.
1517 '(([C-backspace] . hungry-electric-delete))
1518 :group 'hunger)
1519 @end smallexample
1520
1521 @noindent
1522 This defines a minor mode named ``Hungry mode,'' a command named
1523 @code{hungry-mode} to toggle it, a variable named @code{hungry-mode}
1524 which indicates whether the mode is enabled, and a variable named
1525 @code{hungry-mode-map} which holds the keymap that is active when the
1526 mode is enabled. It initializes the keymap with a key binding for
1527 @kbd{C-@key{DEL}}. It puts the variable @code{hungry-mode} into
1528 custom group @code{hunger}. There are no @var{body} forms---many
1529 minor modes don't need any.
1530
1531 Here's an equivalent way to write it:
1532
1533 @smallexample
1534 (define-minor-mode hungry-mode
1535 "Toggle Hungry mode.
1536 With no argument, this command toggles the mode.
1537 Non-null prefix argument turns on the mode.
1538 Null prefix argument turns off the mode.
1539
1540 When Hungry mode is enabled, the control delete key
1541 gobbles all preceding whitespace except the last.
1542 See the command \\[hungry-electric-delete]."
1543 ;; The initial value.
1544 :init-value nil
1545 ;; The indicator for the mode line.
1546 :lighter " Hungry"
1547 ;; The minor mode bindings.
1548 :keymap
1549 '(([C-backspace] . hungry-electric-delete)
1550 ([C-M-backspace]
1551 . (lambda ()
1552 (interactive)
1553 (hungry-electric-delete t))))
1554 :group 'hunger)
1555 @end smallexample
1556
1557 @defmac define-globalized-minor-mode global-mode mode turn-on keyword-args@dots{}
1558 This defines a global toggle named @var{global-mode} whose meaning is
1559 to enable or disable the buffer-local minor mode @var{mode} in all
1560 buffers. To turn on the minor mode in a buffer, it uses the function
1561 @var{turn-on}; to turn off the minor mode, it calls @code{mode} with
1562 @minus{}1 as argument.
1563
1564 Globally enabling the mode also affects buffers subsequently created
1565 by visiting files, and buffers that use a major mode other than
1566 Fundamental mode; but it does not detect the creation of a new buffer
1567 in Fundamental mode.
1568
1569 This defines the customization option @var{global-mode} (@pxref{Customization}),
1570 which can be toggled in the Custom interface to turn the minor mode on
1571 and off. As with @code{define-minor-mode}, you should ensure that the
1572 @code{define-globalized-minor-mode} form is evaluated each time Emacs
1573 starts, for example by providing a @code{:require} keyword.
1574
1575 Use @code{:group @var{group}} in @var{keyword-args} to specify the
1576 custom group for the mode variable of the global minor mode.
1577 @end defmac
1578
1579 @node Mode Line Format
1580 @section Mode-Line Format
1581 @cindex mode line
1582
1583 Each Emacs window (aside from minibuffer windows) typically has a mode
1584 line at the bottom, which displays status information about the buffer
1585 displayed in the window. The mode line contains information about the
1586 buffer, such as its name, associated file, depth of recursive editing,
1587 and major and minor modes. A window can also have a @dfn{header
1588 line}, which is much like the mode line but appears at the top of the
1589 window.
1590
1591 This section describes how to control the contents of the mode line
1592 and header line. We include it in this chapter because much of the
1593 information displayed in the mode line relates to the enabled major and
1594 minor modes.
1595
1596 @menu
1597 * Base: Mode Line Basics. Basic ideas of mode line control.
1598 * Data: Mode Line Data. The data structure that controls the mode line.
1599 * Top: Mode Line Top. The top level variable, mode-line-format.
1600 * Mode Line Variables:: Variables used in that data structure.
1601 * %-Constructs:: Putting information into a mode line.
1602 * Properties in Mode:: Using text properties in the mode line.
1603 * Header Lines:: Like a mode line, but at the top.
1604 * Emulating Mode Line:: Formatting text as the mode line would.
1605 @end menu
1606
1607 @node Mode Line Basics
1608 @subsection Mode Line Basics
1609
1610 @code{mode-line-format} is a buffer-local variable that holds a
1611 @dfn{mode line construct}, a kind of template, which controls what is
1612 displayed on the mode line of the current buffer. The value of
1613 @code{header-line-format} specifies the buffer's header line in the
1614 same way. All windows for the same buffer use the same
1615 @code{mode-line-format} and @code{header-line-format}.
1616
1617 For efficiency, Emacs does not continuously recompute the mode
1618 line and header line of a window. It does so when circumstances
1619 appear to call for it---for instance, if you change the window
1620 configuration, switch buffers, narrow or widen the buffer, scroll, or
1621 change the buffer's modification status. If you modify any of the
1622 variables referenced by @code{mode-line-format} (@pxref{Mode Line
1623 Variables}), or any other variables and data structures that affect
1624 how text is displayed (@pxref{Display}), you may want to force an
1625 update of the mode line so as to display the new information or
1626 display it in the new way.
1627
1628 @defun force-mode-line-update &optional all
1629 Force redisplay of the current buffer's mode line and header line.
1630 The next redisplay will update the mode line and header line based on
1631 the latest values of all relevant variables. With optional
1632 non-@code{nil} @var{all}, force redisplay of all mode lines and header
1633 lines.
1634
1635 This function also forces recomputation of the menu bar menus
1636 and the frame title.
1637 @end defun
1638
1639 The selected window's mode line is usually displayed in a different
1640 color using the face @code{mode-line}. Other windows' mode lines
1641 appear in the face @code{mode-line-inactive} instead. @xref{Faces}.
1642
1643 @node Mode Line Data
1644 @subsection The Data Structure of the Mode Line
1645 @cindex mode-line construct
1646
1647 The mode-line contents are controlled by a data structure called a
1648 @dfn{mode-line construct}, made up of lists, strings, symbols, and
1649 numbers kept in buffer-local variables. Each data type has a specific
1650 meaning for the mode-line appearance, as described below. The same
1651 data structure is used for constructing frame titles (@pxref{Frame
1652 Titles}) and header lines (@pxref{Header Lines}).
1653
1654 A mode-line construct may be as simple as a fixed string of text,
1655 but it usually specifies how to combine fixed strings with variables'
1656 values to construct the text. Many of these variables are themselves
1657 defined to have mode-line constructs as their values.
1658
1659 Here are the meanings of various data types as mode-line constructs:
1660
1661 @table @code
1662 @cindex percent symbol in mode line
1663 @item @var{string}
1664 A string as a mode-line construct appears verbatim except for
1665 @dfn{@code{%}-constructs} in it. These stand for substitution of
1666 other data; see @ref{%-Constructs}.
1667
1668 If parts of the string have @code{face} properties, they control
1669 display of the text just as they would text in the buffer. Any
1670 characters which have no @code{face} properties are displayed, by
1671 default, in the face @code{mode-line} or @code{mode-line-inactive}
1672 (@pxref{Standard Faces,,, emacs, The GNU Emacs Manual}). The
1673 @code{help-echo} and @code{local-map} properties in @var{string} have
1674 special meanings. @xref{Properties in Mode}.
1675
1676 @item @var{symbol}
1677 A symbol as a mode-line construct stands for its value. The value of
1678 @var{symbol} is used as a mode-line construct, in place of @var{symbol}.
1679 However, the symbols @code{t} and @code{nil} are ignored, as is any
1680 symbol whose value is void.
1681
1682 There is one exception: if the value of @var{symbol} is a string, it is
1683 displayed verbatim: the @code{%}-constructs are not recognized.
1684
1685 Unless @var{symbol} is marked as ``risky'' (i.e., it has a
1686 non-@code{nil} @code{risky-local-variable} property), all text
1687 properties specified in @var{symbol}'s value are ignored. This
1688 includes the text properties of strings in @var{symbol}'s value, as
1689 well as all @code{:eval} and @code{:propertize} forms in it. (The
1690 reason for this is security: non-risky variables could be set
1691 automatically from file variables without prompting the user.)
1692
1693 @item (@var{string} @var{rest}@dots{})
1694 @itemx (@var{list} @var{rest}@dots{})
1695 A list whose first element is a string or list means to process all the
1696 elements recursively and concatenate the results. This is the most
1697 common form of mode-line construct.
1698
1699 @item (:eval @var{form})
1700 A list whose first element is the symbol @code{:eval} says to evaluate
1701 @var{form}, and use the result as a string to display. Make sure this
1702 evaluation cannot load any files, as doing so could cause infinite
1703 recursion.
1704
1705 @item (:propertize @var{elt} @var{props}@dots{})
1706 A list whose first element is the symbol @code{:propertize} says to
1707 process the mode-line construct @var{elt} recursively, then add the text
1708 properties specified by @var{props} to the result. The argument
1709 @var{props} should consist of zero or more pairs @var{text-property}
1710 @var{value}. (This feature is new as of Emacs 22.1.)
1711
1712 @item (@var{symbol} @var{then} @var{else})
1713 A list whose first element is a symbol that is not a keyword specifies
1714 a conditional. Its meaning depends on the value of @var{symbol}. If
1715 @var{symbol} has a non-@code{nil} value, the second element,
1716 @var{then}, is processed recursively as a mode-line element.
1717 Otherwise, the third element, @var{else}, is processed recursively.
1718 You may omit @var{else}; then the mode-line element displays nothing
1719 if the value of @var{symbol} is @code{nil} or void.
1720
1721 @item (@var{width} @var{rest}@dots{})
1722 A list whose first element is an integer specifies truncation or
1723 padding of the results of @var{rest}. The remaining elements
1724 @var{rest} are processed recursively as mode-line constructs and
1725 concatenated together. When @var{width} is positive, the result is
1726 space filled on the right if its width is less than @var{width}. When
1727 @var{width} is negative, the result is truncated on the right to
1728 @minus{}@var{width} columns if its width exceeds @minus{}@var{width}.
1729
1730 For example, the usual way to show what percentage of a buffer is above
1731 the top of the window is to use a list like this: @code{(-3 "%p")}.
1732 @end table
1733
1734 @node Mode Line Top
1735 @subsection The Top Level of Mode Line Control
1736
1737 The variable in overall control of the mode line is
1738 @code{mode-line-format}.
1739
1740 @defopt mode-line-format
1741 The value of this variable is a mode-line construct that controls the
1742 contents of the mode-line. It is always buffer-local in all buffers.
1743
1744 If you set this variable to @code{nil} in a buffer, that buffer does
1745 not have a mode line. (A window that is just one line tall never
1746 displays a mode line.)
1747 @end defopt
1748
1749 The default value of @code{mode-line-format} is designed to use the
1750 values of other variables such as @code{mode-line-position} and
1751 @code{mode-line-modes} (which in turn incorporates the values of the
1752 variables @code{mode-name} and @code{minor-mode-alist}). Very few
1753 modes need to alter @code{mode-line-format} itself. For most
1754 purposes, it is sufficient to alter some of the variables that
1755 @code{mode-line-format} either directly or indirectly refers to.
1756
1757 If you do alter @code{mode-line-format} itself, the new value should
1758 use the same variables that appear in the default value (@pxref{Mode
1759 Line Variables}), rather than duplicating their contents or displaying
1760 the information in another fashion. This way, customizations made by
1761 the user or by Lisp programs (such as @code{display-time} and major
1762 modes) via changes to those variables remain effective.
1763
1764 Here is an example of a @code{mode-line-format} that might be
1765 useful for @code{shell-mode}, since it contains the host name and default
1766 directory.
1767
1768 @example
1769 @group
1770 (setq mode-line-format
1771 (list "-"
1772 'mode-line-mule-info
1773 'mode-line-modified
1774 'mode-line-frame-identification
1775 "%b--"
1776 @end group
1777 @group
1778 ;; @r{Note that this is evaluated while making the list.}
1779 ;; @r{It makes a mode-line construct which is just a string.}
1780 (getenv "HOST")
1781 @end group
1782 ":"
1783 'default-directory
1784 " "
1785 'global-mode-string
1786 " %[("
1787 '(:eval (mode-line-mode-name))
1788 'mode-line-process
1789 'minor-mode-alist
1790 "%n"
1791 ")%]--"
1792 @group
1793 '(which-func-mode ("" which-func-format "--"))
1794 '(line-number-mode "L%l--")
1795 '(column-number-mode "C%c--")
1796 '(-3 "%p")
1797 "-%-"))
1798 @end group
1799 @end example
1800
1801 @noindent
1802 (The variables @code{line-number-mode}, @code{column-number-mode}
1803 and @code{which-func-mode} enable particular minor modes; as usual,
1804 these variable names are also the minor mode command names.)
1805
1806 @node Mode Line Variables
1807 @subsection Variables Used in the Mode Line
1808
1809 This section describes variables incorporated by the standard value
1810 of @code{mode-line-format} into the text of the mode line. There is
1811 nothing inherently special about these variables; any other variables
1812 could have the same effects on the mode line if
1813 @code{mode-line-format}'s value were changed to use them. However,
1814 various parts of Emacs set these variables on the understanding that
1815 they will control parts of the mode line; therefore, practically
1816 speaking, it is essential for the mode line to use them.
1817
1818 @defvar mode-line-mule-info
1819 This variable holds the value of the mode-line construct that displays
1820 information about the language environment, buffer coding system, and
1821 current input method. @xref{Non-ASCII Characters}.
1822 @end defvar
1823
1824 @defvar mode-line-modified
1825 This variable holds the value of the mode-line construct that displays
1826 whether the current buffer is modified. Its default value displays
1827 @samp{**} if the buffer is modified, @samp{--} if the buffer is not
1828 modified, @samp{%%} if the buffer is read only, and @samp{%*} if the
1829 buffer is read only and modified.
1830
1831 Changing this variable does not force an update of the mode line.
1832 @end defvar
1833
1834 @defvar mode-line-frame-identification
1835 This variable identifies the current frame. Its default value
1836 displays @code{" "} if you are using a window system which can show
1837 multiple frames, or @code{"-%F "} on an ordinary terminal which shows
1838 only one frame at a time.
1839 @end defvar
1840
1841 @defvar mode-line-buffer-identification
1842 This variable identifies the buffer being displayed in the window.
1843 Its default value displays the buffer name, padded with spaces to at
1844 least 12 columns.
1845 @end defvar
1846
1847 @defopt mode-line-position
1848 This variable indicates the position in the buffer. Its default value
1849 displays the buffer percentage and, optionally, the buffer size, the
1850 line number and the column number.
1851 @end defopt
1852
1853 @defvar vc-mode
1854 The variable @code{vc-mode}, buffer-local in each buffer, records
1855 whether the buffer's visited file is maintained with version control,
1856 and, if so, which kind. Its value is a string that appears in the mode
1857 line, or @code{nil} for no version control.
1858 @end defvar
1859
1860 @defopt mode-line-modes
1861 This variable displays the buffer's major and minor modes. Its
1862 default value also displays the recursive editing level, information
1863 on the process status, and whether narrowing is in effect.
1864 @end defopt
1865
1866 The following three variables are used in @code{mode-line-modes}:
1867
1868 @defvar mode-name
1869 This buffer-local variable holds the ``pretty'' name of the current
1870 buffer's major mode. Each major mode should set this variable so that
1871 the mode name will appear in the mode line. The value does not have
1872 to be a string, but can use any of the data types valid in a mode-line
1873 construct (@pxref{Mode Line Data}). To compute the string that will
1874 identify the mode name in the mode line, use @code{format-mode-line}
1875 (@pxref{Emulating Mode Line}).
1876 @end defvar
1877
1878 @defvar mode-line-process
1879 This buffer-local variable contains the mode-line information on process
1880 status in modes used for communicating with subprocesses. It is
1881 displayed immediately following the major mode name, with no intervening
1882 space. For example, its value in the @samp{*shell*} buffer is
1883 @code{(":%s")}, which allows the shell to display its status along
1884 with the major mode as: @samp{(Shell:run)}. Normally this variable
1885 is @code{nil}.
1886 @end defvar
1887
1888 @defvar minor-mode-alist
1889 @anchor{Definition of minor-mode-alist}
1890 This variable holds an association list whose elements specify how the
1891 mode line should indicate that a minor mode is active. Each element of
1892 the @code{minor-mode-alist} should be a two-element list:
1893
1894 @example
1895 (@var{minor-mode-variable} @var{mode-line-string})
1896 @end example
1897
1898 More generally, @var{mode-line-string} can be any mode-line spec. It
1899 appears in the mode line when the value of @var{minor-mode-variable}
1900 is non-@code{nil}, and not otherwise. These strings should begin with
1901 spaces so that they don't run together. Conventionally, the
1902 @var{minor-mode-variable} for a specific mode is set to a
1903 non-@code{nil} value when that minor mode is activated.
1904
1905 @code{minor-mode-alist} itself is not buffer-local. Each variable
1906 mentioned in the alist should be buffer-local if its minor mode can be
1907 enabled separately in each buffer.
1908 @end defvar
1909
1910 @defvar global-mode-string
1911 This variable holds a mode-line spec that, by default, appears in the
1912 mode line just after the @code{which-func-mode} minor mode if set,
1913 else after @code{mode-line-modes}. The command @code{display-time}
1914 sets @code{global-mode-string} to refer to the variable
1915 @code{display-time-string}, which holds a string containing the time
1916 and load information.
1917
1918 The @samp{%M} construct substitutes the value of
1919 @code{global-mode-string}, but that is obsolete, since the variable is
1920 included in the mode line from @code{mode-line-format}.
1921 @end defvar
1922
1923 Here is a simplified version of the default value of
1924 @code{mode-line-format}. The real default value also
1925 specifies addition of text properties.
1926
1927 @example
1928 @group
1929 ("-"
1930 mode-line-mule-info
1931 mode-line-modified
1932 mode-line-frame-identification
1933 mode-line-buffer-identification
1934 @end group
1935 " "
1936 mode-line-position
1937 (vc-mode vc-mode)
1938 " "
1939 @group
1940 mode-line-modes
1941 (which-func-mode ("" which-func-format "--"))
1942 (global-mode-string ("--" global-mode-string))
1943 "-%-")
1944 @end group
1945 @end example
1946
1947 @node %-Constructs
1948 @subsection @code{%}-Constructs in the Mode Line
1949
1950 Strings used as mode-line constructs can use certain
1951 @code{%}-constructs to substitute various kinds of data. Here is a
1952 list of the defined @code{%}-constructs, and what they mean. In any
1953 construct except @samp{%%}, you can add a decimal integer after the
1954 @samp{%} to specify a minimum field width. If the width is less, the
1955 field is padded with spaces to the right.
1956
1957 @table @code
1958 @item %b
1959 The current buffer name, obtained with the @code{buffer-name} function.
1960 @xref{Buffer Names}.
1961
1962 @item %c
1963 The current column number of point.
1964
1965 @item %e
1966 When Emacs is nearly out of memory for Lisp objects, a brief message
1967 saying so. Otherwise, this is empty.
1968
1969 @item %f
1970 The visited file name, obtained with the @code{buffer-file-name}
1971 function. @xref{Buffer File Name}.
1972
1973 @item %F
1974 The title (only on a window system) or the name of the selected frame.
1975 @xref{Basic Parameters}.
1976
1977 @item %i
1978 The size of the accessible part of the current buffer; basically
1979 @code{(- (point-max) (point-min))}.
1980
1981 @item %I
1982 Like @samp{%i}, but the size is printed in a more readable way by using
1983 @samp{k} for 10^3, @samp{M} for 10^6, @samp{G} for 10^9, etc., to
1984 abbreviate.
1985
1986 @item %l
1987 The current line number of point, counting within the accessible portion
1988 of the buffer.
1989
1990 @item %n
1991 @samp{Narrow} when narrowing is in effect; nothing otherwise (see
1992 @code{narrow-to-region} in @ref{Narrowing}).
1993
1994 @item %p
1995 The percentage of the buffer text above the @strong{top} of window, or
1996 @samp{Top}, @samp{Bottom} or @samp{All}. Note that the default
1997 mode-line specification truncates this to three characters.
1998
1999 @item %P
2000 The percentage of the buffer text that is above the @strong{bottom} of
2001 the window (which includes the text visible in the window, as well as
2002 the text above the top), plus @samp{Top} if the top of the buffer is
2003 visible on screen; or @samp{Bottom} or @samp{All}.
2004
2005 @item %s
2006 The status of the subprocess belonging to the current buffer, obtained with
2007 @code{process-status}. @xref{Process Information}.
2008
2009 @item %t
2010 Whether the visited file is a text file or a binary file. This is a
2011 meaningful distinction only on certain operating systems (@pxref{MS-DOS
2012 File Types}).
2013
2014 @item %z
2015 The mnemonics of keyboard, terminal, and buffer coding systems.
2016
2017 @item %Z
2018 Like @samp{%z}, but including the end-of-line format.
2019
2020 @item %*
2021 @samp{%} if the buffer is read only (see @code{buffer-read-only}); @*
2022 @samp{*} if the buffer is modified (see @code{buffer-modified-p}); @*
2023 @samp{-} otherwise. @xref{Buffer Modification}.
2024
2025 @item %+
2026 @samp{*} if the buffer is modified (see @code{buffer-modified-p}); @*
2027 @samp{%} if the buffer is read only (see @code{buffer-read-only}); @*
2028 @samp{-} otherwise. This differs from @samp{%*} only for a modified
2029 read-only buffer. @xref{Buffer Modification}.
2030
2031 @item %&
2032 @samp{*} if the buffer is modified, and @samp{-} otherwise.
2033
2034 @item %[
2035 An indication of the depth of recursive editing levels (not counting
2036 minibuffer levels): one @samp{[} for each editing level.
2037 @xref{Recursive Editing}.
2038
2039 @item %]
2040 One @samp{]} for each recursive editing level (not counting minibuffer
2041 levels).
2042
2043 @item %-
2044 Dashes sufficient to fill the remainder of the mode line.
2045
2046 @item %%
2047 The character @samp{%}---this is how to include a literal @samp{%} in a
2048 string in which @code{%}-constructs are allowed.
2049 @end table
2050
2051 The following two @code{%}-constructs are still supported, but they are
2052 obsolete, since you can get the same results with the variables
2053 @code{mode-name} and @code{global-mode-string}.
2054
2055 @table @code
2056 @item %m
2057 The value of @code{mode-name}.
2058
2059 @item %M
2060 The value of @code{global-mode-string}.
2061 @end table
2062
2063 @node Properties in Mode
2064 @subsection Properties in the Mode Line
2065 @cindex text properties in the mode line
2066
2067 Certain text properties are meaningful in the
2068 mode line. The @code{face} property affects the appearance of text; the
2069 @code{help-echo} property associates help strings with the text, and
2070 @code{local-map} can make the text mouse-sensitive.
2071
2072 There are four ways to specify text properties for text in the mode
2073 line:
2074
2075 @enumerate
2076 @item
2077 Put a string with a text property directly into the mode-line data
2078 structure.
2079
2080 @item
2081 Put a text property on a mode-line %-construct such as @samp{%12b}; then
2082 the expansion of the %-construct will have that same text property.
2083
2084 @item
2085 Use a @code{(:propertize @var{elt} @var{props}@dots{})} construct to
2086 give @var{elt} a text property specified by @var{props}.
2087
2088 @item
2089 Use a list containing @code{:eval @var{form}} in the mode-line data
2090 structure, and make @var{form} evaluate to a string that has a text
2091 property.
2092 @end enumerate
2093
2094 You can use the @code{local-map} property to specify a keymap. This
2095 keymap only takes real effect for mouse clicks; binding character keys
2096 and function keys to it has no effect, since it is impossible to move
2097 point into the mode line.
2098
2099 When the mode line refers to a variable which does not have a
2100 non-@code{nil} @code{risky-local-variable} property, any text
2101 properties given or specified within that variable's values are
2102 ignored. This is because such properties could otherwise specify
2103 functions to be called, and those functions could come from file
2104 local variables.
2105
2106 @node Header Lines
2107 @subsection Window Header Lines
2108 @cindex header line (of a window)
2109 @cindex window header line
2110
2111 A window can have a @dfn{header line} at the
2112 top, just as it can have a mode line at the bottom. The header line
2113 feature works just like the mode-line feature, except that it's
2114 controlled by different variables.
2115
2116 @defvar header-line-format
2117 This variable, local in every buffer, specifies how to display the
2118 header line, for windows displaying the buffer. The format of the value
2119 is the same as for @code{mode-line-format} (@pxref{Mode Line Data}).
2120 It is normally @code{nil}, so that ordinary buffers have no header line.
2121 @end defvar
2122
2123 A window that is just one line tall never displays a header line. A
2124 window that is two lines tall cannot display both a mode line and a
2125 header line at once; if it has a mode line, then it does not display a
2126 header line.
2127
2128 @node Emulating Mode Line
2129 @subsection Emulating Mode-Line Formatting
2130
2131 You can use the function @code{format-mode-line} to compute
2132 the text that would appear in a mode line or header line
2133 based on a certain mode-line specification.
2134
2135 @defun format-mode-line format &optional face window buffer
2136 This function formats a line of text according to @var{format} as if it
2137 were generating the mode line for @var{window}, but it also returns the
2138 text as a string. The argument @var{window} defaults to the selected
2139 window. If @var{buffer} is non-@code{nil}, all the information used is
2140 taken from @var{buffer}; by default, it comes from @var{window}'s
2141 buffer.
2142
2143 The value string normally has text properties that correspond to the
2144 faces, keymaps, etc., that the mode line would have. Any character for
2145 which no @code{face} property is specified by @var{format} gets a
2146 default value determined by @var{face}. If @var{face} is @code{t}, that
2147 stands for either @code{mode-line} if @var{window} is selected,
2148 otherwise @code{mode-line-inactive}. If @var{face} is @code{nil} or
2149 omitted, that stands for the default face. If @var{face} is an integer,
2150 the value returned by this function will have no text properties.
2151
2152 You can also specify other valid faces as the value of @var{face}.
2153 If specified, that face provides the @code{face} property for characters
2154 whose face is not specified by @var{format}.
2155
2156 Note that using @code{mode-line}, @code{mode-line-inactive}, or
2157 @code{header-line} as @var{face} will actually redisplay the mode line
2158 or the header line, respectively, using the current definitions of the
2159 corresponding face, in addition to returning the formatted string.
2160 (Other faces do not cause redisplay.)
2161
2162 For example, @code{(format-mode-line header-line-format)} returns the
2163 text that would appear in the selected window's header line (@code{""}
2164 if it has no header line). @code{(format-mode-line header-line-format
2165 'header-line)} returns the same text, with each character
2166 carrying the face that it will have in the header line itself, and also
2167 redraws the header line.
2168 @end defun
2169
2170 @node Imenu
2171 @section Imenu
2172
2173 @cindex Imenu
2174 @dfn{Imenu} is a feature that lets users select a definition or
2175 section in the buffer, from a menu which lists all of them, to go
2176 directly to that location in the buffer. Imenu works by constructing
2177 a buffer index which lists the names and buffer positions of the
2178 definitions, or other named portions of the buffer; then the user can
2179 choose one of them and move point to it. Major modes can add a menu
2180 bar item to use Imenu using @code{imenu-add-to-menubar}.
2181
2182 @defun imenu-add-to-menubar name
2183 This function defines a local menu bar item named @var{name}
2184 to run Imenu.
2185 @end defun
2186
2187 The user-level commands for using Imenu are described in the Emacs
2188 Manual (@pxref{Imenu,, Imenu, emacs, the Emacs Manual}). This section
2189 explains how to customize Imenu's method of finding definitions or
2190 buffer portions for a particular major mode.
2191
2192 The usual and simplest way is to set the variable
2193 @code{imenu-generic-expression}:
2194
2195 @defvar imenu-generic-expression
2196 This variable, if non-@code{nil}, is a list that specifies regular
2197 expressions for finding definitions for Imenu. Simple elements of
2198 @code{imenu-generic-expression} look like this:
2199
2200 @example
2201 (@var{menu-title} @var{regexp} @var{index})
2202 @end example
2203
2204 Here, if @var{menu-title} is non-@code{nil}, it says that the matches
2205 for this element should go in a submenu of the buffer index;
2206 @var{menu-title} itself specifies the name for the submenu. If
2207 @var{menu-title} is @code{nil}, the matches for this element go directly
2208 in the top level of the buffer index.
2209
2210 The second item in the list, @var{regexp}, is a regular expression
2211 (@pxref{Regular Expressions}); anything in the buffer that it matches
2212 is considered a definition, something to mention in the buffer index.
2213 The third item, @var{index}, is a non-negative integer that indicates
2214 which subexpression in @var{regexp} matches the definition's name.
2215
2216 An element can also look like this:
2217
2218 @example
2219 (@var{menu-title} @var{regexp} @var{index} @var{function} @var{arguments}@dots{})
2220 @end example
2221
2222 Each match for this element creates an index item, and when the index
2223 item is selected by the user, it calls @var{function} with arguments
2224 consisting of the item name, the buffer position, and @var{arguments}.
2225
2226 For Emacs Lisp mode, @code{imenu-generic-expression} could look like
2227 this:
2228
2229 @c should probably use imenu-syntax-alist and \\sw rather than [-A-Za-z0-9+]
2230 @example
2231 @group
2232 ((nil "^\\s-*(def\\(un\\|subst\\|macro\\|advice\\)\
2233 \\s-+\\([-A-Za-z0-9+]+\\)" 2)
2234 @end group
2235 @group
2236 ("*Vars*" "^\\s-*(def\\(var\\|const\\)\
2237 \\s-+\\([-A-Za-z0-9+]+\\)" 2)
2238 @end group
2239 @group
2240 ("*Types*"
2241 "^\\s-*\
2242 (def\\(type\\|struct\\|class\\|ine-condition\\)\
2243 \\s-+\\([-A-Za-z0-9+]+\\)" 2))
2244 @end group
2245 @end example
2246
2247 Setting this variable makes it buffer-local in the current buffer.
2248 @end defvar
2249
2250 @defvar imenu-case-fold-search
2251 This variable controls whether matching against the regular
2252 expressions in the value of @code{imenu-generic-expression} is
2253 case-sensitive: @code{t}, the default, means matching should ignore
2254 case.
2255
2256 Setting this variable makes it buffer-local in the current buffer.
2257 @end defvar
2258
2259 @defvar imenu-syntax-alist
2260 This variable is an alist of syntax table modifiers to use while
2261 processing @code{imenu-generic-expression}, to override the syntax table
2262 of the current buffer. Each element should have this form:
2263
2264 @example
2265 (@var{characters} . @var{syntax-description})
2266 @end example
2267
2268 The @sc{car}, @var{characters}, can be either a character or a string.
2269 The element says to give that character or characters the syntax
2270 specified by @var{syntax-description}, which is passed to
2271 @code{modify-syntax-entry} (@pxref{Syntax Table Functions}).
2272
2273 This feature is typically used to give word syntax to characters which
2274 normally have symbol syntax, and thus to simplify
2275 @code{imenu-generic-expression} and speed up matching.
2276 For example, Fortran mode uses it this way:
2277
2278 @example
2279 (setq imenu-syntax-alist '(("_$" . "w")))
2280 @end example
2281
2282 The @code{imenu-generic-expression} regular expressions can then use
2283 @samp{\\sw+} instead of @samp{\\(\\sw\\|\\s_\\)+}. Note that this
2284 technique may be inconvenient when the mode needs to limit the initial
2285 character of a name to a smaller set of characters than are allowed in
2286 the rest of a name.
2287
2288 Setting this variable makes it buffer-local in the current buffer.
2289 @end defvar
2290
2291 Another way to customize Imenu for a major mode is to set the
2292 variables @code{imenu-prev-index-position-function} and
2293 @code{imenu-extract-index-name-function}:
2294
2295 @defvar imenu-prev-index-position-function
2296 If this variable is non-@code{nil}, its value should be a function that
2297 finds the next ``definition'' to put in the buffer index, scanning
2298 backward in the buffer from point. It should return @code{nil} if it
2299 doesn't find another ``definition'' before point. Otherwise it should
2300 leave point at the place it finds a ``definition'' and return any
2301 non-@code{nil} value.
2302
2303 Setting this variable makes it buffer-local in the current buffer.
2304 @end defvar
2305
2306 @defvar imenu-extract-index-name-function
2307 If this variable is non-@code{nil}, its value should be a function to
2308 return the name for a definition, assuming point is in that definition
2309 as the @code{imenu-prev-index-position-function} function would leave
2310 it.
2311
2312 Setting this variable makes it buffer-local in the current buffer.
2313 @end defvar
2314
2315 The last way to customize Imenu for a major mode is to set the
2316 variable @code{imenu-create-index-function}:
2317
2318 @defvar imenu-create-index-function
2319 This variable specifies the function to use for creating a buffer
2320 index. The function should take no arguments, and return an index
2321 alist for the current buffer. It is called within
2322 @code{save-excursion}, so where it leaves point makes no difference.
2323
2324 The index alist can have three types of elements. Simple elements
2325 look like this:
2326
2327 @example
2328 (@var{index-name} . @var{index-position})
2329 @end example
2330
2331 Selecting a simple element has the effect of moving to position
2332 @var{index-position} in the buffer. Special elements look like this:
2333
2334 @example
2335 (@var{index-name} @var{index-position} @var{function} @var{arguments}@dots{})
2336 @end example
2337
2338 Selecting a special element performs:
2339
2340 @example
2341 (funcall @var{function}
2342 @var{index-name} @var{index-position} @var{arguments}@dots{})
2343 @end example
2344
2345 A nested sub-alist element looks like this:
2346
2347 @example
2348 (@var{menu-title} @var{sub-alist})
2349 @end example
2350
2351 It creates the submenu @var{menu-title} specified by @var{sub-alist}.
2352
2353 The default value of @code{imenu-create-index-function} is
2354 @code{imenu-default-create-index-function}. This function calls the
2355 value of @code{imenu-prev-index-position-function} and the value of
2356 @code{imenu-extract-index-name-function} to produce the index alist.
2357 However, if either of these two variables is @code{nil}, the default
2358 function uses @code{imenu-generic-expression} instead.
2359
2360 Setting this variable makes it buffer-local in the current buffer.
2361 @end defvar
2362
2363 @node Font Lock Mode
2364 @section Font Lock Mode
2365 @cindex Font Lock mode
2366
2367 @dfn{Font Lock mode} is a feature that automatically attaches
2368 @code{face} properties to certain parts of the buffer based on their
2369 syntactic role. How it parses the buffer depends on the major mode;
2370 most major modes define syntactic criteria for which faces to use in
2371 which contexts. This section explains how to customize Font Lock for a
2372 particular major mode.
2373
2374 Font Lock mode finds text to highlight in two ways: through
2375 syntactic parsing based on the syntax table, and through searching
2376 (usually for regular expressions). Syntactic fontification happens
2377 first; it finds comments and string constants and highlights them.
2378 Search-based fontification happens second.
2379
2380 @menu
2381 * Font Lock Basics:: Overview of customizing Font Lock.
2382 * Search-based Fontification:: Fontification based on regexps.
2383 * Customizing Keywords:: Customizing search-based fontification.
2384 * Other Font Lock Variables:: Additional customization facilities.
2385 * Levels of Font Lock:: Each mode can define alternative levels
2386 so that the user can select more or less.
2387 * Precalculated Fontification:: How Lisp programs that produce the buffer
2388 contents can also specify how to fontify it.
2389 * Faces for Font Lock:: Special faces specifically for Font Lock.
2390 * Syntactic Font Lock:: Fontification based on syntax tables.
2391 * Setting Syntax Properties:: Defining character syntax based on context
2392 using the Font Lock mechanism.
2393 * Multiline Font Lock:: How to coerce Font Lock into properly
2394 highlighting multiline constructs.
2395 @end menu
2396
2397 @node Font Lock Basics
2398 @subsection Font Lock Basics
2399
2400 There are several variables that control how Font Lock mode highlights
2401 text. But major modes should not set any of these variables directly.
2402 Instead, they should set @code{font-lock-defaults} as a buffer-local
2403 variable. The value assigned to this variable is used, if and when Font
2404 Lock mode is enabled, to set all the other variables.
2405
2406 @defvar font-lock-defaults
2407 This variable is set by major modes, as a buffer-local variable, to
2408 specify how to fontify text in that mode. It automatically becomes
2409 buffer-local when you set it. If its value is @code{nil}, Font-Lock
2410 mode does no highlighting, and you can use the @samp{Faces} menu
2411 (under @samp{Edit} and then @samp{Text Properties} in the menu bar) to
2412 assign faces explicitly to text in the buffer.
2413
2414 If non-@code{nil}, the value should look like this:
2415
2416 @example
2417 (@var{keywords} [@var{keywords-only} [@var{case-fold}
2418 [@var{syntax-alist} [@var{syntax-begin} @var{other-vars}@dots{}]]]])
2419 @end example
2420
2421 The first element, @var{keywords}, indirectly specifies the value of
2422 @code{font-lock-keywords} which directs search-based fontification.
2423 It can be a symbol, a variable or a function whose value is the list
2424 to use for @code{font-lock-keywords}. It can also be a list of
2425 several such symbols, one for each possible level of fontification.
2426 The first symbol specifies the @samp{mode default} level of
2427 fontification, the next symbol level 1 fontification, the next level 2,
2428 and so on. The @samp{mode default} level is normally the same as level
2429 1. It is used when @code{font-lock-maximum-decoration} has a @code{nil}
2430 value. @xref{Levels of Font Lock}.
2431
2432 The second element, @var{keywords-only}, specifies the value of the
2433 variable @code{font-lock-keywords-only}. If this is omitted or
2434 @code{nil}, syntactic fontification (of strings and comments) is also
2435 performed. If this is non-@code{nil}, such fontification is not
2436 performed. @xref{Syntactic Font Lock}.
2437
2438 The third element, @var{case-fold}, specifies the value of
2439 @code{font-lock-keywords-case-fold-search}. If it is non-@code{nil},
2440 Font Lock mode ignores case when searching as directed by
2441 @code{font-lock-keywords}.
2442
2443 If the fourth element, @var{syntax-alist}, is non-@code{nil}, it
2444 should be a list of cons cells of the form @code{(@var{char-or-string}
2445 . @var{string})}. These are used to set up a syntax table for
2446 syntactic fontification (@pxref{Syntax Table Functions}). The
2447 resulting syntax table is stored in @code{font-lock-syntax-table}.
2448
2449 The fifth element, @var{syntax-begin}, specifies the value of
2450 @code{font-lock-beginning-of-syntax-function}. We recommend setting
2451 this variable to @code{nil} and using @code{syntax-begin-function}
2452 instead.
2453
2454 All the remaining elements (if any) are collectively called
2455 @var{other-vars}. Each of these elements should have the form
2456 @code{(@var{variable} . @var{value})}---which means, make
2457 @var{variable} buffer-local and then set it to @var{value}. You can
2458 use these @var{other-vars} to set other variables that affect
2459 fontification, aside from those you can control with the first five
2460 elements. @xref{Other Font Lock Variables}.
2461 @end defvar
2462
2463 If your mode fontifies text explicitly by adding
2464 @code{font-lock-face} properties, it can specify @code{(nil t)} for
2465 @code{font-lock-defaults} to turn off all automatic fontification.
2466 However, this is not required; it is possible to fontify some things
2467 using @code{font-lock-face} properties and set up automatic
2468 fontification for other parts of the text.
2469
2470 @node Search-based Fontification
2471 @subsection Search-based Fontification
2472
2473 The most important variable for customizing Font Lock mode is
2474 @code{font-lock-keywords}. It specifies the search criteria for
2475 search-based fontification. You should specify the value of this
2476 variable with @var{keywords} in @code{font-lock-defaults}.
2477
2478 @defvar font-lock-keywords
2479 This variable's value is a list of the keywords to highlight. Be
2480 careful when composing regular expressions for this list; a poorly
2481 written pattern can dramatically slow things down!
2482 @end defvar
2483
2484 Each element of @code{font-lock-keywords} specifies how to find
2485 certain cases of text, and how to highlight those cases. Font Lock mode
2486 processes the elements of @code{font-lock-keywords} one by one, and for
2487 each element, it finds and handles all matches. Ordinarily, once
2488 part of the text has been fontified already, this cannot be overridden
2489 by a subsequent match in the same text; but you can specify different
2490 behavior using the @var{override} element of a @var{subexp-highlighter}.
2491
2492 Each element of @code{font-lock-keywords} should have one of these
2493 forms:
2494
2495 @table @code
2496 @item @var{regexp}
2497 Highlight all matches for @var{regexp} using
2498 @code{font-lock-keyword-face}. For example,
2499
2500 @example
2501 ;; @r{Highlight occurrences of the word @samp{foo}}
2502 ;; @r{using @code{font-lock-keyword-face}.}
2503 "\\<foo\\>"
2504 @end example
2505
2506 The function @code{regexp-opt} (@pxref{Regexp Functions}) is useful
2507 for calculating optimal regular expressions to match a number of
2508 different keywords.
2509
2510 @item @var{function}
2511 Find text by calling @var{function}, and highlight the matches
2512 it finds using @code{font-lock-keyword-face}.
2513
2514 When @var{function} is called, it receives one argument, the limit of
2515 the search; it should begin searching at point, and not search beyond the
2516 limit. It should return non-@code{nil} if it succeeds, and set the
2517 match data to describe the match that was found. Returning @code{nil}
2518 indicates failure of the search.
2519
2520 Fontification will call @var{function} repeatedly with the same limit,
2521 and with point where the previous invocation left it, until
2522 @var{function} fails. On failure, @var{function} need not reset point
2523 in any particular way.
2524
2525 @item (@var{matcher} . @var{subexp})
2526 In this kind of element, @var{matcher} is either a regular
2527 expression or a function, as described above. The @sc{cdr},
2528 @var{subexp}, specifies which subexpression of @var{matcher} should be
2529 highlighted (instead of the entire text that @var{matcher} matched).
2530
2531 @example
2532 ;; @r{Highlight the @samp{bar} in each occurrence of @samp{fubar},}
2533 ;; @r{using @code{font-lock-keyword-face}.}
2534 ("fu\\(bar\\)" . 1)
2535 @end example
2536
2537 If you use @code{regexp-opt} to produce the regular expression
2538 @var{matcher}, you can use @code{regexp-opt-depth} (@pxref{Regexp
2539 Functions}) to calculate the value for @var{subexp}.
2540
2541 @item (@var{matcher} . @var{facespec})
2542 In this kind of element, @var{facespec} is an expression whose value
2543 specifies the face to use for highlighting. In the simplest case,
2544 @var{facespec} is a Lisp variable (a symbol) whose value is a face
2545 name.
2546
2547 @example
2548 ;; @r{Highlight occurrences of @samp{fubar},}
2549 ;; @r{using the face which is the value of @code{fubar-face}.}
2550 ("fubar" . fubar-face)
2551 @end example
2552
2553 However, @var{facespec} can also evaluate to a list of this form:
2554
2555 @example
2556 (face @var{face} @var{prop1} @var{val1} @var{prop2} @var{val2}@dots{})
2557 @end example
2558
2559 @noindent
2560 to specify the face @var{face} and various additional text properties
2561 to put on the text that matches. If you do this, be sure to add the
2562 other text property names that you set in this way to the value of
2563 @code{font-lock-extra-managed-props} so that the properties will also
2564 be cleared out when they are no longer appropriate. Alternatively,
2565 you can set the variable @code{font-lock-unfontify-region-function} to
2566 a function that clears these properties. @xref{Other Font Lock
2567 Variables}.
2568
2569 @item (@var{matcher} . @var{subexp-highlighter})
2570 In this kind of element, @var{subexp-highlighter} is a list
2571 which specifies how to highlight matches found by @var{matcher}.
2572 It has the form:
2573
2574 @example
2575 (@var{subexp} @var{facespec} [@var{override} [@var{laxmatch}]])
2576 @end example
2577
2578 The @sc{car}, @var{subexp}, is an integer specifying which subexpression
2579 of the match to fontify (0 means the entire matching text). The second
2580 subelement, @var{facespec}, is an expression whose value specifies the
2581 face, as described above.
2582
2583 The last two values in @var{subexp-highlighter}, @var{override} and
2584 @var{laxmatch}, are optional flags. If @var{override} is @code{t},
2585 this element can override existing fontification made by previous
2586 elements of @code{font-lock-keywords}. If it is @code{keep}, then
2587 each character is fontified if it has not been fontified already by
2588 some other element. If it is @code{prepend}, the face specified by
2589 @var{facespec} is added to the beginning of the @code{font-lock-face}
2590 property. If it is @code{append}, the face is added to the end of the
2591 @code{font-lock-face} property.
2592
2593 If @var{laxmatch} is non-@code{nil}, it means there should be no error
2594 if there is no subexpression numbered @var{subexp} in @var{matcher}.
2595 Obviously, fontification of the subexpression numbered @var{subexp} will
2596 not occur. However, fontification of other subexpressions (and other
2597 regexps) will continue. If @var{laxmatch} is @code{nil}, and the
2598 specified subexpression is missing, then an error is signaled which
2599 terminates search-based fontification.
2600
2601 Here are some examples of elements of this kind, and what they do:
2602
2603 @smallexample
2604 ;; @r{Highlight occurrences of either @samp{foo} or @samp{bar}, using}
2605 ;; @r{@code{foo-bar-face}, even if they have already been highlighted.}
2606 ;; @r{@code{foo-bar-face} should be a variable whose value is a face.}
2607 ("foo\\|bar" 0 foo-bar-face t)
2608
2609 ;; @r{Highlight the first subexpression within each occurrence}
2610 ;; @r{that the function @code{fubar-match} finds,}
2611 ;; @r{using the face which is the value of @code{fubar-face}.}
2612 (fubar-match 1 fubar-face)
2613 @end smallexample
2614
2615 @item (@var{matcher} . @var{anchored-highlighter})
2616 In this kind of element, @var{anchored-highlighter} specifies how to
2617 highlight text that follows a match found by @var{matcher}. So a
2618 match found by @var{matcher} acts as the anchor for further searches
2619 specified by @var{anchored-highlighter}. @var{anchored-highlighter}
2620 is a list of the following form:
2621
2622 @example
2623 (@var{anchored-matcher} @var{pre-form} @var{post-form}
2624 @var{subexp-highlighters}@dots{})
2625 @end example
2626
2627 Here, @var{anchored-matcher}, like @var{matcher}, is either a regular
2628 expression or a function. After a match of @var{matcher} is found,
2629 point is at the end of the match. Now, Font Lock evaluates the form
2630 @var{pre-form}. Then it searches for matches of
2631 @var{anchored-matcher} and uses @var{subexp-highlighters} to highlight
2632 these. A @var{subexp-highlighter} is as described above. Finally,
2633 Font Lock evaluates @var{post-form}.
2634
2635 The forms @var{pre-form} and @var{post-form} can be used to initialize
2636 before, and cleanup after, @var{anchored-matcher} is used. Typically,
2637 @var{pre-form} is used to move point to some position relative to the
2638 match of @var{matcher}, before starting with @var{anchored-matcher}.
2639 @var{post-form} might be used to move back, before resuming with
2640 @var{matcher}.
2641
2642 After Font Lock evaluates @var{pre-form}, it does not search for
2643 @var{anchored-matcher} beyond the end of the line. However, if
2644 @var{pre-form} returns a buffer position that is greater than the
2645 position of point after @var{pre-form} is evaluated, then the position
2646 returned by @var{pre-form} is used as the limit of the search instead.
2647 It is generally a bad idea to return a position greater than the end
2648 of the line; in other words, the @var{anchored-matcher} search should
2649 not span lines.
2650
2651 For example,
2652
2653 @smallexample
2654 ;; @r{Highlight occurrences of the word @samp{item} following}
2655 ;; @r{an occurrence of the word @samp{anchor} (on the same line)}
2656 ;; @r{in the value of @code{item-face}.}
2657 ("\\<anchor\\>" "\\<item\\>" nil nil (0 item-face))
2658 @end smallexample
2659
2660 Here, @var{pre-form} and @var{post-form} are @code{nil}. Therefore
2661 searching for @samp{item} starts at the end of the match of
2662 @samp{anchor}, and searching for subsequent instances of @samp{anchor}
2663 resumes from where searching for @samp{item} concluded.
2664
2665 @item (@var{matcher} @var{highlighters}@dots{})
2666 This sort of element specifies several @var{highlighter} lists for a
2667 single @var{matcher}. A @var{highlighter} list can be of the type
2668 @var{subexp-highlighter} or @var{anchored-highlighter} as described
2669 above.
2670
2671 For example,
2672
2673 @smallexample
2674 ;; @r{Highlight occurrences of the word @samp{anchor} in the value}
2675 ;; @r{of @code{anchor-face}, and subsequent occurrences of the word}
2676 ;; @r{@samp{item} (on the same line) in the value of @code{item-face}.}
2677 ("\\<anchor\\>" (0 anchor-face)
2678 ("\\<item\\>" nil nil (0 item-face)))
2679 @end smallexample
2680
2681 @item (eval . @var{form})
2682 Here @var{form} is an expression to be evaluated the first time
2683 this value of @code{font-lock-keywords} is used in a buffer.
2684 Its value should have one of the forms described in this table.
2685 @end table
2686
2687 @strong{Warning:} Do not design an element of @code{font-lock-keywords}
2688 to match text which spans lines; this does not work reliably.
2689 For details, see @xref{Multiline Font Lock}.
2690
2691 You can use @var{case-fold} in @code{font-lock-defaults} to specify
2692 the value of @code{font-lock-keywords-case-fold-search} which says
2693 whether search-based fontification should be case-insensitive.
2694
2695 @defvar font-lock-keywords-case-fold-search
2696 Non-@code{nil} means that regular expression matching for the sake of
2697 @code{font-lock-keywords} should be case-insensitive.
2698 @end defvar
2699
2700 @node Customizing Keywords
2701 @subsection Customizing Search-Based Fontification
2702
2703 You can use @code{font-lock-add-keywords} to add additional
2704 search-based fontification rules to a major mode, and
2705 @code{font-lock-remove-keywords} to remove rules.
2706
2707 @defun font-lock-add-keywords mode keywords &optional how
2708 This function adds highlighting @var{keywords}, for the current buffer
2709 or for major mode @var{mode}. The argument @var{keywords} should be a
2710 list with the same format as the variable @code{font-lock-keywords}.
2711
2712 If @var{mode} is a symbol which is a major mode command name, such as
2713 @code{c-mode}, the effect is that enabling Font Lock mode in
2714 @var{mode} will add @var{keywords} to @code{font-lock-keywords}.
2715 Calling with a non-@code{nil} value of @var{mode} is correct only in
2716 your @file{~/.emacs} file.
2717
2718 If @var{mode} is @code{nil}, this function adds @var{keywords} to
2719 @code{font-lock-keywords} in the current buffer. This way of calling
2720 @code{font-lock-add-keywords} is usually used in mode hook functions.
2721
2722 By default, @var{keywords} are added at the beginning of
2723 @code{font-lock-keywords}. If the optional argument @var{how} is
2724 @code{set}, they are used to replace the value of
2725 @code{font-lock-keywords}. If @var{how} is any other non-@code{nil}
2726 value, they are added at the end of @code{font-lock-keywords}.
2727
2728 Some modes provide specialized support you can use in additional
2729 highlighting patterns. See the variables
2730 @code{c-font-lock-extra-types}, @code{c++-font-lock-extra-types},
2731 and @code{java-font-lock-extra-types}, for example.
2732
2733 @strong{Warning:} major mode functions must not call
2734 @code{font-lock-add-keywords} under any circumstances, either directly
2735 or indirectly, except through their mode hooks. (Doing so would lead
2736 to incorrect behavior for some minor modes.) They should set up their
2737 rules for search-based fontification by setting
2738 @code{font-lock-keywords}.
2739 @end defun
2740
2741 @defun font-lock-remove-keywords mode keywords
2742 This function removes @var{keywords} from @code{font-lock-keywords}
2743 for the current buffer or for major mode @var{mode}. As in
2744 @code{font-lock-add-keywords}, @var{mode} should be a major mode
2745 command name or @code{nil}. All the caveats and requirements for
2746 @code{font-lock-add-keywords} apply here too.
2747 @end defun
2748
2749 For example, this code
2750
2751 @smallexample
2752 (font-lock-add-keywords 'c-mode
2753 '(("\\<\\(FIXME\\):" 1 font-lock-warning-face prepend)
2754 ("\\<\\(and\\|or\\|not\\)\\>" . font-lock-keyword-face)))
2755 @end smallexample
2756
2757 @noindent
2758 adds two fontification patterns for C mode: one to fontify the word
2759 @samp{FIXME}, even in comments, and another to fontify the words
2760 @samp{and}, @samp{or} and @samp{not} as keywords.
2761
2762 @noindent
2763 That example affects only C mode proper. To add the same patterns to
2764 C mode @emph{and} all modes derived from it, do this instead:
2765
2766 @smallexample
2767 (add-hook 'c-mode-hook
2768 (lambda ()
2769 (font-lock-add-keywords nil
2770 '(("\\<\\(FIXME\\):" 1 font-lock-warning-face prepend)
2771 ("\\<\\(and\\|or\\|not\\)\\>" .
2772 font-lock-keyword-face)))))
2773 @end smallexample
2774
2775 @node Other Font Lock Variables
2776 @subsection Other Font Lock Variables
2777
2778 This section describes additional variables that a major mode can
2779 set by means of @var{other-vars} in @code{font-lock-defaults}
2780 (@pxref{Font Lock Basics}).
2781
2782 @defvar font-lock-mark-block-function
2783 If this variable is non-@code{nil}, it should be a function that is
2784 called with no arguments, to choose an enclosing range of text for
2785 refontification for the command @kbd{M-o M-o}
2786 (@code{font-lock-fontify-block}).
2787
2788 The function should report its choice by placing the region around it.
2789 A good choice is a range of text large enough to give proper results,
2790 but not too large so that refontification becomes slow. Typical values
2791 are @code{mark-defun} for programming modes or @code{mark-paragraph} for
2792 textual modes.
2793 @end defvar
2794
2795 @defvar font-lock-extra-managed-props
2796 This variable specifies additional properties (other than
2797 @code{font-lock-face}) that are being managed by Font Lock mode. It
2798 is used by @code{font-lock-default-unfontify-region}, which normally
2799 only manages the @code{font-lock-face} property. If you want Font
2800 Lock to manage other properties as well, you must specify them in a
2801 @var{facespec} in @code{font-lock-keywords} as well as add them to
2802 this list. @xref{Search-based Fontification}.
2803 @end defvar
2804
2805 @defvar font-lock-fontify-buffer-function
2806 Function to use for fontifying the buffer. The default value is
2807 @code{font-lock-default-fontify-buffer}.
2808 @end defvar
2809
2810 @defvar font-lock-unfontify-buffer-function
2811 Function to use for unfontifying the buffer. This is used when
2812 turning off Font Lock mode. The default value is
2813 @code{font-lock-default-unfontify-buffer}.
2814 @end defvar
2815
2816 @defvar font-lock-fontify-region-function
2817 Function to use for fontifying a region. It should take two
2818 arguments, the beginning and end of the region, and an optional third
2819 argument @var{verbose}. If @var{verbose} is non-@code{nil}, the
2820 function should print status messages. The default value is
2821 @code{font-lock-default-fontify-region}.
2822 @end defvar
2823
2824 @defvar font-lock-unfontify-region-function
2825 Function to use for unfontifying a region. It should take two
2826 arguments, the beginning and end of the region. The default value is
2827 @code{font-lock-default-unfontify-region}.
2828 @end defvar
2829
2830 @defun jit-lock-register function &optional contextual
2831 This function tells Font Lock mode to run the Lisp function
2832 @var{function} any time it has to fontify or refontify part of the
2833 current buffer. It calls @var{function} before calling the default
2834 fontification functions, and gives it two arguments, @var{start} and
2835 @var{end}, which specify the region to be fontified or refontified.
2836
2837 The optional argument @var{contextual}, if non-@code{nil}, forces Font
2838 Lock mode to always refontify a syntactically relevant part of the
2839 buffer, and not just the modified lines. This argument can usually be
2840 omitted.
2841 @end defun
2842
2843 @defun jit-lock-unregister function
2844 If @var{function} was previously registered as a fontification
2845 function using @code{jit-lock-register}, this function unregisters it.
2846 @end defun
2847
2848 @node Levels of Font Lock
2849 @subsection Levels of Font Lock
2850
2851 Many major modes offer three different levels of fontification. You
2852 can define multiple levels by using a list of symbols for @var{keywords}
2853 in @code{font-lock-defaults}. Each symbol specifies one level of
2854 fontification; it is up to the user to choose one of these levels,
2855 normally by setting @code{font-lock-maximum-decoration} (@pxref{Font
2856 Lock,,, emacs, the GNU Emacs Manual}). The chosen level's symbol
2857 value is used to initialize @code{font-lock-keywords}.
2858
2859 Here are the conventions for how to define the levels of
2860 fontification:
2861
2862 @itemize @bullet
2863 @item
2864 Level 1: highlight function declarations, file directives (such as include or
2865 import directives), strings and comments. The idea is speed, so only
2866 the most important and top-level components are fontified.
2867
2868 @item
2869 Level 2: in addition to level 1, highlight all language keywords,
2870 including type names that act like keywords, as well as named constant
2871 values. The idea is that all keywords (either syntactic or semantic)
2872 should be fontified appropriately.
2873
2874 @item
2875 Level 3: in addition to level 2, highlight the symbols being defined in
2876 function and variable declarations, and all builtin function names,
2877 wherever they appear.
2878 @end itemize
2879
2880 @node Precalculated Fontification
2881 @subsection Precalculated Fontification
2882
2883 Some major modes such as @code{list-buffers} and @code{occur}
2884 construct the buffer text programmatically. The easiest way for them
2885 to support Font Lock mode is to specify the faces of text when they
2886 insert the text in the buffer.
2887
2888 The way to do this is to specify the faces in the text with the
2889 special text property @code{font-lock-face} (@pxref{Special
2890 Properties}). When Font Lock mode is enabled, this property controls
2891 the display, just like the @code{face} property. When Font Lock mode
2892 is disabled, @code{font-lock-face} has no effect on the display.
2893
2894 It is ok for a mode to use @code{font-lock-face} for some text and
2895 also use the normal Font Lock machinery. But if the mode does not use
2896 the normal Font Lock machinery, it should not set the variable
2897 @code{font-lock-defaults}.
2898
2899 @node Faces for Font Lock
2900 @subsection Faces for Font Lock
2901 @cindex faces for font lock
2902 @cindex font lock faces
2903
2904 Font Lock mode can highlight using any face, but Emacs defines several
2905 faces specifically for syntactic highlighting. These @dfn{Font Lock
2906 faces} are listed below. They can also be used by major modes for
2907 syntactic highlighting outside of Font Lock mode (@pxref{Major Mode
2908 Conventions}).
2909
2910 Each of these symbols is both a face name, and a variable whose
2911 default value is the symbol itself. Thus, the default value of
2912 @code{font-lock-comment-face} is @code{font-lock-comment-face}.
2913
2914 The faces are listed with descriptions of their typical usage, and in
2915 order of greater to lesser ``prominence''. If a mode's syntactic
2916 categories do not fit well with the usage descriptions, the faces can be
2917 assigned using the ordering as a guide.
2918
2919 @table @code
2920 @item font-lock-warning-face
2921 @vindex font-lock-warning-face
2922 for a construct that is peculiar, or that greatly changes the meaning of
2923 other text, like @samp{;;;###autoload} in Emacs Lisp and @samp{#error}
2924 in C.
2925
2926 @item font-lock-function-name-face
2927 @vindex font-lock-function-name-face
2928 for the name of a function being defined or declared.
2929
2930 @item font-lock-variable-name-face
2931 @vindex font-lock-variable-name-face
2932 for the name of a variable being defined or declared.
2933
2934 @item font-lock-keyword-face
2935 @vindex font-lock-keyword-face
2936 for a keyword with special syntactic significance, like @samp{for} and
2937 @samp{if} in C.
2938
2939 @item font-lock-comment-face
2940 @vindex font-lock-comment-face
2941 for comments.
2942
2943 @item font-lock-comment-delimiter-face
2944 @vindex font-lock-comment-delimiter-face
2945 for comments delimiters, like @samp{/*} and @samp{*/} in C. On most
2946 terminals, this inherits from @code{font-lock-comment-face}.
2947
2948 @item font-lock-type-face
2949 @vindex font-lock-type-face
2950 for the names of user-defined data types.
2951
2952 @item font-lock-constant-face
2953 @vindex font-lock-constant-face
2954 for the names of constants, like @samp{NULL} in C.
2955
2956 @item font-lock-builtin-face
2957 @vindex font-lock-builtin-face
2958 for the names of built-in functions.
2959
2960 @item font-lock-preprocessor-face
2961 @vindex font-lock-preprocessor-face
2962 for preprocessor commands. This inherits, by default, from
2963 @code{font-lock-builtin-face}.
2964
2965 @item font-lock-string-face
2966 @vindex font-lock-string-face
2967 for string constants.
2968
2969 @item font-lock-doc-face
2970 @vindex font-lock-doc-face
2971 for documentation strings in the code. This inherits, by default, from
2972 @code{font-lock-string-face}.
2973
2974 @item font-lock-negation-char-face
2975 @vindex font-lock-negation-char-face
2976 for easily-overlooked negation characters.
2977 @end table
2978
2979 @node Syntactic Font Lock
2980 @subsection Syntactic Font Lock
2981 @cindex syntactic font lock
2982
2983 Syntactic fontification uses the syntax table to find comments and
2984 string constants (@pxref{Syntax Tables}). It highlights them using
2985 @code{font-lock-comment-face} and @code{font-lock-string-face}
2986 (@pxref{Faces for Font Lock}), or whatever
2987 @code{font-lock-syntactic-face-function} chooses. There are several
2988 variables that affect syntactic fontification; you should set them by
2989 means of @code{font-lock-defaults} (@pxref{Font Lock Basics}).
2990
2991 @defvar font-lock-keywords-only
2992 Non-@code{nil} means Font Lock should not do syntactic fontification;
2993 it should only fontify based on @code{font-lock-keywords}. The normal
2994 way for a mode to set this variable to @code{t} is with
2995 @var{keywords-only} in @code{font-lock-defaults}.
2996 @end defvar
2997
2998 @defvar font-lock-syntax-table
2999 This variable holds the syntax table to use for fontification of
3000 comments and strings. Specify it using @var{syntax-alist} in
3001 @code{font-lock-defaults}. If this is @code{nil}, fontification uses
3002 the buffer's syntax table.
3003 @end defvar
3004
3005 @defvar font-lock-beginning-of-syntax-function
3006 If this variable is non-@code{nil}, it should be a function to move
3007 point back to a position that is syntactically at ``top level'' and
3008 outside of strings or comments. Font Lock uses this when necessary
3009 to get the right results for syntactic fontification.
3010
3011 This function is called with no arguments. It should leave point at
3012 the beginning of any enclosing syntactic block. Typical values are
3013 @code{beginning-of-line} (used when the start of the line is known to
3014 be outside a syntactic block), or @code{beginning-of-defun} for
3015 programming modes, or @code{backward-paragraph} for textual modes.
3016
3017 If the value is @code{nil}, Font Lock uses
3018 @code{syntax-begin-function} to move back outside of any comment,
3019 string, or sexp. This variable is semi-obsolete; we recommend setting
3020 @code{syntax-begin-function} instead.
3021
3022 Specify this variable using @var{syntax-begin} in
3023 @code{font-lock-defaults}.
3024 @end defvar
3025
3026 @defvar font-lock-syntactic-face-function
3027 A function to determine which face to use for a given syntactic
3028 element (a string or a comment). The function is called with one
3029 argument, the parse state at point returned by
3030 @code{parse-partial-sexp}, and should return a face. The default
3031 value returns @code{font-lock-comment-face} for comments and
3032 @code{font-lock-string-face} for strings.
3033
3034 This can be used to highlighting different kinds of strings or
3035 comments differently. It is also sometimes abused together with
3036 @code{font-lock-syntactic-keywords} to highlight constructs that span
3037 multiple lines, but this is too esoteric to document here.
3038
3039 Specify this variable using @var{other-vars} in
3040 @code{font-lock-defaults}.
3041 @end defvar
3042
3043 @node Setting Syntax Properties
3044 @subsection Setting Syntax Properties
3045
3046 Font Lock mode can be used to update @code{syntax-table} properties
3047 automatically (@pxref{Syntax Properties}). This is useful in
3048 languages for which a single syntax table by itself is not sufficient.
3049
3050 @defvar font-lock-syntactic-keywords
3051 This variable enables and controls updating @code{syntax-table}
3052 properties by Font Lock. Its value should be a list of elements of
3053 this form:
3054
3055 @example
3056 (@var{matcher} @var{subexp} @var{syntax} @var{override} @var{laxmatch})
3057 @end example
3058
3059 The parts of this element have the same meanings as in the corresponding
3060 sort of element of @code{font-lock-keywords},
3061
3062 @example
3063 (@var{matcher} @var{subexp} @var{facespec} @var{override} @var{laxmatch})
3064 @end example
3065
3066 However, instead of specifying the value @var{facespec} to use for the
3067 @code{face} property, it specifies the value @var{syntax} to use for
3068 the @code{syntax-table} property. Here, @var{syntax} can be a string
3069 (as taken by @code{modify-syntax-entry}), a syntax table, a cons cell
3070 (as returned by @code{string-to-syntax}), or an expression whose value
3071 is one of those two types. @var{override} cannot be @code{prepend} or
3072 @code{append}.
3073
3074 For example, an element of the form:
3075
3076 @example
3077 ("\\$\\(#\\)" 1 ".")
3078 @end example
3079
3080 highlights syntactically a hash character when following a dollar
3081 character, with a SYNTAX of @code{"."} (meaning punctuation syntax).
3082 Assuming that the buffer syntax table specifies hash characters to
3083 have comment start syntax, the element will only highlight hash
3084 characters that do not follow dollar characters as comments
3085 syntactically.
3086
3087 An element of the form:
3088
3089 @example
3090 ("\\('\\).\\('\\)"
3091 (1 "\"")
3092 (2 "\""))
3093 @end example
3094
3095 highlights syntactically both single quotes which surround a single
3096 character, with a SYNTAX of @code{"\""} (meaning string quote syntax).
3097 Assuming that the buffer syntax table does not specify single quotes
3098 to have quote syntax, the element will only highlight single quotes of
3099 the form @samp{'@var{c}'} as strings syntactically. Other forms, such
3100 as @samp{foo'bar} or @samp{'fubar'}, will not be highlighted as
3101 strings.
3102
3103 Major modes normally set this variable with @var{other-vars} in
3104 @code{font-lock-defaults}.
3105 @end defvar
3106
3107 @node Multiline Font Lock
3108 @subsection Multiline Font Lock Constructs
3109 @cindex multiline font lock
3110
3111 Normally, elements of @code{font-lock-keywords} should not match
3112 across multiple lines; that doesn't work reliably, because Font Lock
3113 usually scans just part of the buffer, and it can miss a multi-line
3114 construct that crosses the line boundary where the scan starts. (The
3115 scan normally starts at the beginning of a line.)
3116
3117 Making elements that match multiline constructs work properly has
3118 two aspects: correct @emph{identification} and correct
3119 @emph{rehighlighting}. The first means that Font Lock finds all
3120 multiline constructs. The second means that Font Lock will correctly
3121 rehighlight all the relevant text when a multiline construct is
3122 changed---for example, if some of the text that was previously part of
3123 a multiline construct ceases to be part of it. The two aspects are
3124 closely related, and often getting one of them to work will appear to
3125 make the other also work. However, for reliable results you must
3126 attend explicitly to both aspects.
3127
3128 There are three ways to ensure correct identification of multiline
3129 constructs:
3130
3131 @itemize
3132 @item
3133 Add a function to @code{font-lock-extend-region-functions} that does
3134 the @emph{identification} and extends the scan so that the scanned
3135 text never starts or ends in the middle of a multiline construct.
3136 @item
3137 Use the @code{font-lock-fontify-region-function} hook similarly to
3138 extend the scan so that the scanned text never starts or ends in the
3139 middle of a multiline construct.
3140 @item
3141 Somehow identify the multiline construct right when it gets inserted
3142 into the buffer (or at any point after that but before font-lock
3143 tries to highlight it), and mark it with a @code{font-lock-multiline}
3144 which will instruct font-lock not to start or end the scan in the
3145 middle of the construct.
3146 @end itemize
3147
3148 There are three ways to do rehighlighting of multiline constructs:
3149
3150 @itemize
3151 @item
3152 Place a @code{font-lock-multiline} property on the construct. This
3153 will rehighlight the whole construct if any part of it is changed. In
3154 some cases you can do this automatically by setting the
3155 @code{font-lock-multiline} variable, which see.
3156 @item
3157 Make sure @code{jit-lock-contextually} is set and rely on it doing its
3158 job. This will only rehighlight the part of the construct that
3159 follows the actual change, and will do it after a short delay.
3160 This only works if the highlighting of the various parts of your
3161 multiline construct never depends on text in subsequent lines.
3162 Since @code{jit-lock-contextually} is activated by default, this can
3163 be an attractive solution.
3164 @item
3165 Place a @code{jit-lock-defer-multiline} property on the construct.
3166 This works only if @code{jit-lock-contextually} is used, and with the
3167 same delay before rehighlighting, but like @code{font-lock-multiline},
3168 it also handles the case where highlighting depends on
3169 subsequent lines.
3170 @end itemize
3171
3172 @menu
3173 * Font Lock Multiline:: Marking multiline chunks with a text property.
3174 * Region to Refontify:: Controlling which region gets refontified
3175 after a buffer change.
3176 @end menu
3177
3178 @node Font Lock Multiline
3179 @subsubsection Font Lock Multiline
3180
3181 One way to ensure reliable rehighlighting of multiline Font Lock
3182 constructs is to put on them the text property @code{font-lock-multiline}.
3183 It should be present and non-@code{nil} for text that is part of a
3184 multiline construct.
3185
3186 When Font Lock is about to highlight a range of text, it first
3187 extends the boundaries of the range as necessary so that they do not
3188 fall within text marked with the @code{font-lock-multiline} property.
3189 Then it removes any @code{font-lock-multiline} properties from the
3190 range, and highlights it. The highlighting specification (mostly
3191 @code{font-lock-keywords}) must reinstall this property each time,
3192 whenever it is appropriate.
3193
3194 @strong{Warning:} don't use the @code{font-lock-multiline} property
3195 on large ranges of text, because that will make rehighlighting slow.
3196
3197 @defvar font-lock-multiline
3198 If the @code{font-lock-multiline} variable is set to @code{t}, Font
3199 Lock will try to add the @code{font-lock-multiline} property
3200 automatically on multiline constructs. This is not a universal
3201 solution, however, since it slows down Font Lock somewhat. It can
3202 miss some multiline constructs, or make the property larger or smaller
3203 than necessary.
3204
3205 For elements whose @var{matcher} is a function, the function should
3206 ensure that submatch 0 covers the whole relevant multiline construct,
3207 even if only a small subpart will be highlighted. It is often just as
3208 easy to add the @code{font-lock-multiline} property by hand.
3209 @end defvar
3210
3211 The @code{font-lock-multiline} property is meant to ensure proper
3212 refontification; it does not automatically identify new multiline
3213 constructs. Identifying the requires that Font-Lock operate on large
3214 enough chunks at a time. This will happen by accident on many cases,
3215 which may give the impression that multiline constructs magically work.
3216 If you set the @code{font-lock-multiline} variable non-@code{nil},
3217 this impression will be even stronger, since the highlighting of those
3218 constructs which are found will be properly updated from then on.
3219 But that does not work reliably.
3220
3221 To find multiline constructs reliably, you must either manually
3222 place the @code{font-lock-multiline} property on the text before
3223 Font-Lock looks at it, or use
3224 @code{font-lock-fontify-region-function}.
3225
3226 @node Region to Refontify
3227 @subsubsection Region to Fontify after a Buffer Change
3228
3229 When a buffer is changed, the region that Font Lock refontifies is
3230 by default the smallest sequence of whole lines that spans the change.
3231 While this works well most of the time, sometimes it doesn't---for
3232 example, when a change alters the syntactic meaning of text on an
3233 earlier line.
3234
3235 You can enlarge (or even reduce) the region to refontify by setting
3236 the following variable:
3237
3238 @defvar font-lock-extend-after-change-region-function
3239 This buffer-local variable is either @code{nil} or a function for
3240 Font-Lock to call to determine the region to scan and fontify.
3241
3242 The function is given three parameters, the standard @var{beg},
3243 @var{end}, and @var{old-len} from @code{after-change-functions}
3244 (@pxref{Change Hooks}). It should return either a cons of the
3245 beginning and end buffer positions (in that order) of the region to
3246 fontify, or @code{nil} (which means choose the region in the standard
3247 way). This function needs to preserve point, the match-data, and the
3248 current restriction. The region it returns may start or end in the
3249 middle of a line.
3250
3251 Since this function is called after every buffer change, it should be
3252 reasonably fast.
3253 @end defvar
3254
3255 @node Auto-Indentation
3256 @section Auto-indentation of code
3257
3258 For programming languages, an important feature of a major mode is to
3259 provide automatic indentation. This is controlled in Emacs by
3260 @code{indent-line-function} (@pxref{Mode-Specific Indent}).
3261 Writing a good indentation function can be difficult and to a large
3262 extent it is still a black art.
3263
3264 Many major mode authors will start by writing a simple indentation
3265 function that works for simple cases, for example by comparing with the
3266 indentation of the previous text line. For most programming languages
3267 that are not really line-based, this tends to scale very poorly:
3268 improving such a function to let it handle more diverse situations tends
3269 to become more and more difficult, resulting in the end with a large,
3270 complex, unmaintainable indentation function which nobody dares to touch.
3271
3272 A good indentation function will usually need to actually parse the
3273 text, according to the syntax of the language. Luckily, it is not
3274 necessary to parse the text in as much detail as would be needed
3275 for a compiler, but on the other hand, the parser embedded in the
3276 indentation code will want to be somewhat friendly to syntactically
3277 incorrect code.
3278
3279 Good maintainable indentation functions usually fall into 2 categories:
3280 either parsing forward from some ``safe'' starting point until the
3281 position of interest, or parsing backward from the position of interest.
3282 Neither of the two is a clearly better choice than the other: parsing
3283 backward is often more difficult than parsing forward because
3284 programming languages are designed to be parsed forward, but for the
3285 purpose of indentation it has the advantage of not needing to
3286 guess a ``safe'' starting point, and it generally enjoys the property
3287 that only a minimum of text will be analyzed to decide the indentation
3288 of a line, so indentation will tend to be unaffected by syntax errors in
3289 some earlier unrelated piece of code. Parsing forward on the other hand
3290 is usually easier and has the advantage of making it possible to
3291 reindent efficiently a whole region at a time, with a single parse.
3292
3293 Rather than write your own indentation function from scratch, it is
3294 often preferable to try and reuse some existing ones or to rely
3295 on a generic indentation engine. There are sadly few such
3296 engines. The CC-mode indentation code (used with C, C++, Java, Awk
3297 and a few other such modes) has been made more generic over the years,
3298 so if your language seems somewhat similar to one of those languages,
3299 you might try to use that engine. @c FIXME: documentation?
3300 Another one is SMIE which takes an approach in the spirit
3301 of Lisp sexps and adapts it to non-Lisp languages.
3302
3303 @menu
3304 * SMIE:: A simple minded indentation engine
3305 @end menu
3306
3307 @node SMIE
3308 @subsection Simple Minded Indentation Engine
3309
3310 SMIE is a package that provides a generic navigation and indentation
3311 engine. Based on a very simple parser using an ``operator precedence
3312 grammar'', it lets major modes extend the sexp-based navigation of Lisp
3313 to non-Lisp languages as well as provide a simple to use but reliable
3314 auto-indentation.
3315
3316 Operator precedence grammar is a very primitive technology for parsing
3317 compared to some of the more common techniques used in compilers.
3318 It has the following characteristics: its parsing power is very limited,
3319 and it is largely unable to detect syntax errors, but it has the
3320 advantage of being algorithmically efficient and able to parse forward
3321 just as well as backward. In practice that means that SMIE can use it
3322 for indentation based on backward parsing, that it can provide both
3323 @code{forward-sexp} and @code{backward-sexp} functionality, and that it
3324 will naturally work on syntactically incorrect code without any extra
3325 effort. The downside is that it also means that most programming
3326 languages cannot be parsed correctly using SMIE, at least not without
3327 resorting to some special tricks (@pxref{SMIE Tricks}).
3328
3329 @menu
3330 * SMIE setup:: SMIE setup and features
3331 * Operator Precedence Grammars:: A very simple parsing technique
3332 * SMIE Grammar:: Defining the grammar of a language
3333 * SMIE Lexer:: Defining tokens
3334 * SMIE Tricks:: Working around the parser's limitations
3335 * SMIE Indentation:: Specifying indentation rules
3336 * SMIE Indentation Helpers:: Helper functions for indentation rules
3337 * SMIE Indentation Example:: Sample indentation rules
3338 @end menu
3339
3340 @node SMIE setup
3341 @subsubsection SMIE Setup and Features
3342
3343 SMIE is meant to be a one-stop shop for structural navigation and
3344 various other features which rely on the syntactic structure of code, in
3345 particular automatic indentation. The main entry point is
3346 @code{smie-setup} which is a function typically called while setting
3347 up a major mode.
3348
3349 @defun smie-setup grammar rules-function &rest keywords
3350 Setup SMIE navigation and indentation.
3351 @var{grammar} is a grammar table generated by @code{smie-prec2->grammar}.
3352 @var{rules-function} is a set of indentation rules for use on
3353 @code{smie-rules-function}.
3354 @var{keywords} are additional arguments, which can include the following
3355 keywords:
3356 @itemize
3357 @item
3358 @code{:forward-token} @var{fun}: Specify the forward lexer to use.
3359 @item
3360 @code{:backward-token} @var{fun}: Specify the backward lexer to use.
3361 @end itemize
3362 @end defun
3363
3364 Calling this function is sufficient to make commands such as
3365 @code{forward-sexp}, @code{backward-sexp}, and @code{transpose-sexps} be
3366 able to properly handle structural elements other than just the paired
3367 parentheses already handled by syntax tables. For example, if the
3368 provided grammar is precise enough, @code{transpose-sexps} can correctly
3369 transpose the two arguments of a @code{+} operator, taking into account
3370 the precedence rules of the language.
3371
3372 Calling `smie-setup' is also sufficient to make TAB indentation work in
3373 the expected way, extends @code{blink-matching-paren} to apply to
3374 elements like @code{begin...end}, and provides some commands that you
3375 can bind in the major mode keymap.
3376
3377 @deffn Command smie-close-block
3378 This command closes the most recently opened (and not yet closed) block.
3379 @end deffn
3380
3381 @deffn Command smie-down-list &optional arg
3382 This command is like @code{down-list} but it also pays attention to
3383 nesting of tokens other than parentheses, such as @code{begin...end}.
3384 @end deffn
3385
3386 @node Operator Precedence Grammars
3387 @subsubsection Operator Precedence Grammars
3388
3389 SMIE's precedence grammars simply give to each token a pair of
3390 precedences: the left-precedence and the right-precedence. We say
3391 @code{T1 < T2} if the right-precedence of token @code{T1} is less than
3392 the left-precedence of token @code{T2}. A good way to read this
3393 @code{<} is as a kind of parenthesis: if we find @code{... T1 something
3394 T2 ...} then that should be parsed as @code{... T1 (something T2 ...}
3395 rather than as @code{... T1 something) T2 ...}. The latter
3396 interpretation would be the case if we had @code{T1 > T2}. If we have
3397 @code{T1 = T2}, it means that token T2 follows token T1 in the same
3398 syntactic construction, so typically we have @code{"begin" = "end"}.
3399 Such pairs of precedences are sufficient to express left-associativity
3400 or right-associativity of infix operators, nesting of tokens like
3401 parentheses and many other cases.
3402
3403 @c Let's leave this undocumented to leave it more open for change!
3404 @c @defvar smie-grammar
3405 @c The value of this variable is an alist specifying the left and right
3406 @c precedence of each token. It is meant to be initialized by using one of
3407 @c the functions below.
3408 @c @end defvar
3409
3410 @defun smie-prec2->grammar table
3411 This function takes a @emph{prec2} grammar @var{table} and returns an
3412 alist suitable for use in @code{smie-setup}. The @emph{prec2}
3413 @var{table} is itself meant to be built by one of the functions below.
3414 @end defun
3415
3416 @defun smie-merge-prec2s &rest tables
3417 This function takes several @emph{prec2} @var{tables} and merges them
3418 into a new @emph{prec2} table.
3419 @end defun
3420
3421 @defun smie-precs->prec2 precs
3422 This function builds a @emph{prec2} table from a table of precedences
3423 @var{precs}. @var{precs} should be a list, sorted by precedence (for
3424 example @code{"+"} will come before @code{"*"}), of elements of the form
3425 @code{(@var{assoc} @var{op} ...)}, where each @var{op} is a token that
3426 acts as an operator; @var{assoc} is their associativity, which can be
3427 either @code{left}, @code{right}, @code{assoc}, or @code{nonassoc}.
3428 All operators in a given element share the same precedence level
3429 and associativity.
3430 @end defun
3431
3432 @defun smie-bnf->prec2 bnf &rest resolvers
3433 This function lets you specify the grammar using a BNF notation.
3434 It accepts a @var{bnf} description of the grammar along with a set of
3435 conflict resolution rules @var{resolvers}, and
3436 returns a @emph{prec2} table.
3437
3438 @var{bnf} is a list of nonterminal definitions of the form
3439 @code{(@var{nonterm} @var{rhs1} @var{rhs2} ...)} where each @var{rhs}
3440 is a (non-empty) list of terminals (aka tokens) or non-terminals.
3441
3442 Not all grammars are accepted:
3443 @itemize
3444 @item
3445 An @var{rhs} cannot be an empty list (an empty list is never needed,
3446 since SMIE allows all non-terminals to match the empty string anyway).
3447 @item
3448 An @var{rhs} cannot have 2 consecutive non-terminals: each pair of
3449 non-terminals needs to be separated by a terminal (aka token).
3450 This is a fundamental limitation of operator precedence grammars.
3451 @end itemize
3452
3453 Additionally, conflicts can occur:
3454 @itemize
3455 @item
3456 The returned @emph{prec2} table holds constraints between pairs of tokens, and
3457 for any given pair only one constraint can be present: T1 < T2,
3458 T1 = T2, or T1 > T2.
3459 @item
3460 A token can be an @code{opener} (something similar to an open-paren),
3461 a @code{closer} (like a close-paren), or @code{neither} of the two
3462 (e.g. an infix operator, or an inner token like @code{"else"}).
3463 @end itemize
3464
3465 Precedence conflicts can be resolved via @var{resolvers}, which
3466 is a list of @emph{precs} tables (see @code{smie-precs->prec2}): for
3467 each precedence conflict, if those @code{precs} tables
3468 specify a particular constraint, then the conflict is resolved by using
3469 this constraint instead, else a conflict is reported and one of the
3470 conflicting constraints is picked arbitrarily and the others are
3471 simply ignored.
3472 @end defun
3473
3474 @node SMIE Grammar
3475 @subsubsection Defining the Grammar of a Language
3476
3477 The usual way to define the SMIE grammar of a language is by
3478 defining a new global variable that holds the precedence table by
3479 giving a set of BNF rules.
3480 For example, the grammar definition for a small Pascal-like language
3481 could look like:
3482 @example
3483 @group
3484 (require 'smie)
3485 (defvar sample-smie-grammar
3486 (smie-prec2->grammar
3487 (smie-bnf->prec2
3488 @end group
3489 @group
3490 '((id)
3491 (inst ("begin" insts "end")
3492 ("if" exp "then" inst "else" inst)
3493 (id ":=" exp)
3494 (exp))
3495 (insts (insts ";" insts) (inst))
3496 (exp (exp "+" exp)
3497 (exp "*" exp)
3498 ("(" exps ")"))
3499 (exps (exps "," exps) (exp)))
3500 @end group
3501 @group
3502 '((assoc ";"))
3503 '((assoc ","))
3504 '((assoc "+") (assoc "*")))))
3505 @end group
3506 @end example
3507
3508 @noindent
3509 A few things to note:
3510
3511 @itemize
3512 @item
3513 The above grammar does not explicitly mention the syntax of function
3514 calls: SMIE will automatically allow any sequence of sexps, such as
3515 identifiers, balanced parentheses, or @code{begin ... end} blocks
3516 to appear anywhere anyway.
3517 @item
3518 The grammar category @code{id} has no right hand side: this does not
3519 mean that it can match only the empty string, since as mentioned any
3520 sequence of sexps can appear anywhere anyway.
3521 @item
3522 Because non terminals cannot appear consecutively in the BNF grammar, it
3523 is difficult to correctly handle tokens that act as terminators, so the
3524 above grammar treats @code{";"} as a statement @emph{separator} instead,
3525 which SMIE can handle very well.
3526 @item
3527 Separators used in sequences (such as @code{","} and @code{";"} above)
3528 are best defined with BNF rules such as @code{(foo (foo "separator" foo) ...)}
3529 which generate precedence conflicts which are then resolved by giving
3530 them an explicit @code{(assoc "separator")}.
3531 @item
3532 The @code{("(" exps ")")} rule was not needed to pair up parens, since
3533 SMIE will pair up any characters that are marked as having paren syntax
3534 in the syntax table. What this rule does instead (together with the
3535 definition of @code{exps}) is to make it clear that @code{","} should
3536 not appear outside of parentheses.
3537 @item
3538 Rather than have a single @emph{precs} table to resolve conflicts, it is
3539 preferable to have several tables, so as to let the BNF part of the
3540 grammar specify relative precedences where possible.
3541 @item
3542 Unless there is a very good reason to prefer @code{left} or
3543 @code{right}, it is usually preferable to mark operators as associative,
3544 using @code{assoc}. For that reason @code{"+"} and @code{"*"} are
3545 defined above as @code{assoc}, although the language defines them
3546 formally as left associative.
3547 @end itemize
3548
3549 @node SMIE Lexer
3550 @subsubsection Defining Tokens
3551
3552 SMIE comes with a predefined lexical analyzer which uses syntax tables
3553 in the following way: any sequence of characters that have word or
3554 symbol syntax is considered a token, and so is any sequence of
3555 characters that have punctuation syntax. This default lexer is
3556 often a good starting point but is rarely actually correct for any given
3557 language. For example, it will consider @code{"2,+3"} to be composed
3558 of 3 tokens: @code{"2"}, @code{",+"}, and @code{"3"}.
3559
3560 To describe the lexing rules of your language to SMIE, you need
3561 2 functions, one to fetch the next token, and another to fetch the
3562 previous token. Those functions will usually first skip whitespace and
3563 comments and then look at the next chunk of text to see if it
3564 is a special token. If so it should skip the token and
3565 return a description of this token. Usually this is simply the string
3566 extracted from the buffer, but it can be anything you want.
3567 For example:
3568 @example
3569 @group
3570 (defvar sample-keywords-regexp
3571 (regexp-opt '("+" "*" "," ";" ">" ">=" "<" "<=" ":=" "=")))
3572 @end group
3573 @group
3574 (defun sample-smie-forward-token ()
3575 (forward-comment (point-max))
3576 (cond
3577 ((looking-at sample-keywords-regexp)
3578 (goto-char (match-end 0))
3579 (match-string-no-properties 0))
3580 (t (buffer-substring-no-properties
3581 (point)
3582 (progn (skip-syntax-forward "w_")
3583 (point))))))
3584 @end group
3585 @group
3586 (defun sample-smie-backward-token ()
3587 (forward-comment (- (point)))
3588 (cond
3589 ((looking-back sample-keywords-regexp (- (point) 2) t)
3590 (goto-char (match-beginning 0))
3591 (match-string-no-properties 0))
3592 (t (buffer-substring-no-properties
3593 (point)
3594 (progn (skip-syntax-backward "w_")
3595 (point))))))
3596 @end group
3597 @end example
3598
3599 Notice how those lexers return the empty string when in front of
3600 parentheses. This is because SMIE automatically takes care of the
3601 parentheses defined in the syntax table. More specifically if the lexer
3602 returns nil or an empty string, SMIE tries to handle the corresponding
3603 text as a sexp according to syntax tables.
3604
3605 @node SMIE Tricks
3606 @subsubsection Living With a Weak Parser
3607
3608 The parsing technique used by SMIE does not allow tokens to behave
3609 differently in different contexts. For most programming languages, this
3610 manifests itself by precedence conflicts when converting the
3611 BNF grammar.
3612
3613 Sometimes, those conflicts can be worked around by expressing the
3614 grammar slightly differently. For example, for Modula-2 it might seem
3615 natural to have a BNF grammar that looks like this:
3616
3617 @example
3618 ...
3619 (inst ("IF" exp "THEN" insts "ELSE" insts "END")
3620 ("CASE" exp "OF" cases "END")
3621 ...)
3622 (cases (cases "|" cases) (caselabel ":" insts) ("ELSE" insts))
3623 ...
3624 @end example
3625
3626 But this will create conflicts for @code{"ELSE"}: on the one hand, the
3627 IF rule implies (among many other things) that @code{"ELSE" = "END"};
3628 but on the other hand, since @code{"ELSE"} appears within @code{cases},
3629 which appears left of @code{"END"}, we also have @code{"ELSE" > "END"}.
3630 We can solve the conflict either by using:
3631 @example
3632 ...
3633 (inst ("IF" exp "THEN" insts "ELSE" insts "END")
3634 ("CASE" exp "OF" cases "END")
3635 ("CASE" exp "OF" cases "ELSE" insts "END")
3636 ...)
3637 (cases (cases "|" cases) (caselabel ":" insts))
3638 ...
3639 @end example
3640 or
3641 @example
3642 ...
3643 (inst ("IF" exp "THEN" else "END")
3644 ("CASE" exp "OF" cases "END")
3645 ...)
3646 (else (insts "ELSE" insts))
3647 (cases (cases "|" cases) (caselabel ":" insts) (else))
3648 ...
3649 @end example
3650
3651 Reworking the grammar to try and solve conflicts has its downsides, tho,
3652 because SMIE assumes that the grammar reflects the logical structure of
3653 the code, so it is preferable to keep the BNF closer to the intended
3654 abstract syntax tree.
3655
3656 Other times, after careful consideration you may conclude that those
3657 conflicts are not serious and simply resolve them via the
3658 @var{resolvers} argument of @code{smie-bnf->prec2}. Usually this is
3659 because the grammar is simply ambiguous: the conflict does not affect
3660 the set of programs described by the grammar, but only the way those
3661 programs are parsed. This is typically the case for separators and
3662 associative infix operators, where you want to add a resolver like
3663 @code{'((assoc "|"))}. Another case where this can happen is for the
3664 classic @emph{dangling else} problem, where you will use @code{'((assoc
3665 "else" "then"))}. It can also happen for cases where the conflict is
3666 real and cannot really be resolved, but it is unlikely to pose a problem
3667 in practice.
3668
3669 Finally, in many cases some conflicts will remain despite all efforts to
3670 restructure the grammar. Do not despair: while the parser cannot be
3671 made more clever, you can make the lexer as smart as you want. So, the
3672 solution is then to look at the tokens involved in the conflict and to
3673 split one of those tokens into 2 (or more) different tokens. E.g. if
3674 the grammar needs to distinguish between two incompatible uses of the
3675 token @code{"begin"}, make the lexer return different tokens (say
3676 @code{"begin-fun"} and @code{"begin-plain"}) depending on which kind of
3677 @code{"begin"} it finds. This pushes the work of distinguishing the
3678 different cases to the lexer, which will thus have to look at the
3679 surrounding text to find ad-hoc clues.
3680
3681 @node SMIE Indentation
3682 @subsubsection Specifying Indentation Rules
3683
3684 Based on the provided grammar, SMIE will be able to provide automatic
3685 indentation without any extra effort. But in practice, this default
3686 indentation style will probably not be good enough. You will want to
3687 tweak it in many different cases.
3688
3689 SMIE indentation is based on the idea that indentation rules should be
3690 as local as possible. To this end, it relies on the idea of
3691 @emph{virtual} indentation, which is the indentation that a particular
3692 program point would have if it were at the beginning of a line.
3693 Of course, if that program point is indeed at the beginning of a line,
3694 its virtual indentation is its current indentation. But if not, then
3695 SMIE uses the indentation algorithm to compute the virtual indentation
3696 of that point. Now in practice, the virtual indentation of a program
3697 point does not have to be identical to the indentation it would have if
3698 we inserted a newline before it. To see how this works, the SMIE rule
3699 for indentation after a @code{@{} in C does not care whether the
3700 @code{@{} is standing on a line of its own or is at the end of the
3701 preceding line. Instead, these different cases are handled in the
3702 indentation rule that decides how to indent before a @code{@{}.
3703
3704 Another important concept is the notion of @emph{parent}: The
3705 @emph{parent} of a token, is the head token of the nearest enclosing
3706 syntactic construct. For example, the parent of an @code{else} is the
3707 @code{if} to which it belongs, and the parent of an @code{if}, in turn,
3708 is the lead token of the surrounding construct. The command
3709 @code{backward-sexp} jumps from a token to its parent, but there are
3710 some caveats: for @emph{openers} (tokens which start a construct, like
3711 @code{if}), you need to start with point before the token, while for
3712 others you need to start with point after the token.
3713 @code{backward-sexp} stops with point before the parent token if that is
3714 the @emph{opener} of the token of interest, and otherwise it stops with
3715 point after the parent token.
3716
3717 SMIE indentation rules are specified using a function that takes two
3718 arguments @var{method} and @var{arg} where the meaning of @var{arg} and the
3719 expected return value depend on @var{method}.
3720
3721 @var{method} can be:
3722 @itemize
3723 @item
3724 @code{:after}, in which case @var{arg} is a token and the function
3725 should return the @var{offset} to use for indentation after @var{arg}.
3726 @item
3727 @code{:before}, in which case @var{arg} is a token and the function
3728 should return the @var{offset} to use to indent @var{arg} itself.
3729 @item
3730 @code{:elem}, in which case the function should return either the offset
3731 to use to indent function arguments (if @var{arg} is the symbol
3732 @code{arg}) or the basic indentation step (if @var{arg} is the symbol
3733 @code{basic}).
3734 @item
3735 @code{:list-intro}, in which case @var{arg} is a token and the function
3736 should return non-@code{nil} if the token is followed by a list of
3737 expressions (not separated by any token) rather than an expression.
3738 @end itemize
3739
3740 When @var{arg} is a token, the function is called with point just before
3741 that token. A return value of nil always means to fallback on the
3742 default behavior, so the function should return nil for arguments it
3743 does not expect.
3744
3745 @var{offset} can be:
3746 @itemize
3747 @item
3748 @code{nil}: use the default indentation rule.
3749 @item
3750 @code{(column . @var{column})}: indent to column @var{column}.
3751 @item
3752 @var{number}: offset by @var{number}, relative to a base token which is
3753 the current token for @code{:after} and its parent for @code{:before}.
3754 @end itemize
3755
3756 @node SMIE Indentation Helpers
3757 @subsubsection Helper Functions for Indentation Rules
3758
3759 SMIE provides various functions designed specifically for use in the
3760 indentation rules function (several of those functions break if used in
3761 another context). These functions all start with the prefix
3762 @code{smie-rule-}.
3763
3764 @defun smie-rule-bolp
3765 Return non-@code{nil} if the current token is the first on the line.
3766 @end defun
3767
3768 @defun smie-rule-hanging-p
3769 Return non-@code{nil} if the current token is @emph{hanging}.
3770 A token is @emph{hanging} if it is the last token on the line
3771 and if it is preceded by other tokens: a lone token on a line is not
3772 hanging.
3773 @end defun
3774
3775 @defun smie-rule-next-p &rest tokens
3776 Return non-@code{nil} if the next token is among @var{tokens}.
3777 @end defun
3778
3779 @defun smie-rule-prev-p &rest tokens
3780 Return non-@code{nil} if the previous token is among @var{tokens}.
3781 @end defun
3782
3783 @defun smie-rule-parent-p &rest parents
3784 Return non-@code{nil} if the current token's parent is among @var{parents}.
3785 @end defun
3786
3787 @defun smie-rule-sibling-p
3788 Return non-nil if the current token's parent is actually a sibling.
3789 This is the case for example when the parent of a @code{","} is just the
3790 previous @code{","}.
3791 @end defun
3792
3793 @defun smie-rule-parent &optional offset
3794 Return the proper offset to align the current token with the parent.
3795 If non-@code{nil}, @var{offset} should be an integer giving an
3796 additional offset to apply.
3797 @end defun
3798
3799 @defun smie-rule-separator method
3800 Indent current token as a @emph{separator}.
3801
3802 By @emph{separator}, we mean here a token whose sole purpose is to
3803 separate various elements within some enclosing syntactic construct, and
3804 which does not have any semantic significance in itself (i.e. it would
3805 typically not exist as a node in an abstract syntax tree).
3806
3807 Such a token is expected to have an associative syntax and be closely
3808 tied to its syntactic parent. Typical examples are @code{","} in lists
3809 of arguments (enclosed inside parentheses), or @code{";"} in sequences
3810 of instructions (enclosed in a @code{@{...@}} or @code{begin...end}
3811 block).
3812
3813 @var{method} should be the method name that was passed to
3814 `smie-rules-function'.
3815 @end defun
3816
3817 @node SMIE Indentation Example
3818 @subsubsection Sample Indentation Rules
3819
3820 Here is an example of an indentation function:
3821
3822 @example
3823 (defun sample-smie-rules (kind token)
3824 (pcase (cons kind token)
3825 (`(:elem . basic) sample-indent-basic)
3826 (`(,_ . ",") (smie-rule-separator kind))
3827 (`(:after . ":=") sample-indent-basic)
3828 (`(:before . ,(or `"begin" `"(" `"@{")))
3829 (if (smie-rule-hanging-p) (smie-rule-parent)))
3830 (`(:before . "if")
3831 (and (not (smie-rule-bolp)) (smie-rule-prev-p "else")
3832 (smie-rule-parent)))))
3833 @end example
3834
3835 @noindent
3836 A few things to note:
3837
3838 @itemize
3839 @item
3840 The first case indicates the basic indentation increment to use.
3841 If @code{sample-indent-basic} is nil, then SMIE uses the global
3842 setting @code{smie-indent-basic}. The major mode could have set
3843 @code{smie-indent-basic} buffer-locally instead, but that
3844 is discouraged.
3845
3846 @item
3847 The rule for the token @code{","} make SMIE try to be more clever when
3848 the comma separator is placed at the beginning of lines. It tries to
3849 outdent the separator so as to align the code after the comma; for
3850 example:
3851
3852 @example
3853 x = longfunctionname (
3854 arg1
3855 , arg2
3856 );
3857 @end example
3858
3859 @item
3860 The rule for indentation after @code{":="} exists because otherwise
3861 SMIE would treat @code{":="} as an infix operator and would align the
3862 right argument with the left one.
3863
3864 @item
3865 The rule for indentation before @code{"begin"} is an example of the use
3866 of virtual indentation: This rule is used only when @code{"begin"} is
3867 hanging, which can happen only when @code{"begin"} is not at the
3868 beginning of a line. So this is not used when indenting
3869 @code{"begin"} itself but only when indenting something relative to this
3870 @code{"begin"}. Concretely, this rule changes the indentation from:
3871
3872 @example
3873 if x > 0 then begin
3874 dosomething(x);
3875 end
3876 @end example
3877 to
3878 @example
3879 if x > 0 then begin
3880 dosomething(x);
3881 end
3882 @end example
3883
3884 @item
3885 The rule for indentation before @code{"if"} is similar to the one for
3886 @code{"begin"}, but where the purpose is to treat @code{"else if"}
3887 as a single unit, so as to align a sequence of tests rather than indent
3888 each test further to the right. This function does this only in the
3889 case where the @code{"if"} is not placed on a separate line, hence the
3890 @code{smie-rule-bolp} test.
3891
3892 If we know that the @code{"else"} is always aligned with its @code{"if"}
3893 and is always at the beginning of a line, we can use a more efficient
3894 rule:
3895 @example
3896 ((equal token "if")
3897 (and (not (smie-rule-bolp)) (smie-rule-prev-p "else")
3898 (save-excursion
3899 (sample-smie-backward-token) ;Jump before the "else".
3900 (cons 'column (current-column)))))
3901 @end example
3902
3903 The advantage of this formulation is that it reuses the indentation of
3904 the previous @code{"else"}, rather than going all the way back to the
3905 first @code{"if"} of the sequence.
3906 @end itemize
3907
3908 @node Desktop Save Mode
3909 @section Desktop Save Mode
3910 @cindex desktop save mode
3911
3912 @dfn{Desktop Save Mode} is a feature to save the state of Emacs from
3913 one session to another. The user-level commands for using Desktop
3914 Save Mode are described in the GNU Emacs Manual (@pxref{Saving Emacs
3915 Sessions,,, emacs, the GNU Emacs Manual}). Modes whose buffers visit
3916 a file, don't have to do anything to use this feature.
3917
3918 For buffers not visiting a file to have their state saved, the major
3919 mode must bind the buffer local variable @code{desktop-save-buffer} to
3920 a non-@code{nil} value.
3921
3922 @defvar desktop-save-buffer
3923 If this buffer-local variable is non-@code{nil}, the buffer will have
3924 its state saved in the desktop file at desktop save. If the value is
3925 a function, it is called at desktop save with argument
3926 @var{desktop-dirname}, and its value is saved in the desktop file along
3927 with the state of the buffer for which it was called. When file names
3928 are returned as part of the auxiliary information, they should be
3929 formatted using the call
3930
3931 @example
3932 (desktop-file-name @var{file-name} @var{desktop-dirname})
3933 @end example
3934
3935 @end defvar
3936
3937 For buffers not visiting a file to be restored, the major mode must
3938 define a function to do the job, and that function must be listed in
3939 the alist @code{desktop-buffer-mode-handlers}.
3940
3941 @defvar desktop-buffer-mode-handlers
3942 Alist with elements
3943
3944 @example
3945 (@var{major-mode} . @var{restore-buffer-function})
3946 @end example
3947
3948 The function @var{restore-buffer-function} will be called with
3949 argument list
3950
3951 @example
3952 (@var{buffer-file-name} @var{buffer-name} @var{desktop-buffer-misc})
3953 @end example
3954
3955 and it should return the restored buffer.
3956 Here @var{desktop-buffer-misc} is the value returned by the function
3957 optionally bound to @code{desktop-save-buffer}.
3958 @end defvar
3959
3960 @ignore
3961 Local Variables:
3962 fill-column: 72
3963 End:
3964 @end ignore