Merge from trunk.
[bpt/emacs.git] / doc / lispref / modes.texi
1 @c -*-texinfo-*-
2 @c This is part of the GNU Emacs Lisp Reference Manual.
3 @c Copyright (C) 1990-1995, 1998-1999, 2001-2011 Free Software Foundation, Inc.
4 @c See the file elisp.texi for copying conditions.
5 @setfilename ../../info/modes
6 @node Modes, Documentation, Keymaps, Top
7 @chapter Major and Minor Modes
8 @cindex mode
9
10 A @dfn{mode} is a set of definitions that customize Emacs and can be
11 turned on and off while you edit. There are two varieties of modes:
12 @dfn{major modes}, which are mutually exclusive and used for editing
13 particular kinds of text, and @dfn{minor modes}, which provide features
14 that users can enable individually.
15
16 This chapter describes how to write both major and minor modes, how to
17 indicate them in the mode line, and how they run hooks supplied by the
18 user. For related topics such as keymaps and syntax tables, see
19 @ref{Keymaps}, and @ref{Syntax Tables}.
20
21 @menu
22 * Hooks:: How to use hooks; how to write code that provides hooks.
23 * Major Modes:: Defining major modes.
24 * Minor Modes:: Defining minor modes.
25 * Mode Line Format:: Customizing the text that appears in the mode line.
26 * Imenu:: Providing a menu of definitions made in a buffer.
27 * Font Lock Mode:: How modes can highlight text according to syntax.
28 * Auto-Indentation:: How to teach Emacs to indent for a major mode.
29 * Desktop Save Mode:: How modes can have buffer state saved between
30 Emacs sessions.
31 @end menu
32
33 @node Hooks
34 @section Hooks
35 @cindex hooks
36
37 A @dfn{hook} is a variable where you can store a function or functions
38 to be called on a particular occasion by an existing program. Emacs
39 provides hooks for the sake of customization. Most often, hooks are set
40 up in the init file (@pxref{Init File}), but Lisp programs can set them also.
41 @xref{Standard Hooks}, for a list of standard hook variables.
42
43 @cindex normal hook
44 Most of the hooks in Emacs are @dfn{normal hooks}. These variables
45 contain lists of functions to be called with no arguments. By
46 convention, whenever the hook name ends in @samp{-hook}, that tells
47 you it is normal. We try to make all hooks normal, as much as
48 possible, so that you can use them in a uniform way.
49
50 Every major mode command is supposed to run a normal hook called the
51 @dfn{mode hook} as the one of the last steps of initialization. This
52 makes it easy for a user to customize the behavior of the mode, by
53 overriding the buffer-local variable assignments already made by the
54 mode. Most minor mode functions also run a mode hook at the end. But
55 hooks are used in other contexts too. For example, the hook
56 @code{suspend-hook} runs just before Emacs suspends itself
57 (@pxref{Suspending Emacs}).
58
59 The recommended way to add a hook function to a normal hook is by
60 calling @code{add-hook} (see below). The hook functions may be any of
61 the valid kinds of functions that @code{funcall} accepts (@pxref{What
62 Is a Function}). Most normal hook variables are initially void;
63 @code{add-hook} knows how to deal with this. You can add hooks either
64 globally or buffer-locally with @code{add-hook}.
65
66 @cindex abnormal hook
67 If the hook variable's name does not end with @samp{-hook}, that
68 indicates it is probably an @dfn{abnormal hook}. That means the hook
69 functions are called with arguments, or their return values are used
70 in some way. The hook's documentation says how the functions are
71 called. You can use @code{add-hook} to add a function to an abnormal
72 hook, but you must write the function to follow the hook's calling
73 convention.
74
75 By convention, abnormal hook names end in @samp{-functions} or
76 @samp{-hooks}. If the variable's name ends in @samp{-function}, then
77 its value is just a single function, not a list of functions.
78
79 @menu
80 * Running Hooks:: How to run a hook.
81 * Setting Hooks:: How to put functions on a hook, or remove them.
82 @end menu
83
84 @node Running Hooks
85 @subsection Running Hooks
86
87 At the appropriate times, Emacs uses the @code{run-hooks} function
88 and the other functions below to run particular hooks.
89
90 @defun run-hooks &rest hookvars
91 This function takes one or more normal hook variable names as
92 arguments, and runs each hook in turn. Each argument should be a
93 symbol that is a normal hook variable. These arguments are processed
94 in the order specified.
95
96 If a hook variable has a non-@code{nil} value, that value should be a
97 list of functions. @code{run-hooks} calls all the functions, one by
98 one, with no arguments.
99
100 The hook variable's value can also be a single function---either a
101 lambda expression or a symbol with a function definition---which
102 @code{run-hooks} calls. But this usage is obsolete.
103
104 If the hook variable is buffer-local, the buffer-local variable will
105 be used instead of the global variable. However, if the buffer-local
106 variable contains the element @code{t}, the global hook variable will
107 be run as well.
108 @end defun
109
110 @defun run-hook-with-args hook &rest args
111 This function is the way to run an abnormal hook and always call all
112 of the hook functions. It calls each of the hook functions one by
113 one, passing each of them the arguments @var{args}.
114 @end defun
115
116 @defun run-hook-with-args-until-failure hook &rest args
117 This function is the way to run an abnormal hook until one of the hook
118 functions fails. It calls each of the hook functions, passing each of
119 them the arguments @var{args}, until some hook function returns
120 @code{nil}. It then stops and returns @code{nil}. If none of the
121 hook functions return @code{nil}, it returns a non-@code{nil} value.
122 @end defun
123
124 @defun run-hook-with-args-until-success hook &rest args
125 This function is the way to run an abnormal hook until a hook function
126 succeeds. It calls each of the hook functions, passing each of them
127 the arguments @var{args}, until some hook function returns
128 non-@code{nil}. Then it stops, and returns whatever was returned by
129 the last hook function that was called. If all hook functions return
130 @code{nil}, it returns @code{nil} as well.
131 @end defun
132
133 @node Setting Hooks
134 @subsection Setting Hooks
135
136 Here's an example that uses a mode hook to turn on Auto Fill mode when
137 in Lisp Interaction mode:
138
139 @example
140 (add-hook 'lisp-interaction-mode-hook 'turn-on-auto-fill)
141 @end example
142
143 @defun add-hook hook function &optional append local
144 This function is the handy way to add function @var{function} to hook
145 variable @var{hook}. You can use it for abnormal hooks as well as for
146 normal hooks. @var{function} can be any Lisp function that can accept
147 the proper number of arguments for @var{hook}. For example,
148
149 @example
150 (add-hook 'text-mode-hook 'my-text-hook-function)
151 @end example
152
153 @noindent
154 adds @code{my-text-hook-function} to the hook called @code{text-mode-hook}.
155
156 If @var{function} is already present in @var{hook} (comparing using
157 @code{equal}), then @code{add-hook} does not add it a second time.
158
159 If @var{function} has a non-@code{nil} property
160 @code{permanent-local-hook}, then @code{kill-all-local-variables} (or
161 changing major modes) won't delete it from the hook variable's local
162 value.
163
164 It is best to design your hook functions so that the order in which
165 they are executed does not matter. Any dependence on the order is
166 asking for trouble. However, the order is predictable: normally,
167 @var{function} goes at the front of the hook list, so it will be
168 executed first (barring another @code{add-hook} call). If the
169 optional argument @var{append} is non-@code{nil}, the new hook
170 function goes at the end of the hook list and will be executed last.
171
172 @code{add-hook} can handle the cases where @var{hook} is void or its
173 value is a single function; it sets or changes the value to a list of
174 functions.
175
176 If @var{local} is non-@code{nil}, that says to add @var{function} to the
177 buffer-local hook list instead of to the global hook list. This makes
178 the hook buffer-local and adds @code{t} to the buffer-local value. The
179 latter acts as a flag to run the hook functions in the default value as
180 well as in the local value.
181 @end defun
182
183 @defun remove-hook hook function &optional local
184 This function removes @var{function} from the hook variable
185 @var{hook}. It compares @var{function} with elements of @var{hook}
186 using @code{equal}, so it works for both symbols and lambda
187 expressions.
188
189 If @var{local} is non-@code{nil}, that says to remove @var{function}
190 from the buffer-local hook list instead of from the global hook list.
191 @end defun
192
193 @node Major Modes
194 @section Major Modes
195 @cindex major mode
196
197 @cindex major mode command
198 Major modes specialize Emacs for editing particular kinds of text.
199 Each buffer has one major mode at a time. Every major mode is
200 associated with a @dfn{major mode command}, whose name should end in
201 @samp{-mode}. This command takes care of switching to that mode in the
202 current buffer, by setting various buffer-local variables such as a
203 local keymap. @xref{Major Mode Conventions}.
204
205 The least specialized major mode is called @dfn{Fundamental mode},
206 which has no mode-specific definitions or variable settings.
207
208 @deffn Command fundamental-mode
209 This is the major mode command for Fundamental mode. Unlike other mode
210 commands, it does @emph{not} run any mode hooks (@pxref{Major Mode
211 Conventions}), since you are not supposed to customize this mode.
212 @end deffn
213
214 The easiest way to write a major mode is to use the macro
215 @code{define-derived-mode}, which sets up the new mode as a variant of
216 an existing major mode. @xref{Derived Modes}. We recommend using
217 @code{define-derived-mode} even if the new mode is not an obvious
218 derivative of another mode, as it automatically enforces many coding
219 conventions for you. @xref{Basic Major Modes}, for common modes to
220 derive from.
221
222 The standard GNU Emacs Lisp directory tree contains the code for
223 several major modes, in files such as @file{text-mode.el},
224 @file{texinfo.el}, @file{lisp-mode.el}, and @file{rmail.el}. You can
225 study these libraries to see how modes are written.
226
227 @defopt major-mode
228 The buffer-local value of this variable is a symbol naming the buffer's
229 current major mode. Its default value holds the default major mode for
230 new buffers. The standard default value is @code{fundamental-mode}.
231
232 If the default value is @code{nil}, then whenever Emacs creates a new
233 buffer via a command such as @kbd{C-x b} (@code{switch-to-buffer}), the
234 new buffer is put in the major mode of the previously current buffer.
235 As an exception, if the major mode of the previous buffer has a
236 @code{mode-class} symbol property with value @code{special}, the new
237 buffer is put in Fundamental mode (@pxref{Major Mode Conventions}).
238 @end defopt
239
240 @menu
241 * Major Mode Conventions:: Coding conventions for keymaps, etc.
242 * Auto Major Mode:: How Emacs chooses the major mode automatically.
243 * Mode Help:: Finding out how to use a mode.
244 * Derived Modes:: Defining a new major mode based on another major
245 mode.
246 * Basic Major Modes:: Modes that other modes are often derived from.
247 * Generic Modes:: Defining a simple major mode that supports
248 comment syntax and Font Lock mode.
249 * Mode Hooks:: Hooks run at the end of major mode commands.
250 * Example Major Modes:: Text mode and Lisp modes.
251 @end menu
252
253 @node Major Mode Conventions
254 @subsection Major Mode Conventions
255 @cindex major mode conventions
256 @cindex conventions for writing major modes
257
258 The code for every major mode should follow various coding
259 conventions, including conventions for local keymap and syntax table
260 initialization, function and variable names, and hooks.
261
262 If you use the @code{define-derived-mode} macro, it will take care of
263 many of these conventions automatically. @xref{Derived Modes}. Note
264 also that fundamental mode is an exception to many of these conventions,
265 because its definition is to present the global state of Emacs.
266
267 The following list of conventions is only partial. Each major mode
268 should aim for consistency in general with other Emacs major modes, as
269 this makes Emacs as a whole more coherent. It is impossible to list
270 here all the possible points where this issue might come up; if the
271 Emacs developers point out an area where your major mode deviates from
272 the usual conventions, please make it compatible.
273
274 @itemize @bullet
275 @item
276 Define a major mode command whose name ends in @samp{-mode}. When
277 called with no arguments, this command should switch to the new mode in
278 the current buffer by setting up the keymap, syntax table, and
279 buffer-local variables in an existing buffer. It should not change the
280 buffer's contents.
281
282 @item
283 Write a documentation string for this command that describes the special
284 commands available in this mode. @xref{Mode Help}.
285
286 The documentation string may include the special documentation
287 substrings, @samp{\[@var{command}]}, @samp{\@{@var{keymap}@}}, and
288 @samp{\<@var{keymap}>}, which allow the help display to adapt
289 automatically to the user's own key bindings. @xref{Keys in
290 Documentation}.
291
292 @item
293 The major mode command should start by calling
294 @code{kill-all-local-variables}. This runs the normal hook
295 @code{change-major-mode-hook}, then gets rid of the buffer-local
296 variables of the major mode previously in effect. @xref{Creating
297 Buffer-Local}.
298
299 @item
300 The major mode command should set the variable @code{major-mode} to the
301 major mode command symbol. This is how @code{describe-mode} discovers
302 which documentation to print.
303
304 @item
305 The major mode command should set the variable @code{mode-name} to the
306 ``pretty'' name of the mode, usually a string (but see @ref{Mode Line
307 Data}, for other possible forms). The name of the mode appears
308 in the mode line.
309
310 @item
311 @cindex functions in modes
312 Since all global names are in the same name space, all the global
313 variables, constants, and functions that are part of the mode should
314 have names that start with the major mode name (or with an abbreviation
315 of it if the name is long). @xref{Coding Conventions}.
316
317 @item
318 In a major mode for editing some kind of structured text, such as a
319 programming language, indentation of text according to structure is
320 probably useful. So the mode should set @code{indent-line-function}
321 to a suitable function, and probably customize other variables
322 for indentation. @xref{Auto-Indentation}.
323
324 @item
325 @cindex keymaps in modes
326 The major mode should usually have its own keymap, which is used as the
327 local keymap in all buffers in that mode. The major mode command should
328 call @code{use-local-map} to install this local map. @xref{Active
329 Keymaps}, for more information.
330
331 This keymap should be stored permanently in a global variable named
332 @code{@var{modename}-mode-map}. Normally the library that defines the
333 mode sets this variable.
334
335 @xref{Tips for Defining}, for advice about how to write the code to set
336 up the mode's keymap variable.
337
338 @item
339 The key sequences bound in a major mode keymap should usually start with
340 @kbd{C-c}, followed by a control character, a digit, or @kbd{@{},
341 @kbd{@}}, @kbd{<}, @kbd{>}, @kbd{:} or @kbd{;}. The other punctuation
342 characters are reserved for minor modes, and ordinary letters are
343 reserved for users.
344
345 A major mode can also rebind the keys @kbd{M-n}, @kbd{M-p} and
346 @kbd{M-s}. The bindings for @kbd{M-n} and @kbd{M-p} should normally
347 be some kind of ``moving forward and backward,'' but this does not
348 necessarily mean cursor motion.
349
350 It is legitimate for a major mode to rebind a standard key sequence if
351 it provides a command that does ``the same job'' in a way better
352 suited to the text this mode is used for. For example, a major mode
353 for editing a programming language might redefine @kbd{C-M-a} to
354 ``move to the beginning of a function'' in a way that works better for
355 that language.
356
357 It is also legitimate for a major mode to rebind a standard key
358 sequence whose standard meaning is rarely useful in that mode. For
359 instance, minibuffer modes rebind @kbd{M-r}, whose standard meaning is
360 rarely of any use in the minibuffer. Major modes such as Dired or
361 Rmail that do not allow self-insertion of text can reasonably redefine
362 letters and other printing characters as special commands.
363
364 @item
365 Major modes for editing text should not define @key{RET} to do
366 anything other than insert a newline. However, it is ok for
367 specialized modes for text that users don't directly edit, such as
368 Dired and Info modes, to redefine @key{RET} to do something entirely
369 different.
370
371 @item
372 Major modes should not alter options that are primarily a matter of user
373 preference, such as whether Auto-Fill mode is enabled. Leave this to
374 each user to decide. However, a major mode should customize other
375 variables so that Auto-Fill mode will work usefully @emph{if} the user
376 decides to use it.
377
378 @item
379 @cindex syntax tables in modes
380 The mode may have its own syntax table or may share one with other
381 related modes. If it has its own syntax table, it should store this in
382 a variable named @code{@var{modename}-mode-syntax-table}. @xref{Syntax
383 Tables}.
384
385 @item
386 If the mode handles a language that has a syntax for comments, it should
387 set the variables that define the comment syntax. @xref{Options for
388 Comments,, Options Controlling Comments, emacs, The GNU Emacs Manual}.
389
390 @item
391 @cindex abbrev tables in modes
392 The mode may have its own abbrev table or may share one with other
393 related modes. If it has its own abbrev table, it should store this
394 in a variable named @code{@var{modename}-mode-abbrev-table}. If the
395 major mode command defines any abbrevs itself, it should pass @code{t}
396 for the @var{system-flag} argument to @code{define-abbrev}.
397 @xref{Defining Abbrevs}.
398
399 @item
400 The mode should specify how to do highlighting for Font Lock mode, by
401 setting up a buffer-local value for the variable
402 @code{font-lock-defaults} (@pxref{Font Lock Mode}).
403
404 @item
405 Each face that the mode defines should, if possible, inherit from an
406 existing Emacs face. This reduces the chance of conflicting with a
407 user's face customizations. Useful faces include:
408
409 @table @asis
410 @item @code{highlight}
411 for stretches of text that should temporarily stand out.
412
413 @item @code{match}
414 for text matching a search command.
415
416 @item @code{link} and @code{link-visited}
417 for clickable text buttons that send the user to a different buffer or
418 ``location''.
419
420 @item @code{button}
421 for clickable text buttons that perform other actions.
422
423 @item @asis{Font Lock faces}
424 for other kinds of syntactic highlighting, if highlighting is not
425 handled by Font Lock mode or some Font Lock faces are not in use.
426 @xref{Faces for Font Lock}, for how to assign Font Lock faces.
427 @end table
428
429 @item
430 The mode should specify how Imenu should find the definitions or
431 sections of a buffer, by setting up a buffer-local value for the
432 variable @code{imenu-generic-expression}, for the two variables
433 @code{imenu-prev-index-position-function} and
434 @code{imenu-extract-index-name-function}, or for the variable
435 @code{imenu-create-index-function} (@pxref{Imenu}).
436
437 @item
438 The mode can specify a local value for
439 @code{eldoc-documentation-function} to tell ElDoc mode how to handle
440 this mode.
441
442 @item
443 The mode can specify how to complete various keywords by adding
444 to the special hook @code{completion-at-point-functions}.
445
446 @item
447 Use @code{defvar} or @code{defcustom} to set mode-related variables, so
448 that they are not reinitialized if they already have a value. (Such
449 reinitialization could discard customizations made by the user.)
450
451 @item
452 @cindex buffer-local variables in modes
453 To make a buffer-local binding for an Emacs customization variable, use
454 @code{make-local-variable} in the major mode command, not
455 @code{make-variable-buffer-local}. The latter function would make the
456 variable local to every buffer in which it is subsequently set, which
457 would affect buffers that do not use this mode. It is undesirable for a
458 mode to have such global effects. @xref{Buffer-Local Variables}.
459
460 With rare exceptions, the only reasonable way to use
461 @code{make-variable-buffer-local} in a Lisp package is for a variable
462 which is used only within that package. Using it on a variable used by
463 other packages would interfere with them.
464
465 @item
466 @cindex mode hook
467 @cindex major mode hook
468 Each major mode should have a normal @dfn{mode hook} named
469 @code{@var{modename}-mode-hook}. The very last thing the major mode command
470 should do is to call @code{run-mode-hooks}. This runs the mode hook,
471 and then runs the normal hook @code{after-change-major-mode-hook}.
472 @xref{Mode Hooks}.
473
474 @item
475 The major mode command may start by calling some other major mode
476 command (called the @dfn{parent mode}) and then alter some of its
477 settings. A mode that does this is called a @dfn{derived mode}. The
478 recommended way to define one is to use the @code{define-derived-mode}
479 macro, but this is not required. Such a mode should call the parent
480 mode command inside a @code{delay-mode-hooks} form. (Using
481 @code{define-derived-mode} does this automatically.) @xref{Derived
482 Modes}, and @ref{Mode Hooks}.
483
484 @item
485 If something special should be done if the user switches a buffer from
486 this mode to any other major mode, this mode can set up a buffer-local
487 value for @code{change-major-mode-hook} (@pxref{Creating Buffer-Local}).
488
489 @item
490 If this mode is appropriate only for specially-prepared text produced by
491 the mode itself (rather than by the user typing at the keyboard or by an
492 external file), then the major mode command symbol should have a
493 property named @code{mode-class} with value @code{special}, put on as
494 follows:
495
496 @kindex mode-class @r{(property)}
497 @cindex @code{special} modes
498 @example
499 (put 'funny-mode 'mode-class 'special)
500 @end example
501
502 @noindent
503 This tells Emacs that new buffers created while the current buffer is in
504 Funny mode should not be put in Funny mode, even though the default
505 value of @code{major-mode} is @code{nil}. By default, the value of
506 @code{nil} for @code{major-mode} means to use the current buffer's major
507 mode when creating new buffers (@pxref{Auto Major Mode}), but with such
508 @code{special} modes, Fundamental mode is used instead. Modes such as
509 Dired, Rmail, and Buffer List use this feature.
510
511 The @code{define-derived-mode} macro automatically marks the derived
512 mode as special if the parent mode is special. Special mode is a
513 convenient parent for such modes to inherit from; @xref{Basic Major
514 Modes}.
515
516 @item
517 If you want to make the new mode the default for files with certain
518 recognizable names, add an element to @code{auto-mode-alist} to select
519 the mode for those file names (@pxref{Auto Major Mode}). If you
520 define the mode command to autoload, you should add this element in
521 the same file that calls @code{autoload}. If you use an autoload
522 cookie for the mode command, you can also use an autoload cookie for
523 the form that adds the element (@pxref{autoload cookie}). If you do
524 not autoload the mode command, it is sufficient to add the element in
525 the file that contains the mode definition.
526
527 @item
528 In the comments that document the file, you should provide a sample
529 @code{autoload} form and an example of how to add to
530 @code{auto-mode-alist}, that users can include in their init files
531 (@pxref{Init File}).
532
533 @item
534 @cindex mode loading
535 The top-level forms in the file defining the mode should be written so
536 that they may be evaluated more than once without adverse consequences.
537 Even if you never load the file more than once, someone else will.
538 @end itemize
539
540 @node Auto Major Mode
541 @subsection How Emacs Chooses a Major Mode
542 @cindex major mode, automatic selection
543
544 Based on information in the file name or in the file itself, Emacs
545 automatically selects a major mode for the new buffer when a file is
546 visited. It also processes local variables specified in the file text.
547
548 @deffn Command normal-mode &optional find-file
549 This function establishes the proper major mode and buffer-local variable
550 bindings for the current buffer. First it calls @code{set-auto-mode}
551 (see below), then it runs @code{hack-local-variables} to parse, and
552 bind or evaluate as appropriate, the file's local variables
553 (@pxref{File Local Variables}).
554
555 If the @var{find-file} argument to @code{normal-mode} is non-@code{nil},
556 @code{normal-mode} assumes that the @code{find-file} function is calling
557 it. In this case, it may process local variables in the @samp{-*-}
558 line or at the end of the file. The variable
559 @code{enable-local-variables} controls whether to do so. @xref{File
560 Variables, , Local Variables in Files, emacs, The GNU Emacs Manual},
561 for the syntax of the local variables section of a file.
562
563 If you run @code{normal-mode} interactively, the argument
564 @var{find-file} is normally @code{nil}. In this case,
565 @code{normal-mode} unconditionally processes any file local variables.
566
567 The function calls @code{set-auto-mode} to choose a major mode. If this
568 does not specify a mode, the buffer stays in the major mode determined
569 by the default value of @code{major-mode} (see below).
570
571 @cindex file mode specification error
572 @code{normal-mode} uses @code{condition-case} around the call to the
573 major mode command, so errors are caught and reported as a @samp{File
574 mode specification error}, followed by the original error message.
575 @end deffn
576
577 @defun set-auto-mode &optional keep-mode-if-same
578 @cindex visited file mode
579 This function selects the major mode that is appropriate for the
580 current buffer. It bases its decision (in order of precedence) on the
581 @w{@samp{-*-}} line, on any @samp{mode:} local variable near the end of
582 a file, on the @w{@samp{#!}} line (using @code{interpreter-mode-alist}),
583 on the text at the beginning of the buffer (using
584 @code{magic-mode-alist}), and finally on the visited file name (using
585 @code{auto-mode-alist}). @xref{Choosing Modes, , How Major Modes are
586 Chosen, emacs, The GNU Emacs Manual}. If @code{enable-local-variables}
587 is @code{nil}, @code{set-auto-mode} does not check the @w{@samp{-*-}}
588 line, or near the end of the file, for any mode tag.
589
590 If @var{keep-mode-if-same} is non-@code{nil}, this function does not
591 call the mode command if the buffer is already in the proper major
592 mode. For instance, @code{set-visited-file-name} sets this to
593 @code{t} to avoid killing buffer local variables that the user may
594 have set.
595 @end defun
596
597 @defun set-buffer-major-mode buffer
598 This function sets the major mode of @var{buffer} to the default value of
599 @code{major-mode}; if that is @code{nil}, it uses the
600 current buffer's major mode (if that is suitable). As an exception,
601 if @var{buffer}'s name is @samp{*scratch*}, it sets the mode to
602 @code{initial-major-mode}.
603
604 The low-level primitives for creating buffers do not use this function,
605 but medium-level commands such as @code{switch-to-buffer} and
606 @code{find-file-noselect} use it whenever they create buffers.
607 @end defun
608
609 @defopt initial-major-mode
610 @cindex @samp{*scratch*}
611 The value of this variable determines the major mode of the initial
612 @samp{*scratch*} buffer. The value should be a symbol that is a major
613 mode command. The default value is @code{lisp-interaction-mode}.
614 @end defopt
615
616 @defvar interpreter-mode-alist
617 This variable specifies major modes to use for scripts that specify a
618 command interpreter in a @samp{#!} line. Its value is an alist with
619 elements of the form @code{(@var{interpreter} . @var{mode})}; for
620 example, @code{("perl" . perl-mode)} is one element present by
621 default. The element says to use mode @var{mode} if the file
622 specifies an interpreter which matches @var{interpreter}.
623 @end defvar
624
625 @defvar magic-mode-alist
626 This variable's value is an alist with elements of the form
627 @code{(@var{regexp} . @var{function})}, where @var{regexp} is a
628 regular expression and @var{function} is a function or @code{nil}.
629 After visiting a file, @code{set-auto-mode} calls @var{function} if
630 the text at the beginning of the buffer matches @var{regexp} and
631 @var{function} is non-@code{nil}; if @var{function} is @code{nil},
632 @code{auto-mode-alist} gets to decide the mode.
633 @end defvar
634
635 @defvar magic-fallback-mode-alist
636 This works like @code{magic-mode-alist}, except that it is handled
637 only if @code{auto-mode-alist} does not specify a mode for this file.
638 @end defvar
639
640 @defvar auto-mode-alist
641 This variable contains an association list of file name patterns
642 (regular expressions) and corresponding major mode commands. Usually,
643 the file name patterns test for suffixes, such as @samp{.el} and
644 @samp{.c}, but this need not be the case. An ordinary element of the
645 alist looks like @code{(@var{regexp} . @var{mode-function})}.
646
647 For example,
648
649 @smallexample
650 @group
651 (("\\`/tmp/fol/" . text-mode)
652 ("\\.texinfo\\'" . texinfo-mode)
653 ("\\.texi\\'" . texinfo-mode)
654 @end group
655 @group
656 ("\\.el\\'" . emacs-lisp-mode)
657 ("\\.c\\'" . c-mode)
658 ("\\.h\\'" . c-mode)
659 @dots{})
660 @end group
661 @end smallexample
662
663 When you visit a file whose expanded file name (@pxref{File Name
664 Expansion}), with version numbers and backup suffixes removed using
665 @code{file-name-sans-versions} (@pxref{File Name Components}), matches
666 a @var{regexp}, @code{set-auto-mode} calls the corresponding
667 @var{mode-function}. This feature enables Emacs to select the proper
668 major mode for most files.
669
670 If an element of @code{auto-mode-alist} has the form @code{(@var{regexp}
671 @var{function} t)}, then after calling @var{function}, Emacs searches
672 @code{auto-mode-alist} again for a match against the portion of the file
673 name that did not match before. This feature is useful for
674 uncompression packages: an entry of the form @code{("\\.gz\\'"
675 @var{function} t)} can uncompress the file and then put the uncompressed
676 file in the proper mode according to the name sans @samp{.gz}.
677
678 Here is an example of how to prepend several pattern pairs to
679 @code{auto-mode-alist}. (You might use this sort of expression in your
680 init file.)
681
682 @smallexample
683 @group
684 (setq auto-mode-alist
685 (append
686 ;; @r{File name (within directory) starts with a dot.}
687 '(("/\\.[^/]*\\'" . fundamental-mode)
688 ;; @r{File name has no dot.}
689 ("/[^\\./]*\\'" . fundamental-mode)
690 ;; @r{File name ends in @samp{.C}.}
691 ("\\.C\\'" . c++-mode))
692 auto-mode-alist))
693 @end group
694 @end smallexample
695 @end defvar
696
697 @node Mode Help
698 @subsection Getting Help about a Major Mode
699 @cindex mode help
700 @cindex help for major mode
701 @cindex documentation for major mode
702
703 The @code{describe-mode} function is provides information about major
704 modes. It is normally bound to @kbd{C-h m}. It uses the value of the
705 variable @code{major-mode} (which is why every major mode command needs
706 to set this variable).
707
708 @deffn Command describe-mode
709 This function displays the documentation of the current major mode.
710
711 The @code{describe-mode} function calls the @code{documentation}
712 function using the value of @code{major-mode} as an argument. Thus, it
713 displays the documentation string of the major mode command.
714 (@xref{Accessing Documentation}.)
715 @end deffn
716
717 @defvar major-mode
718 This buffer-local variable holds the symbol for the current buffer's
719 major mode. This symbol should have a function definition that is the
720 command to switch to that major mode. The @code{describe-mode}
721 function uses the documentation string of the function as the
722 documentation of the major mode.
723 @end defvar
724
725 @node Derived Modes
726 @subsection Defining Derived Modes
727 @cindex derived mode
728
729 The recommended way to define a new major mode is to derive it from an
730 existing one using @code{define-derived-mode}. If there is no closely
731 related mode, you should inherit from either @code{text-mode},
732 @code{special-mode}, or @code{prog-mode}. @xref{Basic Major Modes}. If
733 none of these are suitable, you can inherit from @code{fundamental-mode}
734 (@pxref{Major Modes}).
735
736 @defmac define-derived-mode variant parent name docstring keyword-args@dots{} body@dots{}
737 This macro defines @var{variant} as a major mode command, using
738 @var{name} as the string form of the mode name. @var{variant} and
739 @var{parent} should be unquoted symbols.
740
741 The new command @var{variant} is defined to call the function
742 @var{parent}, then override certain aspects of that parent mode:
743
744 @itemize @bullet
745 @item
746 The new mode has its own sparse keymap, named
747 @code{@var{variant}-map}. @code{define-derived-mode}
748 makes the parent mode's keymap the parent of the new map, unless
749 @code{@var{variant}-map} is already set and already has a parent.
750
751 @item
752 The new mode has its own syntax table, kept in the variable
753 @code{@var{variant}-syntax-table}, unless you override this using the
754 @code{:syntax-table} keyword (see below). @code{define-derived-mode}
755 makes the parent mode's syntax-table the parent of
756 @code{@var{variant}-syntax-table}, unless the latter is already set
757 and already has a parent different from the standard syntax table.
758
759 @item
760 The new mode has its own abbrev table, kept in the variable
761 @code{@var{variant}-abbrev-table}, unless you override this using the
762 @code{:abbrev-table} keyword (see below).
763
764 @item
765 The new mode has its own mode hook, @code{@var{variant}-hook}. It
766 runs this hook, after running the hooks of its ancestor modes, with
767 @code{run-mode-hooks}, as the last thing it does. @xref{Mode Hooks}.
768 @end itemize
769
770 In addition, you can specify how to override other aspects of
771 @var{parent} with @var{body}. The command @var{variant}
772 evaluates the forms in @var{body} after setting up all its usual
773 overrides, just before running the mode hooks.
774
775 If @var{parent} has a non-@code{nil} @code{mode-class} symbol
776 property, then @code{define-derived-mode} sets the @code{mode-class}
777 property of @var{variant} to the same value. This ensures, for
778 example, that if @var{parent} is a special mode, then @var{variant} is
779 also a special mode (@pxref{Major Mode Conventions}).
780
781 You can also specify @code{nil} for @var{parent}. This gives the new
782 mode no parent. Then @code{define-derived-mode} behaves as described
783 above, but, of course, omits all actions connected with @var{parent}.
784
785 The argument @var{docstring} specifies the documentation string for
786 the new mode. @code{define-derived-mode} adds some general
787 information about the mode's hook, followed by the mode's keymap, at
788 the end of this docstring. If you omit @var{docstring},
789 @code{define-derived-mode} generates a documentation string.
790
791 The @var{keyword-args} are pairs of keywords and values. The values
792 are evaluated. The following keywords are currently supported:
793
794 @table @code
795 @item :syntax-table
796 You can use this to explicitly specify a syntax table for the new
797 mode. If you specify a @code{nil} value, the new mode uses the same
798 syntax table as @var{parent}, or the standard syntax table if
799 @var{parent} is @code{nil}. (Note that this does @emph{not} follow
800 the convention used for non-keyword arguments that a @code{nil} value
801 is equivalent with not specifying the argument.)
802
803 @item :abbrev-table
804 You can use this to explicitly specify an abbrev table for the new
805 mode. If you specify a @code{nil} value, the new mode uses the same
806 abbrev table as @var{parent}, or @code{fundamental-mode-abbrev-table}
807 if @var{parent} is @code{nil}. (Again, a @code{nil} value is
808 @emph{not} equivalent to not specifying this keyword.)
809
810 @item :group
811 If this is specified, the value should be the customization group for
812 this mode. (Not all major modes have one.) Only the (still
813 experimental and unadvertised) command @code{customize-mode} currently
814 uses this. @code{define-derived-mode} does @emph{not} automatically
815 define the specified customization group.
816 @end table
817
818 Here is a hypothetical example:
819
820 @example
821 (define-derived-mode hypertext-mode
822 text-mode "Hypertext"
823 "Major mode for hypertext.
824 \\@{hypertext-mode-map@}"
825 (setq case-fold-search nil))
826
827 (define-key hypertext-mode-map
828 [down-mouse-3] 'do-hyper-link)
829 @end example
830
831 Do not write an @code{interactive} spec in the definition;
832 @code{define-derived-mode} does that automatically.
833 @end defmac
834
835 @defun derived-mode-p &rest modes
836 This function returns non-@code{nil} if the current major mode is
837 derived from any of the major modes given by the symbols @var{modes}.
838 @end defun
839
840 @node Basic Major Modes
841 @subsection Basic Major Modes
842
843 Apart from Fundamental mode, there are three major modes that other
844 major modes commonly derive from: Text mode, Prog mode, and Special
845 mode. While Text mode is useful in its own right (e.g. for editing
846 files ending in @file{.txt}), Prog mode and Special mode exist mainly to
847 let other modes derive from them.
848
849 @vindex prog-mode-hook
850 As far as possible, new major modes should be derived, either directly
851 or indirectly, from one of these three modes. One reason is that this
852 allows users to customize a single mode hook
853 (e.g. @code{prog-mode-hook}) for an entire family of relevant modes
854 (e.g. all programming language modes).
855
856 @deffn Command text-mode
857 Text mode is a major mode for editing human languages. It defines the
858 @samp{"} and @samp{\} characters as having punctuation syntax
859 (@pxref{Syntax Class Table}), and binds @kbd{M-@key{TAB}} to
860 @code{ispell-complete-word} (@pxref{Spelling,,, emacs, The GNU Emacs
861 Manual}).
862
863 An example of a major mode derived from Text mode is HTML mode.
864 @xref{HTML Mode,,SGML and HTML Modes, emacs, The GNU Emacs Manual}.
865 @end deffn
866
867 @deffn Command prog-mode
868 Prog mode is a basic major mode for buffers containing programming
869 language source code. Most of the programming language major modes
870 built into Emacs are derived from it.
871
872 Prog mode binds @code{parse-sexp-ignore-comments} to @code{t}
873 (@pxref{Motion via Parsing}) and @code{bidi-paragraph-direction} to
874 @code{left-to-right} (@pxref{Bidirectional Display}).
875 @end deffn
876
877 @deffn Command special-mode
878 Special mode is a basic major mode for buffers containing text that is
879 produced specially by Emacs, rather than from a file. Major modes
880 derived from Special mode are given a @code{mode-class} property of
881 @code{special} (@pxref{Major Mode Conventions}).
882
883 Special mode sets the buffer to read-only. Its keymap defines several
884 common bindings, including @kbd{q} for @code{quit-window}, @kbd{z} for
885 @code{kill-this-buffer}, and @kbd{g} for @code{revert-buffer}
886 (@pxref{Reverting}).
887
888 An example of a major mode derived from Special mode is Buffer Menu
889 mode, which is used by the @samp{*Buffer List*} buffer. @xref{List
890 Buffers,,Listing Existing Buffers, emacs, The GNU Emacs Manual}.
891 @end deffn
892
893 @node Generic Modes
894 @subsection Generic Modes
895 @cindex generic mode
896
897 @dfn{Generic modes} are simple major modes with basic support for
898 comment syntax and Font Lock mode. To define a generic mode, use the
899 macro @code{define-generic-mode}. See the file @file{generic-x.el}
900 for some examples of the use of @code{define-generic-mode}.
901
902 @defmac define-generic-mode mode comment-list keyword-list font-lock-list auto-mode-list function-list &optional docstring
903 This macro defines a generic mode command named @var{mode} (a symbol,
904 not quoted). The optional argument @var{docstring} is the
905 documentation for the mode command. If you do not supply it,
906 @code{define-generic-mode} generates one by default.
907
908 The argument @var{comment-list} is a list in which each element is
909 either a character, a string of one or two characters, or a cons cell.
910 A character or a string is set up in the mode's syntax table as a
911 ``comment starter.'' If the entry is a cons cell, the @sc{car} is set
912 up as a ``comment starter'' and the @sc{cdr} as a ``comment ender.''
913 (Use @code{nil} for the latter if you want comments to end at the end
914 of the line.) Note that the syntax table mechanism has limitations
915 about what comment starters and enders are actually possible.
916 @xref{Syntax Tables}.
917
918 The argument @var{keyword-list} is a list of keywords to highlight
919 with @code{font-lock-keyword-face}. Each keyword should be a string.
920 Meanwhile, @var{font-lock-list} is a list of additional expressions to
921 highlight. Each element of this list should have the same form as an
922 element of @code{font-lock-keywords}. @xref{Search-based
923 Fontification}.
924
925 The argument @var{auto-mode-list} is a list of regular expressions to
926 add to the variable @code{auto-mode-alist}. They are added by the execution
927 of the @code{define-generic-mode} form, not by expanding the macro call.
928
929 Finally, @var{function-list} is a list of functions for the mode
930 command to call for additional setup. It calls these functions just
931 before it runs the mode hook variable @code{@var{mode}-hook}.
932 @end defmac
933
934 @node Mode Hooks
935 @subsection Mode Hooks
936
937 Every major mode command should finish by running its mode hook and
938 the mode-independent normal hook @code{after-change-major-mode-hook}.
939 It does this by calling @code{run-mode-hooks}. If the major mode is a
940 derived mode, that is if it calls another major mode (the parent mode)
941 in its body, it should do this inside @code{delay-mode-hooks} so that
942 the parent won't run these hooks itself. Instead, the derived mode's
943 call to @code{run-mode-hooks} runs the parent's mode hook too.
944 @xref{Major Mode Conventions}.
945
946 Emacs versions before Emacs 22 did not have @code{delay-mode-hooks}.
947 When user-implemented major modes have not been updated to use it,
948 they won't entirely follow these conventions: they may run the
949 parent's mode hook too early, or fail to run
950 @code{after-change-major-mode-hook}. If you encounter such a major
951 mode, please correct it to follow these conventions.
952
953 When you defined a major mode using @code{define-derived-mode}, it
954 automatically makes sure these conventions are followed. If you
955 define a major mode ``by hand,'' not using @code{define-derived-mode},
956 use the following functions to handle these conventions automatically.
957
958 @defun run-mode-hooks &rest hookvars
959 Major modes should run their mode hook using this function. It is
960 similar to @code{run-hooks} (@pxref{Hooks}), but it also runs
961 @code{after-change-major-mode-hook}.
962
963 When this function is called during the execution of a
964 @code{delay-mode-hooks} form, it does not run the hooks immediately.
965 Instead, it arranges for the next call to @code{run-mode-hooks} to run
966 them.
967 @end defun
968
969 @defmac delay-mode-hooks body@dots{}
970 When one major mode command calls another, it should do so inside of
971 @code{delay-mode-hooks}.
972
973 This macro executes @var{body}, but tells all @code{run-mode-hooks}
974 calls during the execution of @var{body} to delay running their hooks.
975 The hooks will actually run during the next call to
976 @code{run-mode-hooks} after the end of the @code{delay-mode-hooks}
977 construct.
978 @end defmac
979
980 @defvar after-change-major-mode-hook
981 This is a normal hook run by @code{run-mode-hooks}. It is run at the
982 very end of every properly-written major mode command.
983 @end defvar
984
985 @node Example Major Modes
986 @subsection Major Mode Examples
987
988 Text mode is perhaps the simplest mode besides Fundamental mode.
989 Here are excerpts from @file{text-mode.el} that illustrate many of
990 the conventions listed above:
991
992 @smallexample
993 @group
994 ;; @r{Create the syntax table for this mode.}
995 (defvar text-mode-syntax-table
996 (let ((st (make-syntax-table)))
997 (modify-syntax-entry ?\" ". " st)
998 (modify-syntax-entry ?\\ ". " st)
999 ;; Add `p' so M-c on `hello' leads to `Hello', not `hello'.
1000 (modify-syntax-entry ?' "w p" st)
1001 st)
1002 "Syntax table used while in `text-mode'.")
1003 @end group
1004
1005 ;; @r{Create the keymap for this mode.}
1006 @group
1007 (defvar text-mode-map
1008 (let ((map (make-sparse-keymap)))
1009 (define-key map "\e\t" 'ispell-complete-word)
1010 (define-key map "\es" 'center-line)
1011 (define-key map "\eS" 'center-paragraph)
1012 map)
1013 "Keymap for `text-mode'.
1014 Many other modes, such as Mail mode, Outline mode
1015 and Indented Text mode, inherit all the commands
1016 defined in this map.")
1017 @end group
1018 @end smallexample
1019
1020 Here is how the actual mode command is defined now:
1021
1022 @smallexample
1023 @group
1024 (define-derived-mode text-mode nil "Text"
1025 "Major mode for editing text written for humans to read.
1026 In this mode, paragraphs are delimited only by blank or white lines.
1027 You can thus get the full benefit of adaptive filling
1028 (see the variable `adaptive-fill-mode').
1029 \\@{text-mode-map@}
1030 Turning on Text mode runs the normal hook `text-mode-hook'."
1031 @end group
1032 @group
1033 (set (make-local-variable 'text-mode-variant) t)
1034 ;; @r{These two lines are a feature added recently.}
1035 (set (make-local-variable 'require-final-newline)
1036 mode-require-final-newline)
1037 (set (make-local-variable 'indent-line-function) 'indent-relative))
1038 @end group
1039 @end smallexample
1040
1041 @noindent
1042 (The last line is redundant nowadays, since @code{indent-relative} is
1043 the default value, and we'll delete it in a future version.)
1044
1045 Here is how it was defined formerly, before
1046 @code{define-derived-mode} existed:
1047
1048 @smallexample
1049 @group
1050 ;; @r{This isn't needed nowadays, since @code{define-derived-mode} does it.}
1051 (define-abbrev-table 'text-mode-abbrev-table ()
1052 "Abbrev table used while in text mode.")
1053 @end group
1054
1055 @group
1056 (defun text-mode ()
1057 "Major mode for editing text intended for humans to read...
1058 Special commands: \\@{text-mode-map@}
1059 @end group
1060 @group
1061 Turning on text-mode runs the hook `text-mode-hook'."
1062 (interactive)
1063 (kill-all-local-variables)
1064 (use-local-map text-mode-map)
1065 @end group
1066 @group
1067 (setq local-abbrev-table text-mode-abbrev-table)
1068 (set-syntax-table text-mode-syntax-table)
1069 @end group
1070 @group
1071 ;; @r{These four lines are absent from the current version}
1072 ;; @r{not because this is done some other way, but because}
1073 ;; @r{nowadays Text mode uses the normal definition of paragraphs.}
1074 (set (make-local-variable 'paragraph-start)
1075 (concat "[ \t]*$\\|" page-delimiter))
1076 (set (make-local-variable 'paragraph-separate) paragraph-start)
1077 (set (make-local-variable 'indent-line-function) 'indent-relative-maybe)
1078 @end group
1079 @group
1080 (setq mode-name "Text")
1081 (setq major-mode 'text-mode)
1082 (run-mode-hooks 'text-mode-hook)) ; @r{Finally, this permits the user to}
1083 ; @r{customize the mode with a hook.}
1084 @end group
1085 @end smallexample
1086
1087 @cindex @file{lisp-mode.el}
1088 The three Lisp modes (Lisp mode, Emacs Lisp mode, and Lisp
1089 Interaction mode) have more features than Text mode and the code is
1090 correspondingly more complicated. Here are excerpts from
1091 @file{lisp-mode.el} that illustrate how these modes are written.
1092
1093 @cindex syntax table example
1094 @smallexample
1095 @group
1096 ;; @r{Create mode-specific table variables.}
1097 (defvar lisp-mode-syntax-table nil "")
1098 (defvar lisp-mode-abbrev-table nil "")
1099 @end group
1100
1101 @group
1102 (defvar emacs-lisp-mode-syntax-table
1103 (let ((table (make-syntax-table)))
1104 (let ((i 0))
1105 @end group
1106
1107 @group
1108 ;; @r{Set syntax of chars up to @samp{0} to say they are}
1109 ;; @r{part of symbol names but not words.}
1110 ;; @r{(The digit @samp{0} is @code{48} in the @acronym{ASCII} character set.)}
1111 (while (< i ?0)
1112 (modify-syntax-entry i "_ " table)
1113 (setq i (1+ i)))
1114 ;; @r{@dots{} similar code follows for other character ranges.}
1115 @end group
1116 @group
1117 ;; @r{Then set the syntax codes for characters that are special in Lisp.}
1118 (modify-syntax-entry ? " " table)
1119 (modify-syntax-entry ?\t " " table)
1120 (modify-syntax-entry ?\f " " table)
1121 (modify-syntax-entry ?\n "> " table)
1122 @end group
1123 @group
1124 ;; @r{Give CR the same syntax as newline, for selective-display.}
1125 (modify-syntax-entry ?\^m "> " table)
1126 (modify-syntax-entry ?\; "< " table)
1127 (modify-syntax-entry ?` "' " table)
1128 (modify-syntax-entry ?' "' " table)
1129 (modify-syntax-entry ?, "' " table)
1130 @end group
1131 @group
1132 ;; @r{@dots{}likewise for many other characters@dots{}}
1133 (modify-syntax-entry ?\( "() " table)
1134 (modify-syntax-entry ?\) ")( " table)
1135 (modify-syntax-entry ?\[ "(] " table)
1136 (modify-syntax-entry ?\] ")[ " table))
1137 table))
1138 @end group
1139 @group
1140 ;; @r{Create an abbrev table for lisp-mode.}
1141 (define-abbrev-table 'lisp-mode-abbrev-table ())
1142 @end group
1143 @end smallexample
1144
1145 The three modes for Lisp share much of their code. For instance,
1146 each calls the following function to set various variables:
1147
1148 @smallexample
1149 @group
1150 (defun lisp-mode-variables (lisp-syntax)
1151 (when lisp-syntax
1152 (set-syntax-table lisp-mode-syntax-table))
1153 (setq local-abbrev-table lisp-mode-abbrev-table)
1154 @dots{}
1155 @end group
1156 @end smallexample
1157
1158 In Lisp and most programming languages, we want the paragraph
1159 commands to treat only blank lines as paragraph separators. And the
1160 modes should understand the Lisp conventions for comments. The rest of
1161 @code{lisp-mode-variables} sets this up:
1162
1163 @smallexample
1164 @group
1165 (set (make-local-variable 'paragraph-start)
1166 (concat page-delimiter "\\|$" ))
1167 (set (make-local-variable 'paragraph-separate)
1168 paragraph-start)
1169 @dots{}
1170 @end group
1171 @group
1172 (set (make-local-variable 'comment-indent-function)
1173 'lisp-comment-indent))
1174 @dots{}
1175 @end group
1176 @end smallexample
1177
1178 Each of the different Lisp modes has a slightly different keymap. For
1179 example, Lisp mode binds @kbd{C-c C-z} to @code{run-lisp}, but the other
1180 Lisp modes do not. However, all Lisp modes have some commands in
1181 common. The following code sets up the common commands:
1182
1183 @smallexample
1184 @group
1185 (defvar shared-lisp-mode-map
1186 (let ((map (make-sparse-keymap)))
1187 (define-key shared-lisp-mode-map "\e\C-q" 'indent-sexp)
1188 (define-key shared-lisp-mode-map "\177"
1189 'backward-delete-char-untabify)
1190 map)
1191 "Keymap for commands shared by all sorts of Lisp modes.")
1192 @end group
1193 @end smallexample
1194
1195 @noindent
1196 And here is the code to set up the keymap for Lisp mode:
1197
1198 @smallexample
1199 @group
1200 (defvar lisp-mode-map
1201 (let ((map (make-sparse-keymap)))
1202 (set-keymap-parent map shared-lisp-mode-map)
1203 (define-key map "\e\C-x" 'lisp-eval-defun)
1204 (define-key map "\C-c\C-z" 'run-lisp)
1205 map)
1206 "Keymap for ordinary Lisp mode...")
1207 @end group
1208 @end smallexample
1209
1210 Finally, here is the complete major mode command definition for Lisp
1211 mode.
1212
1213 @smallexample
1214 @group
1215 (defun lisp-mode ()
1216 "Major mode for editing Lisp code for Lisps other than GNU Emacs Lisp.
1217 Commands:
1218 Delete converts tabs to spaces as it moves back.
1219 Blank lines separate paragraphs. Semicolons start comments.
1220 \\@{lisp-mode-map@}
1221 Note that `run-lisp' may be used either to start an inferior Lisp job
1222 or to switch back to an existing one.
1223 @end group
1224
1225 @group
1226 Entry to this mode calls the value of `lisp-mode-hook'
1227 if that value is non-nil."
1228 (interactive)
1229 (kill-all-local-variables)
1230 @end group
1231 @group
1232 (use-local-map lisp-mode-map) ; @r{Select the mode's keymap.}
1233 (setq major-mode 'lisp-mode) ; @r{This is how @code{describe-mode}}
1234 ; @r{finds out what to describe.}
1235 (setq mode-name "Lisp") ; @r{This goes into the mode line.}
1236 (lisp-mode-variables t) ; @r{This defines various variables.}
1237 (set (make-local-variable 'comment-start-skip)
1238 "\\(\\(^\\|[^\\\\\n]\\)\\(\\\\\\\\\\)*\\)\\(;+\\|#|\\) *")
1239 (set (make-local-variable 'font-lock-keywords-case-fold-search) t)
1240 @end group
1241 @group
1242 (setq imenu-case-fold-search t)
1243 (set-syntax-table lisp-mode-syntax-table)
1244 (run-mode-hooks 'lisp-mode-hook)) ; @r{This permits the user to use a}
1245 ; @r{hook to customize the mode.}
1246 @end group
1247 @end smallexample
1248
1249 @node Minor Modes
1250 @section Minor Modes
1251 @cindex minor mode
1252
1253 A @dfn{minor mode} provides features that users may enable or disable
1254 independently of the choice of major mode. Minor modes can be enabled
1255 individually or in combination. Minor modes would be better named
1256 ``generally available, optional feature modes,'' except that such a name
1257 would be unwieldy.
1258
1259 A minor mode is not usually meant as a variation of a single major mode.
1260 Usually they are general and can apply to many major modes. For
1261 example, Auto Fill mode works with any major mode that permits text
1262 insertion. To be general, a minor mode must be effectively independent
1263 of the things major modes do.
1264
1265 A minor mode is often much more difficult to implement than a major
1266 mode. One reason is that you should be able to activate and deactivate
1267 minor modes in any order. A minor mode should be able to have its
1268 desired effect regardless of the major mode and regardless of the other
1269 minor modes in effect.
1270
1271 Often the biggest problem in implementing a minor mode is finding a
1272 way to insert the necessary hook into the rest of Emacs. Minor mode
1273 keymaps make this easier than it used to be.
1274
1275 @defvar minor-mode-list
1276 The value of this variable is a list of all minor mode commands.
1277 @end defvar
1278
1279 @menu
1280 * Minor Mode Conventions:: Tips for writing a minor mode.
1281 * Keymaps and Minor Modes:: How a minor mode can have its own keymap.
1282 * Defining Minor Modes:: A convenient facility for defining minor modes.
1283 @end menu
1284
1285 @node Minor Mode Conventions
1286 @subsection Conventions for Writing Minor Modes
1287 @cindex minor mode conventions
1288 @cindex conventions for writing minor modes
1289
1290 There are conventions for writing minor modes just as there are for
1291 major modes. Several of the major mode conventions apply to minor
1292 modes as well: those regarding the name of the mode initialization
1293 function, the names of global symbols, the use of a hook at the end of
1294 the initialization function, and the use of keymaps and other tables.
1295
1296 In addition, there are several conventions that are specific to
1297 minor modes. (The easiest way to follow all the conventions is to use
1298 the macro @code{define-minor-mode}; @ref{Defining Minor Modes}.)
1299
1300 @itemize @bullet
1301 @item
1302 @cindex mode variable
1303 Make a variable whose name ends in @samp{-mode} to control the minor
1304 mode. We call this the @dfn{mode variable}. The minor mode command
1305 should set this variable (@code{nil} to disable; anything else to
1306 enable).
1307
1308 If possible, implement the mode so that setting the variable
1309 automatically enables or disables the mode. Then the minor mode command
1310 does not need to do anything except set the variable.
1311
1312 This variable is used in conjunction with the @code{minor-mode-alist} to
1313 display the minor mode name in the mode line. It can also enable
1314 or disable a minor mode keymap. Individual commands or hooks can also
1315 check the variable's value.
1316
1317 If you want the minor mode to be enabled separately in each buffer,
1318 make the variable buffer-local.
1319
1320 @item
1321 Define a command whose name is the same as the mode variable.
1322 Its job is to enable and disable the mode by setting the variable.
1323
1324 The command should accept one optional argument. If the argument is
1325 @code{nil}, it should toggle the mode (turn it on if it is off, and
1326 off if it is on). It should turn the mode on if the argument is a
1327 positive integer, the symbol @code{t}, or a list whose @sc{car} is one
1328 of those. It should turn the mode off if the argument is a negative
1329 integer or zero, the symbol @code{-}, or a list whose @sc{car} is a
1330 negative integer or zero. The meaning of other arguments is not
1331 specified.
1332
1333 Here is an example taken from the definition of @code{transient-mark-mode}.
1334 It shows the use of @code{transient-mark-mode} as a variable that enables or
1335 disables the mode's behavior, and also shows the proper way to toggle,
1336 enable or disable the minor mode based on the raw prefix argument value.
1337
1338 @smallexample
1339 @group
1340 (setq transient-mark-mode
1341 (if (null arg) (not transient-mark-mode)
1342 (> (prefix-numeric-value arg) 0)))
1343 @end group
1344 @end smallexample
1345
1346 @item
1347 Add an element to @code{minor-mode-alist} for each minor mode
1348 (@pxref{Definition of minor-mode-alist}), if you want to indicate the
1349 minor mode in the mode line. This element should be a list of the
1350 following form:
1351
1352 @smallexample
1353 (@var{mode-variable} @var{string})
1354 @end smallexample
1355
1356 Here @var{mode-variable} is the variable that controls enabling of the
1357 minor mode, and @var{string} is a short string, starting with a space,
1358 to represent the mode in the mode line. These strings must be short so
1359 that there is room for several of them at once.
1360
1361 When you add an element to @code{minor-mode-alist}, use @code{assq} to
1362 check for an existing element, to avoid duplication. For example:
1363
1364 @smallexample
1365 @group
1366 (unless (assq 'leif-mode minor-mode-alist)
1367 (setq minor-mode-alist
1368 (cons '(leif-mode " Leif") minor-mode-alist)))
1369 @end group
1370 @end smallexample
1371
1372 @noindent
1373 or like this, using @code{add-to-list} (@pxref{List Variables}):
1374
1375 @smallexample
1376 @group
1377 (add-to-list 'minor-mode-alist '(leif-mode " Leif"))
1378 @end group
1379 @end smallexample
1380 @end itemize
1381
1382 Global minor modes distributed with Emacs should if possible support
1383 enabling and disabling via Custom (@pxref{Customization}). To do this,
1384 the first step is to define the mode variable with @code{defcustom}, and
1385 specify @code{:type 'boolean}.
1386
1387 If just setting the variable is not sufficient to enable the mode, you
1388 should also specify a @code{:set} method which enables the mode by
1389 invoking the mode command. Note in the variable's documentation string that
1390 setting the variable other than via Custom may not take effect.
1391
1392 Also mark the definition with an autoload cookie (@pxref{autoload cookie}),
1393 and specify a @code{:require} so that customizing the variable will load
1394 the library that defines the mode. This will copy suitable definitions
1395 into @file{loaddefs.el} so that users can use @code{customize-option} to
1396 enable the mode. For example:
1397
1398 @smallexample
1399 @group
1400
1401 ;;;###autoload
1402 (defcustom msb-mode nil
1403 "Toggle msb-mode.
1404 Setting this variable directly does not take effect;
1405 use either \\[customize] or the function `msb-mode'."
1406 :set 'custom-set-minor-mode
1407 :initialize 'custom-initialize-default
1408 :version "20.4"
1409 :type 'boolean
1410 :group 'msb
1411 :require 'msb)
1412 @end group
1413 @end smallexample
1414
1415 @node Keymaps and Minor Modes
1416 @subsection Keymaps and Minor Modes
1417
1418 Each minor mode can have its own keymap, which is active when the mode
1419 is enabled. To set up a keymap for a minor mode, add an element to the
1420 alist @code{minor-mode-map-alist}. @xref{Definition of minor-mode-map-alist}.
1421
1422 @cindex @code{self-insert-command}, minor modes
1423 One use of minor mode keymaps is to modify the behavior of certain
1424 self-inserting characters so that they do something else as well as
1425 self-insert. In general, this is the only way to do that, since the
1426 facilities for customizing @code{self-insert-command} are limited to
1427 special cases (designed for abbrevs and Auto Fill mode). (Do not try
1428 substituting your own definition of @code{self-insert-command} for the
1429 standard one. The editor command loop handles this function specially.)
1430
1431 The key sequences bound in a minor mode should consist of @kbd{C-c}
1432 followed by one of @kbd{.,/?`'"[]\|~!#$%^&*()-_+=}. (The other
1433 punctuation characters are reserved for major modes.)
1434
1435 @node Defining Minor Modes
1436 @subsection Defining Minor Modes
1437
1438 The macro @code{define-minor-mode} offers a convenient way of
1439 implementing a mode in one self-contained definition.
1440
1441 @defmac define-minor-mode mode doc [init-value [lighter [keymap]]] keyword-args@dots{} body@dots{}
1442 This macro defines a new minor mode whose name is @var{mode} (a
1443 symbol). It defines a command named @var{mode} to toggle the minor
1444 mode, with @var{doc} as its documentation string. It also defines a
1445 variable named @var{mode}, which is set to @code{t} or @code{nil} by
1446 enabling or disabling the mode. The variable is initialized to
1447 @var{init-value}. Except in unusual circumstances (see below), this
1448 value must be @code{nil}.
1449
1450 The string @var{lighter} says what to display in the mode line
1451 when the mode is enabled; if it is @code{nil}, the mode is not displayed
1452 in the mode line.
1453
1454 The optional argument @var{keymap} specifies the keymap for the minor
1455 mode. If non-@code{nil}, it should be a variable name (whose value is
1456 a keymap), a keymap, or an alist of the form
1457
1458 @example
1459 (@var{key-sequence} . @var{definition})
1460 @end example
1461
1462 @noindent
1463 where each @var{key-sequence} and @var{definition} are arguments
1464 suitable for passing to @code{define-key} (@pxref{Changing Key
1465 Bindings}). If @var{keymap} is a keymap or an alist, this also
1466 defines the variable @code{@var{mode}-map}.
1467
1468 The above three arguments @var{init-value}, @var{lighter}, and
1469 @var{keymap} can be (partially) omitted when @var{keyword-args} are
1470 used. The @var{keyword-args} consist of keywords followed by
1471 corresponding values. A few keywords have special meanings:
1472
1473 @table @code
1474 @item :group @var{group}
1475 Custom group name to use in all generated @code{defcustom} forms.
1476 Defaults to @var{mode} without the possible trailing @samp{-mode}.
1477 @strong{Warning:} don't use this default group name unless you have
1478 written a @code{defgroup} to define that group properly. @xref{Group
1479 Definitions}.
1480
1481 @item :global @var{global}
1482 If non-@code{nil}, this specifies that the minor mode should be global
1483 rather than buffer-local. It defaults to @code{nil}.
1484
1485 One of the effects of making a minor mode global is that the
1486 @var{mode} variable becomes a customization variable. Toggling it
1487 through the Custom interface turns the mode on and off, and its value
1488 can be saved for future Emacs sessions (@pxref{Saving
1489 Customizations,,, emacs, The GNU Emacs Manual}. For the saved
1490 variable to work, you should ensure that the @code{define-minor-mode}
1491 form is evaluated each time Emacs starts; for packages that are not
1492 part of Emacs, the easiest way to do this is to specify a
1493 @code{:require} keyword.
1494
1495 @item :init-value @var{init-value}
1496 This is equivalent to specifying @var{init-value} positionally.
1497
1498 @item :lighter @var{lighter}
1499 This is equivalent to specifying @var{lighter} positionally.
1500
1501 @item :keymap @var{keymap}
1502 This is equivalent to specifying @var{keymap} positionally.
1503 @end table
1504
1505 Any other keyword arguments are passed directly to the
1506 @code{defcustom} generated for the variable @var{mode}.
1507
1508 The command named @var{mode} first performs the standard actions such
1509 as setting the variable named @var{mode} and then executes the
1510 @var{body} forms, if any. It finishes by running the mode hook
1511 variable @code{@var{mode}-hook}.
1512 @end defmac
1513
1514 The initial value must be @code{nil} except in cases where (1) the
1515 mode is preloaded in Emacs, or (2) it is painless for loading to
1516 enable the mode even though the user did not request it. For
1517 instance, if the mode has no effect unless something else is enabled,
1518 and will always be loaded by that time, enabling it by default is
1519 harmless. But these are unusual circumstances. Normally, the
1520 initial value must be @code{nil}.
1521
1522 @findex easy-mmode-define-minor-mode
1523 The name @code{easy-mmode-define-minor-mode} is an alias
1524 for this macro.
1525
1526 Here is an example of using @code{define-minor-mode}:
1527
1528 @smallexample
1529 (define-minor-mode hungry-mode
1530 "Toggle Hungry mode.
1531 With no argument, this command toggles the mode.
1532 Non-null prefix argument turns on the mode.
1533 Null prefix argument turns off the mode.
1534
1535 When Hungry mode is enabled, the control delete key
1536 gobbles all preceding whitespace except the last.
1537 See the command \\[hungry-electric-delete]."
1538 ;; The initial value.
1539 nil
1540 ;; The indicator for the mode line.
1541 " Hungry"
1542 ;; The minor mode bindings.
1543 '(([C-backspace] . hungry-electric-delete))
1544 :group 'hunger)
1545 @end smallexample
1546
1547 @noindent
1548 This defines a minor mode named ``Hungry mode,'' a command named
1549 @code{hungry-mode} to toggle it, a variable named @code{hungry-mode}
1550 which indicates whether the mode is enabled, and a variable named
1551 @code{hungry-mode-map} which holds the keymap that is active when the
1552 mode is enabled. It initializes the keymap with a key binding for
1553 @kbd{C-@key{DEL}}. It puts the variable @code{hungry-mode} into
1554 custom group @code{hunger}. There are no @var{body} forms---many
1555 minor modes don't need any.
1556
1557 Here's an equivalent way to write it:
1558
1559 @smallexample
1560 (define-minor-mode hungry-mode
1561 "Toggle Hungry mode.
1562 With no argument, this command toggles the mode.
1563 Non-null prefix argument turns on the mode.
1564 Null prefix argument turns off the mode.
1565
1566 When Hungry mode is enabled, the control delete key
1567 gobbles all preceding whitespace except the last.
1568 See the command \\[hungry-electric-delete]."
1569 ;; The initial value.
1570 :init-value nil
1571 ;; The indicator for the mode line.
1572 :lighter " Hungry"
1573 ;; The minor mode bindings.
1574 :keymap
1575 '(([C-backspace] . hungry-electric-delete)
1576 ([C-M-backspace]
1577 . (lambda ()
1578 (interactive)
1579 (hungry-electric-delete t))))
1580 :group 'hunger)
1581 @end smallexample
1582
1583 @defmac define-globalized-minor-mode global-mode mode turn-on keyword-args@dots{}
1584 This defines a global toggle named @var{global-mode} whose meaning is
1585 to enable or disable the buffer-local minor mode @var{mode} in all
1586 buffers. To turn on the minor mode in a buffer, it uses the function
1587 @var{turn-on}; to turn off the minor mode, it calls @code{mode} with
1588 @minus{}1 as argument.
1589
1590 Globally enabling the mode also affects buffers subsequently created
1591 by visiting files, and buffers that use a major mode other than
1592 Fundamental mode; but it does not detect the creation of a new buffer
1593 in Fundamental mode.
1594
1595 This defines the customization option @var{global-mode} (@pxref{Customization}),
1596 which can be toggled in the Custom interface to turn the minor mode on
1597 and off. As with @code{define-minor-mode}, you should ensure that the
1598 @code{define-globalized-minor-mode} form is evaluated each time Emacs
1599 starts, for example by providing a @code{:require} keyword.
1600
1601 Use @code{:group @var{group}} in @var{keyword-args} to specify the
1602 custom group for the mode variable of the global minor mode.
1603 @end defmac
1604
1605 @node Mode Line Format
1606 @section Mode-Line Format
1607 @cindex mode line
1608
1609 Each Emacs window (aside from minibuffer windows) typically has a mode
1610 line at the bottom, which displays status information about the buffer
1611 displayed in the window. The mode line contains information about the
1612 buffer, such as its name, associated file, depth of recursive editing,
1613 and major and minor modes. A window can also have a @dfn{header
1614 line}, which is much like the mode line but appears at the top of the
1615 window.
1616
1617 This section describes how to control the contents of the mode line
1618 and header line. We include it in this chapter because much of the
1619 information displayed in the mode line relates to the enabled major and
1620 minor modes.
1621
1622 @menu
1623 * Base: Mode Line Basics. Basic ideas of mode line control.
1624 * Data: Mode Line Data. The data structure that controls the mode line.
1625 * Top: Mode Line Top. The top level variable, mode-line-format.
1626 * Mode Line Variables:: Variables used in that data structure.
1627 * %-Constructs:: Putting information into a mode line.
1628 * Properties in Mode:: Using text properties in the mode line.
1629 * Header Lines:: Like a mode line, but at the top.
1630 * Emulating Mode Line:: Formatting text as the mode line would.
1631 @end menu
1632
1633 @node Mode Line Basics
1634 @subsection Mode Line Basics
1635
1636 @code{mode-line-format} is a buffer-local variable that holds a
1637 @dfn{mode line construct}, a kind of template, which controls what is
1638 displayed on the mode line of the current buffer. The value of
1639 @code{header-line-format} specifies the buffer's header line in the
1640 same way. All windows for the same buffer use the same
1641 @code{mode-line-format} and @code{header-line-format}.
1642
1643 For efficiency, Emacs does not continuously recompute the mode
1644 line and header line of a window. It does so when circumstances
1645 appear to call for it---for instance, if you change the window
1646 configuration, switch buffers, narrow or widen the buffer, scroll, or
1647 change the buffer's modification status. If you modify any of the
1648 variables referenced by @code{mode-line-format} (@pxref{Mode Line
1649 Variables}), or any other variables and data structures that affect
1650 how text is displayed (@pxref{Display}), you may want to force an
1651 update of the mode line so as to display the new information or
1652 display it in the new way.
1653
1654 @defun force-mode-line-update &optional all
1655 Force redisplay of the current buffer's mode line and header line.
1656 The next redisplay will update the mode line and header line based on
1657 the latest values of all relevant variables. With optional
1658 non-@code{nil} @var{all}, force redisplay of all mode lines and header
1659 lines.
1660
1661 This function also forces recomputation of the menu bar menus
1662 and the frame title.
1663 @end defun
1664
1665 The selected window's mode line is usually displayed in a different
1666 color using the face @code{mode-line}. Other windows' mode lines
1667 appear in the face @code{mode-line-inactive} instead. @xref{Faces}.
1668
1669 @node Mode Line Data
1670 @subsection The Data Structure of the Mode Line
1671 @cindex mode-line construct
1672
1673 The mode-line contents are controlled by a data structure called a
1674 @dfn{mode-line construct}, made up of lists, strings, symbols, and
1675 numbers kept in buffer-local variables. Each data type has a specific
1676 meaning for the mode-line appearance, as described below. The same
1677 data structure is used for constructing frame titles (@pxref{Frame
1678 Titles}) and header lines (@pxref{Header Lines}).
1679
1680 A mode-line construct may be as simple as a fixed string of text,
1681 but it usually specifies how to combine fixed strings with variables'
1682 values to construct the text. Many of these variables are themselves
1683 defined to have mode-line constructs as their values.
1684
1685 Here are the meanings of various data types as mode-line constructs:
1686
1687 @table @code
1688 @cindex percent symbol in mode line
1689 @item @var{string}
1690 A string as a mode-line construct appears verbatim except for
1691 @dfn{@code{%}-constructs} in it. These stand for substitution of
1692 other data; see @ref{%-Constructs}.
1693
1694 If parts of the string have @code{face} properties, they control
1695 display of the text just as they would text in the buffer. Any
1696 characters which have no @code{face} properties are displayed, by
1697 default, in the face @code{mode-line} or @code{mode-line-inactive}
1698 (@pxref{Standard Faces,,, emacs, The GNU Emacs Manual}). The
1699 @code{help-echo} and @code{local-map} properties in @var{string} have
1700 special meanings. @xref{Properties in Mode}.
1701
1702 @item @var{symbol}
1703 A symbol as a mode-line construct stands for its value. The value of
1704 @var{symbol} is used as a mode-line construct, in place of @var{symbol}.
1705 However, the symbols @code{t} and @code{nil} are ignored, as is any
1706 symbol whose value is void.
1707
1708 There is one exception: if the value of @var{symbol} is a string, it is
1709 displayed verbatim: the @code{%}-constructs are not recognized.
1710
1711 Unless @var{symbol} is marked as ``risky'' (i.e., it has a
1712 non-@code{nil} @code{risky-local-variable} property), all text
1713 properties specified in @var{symbol}'s value are ignored. This
1714 includes the text properties of strings in @var{symbol}'s value, as
1715 well as all @code{:eval} and @code{:propertize} forms in it. (The
1716 reason for this is security: non-risky variables could be set
1717 automatically from file variables without prompting the user.)
1718
1719 @item (@var{string} @var{rest}@dots{})
1720 @itemx (@var{list} @var{rest}@dots{})
1721 A list whose first element is a string or list means to process all the
1722 elements recursively and concatenate the results. This is the most
1723 common form of mode-line construct.
1724
1725 @item (:eval @var{form})
1726 A list whose first element is the symbol @code{:eval} says to evaluate
1727 @var{form}, and use the result as a string to display. Make sure this
1728 evaluation cannot load any files, as doing so could cause infinite
1729 recursion.
1730
1731 @item (:propertize @var{elt} @var{props}@dots{})
1732 A list whose first element is the symbol @code{:propertize} says to
1733 process the mode-line construct @var{elt} recursively, then add the text
1734 properties specified by @var{props} to the result. The argument
1735 @var{props} should consist of zero or more pairs @var{text-property}
1736 @var{value}. (This feature is new as of Emacs 22.1.)
1737
1738 @item (@var{symbol} @var{then} @var{else})
1739 A list whose first element is a symbol that is not a keyword specifies
1740 a conditional. Its meaning depends on the value of @var{symbol}. If
1741 @var{symbol} has a non-@code{nil} value, the second element,
1742 @var{then}, is processed recursively as a mode-line element.
1743 Otherwise, the third element, @var{else}, is processed recursively.
1744 You may omit @var{else}; then the mode-line element displays nothing
1745 if the value of @var{symbol} is @code{nil} or void.
1746
1747 @item (@var{width} @var{rest}@dots{})
1748 A list whose first element is an integer specifies truncation or
1749 padding of the results of @var{rest}. The remaining elements
1750 @var{rest} are processed recursively as mode-line constructs and
1751 concatenated together. When @var{width} is positive, the result is
1752 space filled on the right if its width is less than @var{width}. When
1753 @var{width} is negative, the result is truncated on the right to
1754 @minus{}@var{width} columns if its width exceeds @minus{}@var{width}.
1755
1756 For example, the usual way to show what percentage of a buffer is above
1757 the top of the window is to use a list like this: @code{(-3 "%p")}.
1758 @end table
1759
1760 @node Mode Line Top
1761 @subsection The Top Level of Mode Line Control
1762
1763 The variable in overall control of the mode line is
1764 @code{mode-line-format}.
1765
1766 @defopt mode-line-format
1767 The value of this variable is a mode-line construct that controls the
1768 contents of the mode-line. It is always buffer-local in all buffers.
1769
1770 If you set this variable to @code{nil} in a buffer, that buffer does
1771 not have a mode line. (A window that is just one line tall never
1772 displays a mode line.)
1773 @end defopt
1774
1775 The default value of @code{mode-line-format} is designed to use the
1776 values of other variables such as @code{mode-line-position} and
1777 @code{mode-line-modes} (which in turn incorporates the values of the
1778 variables @code{mode-name} and @code{minor-mode-alist}). Very few
1779 modes need to alter @code{mode-line-format} itself. For most
1780 purposes, it is sufficient to alter some of the variables that
1781 @code{mode-line-format} either directly or indirectly refers to.
1782
1783 If you do alter @code{mode-line-format} itself, the new value should
1784 use the same variables that appear in the default value (@pxref{Mode
1785 Line Variables}), rather than duplicating their contents or displaying
1786 the information in another fashion. This way, customizations made by
1787 the user or by Lisp programs (such as @code{display-time} and major
1788 modes) via changes to those variables remain effective.
1789
1790 Here is an example of a @code{mode-line-format} that might be
1791 useful for @code{shell-mode}, since it contains the host name and default
1792 directory.
1793
1794 @example
1795 @group
1796 (setq mode-line-format
1797 (list "-"
1798 'mode-line-mule-info
1799 'mode-line-modified
1800 'mode-line-frame-identification
1801 "%b--"
1802 @end group
1803 @group
1804 ;; @r{Note that this is evaluated while making the list.}
1805 ;; @r{It makes a mode-line construct which is just a string.}
1806 (getenv "HOST")
1807 @end group
1808 ":"
1809 'default-directory
1810 " "
1811 'global-mode-string
1812 " %[("
1813 '(:eval (mode-line-mode-name))
1814 'mode-line-process
1815 'minor-mode-alist
1816 "%n"
1817 ")%]--"
1818 @group
1819 '(which-func-mode ("" which-func-format "--"))
1820 '(line-number-mode "L%l--")
1821 '(column-number-mode "C%c--")
1822 '(-3 "%p")
1823 "-%-"))
1824 @end group
1825 @end example
1826
1827 @noindent
1828 (The variables @code{line-number-mode}, @code{column-number-mode}
1829 and @code{which-func-mode} enable particular minor modes; as usual,
1830 these variable names are also the minor mode command names.)
1831
1832 @node Mode Line Variables
1833 @subsection Variables Used in the Mode Line
1834
1835 This section describes variables incorporated by the standard value
1836 of @code{mode-line-format} into the text of the mode line. There is
1837 nothing inherently special about these variables; any other variables
1838 could have the same effects on the mode line if
1839 @code{mode-line-format}'s value were changed to use them. However,
1840 various parts of Emacs set these variables on the understanding that
1841 they will control parts of the mode line; therefore, practically
1842 speaking, it is essential for the mode line to use them.
1843
1844 @defvar mode-line-mule-info
1845 This variable holds the value of the mode-line construct that displays
1846 information about the language environment, buffer coding system, and
1847 current input method. @xref{Non-ASCII Characters}.
1848 @end defvar
1849
1850 @defvar mode-line-modified
1851 This variable holds the value of the mode-line construct that displays
1852 whether the current buffer is modified. Its default value displays
1853 @samp{**} if the buffer is modified, @samp{--} if the buffer is not
1854 modified, @samp{%%} if the buffer is read only, and @samp{%*} if the
1855 buffer is read only and modified.
1856
1857 Changing this variable does not force an update of the mode line.
1858 @end defvar
1859
1860 @defvar mode-line-frame-identification
1861 This variable identifies the current frame. Its default value
1862 displays @code{" "} if you are using a window system which can show
1863 multiple frames, or @code{"-%F "} on an ordinary terminal which shows
1864 only one frame at a time.
1865 @end defvar
1866
1867 @defvar mode-line-buffer-identification
1868 This variable identifies the buffer being displayed in the window.
1869 Its default value displays the buffer name, padded with spaces to at
1870 least 12 columns.
1871 @end defvar
1872
1873 @defopt mode-line-position
1874 This variable indicates the position in the buffer. Its default value
1875 displays the buffer percentage and, optionally, the buffer size, the
1876 line number and the column number.
1877 @end defopt
1878
1879 @defvar vc-mode
1880 The variable @code{vc-mode}, buffer-local in each buffer, records
1881 whether the buffer's visited file is maintained with version control,
1882 and, if so, which kind. Its value is a string that appears in the mode
1883 line, or @code{nil} for no version control.
1884 @end defvar
1885
1886 @defopt mode-line-modes
1887 This variable displays the buffer's major and minor modes. Its
1888 default value also displays the recursive editing level, information
1889 on the process status, and whether narrowing is in effect.
1890 @end defopt
1891
1892 @defopt mode-line-remote
1893 This variable is used to show whether @code{default-directory} for the
1894 current buffer is remote.
1895 @end defopt
1896
1897 @defopt mode-line-client
1898 This variable is used to identify @code{emacsclient} frames.
1899 @end defopt
1900
1901 The following three variables are used in @code{mode-line-modes}:
1902
1903 @defvar mode-name
1904 This buffer-local variable holds the ``pretty'' name of the current
1905 buffer's major mode. Each major mode should set this variable so that
1906 the mode name will appear in the mode line. The value does not have
1907 to be a string, but can use any of the data types valid in a mode-line
1908 construct (@pxref{Mode Line Data}). To compute the string that will
1909 identify the mode name in the mode line, use @code{format-mode-line}
1910 (@pxref{Emulating Mode Line}).
1911 @end defvar
1912
1913 @defvar mode-line-process
1914 This buffer-local variable contains the mode-line information on process
1915 status in modes used for communicating with subprocesses. It is
1916 displayed immediately following the major mode name, with no intervening
1917 space. For example, its value in the @samp{*shell*} buffer is
1918 @code{(":%s")}, which allows the shell to display its status along
1919 with the major mode as: @samp{(Shell:run)}. Normally this variable
1920 is @code{nil}.
1921 @end defvar
1922
1923 @defvar minor-mode-alist
1924 @anchor{Definition of minor-mode-alist}
1925 This variable holds an association list whose elements specify how the
1926 mode line should indicate that a minor mode is active. Each element of
1927 the @code{minor-mode-alist} should be a two-element list:
1928
1929 @example
1930 (@var{minor-mode-variable} @var{mode-line-string})
1931 @end example
1932
1933 More generally, @var{mode-line-string} can be any mode-line spec. It
1934 appears in the mode line when the value of @var{minor-mode-variable}
1935 is non-@code{nil}, and not otherwise. These strings should begin with
1936 spaces so that they don't run together. Conventionally, the
1937 @var{minor-mode-variable} for a specific mode is set to a
1938 non-@code{nil} value when that minor mode is activated.
1939
1940 @code{minor-mode-alist} itself is not buffer-local. Each variable
1941 mentioned in the alist should be buffer-local if its minor mode can be
1942 enabled separately in each buffer.
1943 @end defvar
1944
1945 @defvar global-mode-string
1946 This variable holds a mode-line spec that, by default, appears in the
1947 mode line just after the @code{which-func-mode} minor mode if set,
1948 else after @code{mode-line-modes}. The command @code{display-time}
1949 sets @code{global-mode-string} to refer to the variable
1950 @code{display-time-string}, which holds a string containing the time
1951 and load information.
1952
1953 The @samp{%M} construct substitutes the value of
1954 @code{global-mode-string}, but that is obsolete, since the variable is
1955 included in the mode line from @code{mode-line-format}.
1956 @end defvar
1957
1958 Here is a simplified version of the default value of
1959 @code{mode-line-format}. The real default value also
1960 specifies addition of text properties.
1961
1962 @example
1963 @group
1964 ("-"
1965 mode-line-mule-info
1966 mode-line-modified
1967 mode-line-frame-identification
1968 mode-line-buffer-identification
1969 @end group
1970 " "
1971 mode-line-position
1972 (vc-mode vc-mode)
1973 " "
1974 @group
1975 mode-line-modes
1976 (which-func-mode ("" which-func-format "--"))
1977 (global-mode-string ("--" global-mode-string))
1978 "-%-")
1979 @end group
1980 @end example
1981
1982 @node %-Constructs
1983 @subsection @code{%}-Constructs in the Mode Line
1984
1985 Strings used as mode-line constructs can use certain
1986 @code{%}-constructs to substitute various kinds of data. Here is a
1987 list of the defined @code{%}-constructs, and what they mean. In any
1988 construct except @samp{%%}, you can add a decimal integer after the
1989 @samp{%} to specify a minimum field width. If the width is less, the
1990 field is padded with spaces to the right.
1991
1992 @table @code
1993 @item %b
1994 The current buffer name, obtained with the @code{buffer-name} function.
1995 @xref{Buffer Names}.
1996
1997 @item %c
1998 The current column number of point.
1999
2000 @item %e
2001 When Emacs is nearly out of memory for Lisp objects, a brief message
2002 saying so. Otherwise, this is empty.
2003
2004 @item %f
2005 The visited file name, obtained with the @code{buffer-file-name}
2006 function. @xref{Buffer File Name}.
2007
2008 @item %F
2009 The title (only on a window system) or the name of the selected frame.
2010 @xref{Basic Parameters}.
2011
2012 @item %i
2013 The size of the accessible part of the current buffer; basically
2014 @code{(- (point-max) (point-min))}.
2015
2016 @item %I
2017 Like @samp{%i}, but the size is printed in a more readable way by using
2018 @samp{k} for 10^3, @samp{M} for 10^6, @samp{G} for 10^9, etc., to
2019 abbreviate.
2020
2021 @item %l
2022 The current line number of point, counting within the accessible portion
2023 of the buffer.
2024
2025 @item %n
2026 @samp{Narrow} when narrowing is in effect; nothing otherwise (see
2027 @code{narrow-to-region} in @ref{Narrowing}).
2028
2029 @item %p
2030 The percentage of the buffer text above the @strong{top} of window, or
2031 @samp{Top}, @samp{Bottom} or @samp{All}. Note that the default
2032 mode-line specification truncates this to three characters.
2033
2034 @item %P
2035 The percentage of the buffer text that is above the @strong{bottom} of
2036 the window (which includes the text visible in the window, as well as
2037 the text above the top), plus @samp{Top} if the top of the buffer is
2038 visible on screen; or @samp{Bottom} or @samp{All}.
2039
2040 @item %s
2041 The status of the subprocess belonging to the current buffer, obtained with
2042 @code{process-status}. @xref{Process Information}.
2043
2044 @item %t
2045 Whether the visited file is a text file or a binary file. This is a
2046 meaningful distinction only on certain operating systems (@pxref{MS-DOS
2047 File Types}).
2048
2049 @item %z
2050 The mnemonics of keyboard, terminal, and buffer coding systems.
2051
2052 @item %Z
2053 Like @samp{%z}, but including the end-of-line format.
2054
2055 @item %*
2056 @samp{%} if the buffer is read only (see @code{buffer-read-only}); @*
2057 @samp{*} if the buffer is modified (see @code{buffer-modified-p}); @*
2058 @samp{-} otherwise. @xref{Buffer Modification}.
2059
2060 @item %+
2061 @samp{*} if the buffer is modified (see @code{buffer-modified-p}); @*
2062 @samp{%} if the buffer is read only (see @code{buffer-read-only}); @*
2063 @samp{-} otherwise. This differs from @samp{%*} only for a modified
2064 read-only buffer. @xref{Buffer Modification}.
2065
2066 @item %&
2067 @samp{*} if the buffer is modified, and @samp{-} otherwise.
2068
2069 @item %[
2070 An indication of the depth of recursive editing levels (not counting
2071 minibuffer levels): one @samp{[} for each editing level.
2072 @xref{Recursive Editing}.
2073
2074 @item %]
2075 One @samp{]} for each recursive editing level (not counting minibuffer
2076 levels).
2077
2078 @item %-
2079 Dashes sufficient to fill the remainder of the mode line.
2080
2081 @item %%
2082 The character @samp{%}---this is how to include a literal @samp{%} in a
2083 string in which @code{%}-constructs are allowed.
2084 @end table
2085
2086 The following two @code{%}-constructs are still supported, but they are
2087 obsolete, since you can get the same results with the variables
2088 @code{mode-name} and @code{global-mode-string}.
2089
2090 @table @code
2091 @item %m
2092 The value of @code{mode-name}.
2093
2094 @item %M
2095 The value of @code{global-mode-string}.
2096 @end table
2097
2098 @node Properties in Mode
2099 @subsection Properties in the Mode Line
2100 @cindex text properties in the mode line
2101
2102 Certain text properties are meaningful in the
2103 mode line. The @code{face} property affects the appearance of text; the
2104 @code{help-echo} property associates help strings with the text, and
2105 @code{local-map} can make the text mouse-sensitive.
2106
2107 There are four ways to specify text properties for text in the mode
2108 line:
2109
2110 @enumerate
2111 @item
2112 Put a string with a text property directly into the mode-line data
2113 structure.
2114
2115 @item
2116 Put a text property on a mode-line %-construct such as @samp{%12b}; then
2117 the expansion of the %-construct will have that same text property.
2118
2119 @item
2120 Use a @code{(:propertize @var{elt} @var{props}@dots{})} construct to
2121 give @var{elt} a text property specified by @var{props}.
2122
2123 @item
2124 Use a list containing @code{:eval @var{form}} in the mode-line data
2125 structure, and make @var{form} evaluate to a string that has a text
2126 property.
2127 @end enumerate
2128
2129 You can use the @code{local-map} property to specify a keymap. This
2130 keymap only takes real effect for mouse clicks; binding character keys
2131 and function keys to it has no effect, since it is impossible to move
2132 point into the mode line.
2133
2134 When the mode line refers to a variable which does not have a
2135 non-@code{nil} @code{risky-local-variable} property, any text
2136 properties given or specified within that variable's values are
2137 ignored. This is because such properties could otherwise specify
2138 functions to be called, and those functions could come from file
2139 local variables.
2140
2141 @node Header Lines
2142 @subsection Window Header Lines
2143 @cindex header line (of a window)
2144 @cindex window header line
2145
2146 A window can have a @dfn{header line} at the
2147 top, just as it can have a mode line at the bottom. The header line
2148 feature works just like the mode-line feature, except that it's
2149 controlled by different variables.
2150
2151 @defvar header-line-format
2152 This variable, local in every buffer, specifies how to display the
2153 header line, for windows displaying the buffer. The format of the value
2154 is the same as for @code{mode-line-format} (@pxref{Mode Line Data}).
2155 It is normally @code{nil}, so that ordinary buffers have no header line.
2156 @end defvar
2157
2158 A window that is just one line tall never displays a header line. A
2159 window that is two lines tall cannot display both a mode line and a
2160 header line at once; if it has a mode line, then it does not display a
2161 header line.
2162
2163 @node Emulating Mode Line
2164 @subsection Emulating Mode-Line Formatting
2165
2166 You can use the function @code{format-mode-line} to compute
2167 the text that would appear in a mode line or header line
2168 based on a certain mode-line specification.
2169
2170 @defun format-mode-line format &optional face window buffer
2171 This function formats a line of text according to @var{format} as if it
2172 were generating the mode line for @var{window}, but it also returns the
2173 text as a string. The argument @var{window} defaults to the selected
2174 window. If @var{buffer} is non-@code{nil}, all the information used is
2175 taken from @var{buffer}; by default, it comes from @var{window}'s
2176 buffer.
2177
2178 The value string normally has text properties that correspond to the
2179 faces, keymaps, etc., that the mode line would have. Any character for
2180 which no @code{face} property is specified by @var{format} gets a
2181 default value determined by @var{face}. If @var{face} is @code{t}, that
2182 stands for either @code{mode-line} if @var{window} is selected,
2183 otherwise @code{mode-line-inactive}. If @var{face} is @code{nil} or
2184 omitted, that stands for the default face. If @var{face} is an integer,
2185 the value returned by this function will have no text properties.
2186
2187 You can also specify other valid faces as the value of @var{face}.
2188 If specified, that face provides the @code{face} property for characters
2189 whose face is not specified by @var{format}.
2190
2191 Note that using @code{mode-line}, @code{mode-line-inactive}, or
2192 @code{header-line} as @var{face} will actually redisplay the mode line
2193 or the header line, respectively, using the current definitions of the
2194 corresponding face, in addition to returning the formatted string.
2195 (Other faces do not cause redisplay.)
2196
2197 For example, @code{(format-mode-line header-line-format)} returns the
2198 text that would appear in the selected window's header line (@code{""}
2199 if it has no header line). @code{(format-mode-line header-line-format
2200 'header-line)} returns the same text, with each character
2201 carrying the face that it will have in the header line itself, and also
2202 redraws the header line.
2203 @end defun
2204
2205 @node Imenu
2206 @section Imenu
2207
2208 @cindex Imenu
2209 @dfn{Imenu} is a feature that lets users select a definition or
2210 section in the buffer, from a menu which lists all of them, to go
2211 directly to that location in the buffer. Imenu works by constructing
2212 a buffer index which lists the names and buffer positions of the
2213 definitions, or other named portions of the buffer; then the user can
2214 choose one of them and move point to it. Major modes can add a menu
2215 bar item to use Imenu using @code{imenu-add-to-menubar}.
2216
2217 @defun imenu-add-to-menubar name
2218 This function defines a local menu bar item named @var{name}
2219 to run Imenu.
2220 @end defun
2221
2222 The user-level commands for using Imenu are described in the Emacs
2223 Manual (@pxref{Imenu,, Imenu, emacs, the Emacs Manual}). This section
2224 explains how to customize Imenu's method of finding definitions or
2225 buffer portions for a particular major mode.
2226
2227 The usual and simplest way is to set the variable
2228 @code{imenu-generic-expression}:
2229
2230 @defvar imenu-generic-expression
2231 This variable, if non-@code{nil}, is a list that specifies regular
2232 expressions for finding definitions for Imenu. Simple elements of
2233 @code{imenu-generic-expression} look like this:
2234
2235 @example
2236 (@var{menu-title} @var{regexp} @var{index})
2237 @end example
2238
2239 Here, if @var{menu-title} is non-@code{nil}, it says that the matches
2240 for this element should go in a submenu of the buffer index;
2241 @var{menu-title} itself specifies the name for the submenu. If
2242 @var{menu-title} is @code{nil}, the matches for this element go directly
2243 in the top level of the buffer index.
2244
2245 The second item in the list, @var{regexp}, is a regular expression
2246 (@pxref{Regular Expressions}); anything in the buffer that it matches
2247 is considered a definition, something to mention in the buffer index.
2248 The third item, @var{index}, is a non-negative integer that indicates
2249 which subexpression in @var{regexp} matches the definition's name.
2250
2251 An element can also look like this:
2252
2253 @example
2254 (@var{menu-title} @var{regexp} @var{index} @var{function} @var{arguments}@dots{})
2255 @end example
2256
2257 Each match for this element creates an index item, and when the index
2258 item is selected by the user, it calls @var{function} with arguments
2259 consisting of the item name, the buffer position, and @var{arguments}.
2260
2261 For Emacs Lisp mode, @code{imenu-generic-expression} could look like
2262 this:
2263
2264 @c should probably use imenu-syntax-alist and \\sw rather than [-A-Za-z0-9+]
2265 @example
2266 @group
2267 ((nil "^\\s-*(def\\(un\\|subst\\|macro\\|advice\\)\
2268 \\s-+\\([-A-Za-z0-9+]+\\)" 2)
2269 @end group
2270 @group
2271 ("*Vars*" "^\\s-*(def\\(var\\|const\\)\
2272 \\s-+\\([-A-Za-z0-9+]+\\)" 2)
2273 @end group
2274 @group
2275 ("*Types*"
2276 "^\\s-*\
2277 (def\\(type\\|struct\\|class\\|ine-condition\\)\
2278 \\s-+\\([-A-Za-z0-9+]+\\)" 2))
2279 @end group
2280 @end example
2281
2282 Setting this variable makes it buffer-local in the current buffer.
2283 @end defvar
2284
2285 @defvar imenu-case-fold-search
2286 This variable controls whether matching against the regular
2287 expressions in the value of @code{imenu-generic-expression} is
2288 case-sensitive: @code{t}, the default, means matching should ignore
2289 case.
2290
2291 Setting this variable makes it buffer-local in the current buffer.
2292 @end defvar
2293
2294 @defvar imenu-syntax-alist
2295 This variable is an alist of syntax table modifiers to use while
2296 processing @code{imenu-generic-expression}, to override the syntax table
2297 of the current buffer. Each element should have this form:
2298
2299 @example
2300 (@var{characters} . @var{syntax-description})
2301 @end example
2302
2303 The @sc{car}, @var{characters}, can be either a character or a string.
2304 The element says to give that character or characters the syntax
2305 specified by @var{syntax-description}, which is passed to
2306 @code{modify-syntax-entry} (@pxref{Syntax Table Functions}).
2307
2308 This feature is typically used to give word syntax to characters which
2309 normally have symbol syntax, and thus to simplify
2310 @code{imenu-generic-expression} and speed up matching.
2311 For example, Fortran mode uses it this way:
2312
2313 @example
2314 (setq imenu-syntax-alist '(("_$" . "w")))
2315 @end example
2316
2317 The @code{imenu-generic-expression} regular expressions can then use
2318 @samp{\\sw+} instead of @samp{\\(\\sw\\|\\s_\\)+}. Note that this
2319 technique may be inconvenient when the mode needs to limit the initial
2320 character of a name to a smaller set of characters than are allowed in
2321 the rest of a name.
2322
2323 Setting this variable makes it buffer-local in the current buffer.
2324 @end defvar
2325
2326 Another way to customize Imenu for a major mode is to set the
2327 variables @code{imenu-prev-index-position-function} and
2328 @code{imenu-extract-index-name-function}:
2329
2330 @defvar imenu-prev-index-position-function
2331 If this variable is non-@code{nil}, its value should be a function that
2332 finds the next ``definition'' to put in the buffer index, scanning
2333 backward in the buffer from point. It should return @code{nil} if it
2334 doesn't find another ``definition'' before point. Otherwise it should
2335 leave point at the place it finds a ``definition'' and return any
2336 non-@code{nil} value.
2337
2338 Setting this variable makes it buffer-local in the current buffer.
2339 @end defvar
2340
2341 @defvar imenu-extract-index-name-function
2342 If this variable is non-@code{nil}, its value should be a function to
2343 return the name for a definition, assuming point is in that definition
2344 as the @code{imenu-prev-index-position-function} function would leave
2345 it.
2346
2347 Setting this variable makes it buffer-local in the current buffer.
2348 @end defvar
2349
2350 The last way to customize Imenu for a major mode is to set the
2351 variable @code{imenu-create-index-function}:
2352
2353 @defvar imenu-create-index-function
2354 This variable specifies the function to use for creating a buffer
2355 index. The function should take no arguments, and return an index
2356 alist for the current buffer. It is called within
2357 @code{save-excursion}, so where it leaves point makes no difference.
2358
2359 The index alist can have three types of elements. Simple elements
2360 look like this:
2361
2362 @example
2363 (@var{index-name} . @var{index-position})
2364 @end example
2365
2366 Selecting a simple element has the effect of moving to position
2367 @var{index-position} in the buffer. Special elements look like this:
2368
2369 @example
2370 (@var{index-name} @var{index-position} @var{function} @var{arguments}@dots{})
2371 @end example
2372
2373 Selecting a special element performs:
2374
2375 @example
2376 (funcall @var{function}
2377 @var{index-name} @var{index-position} @var{arguments}@dots{})
2378 @end example
2379
2380 A nested sub-alist element looks like this:
2381
2382 @example
2383 (@var{menu-title} @var{sub-alist})
2384 @end example
2385
2386 It creates the submenu @var{menu-title} specified by @var{sub-alist}.
2387
2388 The default value of @code{imenu-create-index-function} is
2389 @code{imenu-default-create-index-function}. This function calls the
2390 value of @code{imenu-prev-index-position-function} and the value of
2391 @code{imenu-extract-index-name-function} to produce the index alist.
2392 However, if either of these two variables is @code{nil}, the default
2393 function uses @code{imenu-generic-expression} instead.
2394
2395 Setting this variable makes it buffer-local in the current buffer.
2396 @end defvar
2397
2398 @node Font Lock Mode
2399 @section Font Lock Mode
2400 @cindex Font Lock mode
2401
2402 @dfn{Font Lock mode} is a feature that automatically attaches
2403 @code{face} properties to certain parts of the buffer based on their
2404 syntactic role. How it parses the buffer depends on the major mode;
2405 most major modes define syntactic criteria for which faces to use in
2406 which contexts. This section explains how to customize Font Lock for a
2407 particular major mode.
2408
2409 Font Lock mode finds text to highlight in two ways: through
2410 syntactic parsing based on the syntax table, and through searching
2411 (usually for regular expressions). Syntactic fontification happens
2412 first; it finds comments and string constants and highlights them.
2413 Search-based fontification happens second.
2414
2415 @menu
2416 * Font Lock Basics:: Overview of customizing Font Lock.
2417 * Search-based Fontification:: Fontification based on regexps.
2418 * Customizing Keywords:: Customizing search-based fontification.
2419 * Other Font Lock Variables:: Additional customization facilities.
2420 * Levels of Font Lock:: Each mode can define alternative levels
2421 so that the user can select more or less.
2422 * Precalculated Fontification:: How Lisp programs that produce the buffer
2423 contents can also specify how to fontify it.
2424 * Faces for Font Lock:: Special faces specifically for Font Lock.
2425 * Syntactic Font Lock:: Fontification based on syntax tables.
2426 * Setting Syntax Properties:: Defining character syntax based on context
2427 using the Font Lock mechanism.
2428 * Multiline Font Lock:: How to coerce Font Lock into properly
2429 highlighting multiline constructs.
2430 @end menu
2431
2432 @node Font Lock Basics
2433 @subsection Font Lock Basics
2434
2435 There are several variables that control how Font Lock mode highlights
2436 text. But major modes should not set any of these variables directly.
2437 Instead, they should set @code{font-lock-defaults} as a buffer-local
2438 variable. The value assigned to this variable is used, if and when Font
2439 Lock mode is enabled, to set all the other variables.
2440
2441 @defvar font-lock-defaults
2442 This variable is set by major modes, as a buffer-local variable, to
2443 specify how to fontify text in that mode. It automatically becomes
2444 buffer-local when you set it. If its value is @code{nil}, Font-Lock
2445 mode does no highlighting, and you can use the @samp{Faces} menu
2446 (under @samp{Edit} and then @samp{Text Properties} in the menu bar) to
2447 assign faces explicitly to text in the buffer.
2448
2449 If non-@code{nil}, the value should look like this:
2450
2451 @example
2452 (@var{keywords} [@var{keywords-only} [@var{case-fold}
2453 [@var{syntax-alist} [@var{syntax-begin} @var{other-vars}@dots{}]]]])
2454 @end example
2455
2456 The first element, @var{keywords}, indirectly specifies the value of
2457 @code{font-lock-keywords} which directs search-based fontification.
2458 It can be a symbol, a variable or a function whose value is the list
2459 to use for @code{font-lock-keywords}. It can also be a list of
2460 several such symbols, one for each possible level of fontification.
2461 The first symbol specifies the @samp{mode default} level of
2462 fontification, the next symbol level 1 fontification, the next level 2,
2463 and so on. The @samp{mode default} level is normally the same as level
2464 1. It is used when @code{font-lock-maximum-decoration} has a @code{nil}
2465 value. @xref{Levels of Font Lock}.
2466
2467 The second element, @var{keywords-only}, specifies the value of the
2468 variable @code{font-lock-keywords-only}. If this is omitted or
2469 @code{nil}, syntactic fontification (of strings and comments) is also
2470 performed. If this is non-@code{nil}, such fontification is not
2471 performed. @xref{Syntactic Font Lock}.
2472
2473 The third element, @var{case-fold}, specifies the value of
2474 @code{font-lock-keywords-case-fold-search}. If it is non-@code{nil},
2475 Font Lock mode ignores case when searching as directed by
2476 @code{font-lock-keywords}.
2477
2478 If the fourth element, @var{syntax-alist}, is non-@code{nil}, it
2479 should be a list of cons cells of the form @code{(@var{char-or-string}
2480 . @var{string})}. These are used to set up a syntax table for
2481 syntactic fontification (@pxref{Syntax Table Functions}). The
2482 resulting syntax table is stored in @code{font-lock-syntax-table}.
2483
2484 The fifth element, @var{syntax-begin}, specifies the value of
2485 @code{font-lock-beginning-of-syntax-function}. We recommend setting
2486 this variable to @code{nil} and using @code{syntax-begin-function}
2487 instead.
2488
2489 All the remaining elements (if any) are collectively called
2490 @var{other-vars}. Each of these elements should have the form
2491 @code{(@var{variable} . @var{value})}---which means, make
2492 @var{variable} buffer-local and then set it to @var{value}. You can
2493 use these @var{other-vars} to set other variables that affect
2494 fontification, aside from those you can control with the first five
2495 elements. @xref{Other Font Lock Variables}.
2496 @end defvar
2497
2498 If your mode fontifies text explicitly by adding
2499 @code{font-lock-face} properties, it can specify @code{(nil t)} for
2500 @code{font-lock-defaults} to turn off all automatic fontification.
2501 However, this is not required; it is possible to fontify some things
2502 using @code{font-lock-face} properties and set up automatic
2503 fontification for other parts of the text.
2504
2505 @node Search-based Fontification
2506 @subsection Search-based Fontification
2507
2508 The most important variable for customizing Font Lock mode is
2509 @code{font-lock-keywords}. It specifies the search criteria for
2510 search-based fontification. You should specify the value of this
2511 variable with @var{keywords} in @code{font-lock-defaults}.
2512
2513 @defvar font-lock-keywords
2514 This variable's value is a list of the keywords to highlight. Be
2515 careful when composing regular expressions for this list; a poorly
2516 written pattern can dramatically slow things down!
2517 @end defvar
2518
2519 Each element of @code{font-lock-keywords} specifies how to find
2520 certain cases of text, and how to highlight those cases. Font Lock mode
2521 processes the elements of @code{font-lock-keywords} one by one, and for
2522 each element, it finds and handles all matches. Ordinarily, once
2523 part of the text has been fontified already, this cannot be overridden
2524 by a subsequent match in the same text; but you can specify different
2525 behavior using the @var{override} element of a @var{subexp-highlighter}.
2526
2527 Each element of @code{font-lock-keywords} should have one of these
2528 forms:
2529
2530 @table @code
2531 @item @var{regexp}
2532 Highlight all matches for @var{regexp} using
2533 @code{font-lock-keyword-face}. For example,
2534
2535 @example
2536 ;; @r{Highlight occurrences of the word @samp{foo}}
2537 ;; @r{using @code{font-lock-keyword-face}.}
2538 "\\<foo\\>"
2539 @end example
2540
2541 The function @code{regexp-opt} (@pxref{Regexp Functions}) is useful
2542 for calculating optimal regular expressions to match a number of
2543 different keywords.
2544
2545 @item @var{function}
2546 Find text by calling @var{function}, and highlight the matches
2547 it finds using @code{font-lock-keyword-face}.
2548
2549 When @var{function} is called, it receives one argument, the limit of
2550 the search; it should begin searching at point, and not search beyond the
2551 limit. It should return non-@code{nil} if it succeeds, and set the
2552 match data to describe the match that was found. Returning @code{nil}
2553 indicates failure of the search.
2554
2555 Fontification will call @var{function} repeatedly with the same limit,
2556 and with point where the previous invocation left it, until
2557 @var{function} fails. On failure, @var{function} need not reset point
2558 in any particular way.
2559
2560 @item (@var{matcher} . @var{subexp})
2561 In this kind of element, @var{matcher} is either a regular
2562 expression or a function, as described above. The @sc{cdr},
2563 @var{subexp}, specifies which subexpression of @var{matcher} should be
2564 highlighted (instead of the entire text that @var{matcher} matched).
2565
2566 @example
2567 ;; @r{Highlight the @samp{bar} in each occurrence of @samp{fubar},}
2568 ;; @r{using @code{font-lock-keyword-face}.}
2569 ("fu\\(bar\\)" . 1)
2570 @end example
2571
2572 If you use @code{regexp-opt} to produce the regular expression
2573 @var{matcher}, you can use @code{regexp-opt-depth} (@pxref{Regexp
2574 Functions}) to calculate the value for @var{subexp}.
2575
2576 @item (@var{matcher} . @var{facespec})
2577 In this kind of element, @var{facespec} is an expression whose value
2578 specifies the face to use for highlighting. In the simplest case,
2579 @var{facespec} is a Lisp variable (a symbol) whose value is a face
2580 name.
2581
2582 @example
2583 ;; @r{Highlight occurrences of @samp{fubar},}
2584 ;; @r{using the face which is the value of @code{fubar-face}.}
2585 ("fubar" . fubar-face)
2586 @end example
2587
2588 However, @var{facespec} can also evaluate to a list of this form:
2589
2590 @example
2591 (face @var{face} @var{prop1} @var{val1} @var{prop2} @var{val2}@dots{})
2592 @end example
2593
2594 @noindent
2595 to specify the face @var{face} and various additional text properties
2596 to put on the text that matches. If you do this, be sure to add the
2597 other text property names that you set in this way to the value of
2598 @code{font-lock-extra-managed-props} so that the properties will also
2599 be cleared out when they are no longer appropriate. Alternatively,
2600 you can set the variable @code{font-lock-unfontify-region-function} to
2601 a function that clears these properties. @xref{Other Font Lock
2602 Variables}.
2603
2604 @item (@var{matcher} . @var{subexp-highlighter})
2605 In this kind of element, @var{subexp-highlighter} is a list
2606 which specifies how to highlight matches found by @var{matcher}.
2607 It has the form:
2608
2609 @example
2610 (@var{subexp} @var{facespec} [@var{override} [@var{laxmatch}]])
2611 @end example
2612
2613 The @sc{car}, @var{subexp}, is an integer specifying which subexpression
2614 of the match to fontify (0 means the entire matching text). The second
2615 subelement, @var{facespec}, is an expression whose value specifies the
2616 face, as described above.
2617
2618 The last two values in @var{subexp-highlighter}, @var{override} and
2619 @var{laxmatch}, are optional flags. If @var{override} is @code{t},
2620 this element can override existing fontification made by previous
2621 elements of @code{font-lock-keywords}. If it is @code{keep}, then
2622 each character is fontified if it has not been fontified already by
2623 some other element. If it is @code{prepend}, the face specified by
2624 @var{facespec} is added to the beginning of the @code{font-lock-face}
2625 property. If it is @code{append}, the face is added to the end of the
2626 @code{font-lock-face} property.
2627
2628 If @var{laxmatch} is non-@code{nil}, it means there should be no error
2629 if there is no subexpression numbered @var{subexp} in @var{matcher}.
2630 Obviously, fontification of the subexpression numbered @var{subexp} will
2631 not occur. However, fontification of other subexpressions (and other
2632 regexps) will continue. If @var{laxmatch} is @code{nil}, and the
2633 specified subexpression is missing, then an error is signaled which
2634 terminates search-based fontification.
2635
2636 Here are some examples of elements of this kind, and what they do:
2637
2638 @smallexample
2639 ;; @r{Highlight occurrences of either @samp{foo} or @samp{bar}, using}
2640 ;; @r{@code{foo-bar-face}, even if they have already been highlighted.}
2641 ;; @r{@code{foo-bar-face} should be a variable whose value is a face.}
2642 ("foo\\|bar" 0 foo-bar-face t)
2643
2644 ;; @r{Highlight the first subexpression within each occurrence}
2645 ;; @r{that the function @code{fubar-match} finds,}
2646 ;; @r{using the face which is the value of @code{fubar-face}.}
2647 (fubar-match 1 fubar-face)
2648 @end smallexample
2649
2650 @item (@var{matcher} . @var{anchored-highlighter})
2651 In this kind of element, @var{anchored-highlighter} specifies how to
2652 highlight text that follows a match found by @var{matcher}. So a
2653 match found by @var{matcher} acts as the anchor for further searches
2654 specified by @var{anchored-highlighter}. @var{anchored-highlighter}
2655 is a list of the following form:
2656
2657 @example
2658 (@var{anchored-matcher} @var{pre-form} @var{post-form}
2659 @var{subexp-highlighters}@dots{})
2660 @end example
2661
2662 Here, @var{anchored-matcher}, like @var{matcher}, is either a regular
2663 expression or a function. After a match of @var{matcher} is found,
2664 point is at the end of the match. Now, Font Lock evaluates the form
2665 @var{pre-form}. Then it searches for matches of
2666 @var{anchored-matcher} and uses @var{subexp-highlighters} to highlight
2667 these. A @var{subexp-highlighter} is as described above. Finally,
2668 Font Lock evaluates @var{post-form}.
2669
2670 The forms @var{pre-form} and @var{post-form} can be used to initialize
2671 before, and cleanup after, @var{anchored-matcher} is used. Typically,
2672 @var{pre-form} is used to move point to some position relative to the
2673 match of @var{matcher}, before starting with @var{anchored-matcher}.
2674 @var{post-form} might be used to move back, before resuming with
2675 @var{matcher}.
2676
2677 After Font Lock evaluates @var{pre-form}, it does not search for
2678 @var{anchored-matcher} beyond the end of the line. However, if
2679 @var{pre-form} returns a buffer position that is greater than the
2680 position of point after @var{pre-form} is evaluated, then the position
2681 returned by @var{pre-form} is used as the limit of the search instead.
2682 It is generally a bad idea to return a position greater than the end
2683 of the line; in other words, the @var{anchored-matcher} search should
2684 not span lines.
2685
2686 For example,
2687
2688 @smallexample
2689 ;; @r{Highlight occurrences of the word @samp{item} following}
2690 ;; @r{an occurrence of the word @samp{anchor} (on the same line)}
2691 ;; @r{in the value of @code{item-face}.}
2692 ("\\<anchor\\>" "\\<item\\>" nil nil (0 item-face))
2693 @end smallexample
2694
2695 Here, @var{pre-form} and @var{post-form} are @code{nil}. Therefore
2696 searching for @samp{item} starts at the end of the match of
2697 @samp{anchor}, and searching for subsequent instances of @samp{anchor}
2698 resumes from where searching for @samp{item} concluded.
2699
2700 @item (@var{matcher} @var{highlighters}@dots{})
2701 This sort of element specifies several @var{highlighter} lists for a
2702 single @var{matcher}. A @var{highlighter} list can be of the type
2703 @var{subexp-highlighter} or @var{anchored-highlighter} as described
2704 above.
2705
2706 For example,
2707
2708 @smallexample
2709 ;; @r{Highlight occurrences of the word @samp{anchor} in the value}
2710 ;; @r{of @code{anchor-face}, and subsequent occurrences of the word}
2711 ;; @r{@samp{item} (on the same line) in the value of @code{item-face}.}
2712 ("\\<anchor\\>" (0 anchor-face)
2713 ("\\<item\\>" nil nil (0 item-face)))
2714 @end smallexample
2715
2716 @item (eval . @var{form})
2717 Here @var{form} is an expression to be evaluated the first time
2718 this value of @code{font-lock-keywords} is used in a buffer.
2719 Its value should have one of the forms described in this table.
2720 @end table
2721
2722 @strong{Warning:} Do not design an element of @code{font-lock-keywords}
2723 to match text which spans lines; this does not work reliably.
2724 For details, see @xref{Multiline Font Lock}.
2725
2726 You can use @var{case-fold} in @code{font-lock-defaults} to specify
2727 the value of @code{font-lock-keywords-case-fold-search} which says
2728 whether search-based fontification should be case-insensitive.
2729
2730 @defvar font-lock-keywords-case-fold-search
2731 Non-@code{nil} means that regular expression matching for the sake of
2732 @code{font-lock-keywords} should be case-insensitive.
2733 @end defvar
2734
2735 @node Customizing Keywords
2736 @subsection Customizing Search-Based Fontification
2737
2738 You can use @code{font-lock-add-keywords} to add additional
2739 search-based fontification rules to a major mode, and
2740 @code{font-lock-remove-keywords} to remove rules.
2741
2742 @defun font-lock-add-keywords mode keywords &optional how
2743 This function adds highlighting @var{keywords}, for the current buffer
2744 or for major mode @var{mode}. The argument @var{keywords} should be a
2745 list with the same format as the variable @code{font-lock-keywords}.
2746
2747 If @var{mode} is a symbol which is a major mode command name, such as
2748 @code{c-mode}, the effect is that enabling Font Lock mode in
2749 @var{mode} will add @var{keywords} to @code{font-lock-keywords}.
2750 Calling with a non-@code{nil} value of @var{mode} is correct only in
2751 your @file{~/.emacs} file.
2752
2753 If @var{mode} is @code{nil}, this function adds @var{keywords} to
2754 @code{font-lock-keywords} in the current buffer. This way of calling
2755 @code{font-lock-add-keywords} is usually used in mode hook functions.
2756
2757 By default, @var{keywords} are added at the beginning of
2758 @code{font-lock-keywords}. If the optional argument @var{how} is
2759 @code{set}, they are used to replace the value of
2760 @code{font-lock-keywords}. If @var{how} is any other non-@code{nil}
2761 value, they are added at the end of @code{font-lock-keywords}.
2762
2763 Some modes provide specialized support you can use in additional
2764 highlighting patterns. See the variables
2765 @code{c-font-lock-extra-types}, @code{c++-font-lock-extra-types},
2766 and @code{java-font-lock-extra-types}, for example.
2767
2768 @strong{Warning:} major mode commands must not call
2769 @code{font-lock-add-keywords} under any circumstances, either directly
2770 or indirectly, except through their mode hooks. (Doing so would lead to
2771 incorrect behavior for some minor modes.) They should set up their
2772 rules for search-based fontification by setting
2773 @code{font-lock-keywords}.
2774 @end defun
2775
2776 @defun font-lock-remove-keywords mode keywords
2777 This function removes @var{keywords} from @code{font-lock-keywords}
2778 for the current buffer or for major mode @var{mode}. As in
2779 @code{font-lock-add-keywords}, @var{mode} should be a major mode
2780 command name or @code{nil}. All the caveats and requirements for
2781 @code{font-lock-add-keywords} apply here too.
2782 @end defun
2783
2784 For example, this code
2785
2786 @smallexample
2787 (font-lock-add-keywords 'c-mode
2788 '(("\\<\\(FIXME\\):" 1 font-lock-warning-face prepend)
2789 ("\\<\\(and\\|or\\|not\\)\\>" . font-lock-keyword-face)))
2790 @end smallexample
2791
2792 @noindent
2793 adds two fontification patterns for C mode: one to fontify the word
2794 @samp{FIXME}, even in comments, and another to fontify the words
2795 @samp{and}, @samp{or} and @samp{not} as keywords.
2796
2797 @noindent
2798 That example affects only C mode proper. To add the same patterns to
2799 C mode @emph{and} all modes derived from it, do this instead:
2800
2801 @smallexample
2802 (add-hook 'c-mode-hook
2803 (lambda ()
2804 (font-lock-add-keywords nil
2805 '(("\\<\\(FIXME\\):" 1 font-lock-warning-face prepend)
2806 ("\\<\\(and\\|or\\|not\\)\\>" .
2807 font-lock-keyword-face)))))
2808 @end smallexample
2809
2810 @node Other Font Lock Variables
2811 @subsection Other Font Lock Variables
2812
2813 This section describes additional variables that a major mode can
2814 set by means of @var{other-vars} in @code{font-lock-defaults}
2815 (@pxref{Font Lock Basics}).
2816
2817 @defvar font-lock-mark-block-function
2818 If this variable is non-@code{nil}, it should be a function that is
2819 called with no arguments, to choose an enclosing range of text for
2820 refontification for the command @kbd{M-o M-o}
2821 (@code{font-lock-fontify-block}).
2822
2823 The function should report its choice by placing the region around it.
2824 A good choice is a range of text large enough to give proper results,
2825 but not too large so that refontification becomes slow. Typical values
2826 are @code{mark-defun} for programming modes or @code{mark-paragraph} for
2827 textual modes.
2828 @end defvar
2829
2830 @defvar font-lock-extra-managed-props
2831 This variable specifies additional properties (other than
2832 @code{font-lock-face}) that are being managed by Font Lock mode. It
2833 is used by @code{font-lock-default-unfontify-region}, which normally
2834 only manages the @code{font-lock-face} property. If you want Font
2835 Lock to manage other properties as well, you must specify them in a
2836 @var{facespec} in @code{font-lock-keywords} as well as add them to
2837 this list. @xref{Search-based Fontification}.
2838 @end defvar
2839
2840 @defvar font-lock-fontify-buffer-function
2841 Function to use for fontifying the buffer. The default value is
2842 @code{font-lock-default-fontify-buffer}.
2843 @end defvar
2844
2845 @defvar font-lock-unfontify-buffer-function
2846 Function to use for unfontifying the buffer. This is used when
2847 turning off Font Lock mode. The default value is
2848 @code{font-lock-default-unfontify-buffer}.
2849 @end defvar
2850
2851 @defvar font-lock-fontify-region-function
2852 Function to use for fontifying a region. It should take two
2853 arguments, the beginning and end of the region, and an optional third
2854 argument @var{verbose}. If @var{verbose} is non-@code{nil}, the
2855 function should print status messages. The default value is
2856 @code{font-lock-default-fontify-region}.
2857 @end defvar
2858
2859 @defvar font-lock-unfontify-region-function
2860 Function to use for unfontifying a region. It should take two
2861 arguments, the beginning and end of the region. The default value is
2862 @code{font-lock-default-unfontify-region}.
2863 @end defvar
2864
2865 @defun jit-lock-register function &optional contextual
2866 This function tells Font Lock mode to run the Lisp function
2867 @var{function} any time it has to fontify or refontify part of the
2868 current buffer. It calls @var{function} before calling the default
2869 fontification functions, and gives it two arguments, @var{start} and
2870 @var{end}, which specify the region to be fontified or refontified.
2871
2872 The optional argument @var{contextual}, if non-@code{nil}, forces Font
2873 Lock mode to always refontify a syntactically relevant part of the
2874 buffer, and not just the modified lines. This argument can usually be
2875 omitted.
2876 @end defun
2877
2878 @defun jit-lock-unregister function
2879 If @var{function} was previously registered as a fontification
2880 function using @code{jit-lock-register}, this function unregisters it.
2881 @end defun
2882
2883 @node Levels of Font Lock
2884 @subsection Levels of Font Lock
2885
2886 Many major modes offer three different levels of fontification. You
2887 can define multiple levels by using a list of symbols for @var{keywords}
2888 in @code{font-lock-defaults}. Each symbol specifies one level of
2889 fontification; it is up to the user to choose one of these levels,
2890 normally by setting @code{font-lock-maximum-decoration} (@pxref{Font
2891 Lock,,, emacs, the GNU Emacs Manual}). The chosen level's symbol
2892 value is used to initialize @code{font-lock-keywords}.
2893
2894 Here are the conventions for how to define the levels of
2895 fontification:
2896
2897 @itemize @bullet
2898 @item
2899 Level 1: highlight function declarations, file directives (such as include or
2900 import directives), strings and comments. The idea is speed, so only
2901 the most important and top-level components are fontified.
2902
2903 @item
2904 Level 2: in addition to level 1, highlight all language keywords,
2905 including type names that act like keywords, as well as named constant
2906 values. The idea is that all keywords (either syntactic or semantic)
2907 should be fontified appropriately.
2908
2909 @item
2910 Level 3: in addition to level 2, highlight the symbols being defined in
2911 function and variable declarations, and all builtin function names,
2912 wherever they appear.
2913 @end itemize
2914
2915 @node Precalculated Fontification
2916 @subsection Precalculated Fontification
2917
2918 Some major modes such as @code{list-buffers} and @code{occur}
2919 construct the buffer text programmatically. The easiest way for them
2920 to support Font Lock mode is to specify the faces of text when they
2921 insert the text in the buffer.
2922
2923 The way to do this is to specify the faces in the text with the
2924 special text property @code{font-lock-face} (@pxref{Special
2925 Properties}). When Font Lock mode is enabled, this property controls
2926 the display, just like the @code{face} property. When Font Lock mode
2927 is disabled, @code{font-lock-face} has no effect on the display.
2928
2929 It is ok for a mode to use @code{font-lock-face} for some text and
2930 also use the normal Font Lock machinery. But if the mode does not use
2931 the normal Font Lock machinery, it should not set the variable
2932 @code{font-lock-defaults}.
2933
2934 @node Faces for Font Lock
2935 @subsection Faces for Font Lock
2936 @cindex faces for font lock
2937 @cindex font lock faces
2938
2939 Font Lock mode can highlight using any face, but Emacs defines several
2940 faces specifically for syntactic highlighting. These @dfn{Font Lock
2941 faces} are listed below. They can also be used by major modes for
2942 syntactic highlighting outside of Font Lock mode (@pxref{Major Mode
2943 Conventions}).
2944
2945 Each of these symbols is both a face name, and a variable whose
2946 default value is the symbol itself. Thus, the default value of
2947 @code{font-lock-comment-face} is @code{font-lock-comment-face}.
2948
2949 The faces are listed with descriptions of their typical usage, and in
2950 order of greater to lesser ``prominence''. If a mode's syntactic
2951 categories do not fit well with the usage descriptions, the faces can be
2952 assigned using the ordering as a guide.
2953
2954 @table @code
2955 @item font-lock-warning-face
2956 @vindex font-lock-warning-face
2957 for a construct that is peculiar, or that greatly changes the meaning of
2958 other text, like @samp{;;;###autoload} in Emacs Lisp and @samp{#error}
2959 in C.
2960
2961 @item font-lock-function-name-face
2962 @vindex font-lock-function-name-face
2963 for the name of a function being defined or declared.
2964
2965 @item font-lock-variable-name-face
2966 @vindex font-lock-variable-name-face
2967 for the name of a variable being defined or declared.
2968
2969 @item font-lock-keyword-face
2970 @vindex font-lock-keyword-face
2971 for a keyword with special syntactic significance, like @samp{for} and
2972 @samp{if} in C.
2973
2974 @item font-lock-comment-face
2975 @vindex font-lock-comment-face
2976 for comments.
2977
2978 @item font-lock-comment-delimiter-face
2979 @vindex font-lock-comment-delimiter-face
2980 for comments delimiters, like @samp{/*} and @samp{*/} in C. On most
2981 terminals, this inherits from @code{font-lock-comment-face}.
2982
2983 @item font-lock-type-face
2984 @vindex font-lock-type-face
2985 for the names of user-defined data types.
2986
2987 @item font-lock-constant-face
2988 @vindex font-lock-constant-face
2989 for the names of constants, like @samp{NULL} in C.
2990
2991 @item font-lock-builtin-face
2992 @vindex font-lock-builtin-face
2993 for the names of built-in functions.
2994
2995 @item font-lock-preprocessor-face
2996 @vindex font-lock-preprocessor-face
2997 for preprocessor commands. This inherits, by default, from
2998 @code{font-lock-builtin-face}.
2999
3000 @item font-lock-string-face
3001 @vindex font-lock-string-face
3002 for string constants.
3003
3004 @item font-lock-doc-face
3005 @vindex font-lock-doc-face
3006 for documentation strings in the code. This inherits, by default, from
3007 @code{font-lock-string-face}.
3008
3009 @item font-lock-negation-char-face
3010 @vindex font-lock-negation-char-face
3011 for easily-overlooked negation characters.
3012 @end table
3013
3014 @node Syntactic Font Lock
3015 @subsection Syntactic Font Lock
3016 @cindex syntactic font lock
3017
3018 Syntactic fontification uses the syntax table to find comments and
3019 string constants (@pxref{Syntax Tables}). It highlights them using
3020 @code{font-lock-comment-face} and @code{font-lock-string-face}
3021 (@pxref{Faces for Font Lock}), or whatever
3022 @code{font-lock-syntactic-face-function} chooses. There are several
3023 variables that affect syntactic fontification; you should set them by
3024 means of @code{font-lock-defaults} (@pxref{Font Lock Basics}).
3025
3026 @defvar font-lock-keywords-only
3027 Non-@code{nil} means Font Lock should not do syntactic fontification;
3028 it should only fontify based on @code{font-lock-keywords}. The normal
3029 way for a mode to set this variable to @code{t} is with
3030 @var{keywords-only} in @code{font-lock-defaults}.
3031 @end defvar
3032
3033 @defvar font-lock-syntax-table
3034 This variable holds the syntax table to use for fontification of
3035 comments and strings. Specify it using @var{syntax-alist} in
3036 @code{font-lock-defaults}. If this is @code{nil}, fontification uses
3037 the buffer's syntax table.
3038 @end defvar
3039
3040 @defvar font-lock-beginning-of-syntax-function
3041 If this variable is non-@code{nil}, it should be a function to move
3042 point back to a position that is syntactically at ``top level'' and
3043 outside of strings or comments. Font Lock uses this when necessary
3044 to get the right results for syntactic fontification.
3045
3046 This function is called with no arguments. It should leave point at
3047 the beginning of any enclosing syntactic block. Typical values are
3048 @code{beginning-of-line} (used when the start of the line is known to
3049 be outside a syntactic block), or @code{beginning-of-defun} for
3050 programming modes, or @code{backward-paragraph} for textual modes.
3051
3052 If the value is @code{nil}, Font Lock uses
3053 @code{syntax-begin-function} to move back outside of any comment,
3054 string, or sexp. This variable is semi-obsolete; we recommend setting
3055 @code{syntax-begin-function} instead.
3056
3057 Specify this variable using @var{syntax-begin} in
3058 @code{font-lock-defaults}.
3059 @end defvar
3060
3061 @defvar font-lock-syntactic-face-function
3062 A function to determine which face to use for a given syntactic
3063 element (a string or a comment). The function is called with one
3064 argument, the parse state at point returned by
3065 @code{parse-partial-sexp}, and should return a face. The default
3066 value returns @code{font-lock-comment-face} for comments and
3067 @code{font-lock-string-face} for strings.
3068
3069 This can be used to highlighting different kinds of strings or
3070 comments differently. It is also sometimes abused together with
3071 @code{font-lock-syntactic-keywords} to highlight constructs that span
3072 multiple lines, but this is too esoteric to document here.
3073
3074 Specify this variable using @var{other-vars} in
3075 @code{font-lock-defaults}.
3076 @end defvar
3077
3078 @node Setting Syntax Properties
3079 @subsection Setting Syntax Properties
3080
3081 Font Lock mode can be used to update @code{syntax-table} properties
3082 automatically (@pxref{Syntax Properties}). This is useful in
3083 languages for which a single syntax table by itself is not sufficient.
3084
3085 @defvar font-lock-syntactic-keywords
3086 This variable enables and controls updating @code{syntax-table}
3087 properties by Font Lock. Its value should be a list of elements of
3088 this form:
3089
3090 @example
3091 (@var{matcher} @var{subexp} @var{syntax} @var{override} @var{laxmatch})
3092 @end example
3093
3094 The parts of this element have the same meanings as in the corresponding
3095 sort of element of @code{font-lock-keywords},
3096
3097 @example
3098 (@var{matcher} @var{subexp} @var{facespec} @var{override} @var{laxmatch})
3099 @end example
3100
3101 However, instead of specifying the value @var{facespec} to use for the
3102 @code{face} property, it specifies the value @var{syntax} to use for
3103 the @code{syntax-table} property. Here, @var{syntax} can be a string
3104 (as taken by @code{modify-syntax-entry}), a syntax table, a cons cell
3105 (as returned by @code{string-to-syntax}), or an expression whose value
3106 is one of those two types. @var{override} cannot be @code{prepend} or
3107 @code{append}.
3108
3109 For example, an element of the form:
3110
3111 @example
3112 ("\\$\\(#\\)" 1 ".")
3113 @end example
3114
3115 highlights syntactically a hash character when following a dollar
3116 character, with a SYNTAX of @code{"."} (meaning punctuation syntax).
3117 Assuming that the buffer syntax table specifies hash characters to
3118 have comment start syntax, the element will only highlight hash
3119 characters that do not follow dollar characters as comments
3120 syntactically.
3121
3122 An element of the form:
3123
3124 @example
3125 ("\\('\\).\\('\\)"
3126 (1 "\"")
3127 (2 "\""))
3128 @end example
3129
3130 highlights syntactically both single quotes which surround a single
3131 character, with a SYNTAX of @code{"\""} (meaning string quote syntax).
3132 Assuming that the buffer syntax table does not specify single quotes
3133 to have quote syntax, the element will only highlight single quotes of
3134 the form @samp{'@var{c}'} as strings syntactically. Other forms, such
3135 as @samp{foo'bar} or @samp{'fubar'}, will not be highlighted as
3136 strings.
3137
3138 Major modes normally set this variable with @var{other-vars} in
3139 @code{font-lock-defaults}.
3140 @end defvar
3141
3142 @node Multiline Font Lock
3143 @subsection Multiline Font Lock Constructs
3144 @cindex multiline font lock
3145
3146 Normally, elements of @code{font-lock-keywords} should not match
3147 across multiple lines; that doesn't work reliably, because Font Lock
3148 usually scans just part of the buffer, and it can miss a multi-line
3149 construct that crosses the line boundary where the scan starts. (The
3150 scan normally starts at the beginning of a line.)
3151
3152 Making elements that match multiline constructs work properly has
3153 two aspects: correct @emph{identification} and correct
3154 @emph{rehighlighting}. The first means that Font Lock finds all
3155 multiline constructs. The second means that Font Lock will correctly
3156 rehighlight all the relevant text when a multiline construct is
3157 changed---for example, if some of the text that was previously part of
3158 a multiline construct ceases to be part of it. The two aspects are
3159 closely related, and often getting one of them to work will appear to
3160 make the other also work. However, for reliable results you must
3161 attend explicitly to both aspects.
3162
3163 There are three ways to ensure correct identification of multiline
3164 constructs:
3165
3166 @itemize
3167 @item
3168 Add a function to @code{font-lock-extend-region-functions} that does
3169 the @emph{identification} and extends the scan so that the scanned
3170 text never starts or ends in the middle of a multiline construct.
3171 @item
3172 Use the @code{font-lock-fontify-region-function} hook similarly to
3173 extend the scan so that the scanned text never starts or ends in the
3174 middle of a multiline construct.
3175 @item
3176 Somehow identify the multiline construct right when it gets inserted
3177 into the buffer (or at any point after that but before font-lock
3178 tries to highlight it), and mark it with a @code{font-lock-multiline}
3179 which will instruct font-lock not to start or end the scan in the
3180 middle of the construct.
3181 @end itemize
3182
3183 There are three ways to do rehighlighting of multiline constructs:
3184
3185 @itemize
3186 @item
3187 Place a @code{font-lock-multiline} property on the construct. This
3188 will rehighlight the whole construct if any part of it is changed. In
3189 some cases you can do this automatically by setting the
3190 @code{font-lock-multiline} variable, which see.
3191 @item
3192 Make sure @code{jit-lock-contextually} is set and rely on it doing its
3193 job. This will only rehighlight the part of the construct that
3194 follows the actual change, and will do it after a short delay.
3195 This only works if the highlighting of the various parts of your
3196 multiline construct never depends on text in subsequent lines.
3197 Since @code{jit-lock-contextually} is activated by default, this can
3198 be an attractive solution.
3199 @item
3200 Place a @code{jit-lock-defer-multiline} property on the construct.
3201 This works only if @code{jit-lock-contextually} is used, and with the
3202 same delay before rehighlighting, but like @code{font-lock-multiline},
3203 it also handles the case where highlighting depends on
3204 subsequent lines.
3205 @end itemize
3206
3207 @menu
3208 * Font Lock Multiline:: Marking multiline chunks with a text property.
3209 * Region to Refontify:: Controlling which region gets refontified
3210 after a buffer change.
3211 @end menu
3212
3213 @node Font Lock Multiline
3214 @subsubsection Font Lock Multiline
3215
3216 One way to ensure reliable rehighlighting of multiline Font Lock
3217 constructs is to put on them the text property @code{font-lock-multiline}.
3218 It should be present and non-@code{nil} for text that is part of a
3219 multiline construct.
3220
3221 When Font Lock is about to highlight a range of text, it first
3222 extends the boundaries of the range as necessary so that they do not
3223 fall within text marked with the @code{font-lock-multiline} property.
3224 Then it removes any @code{font-lock-multiline} properties from the
3225 range, and highlights it. The highlighting specification (mostly
3226 @code{font-lock-keywords}) must reinstall this property each time,
3227 whenever it is appropriate.
3228
3229 @strong{Warning:} don't use the @code{font-lock-multiline} property
3230 on large ranges of text, because that will make rehighlighting slow.
3231
3232 @defvar font-lock-multiline
3233 If the @code{font-lock-multiline} variable is set to @code{t}, Font
3234 Lock will try to add the @code{font-lock-multiline} property
3235 automatically on multiline constructs. This is not a universal
3236 solution, however, since it slows down Font Lock somewhat. It can
3237 miss some multiline constructs, or make the property larger or smaller
3238 than necessary.
3239
3240 For elements whose @var{matcher} is a function, the function should
3241 ensure that submatch 0 covers the whole relevant multiline construct,
3242 even if only a small subpart will be highlighted. It is often just as
3243 easy to add the @code{font-lock-multiline} property by hand.
3244 @end defvar
3245
3246 The @code{font-lock-multiline} property is meant to ensure proper
3247 refontification; it does not automatically identify new multiline
3248 constructs. Identifying the requires that Font-Lock operate on large
3249 enough chunks at a time. This will happen by accident on many cases,
3250 which may give the impression that multiline constructs magically work.
3251 If you set the @code{font-lock-multiline} variable non-@code{nil},
3252 this impression will be even stronger, since the highlighting of those
3253 constructs which are found will be properly updated from then on.
3254 But that does not work reliably.
3255
3256 To find multiline constructs reliably, you must either manually
3257 place the @code{font-lock-multiline} property on the text before
3258 Font-Lock looks at it, or use
3259 @code{font-lock-fontify-region-function}.
3260
3261 @node Region to Refontify
3262 @subsubsection Region to Fontify after a Buffer Change
3263
3264 When a buffer is changed, the region that Font Lock refontifies is
3265 by default the smallest sequence of whole lines that spans the change.
3266 While this works well most of the time, sometimes it doesn't---for
3267 example, when a change alters the syntactic meaning of text on an
3268 earlier line.
3269
3270 You can enlarge (or even reduce) the region to refontify by setting
3271 the following variable:
3272
3273 @defvar font-lock-extend-after-change-region-function
3274 This buffer-local variable is either @code{nil} or a function for
3275 Font-Lock to call to determine the region to scan and fontify.
3276
3277 The function is given three parameters, the standard @var{beg},
3278 @var{end}, and @var{old-len} from @code{after-change-functions}
3279 (@pxref{Change Hooks}). It should return either a cons of the
3280 beginning and end buffer positions (in that order) of the region to
3281 fontify, or @code{nil} (which means choose the region in the standard
3282 way). This function needs to preserve point, the match-data, and the
3283 current restriction. The region it returns may start or end in the
3284 middle of a line.
3285
3286 Since this function is called after every buffer change, it should be
3287 reasonably fast.
3288 @end defvar
3289
3290 @node Auto-Indentation
3291 @section Auto-indentation of code
3292
3293 For programming languages, an important feature of a major mode is to
3294 provide automatic indentation. This is controlled in Emacs by
3295 @code{indent-line-function} (@pxref{Mode-Specific Indent}).
3296 Writing a good indentation function can be difficult and to a large
3297 extent it is still a black art.
3298
3299 Many major mode authors will start by writing a simple indentation
3300 function that works for simple cases, for example by comparing with the
3301 indentation of the previous text line. For most programming languages
3302 that are not really line-based, this tends to scale very poorly:
3303 improving such a function to let it handle more diverse situations tends
3304 to become more and more difficult, resulting in the end with a large,
3305 complex, unmaintainable indentation function which nobody dares to touch.
3306
3307 A good indentation function will usually need to actually parse the
3308 text, according to the syntax of the language. Luckily, it is not
3309 necessary to parse the text in as much detail as would be needed
3310 for a compiler, but on the other hand, the parser embedded in the
3311 indentation code will want to be somewhat friendly to syntactically
3312 incorrect code.
3313
3314 Good maintainable indentation functions usually fall into 2 categories:
3315 either parsing forward from some ``safe'' starting point until the
3316 position of interest, or parsing backward from the position of interest.
3317 Neither of the two is a clearly better choice than the other: parsing
3318 backward is often more difficult than parsing forward because
3319 programming languages are designed to be parsed forward, but for the
3320 purpose of indentation it has the advantage of not needing to
3321 guess a ``safe'' starting point, and it generally enjoys the property
3322 that only a minimum of text will be analyzed to decide the indentation
3323 of a line, so indentation will tend to be unaffected by syntax errors in
3324 some earlier unrelated piece of code. Parsing forward on the other hand
3325 is usually easier and has the advantage of making it possible to
3326 reindent efficiently a whole region at a time, with a single parse.
3327
3328 Rather than write your own indentation function from scratch, it is
3329 often preferable to try and reuse some existing ones or to rely
3330 on a generic indentation engine. There are sadly few such
3331 engines. The CC-mode indentation code (used with C, C++, Java, Awk
3332 and a few other such modes) has been made more generic over the years,
3333 so if your language seems somewhat similar to one of those languages,
3334 you might try to use that engine. @c FIXME: documentation?
3335 Another one is SMIE which takes an approach in the spirit
3336 of Lisp sexps and adapts it to non-Lisp languages.
3337
3338 @menu
3339 * SMIE:: A simple minded indentation engine
3340 @end menu
3341
3342 @node SMIE
3343 @subsection Simple Minded Indentation Engine
3344
3345 SMIE is a package that provides a generic navigation and indentation
3346 engine. Based on a very simple parser using an ``operator precedence
3347 grammar'', it lets major modes extend the sexp-based navigation of Lisp
3348 to non-Lisp languages as well as provide a simple to use but reliable
3349 auto-indentation.
3350
3351 Operator precedence grammar is a very primitive technology for parsing
3352 compared to some of the more common techniques used in compilers.
3353 It has the following characteristics: its parsing power is very limited,
3354 and it is largely unable to detect syntax errors, but it has the
3355 advantage of being algorithmically efficient and able to parse forward
3356 just as well as backward. In practice that means that SMIE can use it
3357 for indentation based on backward parsing, that it can provide both
3358 @code{forward-sexp} and @code{backward-sexp} functionality, and that it
3359 will naturally work on syntactically incorrect code without any extra
3360 effort. The downside is that it also means that most programming
3361 languages cannot be parsed correctly using SMIE, at least not without
3362 resorting to some special tricks (@pxref{SMIE Tricks}).
3363
3364 @menu
3365 * SMIE setup:: SMIE setup and features
3366 * Operator Precedence Grammars:: A very simple parsing technique
3367 * SMIE Grammar:: Defining the grammar of a language
3368 * SMIE Lexer:: Defining tokens
3369 * SMIE Tricks:: Working around the parser's limitations
3370 * SMIE Indentation:: Specifying indentation rules
3371 * SMIE Indentation Helpers:: Helper functions for indentation rules
3372 * SMIE Indentation Example:: Sample indentation rules
3373 @end menu
3374
3375 @node SMIE setup
3376 @subsubsection SMIE Setup and Features
3377
3378 SMIE is meant to be a one-stop shop for structural navigation and
3379 various other features which rely on the syntactic structure of code, in
3380 particular automatic indentation. The main entry point is
3381 @code{smie-setup} which is a function typically called while setting
3382 up a major mode.
3383
3384 @defun smie-setup grammar rules-function &rest keywords
3385 Setup SMIE navigation and indentation.
3386 @var{grammar} is a grammar table generated by @code{smie-prec2->grammar}.
3387 @var{rules-function} is a set of indentation rules for use on
3388 @code{smie-rules-function}.
3389 @var{keywords} are additional arguments, which can include the following
3390 keywords:
3391 @itemize
3392 @item
3393 @code{:forward-token} @var{fun}: Specify the forward lexer to use.
3394 @item
3395 @code{:backward-token} @var{fun}: Specify the backward lexer to use.
3396 @end itemize
3397 @end defun
3398
3399 Calling this function is sufficient to make commands such as
3400 @code{forward-sexp}, @code{backward-sexp}, and @code{transpose-sexps} be
3401 able to properly handle structural elements other than just the paired
3402 parentheses already handled by syntax tables. For example, if the
3403 provided grammar is precise enough, @code{transpose-sexps} can correctly
3404 transpose the two arguments of a @code{+} operator, taking into account
3405 the precedence rules of the language.
3406
3407 Calling `smie-setup' is also sufficient to make TAB indentation work in
3408 the expected way, extends @code{blink-matching-paren} to apply to
3409 elements like @code{begin...end}, and provides some commands that you
3410 can bind in the major mode keymap.
3411
3412 @deffn Command smie-close-block
3413 This command closes the most recently opened (and not yet closed) block.
3414 @end deffn
3415
3416 @deffn Command smie-down-list &optional arg
3417 This command is like @code{down-list} but it also pays attention to
3418 nesting of tokens other than parentheses, such as @code{begin...end}.
3419 @end deffn
3420
3421 @node Operator Precedence Grammars
3422 @subsubsection Operator Precedence Grammars
3423
3424 SMIE's precedence grammars simply give to each token a pair of
3425 precedences: the left-precedence and the right-precedence. We say
3426 @code{T1 < T2} if the right-precedence of token @code{T1} is less than
3427 the left-precedence of token @code{T2}. A good way to read this
3428 @code{<} is as a kind of parenthesis: if we find @code{... T1 something
3429 T2 ...} then that should be parsed as @code{... T1 (something T2 ...}
3430 rather than as @code{... T1 something) T2 ...}. The latter
3431 interpretation would be the case if we had @code{T1 > T2}. If we have
3432 @code{T1 = T2}, it means that token T2 follows token T1 in the same
3433 syntactic construction, so typically we have @code{"begin" = "end"}.
3434 Such pairs of precedences are sufficient to express left-associativity
3435 or right-associativity of infix operators, nesting of tokens like
3436 parentheses and many other cases.
3437
3438 @c Let's leave this undocumented to leave it more open for change!
3439 @c @defvar smie-grammar
3440 @c The value of this variable is an alist specifying the left and right
3441 @c precedence of each token. It is meant to be initialized by using one of
3442 @c the functions below.
3443 @c @end defvar
3444
3445 @defun smie-prec2->grammar table
3446 This function takes a @emph{prec2} grammar @var{table} and returns an
3447 alist suitable for use in @code{smie-setup}. The @emph{prec2}
3448 @var{table} is itself meant to be built by one of the functions below.
3449 @end defun
3450
3451 @defun smie-merge-prec2s &rest tables
3452 This function takes several @emph{prec2} @var{tables} and merges them
3453 into a new @emph{prec2} table.
3454 @end defun
3455
3456 @defun smie-precs->prec2 precs
3457 This function builds a @emph{prec2} table from a table of precedences
3458 @var{precs}. @var{precs} should be a list, sorted by precedence (for
3459 example @code{"+"} will come before @code{"*"}), of elements of the form
3460 @code{(@var{assoc} @var{op} ...)}, where each @var{op} is a token that
3461 acts as an operator; @var{assoc} is their associativity, which can be
3462 either @code{left}, @code{right}, @code{assoc}, or @code{nonassoc}.
3463 All operators in a given element share the same precedence level
3464 and associativity.
3465 @end defun
3466
3467 @defun smie-bnf->prec2 bnf &rest resolvers
3468 This function lets you specify the grammar using a BNF notation.
3469 It accepts a @var{bnf} description of the grammar along with a set of
3470 conflict resolution rules @var{resolvers}, and
3471 returns a @emph{prec2} table.
3472
3473 @var{bnf} is a list of nonterminal definitions of the form
3474 @code{(@var{nonterm} @var{rhs1} @var{rhs2} ...)} where each @var{rhs}
3475 is a (non-empty) list of terminals (aka tokens) or non-terminals.
3476
3477 Not all grammars are accepted:
3478 @itemize
3479 @item
3480 An @var{rhs} cannot be an empty list (an empty list is never needed,
3481 since SMIE allows all non-terminals to match the empty string anyway).
3482 @item
3483 An @var{rhs} cannot have 2 consecutive non-terminals: each pair of
3484 non-terminals needs to be separated by a terminal (aka token).
3485 This is a fundamental limitation of operator precedence grammars.
3486 @end itemize
3487
3488 Additionally, conflicts can occur:
3489 @itemize
3490 @item
3491 The returned @emph{prec2} table holds constraints between pairs of tokens, and
3492 for any given pair only one constraint can be present: T1 < T2,
3493 T1 = T2, or T1 > T2.
3494 @item
3495 A token can be an @code{opener} (something similar to an open-paren),
3496 a @code{closer} (like a close-paren), or @code{neither} of the two
3497 (e.g. an infix operator, or an inner token like @code{"else"}).
3498 @end itemize
3499
3500 Precedence conflicts can be resolved via @var{resolvers}, which
3501 is a list of @emph{precs} tables (see @code{smie-precs->prec2}): for
3502 each precedence conflict, if those @code{precs} tables
3503 specify a particular constraint, then the conflict is resolved by using
3504 this constraint instead, else a conflict is reported and one of the
3505 conflicting constraints is picked arbitrarily and the others are
3506 simply ignored.
3507 @end defun
3508
3509 @node SMIE Grammar
3510 @subsubsection Defining the Grammar of a Language
3511
3512 The usual way to define the SMIE grammar of a language is by
3513 defining a new global variable that holds the precedence table by
3514 giving a set of BNF rules.
3515 For example, the grammar definition for a small Pascal-like language
3516 could look like:
3517 @example
3518 @group
3519 (require 'smie)
3520 (defvar sample-smie-grammar
3521 (smie-prec2->grammar
3522 (smie-bnf->prec2
3523 @end group
3524 @group
3525 '((id)
3526 (inst ("begin" insts "end")
3527 ("if" exp "then" inst "else" inst)
3528 (id ":=" exp)
3529 (exp))
3530 (insts (insts ";" insts) (inst))
3531 (exp (exp "+" exp)
3532 (exp "*" exp)
3533 ("(" exps ")"))
3534 (exps (exps "," exps) (exp)))
3535 @end group
3536 @group
3537 '((assoc ";"))
3538 '((assoc ","))
3539 '((assoc "+") (assoc "*")))))
3540 @end group
3541 @end example
3542
3543 @noindent
3544 A few things to note:
3545
3546 @itemize
3547 @item
3548 The above grammar does not explicitly mention the syntax of function
3549 calls: SMIE will automatically allow any sequence of sexps, such as
3550 identifiers, balanced parentheses, or @code{begin ... end} blocks
3551 to appear anywhere anyway.
3552 @item
3553 The grammar category @code{id} has no right hand side: this does not
3554 mean that it can match only the empty string, since as mentioned any
3555 sequence of sexps can appear anywhere anyway.
3556 @item
3557 Because non terminals cannot appear consecutively in the BNF grammar, it
3558 is difficult to correctly handle tokens that act as terminators, so the
3559 above grammar treats @code{";"} as a statement @emph{separator} instead,
3560 which SMIE can handle very well.
3561 @item
3562 Separators used in sequences (such as @code{","} and @code{";"} above)
3563 are best defined with BNF rules such as @code{(foo (foo "separator" foo) ...)}
3564 which generate precedence conflicts which are then resolved by giving
3565 them an explicit @code{(assoc "separator")}.
3566 @item
3567 The @code{("(" exps ")")} rule was not needed to pair up parens, since
3568 SMIE will pair up any characters that are marked as having paren syntax
3569 in the syntax table. What this rule does instead (together with the
3570 definition of @code{exps}) is to make it clear that @code{","} should
3571 not appear outside of parentheses.
3572 @item
3573 Rather than have a single @emph{precs} table to resolve conflicts, it is
3574 preferable to have several tables, so as to let the BNF part of the
3575 grammar specify relative precedences where possible.
3576 @item
3577 Unless there is a very good reason to prefer @code{left} or
3578 @code{right}, it is usually preferable to mark operators as associative,
3579 using @code{assoc}. For that reason @code{"+"} and @code{"*"} are
3580 defined above as @code{assoc}, although the language defines them
3581 formally as left associative.
3582 @end itemize
3583
3584 @node SMIE Lexer
3585 @subsubsection Defining Tokens
3586
3587 SMIE comes with a predefined lexical analyzer which uses syntax tables
3588 in the following way: any sequence of characters that have word or
3589 symbol syntax is considered a token, and so is any sequence of
3590 characters that have punctuation syntax. This default lexer is
3591 often a good starting point but is rarely actually correct for any given
3592 language. For example, it will consider @code{"2,+3"} to be composed
3593 of 3 tokens: @code{"2"}, @code{",+"}, and @code{"3"}.
3594
3595 To describe the lexing rules of your language to SMIE, you need
3596 2 functions, one to fetch the next token, and another to fetch the
3597 previous token. Those functions will usually first skip whitespace and
3598 comments and then look at the next chunk of text to see if it
3599 is a special token. If so it should skip the token and
3600 return a description of this token. Usually this is simply the string
3601 extracted from the buffer, but it can be anything you want.
3602 For example:
3603 @example
3604 @group
3605 (defvar sample-keywords-regexp
3606 (regexp-opt '("+" "*" "," ";" ">" ">=" "<" "<=" ":=" "=")))
3607 @end group
3608 @group
3609 (defun sample-smie-forward-token ()
3610 (forward-comment (point-max))
3611 (cond
3612 ((looking-at sample-keywords-regexp)
3613 (goto-char (match-end 0))
3614 (match-string-no-properties 0))
3615 (t (buffer-substring-no-properties
3616 (point)
3617 (progn (skip-syntax-forward "w_")
3618 (point))))))
3619 @end group
3620 @group
3621 (defun sample-smie-backward-token ()
3622 (forward-comment (- (point)))
3623 (cond
3624 ((looking-back sample-keywords-regexp (- (point) 2) t)
3625 (goto-char (match-beginning 0))
3626 (match-string-no-properties 0))
3627 (t (buffer-substring-no-properties
3628 (point)
3629 (progn (skip-syntax-backward "w_")
3630 (point))))))
3631 @end group
3632 @end example
3633
3634 Notice how those lexers return the empty string when in front of
3635 parentheses. This is because SMIE automatically takes care of the
3636 parentheses defined in the syntax table. More specifically if the lexer
3637 returns nil or an empty string, SMIE tries to handle the corresponding
3638 text as a sexp according to syntax tables.
3639
3640 @node SMIE Tricks
3641 @subsubsection Living With a Weak Parser
3642
3643 The parsing technique used by SMIE does not allow tokens to behave
3644 differently in different contexts. For most programming languages, this
3645 manifests itself by precedence conflicts when converting the
3646 BNF grammar.
3647
3648 Sometimes, those conflicts can be worked around by expressing the
3649 grammar slightly differently. For example, for Modula-2 it might seem
3650 natural to have a BNF grammar that looks like this:
3651
3652 @example
3653 ...
3654 (inst ("IF" exp "THEN" insts "ELSE" insts "END")
3655 ("CASE" exp "OF" cases "END")
3656 ...)
3657 (cases (cases "|" cases)
3658 (caselabel ":" insts)
3659 ("ELSE" insts))
3660 ...
3661 @end example
3662
3663 But this will create conflicts for @code{"ELSE"}: on the one hand, the
3664 IF rule implies (among many other things) that @code{"ELSE" = "END"};
3665 but on the other hand, since @code{"ELSE"} appears within @code{cases},
3666 which appears left of @code{"END"}, we also have @code{"ELSE" > "END"}.
3667 We can solve the conflict either by using:
3668 @example
3669 ...
3670 (inst ("IF" exp "THEN" insts "ELSE" insts "END")
3671 ("CASE" exp "OF" cases "END")
3672 ("CASE" exp "OF" cases "ELSE" insts "END")
3673 ...)
3674 (cases (cases "|" cases) (caselabel ":" insts))
3675 ...
3676 @end example
3677 or
3678 @example
3679 ...
3680 (inst ("IF" exp "THEN" else "END")
3681 ("CASE" exp "OF" cases "END")
3682 ...)
3683 (else (insts "ELSE" insts))
3684 (cases (cases "|" cases) (caselabel ":" insts) (else))
3685 ...
3686 @end example
3687
3688 Reworking the grammar to try and solve conflicts has its downsides, tho,
3689 because SMIE assumes that the grammar reflects the logical structure of
3690 the code, so it is preferable to keep the BNF closer to the intended
3691 abstract syntax tree.
3692
3693 Other times, after careful consideration you may conclude that those
3694 conflicts are not serious and simply resolve them via the
3695 @var{resolvers} argument of @code{smie-bnf->prec2}. Usually this is
3696 because the grammar is simply ambiguous: the conflict does not affect
3697 the set of programs described by the grammar, but only the way those
3698 programs are parsed. This is typically the case for separators and
3699 associative infix operators, where you want to add a resolver like
3700 @code{'((assoc "|"))}. Another case where this can happen is for the
3701 classic @emph{dangling else} problem, where you will use @code{'((assoc
3702 "else" "then"))}. It can also happen for cases where the conflict is
3703 real and cannot really be resolved, but it is unlikely to pose a problem
3704 in practice.
3705
3706 Finally, in many cases some conflicts will remain despite all efforts to
3707 restructure the grammar. Do not despair: while the parser cannot be
3708 made more clever, you can make the lexer as smart as you want. So, the
3709 solution is then to look at the tokens involved in the conflict and to
3710 split one of those tokens into 2 (or more) different tokens. E.g. if
3711 the grammar needs to distinguish between two incompatible uses of the
3712 token @code{"begin"}, make the lexer return different tokens (say
3713 @code{"begin-fun"} and @code{"begin-plain"}) depending on which kind of
3714 @code{"begin"} it finds. This pushes the work of distinguishing the
3715 different cases to the lexer, which will thus have to look at the
3716 surrounding text to find ad-hoc clues.
3717
3718 @node SMIE Indentation
3719 @subsubsection Specifying Indentation Rules
3720
3721 Based on the provided grammar, SMIE will be able to provide automatic
3722 indentation without any extra effort. But in practice, this default
3723 indentation style will probably not be good enough. You will want to
3724 tweak it in many different cases.
3725
3726 SMIE indentation is based on the idea that indentation rules should be
3727 as local as possible. To this end, it relies on the idea of
3728 @emph{virtual} indentation, which is the indentation that a particular
3729 program point would have if it were at the beginning of a line.
3730 Of course, if that program point is indeed at the beginning of a line,
3731 its virtual indentation is its current indentation. But if not, then
3732 SMIE uses the indentation algorithm to compute the virtual indentation
3733 of that point. Now in practice, the virtual indentation of a program
3734 point does not have to be identical to the indentation it would have if
3735 we inserted a newline before it. To see how this works, the SMIE rule
3736 for indentation after a @code{@{} in C does not care whether the
3737 @code{@{} is standing on a line of its own or is at the end of the
3738 preceding line. Instead, these different cases are handled in the
3739 indentation rule that decides how to indent before a @code{@{}.
3740
3741 Another important concept is the notion of @emph{parent}: The
3742 @emph{parent} of a token, is the head token of the nearest enclosing
3743 syntactic construct. For example, the parent of an @code{else} is the
3744 @code{if} to which it belongs, and the parent of an @code{if}, in turn,
3745 is the lead token of the surrounding construct. The command
3746 @code{backward-sexp} jumps from a token to its parent, but there are
3747 some caveats: for @emph{openers} (tokens which start a construct, like
3748 @code{if}), you need to start with point before the token, while for
3749 others you need to start with point after the token.
3750 @code{backward-sexp} stops with point before the parent token if that is
3751 the @emph{opener} of the token of interest, and otherwise it stops with
3752 point after the parent token.
3753
3754 SMIE indentation rules are specified using a function that takes two
3755 arguments @var{method} and @var{arg} where the meaning of @var{arg} and the
3756 expected return value depend on @var{method}.
3757
3758 @var{method} can be:
3759 @itemize
3760 @item
3761 @code{:after}, in which case @var{arg} is a token and the function
3762 should return the @var{offset} to use for indentation after @var{arg}.
3763 @item
3764 @code{:before}, in which case @var{arg} is a token and the function
3765 should return the @var{offset} to use to indent @var{arg} itself.
3766 @item
3767 @code{:elem}, in which case the function should return either the offset
3768 to use to indent function arguments (if @var{arg} is the symbol
3769 @code{arg}) or the basic indentation step (if @var{arg} is the symbol
3770 @code{basic}).
3771 @item
3772 @code{:list-intro}, in which case @var{arg} is a token and the function
3773 should return non-@code{nil} if the token is followed by a list of
3774 expressions (not separated by any token) rather than an expression.
3775 @end itemize
3776
3777 When @var{arg} is a token, the function is called with point just before
3778 that token. A return value of nil always means to fallback on the
3779 default behavior, so the function should return nil for arguments it
3780 does not expect.
3781
3782 @var{offset} can be:
3783 @itemize
3784 @item
3785 @code{nil}: use the default indentation rule.
3786 @item
3787 @code{(column . @var{column})}: indent to column @var{column}.
3788 @item
3789 @var{number}: offset by @var{number}, relative to a base token which is
3790 the current token for @code{:after} and its parent for @code{:before}.
3791 @end itemize
3792
3793 @node SMIE Indentation Helpers
3794 @subsubsection Helper Functions for Indentation Rules
3795
3796 SMIE provides various functions designed specifically for use in the
3797 indentation rules function (several of those functions break if used in
3798 another context). These functions all start with the prefix
3799 @code{smie-rule-}.
3800
3801 @defun smie-rule-bolp
3802 Return non-@code{nil} if the current token is the first on the line.
3803 @end defun
3804
3805 @defun smie-rule-hanging-p
3806 Return non-@code{nil} if the current token is @emph{hanging}.
3807 A token is @emph{hanging} if it is the last token on the line
3808 and if it is preceded by other tokens: a lone token on a line is not
3809 hanging.
3810 @end defun
3811
3812 @defun smie-rule-next-p &rest tokens
3813 Return non-@code{nil} if the next token is among @var{tokens}.
3814 @end defun
3815
3816 @defun smie-rule-prev-p &rest tokens
3817 Return non-@code{nil} if the previous token is among @var{tokens}.
3818 @end defun
3819
3820 @defun smie-rule-parent-p &rest parents
3821 Return non-@code{nil} if the current token's parent is among @var{parents}.
3822 @end defun
3823
3824 @defun smie-rule-sibling-p
3825 Return non-nil if the current token's parent is actually a sibling.
3826 This is the case for example when the parent of a @code{","} is just the
3827 previous @code{","}.
3828 @end defun
3829
3830 @defun smie-rule-parent &optional offset
3831 Return the proper offset to align the current token with the parent.
3832 If non-@code{nil}, @var{offset} should be an integer giving an
3833 additional offset to apply.
3834 @end defun
3835
3836 @defun smie-rule-separator method
3837 Indent current token as a @emph{separator}.
3838
3839 By @emph{separator}, we mean here a token whose sole purpose is to
3840 separate various elements within some enclosing syntactic construct, and
3841 which does not have any semantic significance in itself (i.e. it would
3842 typically not exist as a node in an abstract syntax tree).
3843
3844 Such a token is expected to have an associative syntax and be closely
3845 tied to its syntactic parent. Typical examples are @code{","} in lists
3846 of arguments (enclosed inside parentheses), or @code{";"} in sequences
3847 of instructions (enclosed in a @code{@{...@}} or @code{begin...end}
3848 block).
3849
3850 @var{method} should be the method name that was passed to
3851 `smie-rules-function'.
3852 @end defun
3853
3854 @node SMIE Indentation Example
3855 @subsubsection Sample Indentation Rules
3856
3857 Here is an example of an indentation function:
3858
3859 @example
3860 (defun sample-smie-rules (kind token)
3861 (pcase (cons kind token)
3862 (`(:elem . basic) sample-indent-basic)
3863 (`(,_ . ",") (smie-rule-separator kind))
3864 (`(:after . ":=") sample-indent-basic)
3865 (`(:before . ,(or `"begin" `"(" `"@{")))
3866 (if (smie-rule-hanging-p) (smie-rule-parent)))
3867 (`(:before . "if")
3868 (and (not (smie-rule-bolp)) (smie-rule-prev-p "else")
3869 (smie-rule-parent)))))
3870 @end example
3871
3872 @noindent
3873 A few things to note:
3874
3875 @itemize
3876 @item
3877 The first case indicates the basic indentation increment to use.
3878 If @code{sample-indent-basic} is nil, then SMIE uses the global
3879 setting @code{smie-indent-basic}. The major mode could have set
3880 @code{smie-indent-basic} buffer-locally instead, but that
3881 is discouraged.
3882
3883 @item
3884 The rule for the token @code{","} make SMIE try to be more clever when
3885 the comma separator is placed at the beginning of lines. It tries to
3886 outdent the separator so as to align the code after the comma; for
3887 example:
3888
3889 @example
3890 x = longfunctionname (
3891 arg1
3892 , arg2
3893 );
3894 @end example
3895
3896 @item
3897 The rule for indentation after @code{":="} exists because otherwise
3898 SMIE would treat @code{":="} as an infix operator and would align the
3899 right argument with the left one.
3900
3901 @item
3902 The rule for indentation before @code{"begin"} is an example of the use
3903 of virtual indentation: This rule is used only when @code{"begin"} is
3904 hanging, which can happen only when @code{"begin"} is not at the
3905 beginning of a line. So this is not used when indenting
3906 @code{"begin"} itself but only when indenting something relative to this
3907 @code{"begin"}. Concretely, this rule changes the indentation from:
3908
3909 @example
3910 if x > 0 then begin
3911 dosomething(x);
3912 end
3913 @end example
3914 to
3915 @example
3916 if x > 0 then begin
3917 dosomething(x);
3918 end
3919 @end example
3920
3921 @item
3922 The rule for indentation before @code{"if"} is similar to the one for
3923 @code{"begin"}, but where the purpose is to treat @code{"else if"}
3924 as a single unit, so as to align a sequence of tests rather than indent
3925 each test further to the right. This function does this only in the
3926 case where the @code{"if"} is not placed on a separate line, hence the
3927 @code{smie-rule-bolp} test.
3928
3929 If we know that the @code{"else"} is always aligned with its @code{"if"}
3930 and is always at the beginning of a line, we can use a more efficient
3931 rule:
3932 @example
3933 ((equal token "if")
3934 (and (not (smie-rule-bolp))
3935 (smie-rule-prev-p "else")
3936 (save-excursion
3937 (sample-smie-backward-token)
3938 (cons 'column (current-column)))))
3939 @end example
3940
3941 The advantage of this formulation is that it reuses the indentation of
3942 the previous @code{"else"}, rather than going all the way back to the
3943 first @code{"if"} of the sequence.
3944 @end itemize
3945
3946 @node Desktop Save Mode
3947 @section Desktop Save Mode
3948 @cindex desktop save mode
3949
3950 @dfn{Desktop Save Mode} is a feature to save the state of Emacs from
3951 one session to another. The user-level commands for using Desktop
3952 Save Mode are described in the GNU Emacs Manual (@pxref{Saving Emacs
3953 Sessions,,, emacs, the GNU Emacs Manual}). Modes whose buffers visit
3954 a file, don't have to do anything to use this feature.
3955
3956 For buffers not visiting a file to have their state saved, the major
3957 mode must bind the buffer local variable @code{desktop-save-buffer} to
3958 a non-@code{nil} value.
3959
3960 @defvar desktop-save-buffer
3961 If this buffer-local variable is non-@code{nil}, the buffer will have
3962 its state saved in the desktop file at desktop save. If the value is
3963 a function, it is called at desktop save with argument
3964 @var{desktop-dirname}, and its value is saved in the desktop file along
3965 with the state of the buffer for which it was called. When file names
3966 are returned as part of the auxiliary information, they should be
3967 formatted using the call
3968
3969 @example
3970 (desktop-file-name @var{file-name} @var{desktop-dirname})
3971 @end example
3972
3973 @end defvar
3974
3975 For buffers not visiting a file to be restored, the major mode must
3976 define a function to do the job, and that function must be listed in
3977 the alist @code{desktop-buffer-mode-handlers}.
3978
3979 @defvar desktop-buffer-mode-handlers
3980 Alist with elements
3981
3982 @example
3983 (@var{major-mode} . @var{restore-buffer-function})
3984 @end example
3985
3986 The function @var{restore-buffer-function} will be called with
3987 argument list
3988
3989 @example
3990 (@var{buffer-file-name} @var{buffer-name} @var{desktop-buffer-misc})
3991 @end example
3992
3993 and it should return the restored buffer.
3994 Here @var{desktop-buffer-misc} is the value returned by the function
3995 optionally bound to @code{desktop-save-buffer}.
3996 @end defvar
3997
3998 @ignore
3999 Local Variables:
4000 fill-column: 72
4001 End:
4002 @end ignore