Merge from emacs-23; up to 2010-06-11T21:26:13Z!lekktu@gmail.com.
[bpt/emacs.git] / doc / lispref / symbols.texi
1 @c -*-texinfo-*-
2 @c This is part of the GNU Emacs Lisp Reference Manual.
3 @c Copyright (C) 1990-1995, 1998-1999, 2001-2011 Free Software Foundation, Inc.
4 @c See the file elisp.texi for copying conditions.
5 @setfilename ../../info/symbols
6 @node Symbols, Evaluation, Hash Tables, Top
7 @chapter Symbols
8 @cindex symbol
9
10 A @dfn{symbol} is an object with a unique name. This chapter
11 describes symbols, their components, their property lists, and how they
12 are created and interned. Separate chapters describe the use of symbols
13 as variables and as function names; see @ref{Variables}, and
14 @ref{Functions}. For the precise read syntax for symbols, see
15 @ref{Symbol Type}.
16
17 You can test whether an arbitrary Lisp object is a symbol
18 with @code{symbolp}:
19
20 @defun symbolp object
21 This function returns @code{t} if @var{object} is a symbol, @code{nil}
22 otherwise.
23 @end defun
24
25 @menu
26 * Symbol Components:: Symbols have names, values, function definitions
27 and property lists.
28 * Definitions:: A definition says how a symbol will be used.
29 * Creating Symbols:: How symbols are kept unique.
30 * Property Lists:: Each symbol has a property list
31 for recording miscellaneous information.
32 @end menu
33
34 @node Symbol Components, Definitions, Symbols, Symbols
35 @section Symbol Components
36 @cindex symbol components
37
38 Each symbol has four components (or ``cells''), each of which
39 references another object:
40
41 @table @asis
42 @item Print name
43 @cindex print name cell
44 The @dfn{print name cell} holds a string that names the symbol for
45 reading and printing. See @code{symbol-name} in @ref{Creating Symbols}.
46
47 @item Value
48 @cindex value cell
49 The @dfn{value cell} holds the current value of the symbol as a
50 variable. When a symbol is used as a form, the value of the form is the
51 contents of the symbol's value cell. See @code{symbol-value} in
52 @ref{Accessing Variables}.
53
54 @item Function
55 @cindex function cell
56 The @dfn{function cell} holds the function definition of the symbol.
57 When a symbol is used as a function, its function definition is used in
58 its place. This cell is also used to make a symbol stand for a keymap
59 or a keyboard macro, for editor command execution. Because each symbol
60 has separate value and function cells, variables names and function names do
61 not conflict. See @code{symbol-function} in @ref{Function Cells}.
62
63 @item Property list
64 @cindex property list cell
65 The @dfn{property list cell} holds the property list of the symbol. See
66 @code{symbol-plist} in @ref{Property Lists}.
67 @end table
68
69 The print name cell always holds a string, and cannot be changed. The
70 other three cells can be set individually to any specified Lisp object.
71
72 The print name cell holds the string that is the name of the symbol.
73 Since symbols are represented textually by their names, it is important
74 not to have two symbols with the same name. The Lisp reader ensures
75 this: every time it reads a symbol, it looks for an existing symbol with
76 the specified name before it creates a new one. (In GNU Emacs Lisp,
77 this lookup uses a hashing algorithm and an obarray; see @ref{Creating
78 Symbols}.)
79
80 The value cell holds the symbol's value as a variable
81 (@pxref{Variables}). That is what you get if you evaluate the symbol as
82 a Lisp expression (@pxref{Evaluation}). Any Lisp object is a legitimate
83 value. Certain symbols have values that cannot be changed; these
84 include @code{nil} and @code{t}, and any symbol whose name starts with
85 @samp{:} (those are called @dfn{keywords}). @xref{Constant Variables}.
86
87 We often refer to ``the function @code{foo}'' when we really mean
88 the function stored in the function cell of the symbol @code{foo}. We
89 make the distinction explicit only when necessary. In normal
90 usage, the function cell usually contains a function
91 (@pxref{Functions}) or a macro (@pxref{Macros}), as that is what the
92 Lisp interpreter expects to see there (@pxref{Evaluation}). Keyboard
93 macros (@pxref{Keyboard Macros}), keymaps (@pxref{Keymaps}) and
94 autoload objects (@pxref{Autoloading}) are also sometimes stored in
95 the function cells of symbols.
96
97 The property list cell normally should hold a correctly formatted
98 property list (@pxref{Property Lists}), as a number of functions expect
99 to see a property list there.
100
101 The function cell or the value cell may be @dfn{void}, which means
102 that the cell does not reference any object. (This is not the same
103 thing as holding the symbol @code{void}, nor the same as holding the
104 symbol @code{nil}.) Examining a function or value cell that is void
105 results in an error, such as @samp{Symbol's value as variable is void}.
106
107 The four functions @code{symbol-name}, @code{symbol-value},
108 @code{symbol-plist}, and @code{symbol-function} return the contents of
109 the four cells of a symbol. Here as an example we show the contents of
110 the four cells of the symbol @code{buffer-file-name}:
111
112 @example
113 (symbol-name 'buffer-file-name)
114 @result{} "buffer-file-name"
115 (symbol-value 'buffer-file-name)
116 @result{} "/gnu/elisp/symbols.texi"
117 (symbol-function 'buffer-file-name)
118 @result{} #<subr buffer-file-name>
119 (symbol-plist 'buffer-file-name)
120 @result{} (variable-documentation 29529)
121 @end example
122
123 @noindent
124 Because this symbol is the variable which holds the name of the file
125 being visited in the current buffer, the value cell contents we see are
126 the name of the source file of this chapter of the Emacs Lisp Manual.
127 The property list cell contains the list @code{(variable-documentation
128 29529)} which tells the documentation functions where to find the
129 documentation string for the variable @code{buffer-file-name} in the
130 @file{DOC-@var{version}} file. (29529 is the offset from the beginning
131 of the @file{DOC-@var{version}} file to where that documentation string
132 begins---see @ref{Documentation Basics}.) The function cell contains
133 the function for returning the name of the file.
134 @code{buffer-file-name} names a primitive function, which has no read
135 syntax and prints in hash notation (@pxref{Primitive Function Type}). A
136 symbol naming a function written in Lisp would have a lambda expression
137 (or a byte-code object) in this cell.
138
139 @node Definitions, Creating Symbols, Symbol Components, Symbols
140 @section Defining Symbols
141 @cindex definitions of symbols
142
143 A @dfn{definition} in Lisp is a special form that announces your
144 intention to use a certain symbol in a particular way. In Emacs Lisp,
145 you can define a symbol as a variable, or define it as a function (or
146 macro), or both independently.
147
148 A definition construct typically specifies a value or meaning for the
149 symbol for one kind of use, plus documentation for its meaning when used
150 in this way. Thus, when you define a symbol as a variable, you can
151 supply an initial value for the variable, plus documentation for the
152 variable.
153
154 @code{defvar} and @code{defconst} are special forms that define a
155 symbol as a global variable. They are documented in detail in
156 @ref{Defining Variables}. For defining user option variables that can
157 be customized, use @code{defcustom} (@pxref{Customization}).
158
159 @code{defun} defines a symbol as a function, creating a lambda
160 expression and storing it in the function cell of the symbol. This
161 lambda expression thus becomes the function definition of the symbol.
162 (The term ``function definition,'' meaning the contents of the function
163 cell, is derived from the idea that @code{defun} gives the symbol its
164 definition as a function.) @code{defsubst} and @code{defalias} are two
165 other ways of defining a function. @xref{Functions}.
166
167 @code{defmacro} defines a symbol as a macro. It creates a macro
168 object and stores it in the function cell of the symbol. Note that a
169 given symbol can be a macro or a function, but not both at once, because
170 both macro and function definitions are kept in the function cell, and
171 that cell can hold only one Lisp object at any given time.
172 @xref{Macros}.
173
174 In Emacs Lisp, a definition is not required in order to use a symbol
175 as a variable or function. Thus, you can make a symbol a global
176 variable with @code{setq}, whether you define it first or not. The real
177 purpose of definitions is to guide programmers and programming tools.
178 They inform programmers who read the code that certain symbols are
179 @emph{intended} to be used as variables, or as functions. In addition,
180 utilities such as @file{etags} and @file{make-docfile} recognize
181 definitions, and add appropriate information to tag tables and the
182 @file{DOC-@var{version}} file. @xref{Accessing Documentation}.
183
184 @node Creating Symbols, Property Lists, Definitions, Symbols
185 @section Creating and Interning Symbols
186 @cindex reading symbols
187
188 To understand how symbols are created in GNU Emacs Lisp, you must know
189 how Lisp reads them. Lisp must ensure that it finds the same symbol
190 every time it reads the same set of characters. Failure to do so would
191 cause complete confusion.
192
193 @cindex symbol name hashing
194 @cindex hashing
195 @cindex obarray
196 @cindex bucket (in obarray)
197 When the Lisp reader encounters a symbol, it reads all the characters
198 of the name. Then it ``hashes'' those characters to find an index in a
199 table called an @dfn{obarray}. Hashing is an efficient method of
200 looking something up. For example, instead of searching a telephone
201 book cover to cover when looking up Jan Jones, you start with the J's
202 and go from there. That is a simple version of hashing. Each element
203 of the obarray is a @dfn{bucket} which holds all the symbols with a
204 given hash code; to look for a given name, it is sufficient to look
205 through all the symbols in the bucket for that name's hash code. (The
206 same idea is used for general Emacs hash tables, but they are a
207 different data type; see @ref{Hash Tables}.)
208
209 @cindex interning
210 If a symbol with the desired name is found, the reader uses that
211 symbol. If the obarray does not contain a symbol with that name, the
212 reader makes a new symbol and adds it to the obarray. Finding or adding
213 a symbol with a certain name is called @dfn{interning} it, and the
214 symbol is then called an @dfn{interned symbol}.
215
216 Interning ensures that each obarray has just one symbol with any
217 particular name. Other like-named symbols may exist, but not in the
218 same obarray. Thus, the reader gets the same symbols for the same
219 names, as long as you keep reading with the same obarray.
220
221 Interning usually happens automatically in the reader, but sometimes
222 other programs need to do it. For example, after the @kbd{M-x} command
223 obtains the command name as a string using the minibuffer, it then
224 interns the string, to get the interned symbol with that name.
225
226 @cindex symbol equality
227 @cindex uninterned symbol
228 No obarray contains all symbols; in fact, some symbols are not in any
229 obarray. They are called @dfn{uninterned symbols}. An uninterned
230 symbol has the same four cells as other symbols; however, the only way
231 to gain access to it is by finding it in some other object or as the
232 value of a variable.
233
234 Creating an uninterned symbol is useful in generating Lisp code,
235 because an uninterned symbol used as a variable in the code you generate
236 cannot clash with any variables used in other Lisp programs.
237
238 In Emacs Lisp, an obarray is actually a vector. Each element of the
239 vector is a bucket; its value is either an interned symbol whose name
240 hashes to that bucket, or 0 if the bucket is empty. Each interned
241 symbol has an internal link (invisible to the user) to the next symbol
242 in the bucket. Because these links are invisible, there is no way to
243 find all the symbols in an obarray except using @code{mapatoms} (below).
244 The order of symbols in a bucket is not significant.
245
246 In an empty obarray, every element is 0, so you can create an obarray
247 with @code{(make-vector @var{length} 0)}. @strong{This is the only
248 valid way to create an obarray.} Prime numbers as lengths tend
249 to result in good hashing; lengths one less than a power of two are also
250 good.
251
252 @strong{Do not try to put symbols in an obarray yourself.} This does
253 not work---only @code{intern} can enter a symbol in an obarray properly.
254
255 @cindex CL note---symbol in obarrays
256 @quotation
257 @b{Common Lisp note:} In Common Lisp, a single symbol may be interned in
258 several obarrays.
259 @end quotation
260
261 Most of the functions below take a name and sometimes an obarray as
262 arguments. A @code{wrong-type-argument} error is signaled if the name
263 is not a string, or if the obarray is not a vector.
264
265 @defun symbol-name symbol
266 This function returns the string that is @var{symbol}'s name. For example:
267
268 @example
269 @group
270 (symbol-name 'foo)
271 @result{} "foo"
272 @end group
273 @end example
274
275 @strong{Warning:} Changing the string by substituting characters does
276 change the name of the symbol, but fails to update the obarray, so don't
277 do it!
278 @end defun
279
280 @defun make-symbol name
281 This function returns a newly-allocated, uninterned symbol whose name is
282 @var{name} (which must be a string). Its value and function definition
283 are void, and its property list is @code{nil}. In the example below,
284 the value of @code{sym} is not @code{eq} to @code{foo} because it is a
285 distinct uninterned symbol whose name is also @samp{foo}.
286
287 @example
288 (setq sym (make-symbol "foo"))
289 @result{} foo
290 (eq sym 'foo)
291 @result{} nil
292 @end example
293 @end defun
294
295 @defun intern name &optional obarray
296 This function returns the interned symbol whose name is @var{name}. If
297 there is no such symbol in the obarray @var{obarray}, @code{intern}
298 creates a new one, adds it to the obarray, and returns it. If
299 @var{obarray} is omitted, the value of the global variable
300 @code{obarray} is used.
301
302 @example
303 (setq sym (intern "foo"))
304 @result{} foo
305 (eq sym 'foo)
306 @result{} t
307
308 (setq sym1 (intern "foo" other-obarray))
309 @result{} foo
310 (eq sym1 'foo)
311 @result{} nil
312 @end example
313 @end defun
314
315 @cindex CL note---interning existing symbol
316 @quotation
317 @b{Common Lisp note:} In Common Lisp, you can intern an existing symbol
318 in an obarray. In Emacs Lisp, you cannot do this, because the argument
319 to @code{intern} must be a string, not a symbol.
320 @end quotation
321
322 @defun intern-soft name &optional obarray
323 This function returns the symbol in @var{obarray} whose name is
324 @var{name}, or @code{nil} if @var{obarray} has no symbol with that name.
325 Therefore, you can use @code{intern-soft} to test whether a symbol with
326 a given name is already interned. If @var{obarray} is omitted, the
327 value of the global variable @code{obarray} is used.
328
329 The argument @var{name} may also be a symbol; in that case,
330 the function returns @var{name} if @var{name} is interned
331 in the specified obarray, and otherwise @code{nil}.
332
333 @smallexample
334 (intern-soft "frazzle") ; @r{No such symbol exists.}
335 @result{} nil
336 (make-symbol "frazzle") ; @r{Create an uninterned one.}
337 @result{} frazzle
338 @group
339 (intern-soft "frazzle") ; @r{That one cannot be found.}
340 @result{} nil
341 @end group
342 @group
343 (setq sym (intern "frazzle")) ; @r{Create an interned one.}
344 @result{} frazzle
345 @end group
346 @group
347 (intern-soft "frazzle") ; @r{That one can be found!}
348 @result{} frazzle
349 @end group
350 @group
351 (eq sym 'frazzle) ; @r{And it is the same one.}
352 @result{} t
353 @end group
354 @end smallexample
355 @end defun
356
357 @defvar obarray
358 This variable is the standard obarray for use by @code{intern} and
359 @code{read}.
360 @end defvar
361
362 @defun mapatoms function &optional obarray
363 @anchor{Definition of mapatoms}
364 This function calls @var{function} once with each symbol in the obarray
365 @var{obarray}. Then it returns @code{nil}. If @var{obarray} is
366 omitted, it defaults to the value of @code{obarray}, the standard
367 obarray for ordinary symbols.
368
369 @smallexample
370 (setq count 0)
371 @result{} 0
372 (defun count-syms (s)
373 (setq count (1+ count)))
374 @result{} count-syms
375 (mapatoms 'count-syms)
376 @result{} nil
377 count
378 @result{} 1871
379 @end smallexample
380
381 See @code{documentation} in @ref{Accessing Documentation}, for another
382 example using @code{mapatoms}.
383 @end defun
384
385 @defun unintern symbol obarray
386 This function deletes @var{symbol} from the obarray @var{obarray}. If
387 @code{symbol} is not actually in the obarray, @code{unintern} does
388 nothing. If @var{obarray} is @code{nil}, the current obarray is used.
389
390 If you provide a string instead of a symbol as @var{symbol}, it stands
391 for a symbol name. Then @code{unintern} deletes the symbol (if any) in
392 the obarray which has that name. If there is no such symbol,
393 @code{unintern} does nothing.
394
395 If @code{unintern} does delete a symbol, it returns @code{t}. Otherwise
396 it returns @code{nil}.
397 @end defun
398
399 @node Property Lists,, Creating Symbols, Symbols
400 @section Property Lists
401 @cindex property list
402 @cindex plist
403
404 A @dfn{property list} (@dfn{plist} for short) is a list of paired
405 elements. Each of the pairs associates a property name (usually a
406 symbol) with a property or value.
407
408 Every symbol has a cell that stores a property list (@pxref{Symbol
409 Components}). This property list is used to record information about
410 the symbol, such as its variable documentation and the name of the
411 file where it was defined.
412
413 Property lists can also be used in other contexts. For instance,
414 you can assign property lists to character positions in a string or
415 buffer. @xref{Text Properties}.
416
417 The property names and values in a property list can be any Lisp
418 objects, but the names are usually symbols. Property list functions
419 compare the property names using @code{eq}. Here is an example of a
420 property list, found on the symbol @code{progn} when the compiler is
421 loaded:
422
423 @example
424 (lisp-indent-function 0 byte-compile byte-compile-progn)
425 @end example
426
427 @noindent
428 Here @code{lisp-indent-function} and @code{byte-compile} are property
429 names, and the other two elements are the corresponding values.
430
431 @menu
432 * Plists and Alists:: Comparison of the advantages of property
433 lists and association lists.
434 * Symbol Plists:: Functions to access symbols' property lists.
435 * Other Plists:: Accessing property lists stored elsewhere.
436 @end menu
437
438 @node Plists and Alists
439 @subsection Property Lists and Association Lists
440 @cindex plist vs. alist
441 @cindex alist vs. plist
442
443 @cindex property lists vs association lists
444 Association lists (@pxref{Association Lists}) are very similar to
445 property lists. In contrast to association lists, the order of the
446 pairs in the property list is not significant since the property names
447 must be distinct.
448
449 Property lists are better than association lists for attaching
450 information to various Lisp function names or variables. If your
451 program keeps all of its associations in one association list, it will
452 typically need to search that entire list each time it checks for an
453 association. This could be slow. By contrast, if you keep the same
454 information in the property lists of the function names or variables
455 themselves, each search will scan only the length of one property list,
456 which is usually short. This is why the documentation for a variable is
457 recorded in a property named @code{variable-documentation}. The byte
458 compiler likewise uses properties to record those functions needing
459 special treatment.
460
461 However, association lists have their own advantages. Depending on
462 your application, it may be faster to add an association to the front of
463 an association list than to update a property. All properties for a
464 symbol are stored in the same property list, so there is a possibility
465 of a conflict between different uses of a property name. (For this
466 reason, it is a good idea to choose property names that are probably
467 unique, such as by beginning the property name with the program's usual
468 name-prefix for variables and functions.) An association list may be
469 used like a stack where associations are pushed on the front of the list
470 and later discarded; this is not possible with a property list.
471
472 @node Symbol Plists
473 @subsection Property List Functions for Symbols
474
475 @defun symbol-plist symbol
476 This function returns the property list of @var{symbol}.
477 @end defun
478
479 @defun setplist symbol plist
480 This function sets @var{symbol}'s property list to @var{plist}.
481 Normally, @var{plist} should be a well-formed property list, but this is
482 not enforced. The return value is @var{plist}.
483
484 @smallexample
485 (setplist 'foo '(a 1 b (2 3) c nil))
486 @result{} (a 1 b (2 3) c nil)
487 (symbol-plist 'foo)
488 @result{} (a 1 b (2 3) c nil)
489 @end smallexample
490
491 For symbols in special obarrays, which are not used for ordinary
492 purposes, it may make sense to use the property list cell in a
493 nonstandard fashion; in fact, the abbrev mechanism does so
494 (@pxref{Abbrevs}).
495 @end defun
496
497 @defun get symbol property
498 This function finds the value of the property named @var{property} in
499 @var{symbol}'s property list. If there is no such property, @code{nil}
500 is returned. Thus, there is no distinction between a value of
501 @code{nil} and the absence of the property.
502
503 The name @var{property} is compared with the existing property names
504 using @code{eq}, so any object is a legitimate property.
505
506 See @code{put} for an example.
507 @end defun
508
509 @defun put symbol property value
510 This function puts @var{value} onto @var{symbol}'s property list under
511 the property name @var{property}, replacing any previous property value.
512 The @code{put} function returns @var{value}.
513
514 @smallexample
515 (put 'fly 'verb 'transitive)
516 @result{}'transitive
517 (put 'fly 'noun '(a buzzing little bug))
518 @result{} (a buzzing little bug)
519 (get 'fly 'verb)
520 @result{} transitive
521 (symbol-plist 'fly)
522 @result{} (verb transitive noun (a buzzing little bug))
523 @end smallexample
524 @end defun
525
526 @node Other Plists
527 @subsection Property Lists Outside Symbols
528
529 These functions are useful for manipulating property lists
530 that are stored in places other than symbols:
531
532 @defun plist-get plist property
533 This returns the value of the @var{property} property stored in the
534 property list @var{plist}. It accepts a malformed @var{plist}
535 argument. If @var{property} is not found in the @var{plist}, it
536 returns @code{nil}. For example,
537
538 @example
539 (plist-get '(foo 4) 'foo)
540 @result{} 4
541 (plist-get '(foo 4 bad) 'foo)
542 @result{} 4
543 (plist-get '(foo 4 bad) 'bad)
544 @result{} @code{nil}
545 (plist-get '(foo 4 bad) 'bar)
546 @result{} nil
547 @end example
548 @end defun
549
550 @defun plist-put plist property value
551 This stores @var{value} as the value of the @var{property} property in
552 the property list @var{plist}. It may modify @var{plist} destructively,
553 or it may construct a new list structure without altering the old. The
554 function returns the modified property list, so you can store that back
555 in the place where you got @var{plist}. For example,
556
557 @example
558 (setq my-plist '(bar t foo 4))
559 @result{} (bar t foo 4)
560 (setq my-plist (plist-put my-plist 'foo 69))
561 @result{} (bar t foo 69)
562 (setq my-plist (plist-put my-plist 'quux '(a)))
563 @result{} (bar t foo 69 quux (a))
564 @end example
565 @end defun
566
567 You could define @code{put} in terms of @code{plist-put} as follows:
568
569 @example
570 (defun put (symbol prop value)
571 (setplist symbol
572 (plist-put (symbol-plist symbol) prop value)))
573 @end example
574
575 @defun lax-plist-get plist property
576 Like @code{plist-get} except that it compares properties
577 using @code{equal} instead of @code{eq}.
578 @end defun
579
580 @defun lax-plist-put plist property value
581 Like @code{plist-put} except that it compares properties
582 using @code{equal} instead of @code{eq}.
583 @end defun
584
585 @defun plist-member plist property
586 This returns non-@code{nil} if @var{plist} contains the given
587 @var{property}. Unlike @code{plist-get}, this allows you to distinguish
588 between a missing property and a property with the value @code{nil}.
589 The value is actually the tail of @var{plist} whose @code{car} is
590 @var{property}.
591 @end defun