entered into RCS
[bpt/emacs.git] / lispref / symbols.texi
1 @c -*-texinfo-*-
2 @c This is part of the GNU Emacs Lisp Reference Manual.
3 @c Copyright (C) 1990, 1991, 1992, 1993, 1994 Free Software Foundation, Inc.
4 @c See the file elisp.texi for copying conditions.
5 @setfilename ../info/symbols
6 @node Symbols, Evaluation, Sequences Arrays Vectors, Top
7 @chapter Symbols
8 @cindex symbol
9
10 A @dfn{symbol} is an object with a unique name. This chapter
11 describes symbols, their components, their property lists, and how they
12 are created and interned. Separate chapters describe the use of symbols
13 as variables and as function names; see @ref{Variables}, and
14 @ref{Functions}. For the precise read syntax for symbols, see
15 @ref{Symbol Type}.
16
17 You can test whether an arbitrary Lisp object is a symbol
18 with @code{symbolp}:
19
20 @defun symbolp object
21 This function returns @code{t} if @var{object} is a symbol, @code{nil}
22 otherwise.
23 @end defun
24
25 @menu
26 * Symbol Components:: Symbols have names, values, function definitions
27 and property lists.
28 * Definitions:: A definition says how a symbol will be used.
29 * Creating Symbols:: How symbols are kept unique.
30 * Property Lists:: Each symbol has a property list
31 for recording miscellaneous information.
32 @end menu
33
34 @node Symbol Components, Definitions, Symbols, Symbols
35 @section Symbol Components
36 @cindex symbol components
37
38 Each symbol has four components (or ``cells''), each of which
39 references another object:
40
41 @table @asis
42 @item Print name
43 @cindex print name cell
44 The @dfn{print name cell} holds a string that names the symbol for
45 reading and printing. See @code{symbol-name} in @ref{Creating Symbols}.
46
47 @item Value
48 @cindex value cell
49 The @dfn{value cell} holds the current value of the symbol as a
50 variable. When a symbol is used as a form, the value of the form is the
51 contents of the symbol's value cell. See @code{symbol-value} in
52 @ref{Accessing Variables}.
53
54 @item Function
55 @cindex function cell
56 The @dfn{function cell} holds the function definition of the symbol.
57 When a symbol is used as a function, its function definition is used in
58 its place. This cell is also used to make a symbol stand for a keymap
59 or a keyboard macro, for editor command execution. Because each symbol
60 has separate value and function cells, variables and function names do
61 not conflict. See @code{symbol-function} in @ref{Function Cells}.
62
63 @item Property list
64 @cindex property list cell
65 The @dfn{property list cell} holds the property list of the symbol. See
66 @code{symbol-plist} in @ref{Property Lists}.
67 @end table
68
69 The print name cell always holds a string, and cannot be changed. The
70 other three cells can be set individually to any specified Lisp object.
71
72 The print name cell holds the string that is the name of the symbol.
73 Since symbols are represented textually by their names, it is important
74 not to have two symbols with the same name. The Lisp reader ensures
75 this: every time it reads a symbol, it looks for an existing symbol with
76 the specified name before it creates a new one. (In GNU Emacs Lisp,
77 this lookup uses a hashing algorithm and an obarray; see @ref{Creating
78 Symbols}.)
79
80 In normal usage, the function cell usually contains a function or
81 macro, as that is what the Lisp interpreter expects to see there
82 (@pxref{Evaluation}). Keyboard macros (@pxref{Keyboard Macros}),
83 keymaps (@pxref{Keymaps}) and autoload objects (@pxref{Autoloading}) are
84 also sometimes stored in the function cell of symbols. We often refer
85 to ``the function @code{foo}'' when we really mean the function stored
86 in the function cell of the symbol @code{foo}. We make the distinction
87 only when necessary.
88
89 The property list cell normally should hold a correctly formatted
90 property list (@pxref{Property Lists}), as a number of functions expect
91 to see a property list there.
92
93 The function cell or the value cell may be @dfn{void}, which means
94 that the cell does not reference any object. (This is not the same
95 thing as holding the symbol @code{void}, nor the same as holding the
96 symbol @code{nil}.) Examining a cell that is void results in an error,
97 such as @samp{Symbol's value as variable is void}.
98
99 The four functions @code{symbol-name}, @code{symbol-value},
100 @code{symbol-plist}, and @code{symbol-function} return the contents of
101 the four cells of a symbol. Here as an example we show the contents of
102 the four cells of the symbol @code{buffer-file-name}:
103
104 @example
105 (symbol-name 'buffer-file-name)
106 @result{} "buffer-file-name"
107 (symbol-value 'buffer-file-name)
108 @result{} "/gnu/elisp/symbols.texi"
109 (symbol-plist 'buffer-file-name)
110 @result{} (variable-documentation 29529)
111 (symbol-function 'buffer-file-name)
112 @result{} #<subr buffer-file-name>
113 @end example
114
115 @noindent
116 Because this symbol is the variable which holds the name of the file
117 being visited in the current buffer, the value cell contents we see are
118 the name of the source file of this chapter of the Emacs Lisp Manual.
119 The property list cell contains the list @code{(variable-documentation
120 29529)} which tells the documentation functions where to find the
121 documentation string for the variable @code{buffer-file-name} in the
122 @file{DOC} file. (29529 is the offset from the beginning of the
123 @file{DOC} file to where that documentation string begins.) The
124 function cell contains the function for returning the name of the file.
125 @code{buffer-file-name} names a primitive function, which has no read
126 syntax and prints in hash notation (@pxref{Primitive Function Type}). A
127 symbol naming a function written in Lisp would have a lambda expression
128 (or a byte-code object) in this cell.
129
130 @node Definitions, Creating Symbols, Symbol Components, Symbols
131 @section Defining Symbols
132 @cindex definition of a symbol
133
134 A @dfn{definition} in Lisp is a special form that announces your
135 intention to use a certain symbol in a particular way. In Emacs Lisp,
136 you can define a symbol as a variable, or define it as a function (or
137 macro), or both independently.
138
139 A definition construct typically specifies a value or meaning for the
140 symbol for one kind of use, plus documentation for its meaning when used
141 in this way. Thus, when you define a symbol as a variable, you can
142 supply an initial value for the variable, plus documentation for the
143 variable.
144
145 @code{defvar} and @code{defconst} are special forms that define a
146 symbol as a global variable. They are documented in detail in
147 @ref{Defining Variables}.
148
149 @code{defun} defines a symbol as a function, creating a lambda
150 expression and storing it in the function cell of the symbol. This
151 lambda expression thus becomes the function definition of the symbol.
152 (The term ``function definition'', meaning the contents of the function
153 cell, is derived from the idea that @code{defun} gives the symbol its
154 definition as a function.) @xref{Functions}.
155
156 @code{defmacro} defines a symbol as a macro. It creates a macro
157 object and stores it in the function cell of the symbol. Note that a
158 given symbol can be a macro or a function, but not both at once, because
159 both macro and function definitions are kept in the function cell, and
160 that cell can hold only one Lisp object at any given time.
161 @xref{Macros}.
162
163 In GNU Emacs Lisp, a definition is not required in order to use a
164 symbol as a variable or function. Thus, you can make a symbol a global
165 variable with @code{setq}, whether you define it first or not. The real
166 purpose of definitions is to guide programmers and programming tools.
167 They inform programmers who read the code that certain symbols are
168 @emph{intended} to be used as variables, or as functions. In addition,
169 utilities such as @file{etags} and @file{make-docfile} recognize
170 definitions, and add appropriate information to tag tables and the
171 @file{emacs/etc/DOC-@var{version}} file. @xref{Accessing Documentation}.
172
173 @node Creating Symbols, Property Lists, Definitions, Symbols
174 @section Creating and Interning Symbols
175 @cindex reading symbols
176
177 To understand how symbols are created in GNU Emacs Lisp, you must know
178 how Lisp reads them. Lisp must ensure that it finds the same symbol
179 every time it reads the same set of characters. Failure to do so would
180 cause complete confusion.
181
182 @cindex symbol name hashing
183 @cindex hashing
184 @cindex obarray
185 @cindex bucket (in obarray)
186 When the Lisp reader encounters a symbol, it reads all the characters
187 of the name. Then it ``hashes'' those characters to find an index in a
188 table called an @dfn{obarray}. Hashing is an efficient method of
189 looking something up. For example, instead of searching a telephone
190 book cover to cover when looking up Jan Jones, you start with the J's
191 and go from there. That is a simple version of hashing. Each element
192 of the obarray is a @dfn{bucket} which holds all the symbols with a
193 given hash code; to look for a given name, it is sufficient to look
194 through all the symbols in the bucket for that name's hash code.
195
196 @cindex interning
197 If a symbol with the desired name is found, the reader uses that
198 symbol. If the obarray does not contain a symbol with that name, the
199 reader makes a new symbol and adds it to the obarray. Finding or adding
200 a symbol with a certain name is called @dfn{interning} it, and the
201 symbol is then called an @dfn{interned symbol}.
202
203 Interning ensures that each obarray has just one symbol with any
204 particular name. Other like-named symbols may exist, but not in the
205 same obarray. Thus, the reader gets the same symbols for the same
206 names, as long as you keep reading with the same obarray.
207
208 @cindex symbol equality
209 @cindex uninterned symbol
210 No obarray contains all symbols; in fact, some symbols are not in any
211 obarray. They are called @dfn{uninterned symbols}. An uninterned
212 symbol has the same four cells as other symbols; however, the only way
213 to gain access to it is by finding it in some other object or as the
214 value of a variable.
215
216 In Emacs Lisp, an obarray is actually a vector. Each element of the
217 vector is a bucket; its value is either an interned symbol whose name
218 hashes to that bucket, or 0 if the bucket is empty. Each interned
219 symbol has an internal link (invisible to the user) to the next symbol
220 in the bucket. Because these links are invisible, there is no way to
221 find all the symbols in an obarray except using @code{mapatoms} (below).
222 The order of symbols in a bucket is not significant.
223
224 In an empty obarray, every element is 0, and you can create an obarray
225 with @code{(make-vector @var{length} 0)}. @strong{This is the only
226 valid way to create an obarray.} Prime numbers as lengths tend
227 to result in good hashing; lengths one less than a power of two are also
228 good.
229
230 @strong{Do not try to put symbols in an obarray yourself.} This does
231 not work---only @code{intern} can enter a symbol in an obarray properly.
232 @strong{Do not try to intern one symbol in two obarrays.} This would
233 garble both obarrays, because a symbol has just one slot to hold the
234 following symbol in the obarray bucket. The results would be
235 unpredictable.
236
237 It is possible for two different symbols to have the same name in
238 different obarrays; these symbols are not @code{eq} or @code{equal}.
239 However, this normally happens only as part of the abbrev mechanism
240 (@pxref{Abbrevs}).
241
242 @cindex CL note---symbol in obarrays
243 @quotation
244 @b{Common Lisp note:} In Common Lisp, a single symbol may be interned in
245 several obarrays.
246 @end quotation
247
248 Most of the functions below take a name and sometimes an obarray as
249 arguments. A @code{wrong-type-argument} error is signaled if the name
250 is not a string, or if the obarray is not a vector.
251
252 @defun symbol-name symbol
253 This function returns the string that is @var{symbol}'s name. For example:
254
255 @example
256 @group
257 (symbol-name 'foo)
258 @result{} "foo"
259 @end group
260 @end example
261
262 Changing the string by substituting characters, etc, does change the
263 name of the symbol, but fails to update the obarray, so don't do it!
264 @end defun
265
266 @defun make-symbol name
267 This function returns a newly-allocated, uninterned symbol whose name is
268 @var{name} (which must be a string). Its value and function definition
269 are void, and its property list is @code{nil}. In the example below,
270 the value of @code{sym} is not @code{eq} to @code{foo} because it is a
271 distinct uninterned symbol whose name is also @samp{foo}.
272
273 @example
274 (setq sym (make-symbol "foo"))
275 @result{} foo
276 (eq sym 'foo)
277 @result{} nil
278 @end example
279 @end defun
280
281 @defun intern name &optional obarray
282 This function returns the interned symbol whose name is @var{name}. If
283 there is no such symbol in the obarray @var{obarray}, @code{intern}
284 creates a new one, adds it to the obarray, and returns it. If
285 @var{obarray} is omitted, the value of the global variable
286 @code{obarray} is used.
287
288 @example
289 (setq sym (intern "foo"))
290 @result{} foo
291 (eq sym 'foo)
292 @result{} t
293
294 (setq sym1 (intern "foo" other-obarray))
295 @result{} foo
296 (eq sym 'foo)
297 @result{} nil
298 @end example
299 @end defun
300
301 @defun intern-soft name &optional obarray
302 This function returns the symbol in @var{obarray} whose name is
303 @var{name}, or @code{nil} if @var{obarray} has no symbol with that name.
304 Therefore, you can use @code{intern-soft} to test whether a symbol with
305 a given name is already interned. If @var{obarray} is omitted, the
306 value of the global variable @code{obarray} is used.
307
308 @smallexample
309 (intern-soft "frazzle") ; @r{No such symbol exists.}
310 @result{} nil
311 (make-symbol "frazzle") ; @r{Create an uninterned one.}
312 @result{} frazzle
313 (intern-soft "frazzle") ; @r{That one cannot be found.}
314 @result{} nil
315 (setq sym (intern "frazzle")) ; @r{Create an interned one.}
316 @result{} frazzle
317 (intern-soft "frazzle") ; @r{That one can be found!}
318 @result{} frazzle
319 @group
320 (eq sym 'frazzle) ; @r{And it is the same one.}
321 @result{} t
322 @end group
323 @end smallexample
324 @end defun
325
326 @defvar obarray
327 This variable is the standard obarray for use by @code{intern} and
328 @code{read}.
329 @end defvar
330
331 @defun mapatoms function &optional obarray
332 This function calls @var{function} for each symbol in the obarray
333 @var{obarray}. It returns @code{nil}. If @var{obarray} is omitted, it
334 defaults to the value of @code{obarray}, the standard obarray for
335 ordinary symbols.
336
337 @smallexample
338 (setq count 0)
339 @result{} 0
340 (defun count-syms (s)
341 (setq count (1+ count)))
342 @result{} count-syms
343 (mapatoms 'count-syms)
344 @result{} nil
345 count
346 @result{} 1871
347 @end smallexample
348
349 See @code{documentation} in @ref{Accessing Documentation}, for another
350 example using @code{mapatoms}.
351 @end defun
352
353 @node Property Lists,, Creating Symbols, Symbols
354 @section Property Lists
355 @cindex property list
356 @cindex plist
357
358 A @dfn{property list} (@dfn{plist} for short) is a list of paired
359 elements stored in the property list cell of a symbol. Each of the
360 pairs associates a property name (usually a symbol) with a property or
361 value. Property lists are generally used to record information about a
362 symbol, such as its documentation as a variable, the name of the file
363 where it was defined, or perhaps even the grammatical class of the
364 symbol (representing a word) in a language-understanding system.
365
366 Character positions in a string or buffer can also have property lists.
367 @xref{Text Properties}.
368
369 The property names and values in a property list can be any Lisp
370 objects, but the names are usually symbols. They are compared using
371 @code{eq}. Here is an example of a property list, found on the symbol
372 @code{progn} when the compiler is loaded:
373
374 @example
375 (lisp-indent-function 0 byte-compile byte-compile-progn)
376 @end example
377
378 @noindent
379 Here @code{lisp-indent-function} and @code{byte-compile} are property
380 names, and the other two elements are the corresponding values.
381
382 @cindex property lists vs association lists
383 Association lists (@pxref{Association Lists}) are very similar to
384 property lists. In contrast to association lists, the order of the
385 pairs in the property list is not significant since the property names
386 must be distinct.
387
388 Property lists are better than association lists for attaching
389 information to various Lisp function names or variables. If all the
390 associations are recorded in one association list, the program will need
391 to search that entire list each time a function or variable is to be
392 operated on. By contrast, if the information is recorded in the
393 property lists of the function names or variables themselves, each
394 search will scan only the length of one property list, which is usually
395 short. This is why the documentation for a variable is recorded in a
396 property named @code{variable-documentation}. The byte compiler
397 likewise uses properties to record those functions needing special
398 treatment.
399
400 However, association lists have their own advantages. Depending on
401 your application, it may be faster to add an association to the front of
402 an association list than to update a property. All properties for a
403 symbol are stored in the same property list, so there is a possibility
404 of a conflict between different uses of a property name. (For this
405 reason, it is a good idea to choose property names that are probably
406 unique, such as by including the name of the library in the property
407 name.) An association list may be used like a stack where associations
408 are pushed on the front of the list and later discarded; this is not
409 possible with a property list.
410
411 @defun symbol-plist symbol
412 This function returns the property list of @var{symbol}.
413 @end defun
414
415 @defun setplist symbol plist
416 This function sets @var{symbol}'s property list to @var{plist}.
417 Normally, @var{plist} should be a well-formed property list, but this is
418 not enforced.
419
420 @smallexample
421 (setplist 'foo '(a 1 b (2 3) c nil))
422 @result{} (a 1 b (2 3) c nil)
423 (symbol-plist 'foo)
424 @result{} (a 1 b (2 3) c nil)
425 @end smallexample
426
427 For symbols in special obarrays, which are not used for ordinary
428 purposes, it may make sense to use the property list cell in a
429 nonstandard fashion; in fact, the abbrev mechanism does so
430 (@pxref{Abbrevs}).
431 @end defun
432
433 @defun get symbol property
434 This function finds the value of the property named @var{property} in
435 @var{symbol}'s property list. If there is no such property, @code{nil}
436 is returned. Thus, there is no distinction between a value of
437 @code{nil} and the absence of the property.
438
439 The name @var{property} is compared with the existing property names
440 using @code{eq}, so any object is a legitimate property.
441
442 See @code{put} for an example.
443 @end defun
444
445 @defun put symbol property value
446 This function puts @var{value} onto @var{symbol}'s property list under
447 the property name @var{property}, replacing any previous property value.
448 The @code{put} function returns @var{value}.
449
450 @smallexample
451 (put 'fly 'verb 'transitive)
452 @result{}'transitive
453 (put 'fly 'noun '(a buzzing little bug))
454 @result{} (a buzzing little bug)
455 (get 'fly 'verb)
456 @result{} transitive
457 (symbol-plist 'fly)
458 @result{} (verb transitive noun (a buzzing little bug))
459 @end smallexample
460 @end defun