Reposition @anchor's.
[bpt/emacs.git] / lispref / hash.texi
CommitLineData
7d15d35d
RS
1@c -*-texinfo-*-
2@c This is part of the GNU Emacs Lisp Reference Manual.
816c421e 3@c Copyright (C) 1999, 2003 Free Software Foundation, Inc.
7d15d35d
RS
4@c See the file elisp.texi for copying conditions.
5@setfilename ../info/hash
6@node Hash Tables, Symbols, Sequences Arrays Vectors, Top
7@chapter Hash Tables
8@cindex hash tables
9
10 A hash table is a very fast kind of lookup table, somewhat like
11an alist in that it maps keys to corresponding values. It differs
12from an alist in these ways:
13
14@itemize @bullet
15@item
1107d4ca
DL
16Lookup in a hash table is extremely fast for large tables---in fact, the
17time required is essentially @emph{independent} of how many elements are
18stored in the table. For smaller tables (a few tens of elements)
19alists may still be faster because hash tables have a more-or-less
20constant overhead.
7d15d35d
RS
21
22@item
23The correspondences in a hash table are in no particular order.
24
25@item
26There is no way to share structure between two hash tables,
27the way two alists can share a common tail.
28@end itemize
29
30 Emacs Lisp (starting with Emacs 21) provides a general-purpose hash
31table data type, along with a series of functions for operating on them.
32Hash tables have no read syntax, and print in hash notation, like this:
33
34@example
35(make-hash-table)
36 @result{} #<hash-table 'eql nil 0/65 0x83af980>
37@end example
38
00510a6b
GM
39@noindent
40(The term ``hash notation'' refers to the initial @samp{#}
41character---@pxref{Printed Representation}---and has nothing to do with
42the term ``hash table.'')
43
7d15d35d
RS
44 Obarrays are also a kind of hash table, but they are a different type
45of object and are used only for recording interned symbols
46(@pxref{Creating Symbols}).
47
48@menu
49* Creating Hash::
50* Hash Access::
51* Defining Hash::
52* Other Hash::
53@end menu
54
55@node Creating Hash
56@section Creating Hash Tables
57
58 The principal function for creating a hash table is
59@code{make-hash-table}.
60
61@tindex make-hash-table
62@defun make-hash-table &rest keyword-args
63This function creates a new hash table according to the specified
64arguments. The arguments should consist of alternating keywords
65(particular symbols recognized specially) and values corresponding to
66them.
67
68Several keywords make sense in @code{make-hash-table}, but the only two
711331aa 69that you really need to know about are @code{:test} and @code{:weakness}.
7d15d35d
RS
70
71@table @code
72@item :test @var{test}
73This specifies the method of key lookup for this hash table. The
74default is @code{eql}; @code{eq} and @code{equal} are other
75alternatives:
76
77@table @code
78@item eql
b02495f1
LT
79Keys which are numbers are ``the same'' if they are @code{equal}, that
80is, if they are equal in value and either both are integers or both
e57d0aa8
LT
81are floating point numbers; otherwise, two distinct objects are never
82``the same''.
7d15d35d
RS
83
84@item eq
85Any two distinct Lisp objects are ``different'' as keys.
86
87@item equal
88Two Lisp objects are ``the same'', as keys, if they are equal
89according to @code{equal}.
90@end table
91
92You can use @code{define-hash-table-test} (@pxref{Defining Hash}) to
93define additional possibilities for @var{test}.
94
95@item :weakness @var{weak}
96The weakness of a hash table specifies whether the presence of a key or
97value in the hash table preserves it from garbage collection.
98
99The value, @var{weak}, must be one of @code{nil}, @code{key},
18925e78 100@code{value}, @code{key-or-value}, @code{key-and-value}, or @code{t}
1c673658 101which is an alias for @code{key-and-value}. If @var{weak} is @code{key}
18925e78
GM
102then the hash table does not prevent its keys from being collected as
103garbage (if they are not referenced anywhere else); if a particular key
104does get collected, the corresponding association is removed from the
105hash table.
106
107If @var{weak} is @code{value}, then the hash table does not prevent
108values from being collected as garbage (if they are not referenced
109anywhere else); if a particular value does get collected, the
7d15d35d
RS
110corresponding association is removed from the hash table.
111
2c6d3eef
RS
112If @var{weak} is @code{key-and-value} or @code{t}, both the key and
113the value must be live in order to preserve the association. Thus,
114the hash table does not protect either keys or values from garbage
115collection; if either one is collected as garbage, that removes the
116association.
a9749dab 117
2c6d3eef
RS
118If @var{weak} is @code{key-or-value}, either the key or
119the value can preserve the association. Thus, associations are
120removed from the hash table when both their key and value would be
121collected as garbage (if not for references from weak hash tables).
18925e78 122
7d15d35d 123The default for @var{weak} is @code{nil}, so that all keys and values
2c6d3eef 124referenced in the hash table are preserved from garbage collection.
7d15d35d
RS
125
126@item :size @var{size}
127This specifies a hint for how many associations you plan to store in the
128hash table. If you know the approximate number, you can make things a
711331aa 129little more efficient by specifying it this way. If you specify too
7d15d35d 130small a size, the hash table will grow automatically when necessary, but
00510a6b 131doing that takes some extra time.
7d15d35d
RS
132
133The default size is 65.
134
135@item :rehash-size @var{rehash-size}
136When you add an association to a hash table and the table is ``full,''
137it grows automatically. This value specifies how to make the hash table
138larger, at that time.
139
a40d4712
PR
140If @var{rehash-size} is an integer, it should be positive, and the hash
141table grows by adding that much to the nominal size. If
142@var{rehash-size} is a floating point number, it had better be greater
143than 1, and the hash table grows by multiplying the old size by that
144number.
7d15d35d
RS
145
146The default value is 1.5.
147
148@item :rehash-threshold @var{threshold}
149This specifies the criterion for when the hash table is ``full.'' The
150value, @var{threshold}, should be a positive floating point number, no
151greater than 1. The hash table is ``full'' whenever the actual number of
152entries exceeds this fraction of the nominal size. The default for
153@var{threshold} is 0.8.
154@end table
155@end defun
156
157@tindex makehash
158@defun makehash &optional test
159This is equivalent to @code{make-hash-table}, but with a different style
160argument list. The argument @var{test} specifies the method
161of key lookup.
162
b02495f1 163This function is obsolete. Use @code{make-hash-table} instead.
7d15d35d
RS
164@end defun
165
166@node Hash Access
167@section Hash Table Access
168
169 This section describes the functions for accessing and storing
170associations in a hash table.
171
172@tindex gethash
173@defun gethash key table &optional default
174This function looks up @var{key} in @var{table}, and returns its
175associated @var{value}---or @var{default}, if @var{key} has no
176association in @var{table}.
177@end defun
178
179@tindex puthash
177c0ea7 180@defun puthash key value table
7d15d35d
RS
181This function enters an association for @var{key} in @var{table}, with
182value @var{value}. If @var{key} already has an association in
183@var{table}, @var{value} replaces the old associated value.
184@end defun
185
186@tindex remhash
187@defun remhash key table
188This function removes the association for @var{key} from @var{table}, if
189there is one. If @var{key} has no association, @code{remhash} does
190nothing.
b02495f1
LT
191
192@b{Common Lisp note:} In Common Lisp, @code{remhash} returns
193non-@code{nil} if it actually removed an association and @code{nil}
194otherwise. In Emacs Lisp, @code{remhash} always returns @code{nil}.
7d15d35d
RS
195@end defun
196
197@tindex clrhash
198@defun clrhash table
199This function removes all the associations from hash table @var{table},
200so that it becomes empty. This is also called @dfn{clearing} the hash
201table.
b02495f1
LT
202
203@b{Common Lisp note:} In Common Lisp, @code{clrhash} returns the empty
204@var{table}. In Emacs Lisp, it returns @code{nil}.
7d15d35d
RS
205@end defun
206
207@tindex maphash
208@defun maphash function table
7baeca0c 209@anchor{Definition of maphash}
7d15d35d
RS
210This function calls @var{function} once for each of the associations in
211@var{table}. The function @var{function} should accept two
212arguments---a @var{key} listed in @var{table}, and its associated
213@var{value}.
214@end defun
215
216@node Defining Hash
217@section Defining Hash Comparisons
218@cindex hash code
219
220 You can define new methods of key lookup by means of
221@code{define-hash-table-test}. In order to use this feature, you need
222to understand how hash tables work, and what a @dfn{hash code} means.
223
224 You can think of a hash table conceptually as a large array of many
225slots, each capable of holding one association. To look up a key,
226@code{gethash} first computes an integer, the hash code, from the key.
227It reduces this integer modulo the length of the array, to produce an
228index in the array. Then it looks in that slot, and if necessary in
229other nearby slots, to see if it has found the key being sought.
230
231 Thus, to define a new method of key lookup, you need to specify both a
232function to compute the hash code from a key, and a function to compare
233two keys directly.
234
235@tindex define-hash-table-test
236@defun define-hash-table-test name test-fn hash-fn
237This function defines a new hash table test, named @var{name}.
238
239After defining @var{name} in this way, you can use it as the @var{test}
240argument in @code{make-hash-table}. When you do that, the hash table
241will use @var{test-fn} to compare key values, and @var{hash-fn} to compute
242a ``hash code'' from a key value.
243
244The function @var{test-fn} should accept two arguments, two keys, and
245return non-@code{nil} if they are considered ``the same.''
246
247The function @var{hash-fn} should accept one argument, a key, and return
248an integer that is the ``hash code'' of that key. For good results, the
249function should use the whole range of integer values for hash codes,
250including negative integers.
251
252The specified functions are stored in the property list of @var{name}
253under the property @code{hash-table-test}; the property value's form is
254@code{(@var{test-fn} @var{hash-fn})}.
a9749dab
RS
255@end defun
256
257@tindex sxhash
258@defun sxhash obj
259This function returns a hash code for Lisp object @var{obj}.
260This is an integer which reflects the contents of @var{obj}
261and the other Lisp objects it points to.
262
263If two objects @var{obj1} and @var{obj2} are equal, then @code{(sxhash
264@var{obj1})} and @code{(sxhash @var{obj2})} are the same integer.
265
266If the two objects are not equal, the values returned by @code{sxhash}
b02495f1
LT
267are usually different, but not always; once in a rare while, by luck,
268you will encounter two distinct-looking objects that give the same
a9749dab
RS
269result from @code{sxhash}.
270@end defun
7d15d35d 271
a9749dab 272 This example creates a hash table whose keys are strings that are
7d15d35d
RS
273compared case-insensitively.
274
275@example
276(defun case-fold-string= (a b)
277 (compare-strings a nil nil b nil nil t))
278
279(defun case-fold-string-hash (a)
280 (sxhash (upcase a)))
281
177c0ea7 282(define-hash-table-test 'case-fold 'case-fold-string=
7d15d35d
RS
283 'case-fold-string-hash))
284
285(make-hash-table :test 'case-fold)
286@end example
7d15d35d 287
a9749dab
RS
288 Here is how you could define a hash table test equivalent to the
289predefined test value @code{equal}. The keys can be any Lisp object,
290and equal-looking objects are considered the same key.
7d15d35d 291
a9749dab
RS
292@example
293(define-hash-table-test 'contents-hash 'equal 'sxhash)
7d15d35d 294
a9749dab
RS
295(make-hash-table :test 'contents-hash)
296@end example
7d15d35d
RS
297
298@node Other Hash
299@section Other Hash Table Functions
300
301 Here are some other functions for working with hash tables.
302
303@tindex hash-table-p
304@defun hash-table-p table
305This returns non-@code{nil} if @var{table} is a hash table object.
306@end defun
307
308@tindex copy-hash-table
309@defun copy-hash-table table
310This function creates and returns a copy of @var{table}. Only the table
311itself is copied---the keys and values are shared.
312@end defun
313
314@tindex hash-table-count
315@defun hash-table-count table
316This function returns the actual number of entries in @var{table}.
317@end defun
318
711331aa
GM
319@tindex hash-table-test
320@defun hash-table-test table
a40d4712
PR
321This returns the @var{test} value that was given when @var{table} was
322created, to specify how to hash and compare keys. See
7d15d35d
RS
323@code{make-hash-table} (@pxref{Creating Hash}).
324@end defun
325
326@tindex hash-table-weakness
327@defun hash-table-weakness table
328This function returns the @var{weak} value that was specified for hash
329table @var{table}.
330@end defun
331
332@tindex hash-table-rehash-size
333@defun hash-table-rehash-size table
334This returns the rehash size of @var{table}.
335@end defun
336
337@tindex hash-table-rehash-threshold
338@defun hash-table-rehash-threshold table
339This returns the rehash threshold of @var{table}.
340@end defun
341
711331aa
GM
342@tindex hash-table-size
343@defun hash-table-size table
7d15d35d
RS
344This returns the current nominal size of @var{table}.
345@end defun
ab5796a9
MB
346
347@ignore
348 arch-tag: 3b5107f9-d2f0-47d5-ad61-3498496bea0e
349@end ignore