(Top): Update node listing.
[bpt/emacs.git] / doc / lispref / hash.texi
CommitLineData
b8d4c8d0
GM
1@c -*-texinfo-*-
2@c This is part of the GNU Emacs Lisp Reference Manual.
3@c Copyright (C) 1999, 2001, 2002, 2003, 2004, 2005,
6ed161e1 4@c 2006, 2007, 2008, 2009 Free Software Foundation, Inc.
b8d4c8d0 5@c See the file elisp.texi for copying conditions.
6336d8c3 6@setfilename ../../info/hash
b8d4c8d0
GM
7@node Hash Tables, Symbols, Sequences Arrays Vectors, Top
8@chapter Hash Tables
9@cindex hash tables
10@cindex lookup tables
11
12 A hash table is a very fast kind of lookup table, somewhat like an
13alist (@pxref{Association Lists}) in that it maps keys to
14corresponding values. It differs from an alist in these ways:
15
16@itemize @bullet
17@item
18Lookup in a hash table is extremely fast for large tables---in fact, the
19time required is essentially @emph{independent} of how many elements are
20stored in the table. For smaller tables (a few tens of elements)
21alists may still be faster because hash tables have a more-or-less
22constant overhead.
23
24@item
25The correspondences in a hash table are in no particular order.
26
27@item
28There is no way to share structure between two hash tables,
29the way two alists can share a common tail.
30@end itemize
31
32 Emacs Lisp provides a general-purpose hash table data type, along
33with a series of functions for operating on them. Hash tables have no
34read syntax, and print in hash notation, like this:
35
36@example
37(make-hash-table)
38 @result{} #<hash-table 'eql nil 0/65 0x83af980>
39@end example
40
41@noindent
42(The term ``hash notation'' refers to the initial @samp{#}
43character---@pxref{Printed Representation}---and has nothing to do with
44the term ``hash table.'')
45
46 Obarrays are also a kind of hash table, but they are a different type
47of object and are used only for recording interned symbols
48(@pxref{Creating Symbols}).
49
50@menu
51* Creating Hash:: Functions to create hash tables.
52* Hash Access:: Reading and writing the hash table contents.
53* Defining Hash:: Defining new comparison methods
54* Other Hash:: Miscellaneous.
55@end menu
56
57@node Creating Hash
58@section Creating Hash Tables
59@cindex creating hash tables
60
61 The principal function for creating a hash table is
62@code{make-hash-table}.
63
64@defun make-hash-table &rest keyword-args
65This function creates a new hash table according to the specified
66arguments. The arguments should consist of alternating keywords
67(particular symbols recognized specially) and values corresponding to
68them.
69
70Several keywords make sense in @code{make-hash-table}, but the only two
71that you really need to know about are @code{:test} and @code{:weakness}.
72
73@table @code
74@item :test @var{test}
75This specifies the method of key lookup for this hash table. The
76default is @code{eql}; @code{eq} and @code{equal} are other
77alternatives:
78
79@table @code
80@item eql
81Keys which are numbers are ``the same'' if they are @code{equal}, that
82is, if they are equal in value and either both are integers or both
83are floating point numbers; otherwise, two distinct objects are never
84``the same.''
85
86@item eq
87Any two distinct Lisp objects are ``different'' as keys.
88
89@item equal
90Two Lisp objects are ``the same,'' as keys, if they are equal
91according to @code{equal}.
92@end table
93
94You can use @code{define-hash-table-test} (@pxref{Defining Hash}) to
95define additional possibilities for @var{test}.
96
97@item :weakness @var{weak}
98The weakness of a hash table specifies whether the presence of a key or
99value in the hash table preserves it from garbage collection.
100
101The value, @var{weak}, must be one of @code{nil}, @code{key},
102@code{value}, @code{key-or-value}, @code{key-and-value}, or @code{t}
103which is an alias for @code{key-and-value}. If @var{weak} is @code{key}
104then the hash table does not prevent its keys from being collected as
105garbage (if they are not referenced anywhere else); if a particular key
106does get collected, the corresponding association is removed from the
107hash table.
108
109If @var{weak} is @code{value}, then the hash table does not prevent
110values from being collected as garbage (if they are not referenced
111anywhere else); if a particular value does get collected, the
112corresponding association is removed from the hash table.
113
114If @var{weak} is @code{key-and-value} or @code{t}, both the key and
115the value must be live in order to preserve the association. Thus,
116the hash table does not protect either keys or values from garbage
117collection; if either one is collected as garbage, that removes the
118association.
119
120If @var{weak} is @code{key-or-value}, either the key or
121the value can preserve the association. Thus, associations are
122removed from the hash table when both their key and value would be
123collected as garbage (if not for references from weak hash tables).
124
125The default for @var{weak} is @code{nil}, so that all keys and values
126referenced in the hash table are preserved from garbage collection.
127
128@item :size @var{size}
129This specifies a hint for how many associations you plan to store in the
130hash table. If you know the approximate number, you can make things a
131little more efficient by specifying it this way. If you specify too
132small a size, the hash table will grow automatically when necessary, but
133doing that takes some extra time.
134
135The default size is 65.
136
137@item :rehash-size @var{rehash-size}
138When you add an association to a hash table and the table is ``full,''
139it grows automatically. This value specifies how to make the hash table
140larger, at that time.
141
142If @var{rehash-size} is an integer, it should be positive, and the hash
143table grows by adding that much to the nominal size. If
144@var{rehash-size} is a floating point number, it had better be greater
145than 1, and the hash table grows by multiplying the old size by that
146number.
147
148The default value is 1.5.
149
150@item :rehash-threshold @var{threshold}
151This specifies the criterion for when the hash table is ``full'' (so
152it should be made larger). The value, @var{threshold}, should be a
153positive floating point number, no greater than 1. The hash table is
154``full'' whenever the actual number of entries exceeds this fraction
155of the nominal size. The default for @var{threshold} is 0.8.
156@end table
157@end defun
158
159@defun makehash &optional test
160This is equivalent to @code{make-hash-table}, but with a different style
161argument list. The argument @var{test} specifies the method
162of key lookup.
163
164This function is obsolete. Use @code{make-hash-table} instead.
165@end defun
166
167@node Hash Access
168@section Hash Table Access
169
170 This section describes the functions for accessing and storing
171associations in a hash table. In general, any Lisp object can be used
172as a hash key, unless the comparison method imposes limits. Any Lisp
173object can also be used as the value.
174
175@defun gethash key table &optional default
176This function looks up @var{key} in @var{table}, and returns its
177associated @var{value}---or @var{default}, if @var{key} has no
178association in @var{table}.
179@end defun
180
181@defun puthash key value table
182This function enters an association for @var{key} in @var{table}, with
183value @var{value}. If @var{key} already has an association in
184@var{table}, @var{value} replaces the old associated value.
185@end defun
186
187@defun remhash key table
188This function removes the association for @var{key} from @var{table}, if
189there is one. If @var{key} has no association, @code{remhash} does
190nothing.
191
192@b{Common Lisp note:} In Common Lisp, @code{remhash} returns
193non-@code{nil} if it actually removed an association and @code{nil}
194otherwise. In Emacs Lisp, @code{remhash} always returns @code{nil}.
195@end defun
196
197@defun clrhash table
198This function removes all the associations from hash table @var{table},
199so that it becomes empty. This is also called @dfn{clearing} the hash
200table.
201
202@b{Common Lisp note:} In Common Lisp, @code{clrhash} returns the empty
203@var{table}. In Emacs Lisp, it returns @code{nil}.
204@end defun
205
206@defun maphash function table
207@anchor{Definition of maphash}
208This function calls @var{function} once for each of the associations in
209@var{table}. The function @var{function} should accept two
210arguments---a @var{key} listed in @var{table}, and its associated
211@var{value}. @code{maphash} returns @code{nil}.
212@end defun
213
214@node Defining Hash
215@section Defining Hash Comparisons
216@cindex hash code
217@cindex define hash comparisons
218
219 You can define new methods of key lookup by means of
220@code{define-hash-table-test}. In order to use this feature, you need
221to understand how hash tables work, and what a @dfn{hash code} means.
222
223 You can think of a hash table conceptually as a large array of many
224slots, each capable of holding one association. To look up a key,
225@code{gethash} first computes an integer, the hash code, from the key.
226It reduces this integer modulo the length of the array, to produce an
227index in the array. Then it looks in that slot, and if necessary in
228other nearby slots, to see if it has found the key being sought.
229
230 Thus, to define a new method of key lookup, you need to specify both a
231function to compute the hash code from a key, and a function to compare
232two keys directly.
233
234@defun define-hash-table-test name test-fn hash-fn
235This function defines a new hash table test, named @var{name}.
236
237After defining @var{name} in this way, you can use it as the @var{test}
238argument in @code{make-hash-table}. When you do that, the hash table
239will use @var{test-fn} to compare key values, and @var{hash-fn} to compute
240a ``hash code'' from a key value.
241
242The function @var{test-fn} should accept two arguments, two keys, and
243return non-@code{nil} if they are considered ``the same.''
244
245The function @var{hash-fn} should accept one argument, a key, and return
246an integer that is the ``hash code'' of that key. For good results, the
247function should use the whole range of integer values for hash codes,
248including negative integers.
249
250The specified functions are stored in the property list of @var{name}
251under the property @code{hash-table-test}; the property value's form is
252@code{(@var{test-fn} @var{hash-fn})}.
253@end defun
254
255@defun sxhash obj
256This function returns a hash code for Lisp object @var{obj}.
257This is an integer which reflects the contents of @var{obj}
258and the other Lisp objects it points to.
259
260If two objects @var{obj1} and @var{obj2} are equal, then @code{(sxhash
261@var{obj1})} and @code{(sxhash @var{obj2})} are the same integer.
262
263If the two objects are not equal, the values returned by @code{sxhash}
264are usually different, but not always; once in a rare while, by luck,
265you will encounter two distinct-looking objects that give the same
266result from @code{sxhash}.
267@end defun
268
269 This example creates a hash table whose keys are strings that are
270compared case-insensitively.
271
272@example
273(defun case-fold-string= (a b)
274 (compare-strings a nil nil b nil nil t))
275(defun case-fold-string-hash (a)
276 (sxhash (upcase a)))
277
278(define-hash-table-test 'case-fold
279 'case-fold-string= 'case-fold-string-hash)
280
281(make-hash-table :test 'case-fold)
282@end example
283
284 Here is how you could define a hash table test equivalent to the
285predefined test value @code{equal}. The keys can be any Lisp object,
286and equal-looking objects are considered the same key.
287
288@example
289(define-hash-table-test 'contents-hash 'equal 'sxhash)
290
291(make-hash-table :test 'contents-hash)
292@end example
293
294@node Other Hash
295@section Other Hash Table Functions
296
297 Here are some other functions for working with hash tables.
298
299@defun hash-table-p table
300This returns non-@code{nil} if @var{table} is a hash table object.
301@end defun
302
303@defun copy-hash-table table
304This function creates and returns a copy of @var{table}. Only the table
305itself is copied---the keys and values are shared.
306@end defun
307
308@defun hash-table-count table
309This function returns the actual number of entries in @var{table}.
310@end defun
311
312@defun hash-table-test table
313This returns the @var{test} value that was given when @var{table} was
314created, to specify how to hash and compare keys. See
315@code{make-hash-table} (@pxref{Creating Hash}).
316@end defun
317
318@defun hash-table-weakness table
319This function returns the @var{weak} value that was specified for hash
320table @var{table}.
321@end defun
322
323@defun hash-table-rehash-size table
324This returns the rehash size of @var{table}.
325@end defun
326
327@defun hash-table-rehash-threshold table
328This returns the rehash threshold of @var{table}.
329@end defun
330
331@defun hash-table-size table
332This returns the current nominal size of @var{table}.
333@end defun
334
335@ignore
336 arch-tag: 3b5107f9-d2f0-47d5-ad61-3498496bea0e
337@end ignore