Added scm_is_complex, scm_is_number, scm_c_make_rectangular,
[bpt/guile.git] / doc / ref / api-data.texi
CommitLineData
07d83abe
MV
1@c -*-texinfo-*-
2@c This is part of the GNU Guile Reference Manual.
3@c Copyright (C) 1996, 1997, 2000, 2001, 2002, 2003, 2004
4@c Free Software Foundation, Inc.
5@c See the file guile.texi for copying conditions.
6
7@page
8@node Simple Data Types
9@section Simple Generic Data Types
10
11This chapter describes those of Guile's simple data types which are
12primarily used for their role as items of generic data. By
13@dfn{simple} we mean data types that are not primarily used as
14containers to hold other data --- i.e.@: pairs, lists, vectors and so on.
15For the documentation of such @dfn{compound} data types, see
16@ref{Compound Data Types}.
17
18@c One of the great strengths of Scheme is that there is no straightforward
19@c distinction between ``data'' and ``functionality''. For example,
20@c Guile's support for dynamic linking could be described:
21
22@c @itemize @bullet
23@c @item
24@c either in a ``data-centric'' way, as the behaviour and properties of the
25@c ``dynamically linked object'' data type, and the operations that may be
26@c applied to instances of this type
27
28@c @item
29@c or in a ``functionality-centric'' way, as the set of procedures that
30@c constitute Guile's support for dynamic linking, in the context of the
31@c module system.
32@c @end itemize
33
34@c The contents of this chapter are, therefore, a matter of judgment. By
35@c @dfn{generic}, we mean to select those data types whose typical use as
36@c @emph{data} in a wide variety of programming contexts is more important
37@c than their use in the implementation of a particular piece of
38@c @emph{functionality}. The last section of this chapter provides
39@c references for all the data types that are documented not here but in a
40@c ``functionality-centric'' way elsewhere in the manual.
41
42@menu
43* Booleans:: True/false values.
44* Numbers:: Numerical data types.
45* Characters:: New character names.
46* Strings:: Special things about strings.
47* Regular Expressions:: Pattern matching and substitution.
48* Symbols:: Symbols.
49* Keywords:: Self-quoting, customizable display keywords.
50* Other Types:: "Functionality-centric" data types.
51@end menu
52
53
54@node Booleans
55@subsection Booleans
56@tpindex Booleans
57
58The two boolean values are @code{#t} for true and @code{#f} for false.
59
60Boolean values are returned by predicate procedures, such as the general
61equality predicates @code{eq?}, @code{eqv?} and @code{equal?}
62(@pxref{Equality}) and numerical and string comparison operators like
63@code{string=?} (@pxref{String Comparison}) and @code{<=}
64(@pxref{Comparison}).
65
66@lisp
67(<= 3 8)
68@result{} #t
69
70(<= 3 -3)
71@result{} #f
72
73(equal? "house" "houses")
74@result{} #f
75
76(eq? #f #f)
77@result{}
78#t
79@end lisp
80
81In test condition contexts like @code{if} and @code{cond} (@pxref{if
82cond case}), where a group of subexpressions will be evaluated only if a
83@var{condition} expression evaluates to ``true'', ``true'' means any
84value at all except @code{#f}.
85
86@lisp
87(if #t "yes" "no")
88@result{} "yes"
89
90(if 0 "yes" "no")
91@result{} "yes"
92
93(if #f "yes" "no")
94@result{} "no"
95@end lisp
96
97A result of this asymmetry is that typical Scheme source code more often
98uses @code{#f} explicitly than @code{#t}: @code{#f} is necessary to
99represent an @code{if} or @code{cond} false value, whereas @code{#t} is
100not necessary to represent an @code{if} or @code{cond} true value.
101
102It is important to note that @code{#f} is @strong{not} equivalent to any
103other Scheme value. In particular, @code{#f} is not the same as the
104number 0 (like in C and C++), and not the same as the ``empty list''
105(like in some Lisp dialects).
106
107In C, the two Scheme boolean values are available as the two constants
108@code{SCM_BOOL_T} for @code{#t} and @code{SCM_BOOL_F} for @code{#f}.
109Care must be taken with the false value @code{SCM_BOOL_F}: it is not
110false when used in C conditionals. In order to test for it, use
111@code{scm_is_false} or @code{scm_is_true}.
112
113@rnindex not
114@deffn {Scheme Procedure} not x
115@deffnx {C Function} scm_not (x)
116Return @code{#t} if @var{x} is @code{#f}, else return @code{#f}.
117@end deffn
118
119@rnindex boolean?
120@deffn {Scheme Procedure} boolean? obj
121@deffnx {C Function} scm_boolean_p (obj)
122Return @code{#t} if @var{obj} is either @code{#t} or @code{#f}, else
123return @code{#f}.
124@end deffn
125
126@deftypevr {C Macro} SCM SCM_BOOL_T
127The @code{SCM} representation of the Scheme object @code{#t}.
128@end deftypevr
129
130@deftypevr {C Macro} SCM SCM_BOOL_F
131The @code{SCM} representation of the Scheme object @code{#f}.
132@end deftypevr
133
134@deftypefn {C Function} int scm_is_true (SCM obj)
135Return @code{0} if @var{obj} is @code{#f}, else return @code{1}.
136@end deftypefn
137
138@deftypefn {C Function} int scm_is_false (SCM obj)
139Return @code{1} if @var{obj} is @code{#f}, else return @code{0}.
140@end deftypefn
141
142@deftypefn {C Function} int scm_is_bool (SCM obj)
143Return @code{1} if @var{obj} is either @code{#t} or @code{#f}, else
144return @code{0}.
145@end deftypefn
146
147@deftypefn {C Function} SCM scm_from_bool (int val)
148Return @code{#f} if @var{val} is @code{0}, else return @code{#t}.
149@end deftypefn
150
151@deftypefn {C Function} int scm_to_bool (SCM val)
152Return @code{1} if @var{val} is @code{SCM_BOOL_T}, return @code{0}
153when @var{val} is @code{SCM_BOOL_F}, else signal a `wrong type' error.
154
155You should probably use @code{scm_is_true} instead of this function
156when you just want to test a @code{SCM} value for trueness.
157@end deftypefn
158
159@node Numbers
160@subsection Numerical data types
161@tpindex Numbers
162
163Guile supports a rich ``tower'' of numerical types --- integer,
164rational, real and complex --- and provides an extensive set of
165mathematical and scientific functions for operating on numerical
166data. This section of the manual documents those types and functions.
167
168You may also find it illuminating to read R5RS's presentation of numbers
169in Scheme, which is particularly clear and accessible: see
170@ref{Numbers,,,r5rs,R5RS}.
171
172@menu
173* Numerical Tower:: Scheme's numerical "tower".
174* Integers:: Whole numbers.
175* Reals and Rationals:: Real and rational numbers.
176* Complex Numbers:: Complex numbers.
177* Exactness:: Exactness and inexactness.
178* Number Syntax:: Read syntax for numerical data.
179* Integer Operations:: Operations on integer values.
180* Comparison:: Comparison predicates.
181* Conversion:: Converting numbers to and from strings.
182* Complex:: Complex number operations.
183* Arithmetic:: Arithmetic functions.
184* Scientific:: Scientific functions.
185* Primitive Numerics:: Primitive numeric functions.
186* Bitwise Operations:: Logical AND, OR, NOT, and so on.
187* Random:: Random number generation.
188@end menu
189
190
191@node Numerical Tower
192@subsubsection Scheme's Numerical ``Tower''
193@rnindex number?
194
195Scheme's numerical ``tower'' consists of the following categories of
196numbers:
197
198@table @dfn
199@item integers
200Whole numbers, positive or negative; e.g.@: --5, 0, 18.
201
202@item rationals
203The set of numbers that can be expressed as @math{@var{p}/@var{q}}
204where @var{p} and @var{q} are integers; e.g.@: @math{9/16} works, but
205pi (an irrational number) doesn't. These include integers
206(@math{@var{n}/1}).
207
208@item real numbers
209The set of numbers that describes all possible positions along a
210one-dimensional line. This includes rationals as well as irrational
211numbers.
212
213@item complex numbers
214The set of numbers that describes all possible positions in a two
215dimensional space. This includes real as well as imaginary numbers
216(@math{@var{a}+@var{b}i}, where @var{a} is the @dfn{real part},
217@var{b} is the @dfn{imaginary part}, and @math{i} is the square root of
218@minus{}1.)
219@end table
220
221It is called a tower because each category ``sits on'' the one that
222follows it, in the sense that every integer is also a rational, every
223rational is also real, and every real number is also a complex number
224(but with zero imaginary part).
225
226In addition to the classification into integers, rationals, reals and
227complex numbers, Scheme also distinguishes between whether a number is
228represented exactly or not. For example, the result of
229@m{2\sin(\pi/4),sin(pi/4)} is exactly @m{\sqrt{2},2^(1/2)} but Guile
230can neither represent @m{\pi/4,pi/4} nor @m{\sqrt{2},2^(1/2)} exactly.
231Instead, it stores an inexact approximation, using the C type
232@code{double}.
233
234Guile can represent exact rationals of any magnitude, inexact
235rationals that fit into a C @code{double}, and inexact complex numbers
236with @code{double} real and imaginary parts.
237
238The @code{number?} predicate may be applied to any Scheme value to
239discover whether the value is any of the supported numerical types.
240
241@deffn {Scheme Procedure} number? obj
242@deffnx {C Function} scm_number_p (obj)
243Return @code{#t} if @var{obj} is any kind of number, else @code{#f}.
244@end deffn
245
246For example:
247
248@lisp
249(number? 3)
250@result{} #t
251
252(number? "hello there!")
253@result{} #f
254
255(define pi 3.141592654)
256(number? pi)
257@result{} #t
258@end lisp
259
5615f696
MV
260@deftypefn {C Function} int scm_is_number (SCM obj)
261This is equivalent to @code{scm_is_true (scm_number_p (obj))}.
262@end deftypefn
263
07d83abe
MV
264The next few subsections document each of Guile's numerical data types
265in detail.
266
267@node Integers
268@subsubsection Integers
269
270@tpindex Integer numbers
271
272@rnindex integer?
273
274Integers are whole numbers, that is numbers with no fractional part,
275such as 2, 83, and @minus{}3789.
276
277Integers in Guile can be arbitrarily big, as shown by the following
278example.
279
280@lisp
281(define (factorial n)
282 (let loop ((n n) (product 1))
283 (if (= n 0)
284 product
285 (loop (- n 1) (* product n)))))
286
287(factorial 3)
288@result{} 6
289
290(factorial 20)
291@result{} 2432902008176640000
292
293(- (factorial 45))
294@result{} -119622220865480194561963161495657715064383733760000000000
295@end lisp
296
297Readers whose background is in programming languages where integers are
298limited by the need to fit into just 4 or 8 bytes of memory may find
299this surprising, or suspect that Guile's representation of integers is
300inefficient. In fact, Guile achieves a near optimal balance of
301convenience and efficiency by using the host computer's native
302representation of integers where possible, and a more general
303representation where the required number does not fit in the native
304form. Conversion between these two representations is automatic and
305completely invisible to the Scheme level programmer.
306
307The infinities @samp{+inf.0} and @samp{-inf.0} are considered to be
308inexact integers. They are explained in detail in the next section,
309together with reals and rationals.
310
311C has a host of different integer types, and Guile offers a host of
312functions to convert between them and the @code{SCM} representation.
313For example, a C @code{int} can be handled with @code{scm_to_int} and
314@code{scm_from_int}. Guile also defines a few C integer types of its
315own, to help with differences between systems.
316
317C integer types that are not covered can be handled with the generic
318@code{scm_to_signed_integer} and @code{scm_from_signed_integer} for
319signed types, or with @code{scm_to_unsigned_integer} and
320@code{scm_from_unsigned_integer} for unsigned types.
321
322Scheme integers can be exact and inexact. For example, a number
323written as @code{3.0} with an explicit decimal-point is inexact, but
324it is also an integer. The functions @code{integer?} and
325@code{scm_is_integer} report true for such a number, but the functions
326@code{scm_is_signed_integer} and @code{scm_is_unsigned_integer} only
327allow exact integers and thus report false. Likewise, the conversion
328functions like @code{scm_to_signed_integer} only accept exact
329integers.
330
331The motivation for this behavior is that the inexactness of a number
332should not be lost silently. If you want to allow inexact integers,
333you can explicitely insert a call to @code{inexact->exact} or to its C
334equivalent @code{scm_inexact_to_exact}. (Only inexact integers will
335be converted by this call into exact integers; inexact non-integers
336will become exact fractions.)
337
338@deffn {Scheme Procedure} integer? x
339@deffnx {C Function} scm_integer_p (x)
340Return @code{#t} if @var{x} is an exactor inexact integer number, else
341@code{#f}.
342
343@lisp
344(integer? 487)
345@result{} #t
346
347(integer? 3.0)
348@result{} #t
349
350(integer? -3.4)
351@result{} #f
352
353(integer? +inf.0)
354@result{} #t
355@end lisp
356@end deffn
357
358@deftypefn {C Function} int scm_is_integer (SCM x)
359This is equivalent to @code{scm_is_true (scm_integer_p (x))}.
360@end deftypefn
361
362@defvr {C Type} scm_t_int8
363@defvrx {C Type} scm_t_uint8
364@defvrx {C Type} scm_t_int16
365@defvrx {C Type} scm_t_uint16
366@defvrx {C Type} scm_t_int32
367@defvrx {C Type} scm_t_uint32
368@defvrx {C Type} scm_t_int64
369@defvrx {C Type} scm_t_uint64
370@defvrx {C Type} scm_t_intmax
371@defvrx {C Type} scm_t_uintmax
372The C types are equivalent to the corresponding ISO C types but are
373defined on all platforms, with the exception of @code{scm_t_int64} and
374@code{scm_t_uint64}, which are only defined when a 64-bit type is
375available. For example, @code{scm_t_int8} is equivalent to
376@code{int8_t}.
377
378You can regard these definitions as a stop-gap measure until all
379platforms provide these types. If you know that all the platforms
380that you are interested in already provide these types, it is better
381to use them directly instead of the types provided by Guile.
382@end defvr
383
384@deftypefn {C Function} int scm_is_signed_integer (SCM x, scm_t_intmax min, scm_t_intmax max)
385@deftypefnx {C Function} int scm_is_unsigned_integer (SCM x, scm_t_uintmax min, scm_t_uintmax max)
386Return @code{1} when @var{x} represents an exact integer that is
387between @var{min} and @var{max}, inclusive.
388
389These functions can be used to check whether a @code{SCM} value will
390fit into a given range, such as the range of a given C integer type.
391If you just want to convert a @code{SCM} value to a given C integer
392type, use one of the conversion functions directly.
393@end deftypefn
394
395@deftypefn {C Function} scm_t_intmax scm_to_signed_integer (SCM x, scm_t_intmax min, scm_t_intmax max)
396@deftypefnx {C Function} scm_t_uintmax scm_to_unsigned_integer (SCM x, scm_t_uintmax min, scm_t_uintmax max)
397When @var{x} represents an exact integer that is between @var{min} and
398@var{max} inclusive, return that integer. Else signal an error,
399either a `wrong-type' error when @var{x} is not an exact integer, or
400an `out-of-range' error when it doesn't fit the given range.
401@end deftypefn
402
403@deftypefn {C Function} SCM scm_from_signed_integer (scm_t_intmax x)
404@deftypefnx {C Function} SCM scm_from_unsigned_integer (scm_t_uintmax x)
405Return the @code{SCM} value that represents the integer @var{x}. This
406function will always succeed and will always return an exact number.
407@end deftypefn
408
409@deftypefn {C Function} char scm_to_char (SCM x)
410@deftypefnx {C Function} {signed char} scm_to_schar (SCM x)
411@deftypefnx {C Function} {unsigned char} scm_to_uchar (SCM x)
412@deftypefnx {C Function} short scm_to_short (SCM x)
413@deftypefnx {C Function} {unsigned short} scm_to_ushort (SCM x)
414@deftypefnx {C Function} int scm_to_int (SCM x)
415@deftypefnx {C Function} {unsigned int} scm_to_uint (SCM x)
416@deftypefnx {C Function} long scm_to_long (SCM x)
417@deftypefnx {C Function} {unsigned long} scm_to_ulong (SCM x)
418@deftypefnx {C Function} {long long} scm_to_long_long (SCM x)
419@deftypefnx {C Function} {unsigned long long} scm_to_ulong_long (SCM x)
420@deftypefnx {C Function} size_t scm_to_size_t (SCM x)
421@deftypefnx {C Function} ssize_t scm_to_ssize_t (SCM x)
422@deftypefnx {C Function} scm_t_int8 scm_to_int8 (SCM x)
423@deftypefnx {C Function} scm_t_uint8 scm_to_uint8 (SCM x)
424@deftypefnx {C Function} scm_t_int16 scm_to_int16 (SCM x)
425@deftypefnx {C Function} scm_t_uint16 scm_to_uint16 (SCM x)
426@deftypefnx {C Function} scm_t_int32 scm_to_int32 (SCM x)
427@deftypefnx {C Function} scm_t_uint32 scm_to_uint32 (SCM x)
428@deftypefnx {C Function} scm_t_int64 scm_to_int64 (SCM x)
429@deftypefnx {C Function} scm_t_uint64 scm_to_uint64 (SCM x)
430@deftypefnx {C Function} scm_t_intmax scm_to_intmax (SCM x)
431@deftypefnx {C Function} scm_t_uintmax scm_to_uintmax (SCM x)
432When @var{x} represents an exact integer that fits into the indicated
433C type, return that integer. Else signal an error, either a
434`wrong-type' error when @var{x} is not an exact integer, or an
435`out-of-range' error when it doesn't fit the given range.
436
437The functions @code{scm_to_long_long}, @code{scm_to_ulong_long},
438@code{scm_to_int64}, and @code{scm_to_uint64} are only available when
439the corresponding types are.
440@end deftypefn
441
442@deftypefn {C Function} SCM scm_from_char (char x)
443@deftypefnx {C Function} SCM scm_from_schar (signed char x)
444@deftypefnx {C Function} SCM scm_from_uchar (unsigned char x)
445@deftypefnx {C Function} SCM scm_from_short (short x)
446@deftypefnx {C Function} SCM scm_from_ushort (unsigned short x)
447@deftypefnx {C Function} SCM scm_from_int (int x)
448@deftypefnx {C Function} SCM scm_from_uint (unsigned int x)
449@deftypefnx {C Function} SCM scm_from_long (long x)
450@deftypefnx {C Function} SCM scm_from_ulong (unsigned long x)
451@deftypefnx {C Function} SCM scm_from_long_long (long long x)
452@deftypefnx {C Function} SCM scm_from_ulong_long (unsigned long long x)
453@deftypefnx {C Function} SCM scm_from_size_t (size_t x)
454@deftypefnx {C Function} SCM scm_from_ssize_t (ssize_t x)
455@deftypefnx {C Function} SCM scm_from_int8 (scm_t_int8 x)
456@deftypefnx {C Function} SCM scm_from_uint8 (scm_t_uint8 x)
457@deftypefnx {C Function} SCM scm_from_int16 (scm_t_int16 x)
458@deftypefnx {C Function} SCM scm_from_uint16 (scm_t_uint16 x)
459@deftypefnx {C Function} SCM scm_from_int32 (scm_t_int32 x)
460@deftypefnx {C Function} SCM scm_from_uint32 (scm_t_uint32 x)
461@deftypefnx {C Function} SCM scm_from_int64 (scm_t_int64 x)
462@deftypefnx {C Function} SCM scm_from_uint64 (scm_t_uint64 x)
463@deftypefnx {C Function} SCM scm_from_intmax (scm_t_intmax x)
464@deftypefnx {C Function} SCM scm_from_uintmax (scm_t_uintmax x)
465Return the @code{SCM} value that represents the integer @var{x}.
466These functions will always succeed and will always return an exact
467number.
468@end deftypefn
469
470@node Reals and Rationals
471@subsubsection Real and Rational Numbers
472@tpindex Real numbers
473@tpindex Rational numbers
474
475@rnindex real?
476@rnindex rational?
477
478Mathematically, the real numbers are the set of numbers that describe
479all possible points along a continuous, infinite, one-dimensional line.
480The rational numbers are the set of all numbers that can be written as
481fractions @var{p}/@var{q}, where @var{p} and @var{q} are integers.
482All rational numbers are also real, but there are real numbers that
483are not rational, for example the square root of 2, and pi.
484
485Guile can represent both exact and inexact rational numbers, but it
486can not represent irrational numbers. Exact rationals are represented
487by storing the numerator and denominator as two exact integers.
488Inexact rationals are stored as floating point numbers using the C
489type @code{double}.
490
491Exact rationals are written as a fraction of integers. There must be
492no whitespace around the slash:
493
494@lisp
4951/2
496-22/7
497@end lisp
498
499Even though the actual encoding of inexact rationals is in binary, it
500may be helpful to think of it as a decimal number with a limited
501number of significant figures and a decimal point somewhere, since
502this corresponds to the standard notation for non-whole numbers. For
503example:
504
505@lisp
5060.34
507-0.00000142857931198
508-5648394822220000000000.0
5094.0
510@end lisp
511
512The limited precision of Guile's encoding means that any ``real'' number
513in Guile can be written in a rational form, by multiplying and then dividing
514by sufficient powers of 10 (or in fact, 2). For example,
515@samp{-0.00000142857931198} is the same as @minus{}142857931198 divided by
516100000000000000000. In Guile's current incarnation, therefore, the
517@code{rational?} and @code{real?} predicates are equivalent.
518
519
520Dividing by an exact zero leads to a error message, as one might
521expect. However, dividing by an inexact zero does not produce an
522error. Instead, the result of the division is either plus or minus
523infinity, depending on the sign of the divided number.
524
525The infinities are written @samp{+inf.0} and @samp{-inf.0},
526respectivly. This syntax is also recognized by @code{read} as an
527extension to the usual Scheme syntax.
528
529Dividing zero by zero yields something that is not a number at all:
530@samp{+nan.0}. This is the special `not a number' value.
531
532On platforms that follow @acronym{IEEE} 754 for their floating point
533arithmetic, the @samp{+inf.0}, @samp{-inf.0}, and @samp{+nan.0} values
534are implemented using the corresponding @acronym{IEEE} 754 values.
535They behave in arithmetic operations like @acronym{IEEE} 754 describes
536it, i.e., @code{(= +nan.0 +nan.0)} @result{} @code{#f}.
537
538The infinities are inexact integers and are considered to be both even
539and odd. While @samp{+nan.0} is not @code{=} to itself, it is
540@code{eqv?} to itself.
541
542To test for the special values, use the functions @code{inf?} and
543@code{nan?}.
544
545@deffn {Scheme Procedure} real? obj
546@deffnx {C Function} scm_real_p (obj)
547Return @code{#t} if @var{obj} is a real number, else @code{#f}. Note
548that the sets of integer and rational values form subsets of the set
549of real numbers, so the predicate will also be fulfilled if @var{obj}
550is an integer number or a rational number.
551@end deffn
552
553@deffn {Scheme Procedure} rational? x
554@deffnx {C Function} scm_rational_p (x)
555Return @code{#t} if @var{x} is a rational number, @code{#f} otherwise.
556Note that the set of integer values forms a subset of the set of
557rational numbers, i. e. the predicate will also be fulfilled if
558@var{x} is an integer number.
559
560Since Guile can not represent irrational numbers, every number
561satisfying @code{real?} also satisfies @code{rational?} in Guile.
562@end deffn
563
564@deffn {Scheme Procedure} rationalize x eps
565@deffnx {C Function} scm_rationalize (x, eps)
566Returns the @emph{simplest} rational number differing
567from @var{x} by no more than @var{eps}.
568
569As required by @acronym{R5RS}, @code{rationalize} only returns an
570exact result when both its arguments are exact. Thus, you might need
571to use @code{inexact->exact} on the arguments.
572
573@lisp
574(rationalize (inexact->exact 1.2) 1/100)
575@result{} 6/5
576@end lisp
577
578@end deffn
579
d3df9759
MV
580@deffn {Scheme Procedure} inf? x
581@deffnx {C Function} scm_inf_p (x)
07d83abe
MV
582Return @code{#t} if @var{x} is either @samp{+inf.0} or @samp{-inf.0},
583@code{#f} otherwise.
584@end deffn
585
586@deffn {Scheme Procedure} nan? x
d3df9759 587@deffnx {C Function} scm_nan_p (x)
07d83abe
MV
588Return @code{#t} if @var{x} is @samp{+nan.0}, @code{#f} otherwise.
589@end deffn
590
d3df9759
MV
591@deffn {Scheme Procedure} numerator x
592@deffnx {C Function} scm_numerator (x)
593Return the numerator of the rational number @var{x}.
594@end deffn
595
596@deffn {Scheme Procedure} denominator x
597@deffnx {C Function} scm_denominator (x)
598Return the denominator of the rational number @var{x}.
599@end deffn
600
601@deftypefn {C Function} int scm_is_real (SCM val)
602@deftypefnx {C Function} int scm_is_rational (SCM val)
603Equivalent to @code{scm_is_true (scm_real_p (val))} and
604@code{scm_is_true (scm_rational_p (val))}, respectively.
605@end deftypefn
606
607@deftypefn {C Function} double scm_to_double (SCM val)
608Returns the number closest to @var{val} that is representable as a
609@code{double}. Returns infinity for a @var{val} that is too large in
610magnitude. The argument @var{val} must be a real number.
611@end deftypefn
612
613@deftypefn {C Function} SCM scm_from_double (double val)
614Return the @code{SCM} value that representats @var{val}. The returned
615value is inexact according to the predicate @code{inexact?}, but it
616will be exactly equal to @var{val}.
617@end deftypefn
618
07d83abe
MV
619@node Complex Numbers
620@subsubsection Complex Numbers
621@tpindex Complex numbers
622
623@rnindex complex?
624
625Complex numbers are the set of numbers that describe all possible points
626in a two-dimensional space. The two coordinates of a particular point
627in this space are known as the @dfn{real} and @dfn{imaginary} parts of
628the complex number that describes that point.
629
630In Guile, complex numbers are written in rectangular form as the sum of
631their real and imaginary parts, using the symbol @code{i} to indicate
632the imaginary part.
633
634@lisp
6353+4i
636@result{}
6373.0+4.0i
638
639(* 3-8i 2.3+0.3i)
640@result{}
6419.3-17.5i
642@end lisp
643
644Guile represents a complex number with a non-zero imaginary part as a
645pair of inexact rationals, so the real and imaginary parts of a
646complex number have the same properties of inexactness and limited
647precision as single inexact rational numbers. Guile can not represent
648exact complex numbers with non-zero imaginary parts.
649
5615f696
MV
650@deffn {Scheme Procedure} complex? z
651@deffnx {C Function} scm_complex_p (z)
07d83abe
MV
652Return @code{#t} if @var{x} is a complex number, @code{#f}
653otherwise. Note that the sets of real, rational and integer
654values form subsets of the set of complex numbers, i. e. the
655predicate will also be fulfilled if @var{x} is a real,
656rational or integer number.
657@end deffn
658
07d83abe
MV
659@node Exactness
660@subsubsection Exact and Inexact Numbers
661@tpindex Exact numbers
662@tpindex Inexact numbers
663
664@rnindex exact?
665@rnindex inexact?
666@rnindex exact->inexact
667@rnindex inexact->exact
668
669R5RS requires that a calculation involving inexact numbers always
670produces an inexact result. To meet this requirement, Guile
671distinguishes between an exact integer value such as @samp{5} and the
672corresponding inexact real value which, to the limited precision
673available, has no fractional part, and is printed as @samp{5.0}. Guile
674will only convert the latter value to the former when forced to do so by
675an invocation of the @code{inexact->exact} procedure.
676
677@deffn {Scheme Procedure} exact? z
678@deffnx {C Function} scm_exact_p (z)
679Return @code{#t} if the number @var{z} is exact, @code{#f}
680otherwise.
681
682@lisp
683(exact? 2)
684@result{} #t
685
686(exact? 0.5)
687@result{} #f
688
689(exact? (/ 2))
690@result{} #t
691@end lisp
692
693@end deffn
694
695@deffn {Scheme Procedure} inexact? z
696@deffnx {C Function} scm_inexact_p (z)
697Return @code{#t} if the number @var{z} is inexact, @code{#f}
698else.
699@end deffn
700
701@deffn {Scheme Procedure} inexact->exact z
702@deffnx {C Function} scm_inexact_to_exact (z)
703Return an exact number that is numerically closest to @var{z}, when
704there is one. For inexact rationals, Guile returns the exact rational
705that is numerically equal to the inexact rational. Inexact complex
706numbers with a non-zero imaginary part can not be made exact.
707
708@lisp
709(inexact->exact 0.5)
710@result{} 1/2
711@end lisp
712
713The following happens because 12/10 is not exactly representable as a
714@code{double} (on most platforms). However, when reading a decimal
715number that has been marked exact with the ``#e'' prefix, Guile is
716able to represent it correctly.
717
718@lisp
719(inexact->exact 1.2)
720@result{} 5404319552844595/4503599627370496
721
722#e1.2
723@result{} 6/5
724@end lisp
725
726@end deffn
727
728@c begin (texi-doc-string "guile" "exact->inexact")
729@deffn {Scheme Procedure} exact->inexact z
730@deffnx {C Function} scm_exact_to_inexact (z)
731Convert the number @var{z} to its inexact representation.
732@end deffn
733
734
735@node Number Syntax
736@subsubsection Read Syntax for Numerical Data
737
738The read syntax for integers is a string of digits, optionally
739preceded by a minus or plus character, a code indicating the
740base in which the integer is encoded, and a code indicating whether
741the number is exact or inexact. The supported base codes are:
742
743@table @code
744@item #b
745@itemx #B
746the integer is written in binary (base 2)
747
748@item #o
749@itemx #O
750the integer is written in octal (base 8)
751
752@item #d
753@itemx #D
754the integer is written in decimal (base 10)
755
756@item #x
757@itemx #X
758the integer is written in hexadecimal (base 16)
759@end table
760
761If the base code is omitted, the integer is assumed to be decimal. The
762following examples show how these base codes are used.
763
764@lisp
765-13
766@result{} -13
767
768#d-13
769@result{} -13
770
771#x-13
772@result{} -19
773
774#b+1101
775@result{} 13
776
777#o377
778@result{} 255
779@end lisp
780
781The codes for indicating exactness (which can, incidentally, be applied
782to all numerical values) are:
783
784@table @code
785@item #e
786@itemx #E
787the number is exact
788
789@item #i
790@itemx #I
791the number is inexact.
792@end table
793
794If the exactness indicator is omitted, the number is exact unless it
795contains a radix point. Since Guile can not represent exact complex
796numbers, an error is signalled when asking for them.
797
798@lisp
799(exact? 1.2)
800@result{} #f
801
802(exact? #e1.2)
803@result{} #t
804
805(exact? #e+1i)
806ERROR: Wrong type argument
807@end lisp
808
809Guile also understands the syntax @samp{+inf.0} and @samp{-inf.0} for
810plus and minus infinity, respectively. The value must be written
811exactly as shown, that is, they always must have a sign and exactly
812one zero digit after the decimal point. It also understands
813@samp{+nan.0} and @samp{-nan.0} for the special `not-a-number' value.
814The sign is ignored for `not-a-number' and the value is always printed
815as @samp{+nan.0}.
816
817@node Integer Operations
818@subsubsection Operations on Integer Values
819@rnindex odd?
820@rnindex even?
821@rnindex quotient
822@rnindex remainder
823@rnindex modulo
824@rnindex gcd
825@rnindex lcm
826
827@deffn {Scheme Procedure} odd? n
828@deffnx {C Function} scm_odd_p (n)
829Return @code{#t} if @var{n} is an odd number, @code{#f}
830otherwise.
831@end deffn
832
833@deffn {Scheme Procedure} even? n
834@deffnx {C Function} scm_even_p (n)
835Return @code{#t} if @var{n} is an even number, @code{#f}
836otherwise.
837@end deffn
838
839@c begin (texi-doc-string "guile" "quotient")
840@c begin (texi-doc-string "guile" "remainder")
841@deffn {Scheme Procedure} quotient n d
842@deffnx {Scheme Procedure} remainder n d
843@deffnx {C Function} scm_quotient (n, d)
844@deffnx {C Function} scm_remainder (n, d)
845Return the quotient or remainder from @var{n} divided by @var{d}. The
846quotient is rounded towards zero, and the remainder will have the same
847sign as @var{n}. In all cases quotient and remainder satisfy
848@math{@var{n} = @var{q}*@var{d} + @var{r}}.
849
850@lisp
851(remainder 13 4) @result{} 1
852(remainder -13 4) @result{} -1
853@end lisp
854@end deffn
855
856@c begin (texi-doc-string "guile" "modulo")
857@deffn {Scheme Procedure} modulo n d
858@deffnx {C Function} scm_modulo (n, d)
859Return the remainder from @var{n} divided by @var{d}, with the same
860sign as @var{d}.
861
862@lisp
863(modulo 13 4) @result{} 1
864(modulo -13 4) @result{} 3
865(modulo 13 -4) @result{} -3
866(modulo -13 -4) @result{} -1
867@end lisp
868@end deffn
869
870@c begin (texi-doc-string "guile" "gcd")
871@deffn {Scheme Procedure} gcd
872@deffnx {C Function} scm_gcd (x, y)
873Return the greatest common divisor of all arguments.
874If called without arguments, 0 is returned.
875
876The C function @code{scm_gcd} always takes two arguments, while the
877Scheme function can take an arbitrary number.
878@end deffn
879
880@c begin (texi-doc-string "guile" "lcm")
881@deffn {Scheme Procedure} lcm
882@deffnx {C Function} scm_lcm (x, y)
883Return the least common multiple of the arguments.
884If called without arguments, 1 is returned.
885
886The C function @code{scm_lcm} always takes two arguments, while the
887Scheme function can take an arbitrary number.
888@end deffn
889
890
891@node Comparison
892@subsubsection Comparison Predicates
893@rnindex zero?
894@rnindex positive?
895@rnindex negative?
896
897The C comparison functions below always takes two arguments, while the
898Scheme functions can take an arbitrary number. Also keep in mind that
899the C functions return one of the Scheme boolean values
900@code{SCM_BOOL_T} or @code{SCM_BOOL_F} which are both true as far as C
901is concerned. Thus, always write @code{scm_is_true (scm_num_eq_p (x,
902y))} when testing the two Scheme numbers @code{x} and @code{y} for
903equality, for example.
904
905@c begin (texi-doc-string "guile" "=")
906@deffn {Scheme Procedure} =
907@deffnx {C Function} scm_num_eq_p (x, y)
908Return @code{#t} if all parameters are numerically equal.
909@end deffn
910
911@c begin (texi-doc-string "guile" "<")
912@deffn {Scheme Procedure} <
913@deffnx {C Function} scm_less_p (x, y)
914Return @code{#t} if the list of parameters is monotonically
915increasing.
916@end deffn
917
918@c begin (texi-doc-string "guile" ">")
919@deffn {Scheme Procedure} >
920@deffnx {C Function} scm_gr_p (x, y)
921Return @code{#t} if the list of parameters is monotonically
922decreasing.
923@end deffn
924
925@c begin (texi-doc-string "guile" "<=")
926@deffn {Scheme Procedure} <=
927@deffnx {C Function} scm_leq_p (x, y)
928Return @code{#t} if the list of parameters is monotonically
929non-decreasing.
930@end deffn
931
932@c begin (texi-doc-string "guile" ">=")
933@deffn {Scheme Procedure} >=
934@deffnx {C Function} scm_geq_p (x, y)
935Return @code{#t} if the list of parameters is monotonically
936non-increasing.
937@end deffn
938
939@c begin (texi-doc-string "guile" "zero?")
940@deffn {Scheme Procedure} zero? z
941@deffnx {C Function} scm_zero_p (z)
942Return @code{#t} if @var{z} is an exact or inexact number equal to
943zero.
944@end deffn
945
946@c begin (texi-doc-string "guile" "positive?")
947@deffn {Scheme Procedure} positive? x
948@deffnx {C Function} scm_positive_p (x)
949Return @code{#t} if @var{x} is an exact or inexact number greater than
950zero.
951@end deffn
952
953@c begin (texi-doc-string "guile" "negative?")
954@deffn {Scheme Procedure} negative? x
955@deffnx {C Function} scm_negative_p (x)
956Return @code{#t} if @var{x} is an exact or inexact number less than
957zero.
958@end deffn
959
960
961@node Conversion
962@subsubsection Converting Numbers To and From Strings
963@rnindex number->string
964@rnindex string->number
965
966@deffn {Scheme Procedure} number->string n [radix]
967@deffnx {C Function} scm_number_to_string (n, radix)
968Return a string holding the external representation of the
969number @var{n} in the given @var{radix}. If @var{n} is
970inexact, a radix of 10 will be used.
971@end deffn
972
973@deffn {Scheme Procedure} string->number string [radix]
974@deffnx {C Function} scm_string_to_number (string, radix)
975Return a number of the maximally precise representation
976expressed by the given @var{string}. @var{radix} must be an
977exact integer, either 2, 8, 10, or 16. If supplied, @var{radix}
978is a default radix that may be overridden by an explicit radix
979prefix in @var{string} (e.g. "#o177"). If @var{radix} is not
980supplied, then the default radix is 10. If string is not a
981syntactically valid notation for a number, then
982@code{string->number} returns @code{#f}.
983@end deffn
984
985
986@node Complex
987@subsubsection Complex Number Operations
988@rnindex make-rectangular
989@rnindex make-polar
990@rnindex real-part
991@rnindex imag-part
992@rnindex magnitude
993@rnindex angle
994
995@deffn {Scheme Procedure} make-rectangular real imaginary
996@deffnx {C Function} scm_make_rectangular (real, imaginary)
997Return a complex number constructed of the given @var{real} and
998@var{imaginary} parts.
999@end deffn
1000
1001@deffn {Scheme Procedure} make-polar x y
1002@deffnx {C Function} scm_make_polar (x, y)
1003Return the complex number @var{x} * e^(i * @var{y}).
1004@end deffn
1005
1006@c begin (texi-doc-string "guile" "real-part")
1007@deffn {Scheme Procedure} real-part z
1008@deffnx {C Function} scm_real_part (z)
1009Return the real part of the number @var{z}.
1010@end deffn
1011
1012@c begin (texi-doc-string "guile" "imag-part")
1013@deffn {Scheme Procedure} imag-part z
1014@deffnx {C Function} scm_imag_part (z)
1015Return the imaginary part of the number @var{z}.
1016@end deffn
1017
1018@c begin (texi-doc-string "guile" "magnitude")
1019@deffn {Scheme Procedure} magnitude z
1020@deffnx {C Function} scm_magnitude (z)
1021Return the magnitude of the number @var{z}. This is the same as
1022@code{abs} for real arguments, but also allows complex numbers.
1023@end deffn
1024
1025@c begin (texi-doc-string "guile" "angle")
1026@deffn {Scheme Procedure} angle z
1027@deffnx {C Function} scm_angle (z)
1028Return the angle of the complex number @var{z}.
1029@end deffn
1030
5615f696
MV
1031@deftypefn {C Function} SCM scm_c_make_rectangular (double re, double im)
1032@deftypefnx {C Function} SCM scm_c_make_polar (double x, double y)
1033Like @code{scm_make_rectangular} or @code{scm_make_polar},
1034respectively, but these functions take @code{double}s as their
1035arguments.
1036@end deftypefn
1037
1038@deftypefn {C Function} double scm_c_real_part (z)
1039@deftypefnx {C Function} double scm_c_imag_part (z)
1040Returns the real or imaginary part of @var{z} as a @code{double}.
1041@end deftypefn
1042
1043@deftypefn {C Function} double scm_c_magnitude (z)
1044@deftypefnx {C Function} double scm_c_angle (z)
1045Returns the magnitude or angle of @var{z} as a @code{double}.
1046@end deftypefn
1047
07d83abe
MV
1048
1049@node Arithmetic
1050@subsubsection Arithmetic Functions
1051@rnindex max
1052@rnindex min
1053@rnindex +
1054@rnindex *
1055@rnindex -
1056@rnindex /
1057@rnindex abs
1058@rnindex floor
1059@rnindex ceiling
1060@rnindex truncate
1061@rnindex round
1062
1063The C arithmetic functions below always takes two arguments, while the
1064Scheme functions can take an arbitrary number. When you need to
1065invoke them with just one argument, for example to compute the
1066equivalent od @code{(- x)}, pass @code{SCM_UNDEFINED} as the second
1067one: @code{scm_difference (x, SCM_UNDEFINED)}.
1068
1069@c begin (texi-doc-string "guile" "+")
1070@deffn {Scheme Procedure} + z1 @dots{}
1071@deffnx {C Function} scm_sum (z1, z2)
1072Return the sum of all parameter values. Return 0 if called without any
1073parameters.
1074@end deffn
1075
1076@c begin (texi-doc-string "guile" "-")
1077@deffn {Scheme Procedure} - z1 z2 @dots{}
1078@deffnx {C Function} scm_difference (z1, z2)
1079If called with one argument @var{z1}, -@var{z1} is returned. Otherwise
1080the sum of all but the first argument are subtracted from the first
1081argument.
1082@end deffn
1083
1084@c begin (texi-doc-string "guile" "*")
1085@deffn {Scheme Procedure} * z1 @dots{}
1086@deffnx {C Function} scm_product (z1, z2)
1087Return the product of all arguments. If called without arguments, 1 is
1088returned.
1089@end deffn
1090
1091@c begin (texi-doc-string "guile" "/")
1092@deffn {Scheme Procedure} / z1 z2 @dots{}
1093@deffnx {C Function} scm_divide (z1, z2)
1094Divide the first argument by the product of the remaining arguments. If
1095called with one argument @var{z1}, 1/@var{z1} is returned.
1096@end deffn
1097
1098@c begin (texi-doc-string "guile" "abs")
1099@deffn {Scheme Procedure} abs x
1100@deffnx {C Function} scm_abs (x)
1101Return the absolute value of @var{x}.
1102
1103@var{x} must be a number with zero imaginary part. To calculate the
1104magnitude of a complex number, use @code{magnitude} instead.
1105@end deffn
1106
1107@c begin (texi-doc-string "guile" "max")
1108@deffn {Scheme Procedure} max x1 x2 @dots{}
1109@deffnx {C Function} scm_max (x1, x2)
1110Return the maximum of all parameter values.
1111@end deffn
1112
1113@c begin (texi-doc-string "guile" "min")
1114@deffn {Scheme Procedure} min x1 x2 @dots{}
1115@deffnx {C Function} scm_min (x1, x2)
1116Return the minimum of all parameter values.
1117@end deffn
1118
1119@c begin (texi-doc-string "guile" "truncate")
1120@deffn {Scheme Procedure} truncate
1121@deffnx {C Function} scm_truncate_number (x)
1122Round the inexact number @var{x} towards zero.
1123@end deffn
1124
1125@c begin (texi-doc-string "guile" "round")
1126@deffn {Scheme Procedure} round x
1127@deffnx {C Function} scm_round_number (x)
1128Round the inexact number @var{x} to the nearest integer. When exactly
1129halfway between two integers, round to the even one.
1130@end deffn
1131
1132@c begin (texi-doc-string "guile" "floor")
1133@deffn {Scheme Procedure} floor x
1134@deffnx {C Function} scm_floor (x)
1135Round the number @var{x} towards minus infinity.
1136@end deffn
1137
1138@c begin (texi-doc-string "guile" "ceiling")
1139@deffn {Scheme Procedure} ceiling x
1140@deffnx {C Function} scm_ceiling (x)
1141Round the number @var{x} towards infinity.
1142@end deffn
1143
1144
1145@node Scientific
1146@subsubsection Scientific Functions
1147
1148The following procedures accept any kind of number as arguments,
1149including complex numbers.
1150
1151@rnindex sqrt
1152@c begin (texi-doc-string "guile" "sqrt")
1153@deffn {Scheme Procedure} sqrt z
1154Return the square root of @var{z}.
1155@end deffn
1156
1157@rnindex expt
1158@c begin (texi-doc-string "guile" "expt")
1159@deffn {Scheme Procedure} expt z1 z2
1160Return @var{z1} raised to the power of @var{z2}.
1161@end deffn
1162
1163@rnindex sin
1164@c begin (texi-doc-string "guile" "sin")
1165@deffn {Scheme Procedure} sin z
1166Return the sine of @var{z}.
1167@end deffn
1168
1169@rnindex cos
1170@c begin (texi-doc-string "guile" "cos")
1171@deffn {Scheme Procedure} cos z
1172Return the cosine of @var{z}.
1173@end deffn
1174
1175@rnindex tan
1176@c begin (texi-doc-string "guile" "tan")
1177@deffn {Scheme Procedure} tan z
1178Return the tangent of @var{z}.
1179@end deffn
1180
1181@rnindex asin
1182@c begin (texi-doc-string "guile" "asin")
1183@deffn {Scheme Procedure} asin z
1184Return the arcsine of @var{z}.
1185@end deffn
1186
1187@rnindex acos
1188@c begin (texi-doc-string "guile" "acos")
1189@deffn {Scheme Procedure} acos z
1190Return the arccosine of @var{z}.
1191@end deffn
1192
1193@rnindex atan
1194@c begin (texi-doc-string "guile" "atan")
1195@deffn {Scheme Procedure} atan z
1196@deffnx {Scheme Procedure} atan y x
1197Return the arctangent of @var{z}, or of @math{@var{y}/@var{x}}.
1198@end deffn
1199
1200@rnindex exp
1201@c begin (texi-doc-string "guile" "exp")
1202@deffn {Scheme Procedure} exp z
1203Return e to the power of @var{z}, where e is the base of natural
1204logarithms (2.71828@dots{}).
1205@end deffn
1206
1207@rnindex log
1208@c begin (texi-doc-string "guile" "log")
1209@deffn {Scheme Procedure} log z
1210Return the natural logarithm of @var{z}.
1211@end deffn
1212
1213@c begin (texi-doc-string "guile" "log10")
1214@deffn {Scheme Procedure} log10 z
1215Return the base 10 logarithm of @var{z}.
1216@end deffn
1217
1218@c begin (texi-doc-string "guile" "sinh")
1219@deffn {Scheme Procedure} sinh z
1220Return the hyperbolic sine of @var{z}.
1221@end deffn
1222
1223@c begin (texi-doc-string "guile" "cosh")
1224@deffn {Scheme Procedure} cosh z
1225Return the hyperbolic cosine of @var{z}.
1226@end deffn
1227
1228@c begin (texi-doc-string "guile" "tanh")
1229@deffn {Scheme Procedure} tanh z
1230Return the hyperbolic tangent of @var{z}.
1231@end deffn
1232
1233@c begin (texi-doc-string "guile" "asinh")
1234@deffn {Scheme Procedure} asinh z
1235Return the hyperbolic arcsine of @var{z}.
1236@end deffn
1237
1238@c begin (texi-doc-string "guile" "acosh")
1239@deffn {Scheme Procedure} acosh z
1240Return the hyperbolic arccosine of @var{z}.
1241@end deffn
1242
1243@c begin (texi-doc-string "guile" "atanh")
1244@deffn {Scheme Procedure} atanh z
1245Return the hyperbolic arctangent of @var{z}.
1246@end deffn
1247
1248
1249@node Primitive Numerics
1250@subsubsection Primitive Numeric Functions
1251
1252Many of Guile's numeric procedures which accept any kind of numbers as
1253arguments, including complex numbers, are implemented as Scheme
1254procedures that use the following real number-based primitives. These
1255primitives signal an error if they are called with complex arguments.
1256
1257@c begin (texi-doc-string "guile" "$abs")
1258@deffn {Scheme Procedure} $abs x
1259Return the absolute value of @var{x}.
1260@end deffn
1261
1262@c begin (texi-doc-string "guile" "$sqrt")
1263@deffn {Scheme Procedure} $sqrt x
1264Return the square root of @var{x}.
1265@end deffn
1266
1267@deffn {Scheme Procedure} $expt x y
1268@deffnx {C Function} scm_sys_expt (x, y)
1269Return @var{x} raised to the power of @var{y}. This
1270procedure does not accept complex arguments.
1271@end deffn
1272
1273@c begin (texi-doc-string "guile" "$sin")
1274@deffn {Scheme Procedure} $sin x
1275Return the sine of @var{x}.
1276@end deffn
1277
1278@c begin (texi-doc-string "guile" "$cos")
1279@deffn {Scheme Procedure} $cos x
1280Return the cosine of @var{x}.
1281@end deffn
1282
1283@c begin (texi-doc-string "guile" "$tan")
1284@deffn {Scheme Procedure} $tan x
1285Return the tangent of @var{x}.
1286@end deffn
1287
1288@c begin (texi-doc-string "guile" "$asin")
1289@deffn {Scheme Procedure} $asin x
1290Return the arcsine of @var{x}.
1291@end deffn
1292
1293@c begin (texi-doc-string "guile" "$acos")
1294@deffn {Scheme Procedure} $acos x
1295Return the arccosine of @var{x}.
1296@end deffn
1297
1298@c begin (texi-doc-string "guile" "$atan")
1299@deffn {Scheme Procedure} $atan x
1300Return the arctangent of @var{x} in the range @minus{}@math{PI/2} to
1301@math{PI/2}.
1302@end deffn
1303
1304@deffn {Scheme Procedure} $atan2 x y
1305@deffnx {C Function} scm_sys_atan2 (x, y)
1306Return the arc tangent of the two arguments @var{x} and
1307@var{y}. This is similar to calculating the arc tangent of
1308@var{x} / @var{y}, except that the signs of both arguments
1309are used to determine the quadrant of the result. This
1310procedure does not accept complex arguments.
1311@end deffn
1312
1313@c begin (texi-doc-string "guile" "$exp")
1314@deffn {Scheme Procedure} $exp x
1315Return e to the power of @var{x}, where e is the base of natural
1316logarithms (2.71828@dots{}).
1317@end deffn
1318
1319@c begin (texi-doc-string "guile" "$log")
1320@deffn {Scheme Procedure} $log x
1321Return the natural logarithm of @var{x}.
1322@end deffn
1323
1324@c begin (texi-doc-string "guile" "$sinh")
1325@deffn {Scheme Procedure} $sinh x
1326Return the hyperbolic sine of @var{x}.
1327@end deffn
1328
1329@c begin (texi-doc-string "guile" "$cosh")
1330@deffn {Scheme Procedure} $cosh x
1331Return the hyperbolic cosine of @var{x}.
1332@end deffn
1333
1334@c begin (texi-doc-string "guile" "$tanh")
1335@deffn {Scheme Procedure} $tanh x
1336Return the hyperbolic tangent of @var{x}.
1337@end deffn
1338
1339@c begin (texi-doc-string "guile" "$asinh")
1340@deffn {Scheme Procedure} $asinh x
1341Return the hyperbolic arcsine of @var{x}.
1342@end deffn
1343
1344@c begin (texi-doc-string "guile" "$acosh")
1345@deffn {Scheme Procedure} $acosh x
1346Return the hyperbolic arccosine of @var{x}.
1347@end deffn
1348
1349@c begin (texi-doc-string "guile" "$atanh")
1350@deffn {Scheme Procedure} $atanh x
1351Return the hyperbolic arctangent of @var{x}.
1352@end deffn
1353
1354C functions for the above are provided by the standard mathematics
1355library. Naturally these expect and return @code{double} arguments
1356(@pxref{Mathematics,,, libc, GNU C Library Reference Manual}).
1357
1358@multitable {xx} {Scheme Procedure} {C Function}
1359@item @tab Scheme Procedure @tab C Function
1360
1361@item @tab @code{$abs} @tab @code{fabs}
1362@item @tab @code{$sqrt} @tab @code{sqrt}
1363@item @tab @code{$sin} @tab @code{sin}
1364@item @tab @code{$cos} @tab @code{cos}
1365@item @tab @code{$tan} @tab @code{tan}
1366@item @tab @code{$asin} @tab @code{asin}
1367@item @tab @code{$acos} @tab @code{acos}
1368@item @tab @code{$atan} @tab @code{atan}
1369@item @tab @code{$atan2} @tab @code{atan2}
1370@item @tab @code{$exp} @tab @code{exp}
1371@item @tab @code{$expt} @tab @code{pow}
1372@item @tab @code{$log} @tab @code{log}
1373@item @tab @code{$sinh} @tab @code{sinh}
1374@item @tab @code{$cosh} @tab @code{cosh}
1375@item @tab @code{$tanh} @tab @code{tanh}
1376@item @tab @code{$asinh} @tab @code{asinh}
1377@item @tab @code{$acosh} @tab @code{acosh}
1378@item @tab @code{$atanh} @tab @code{atanh}
1379@end multitable
1380
1381@code{asinh}, @code{acosh} and @code{atanh} are C99 standard but might
1382not be available on older systems. Guile provides the following
1383equivalents (on all systems).
1384
1385@deftypefn {C Function} double scm_asinh (double x)
1386@deftypefnx {C Function} double scm_acosh (double x)
1387@deftypefnx {C Function} double scm_atanh (double x)
1388Return the hyperbolic arcsine, arccosine or arctangent of @var{x}
1389respectively.
1390@end deftypefn
1391
1392
1393@node Bitwise Operations
1394@subsubsection Bitwise Operations
1395
1396For the following bitwise functions, negative numbers are treated as
1397infinite precision twos-complements. For instance @math{-6} is bits
1398@math{@dots{}111010}, with infinitely many ones on the left. It can
1399be seen that adding 6 (binary 110) to such a bit pattern gives all
1400zeros.
1401
1402@deffn {Scheme Procedure} logand n1 n2 @dots{}
1403@deffnx {C Function} scm_logand (n1, n2)
1404Return the bitwise @sc{and} of the integer arguments.
1405
1406@lisp
1407(logand) @result{} -1
1408(logand 7) @result{} 7
1409(logand #b111 #b011 #b001) @result{} 1
1410@end lisp
1411@end deffn
1412
1413@deffn {Scheme Procedure} logior n1 n2 @dots{}
1414@deffnx {C Function} scm_logior (n1, n2)
1415Return the bitwise @sc{or} of the integer arguments.
1416
1417@lisp
1418(logior) @result{} 0
1419(logior 7) @result{} 7
1420(logior #b000 #b001 #b011) @result{} 3
1421@end lisp
1422@end deffn
1423
1424@deffn {Scheme Procedure} logxor n1 n2 @dots{}
1425@deffnx {C Function} scm_loxor (n1, n2)
1426Return the bitwise @sc{xor} of the integer arguments. A bit is
1427set in the result if it is set in an odd number of arguments.
1428
1429@lisp
1430(logxor) @result{} 0
1431(logxor 7) @result{} 7
1432(logxor #b000 #b001 #b011) @result{} 2
1433(logxor #b000 #b001 #b011 #b011) @result{} 1
1434@end lisp
1435@end deffn
1436
1437@deffn {Scheme Procedure} lognot n
1438@deffnx {C Function} scm_lognot (n)
1439Return the integer which is the ones-complement of the integer
1440argument, ie.@: each 0 bit is changed to 1 and each 1 bit to 0.
1441
1442@lisp
1443(number->string (lognot #b10000000) 2)
1444 @result{} "-10000001"
1445(number->string (lognot #b0) 2)
1446 @result{} "-1"
1447@end lisp
1448@end deffn
1449
1450@deffn {Scheme Procedure} logtest j k
1451@deffnx {C Function} scm_logtest (j, k)
1452@lisp
1453(logtest j k) @equiv{} (not (zero? (logand j k)))
1454
1455(logtest #b0100 #b1011) @result{} #f
1456(logtest #b0100 #b0111) @result{} #t
1457@end lisp
1458@end deffn
1459
1460@deffn {Scheme Procedure} logbit? index j
1461@deffnx {C Function} scm_logbit_p (index, j)
1462@lisp
1463(logbit? index j) @equiv{} (logtest (integer-expt 2 index) j)
1464
1465(logbit? 0 #b1101) @result{} #t
1466(logbit? 1 #b1101) @result{} #f
1467(logbit? 2 #b1101) @result{} #t
1468(logbit? 3 #b1101) @result{} #t
1469(logbit? 4 #b1101) @result{} #f
1470@end lisp
1471@end deffn
1472
1473@deffn {Scheme Procedure} ash n cnt
1474@deffnx {C Function} scm_ash (n, cnt)
1475Return @var{n} shifted left by @var{cnt} bits, or shifted right if
1476@var{cnt} is negative. This is an ``arithmetic'' shift.
1477
1478This is effectively a multiplication by @m{2^{cnt}, 2^@var{cnt}}, and
1479when @var{cnt} is negative it's a division, rounded towards negative
1480infinity. (Note that this is not the same rounding as @code{quotient}
1481does.)
1482
1483With @var{n} viewed as an infinite precision twos complement,
1484@code{ash} means a left shift introducing zero bits, or a right shift
1485dropping bits.
1486
1487@lisp
1488(number->string (ash #b1 3) 2) @result{} "1000"
1489(number->string (ash #b1010 -1) 2) @result{} "101"
1490
1491;; -23 is bits ...11101001, -6 is bits ...111010
1492(ash -23 -2) @result{} -6
1493@end lisp
1494@end deffn
1495
1496@deffn {Scheme Procedure} logcount n
1497@deffnx {C Function} scm_logcount (n)
1498Return the number of bits in integer @var{n}. If integer is
1499positive, the 1-bits in its binary representation are counted.
1500If negative, the 0-bits in its two's-complement binary
1501representation are counted. If 0, 0 is returned.
1502
1503@lisp
1504(logcount #b10101010)
1505 @result{} 4
1506(logcount 0)
1507 @result{} 0
1508(logcount -2)
1509 @result{} 1
1510@end lisp
1511@end deffn
1512
1513@deffn {Scheme Procedure} integer-length n
1514@deffnx {C Function} scm_integer_length (n)
1515Return the number of bits necessary to represent @var{n}.
1516
1517For positive @var{n} this is how many bits to the most significant one
1518bit. For negative @var{n} it's how many bits to the most significant
1519zero bit in twos complement form.
1520
1521@lisp
1522(integer-length #b10101010) @result{} 8
1523(integer-length #b1111) @result{} 4
1524(integer-length 0) @result{} 0
1525(integer-length -1) @result{} 0
1526(integer-length -256) @result{} 8
1527(integer-length -257) @result{} 9
1528@end lisp
1529@end deffn
1530
1531@deffn {Scheme Procedure} integer-expt n k
1532@deffnx {C Function} scm_integer_expt (n, k)
1533Return @var{n} raised to the non-negative integer exponent
1534@var{k}.
1535
1536@lisp
1537(integer-expt 2 5)
1538 @result{} 32
1539(integer-expt -3 3)
1540 @result{} -27
1541@end lisp
1542@end deffn
1543
1544@deffn {Scheme Procedure} bit-extract n start end
1545@deffnx {C Function} scm_bit_extract (n, start, end)
1546Return the integer composed of the @var{start} (inclusive)
1547through @var{end} (exclusive) bits of @var{n}. The
1548@var{start}th bit becomes the 0-th bit in the result.
1549
1550@lisp
1551(number->string (bit-extract #b1101101010 0 4) 2)
1552 @result{} "1010"
1553(number->string (bit-extract #b1101101010 4 9) 2)
1554 @result{} "10110"
1555@end lisp
1556@end deffn
1557
1558
1559@node Random
1560@subsubsection Random Number Generation
1561
1562Pseudo-random numbers are generated from a random state object, which
1563can be created with @code{seed->random-state}. The @var{state}
1564parameter to the various functions below is optional, it defaults to
1565the state object in the @code{*random-state*} variable.
1566
1567@deffn {Scheme Procedure} copy-random-state [state]
1568@deffnx {C Function} scm_copy_random_state (state)
1569Return a copy of the random state @var{state}.
1570@end deffn
1571
1572@deffn {Scheme Procedure} random n [state]
1573@deffnx {C Function} scm_random (n, state)
1574Return a number in [0, @var{n}).
1575
1576Accepts a positive integer or real n and returns a
1577number of the same type between zero (inclusive) and
1578@var{n} (exclusive). The values returned have a uniform
1579distribution.
1580@end deffn
1581
1582@deffn {Scheme Procedure} random:exp [state]
1583@deffnx {C Function} scm_random_exp (state)
1584Return an inexact real in an exponential distribution with mean
15851. For an exponential distribution with mean @var{u} use @code{(*
1586@var{u} (random:exp))}.
1587@end deffn
1588
1589@deffn {Scheme Procedure} random:hollow-sphere! vect [state]
1590@deffnx {C Function} scm_random_hollow_sphere_x (vect, state)
1591Fills @var{vect} with inexact real random numbers the sum of whose
1592squares is equal to 1.0. Thinking of @var{vect} as coordinates in
1593space of dimension @var{n} @math{=} @code{(vector-length @var{vect})},
1594the coordinates are uniformly distributed over the surface of the unit
1595n-sphere.
1596@end deffn
1597
1598@deffn {Scheme Procedure} random:normal [state]
1599@deffnx {C Function} scm_random_normal (state)
1600Return an inexact real in a normal distribution. The distribution
1601used has mean 0 and standard deviation 1. For a normal distribution
1602with mean @var{m} and standard deviation @var{d} use @code{(+ @var{m}
1603(* @var{d} (random:normal)))}.
1604@end deffn
1605
1606@deffn {Scheme Procedure} random:normal-vector! vect [state]
1607@deffnx {C Function} scm_random_normal_vector_x (vect, state)
1608Fills @var{vect} with inexact real random numbers that are
1609independent and standard normally distributed
1610(i.e., with mean 0 and variance 1).
1611@end deffn
1612
1613@deffn {Scheme Procedure} random:solid-sphere! vect [state]
1614@deffnx {C Function} scm_random_solid_sphere_x (vect, state)
1615Fills @var{vect} with inexact real random numbers the sum of whose
1616squares is less than 1.0. Thinking of @var{vect} as coordinates in
1617space of dimension @var{n} @math{=} @code{(vector-length @var{vect})},
1618the coordinates are uniformly distributed within the unit
1619@var{n}-sphere. The sum of the squares of the numbers is returned.
1620@c FIXME: What does this mean, particularly the n-sphere part?
1621@end deffn
1622
1623@deffn {Scheme Procedure} random:uniform [state]
1624@deffnx {C Function} scm_random_uniform (state)
1625Return a uniformly distributed inexact real random number in
1626[0,1).
1627@end deffn
1628
1629@deffn {Scheme Procedure} seed->random-state seed
1630@deffnx {C Function} scm_seed_to_random_state (seed)
1631Return a new random state using @var{seed}.
1632@end deffn
1633
1634@defvar *random-state*
1635The global random state used by the above functions when the
1636@var{state} parameter is not given.
1637@end defvar
1638
1639
1640@node Characters
1641@subsection Characters
1642@tpindex Characters
1643
1644@noindent
1645[@strong{FIXME}: how do you specify regular (non-control) characters?]
1646
1647Most of the ``control characters'' (those below codepoint 32) in the
1648@acronym{ASCII} character set, as well as the space, may be referred
1649to by name: for example, @code{#\tab}, @code{#\esc}, @code{#\stx}, and
1650so on. The following table describes the @acronym{ASCII} names for
1651each character.
1652
1653@multitable @columnfractions .25 .25 .25 .25
1654@item 0 = @code{#\nul}
1655 @tab 1 = @code{#\soh}
1656 @tab 2 = @code{#\stx}
1657 @tab 3 = @code{#\etx}
1658@item 4 = @code{#\eot}
1659 @tab 5 = @code{#\enq}
1660 @tab 6 = @code{#\ack}
1661 @tab 7 = @code{#\bel}
1662@item 8 = @code{#\bs}
1663 @tab 9 = @code{#\ht}
1664 @tab 10 = @code{#\nl}
1665 @tab 11 = @code{#\vt}
1666@item 12 = @code{#\np}
1667 @tab 13 = @code{#\cr}
1668 @tab 14 = @code{#\so}
1669 @tab 15 = @code{#\si}
1670@item 16 = @code{#\dle}
1671 @tab 17 = @code{#\dc1}
1672 @tab 18 = @code{#\dc2}
1673 @tab 19 = @code{#\dc3}
1674@item 20 = @code{#\dc4}
1675 @tab 21 = @code{#\nak}
1676 @tab 22 = @code{#\syn}
1677 @tab 23 = @code{#\etb}
1678@item 24 = @code{#\can}
1679 @tab 25 = @code{#\em}
1680 @tab 26 = @code{#\sub}
1681 @tab 27 = @code{#\esc}
1682@item 28 = @code{#\fs}
1683 @tab 29 = @code{#\gs}
1684 @tab 30 = @code{#\rs}
1685 @tab 31 = @code{#\us}
1686@item 32 = @code{#\sp}
1687@end multitable
1688
1689The ``delete'' character (octal 177) may be referred to with the name
1690@code{#\del}.
1691
1692Several characters have more than one name:
1693
1694@multitable {@code{#\backspace}} {Original}
1695@item Alias @tab Original
1696@item @code{#\space} @tab @code{#\sp}
1697@item @code{#\newline} @tab @code{#\nl}
1698@item @code{#\tab} @tab @code{#\ht}
1699@item @code{#\backspace} @tab @code{#\bs}
1700@item @code{#\return} @tab @code{#\cr}
1701@item @code{#\page} @tab @code{#\np}
1702@item @code{#\null} @tab @code{#\nul}
1703@end multitable
1704
1705@rnindex char?
1706@deffn {Scheme Procedure} char? x
1707@deffnx {C Function} scm_char_p (x)
1708Return @code{#t} iff @var{x} is a character, else @code{#f}.
1709@end deffn
1710
1711@rnindex char=?
1712@deffn {Scheme Procedure} char=? x y
1713Return @code{#t} iff @var{x} is the same character as @var{y}, else @code{#f}.
1714@end deffn
1715
1716@rnindex char<?
1717@deffn {Scheme Procedure} char<? x y
1718Return @code{#t} iff @var{x} is less than @var{y} in the @acronym{ASCII} sequence,
1719else @code{#f}.
1720@end deffn
1721
1722@rnindex char<=?
1723@deffn {Scheme Procedure} char<=? x y
1724Return @code{#t} iff @var{x} is less than or equal to @var{y} in the
1725@acronym{ASCII} sequence, else @code{#f}.
1726@end deffn
1727
1728@rnindex char>?
1729@deffn {Scheme Procedure} char>? x y
1730Return @code{#t} iff @var{x} is greater than @var{y} in the @acronym{ASCII}
1731sequence, else @code{#f}.
1732@end deffn
1733
1734@rnindex char>=?
1735@deffn {Scheme Procedure} char>=? x y
1736Return @code{#t} iff @var{x} is greater than or equal to @var{y} in the
1737@acronym{ASCII} sequence, else @code{#f}.
1738@end deffn
1739
1740@rnindex char-ci=?
1741@deffn {Scheme Procedure} char-ci=? x y
1742Return @code{#t} iff @var{x} is the same character as @var{y} ignoring
1743case, else @code{#f}.
1744@end deffn
1745
1746@rnindex char-ci<?
1747@deffn {Scheme Procedure} char-ci<? x y
1748Return @code{#t} iff @var{x} is less than @var{y} in the @acronym{ASCII} sequence
1749ignoring case, else @code{#f}.
1750@end deffn
1751
1752@rnindex char-ci<=?
1753@deffn {Scheme Procedure} char-ci<=? x y
1754Return @code{#t} iff @var{x} is less than or equal to @var{y} in the
1755@acronym{ASCII} sequence ignoring case, else @code{#f}.
1756@end deffn
1757
1758@rnindex char-ci>?
1759@deffn {Scheme Procedure} char-ci>? x y
1760Return @code{#t} iff @var{x} is greater than @var{y} in the @acronym{ASCII}
1761sequence ignoring case, else @code{#f}.
1762@end deffn
1763
1764@rnindex char-ci>=?
1765@deffn {Scheme Procedure} char-ci>=? x y
1766Return @code{#t} iff @var{x} is greater than or equal to @var{y} in the
1767@acronym{ASCII} sequence ignoring case, else @code{#f}.
1768@end deffn
1769
1770@rnindex char-alphabetic?
1771@deffn {Scheme Procedure} char-alphabetic? chr
1772@deffnx {C Function} scm_char_alphabetic_p (chr)
1773Return @code{#t} iff @var{chr} is alphabetic, else @code{#f}.
1774Alphabetic means the same thing as the @code{isalpha} C library function.
1775@end deffn
1776
1777@rnindex char-numeric?
1778@deffn {Scheme Procedure} char-numeric? chr
1779@deffnx {C Function} scm_char_numeric_p (chr)
1780Return @code{#t} iff @var{chr} is numeric, else @code{#f}.
1781Numeric means the same thing as the @code{isdigit} C library function.
1782@end deffn
1783
1784@rnindex char-whitespace?
1785@deffn {Scheme Procedure} char-whitespace? chr
1786@deffnx {C Function} scm_char_whitespace_p (chr)
1787Return @code{#t} iff @var{chr} is whitespace, else @code{#f}.
1788Whitespace means the same thing as the @code{isspace} C library function.
1789@end deffn
1790
1791@rnindex char-upper-case?
1792@deffn {Scheme Procedure} char-upper-case? chr
1793@deffnx {C Function} scm_char_upper_case_p (chr)
1794Return @code{#t} iff @var{chr} is uppercase, else @code{#f}.
1795Uppercase means the same thing as the @code{isupper} C library function.
1796@end deffn
1797
1798@rnindex char-lower-case?
1799@deffn {Scheme Procedure} char-lower-case? chr
1800@deffnx {C Function} scm_char_lower_case_p (chr)
1801Return @code{#t} iff @var{chr} is lowercase, else @code{#f}.
1802Lowercase means the same thing as the @code{islower} C library function.
1803@end deffn
1804
1805@deffn {Scheme Procedure} char-is-both? chr
1806@deffnx {C Function} scm_char_is_both_p (chr)
1807Return @code{#t} iff @var{chr} is either uppercase or lowercase, else
1808@code{#f}. Uppercase and lowercase are as defined by the
1809@code{isupper} and @code{islower} C library functions.
1810@end deffn
1811
1812@rnindex char->integer
1813@deffn {Scheme Procedure} char->integer chr
1814@deffnx {C Function} scm_char_to_integer (chr)
1815Return the number corresponding to ordinal position of @var{chr} in the
1816@acronym{ASCII} sequence.
1817@end deffn
1818
1819@rnindex integer->char
1820@deffn {Scheme Procedure} integer->char n
1821@deffnx {C Function} scm_integer_to_char (n)
1822Return the character at position @var{n} in the @acronym{ASCII} sequence.
1823@end deffn
1824
1825@rnindex char-upcase
1826@deffn {Scheme Procedure} char-upcase chr
1827@deffnx {C Function} scm_char_upcase (chr)
1828Return the uppercase character version of @var{chr}.
1829@end deffn
1830
1831@rnindex char-downcase
1832@deffn {Scheme Procedure} char-downcase chr
1833@deffnx {C Function} scm_char_downcase (chr)
1834Return the lowercase character version of @var{chr}.
1835@end deffn
1836
1837@xref{Classification of Characters,,,libc,GNU C Library Reference
1838Manual}, for information about the @code{is*} Standard C functions
1839mentioned above.
1840
1841
1842@node Strings
1843@subsection Strings
1844@tpindex Strings
1845
1846Strings are fixed-length sequences of characters. They can be created
1847by calling constructor procedures, but they can also literally get
1848entered at the @acronym{REPL} or in Scheme source files.
1849
1850@c Guile provides a rich set of string processing procedures, because text
1851@c handling is very important when Guile is used as a scripting language.
1852
1853Strings always carry the information about how many characters they are
1854composed of with them, so there is no special end-of-string character,
1855like in C. That means that Scheme strings can contain any character,
1856even the @samp{NUL} character @samp{\0}. But note: Since most operating
1857system calls dealing with strings (such as for file operations) expect
1858strings to be zero-terminated, they might do unexpected things when
1859called with string containing unusual characters.
1860
1861@menu
1862* String Syntax:: Read syntax for strings.
1863* String Predicates:: Testing strings for certain properties.
1864* String Constructors:: Creating new string objects.
1865* List/String Conversion:: Converting from/to lists of characters.
1866* String Selection:: Select portions from strings.
1867* String Modification:: Modify parts or whole strings.
1868* String Comparison:: Lexicographic ordering predicates.
1869* String Searching:: Searching in strings.
1870* Alphabetic Case Mapping:: Convert the alphabetic case of strings.
1871* Appending Strings:: Appending strings to form a new string.
1872@end menu
1873
1874@node String Syntax
1875@subsubsection String Read Syntax
1876
1877@c In the following @code is used to get a good font in TeX etc, but
1878@c is omitted for Info format, so as not to risk any confusion over
1879@c whether surrounding ` ' quotes are part of the escape or are
1880@c special in a string (they're not).
1881
1882The read syntax for strings is an arbitrarily long sequence of
1883characters enclosed in double quotes (@nicode{"}). @footnote{Actually,
1884the current implementation restricts strings to a length of
1885@math{2^24}, or 16,777,216, characters. Sorry.}
1886
1887Backslash is an escape character and can be used to insert the
1888following special characters. @nicode{\"} and @nicode{\\} are R5RS
1889standard, the rest are Guile extensions, notice they follow C string
1890syntax.
1891
1892@table @asis
1893@item @nicode{\\}
1894Backslash character.
1895
1896@item @nicode{\"}
1897Double quote character (an unescaped @nicode{"} is otherwise the end
1898of the string).
1899
1900@item @nicode{\0}
1901NUL character (ASCII 0).
1902
1903@item @nicode{\a}
1904Bell character (ASCII 7).
1905
1906@item @nicode{\f}
1907Formfeed character (ASCII 12).
1908
1909@item @nicode{\n}
1910Newline character (ASCII 10).
1911
1912@item @nicode{\r}
1913Carriage return character (ASCII 13).
1914
1915@item @nicode{\t}
1916Tab character (ASCII 9).
1917
1918@item @nicode{\v}
1919Vertical tab character (ASCII 11).
1920
1921@item @nicode{\xHH}
1922Character code given by two hexadecimal digits. For example
1923@nicode{\x7f} for an ASCII DEL (127).
1924@end table
1925
1926@noindent
1927The following are examples of string literals:
1928
1929@lisp
1930"foo"
1931"bar plonk"
1932"Hello World"
1933"\"Hi\", he said."
1934@end lisp
1935
1936
1937@node String Predicates
1938@subsubsection String Predicates
1939
1940The following procedures can be used to check whether a given string
1941fulfills some specified property.
1942
1943@rnindex string?
1944@deffn {Scheme Procedure} string? obj
1945@deffnx {C Function} scm_string_p (obj)
1946Return @code{#t} if @var{obj} is a string, else @code{#f}.
1947@end deffn
1948
1949@deffn {Scheme Procedure} string-null? str
1950@deffnx {C Function} scm_string_null_p (str)
1951Return @code{#t} if @var{str}'s length is zero, and
1952@code{#f} otherwise.
1953@lisp
1954(string-null? "") @result{} #t
1955y @result{} "foo"
1956(string-null? y) @result{} #f
1957@end lisp
1958@end deffn
1959
1960@node String Constructors
1961@subsubsection String Constructors
1962
1963The string constructor procedures create new string objects, possibly
1964initializing them with some specified character data.
1965
1966@c FIXME::martin: list->string belongs into `List/String Conversion'
1967
1968@rnindex string
1969@rnindex list->string
1970@deffn {Scheme Procedure} string . chrs
1971@deffnx {Scheme Procedure} list->string chrs
1972@deffnx {C Function} scm_string (chrs)
1973Return a newly allocated string composed of the arguments,
1974@var{chrs}.
1975@end deffn
1976
1977@rnindex make-string
1978@deffn {Scheme Procedure} make-string k [chr]
1979@deffnx {C Function} scm_make_string (k, chr)
1980Return a newly allocated string of
1981length @var{k}. If @var{chr} is given, then all elements of
1982the string are initialized to @var{chr}, otherwise the contents
1983of the @var{string} are unspecified.
1984@end deffn
1985
1986@node List/String Conversion
1987@subsubsection List/String conversion
1988
1989When processing strings, it is often convenient to first convert them
1990into a list representation by using the procedure @code{string->list},
1991work with the resulting list, and then convert it back into a string.
1992These procedures are useful for similar tasks.
1993
1994@rnindex string->list
1995@deffn {Scheme Procedure} string->list str
1996@deffnx {C Function} scm_string_to_list (str)
1997Return a newly allocated list of the characters that make up
1998the given string @var{str}. @code{string->list} and
1999@code{list->string} are inverses as far as @samp{equal?} is
2000concerned.
2001@end deffn
2002
2003@deffn {Scheme Procedure} string-split str chr
2004@deffnx {C Function} scm_string_split (str, chr)
2005Split the string @var{str} into the a list of the substrings delimited
2006by appearances of the character @var{chr}. Note that an empty substring
2007between separator characters will result in an empty string in the
2008result list.
2009
2010@lisp
2011(string-split "root:x:0:0:root:/root:/bin/bash" #\:)
2012@result{}
2013("root" "x" "0" "0" "root" "/root" "/bin/bash")
2014
2015(string-split "::" #\:)
2016@result{}
2017("" "" "")
2018
2019(string-split "" #\:)
2020@result{}
2021("")
2022@end lisp
2023@end deffn
2024
2025
2026@node String Selection
2027@subsubsection String Selection
2028
2029Portions of strings can be extracted by these procedures.
2030@code{string-ref} delivers individual characters whereas
2031@code{substring} can be used to extract substrings from longer strings.
2032
2033@rnindex string-length
2034@deffn {Scheme Procedure} string-length string
2035@deffnx {C Function} scm_string_length (string)
2036Return the number of characters in @var{string}.
2037@end deffn
2038
2039@rnindex string-ref
2040@deffn {Scheme Procedure} string-ref str k
2041@deffnx {C Function} scm_string_ref (str, k)
2042Return character @var{k} of @var{str} using zero-origin
2043indexing. @var{k} must be a valid index of @var{str}.
2044@end deffn
2045
2046@rnindex string-copy
2047@deffn {Scheme Procedure} string-copy str
2048@deffnx {C Function} scm_string_copy (str)
2049Return a newly allocated copy of the given @var{string}.
2050@end deffn
2051
2052@rnindex substring
2053@deffn {Scheme Procedure} substring str start [end]
2054@deffnx {C Function} scm_substring (str, start, end)
2055Return a newly allocated string formed from the characters
2056of @var{str} beginning with index @var{start} (inclusive) and
2057ending with index @var{end} (exclusive).
2058@var{str} must be a string, @var{start} and @var{end} must be
2059exact integers satisfying:
2060
20610 <= @var{start} <= @var{end} <= @code{(string-length @var{str})}.
2062@end deffn
2063
2064@node String Modification
2065@subsubsection String Modification
2066
2067These procedures are for modifying strings in-place. This means that the
2068result of the operation is not a new string; instead, the original string's
2069memory representation is modified.
2070
2071@rnindex string-set!
2072@deffn {Scheme Procedure} string-set! str k chr
2073@deffnx {C Function} scm_string_set_x (str, k, chr)
2074Store @var{chr} in element @var{k} of @var{str} and return
2075an unspecified value. @var{k} must be a valid index of
2076@var{str}.
2077@end deffn
2078
2079@rnindex string-fill!
2080@deffn {Scheme Procedure} string-fill! str chr
2081@deffnx {C Function} scm_string_fill_x (str, chr)
2082Store @var{char} in every element of the given @var{string} and
2083return an unspecified value.
2084@end deffn
2085
2086@deffn {Scheme Procedure} substring-fill! str start end fill
2087@deffnx {C Function} scm_substring_fill_x (str, start, end, fill)
2088Change every character in @var{str} between @var{start} and
2089@var{end} to @var{fill}.
2090
2091@lisp
2092(define y "abcdefg")
2093(substring-fill! y 1 3 #\r)
2094y
2095@result{} "arrdefg"
2096@end lisp
2097@end deffn
2098
2099@deffn {Scheme Procedure} substring-move! str1 start1 end1 str2 start2
2100@deffnx {C Function} scm_substring_move_x (str1, start1, end1, str2, start2)
2101Copy the substring of @var{str1} bounded by @var{start1} and @var{end1}
2102into @var{str2} beginning at position @var{start2}.
2103@var{str1} and @var{str2} can be the same string.
2104@end deffn
2105
2106
2107@node String Comparison
2108@subsubsection String Comparison
2109
2110The procedures in this section are similar to the character ordering
2111predicates (@pxref{Characters}), but are defined on character sequences.
2112They all return @code{#t} on success and @code{#f} on failure. The
2113predicates ending in @code{-ci} ignore the character case when comparing
2114strings.
2115
2116
2117@rnindex string=?
2118@deffn {Scheme Procedure} string=? s1 s2
2119Lexicographic equality predicate; return @code{#t} if the two
2120strings are the same length and contain the same characters in
2121the same positions, otherwise return @code{#f}.
2122
2123The procedure @code{string-ci=?} treats upper and lower case
2124letters as though they were the same character, but
2125@code{string=?} treats upper and lower case as distinct
2126characters.
2127@end deffn
2128
2129@rnindex string<?
2130@deffn {Scheme Procedure} string<? s1 s2
2131Lexicographic ordering predicate; return @code{#t} if @var{s1}
2132is lexicographically less than @var{s2}.
2133@end deffn
2134
2135@rnindex string<=?
2136@deffn {Scheme Procedure} string<=? s1 s2
2137Lexicographic ordering predicate; return @code{#t} if @var{s1}
2138is lexicographically less than or equal to @var{s2}.
2139@end deffn
2140
2141@rnindex string>?
2142@deffn {Scheme Procedure} string>? s1 s2
2143Lexicographic ordering predicate; return @code{#t} if @var{s1}
2144is lexicographically greater than @var{s2}.
2145@end deffn
2146
2147@rnindex string>=?
2148@deffn {Scheme Procedure} string>=? s1 s2
2149Lexicographic ordering predicate; return @code{#t} if @var{s1}
2150is lexicographically greater than or equal to @var{s2}.
2151@end deffn
2152
2153@rnindex string-ci=?
2154@deffn {Scheme Procedure} string-ci=? s1 s2
2155Case-insensitive string equality predicate; return @code{#t} if
2156the two strings are the same length and their component
2157characters match (ignoring case) at each position; otherwise
2158return @code{#f}.
2159@end deffn
2160
2161@rnindex string-ci<
2162@deffn {Scheme Procedure} string-ci<? s1 s2
2163Case insensitive lexicographic ordering predicate; return
2164@code{#t} if @var{s1} is lexicographically less than @var{s2}
2165regardless of case.
2166@end deffn
2167
2168@rnindex string<=?
2169@deffn {Scheme Procedure} string-ci<=? s1 s2
2170Case insensitive lexicographic ordering predicate; return
2171@code{#t} if @var{s1} is lexicographically less than or equal
2172to @var{s2} regardless of case.
2173@end deffn
2174
2175@rnindex string-ci>?
2176@deffn {Scheme Procedure} string-ci>? s1 s2
2177Case insensitive lexicographic ordering predicate; return
2178@code{#t} if @var{s1} is lexicographically greater than
2179@var{s2} regardless of case.
2180@end deffn
2181
2182@rnindex string-ci>=?
2183@deffn {Scheme Procedure} string-ci>=? s1 s2
2184Case insensitive lexicographic ordering predicate; return
2185@code{#t} if @var{s1} is lexicographically greater than or
2186equal to @var{s2} regardless of case.
2187@end deffn
2188
2189
2190@node String Searching
2191@subsubsection String Searching
2192
2193When searching for the index of a character in a string, these
2194procedures can be used.
2195
2196@deffn {Scheme Procedure} string-index str chr [frm [to]]
2197@deffnx {C Function} scm_string_index (str, chr, frm, to)
2198Return the index of the first occurrence of @var{chr} in
2199@var{str}. The optional integer arguments @var{frm} and
2200@var{to} limit the search to a portion of the string. This
2201procedure essentially implements the @code{index} or
2202@code{strchr} functions from the C library.
2203
2204@lisp
2205(string-index "weiner" #\e)
2206@result{} 1
2207
2208(string-index "weiner" #\e 2)
2209@result{} 4
2210
2211(string-index "weiner" #\e 2 4)
2212@result{} #f
2213@end lisp
2214@end deffn
2215
2216@deffn {Scheme Procedure} string-rindex str chr [frm [to]]
2217@deffnx {C Function} scm_string_rindex (str, chr, frm, to)
2218Like @code{string-index}, but search from the right of the
2219string rather than from the left. This procedure essentially
2220implements the @code{rindex} or @code{strrchr} functions from
2221the C library.
2222
2223@lisp
2224(string-rindex "weiner" #\e)
2225@result{} 4
2226
2227(string-rindex "weiner" #\e 2 4)
2228@result{} #f
2229
2230(string-rindex "weiner" #\e 2 5)
2231@result{} 4
2232@end lisp
2233@end deffn
2234
2235@node Alphabetic Case Mapping
2236@subsubsection Alphabetic Case Mapping
2237
2238These are procedures for mapping strings to their upper- or lower-case
2239equivalents, respectively, or for capitalizing strings.
2240
2241@deffn {Scheme Procedure} string-upcase str
2242@deffnx {C Function} scm_string_upcase (str)
2243Return a freshly allocated string containing the characters of
2244@var{str} in upper case.
2245@end deffn
2246
2247@deffn {Scheme Procedure} string-upcase! str
2248@deffnx {C Function} scm_string_upcase_x (str)
2249Destructively upcase every character in @var{str} and return
2250@var{str}.
2251@lisp
2252y @result{} "arrdefg"
2253(string-upcase! y) @result{} "ARRDEFG"
2254y @result{} "ARRDEFG"
2255@end lisp
2256@end deffn
2257
2258@deffn {Scheme Procedure} string-downcase str
2259@deffnx {C Function} scm_string_downcase (str)
2260Return a freshly allocation string containing the characters in
2261@var{str} in lower case.
2262@end deffn
2263
2264@deffn {Scheme Procedure} string-downcase! str
2265@deffnx {C Function} scm_string_downcase_x (str)
2266Destructively downcase every character in @var{str} and return
2267@var{str}.
2268@lisp
2269y @result{} "ARRDEFG"
2270(string-downcase! y) @result{} "arrdefg"
2271y @result{} "arrdefg"
2272@end lisp
2273@end deffn
2274
2275@deffn {Scheme Procedure} string-capitalize str
2276@deffnx {C Function} scm_string_capitalize (str)
2277Return a freshly allocated string with the characters in
2278@var{str}, where the first character of every word is
2279capitalized.
2280@end deffn
2281
2282@deffn {Scheme Procedure} string-capitalize! str
2283@deffnx {C Function} scm_string_capitalize_x (str)
2284Upcase the first character of every word in @var{str}
2285destructively and return @var{str}.
2286
2287@lisp
2288y @result{} "hello world"
2289(string-capitalize! y) @result{} "Hello World"
2290y @result{} "Hello World"
2291@end lisp
2292@end deffn
2293
2294
2295@node Appending Strings
2296@subsubsection Appending Strings
2297
2298The procedure @code{string-append} appends several strings together to
2299form a longer result string.
2300
2301@rnindex string-append
2302@deffn {Scheme Procedure} string-append . args
2303@deffnx {C Function} scm_string_append (args)
2304Return a newly allocated string whose characters form the
2305concatenation of the given strings, @var{args}.
2306
2307@example
2308(let ((h "hello "))
2309 (string-append h "world"))
2310@result{} "hello world"
2311@end example
2312@end deffn
2313
2314
2315@node Regular Expressions
2316@subsection Regular Expressions
2317@tpindex Regular expressions
2318
2319@cindex regular expressions
2320@cindex regex
2321@cindex emacs regexp
2322
2323A @dfn{regular expression} (or @dfn{regexp}) is a pattern that
2324describes a whole class of strings. A full description of regular
2325expressions and their syntax is beyond the scope of this manual;
2326an introduction can be found in the Emacs manual (@pxref{Regexps,
2327, Syntax of Regular Expressions, emacs, The GNU Emacs Manual}), or
2328in many general Unix reference books.
2329
2330If your system does not include a POSIX regular expression library,
2331and you have not linked Guile with a third-party regexp library such
2332as Rx, these functions will not be available. You can tell whether
2333your Guile installation includes regular expression support by
2334checking whether @code{(provided? 'regex)} returns true.
2335
2336The following regexp and string matching features are provided by the
2337@code{(ice-9 regex)} module. Before using the described functions,
2338you should load this module by executing @code{(use-modules (ice-9
2339regex))}.
2340
2341@menu
2342* Regexp Functions:: Functions that create and match regexps.
2343* Match Structures:: Finding what was matched by a regexp.
2344* Backslash Escapes:: Removing the special meaning of regexp
2345 meta-characters.
2346@end menu
2347
2348
2349@node Regexp Functions
2350@subsubsection Regexp Functions
2351
2352By default, Guile supports POSIX extended regular expressions.
2353That means that the characters @samp{(}, @samp{)}, @samp{+} and
2354@samp{?} are special, and must be escaped if you wish to match the
2355literal characters.
2356
2357This regular expression interface was modeled after that
2358implemented by SCSH, the Scheme Shell. It is intended to be
2359upwardly compatible with SCSH regular expressions.
2360
2361@deffn {Scheme Procedure} string-match pattern str [start]
2362Compile the string @var{pattern} into a regular expression and compare
2363it with @var{str}. The optional numeric argument @var{start} specifies
2364the position of @var{str} at which to begin matching.
2365
2366@code{string-match} returns a @dfn{match structure} which
2367describes what, if anything, was matched by the regular
2368expression. @xref{Match Structures}. If @var{str} does not match
2369@var{pattern} at all, @code{string-match} returns @code{#f}.
2370@end deffn
2371
2372Two examples of a match follow. In the first example, the pattern
2373matches the four digits in the match string. In the second, the pattern
2374matches nothing.
2375
2376@example
2377(string-match "[0-9][0-9][0-9][0-9]" "blah2002")
2378@result{} #("blah2002" (4 . 8))
2379
2380(string-match "[A-Za-z]" "123456")
2381@result{} #f
2382@end example
2383
2384Each time @code{string-match} is called, it must compile its
2385@var{pattern} argument into a regular expression structure. This
2386operation is expensive, which makes @code{string-match} inefficient if
2387the same regular expression is used several times (for example, in a
2388loop). For better performance, you can compile a regular expression in
2389advance and then match strings against the compiled regexp.
2390
2391@deffn {Scheme Procedure} make-regexp pat flag@dots{}
2392@deffnx {C Function} scm_make_regexp (pat, flaglst)
2393Compile the regular expression described by @var{pat}, and
2394return the compiled regexp structure. If @var{pat} does not
2395describe a legal regular expression, @code{make-regexp} throws
2396a @code{regular-expression-syntax} error.
2397
2398The @var{flag} arguments change the behavior of the compiled
2399regular expression. The following values may be supplied:
2400
2401@defvar regexp/icase
2402Consider uppercase and lowercase letters to be the same when
2403matching.
2404@end defvar
2405
2406@defvar regexp/newline
2407If a newline appears in the target string, then permit the
2408@samp{^} and @samp{$} operators to match immediately after or
2409immediately before the newline, respectively. Also, the
2410@samp{.} and @samp{[^...]} operators will never match a newline
2411character. The intent of this flag is to treat the target
2412string as a buffer containing many lines of text, and the
2413regular expression as a pattern that may match a single one of
2414those lines.
2415@end defvar
2416
2417@defvar regexp/basic
2418Compile a basic (``obsolete'') regexp instead of the extended
2419(``modern'') regexps that are the default. Basic regexps do
2420not consider @samp{|}, @samp{+} or @samp{?} to be special
2421characters, and require the @samp{@{...@}} and @samp{(...)}
2422metacharacters to be backslash-escaped (@pxref{Backslash
2423Escapes}). There are several other differences between basic
2424and extended regular expressions, but these are the most
2425significant.
2426@end defvar
2427
2428@defvar regexp/extended
2429Compile an extended regular expression rather than a basic
2430regexp. This is the default behavior; this flag will not
2431usually be needed. If a call to @code{make-regexp} includes
2432both @code{regexp/basic} and @code{regexp/extended} flags, the
2433one which comes last will override the earlier one.
2434@end defvar
2435@end deffn
2436
2437@deffn {Scheme Procedure} regexp-exec rx str [start [flags]]
2438@deffnx {C Function} scm_regexp_exec (rx, str, start, flags)
2439Match the compiled regular expression @var{rx} against
2440@code{str}. If the optional integer @var{start} argument is
2441provided, begin matching from that position in the string.
2442Return a match structure describing the results of the match,
2443or @code{#f} if no match could be found.
2444
2445The @var{flags} arguments change the matching behavior.
2446The following flags may be supplied:
2447
2448@defvar regexp/notbol
2449Operator @samp{^} always fails (unless @code{regexp/newline}
2450is used). Use this when the beginning of the string should
2451not be considered the beginning of a line.
2452@end defvar
2453
2454@defvar regexp/noteol
2455Operator @samp{$} always fails (unless @code{regexp/newline}
2456is used). Use this when the end of the string should not be
2457considered the end of a line.
2458@end defvar
2459@end deffn
2460
2461@lisp
2462;; Regexp to match uppercase letters
2463(define r (make-regexp "[A-Z]*"))
2464
2465;; Regexp to match letters, ignoring case
2466(define ri (make-regexp "[A-Z]*" regexp/icase))
2467
2468;; Search for bob using regexp r
2469(match:substring (regexp-exec r "bob"))
2470@result{} "" ; no match
2471
2472;; Search for bob using regexp ri
2473(match:substring (regexp-exec ri "Bob"))
2474@result{} "Bob" ; matched case insensitive
2475@end lisp
2476
2477@deffn {Scheme Procedure} regexp? obj
2478@deffnx {C Function} scm_regexp_p (obj)
2479Return @code{#t} if @var{obj} is a compiled regular expression,
2480or @code{#f} otherwise.
2481@end deffn
2482
2483Regular expressions are commonly used to find patterns in one string and
2484replace them with the contents of another string.
2485
2486@c begin (scm-doc-string "regex.scm" "regexp-substitute")
2487@deffn {Scheme Procedure} regexp-substitute port match [item@dots{}]
2488Write to the output port @var{port} selected contents of the match
2489structure @var{match}. Each @var{item} specifies what should be
2490written, and may be one of the following arguments:
2491
2492@itemize @bullet
2493@item
2494A string. String arguments are written out verbatim.
2495
2496@item
2497An integer. The submatch with that number is written.
2498
2499@item
2500The symbol @samp{pre}. The portion of the matched string preceding
2501the regexp match is written.
2502
2503@item
2504The symbol @samp{post}. The portion of the matched string following
2505the regexp match is written.
2506@end itemize
2507
2508The @var{port} argument may be @code{#f}, in which case nothing is
2509written; instead, @code{regexp-substitute} constructs a string from the
2510specified @var{item}s and returns that.
2511@end deffn
2512
2513The following example takes a regular expression that matches a standard
2514@sc{yyyymmdd}-format date such as @code{"20020828"}. The
2515@code{regexp-substitute} call returns a string computed from the
2516information in the match structure, consisting of the fields and text
2517from the original string reordered and reformatted.
2518
2519@lisp
2520(define date-regex "([0-9][0-9][0-9][0-9])([0-9][0-9])([0-9][0-9])")
2521(define s "Date 20020429 12am.")
2522(define sm (string-match date-regex s))
2523(regexp-substitute #f sm 'pre 2 "-" 3 "-" 1 'post " (" 0 ")")
2524@result{} "Date 04-29-2002 12am. (20020429)"
2525@end lisp
2526
2527@c begin (scm-doc-string "regex.scm" "regexp-substitute")
2528@deffn {Scheme Procedure} regexp-substitute/global port regexp target [item@dots{}]
2529Similar to @code{regexp-substitute}, but can be used to perform global
2530substitutions on @var{str}. Instead of taking a match structure as an
2531argument, @code{regexp-substitute/global} takes two string arguments: a
2532@var{regexp} string describing a regular expression, and a @var{target}
2533string which should be matched against this regular expression.
2534
2535Each @var{item} behaves as in @code{regexp-substitute}, with the
2536following exceptions:
2537
2538@itemize @bullet
2539@item
2540A function may be supplied. When this function is called, it will be
2541passed one argument: a match structure for a given regular expression
2542match. It should return a string to be written out to @var{port}.
2543
2544@item
2545The @samp{post} symbol causes @code{regexp-substitute/global} to recurse
2546on the unmatched portion of @var{str}. This @emph{must} be supplied in
2547order to perform global search-and-replace on @var{str}; if it is not
2548present among the @var{item}s, then @code{regexp-substitute/global} will
2549return after processing a single match.
2550@end itemize
2551@end deffn
2552
2553The example above for @code{regexp-substitute} could be rewritten as
2554follows to remove the @code{string-match} stage:
2555
2556@lisp
2557(define date-regex "([0-9][0-9][0-9][0-9])([0-9][0-9])([0-9][0-9])")
2558(define s "Date 20020429 12am.")
2559(regexp-substitute/global #f date-regex s
2560 'pre 2 "-" 3 "-" 1 'post " (" 0 ")")
2561@result{} "Date 04-29-2002 12am. (20020429)"
2562@end lisp
2563
2564
2565@node Match Structures
2566@subsubsection Match Structures
2567
2568@cindex match structures
2569
2570A @dfn{match structure} is the object returned by @code{string-match} and
2571@code{regexp-exec}. It describes which portion of a string, if any,
2572matched the given regular expression. Match structures include: a
2573reference to the string that was checked for matches; the starting and
2574ending positions of the regexp match; and, if the regexp included any
2575parenthesized subexpressions, the starting and ending positions of each
2576submatch.
2577
2578In each of the regexp match functions described below, the @code{match}
2579argument must be a match structure returned by a previous call to
2580@code{string-match} or @code{regexp-exec}. Most of these functions
2581return some information about the original target string that was
2582matched against a regular expression; we will call that string
2583@var{target} for easy reference.
2584
2585@c begin (scm-doc-string "regex.scm" "regexp-match?")
2586@deffn {Scheme Procedure} regexp-match? obj
2587Return @code{#t} if @var{obj} is a match structure returned by a
2588previous call to @code{regexp-exec}, or @code{#f} otherwise.
2589@end deffn
2590
2591@c begin (scm-doc-string "regex.scm" "match:substring")
2592@deffn {Scheme Procedure} match:substring match [n]
2593Return the portion of @var{target} matched by subexpression number
2594@var{n}. Submatch 0 (the default) represents the entire regexp match.
2595If the regular expression as a whole matched, but the subexpression
2596number @var{n} did not match, return @code{#f}.
2597@end deffn
2598
2599@lisp
2600(define s (string-match "[0-9][0-9][0-9][0-9]" "blah2002foo"))
2601(match:substring s)
2602@result{} "2002"
2603
2604;; match starting at offset 6 in the string
2605(match:substring
2606 (string-match "[0-9][0-9][0-9][0-9]" "blah987654" 6))
2607@result{} "7654"
2608@end lisp
2609
2610@c begin (scm-doc-string "regex.scm" "match:start")
2611@deffn {Scheme Procedure} match:start match [n]
2612Return the starting position of submatch number @var{n}.
2613@end deffn
2614
2615In the following example, the result is 4, since the match starts at
2616character index 4:
2617
2618@lisp
2619(define s (string-match "[0-9][0-9][0-9][0-9]" "blah2002foo"))
2620(match:start s)
2621@result{} 4
2622@end lisp
2623
2624@c begin (scm-doc-string "regex.scm" "match:end")
2625@deffn {Scheme Procedure} match:end match [n]
2626Return the ending position of submatch number @var{n}.
2627@end deffn
2628
2629In the following example, the result is 8, since the match runs between
2630characters 4 and 8 (i.e. the ``2002'').
2631
2632@lisp
2633(define s (string-match "[0-9][0-9][0-9][0-9]" "blah2002foo"))
2634(match:end s)
2635@result{} 8
2636@end lisp
2637
2638@c begin (scm-doc-string "regex.scm" "match:prefix")
2639@deffn {Scheme Procedure} match:prefix match
2640Return the unmatched portion of @var{target} preceding the regexp match.
2641
2642@lisp
2643(define s (string-match "[0-9][0-9][0-9][0-9]" "blah2002foo"))
2644(match:prefix s)
2645@result{} "blah"
2646@end lisp
2647@end deffn
2648
2649@c begin (scm-doc-string "regex.scm" "match:suffix")
2650@deffn {Scheme Procedure} match:suffix match
2651Return the unmatched portion of @var{target} following the regexp match.
2652@end deffn
2653
2654@lisp
2655(define s (string-match "[0-9][0-9][0-9][0-9]" "blah2002foo"))
2656(match:suffix s)
2657@result{} "foo"
2658@end lisp
2659
2660@c begin (scm-doc-string "regex.scm" "match:count")
2661@deffn {Scheme Procedure} match:count match
2662Return the number of parenthesized subexpressions from @var{match}.
2663Note that the entire regular expression match itself counts as a
2664subexpression, and failed submatches are included in the count.
2665@end deffn
2666
2667@c begin (scm-doc-string "regex.scm" "match:string")
2668@deffn {Scheme Procedure} match:string match
2669Return the original @var{target} string.
2670@end deffn
2671
2672@lisp
2673(define s (string-match "[0-9][0-9][0-9][0-9]" "blah2002foo"))
2674(match:string s)
2675@result{} "blah2002foo"
2676@end lisp
2677
2678
2679@node Backslash Escapes
2680@subsubsection Backslash Escapes
2681
2682Sometimes you will want a regexp to match characters like @samp{*} or
2683@samp{$} exactly. For example, to check whether a particular string
2684represents a menu entry from an Info node, it would be useful to match
2685it against a regexp like @samp{^* [^:]*::}. However, this won't work;
2686because the asterisk is a metacharacter, it won't match the @samp{*} at
2687the beginning of the string. In this case, we want to make the first
2688asterisk un-magic.
2689
2690You can do this by preceding the metacharacter with a backslash
2691character @samp{\}. (This is also called @dfn{quoting} the
2692metacharacter, and is known as a @dfn{backslash escape}.) When Guile
2693sees a backslash in a regular expression, it considers the following
2694glyph to be an ordinary character, no matter what special meaning it
2695would ordinarily have. Therefore, we can make the above example work by
2696changing the regexp to @samp{^\* [^:]*::}. The @samp{\*} sequence tells
2697the regular expression engine to match only a single asterisk in the
2698target string.
2699
2700Since the backslash is itself a metacharacter, you may force a regexp to
2701match a backslash in the target string by preceding the backslash with
2702itself. For example, to find variable references in a @TeX{} program,
2703you might want to find occurrences of the string @samp{\let\} followed
2704by any number of alphabetic characters. The regular expression
2705@samp{\\let\\[A-Za-z]*} would do this: the double backslashes in the
2706regexp each match a single backslash in the target string.
2707
2708@c begin (scm-doc-string "regex.scm" "regexp-quote")
2709@deffn {Scheme Procedure} regexp-quote str
2710Quote each special character found in @var{str} with a backslash, and
2711return the resulting string.
2712@end deffn
2713
2714@strong{Very important:} Using backslash escapes in Guile source code
2715(as in Emacs Lisp or C) can be tricky, because the backslash character
2716has special meaning for the Guile reader. For example, if Guile
2717encounters the character sequence @samp{\n} in the middle of a string
2718while processing Scheme code, it replaces those characters with a
2719newline character. Similarly, the character sequence @samp{\t} is
2720replaced by a horizontal tab. Several of these @dfn{escape sequences}
2721are processed by the Guile reader before your code is executed.
2722Unrecognized escape sequences are ignored: if the characters @samp{\*}
2723appear in a string, they will be translated to the single character
2724@samp{*}.
2725
2726This translation is obviously undesirable for regular expressions, since
2727we want to be able to include backslashes in a string in order to
2728escape regexp metacharacters. Therefore, to make sure that a backslash
2729is preserved in a string in your Guile program, you must use @emph{two}
2730consecutive backslashes:
2731
2732@lisp
2733(define Info-menu-entry-pattern (make-regexp "^\\* [^:]*"))
2734@end lisp
2735
2736The string in this example is preprocessed by the Guile reader before
2737any code is executed. The resulting argument to @code{make-regexp} is
2738the string @samp{^\* [^:]*}, which is what we really want.
2739
2740This also means that in order to write a regular expression that matches
2741a single backslash character, the regular expression string in the
2742source code must include @emph{four} backslashes. Each consecutive pair
2743of backslashes gets translated by the Guile reader to a single
2744backslash, and the resulting double-backslash is interpreted by the
2745regexp engine as matching a single backslash character. Hence:
2746
2747@lisp
2748(define tex-variable-pattern (make-regexp "\\\\let\\\\=[A-Za-z]*"))
2749@end lisp
2750
2751The reason for the unwieldiness of this syntax is historical. Both
2752regular expression pattern matchers and Unix string processing systems
2753have traditionally used backslashes with the special meanings
2754described above. The POSIX regular expression specification and ANSI C
2755standard both require these semantics. Attempting to abandon either
2756convention would cause other kinds of compatibility problems, possibly
2757more severe ones. Therefore, without extending the Scheme reader to
2758support strings with different quoting conventions (an ungainly and
2759confusing extension when implemented in other languages), we must adhere
2760to this cumbersome escape syntax.
2761
2762
2763@node Symbols
2764@subsection Symbols
2765@tpindex Symbols
2766
2767Symbols in Scheme are widely used in three ways: as items of discrete
2768data, as lookup keys for alists and hash tables, and to denote variable
2769references.
2770
2771A @dfn{symbol} is similar to a string in that it is defined by a
2772sequence of characters. The sequence of characters is known as the
2773symbol's @dfn{name}. In the usual case --- that is, where the symbol's
2774name doesn't include any characters that could be confused with other
2775elements of Scheme syntax --- a symbol is written in a Scheme program by
2776writing the sequence of characters that make up the name, @emph{without}
2777any quotation marks or other special syntax. For example, the symbol
2778whose name is ``multiply-by-2'' is written, simply:
2779
2780@lisp
2781multiply-by-2
2782@end lisp
2783
2784Notice how this differs from a @emph{string} with contents
2785``multiply-by-2'', which is written with double quotation marks, like
2786this:
2787
2788@lisp
2789"multiply-by-2"
2790@end lisp
2791
2792Looking beyond how they are written, symbols are different from strings
2793in two important respects.
2794
2795The first important difference is uniqueness. If the same-looking
2796string is read twice from two different places in a program, the result
2797is two @emph{different} string objects whose contents just happen to be
2798the same. If, on the other hand, the same-looking symbol is read twice
2799from two different places in a program, the result is the @emph{same}
2800symbol object both times.
2801
2802Given two read symbols, you can use @code{eq?} to test whether they are
2803the same (that is, have the same name). @code{eq?} is the most
2804efficient comparison operator in Scheme, and comparing two symbols like
2805this is as fast as comparing, for example, two numbers. Given two
2806strings, on the other hand, you must use @code{equal?} or
2807@code{string=?}, which are much slower comparison operators, to
2808determine whether the strings have the same contents.
2809
2810@lisp
2811(define sym1 (quote hello))
2812(define sym2 (quote hello))
2813(eq? sym1 sym2) @result{} #t
2814
2815(define str1 "hello")
2816(define str2 "hello")
2817(eq? str1 str2) @result{} #f
2818(equal? str1 str2) @result{} #t
2819@end lisp
2820
2821The second important difference is that symbols, unlike strings, are not
2822self-evaluating. This is why we need the @code{(quote @dots{})}s in the
2823example above: @code{(quote hello)} evaluates to the symbol named
2824"hello" itself, whereas an unquoted @code{hello} is @emph{read} as the
2825symbol named "hello" and evaluated as a variable reference @dots{} about
2826which more below (@pxref{Symbol Variables}).
2827
2828@menu
2829* Symbol Data:: Symbols as discrete data.
2830* Symbol Keys:: Symbols as lookup keys.
2831* Symbol Variables:: Symbols as denoting variables.
2832* Symbol Primitives:: Operations related to symbols.
2833* Symbol Props:: Function slots and property lists.
2834* Symbol Read Syntax:: Extended read syntax for symbols.
2835* Symbol Uninterned:: Uninterned symbols.
2836@end menu
2837
2838
2839@node Symbol Data
2840@subsubsection Symbols as Discrete Data
2841
2842Numbers and symbols are similar to the extent that they both lend
2843themselves to @code{eq?} comparison. But symbols are more descriptive
2844than numbers, because a symbol's name can be used directly to describe
2845the concept for which that symbol stands.
2846
2847For example, imagine that you need to represent some colours in a
2848computer program. Using numbers, you would have to choose arbitrarily
2849some mapping between numbers and colours, and then take care to use that
2850mapping consistently:
2851
2852@lisp
2853;; 1=red, 2=green, 3=purple
2854
2855(if (eq? (colour-of car) 1)
2856 ...)
2857@end lisp
2858
2859@noindent
2860You can make the mapping more explicit and the code more readable by
2861defining constants:
2862
2863@lisp
2864(define red 1)
2865(define green 2)
2866(define purple 3)
2867
2868(if (eq? (colour-of car) red)
2869 ...)
2870@end lisp
2871
2872@noindent
2873But the simplest and clearest approach is not to use numbers at all, but
2874symbols whose names specify the colours that they refer to:
2875
2876@lisp
2877(if (eq? (colour-of car) 'red)
2878 ...)
2879@end lisp
2880
2881The descriptive advantages of symbols over numbers increase as the set
2882of concepts that you want to describe grows. Suppose that a car object
2883can have other properties as well, such as whether it has or uses:
2884
2885@itemize @bullet
2886@item
2887automatic or manual transmission
2888@item
2889leaded or unleaded fuel
2890@item
2891power steering (or not).
2892@end itemize
2893
2894@noindent
2895Then a car's combined property set could be naturally represented and
2896manipulated as a list of symbols:
2897
2898@lisp
2899(properties-of car1)
2900@result{}
2901(red manual unleaded power-steering)
2902
2903(if (memq 'power-steering (properties-of car1))
2904 (display "Unfit people can drive this car.\n")
2905 (display "You'll need strong arms to drive this car!\n"))
2906@print{}
2907Unfit people can drive this car.
2908@end lisp
2909
2910Remember, the fundamental property of symbols that we are relying on
2911here is that an occurrence of @code{'red} in one part of a program is an
2912@emph{indistinguishable} symbol from an occurrence of @code{'red} in
2913another part of a program; this means that symbols can usefully be
2914compared using @code{eq?}. At the same time, symbols have naturally
2915descriptive names. This combination of efficiency and descriptive power
2916makes them ideal for use as discrete data.
2917
2918
2919@node Symbol Keys
2920@subsubsection Symbols as Lookup Keys
2921
2922Given their efficiency and descriptive power, it is natural to use
2923symbols as the keys in an association list or hash table.
2924
2925To illustrate this, consider a more structured representation of the car
2926properties example from the preceding subsection. Rather than
2927mixing all the properties up together in a flat list, we could use an
2928association list like this:
2929
2930@lisp
2931(define car1-properties '((colour . red)
2932 (transmission . manual)
2933 (fuel . unleaded)
2934 (steering . power-assisted)))
2935@end lisp
2936
2937Notice how this structure is more explicit and extensible than the flat
2938list. For example it makes clear that @code{manual} refers to the
2939transmission rather than, say, the windows or the locking of the car.
2940It also allows further properties to use the same symbols among their
2941possible values without becoming ambiguous:
2942
2943@lisp
2944(define car1-properties '((colour . red)
2945 (transmission . manual)
2946 (fuel . unleaded)
2947 (steering . power-assisted)
2948 (seat-colour . red)
2949 (locking . manual)))
2950@end lisp
2951
2952With a representation like this, it is easy to use the efficient
2953@code{assq-XXX} family of procedures (@pxref{Association Lists}) to
2954extract or change individual pieces of information:
2955
2956@lisp
2957(assq-ref car1-properties 'fuel) @result{} unleaded
2958(assq-ref car1-properties 'transmission) @result{} manual
2959
2960(assq-set! car1-properties 'seat-colour 'black)
2961@result{}
2962((colour . red)
2963 (transmission . manual)
2964 (fuel . unleaded)
2965 (steering . power-assisted)
2966 (seat-colour . black)
2967 (locking . manual)))
2968@end lisp
2969
2970Hash tables also have keys, and exactly the same arguments apply to the
2971use of symbols in hash tables as in association lists. The hash value
2972that Guile uses to decide where to add a symbol-keyed entry to a hash
2973table can be obtained by calling the @code{symbol-hash} procedure:
2974
2975@deffn {Scheme Procedure} symbol-hash symbol
2976@deffnx {C Function} scm_symbol_hash (symbol)
2977Return a hash value for @var{symbol}.
2978@end deffn
2979
2980See @ref{Hash Tables} for information about hash tables in general, and
2981for why you might choose to use a hash table rather than an association
2982list.
2983
2984
2985@node Symbol Variables
2986@subsubsection Symbols as Denoting Variables
2987
2988When an unquoted symbol in a Scheme program is evaluated, it is
2989interpreted as a variable reference, and the result of the evaluation is
2990the appropriate variable's value.
2991
2992For example, when the expression @code{(string-length "abcd")} is read
2993and evaluated, the sequence of characters @code{string-length} is read
2994as the symbol whose name is "string-length". This symbol is associated
2995with a variable whose value is the procedure that implements string
2996length calculation. Therefore evaluation of the @code{string-length}
2997symbol results in that procedure.
2998
2999The details of the connection between an unquoted symbol and the
3000variable to which it refers are explained elsewhere. See @ref{Binding
3001Constructs}, for how associations between symbols and variables are
3002created, and @ref{Modules}, for how those associations are affected by
3003Guile's module system.
3004
3005
3006@node Symbol Primitives
3007@subsubsection Operations Related to Symbols
3008
3009Given any Scheme value, you can determine whether it is a symbol using
3010the @code{symbol?} primitive:
3011
3012@rnindex symbol?
3013@deffn {Scheme Procedure} symbol? obj
3014@deffnx {C Function} scm_symbol_p (obj)
3015Return @code{#t} if @var{obj} is a symbol, otherwise return
3016@code{#f}.
3017@end deffn
3018
3019Once you know that you have a symbol, you can obtain its name as a
3020string by calling @code{symbol->string}. Note that Guile differs by
3021default from R5RS on the details of @code{symbol->string} as regards
3022case-sensitivity:
3023
3024@rnindex symbol->string
3025@deffn {Scheme Procedure} symbol->string s
3026@deffnx {C Function} scm_symbol_to_string (s)
3027Return the name of symbol @var{s} as a string. By default, Guile reads
3028symbols case-sensitively, so the string returned will have the same case
3029variation as the sequence of characters that caused @var{s} to be
3030created.
3031
3032If Guile is set to read symbols case-insensitively (as specified by
3033R5RS), and @var{s} comes into being as part of a literal expression
3034(@pxref{Literal expressions,,,r5rs, The Revised^5 Report on Scheme}) or
3035by a call to the @code{read} or @code{string-ci->symbol} procedures,
3036Guile converts any alphabetic characters in the symbol's name to
3037lower case before creating the symbol object, so the string returned
3038here will be in lower case.
3039
3040If @var{s} was created by @code{string->symbol}, the case of characters
3041in the string returned will be the same as that in the string that was
3042passed to @code{string->symbol}, regardless of Guile's case-sensitivity
3043setting at the time @var{s} was created.
3044
3045It is an error to apply mutation procedures like @code{string-set!} to
3046strings returned by this procedure.
3047@end deffn
3048
3049Most symbols are created by writing them literally in code. However it
3050is also possible to create symbols programmatically using the following
3051@code{string->symbol} and @code{string-ci->symbol} procedures:
3052
3053@rnindex string->symbol
3054@deffn {Scheme Procedure} string->symbol string
3055@deffnx {C Function} scm_string_to_symbol (string)
3056Return the symbol whose name is @var{string}. This procedure can create
3057symbols with names containing special characters or letters in the
3058non-standard case, but it is usually a bad idea to create such symbols
3059because in some implementations of Scheme they cannot be read as
3060themselves.
3061@end deffn
3062
3063@deffn {Scheme Procedure} string-ci->symbol str
3064@deffnx {C Function} scm_string_ci_to_symbol (str)
3065Return the symbol whose name is @var{str}. If Guile is currently
3066reading symbols case-insensitively, @var{str} is converted to lowercase
3067before the returned symbol is looked up or created.
3068@end deffn
3069
3070The following examples illustrate Guile's detailed behaviour as regards
3071the case-sensitivity of symbols:
3072
3073@lisp
3074(read-enable 'case-insensitive) ; R5RS compliant behaviour
3075
3076(symbol->string 'flying-fish) @result{} "flying-fish"
3077(symbol->string 'Martin) @result{} "martin"
3078(symbol->string
3079 (string->symbol "Malvina")) @result{} "Malvina"
3080
3081(eq? 'mISSISSIppi 'mississippi) @result{} #t
3082(string->symbol "mISSISSIppi") @result{} mISSISSIppi
3083(eq? 'bitBlt (string->symbol "bitBlt")) @result{} #f
3084(eq? 'LolliPop
3085 (string->symbol (symbol->string 'LolliPop))) @result{} #t
3086(string=? "K. Harper, M.D."
3087 (symbol->string
3088 (string->symbol "K. Harper, M.D."))) @result{} #t
3089
3090(read-disable 'case-insensitive) ; Guile default behaviour
3091
3092(symbol->string 'flying-fish) @result{} "flying-fish"
3093(symbol->string 'Martin) @result{} "Martin"
3094(symbol->string
3095 (string->symbol "Malvina")) @result{} "Malvina"
3096
3097(eq? 'mISSISSIppi 'mississippi) @result{} #f
3098(string->symbol "mISSISSIppi") @result{} mISSISSIppi
3099(eq? 'bitBlt (string->symbol "bitBlt")) @result{} #t
3100(eq? 'LolliPop
3101 (string->symbol (symbol->string 'LolliPop))) @result{} #t
3102(string=? "K. Harper, M.D."
3103 (symbol->string
3104 (string->symbol "K. Harper, M.D."))) @result{} #t
3105@end lisp
3106
3107From C, there are lower level functions that construct a Scheme symbol
3108from a null terminated C string or from a sequence of bytes whose length
3109is specified explicitly.
3110
3111@deffn {C Function} scm_str2symbol (const char * name)
3112@deffnx {C Function} scm_mem2symbol (const char * name, size_t len)
3113Construct and return a Scheme symbol whose name is specified by
3114@var{name}. For @code{scm_str2symbol} @var{name} must be null
3115terminated; For @code{scm_mem2symbol} the length of @var{name} is
3116specified explicitly by @var{len}.
3117@end deffn
3118
3119Finally, some applications, especially those that generate new Scheme
3120code dynamically, need to generate symbols for use in the generated
3121code. The @code{gensym} primitive meets this need:
3122
3123@deffn {Scheme Procedure} gensym [prefix]
3124@deffnx {C Function} scm_gensym (prefix)
3125Create a new symbol with a name constructed from a prefix and a counter
3126value. The string @var{prefix} can be specified as an optional
3127argument. Default prefix is @samp{@w{ g}}. The counter is increased by 1
3128at each call. There is no provision for resetting the counter.
3129@end deffn
3130
3131The symbols generated by @code{gensym} are @emph{likely} to be unique,
3132since their names begin with a space and it is only otherwise possible
3133to generate such symbols if a programmer goes out of their way to do
3134so. Uniqueness can be guaranteed by instead using uninterned symbols
3135(@pxref{Symbol Uninterned}), though they can't be usefully written out
3136and read back in.
3137
3138
3139@node Symbol Props
3140@subsubsection Function Slots and Property Lists
3141
3142In traditional Lisp dialects, symbols are often understood as having
3143three kinds of value at once:
3144
3145@itemize @bullet
3146@item
3147a @dfn{variable} value, which is used when the symbol appears in
3148code in a variable reference context
3149
3150@item
3151a @dfn{function} value, which is used when the symbol appears in
3152code in a function name position (i.e. as the first element in an
3153unquoted list)
3154
3155@item
3156a @dfn{property list} value, which is used when the symbol is given as
3157the first argument to Lisp's @code{put} or @code{get} functions.
3158@end itemize
3159
3160Although Scheme (as one of its simplifications with respect to Lisp)
3161does away with the distinction between variable and function namespaces,
3162Guile currently retains some elements of the traditional structure in
3163case they turn out to be useful when implementing translators for other
3164languages, in particular Emacs Lisp.
3165
3166Specifically, Guile symbols have two extra slots. for a symbol's
3167property list, and for its ``function value.'' The following procedures
3168are provided to access these slots.
3169
3170@deffn {Scheme Procedure} symbol-fref symbol
3171@deffnx {C Function} scm_symbol_fref (symbol)
3172Return the contents of @var{symbol}'s @dfn{function slot}.
3173@end deffn
3174
3175@deffn {Scheme Procedure} symbol-fset! symbol value
3176@deffnx {C Function} scm_symbol_fset_x (symbol, value)
3177Set the contents of @var{symbol}'s function slot to @var{value}.
3178@end deffn
3179
3180@deffn {Scheme Procedure} symbol-pref symbol
3181@deffnx {C Function} scm_symbol_pref (symbol)
3182Return the @dfn{property list} currently associated with @var{symbol}.
3183@end deffn
3184
3185@deffn {Scheme Procedure} symbol-pset! symbol value
3186@deffnx {C Function} scm_symbol_pset_x (symbol, value)
3187Set @var{symbol}'s property list to @var{value}.
3188@end deffn
3189
3190@deffn {Scheme Procedure} symbol-property sym prop
3191From @var{sym}'s property list, return the value for property
3192@var{prop}. The assumption is that @var{sym}'s property list is an
3193association list whose keys are distinguished from each other using
3194@code{equal?}; @var{prop} should be one of the keys in that list. If
3195the property list has no entry for @var{prop}, @code{symbol-property}
3196returns @code{#f}.
3197@end deffn
3198
3199@deffn {Scheme Procedure} set-symbol-property! sym prop val
3200In @var{sym}'s property list, set the value for property @var{prop} to
3201@var{val}, or add a new entry for @var{prop}, with value @var{val}, if
3202none already exists. For the structure of the property list, see
3203@code{symbol-property}.
3204@end deffn
3205
3206@deffn {Scheme Procedure} symbol-property-remove! sym prop
3207From @var{sym}'s property list, remove the entry for property
3208@var{prop}, if there is one. For the structure of the property list,
3209see @code{symbol-property}.
3210@end deffn
3211
3212Support for these extra slots may be removed in a future release, and it
3213is probably better to avoid using them. (In release 1.6, Guile itself
3214uses the property list slot sparingly, and the function slot not at
3215all.) For a more modern and Schemely approach to properties, see
3216@ref{Object Properties}.
3217
3218
3219@node Symbol Read Syntax
3220@subsubsection Extended Read Syntax for Symbols
3221
3222The read syntax for a symbol is a sequence of letters, digits, and
3223@dfn{extended alphabetic characters}, beginning with a character that
3224cannot begin a number. In addition, the special cases of @code{+},
3225@code{-}, and @code{...} are read as symbols even though numbers can
3226begin with @code{+}, @code{-} or @code{.}.
3227
3228Extended alphabetic characters may be used within identifiers as if
3229they were letters. The set of extended alphabetic characters is:
3230
3231@example
3232! $ % & * + - . / : < = > ? @@ ^ _ ~
3233@end example
3234
3235In addition to the standard read syntax defined above (which is taken
3236from R5RS (@pxref{Formal syntax,,,r5rs,The Revised^5 Report on
3237Scheme})), Guile provides an extended symbol read syntax that allows the
3238inclusion of unusual characters such as space characters, newlines and
3239parentheses. If (for whatever reason) you need to write a symbol
3240containing characters not mentioned above, you can do so as follows.
3241
3242@itemize @bullet
3243@item
3244Begin the symbol with the characters @code{#@{},
3245
3246@item
3247write the characters of the symbol and
3248
3249@item
3250finish the symbol with the characters @code{@}#}.
3251@end itemize
3252
3253Here are a few examples of this form of read syntax. The first symbol
3254needs to use extended syntax because it contains a space character, the
3255second because it contains a line break, and the last because it looks
3256like a number.
3257
3258@lisp
3259#@{foo bar@}#
3260
3261#@{what
3262ever@}#
3263
3264#@{4242@}#
3265@end lisp
3266
3267Although Guile provides this extended read syntax for symbols,
3268widespread usage of it is discouraged because it is not portable and not
3269very readable.
3270
3271
3272@node Symbol Uninterned
3273@subsubsection Uninterned Symbols
3274
3275What makes symbols useful is that they are automatically kept unique.
3276There are no two symbols that are distinct objects but have the same
3277name. But of course, there is no rule without exception. In addition
3278to the normal symbols that have been discussed up to now, you can also
3279create special @dfn{uninterned} symbols that behave slightly
3280differently.
3281
3282To understand what is different about them and why they might be useful,
3283we look at how normal symbols are actually kept unique.
3284
3285Whenever Guile wants to find the symbol with a specific name, for
3286example during @code{read} or when executing @code{string->symbol}, it
3287first looks into a table of all existing symbols to find out whether a
3288symbol with the given name already exists. When this is the case, Guile
3289just returns that symbol. When not, a new symbol with the name is
3290created and entered into the table so that it can be found later.
3291
3292Sometimes you might want to create a symbol that is guaranteed `fresh',
3293i.e. a symbol that did not exist previously. You might also want to
3294somehow guarantee that no one else will ever unintentionally stumble
3295across your symbol in the future. These properties of a symbol are
3296often needed when generating code during macro expansion. When
3297introducing new temporary variables, you want to guarantee that they
3298don't conflict with variables in other people's code.
3299
3300The simplest way to arrange for this is to create a new symbol but
3301not enter it into the global table of all symbols. That way, no one
3302will ever get access to your symbol by chance. Symbols that are not in
3303the table are called @dfn{uninterned}. Of course, symbols that
3304@emph{are} in the table are called @dfn{interned}.
3305
3306You create new uninterned symbols with the function @code{make-symbol}.
3307You can test whether a symbol is interned or not with
3308@code{symbol-interned?}.
3309
3310Uninterned symbols break the rule that the name of a symbol uniquely
3311identifies the symbol object. Because of this, they can not be written
3312out and read back in like interned symbols. Currently, Guile has no
3313support for reading uninterned symbols. Note that the function
3314@code{gensym} does not return uninterned symbols for this reason.
3315
3316@deffn {Scheme Procedure} make-symbol name
3317@deffnx {C Function} scm_make_symbol (name)
3318Return a new uninterned symbol with the name @var{name}. The returned
3319symbol is guaranteed to be unique and future calls to
3320@code{string->symbol} will not return it.
3321@end deffn
3322
3323@deffn {Scheme Procedure} symbol-interned? symbol
3324@deffnx {C Function} scm_symbol_interned_p (symbol)
3325Return @code{#t} if @var{symbol} is interned, otherwise return
3326@code{#f}.
3327@end deffn
3328
3329For example:
3330
3331@lisp
3332(define foo-1 (string->symbol "foo"))
3333(define foo-2 (string->symbol "foo"))
3334(define foo-3 (make-symbol "foo"))
3335(define foo-4 (make-symbol "foo"))
3336
3337(eq? foo-1 foo-2)
3338@result{} #t
3339; Two interned symbols with the same name are the same object,
3340
3341(eq? foo-1 foo-3)
3342@result{} #f
3343; but a call to make-symbol with the same name returns a
3344; distinct object.
3345
3346(eq? foo-3 foo-4)
3347@result{} #f
3348; A call to make-symbol always returns a new object, even for
3349; the same name.
3350
3351foo-3
3352@result{} #<uninterned-symbol foo 8085290>
3353; Uninterned symbols print differently from interned symbols,
3354
3355(symbol? foo-3)
3356@result{} #t
3357; but they are still symbols,
3358
3359(symbol-interned? foo-3)
3360@result{} #f
3361; just not interned.
3362@end lisp
3363
3364
3365@node Keywords
3366@subsection Keywords
3367@tpindex Keywords
3368
3369Keywords are self-evaluating objects with a convenient read syntax that
3370makes them easy to type.
3371
3372Guile's keyword support conforms to R5RS, and adds a (switchable) read
3373syntax extension to permit keywords to begin with @code{:} as well as
3374@code{#:}.
3375
3376@menu
3377* Why Use Keywords?:: Motivation for keyword usage.
3378* Coding With Keywords:: How to use keywords.
3379* Keyword Read Syntax:: Read syntax for keywords.
3380* Keyword Procedures:: Procedures for dealing with keywords.
3381* Keyword Primitives:: The underlying primitive procedures.
3382@end menu
3383
3384@node Why Use Keywords?
3385@subsubsection Why Use Keywords?
3386
3387Keywords are useful in contexts where a program or procedure wants to be
3388able to accept a large number of optional arguments without making its
3389interface unmanageable.
3390
3391To illustrate this, consider a hypothetical @code{make-window}
3392procedure, which creates a new window on the screen for drawing into
3393using some graphical toolkit. There are many parameters that the caller
3394might like to specify, but which could also be sensibly defaulted, for
3395example:
3396
3397@itemize @bullet
3398@item
3399color depth -- Default: the color depth for the screen
3400
3401@item
3402background color -- Default: white
3403
3404@item
3405width -- Default: 600
3406
3407@item
3408height -- Default: 400
3409@end itemize
3410
3411If @code{make-window} did not use keywords, the caller would have to
3412pass in a value for each possible argument, remembering the correct
3413argument order and using a special value to indicate the default value
3414for that argument:
3415
3416@lisp
3417(make-window 'default ;; Color depth
3418 'default ;; Background color
3419 800 ;; Width
3420 100 ;; Height
3421 @dots{}) ;; More make-window arguments
3422@end lisp
3423
3424With keywords, on the other hand, defaulted arguments are omitted, and
3425non-default arguments are clearly tagged by the appropriate keyword. As
3426a result, the invocation becomes much clearer:
3427
3428@lisp
3429(make-window #:width 800 #:height 100)
3430@end lisp
3431
3432On the other hand, for a simpler procedure with few arguments, the use
3433of keywords would be a hindrance rather than a help. The primitive
3434procedure @code{cons}, for example, would not be improved if it had to
3435be invoked as
3436
3437@lisp
3438(cons #:car x #:cdr y)
3439@end lisp
3440
3441So the decision whether to use keywords or not is purely pragmatic: use
3442them if they will clarify the procedure invocation at point of call.
3443
3444@node Coding With Keywords
3445@subsubsection Coding With Keywords
3446
3447If a procedure wants to support keywords, it should take a rest argument
3448and then use whatever means is convenient to extract keywords and their
3449corresponding arguments from the contents of that rest argument.
3450
3451The following example illustrates the principle: the code for
3452@code{make-window} uses a helper procedure called
3453@code{get-keyword-value} to extract individual keyword arguments from
3454the rest argument.
3455
3456@lisp
3457(define (get-keyword-value args keyword default)
3458 (let ((kv (memq keyword args)))
3459 (if (and kv (>= (length kv) 2))
3460 (cadr kv)
3461 default)))
3462
3463(define (make-window . args)
3464 (let ((depth (get-keyword-value args #:depth screen-depth))
3465 (bg (get-keyword-value args #:bg "white"))
3466 (width (get-keyword-value args #:width 800))
3467 (height (get-keyword-value args #:height 100))
3468 @dots{})
3469 @dots{}))
3470@end lisp
3471
3472But you don't need to write @code{get-keyword-value}. The @code{(ice-9
3473optargs)} module provides a set of powerful macros that you can use to
3474implement keyword-supporting procedures like this:
3475
3476@lisp
3477(use-modules (ice-9 optargs))
3478
3479(define (make-window . args)
3480 (let-keywords args #f ((depth screen-depth)
3481 (bg "white")
3482 (width 800)
3483 (height 100))
3484 ...))
3485@end lisp
3486
3487@noindent
3488Or, even more economically, like this:
3489
3490@lisp
3491(use-modules (ice-9 optargs))
3492
3493(define* (make-window #:key (depth screen-depth)
3494 (bg "white")
3495 (width 800)
3496 (height 100))
3497 ...)
3498@end lisp
3499
3500For further details on @code{let-keywords}, @code{define*} and other
3501facilities provided by the @code{(ice-9 optargs)} module, see
3502@ref{Optional Arguments}.
3503
3504
3505@node Keyword Read Syntax
3506@subsubsection Keyword Read Syntax
3507
3508Guile, by default, only recognizes the keyword syntax specified by R5RS.
3509A token of the form @code{#:NAME}, where @code{NAME} has the same syntax
3510as a Scheme symbol (@pxref{Symbol Read Syntax}), is the external
3511representation of the keyword named @code{NAME}. Keyword objects print
3512using this syntax as well, so values containing keyword objects can be
3513read back into Guile. When used in an expression, keywords are
3514self-quoting objects.
3515
3516If the @code{keyword} read option is set to @code{'prefix}, Guile also
3517recognizes the alternative read syntax @code{:NAME}. Otherwise, tokens
3518of the form @code{:NAME} are read as symbols, as required by R5RS.
3519
3520To enable and disable the alternative non-R5RS keyword syntax, you use
3521the @code{read-set!} procedure documented in @ref{User level options
3522interfaces} and @ref{Reader options}.
3523
3524@smalllisp
3525(read-set! keywords 'prefix)
3526
3527#:type
3528@result{}
3529#:type
3530
3531:type
3532@result{}
3533#:type
3534
3535(read-set! keywords #f)
3536
3537#:type
3538@result{}
3539#:type
3540
3541:type
3542@print{}
3543ERROR: In expression :type:
3544ERROR: Unbound variable: :type
3545ABORT: (unbound-variable)
3546@end smalllisp
3547
3548@node Keyword Procedures
3549@subsubsection Keyword Procedures
3550
3551The following procedures can be used for converting symbols to keywords
3552and back.
3553
3554@deffn {Scheme Procedure} symbol->keyword sym
3555Return a keyword with the same characters as in @var{sym}.
3556@end deffn
3557
3558@deffn {Scheme Procedure} keyword->symbol kw
3559Return a symbol with the same characters as in @var{kw}.
3560@end deffn
3561
3562
3563@node Keyword Primitives
3564@subsubsection Keyword Primitives
3565
3566Internally, a keyword is implemented as something like a tagged symbol,
3567where the tag identifies the keyword as being self-evaluating, and the
3568symbol, known as the keyword's @dfn{dash symbol} has the same name as
3569the keyword name but prefixed by a single dash. For example, the
3570keyword @code{#:name} has the corresponding dash symbol @code{-name}.
3571
3572Most keyword objects are constructed automatically by the reader when it
3573reads a token beginning with @code{#:}. However, if you need to
3574construct a keyword object programmatically, you can do so by calling
3575@code{make-keyword-from-dash-symbol} with the corresponding dash symbol
3576(as the reader does). The dash symbol for a keyword object can be
3577retrieved using the @code{keyword-dash-symbol} procedure.
3578
3579@deffn {Scheme Procedure} make-keyword-from-dash-symbol symbol
3580@deffnx {C Function} scm_make_keyword_from_dash_symbol (symbol)
3581Make a keyword object from a @var{symbol} that starts with a dash.
3582For example,
3583
3584@example
3585(make-keyword-from-dash-symbol '-foo)
3586@result{} #:foo
3587@end example
3588@end deffn
3589
3590@deffn {Scheme Procedure} keyword? obj
3591@deffnx {C Function} scm_keyword_p (obj)
3592Return @code{#t} if the argument @var{obj} is a keyword, else
3593@code{#f}.
3594@end deffn
3595
3596@deffn {Scheme Procedure} keyword-dash-symbol keyword
3597@deffnx {C Function} scm_keyword_dash_symbol (keyword)
3598Return the dash symbol for @var{keyword}.
3599This is the inverse of @code{make-keyword-from-dash-symbol}.
3600For example,
3601
3602@example
3603(keyword-dash-symbol #:foo)
3604@result{} -foo
3605@end example
3606@end deffn
3607
3608@deftypefn {C Function} SCM scm_c_make_keyword (char *@var{str})
3609Make a keyword object from a string. For example,
3610
3611@example
3612scm_c_make_keyword ("foo")
3613@result{} #:foo
3614@end example
3615@c
3616@c FIXME: What can be said about the string argument? Currently it's
3617@c not used after creation, but should that be documented?
3618@end deftypefn
3619
3620
3621@node Other Types
3622@subsection ``Functionality-Centric'' Data Types
3623
3624Procedures and macros are documented in their own chapter: see
3625@ref{Procedures and Macros}.
3626
3627Variable objects are documented as part of the description of Guile's
3628module system: see @ref{Variables}.
3629
3630Asyncs, dynamic roots and fluids are described in the chapter on
3631scheduling: see @ref{Scheduling}.
3632
3633Hooks are documented in the chapter on general utility functions: see
3634@ref{Hooks}.
3635
3636Ports are described in the chapter on I/O: see @ref{Input and Output}.
3637
3638
3639@c Local Variables:
3640@c TeX-master: "guile.texi"
3641@c End: