* doc/lispref/internals.texi (Writing Emacs Primitives): Remove unnecessary reference...
[bpt/emacs.git] / doc / lispref / internals.texi
CommitLineData
b8d4c8d0
GM
1@c -*-texinfo-*-
2@c This is part of the GNU Emacs Lisp Reference Manual.
ab422c4d
PE
3@c Copyright (C) 1990-1993, 1998-1999, 2001-2013 Free Software
4@c Foundation, Inc.
b8d4c8d0 5@c See the file elisp.texi for copying conditions.
ecc6530d 6@node GNU Emacs Internals
b8d4c8d0
GM
7@appendix GNU Emacs Internals
8
9This chapter describes how the runnable Emacs executable is dumped with
10the preloaded Lisp libraries in it, how storage is allocated, and some
11internal aspects of GNU Emacs that may be of interest to C programmers.
12
13@menu
14* Building Emacs:: How the dumped Emacs is made.
333f9019 15* Pure Storage:: Kludge to make preloaded Lisp functions shareable.
b8d4c8d0
GM
16* Garbage Collection:: Reclaiming space for Lisp objects no longer used.
17* Memory Usage:: Info about total size of Lisp objects made so far.
18* Writing Emacs Primitives:: Writing C code for Emacs.
19* Object Internals:: Data formats of buffers, windows, processes.
d92d9c95 20* C Integer Types:: How C integer types are used inside Emacs.
b8d4c8d0
GM
21@end menu
22
23@node Building Emacs
520f2425 24@section Building Emacs
b8d4c8d0
GM
25@cindex building Emacs
26@pindex temacs
27
28 This section explains the steps involved in building the Emacs
29executable. You don't have to know this material to build and install
30Emacs, since the makefiles do all these things automatically. This
d15aac68 31information is pertinent to Emacs developers.
b8d4c8d0
GM
32
33 Compilation of the C source files in the @file{src} directory
34produces an executable file called @file{temacs}, also called a
d15aac68
CY
35@dfn{bare impure Emacs}. It contains the Emacs Lisp interpreter and
36I/O routines, but not the editing commands.
b8d4c8d0
GM
37
38@cindex @file{loadup.el}
d15aac68
CY
39 The command @w{@command{temacs -l loadup}} would run @file{temacs}
40and direct it to load @file{loadup.el}. The @code{loadup} library
41loads additional Lisp libraries, which set up the normal Emacs editing
42environment. After this step, the Emacs executable is no longer
43@dfn{bare}.
b8d4c8d0
GM
44
45@cindex dumping Emacs
d15aac68
CY
46 Because it takes some time to load the standard Lisp files, the
47@file{temacs} executable usually isn't run directly by users.
48Instead, as one of the last steps of building Emacs, the command
49@samp{temacs -batch -l loadup dump} is run. The special @samp{dump}
50argument causes @command{temacs} to dump out an executable program,
51called @file{emacs}, which has all the standard Lisp files preloaded.
52(The @samp{-batch} argument prevents @file{temacs} from trying to
53initialize any of its data on the terminal, so that the tables of
54terminal information are empty in the dumped Emacs.)
55
56@cindex preloaded Lisp files
b578c9cc 57@vindex preloaded-file-list
d15aac68
CY
58 The dumped @file{emacs} executable (also called a @dfn{pure} Emacs)
59is the one which is installed. The variable
60@code{preloaded-file-list} stores a list of the Lisp files preloaded
61into the dumped Emacs. If you port Emacs to a new operating system,
62and are not able to implement dumping, then Emacs must load
63@file{loadup.el} each time it starts.
b8d4c8d0
GM
64
65@cindex @file{site-load.el}
b8d4c8d0 66 You can specify additional files to preload by writing a library named
520f2425
GM
67@file{site-load.el} that loads them. You may need to rebuild Emacs
68with an added definition
b8d4c8d0
GM
69
70@example
71#define SITELOAD_PURESIZE_EXTRA @var{n}
72@end example
73
74@noindent
520f2425
GM
75to make @var{n} added bytes of pure space to hold the additional files;
76see @file{src/puresize.h}.
b8d4c8d0
GM
77(Try adding increments of 20000 until it is big enough.) However, the
78advantage of preloading additional files decreases as machines get
79faster. On modern machines, it is usually not advisable.
80
81 After @file{loadup.el} reads @file{site-load.el}, it finds the
82documentation strings for primitive and preloaded functions (and
83variables) in the file @file{etc/DOC} where they are stored, by
84calling @code{Snarf-documentation} (@pxref{Definition of
85Snarf-documentation,, Accessing Documentation}).
86
87@cindex @file{site-init.el}
88@cindex preloading additional functions and variables
89 You can specify other Lisp expressions to execute just before dumping
90by putting them in a library named @file{site-init.el}. This file is
91executed after the documentation strings are found.
92
93 If you want to preload function or variable definitions, there are
94three ways you can do this and make their documentation strings
95accessible when you subsequently run Emacs:
96
97@itemize @bullet
98@item
99Arrange to scan these files when producing the @file{etc/DOC} file,
100and load them with @file{site-load.el}.
101
102@item
103Load the files with @file{site-init.el}, then copy the files into the
104installation directory for Lisp files when you install Emacs.
105
106@item
5906db45
GM
107Specify a @code{nil} value for @code{byte-compile-dynamic-docstrings}
108as a local variable in each of these files, and load them with either
109@file{site-load.el} or @file{site-init.el}. (This method has the
110drawback that the documentation strings take up space in Emacs all the
111time.)
b8d4c8d0
GM
112@end itemize
113
114 It is not advisable to put anything in @file{site-load.el} or
115@file{site-init.el} that would alter any of the features that users
116expect in an ordinary unmodified Emacs. If you feel you must override
117normal features for your site, do it with @file{default.el}, so that
118users can override your changes if they wish. @xref{Startup Summary}.
119
520f2425
GM
120 In a package that can be preloaded, it is sometimes necessary (or
121useful) to delay certain evaluations until Emacs subsequently starts
122up. The vast majority of such cases relate to the values of
123customizable variables. For example, @code{tutorial-directory} is a
124variable defined in @file{startup.el}, which is preloaded. The default
125value is set based on @code{data-directory}. The variable needs to
126access the value of @code{data-directory} when Emacs starts, not when
127it is dumped, because the Emacs executable has probably been installed
128in a different location since it was dumped.
129
130@defun custom-initialize-delay symbol value
131This function delays the initialization of @var{symbol} to the next
132Emacs start. You normally use this function by specifying it as the
133@code{:initialize} property of a customizable variable. (The argument
f003f294 134@var{value} is unused, and is provided only for compatibility with the
520f2425
GM
135form Custom expects.)
136@end defun
b8d4c8d0 137
520f2425
GM
138In the unlikely event that you need a more general functionality than
139@code{custom-initialize-delay} provides, you can use
140@code{before-init-hook} (@pxref{Startup Summary}).
b8d4c8d0
GM
141
142@defun dump-emacs to-file from-file
143@cindex unexec
144This function dumps the current state of Emacs into an executable file
145@var{to-file}. It takes symbols from @var{from-file} (this is normally
146the executable file @file{temacs}).
147
148If you want to use this function in an Emacs that was already dumped,
149you must run Emacs with @samp{-batch}.
150@end defun
151
152@node Pure Storage
520f2425 153@section Pure Storage
b8d4c8d0
GM
154@cindex pure storage
155
156 Emacs Lisp uses two kinds of storage for user-created Lisp objects:
157@dfn{normal storage} and @dfn{pure storage}. Normal storage is where
999dd333
GM
158all the new data created during an Emacs session are kept
159(@pxref{Garbage Collection}). Pure storage is used for certain data
160in the preloaded standard Lisp files---data that should never change
161during actual use of Emacs.
b8d4c8d0 162
d15aac68 163 Pure storage is allocated only while @command{temacs} is loading the
b8d4c8d0
GM
164standard preloaded Lisp libraries. In the file @file{emacs}, it is
165marked as read-only (on operating systems that permit this), so that
166the memory space can be shared by all the Emacs jobs running on the
167machine at once. Pure storage is not expandable; a fixed amount is
168allocated when Emacs is compiled, and if that is not sufficient for
169the preloaded libraries, @file{temacs} allocates dynamic memory for
999dd333
GM
170the part that didn't fit. The resulting image will work, but garbage
171collection (@pxref{Garbage Collection}) is disabled in this situation,
172causing a memory leak. Such an overflow normally won't happen unless
173you try to preload additional libraries or add features to the
174standard ones. Emacs will display a warning about the overflow when
175it starts. If this happens, you should increase the compilation
176parameter @code{SYSTEM_PURESIZE_EXTRA} in the file
177@file{src/puresize.h} and rebuild Emacs.
b8d4c8d0
GM
178
179@defun purecopy object
180This function makes a copy in pure storage of @var{object}, and returns
181it. It copies a string by simply making a new string with the same
182characters, but without text properties, in pure storage. It
183recursively copies the contents of vectors and cons cells. It does
184not make copies of other objects such as symbols, but just returns
185them unchanged. It signals an error if asked to copy markers.
186
187This function is a no-op except while Emacs is being built and dumped;
999dd333 188it is usually called only in preloaded Lisp files.
b8d4c8d0
GM
189@end defun
190
191@defvar pure-bytes-used
192The value of this variable is the number of bytes of pure storage
193allocated so far. Typically, in a dumped Emacs, this number is very
194close to the total amount of pure storage available---if it were not,
195we would preallocate less.
196@end defvar
197
198@defvar purify-flag
199This variable determines whether @code{defun} should make a copy of the
200function definition in pure storage. If it is non-@code{nil}, then the
201function definition is copied into pure storage.
202
203This flag is @code{t} while loading all of the basic functions for
333f9019 204building Emacs initially (allowing those functions to be shareable and
b8d4c8d0
GM
205non-collectible). Dumping Emacs as an executable always writes
206@code{nil} in this variable, regardless of the value it actually has
207before and after dumping.
208
209You should not change this flag in a running Emacs.
210@end defvar
211
212@node Garbage Collection
520f2425 213@section Garbage Collection
b8d4c8d0
GM
214
215@cindex memory allocation
d15aac68
CY
216 When a program creates a list or the user defines a new function
217(such as by loading a library), that data is placed in normal storage.
218If normal storage runs low, then Emacs asks the operating system to
219allocate more memory. Different types of Lisp objects, such as
f3372c87
DA
220symbols, cons cells, small vectors, markers, etc., are segregated in
221distinct blocks in memory. (Large vectors, long strings, buffers and
222certain other editing types, which are fairly large, are allocated in
223individual blocks, one per object; small strings are packed into blocks
224of 8k bytes, and small vectors are packed into blocks of 4k bytes).
225
226@cindex vector-like objects, storage
227@cindex storage of vector-like Lisp objects
228 Beyond the basic vector, a lot of objects like window, buffer, and
229frame are managed as if they were vectors. The corresponding C data
230structures include the @code{struct vectorlike_header} field whose
74934dcc
DA
231@code{size} member contains the subtype enumerated by @code{enum pvec_type}
232and an information about how many @code{Lisp_Object} fields this structure
233contains and what the size of the rest data is. This information is
234needed to calculate the memory footprint of an object, and used
235by the vector allocation code while iterating over the vector blocks.
b8d4c8d0 236
d15aac68
CY
237@cindex garbage collection
238 It is quite common to use some storage for a while, then release it
239by (for example) killing a buffer or deleting the last pointer to an
240object. Emacs provides a @dfn{garbage collector} to reclaim this
241abandoned storage. The garbage collector operates by finding and
242marking all Lisp objects that are still accessible to Lisp programs.
243To begin with, it assumes all the symbols, their values and associated
244function definitions, and any data presently on the stack, are
245accessible. Any objects that can be reached indirectly through other
246accessible objects are also accessible.
b8d4c8d0
GM
247
248 When marking is finished, all objects still unmarked are garbage. No
249matter what the Lisp program or the user does, it is impossible to refer
250to them, since there is no longer a way to reach them. Their space
251might as well be reused, since no one will miss them. The second
252(``sweep'') phase of the garbage collector arranges to reuse them.
253
254@c ??? Maybe add something describing weak hash tables here?
255
256@cindex free list
257 The sweep phase puts unused cons cells onto a @dfn{free list}
258for future allocation; likewise for symbols and markers. It compacts
259the accessible strings so they occupy fewer 8k blocks; then it frees the
f3372c87
DA
260other 8k blocks. Unreachable vectors from vector blocks are coalesced
261to create largest possible free areas; if a free area spans a complete
2624k block, that block is freed. Otherwise, the free area is recorded
263in a free list array, where each entry corresponds to a free list
264of areas of the same size. Large vectors, buffers, and other large
265objects are allocated and freed individually.
b8d4c8d0
GM
266
267@cindex CL note---allocate more storage
268@quotation
269@b{Common Lisp note:} Unlike other Lisps, GNU Emacs Lisp does not
270call the garbage collector when the free list is empty. Instead, it
271simply requests the operating system to allocate more storage, and
272processing continues until @code{gc-cons-threshold} bytes have been
273used.
274
275This means that you can make sure that the garbage collector will not
276run during a certain portion of a Lisp program by calling the garbage
277collector explicitly just before it (provided that portion of the
278program does not use so much space as to force a second garbage
279collection).
280@end quotation
281
282@deffn Command garbage-collect
283This command runs a garbage collection, and returns information on
284the amount of space in use. (Garbage collection can also occur
285spontaneously if you use more than @code{gc-cons-threshold} bytes of
286Lisp data since the previous garbage collection.)
287
74934dcc
DA
288@code{garbage-collect} returns a list with information on amount of space in
289use, where each entry has the form @samp{(@var{name} @var{size} @var{used})}
290or @samp{(@var{name} @var{size} @var{used} @var{free})}. In the entry,
291@var{name} is a symbol describing the kind of objects this entry represents,
292@var{size} is the number of bytes used by each one, @var{used} is the number
293of those objects that were found live in the heap, and optional @var{free} is
294the number of those objects that are not live but that Emacs keeps around for
295future allocations. So an overall result is:
b8d4c8d0
GM
296
297@example
f24f2e22 298((@code{conses} @var{cons-size} @var{used-conses} @var{free-conses})
74934dcc
DA
299 (@code{symbols} @var{symbol-size} @var{used-symbols} @var{free-symbols})
300 (@code{miscs} @var{misc-size} @var{used-miscs} @var{free-miscs})
301 (@code{strings} @var{string-size} @var{used-strings} @var{free-strings})
302 (@code{string-bytes} @var{byte-size} @var{used-bytes})
303 (@code{vectors} @var{vector-size} @var{used-vectors})
304 (@code{vector-slots} @var{slot-size} @var{used-slots} @var{free-slots})
305 (@code{floats} @var{float-size} @var{used-floats} @var{free-floats})
306 (@code{intervals} @var{interval-size} @var{used-intervals} @var{free-intervals})
307 (@code{buffers} @var{buffer-size} @var{used-buffers})
308 (@code{heap} @var{unit-size} @var{total-size} @var{free-size}))
b8d4c8d0
GM
309@end example
310
311Here is an example:
312
313@example
b8d4c8d0 314(garbage-collect)
74934dcc
DA
315 @result{} ((conses 16 49126 8058) (symbols 48 14607 0)
316 (miscs 40 34 56) (strings 32 2942 2607)
317 (string-bytes 1 78607) (vectors 16 7247)
318 (vector-slots 8 341609 29474) (floats 8 71 102)
319 (intervals 56 27 26) (buffers 944 8)
320 (heap 1024 11715 2678))
b8d4c8d0
GM
321@end example
322
74934dcc
DA
323Below is a table explaining each element. Note that last @code{heap} entry
324is optional and present only if an underlying @code{malloc} implementation
325provides @code{mallinfo} function.
b8d4c8d0
GM
326
327@table @var
74934dcc 328@item cons-size
39670ef4 329Internal size of a cons cell, i.e., @code{sizeof (struct Lisp_Cons)}.
74934dcc 330
b8d4c8d0
GM
331@item used-conses
332The number of cons cells in use.
333
334@item free-conses
74934dcc
DA
335The number of cons cells for which space has been obtained from
336the operating system, but that are not currently being used.
b8d4c8d0 337
74934dcc 338@item symbol-size
39670ef4 339Internal size of a symbol, i.e., @code{sizeof (struct Lisp_Symbol)}.
74934dcc
DA
340
341@item used-symbols
b8d4c8d0
GM
342The number of symbols in use.
343
74934dcc
DA
344@item free-symbols
345The number of symbols for which space has been obtained from
346the operating system, but that are not currently being used.
347
348@item misc-size
39670ef4 349Internal size of a miscellaneous entity, i.e.,
74934dcc
DA
350@code{sizeof (union Lisp_Misc)}, which is a size of the
351largest type enumerated in @code{enum Lisp_Misc_Type}.
b8d4c8d0
GM
352
353@item used-miscs
74934dcc
DA
354The number of miscellaneous objects in use. These include markers
355and overlays, plus certain objects not visible to users.
b8d4c8d0
GM
356
357@item free-miscs
358The number of miscellaneous objects for which space has been obtained
359from the operating system, but that are not currently being used.
360
74934dcc 361@item string-size
39670ef4 362Internal size of a string header, i.e., @code{sizeof (struct Lisp_String)}.
74934dcc
DA
363
364@item used-strings
365The number of string headers in use.
366
367@item free-strings
368The number of string headers for which space has been obtained
369from the operating system, but that are not currently being used.
370
371@item byte-size
372This is used for convenience and equals to @code{sizeof (char)}.
373
374@item used-bytes
375The total size of all string data in bytes.
376
377@item vector-size
39670ef4 378Internal size of a vector header, i.e., @code{sizeof (struct Lisp_Vector)}.
74934dcc
DA
379
380@item used-vectors
381The number of vector headers allocated from the vector blocks.
382
383@item slot-size
384Internal size of a vector slot, always equal to @code{sizeof (Lisp_Object)}.
385
386@item used-slots
387The number of slots in all used vectors.
388
389@item free-slots
390The number of free slots in all vector blocks.
b8d4c8d0 391
74934dcc 392@item float-size
39670ef4 393Internal size of a float object, i.e., @code{sizeof (struct Lisp_Float)}.
74934dcc 394(Do not confuse it with the native platform @code{float} or @code{double}.)
b8d4c8d0
GM
395
396@item used-floats
b8d4c8d0
GM
397The number of floats in use.
398
399@item free-floats
74934dcc
DA
400The number of floats for which space has been obtained from
401the operating system, but that are not currently being used.
402
403@item interval-size
39670ef4 404Internal size of an interval object, i.e., @code{sizeof (struct interval)}.
b8d4c8d0
GM
405
406@item used-intervals
74934dcc 407The number of intervals in use.
b8d4c8d0
GM
408
409@item free-intervals
74934dcc
DA
410The number of intervals for which space has been obtained from
411the operating system, but that are not currently being used.
b8d4c8d0 412
74934dcc 413@item buffer-size
39670ef4 414Internal size of a buffer, i.e., @code{sizeof (struct buffer)}.
74934dcc 415(Do not confuse with the value returned by @code{buffer-size} function.)
b8d4c8d0 416
74934dcc
DA
417@item used-buffers
418The number of buffer objects in use. This includes killed buffers
39670ef4 419invisible to users, i.e., all buffers in @code{all_buffers} list.
74934dcc
DA
420
421@item unit-size
422The unit of heap space measurement, always equal to 1024 bytes.
423
424@item total-size
425Total heap size, in @var{unit-size} units.
426
427@item free-size
428Heap space which is not currently used, in @var{unit-size} units.
b8d4c8d0
GM
429@end table
430
999dd333 431If there was overflow in pure space (@pxref{Pure Storage}),
b8d4c8d0 432@code{garbage-collect} returns @code{nil}, because a real garbage
9658dc2a 433collection cannot be done.
b8d4c8d0
GM
434@end deffn
435
436@defopt garbage-collection-messages
437If this variable is non-@code{nil}, Emacs displays a message at the
438beginning and end of garbage collection. The default value is
999dd333 439@code{nil}.
b8d4c8d0
GM
440@end defopt
441
442@defvar post-gc-hook
443This is a normal hook that is run at the end of garbage collection.
444Garbage collection is inhibited while the hook functions run, so be
445careful writing them.
446@end defvar
447
448@defopt gc-cons-threshold
449The value of this variable is the number of bytes of storage that must
450be allocated for Lisp objects after one garbage collection in order to
74934dcc
DA
451trigger another garbage collection. You can use the result returned by
452@code{garbage-collect} to get an information about size of the particular
453object type; space allocated to the contents of buffers does not count.
454Note that the subsequent garbage collection does not happen immediately
455when the threshold is exhausted, but only the next time the Lisp interpreter
456is called.
457
458The initial threshold value is @code{GC_DEFAULT_THRESHOLD}, defined in
459@file{alloc.c}. Since it's defined in @code{word_size} units, the value
460is 400,000 for the default 32-bit configuration and 800,000 for the 64-bit
461one. If you specify a larger value, garbage collection will happen less
462often. This reduces the amount of time spent garbage collecting, but
463increases total memory use. You may want to do this when running a program
464that creates lots of Lisp data.
465
466You can make collections more frequent by specifying a smaller value, down
467to 1/10th of @code{GC_DEFAULT_THRESHOLD}. A value less than this minimum
468will remain in effect only until the subsequent garbage collection, at which
469time @code{garbage-collect} will set the threshold back to the minimum.
b8d4c8d0
GM
470@end defopt
471
472@defopt gc-cons-percentage
473The value of this variable specifies the amount of consing before a
474garbage collection occurs, as a fraction of the current heap size.
475This criterion and @code{gc-cons-threshold} apply in parallel, and
476garbage collection occurs only when both criteria are satisfied.
477
478As the heap size increases, the time to perform a garbage collection
479increases. Thus, it can be desirable to do them less frequently in
480proportion.
481@end defopt
482
483 The value returned by @code{garbage-collect} describes the amount of
484memory used by Lisp data, broken down by data type. By contrast, the
485function @code{memory-limit} provides information on the total amount of
486memory Emacs is currently using.
487
b8d4c8d0
GM
488@defun memory-limit
489This function returns the address of the last byte Emacs has allocated,
490divided by 1024. We divide the value by 1024 to make sure it fits in a
491Lisp integer.
492
493You can use this to get a general idea of how your actions affect the
494memory usage.
495@end defun
496
497@defvar memory-full
d15aac68 498This variable is @code{t} if Emacs is nearly out of memory for Lisp
b8d4c8d0
GM
499objects, and @code{nil} otherwise.
500@end defvar
501
502@defun memory-use-counts
503This returns a list of numbers that count the number of objects
504created in this Emacs session. Each of these counters increments for
505a certain kind of object. See the documentation string for details.
506@end defun
507
508@defvar gcs-done
509This variable contains the total number of garbage collections
510done so far in this Emacs session.
511@end defvar
512
513@defvar gc-elapsed
514This variable contains the total number of seconds of elapsed time
515during garbage collection so far in this Emacs session, as a floating
516point number.
517@end defvar
518
519@node Memory Usage
520@section Memory Usage
521@cindex memory usage
522
523 These functions and variables give information about the total amount
524of memory allocation that Emacs has done, broken down by data type.
525Note the difference between these and the values returned by
01a6dcc8 526@code{garbage-collect}; those count objects that currently exist, but
b8d4c8d0
GM
527these count the number or size of all allocations, including those for
528objects that have since been freed.
529
530@defvar cons-cells-consed
531The total number of cons cells that have been allocated so far
532in this Emacs session.
533@end defvar
534
535@defvar floats-consed
536The total number of floats that have been allocated so far
537in this Emacs session.
538@end defvar
539
540@defvar vector-cells-consed
541The total number of vector cells that have been allocated so far
542in this Emacs session.
543@end defvar
544
545@defvar symbols-consed
546The total number of symbols that have been allocated so far
547in this Emacs session.
548@end defvar
549
550@defvar string-chars-consed
551The total number of string characters that have been allocated so far
9658dc2a 552in this session.
b8d4c8d0
GM
553@end defvar
554
555@defvar misc-objects-consed
556The total number of miscellaneous objects that have been allocated so
9658dc2a 557far in this session. These include markers and overlays, plus
b8d4c8d0
GM
558certain objects not visible to users.
559@end defvar
560
561@defvar intervals-consed
562The total number of intervals that have been allocated so far
563in this Emacs session.
564@end defvar
565
566@defvar strings-consed
567The total number of strings that have been allocated so far in this
568Emacs session.
569@end defvar
570
571@node Writing Emacs Primitives
520f2425 572@section Writing Emacs Primitives
b8d4c8d0
GM
573@cindex primitive function internals
574@cindex writing Emacs primitives
575
1df7defd 576 Lisp primitives are Lisp functions implemented in C@. The details of
b8d4c8d0
GM
577interfacing the C function so that Lisp can call it are handled by a few
578C macros. The only way to really understand how to write new C code is
579to read the source, but we can explain some things here.
580
581 An example of a special form is the definition of @code{or}, from
582@file{eval.c}. (An ordinary function would have the same general
583appearance.)
584
585@cindex garbage collection protection
586@smallexample
587@group
588DEFUN ("or", For, Sor, 0, UNEVALLED, 0,
d15aac68
CY
589 doc: /* Eval args until one of them yields non-nil, then return
590that value.
591The remaining args are not evalled at all.
b8d4c8d0
GM
592If all args return nil, return nil.
593@end group
594@group
595usage: (or CONDITIONS ...) */)
b88746ba 596 (Lisp_Object args)
b8d4c8d0
GM
597@{
598 register Lisp_Object val = Qnil;
599 struct gcpro gcpro1;
600@end group
601
602@group
603 GCPRO1 (args);
604@end group
605
606@group
607 while (CONSP (args))
608 @{
7d2d7cc0 609 val = eval_sub (XCAR (args));
b8d4c8d0
GM
610 if (!NILP (val))
611 break;
612 args = XCDR (args);
613 @}
614@end group
615
616@group
617 UNGCPRO;
618 return val;
619@}
620@end group
621@end smallexample
622
623@cindex @code{DEFUN}, C macro to define Lisp primitives
624 Let's start with a precise explanation of the arguments to the
625@code{DEFUN} macro. Here is a template for them:
626
627@example
628DEFUN (@var{lname}, @var{fname}, @var{sname}, @var{min}, @var{max}, @var{interactive}, @var{doc})
629@end example
630
631@table @var
632@item lname
633This is the name of the Lisp symbol to define as the function name; in
634the example above, it is @code{or}.
635
636@item fname
d15aac68
CY
637This is the C function name for this function. This is the name that
638is used in C code for calling the function. The name is, by
639convention, @samp{F} prepended to the Lisp name, with all dashes
640(@samp{-}) in the Lisp name changed to underscores. Thus, to call
641this function from C code, call @code{For}.
b8d4c8d0
GM
642
643@item sname
644This is a C variable name to use for a structure that holds the data for
645the subr object that represents the function in Lisp. This structure
646conveys the Lisp symbol name to the initialization routine that will
647create the symbol and store the subr object as its definition. By
648convention, this name is always @var{fname} with @samp{F} replaced with
649@samp{S}.
650
651@item min
652This is the minimum number of arguments that the function requires. The
653function @code{or} allows a minimum of zero arguments.
654
655@item max
656This is the maximum number of arguments that the function accepts, if
657there is a fixed maximum. Alternatively, it can be @code{UNEVALLED},
658indicating a special form that receives unevaluated arguments, or
659@code{MANY}, indicating an unlimited number of evaluated arguments (the
660equivalent of @code{&rest}). Both @code{UNEVALLED} and @code{MANY} are
9658dc2a
GM
661macros. If @var{max} is a number, it must be more than @var{min} but
662less than 8.
b8d4c8d0 663
84ebefe9 664@cindex interactive specification in primitives
b8d4c8d0 665@item interactive
84ebefe9
XF
666This is an interactive specification, a string such as might be used
667as the argument of @code{interactive} in a Lisp function. In the case
668of @code{or}, it is 0 (a null pointer), indicating that @code{or}
669cannot be called interactively. A value of @code{""} indicates a function that should receive no
670arguments when called interactively. For example:
671
672@smallexample
673@group
674DEFUN ("baz", Fbaz, Sbaz, 0, 0, "",
675 doc: /* @dots{} */)
676@end group
677@end smallexample
678
679If the value begins with a @samp{"(}, the string is evaluated as a
680Lisp form. For example:
681
682@smallexample
683@group
684DEFUN ("foo", Ffoo, Sfoo, 0, UNEVALLED, "(list
685 (read-char-by-name \"Insert character (Unicode name or hex): \")\
686 (prefix-numeric-value current-prefix-arg)\
687 t))",
688 doc: /* @dots{} /*)
689@end group
690@end smallexample
b8d4c8d0
GM
691
692@item doc
693This is the documentation string. It uses C comment syntax rather
694than C string syntax because comment syntax requires nothing special
695to include multiple lines. The @samp{doc:} identifies the comment
696that follows as the documentation string. The @samp{/*} and @samp{*/}
697delimiters that begin and end the comment are not part of the
698documentation string.
699
700If the last line of the documentation string begins with the keyword
701@samp{usage:}, the rest of the line is treated as the argument list
702for documentation purposes. This way, you can use different argument
703names in the documentation string from the ones used in the C code.
704@samp{usage:} is required if the function has an unlimited number of
705arguments.
706
707All the usual rules for documentation strings in Lisp code
708(@pxref{Documentation Tips}) apply to C code documentation strings
709too.
710@end table
711
b88746ba 712 After the call to the @code{DEFUN} macro, you must write the
d15aac68
CY
713argument list for the C function, including the types for the
714arguments. If the primitive accepts a fixed maximum number of Lisp
715arguments, there must be one C argument for each Lisp argument, and
716each argument must be of type @code{Lisp_Object}. (Various macros and
717functions for creating values of type @code{Lisp_Object} are declared
718in the file @file{lisp.h}.) If the primitive has no upper limit on
719the number of Lisp arguments, it must have exactly two C arguments:
720the first is the number of Lisp arguments, and the second is the
721address of a block containing their values. These have types
39670ef4 722@code{int} and @w{@code{Lisp_Object *}} respectively. Since
74934dcc
DA
723@code{Lisp_Object} can hold any Lisp object of any data type, you
724can determine the actual data type only at run time; so if you want
725a primitive to accept only a certain type of argument, you must check
726the type explicitly using a suitable predicate (@pxref{Type Predicates}).
727@cindex type checking internals
b8d4c8d0
GM
728
729@cindex @code{GCPRO} and @code{UNGCPRO}
730@cindex protect C variables from garbage collection
731 Within the function @code{For} itself, note the use of the macros
d15aac68
CY
732@code{GCPRO1} and @code{UNGCPRO}. These macros are defined for the
733sake of the few platforms which do not use Emacs' default
734stack-marking garbage collector. The @code{GCPRO1} macro ``protects''
735a variable from garbage collection, explicitly informing the garbage
736collector that that variable and all its contents must be as
737accessible. GC protection is necessary in any function which can
738perform Lisp evaluation by calling @code{eval_sub} or @code{Feval} as
739a subroutine, either directly or indirectly.
b8d4c8d0
GM
740
741 It suffices to ensure that at least one pointer to each object is
d15aac68 742GC-protected. Thus, a particular local variable can do without
b8d4c8d0 743protection if it is certain that the object it points to will be
7d2d7cc0 744preserved by some other pointer (such as another local variable that
d15aac68
CY
745has a @code{GCPRO}). Otherwise, the local variable needs a
746@code{GCPRO}.
b8d4c8d0
GM
747
748 The macro @code{GCPRO1} protects just one local variable. If you
749want to protect two variables, use @code{GCPRO2} instead; repeating
750@code{GCPRO1} will not work. Macros @code{GCPRO3}, @code{GCPRO4},
751@code{GCPRO5}, and @code{GCPRO6} also exist. All these macros
752implicitly use local variables such as @code{gcpro1}; you must declare
753these explicitly, with type @code{struct gcpro}. Thus, if you use
754@code{GCPRO2}, you must declare @code{gcpro1} and @code{gcpro2}.
b8d4c8d0
GM
755
756 @code{UNGCPRO} cancels the protection of the variables that are
757protected in the current function. It is necessary to do this
758explicitly.
759
b8d4c8d0
GM
760 You must not use C initializers for static or global variables unless
761the variables are never written once Emacs is dumped. These variables
762with initializers are allocated in an area of memory that becomes
763read-only (on certain operating systems) as a result of dumping Emacs.
764@xref{Pure Storage}.
765
b8d4c8d0
GM
766@cindex @code{defsubr}, Lisp symbol for a primitive
767 Defining the C function is not enough to make a Lisp primitive
768available; you must also create the Lisp symbol for the primitive and
769store a suitable subr object in its function cell. The code looks like
770this:
771
772@example
7d2d7cc0 773defsubr (&@var{sname});
b8d4c8d0
GM
774@end example
775
776@noindent
7d2d7cc0 777Here @var{sname} is the name you used as the third argument to @code{DEFUN}.
b8d4c8d0
GM
778
779 If you add a new primitive to a file that already has Lisp primitives
780defined in it, find the function (near the end of the file) named
781@code{syms_of_@var{something}}, and add the call to @code{defsubr}
782there. If the file doesn't have this function, or if you create a new
783file, add to it a @code{syms_of_@var{filename}} (e.g.,
784@code{syms_of_myfile}). Then find the spot in @file{emacs.c} where all
785of these functions are called, and add a call to
786@code{syms_of_@var{filename}} there.
787
788@anchor{Defining Lisp variables in C}
789@vindex byte-boolean-vars
790@cindex defining Lisp variables in C
791@cindex @code{DEFVAR_INT}, @code{DEFVAR_LISP}, @code{DEFVAR_BOOL}
792 The function @code{syms_of_@var{filename}} is also the place to define
793any C variables that are to be visible as Lisp variables.
794@code{DEFVAR_LISP} makes a C variable of type @code{Lisp_Object} visible
795in Lisp. @code{DEFVAR_INT} makes a C variable of type @code{int}
796visible in Lisp with a value that is always an integer.
797@code{DEFVAR_BOOL} makes a C variable of type @code{int} visible in Lisp
798with a value that is either @code{t} or @code{nil}. Note that variables
799defined with @code{DEFVAR_BOOL} are automatically added to the list
800@code{byte-boolean-vars} used by the byte compiler.
801
7d2d7cc0
GM
802@cindex defining customization variables in C
803 If you want to make a Lisp variables that is defined in C behave
804like one declared with @code{defcustom}, add an appropriate entry to
805@file{cus-start.el}.
806
b8d4c8d0
GM
807@cindex @code{staticpro}, protection from GC
808 If you define a file-scope C variable of type @code{Lisp_Object},
809you must protect it from garbage-collection by calling @code{staticpro}
810in @code{syms_of_@var{filename}}, like this:
811
812@example
813staticpro (&@var{variable});
814@end example
815
816 Here is another example function, with more complicated arguments.
817This comes from the code in @file{window.c}, and it demonstrates the use
818of macros and functions to manipulate Lisp objects.
819
820@smallexample
821@group
822DEFUN ("coordinates-in-window-p", Fcoordinates_in_window_p,
7d2d7cc0
GM
823 Scoordinates_in_window_p, 2, 2, 0,
824 doc: /* Return non-nil if COORDINATES are in WINDOW.
825 ...
b8d4c8d0
GM
826@end group
827@group
7d2d7cc0
GM
828 or `right-margin' is returned. */)
829 (register Lisp_Object coordinates, Lisp_Object window)
b8d4c8d0 830@{
7d2d7cc0
GM
831 struct window *w;
832 struct frame *f;
b8d4c8d0 833 int x, y;
7d2d7cc0 834 Lisp_Object lx, ly;
b8d4c8d0
GM
835@end group
836
837@group
7d2d7cc0
GM
838 CHECK_LIVE_WINDOW (window);
839 w = XWINDOW (window);
840 f = XFRAME (w->frame);
841 CHECK_CONS (coordinates);
842 lx = Fcar (coordinates);
843 ly = Fcdr (coordinates);
844 CHECK_NUMBER_OR_FLOAT (lx);
845 CHECK_NUMBER_OR_FLOAT (ly);
846 x = FRAME_PIXEL_X_FROM_CANON_X (f, lx) + FRAME_INTERNAL_BORDER_WIDTH(f);
847 y = FRAME_PIXEL_Y_FROM_CANON_Y (f, ly) + FRAME_INTERNAL_BORDER_WIDTH(f);
b8d4c8d0
GM
848@end group
849
850@group
7d2d7cc0 851 switch (coordinates_in_window (w, x, y))
b8d4c8d0 852 @{
7d2d7cc0 853 case ON_NOTHING: /* NOT in window at all. */
b8d4c8d0
GM
854 return Qnil;
855@end group
856
7d2d7cc0 857 ...
b8d4c8d0
GM
858
859@group
7d2d7cc0 860 case ON_MODE_LINE: /* In mode line of window. */
b8d4c8d0
GM
861 return Qmode_line;
862@end group
863
7d2d7cc0
GM
864 ...
865
b8d4c8d0 866@group
7d2d7cc0
GM
867 case ON_SCROLL_BAR: /* On scroll-bar of window. */
868 /* Historically we are supposed to return nil in this case. */
869 return Qnil;
b8d4c8d0
GM
870@end group
871
872@group
873 default:
874 abort ();
875 @}
876@}
877@end group
878@end smallexample
879
880 Note that C code cannot call functions by name unless they are defined
1df7defd 881in C@. The way to call a function written in Lisp is to use
b8d4c8d0
GM
882@code{Ffuncall}, which embodies the Lisp function @code{funcall}. Since
883the Lisp function @code{funcall} accepts an unlimited number of
884arguments, in C it takes two: the number of Lisp-level arguments, and a
885one-dimensional array containing their values. The first Lisp-level
886argument is the Lisp function to call, and the rest are the arguments to
887pass to it. Since @code{Ffuncall} can call the evaluator, you must
888protect pointers from garbage collection around the call to
889@code{Ffuncall}.
890
891 The C functions @code{call0}, @code{call1}, @code{call2}, and so on,
892provide handy ways to call a Lisp function conveniently with a fixed
893number of arguments. They work by calling @code{Ffuncall}.
894
895 @file{eval.c} is a very good file to look through for examples;
896@file{lisp.h} contains the definitions for some important macros and
897functions.
898
899 If you define a function which is side-effect free, update the code
7d2d7cc0 900in @file{byte-opt.el} that binds @code{side-effect-free-fns} and
b8d4c8d0
GM
901@code{side-effect-and-error-free-fns} so that the compiler optimizer
902knows about it.
903
904@node Object Internals
520f2425 905@section Object Internals
b8d4c8d0
GM
906@cindex object internals
907
74934dcc 908 Emacs Lisp provides a rich set of the data types. Some of them, like cons
f24f2e22 909cells, integers and strings, are common to nearly all Lisp dialects. Some
74934dcc
DA
910others, like markers and buffers, are quite special and needed to provide
911the basic support to write editor commands in Lisp. To implement such
912a variety of object types and provide an efficient way to pass objects between
913the subsystems of an interpreter, there is a set of C data structures and
914a special type to represent the pointers to all of them, which is known as
915@dfn{tagged pointer}.
916
917 In C, the tagged pointer is an object of type @code{Lisp_Object}. Any
918initialized variable of such a type always holds the value of one of the
919following basic data types: integer, symbol, string, cons cell, float,
920vectorlike or miscellaneous object. Each of these data types has the
921corresponding tag value. All tags are enumerated by @code{enum Lisp_Type}
922and placed into a 3-bit bitfield of the @code{Lisp_Object}. The rest of the
39670ef4 923bits is the value itself. Integer values are immediate, i.e., directly
74934dcc
DA
924represented by those @dfn{value bits}, and all other objects are represented
925by the C pointers to a corresponding object allocated from the heap. Width
926of the @code{Lisp_Object} is platform- and configuration-dependent: usually
39670ef4 927it's equal to the width of an underlying platform pointer (i.e., 32-bit on
74934dcc
DA
928a 32-bit machine and 64-bit on a 64-bit one), but also there is a special
929configuration where @code{Lisp_Object} is 64-bit but all pointers are 32-bit.
930The latter trick was designed to overcome the limited range of values for
931Lisp integers on a 32-bit system by using 64-bit @code{long long} type for
932@code{Lisp_Object}.
933
934 The following C data structures are defined in @file{lisp.h} to represent
935the basic data types beyond integers:
936
937@table @code
938@item struct Lisp_Cons
939Cons cell, an object used to construct lists.
940
941@item struct Lisp_String
942String, the basic object to represent a sequence of characters.
943
944@item struct Lisp_Vector
945Array, a fixed-size set of Lisp objects which may be accessed by an index.
946
947@item struct Lisp_Symbol
948Symbol, the unique-named entity commonly used as an identifier.
949
950@item struct Lisp_Float
951Floating point value.
952
953@item union Lisp_Misc
954Miscellaneous kinds of objects which don't fit into any of the above.
955@end table
956
957 These types are the first-class citizens of an internal type system.
958Since the tag space is limited, all other types are the subtypes of either
959@code{Lisp_Vectorlike} or @code{Lisp_Misc}. Vector subtypes are enumerated
960by @code{enum pvec_type}, and nearly all complex objects like windows, buffers,
961frames, and processes fall into this category. The rest of special types,
962including markers and overlays, are enumerated by @code{enum Lisp_Misc_Type}
963and form the set of subtypes of @code{Lisp_Misc}.
964
965 Below there is a description of a few subtypes of @code{Lisp_Vectorlike}.
966Buffer object represents the text to display and edit. Window is the part
967of display structure which shows the buffer or used as a container to
968recursively place other windows on the same frame. (Do not confuse Emacs Lisp
969window object with the window as an entity managed by the user interface
970system like X; in Emacs terminology, the latter is called frame.) Finally,
971process object is used to manage the subprocesses.
b8d4c8d0
GM
972
973@menu
974* Buffer Internals:: Components of a buffer structure.
975* Window Internals:: Components of a window structure.
976* Process Internals:: Components of a process structure.
977@end menu
978
979@node Buffer Internals
520f2425 980@subsection Buffer Internals
b8d4c8d0
GM
981@cindex internals, of buffer
982@cindex buffer internals
983
7d2d7cc0 984 Two structures (see @file{buffer.h}) are used to represent buffers
1df7defd 985in C@. The @code{buffer_text} structure contains fields describing the
7d2d7cc0
GM
986text of a buffer; the @code{buffer} structure holds other fields. In
987the case of indirect buffers, two or more @code{buffer} structures
988reference the same @code{buffer_text} structure.
b8d4c8d0 989
c773345a 990Here are some of the fields in @code{struct buffer_text}:
b8d4c8d0
GM
991
992@table @code
993@item beg
c773345a 994The address of the buffer contents.
b8d4c8d0
GM
995
996@item gpt
c773345a
CY
997@itemx gpt_byte
998The character and byte positions of the buffer gap. @xref{Buffer
999Gap}.
b8d4c8d0
GM
1000
1001@item z
c773345a
CY
1002@itemx z_byte
1003The character and byte positions of the end of the buffer text.
b8d4c8d0
GM
1004
1005@item gap_size
c773345a 1006The size of buffer's gap. @xref{Buffer Gap}.
b8d4c8d0
GM
1007
1008@item modiff
c773345a
CY
1009@itemx save_modiff
1010@itemx chars_modiff
1011@itemx overlay_modiff
1012These fields count the number of buffer-modification events performed
1013in this buffer. @code{modiff} is incremented after each
1014buffer-modification event, and is never otherwise changed;
1015@code{save_modiff} contains the value of @code{modiff} the last time
1016the buffer was visited or saved; @code{chars_modiff} counts only
1017modifications to the characters in the buffer, ignoring all other
1018kinds of changes; and @code{overlay_modiff} counts only modifications
1019to the overlays.
b8d4c8d0
GM
1020
1021@item beg_unchanged
c773345a
CY
1022@itemx end_unchanged
1023The number of characters at the start and end of the text that are
1024known to be unchanged since the last complete redisplay.
b8d4c8d0
GM
1025
1026@item unchanged_modified
c773345a
CY
1027@itemx overlay_unchanged_modified
1028The values of @code{modiff} and @code{overlay_modiff}, respectively,
da6062e6 1029after the last complete redisplay. If their current values match
c773345a 1030@code{modiff} or @code{overlay_modiff}, that means
b8d4c8d0
GM
1031@code{beg_unchanged} and @code{end_unchanged} contain no useful
1032information.
1033
1034@item markers
1035The markers that refer to this buffer. This is actually a single
1036marker, and successive elements in its marker @code{chain} are the other
1037markers referring to this buffer text.
1038
1039@item intervals
c773345a 1040The interval tree which records the text properties of this buffer.
b8d4c8d0
GM
1041@end table
1042
c773345a 1043Some of the fields of @code{struct buffer} are:
b8d4c8d0
GM
1044
1045@table @code
7d2d7cc0 1046@item header
74934dcc
DA
1047A header of type @code{struct vectorlike_header} is common to all
1048vectorlike objects.
b8d4c8d0
GM
1049
1050@item own_text
c773345a
CY
1051A @code{struct buffer_text} structure that ordinarily holds the buffer
1052contents. In indirect buffers, this field is not used.
b8d4c8d0
GM
1053
1054@item text
c773345a
CY
1055A pointer to the @code{buffer_text} structure for this buffer. In an
1056ordinary buffer, this is the @code{own_text} field above. In an
1057indirect buffer, this is the @code{own_text} field of the base buffer.
b8d4c8d0 1058
74934dcc
DA
1059@item next
1060A pointer to the next buffer, in the chain of all buffers, including
1061killed buffers. This chain is used only for allocation and garbage
1062collection, in order to collect killed buffers properly.
1063
b8d4c8d0 1064@item pt
c773345a
CY
1065@itemx pt_byte
1066The character and byte positions of point in a buffer.
b8d4c8d0
GM
1067
1068@item begv
c773345a
CY
1069@itemx begv_byte
1070The character and byte positions of the beginning of the accessible
1071range of text in the buffer.
b8d4c8d0
GM
1072
1073@item zv
c773345a
CY
1074@itemx zv_byte
1075The character and byte positions of the end of the accessible range of
1076text in the buffer.
b8d4c8d0
GM
1077
1078@item base_buffer
1079In an indirect buffer, this points to the base buffer. In an ordinary
1080buffer, it is null.
1081
c773345a
CY
1082@item local_flags
1083This field contains flags indicating that certain variables are local
1084in this buffer. Such variables are declared in the C code using
1085@code{DEFVAR_PER_BUFFER}, and their buffer-local bindings are stored
1086in fields in the buffer structure itself. (Some of these fields are
b8d4c8d0
GM
1087described in this table.)
1088
1089@item modtime
c773345a
CY
1090The modification time of the visited file. It is set when the file is
1091written or read. Before writing the buffer into a file, this field is
1092compared to the modification time of the file to see if the file has
1093changed on disk. @xref{Buffer Modification}.
b8d4c8d0
GM
1094
1095@item auto_save_modified
c773345a 1096The time when the buffer was last auto-saved.
b8d4c8d0
GM
1097
1098@item last_window_start
c773345a
CY
1099The @code{window-start} position in the buffer as of the last time the
1100buffer was displayed in a window.
b8d4c8d0
GM
1101
1102@item clip_changed
c773345a
CY
1103This flag indicates that narrowing has changed in the buffer.
1104@xref{Narrowing}.
b8d4c8d0
GM
1105
1106@item prevent_redisplay_optimizations_p
c773345a
CY
1107This flag indicates that redisplay optimizations should not be used to
1108display this buffer.
b8d4c8d0 1109
c773345a
CY
1110@item overlay_center
1111This field holds the current overlay center position. @xref{Managing
1112Overlays}.
1113
1114@item overlays_before
1115@itemx overlays_after
1116These fields hold, respectively, a list of overlays that end at or
1117before the current overlay center, and a list of overlays that end
1118after the current overlay center. @xref{Managing Overlays}.
1119@code{overlays_before} is sorted in order of decreasing end position,
1120and @code{overlays_after} is sorted in order of increasing beginning
1121position.
b8d4c8d0 1122
7d2d7cc0
GM
1123@c FIXME? the following are now all Lisp_Object BUFFER_INTERNAL_FIELD (foo).
1124
b8d4c8d0 1125@item name
c773345a
CY
1126A Lisp string that names the buffer. It is guaranteed to be unique.
1127@xref{Buffer Names}.
b8d4c8d0 1128
c773345a
CY
1129@item save_length
1130The length of the file this buffer is visiting, when last read or
1131saved. This and other fields concerned with saving are not kept in
1132the @code{buffer_text} structure because indirect buffers are never
1133saved.
b8d4c8d0
GM
1134
1135@item directory
c773345a
CY
1136The directory for expanding relative file names. This is the value of
1137the buffer-local variable @code{default-directory} (@pxref{File Name Expansion}).
b8d4c8d0 1138
c773345a
CY
1139@item filename
1140The name of the file visited in this buffer, or @code{nil}. This is
1141the value of the buffer-local variable @code{buffer-file-name}
1142(@pxref{Buffer File Name}).
b8d4c8d0 1143
c773345a
CY
1144@item undo_list
1145@itemx backed_up
1146@itemx auto_save_file_name
7d2d7cc0 1147@itemx auto_save_file_format
c773345a
CY
1148@itemx read_only
1149@itemx file_format
1150@itemx file_truename
1151@itemx invisibility_spec
1152@itemx display_count
1153@itemx display_time
1154These fields store the values of Lisp variables that are automatically
1155buffer-local (@pxref{Buffer-Local Variables}), whose corresponding
1156variable names have the additional prefix @code{buffer-} and have
1157underscores replaced with dashes. For instance, @code{undo_list}
29157371 1158stores the value of @code{buffer-undo-list}.
b8d4c8d0
GM
1159
1160@item mark
c773345a
CY
1161The mark for the buffer. The mark is a marker, hence it is also
1162included on the list @code{markers}. @xref{The Mark}.
b8d4c8d0
GM
1163
1164@item local_var_alist
c773345a
CY
1165The association list describing the buffer-local variable bindings of
1166this buffer, not including the built-in buffer-local bindings that
1167have special slots in the buffer object. (Those slots are omitted
1168from this table.) @xref{Buffer-Local Variables}.
b8d4c8d0
GM
1169
1170@item major_mode
1171Symbol naming the major mode of this buffer, e.g., @code{lisp-mode}.
1172
1173@item mode_name
c773345a 1174Pretty name of the major mode, e.g., @code{"Lisp"}.
b8d4c8d0
GM
1175
1176@item keymap
c773345a
CY
1177@itemx abbrev_table
1178@itemx syntax_table
1179@itemx category_table
1180@itemx display_table
1181These fields store the buffer's local keymap (@pxref{Keymaps}), abbrev
1182table (@pxref{Abbrev Tables}), syntax table (@pxref{Syntax Tables}),
1183category table (@pxref{Categories}), and display table (@pxref{Display
1184Tables}).
b8d4c8d0
GM
1185
1186@item downcase_table
c773345a
CY
1187@itemx upcase_table
1188@itemx case_canon_table
1189These fields store the conversion tables for converting text to lower
1190case, upper case, and for canonicalizing text for case-fold search.
b8d4c8d0
GM
1191@xref{Case Tables}.
1192
b8d4c8d0
GM
1193@item minor_modes
1194An alist of the minor modes of this buffer.
1195
b8d4c8d0 1196@item pt_marker
c773345a
CY
1197@itemx begv_marker
1198@itemx zv_marker
1199These fields are only used in an indirect buffer, or in a buffer that
1200is the base of an indirect buffer. Each holds a marker that records
1201@code{pt}, @code{begv}, and @code{zv} respectively, for this buffer
b8d4c8d0
GM
1202when the buffer is not current.
1203
c773345a
CY
1204@item mode_line_format
1205@itemx header_line_format
1206@itemx case_fold_search
1207@itemx tab_width
1208@itemx fill_column
1209@itemx left_margin
1210@itemx auto_fill_function
c773345a
CY
1211@itemx truncate_lines
1212@itemx word_wrap
1213@itemx ctl_arrow
7d2d7cc0
GM
1214@itemx bidi_display_reordering
1215@itemx bidi_paragraph_direction
c773345a
CY
1216@itemx selective_display
1217@itemx selective_display_ellipses
1218@itemx overwrite_mode
1219@itemx abbrev_mode
c773345a
CY
1220@itemx mark_active
1221@itemx enable_multibyte_characters
1222@itemx buffer_file_coding_system
c773345a
CY
1223@itemx cache_long_line_scans
1224@itemx point_before_scroll
1225@itemx left_fringe_width
1226@itemx right_fringe_width
1227@itemx fringes_outside_margins
1228@itemx scroll_bar_width
1229@itemx indicate_empty_lines
1230@itemx indicate_buffer_boundaries
1231@itemx fringe_indicator_alist
1232@itemx fringe_cursor_alist
1233@itemx scroll_up_aggressively
1234@itemx scroll_down_aggressively
1235@itemx cursor_type
1236@itemx cursor_in_non_selected_windows
1237These fields store the values of Lisp variables that are automatically
1238buffer-local (@pxref{Buffer-Local Variables}), whose corresponding
1239variable names have underscores replaced with dashes. For instance,
1240@code{mode_line_format} stores the value of @code{mode-line-format}.
b8d4c8d0
GM
1241
1242@item last_selected_window
1243This is the last window that was selected with this buffer in it, or @code{nil}
1244if that window no longer displays this buffer.
b8d4c8d0
GM
1245@end table
1246
1247@node Window Internals
520f2425 1248@subsection Window Internals
b8d4c8d0
GM
1249@cindex internals, of window
1250@cindex window internals
1251
7d2d7cc0
GM
1252 The fields of a window (for a complete list, see the definition of
1253@code{struct window} in @file{window.h}) include:
b8d4c8d0
GM
1254
1255@table @code
1256@item frame
1257The frame that this window is on.
1258
1259@item mini_p
1260Non-@code{nil} if this window is a minibuffer window.
1261
1262@item parent
1263Internally, Emacs arranges windows in a tree; each group of siblings has
1264a parent window whose area includes all the siblings. This field points
1265to a window's parent.
1266
1267Parent windows do not display buffers, and play little role in display
1268except to shape their child windows. Emacs Lisp programs usually have
1269no access to the parent windows; they operate on the windows at the
1270leaves of the tree, which actually display buffers.
1271
b8d4c8d0 1272@item hchild
ee2d5b5e
CY
1273@itemx vchild
1274These fields contain the window's leftmost child and its topmost child
1275respectively. @code{hchild} is used if the window is subdivided
1276horizontally by child windows, and @code{vchild} if it is subdivided
7d2d7cc0 1277vertically. In a live window, only one of @code{hchild}, @code{vchild},
1df7defd 1278and @code{buffer} (q.v.@:) is non-@code{nil}.
b8d4c8d0
GM
1279
1280@item next
ee2d5b5e
CY
1281@itemx prev
1282The next sibling and previous sibling of this window. @code{next} is
520f2425
GM
1283@code{nil} if the window is the right-most or bottom-most in its group;
1284@code{prev} is @code{nil} if it is the left-most or top-most in its
ee2d5b5e
CY
1285group.
1286
1287@item left_col
1288The left-hand edge of the window, measured in columns, relative to the
1289leftmost column in the frame (column 0).
1290
1291@item top_line
1292The top edge of the window, measured in lines, relative to the topmost
1293line in the frame (line 0).
1294
1295@item total_cols
1296@itemx total_lines
1297The width and height of the window, measured in columns and lines
1298respectively. The width includes the scroll bar and fringes, and/or
1299the separator line on the right of the window (if any).
b8d4c8d0
GM
1300
1301@item buffer
ee2d5b5e 1302The buffer that the window is displaying.
b8d4c8d0
GM
1303
1304@item start
ee2d5b5e
CY
1305A marker pointing to the position in the buffer that is the first
1306character displayed in the window.
b8d4c8d0
GM
1307
1308@item pointm
1309@cindex window point internals
1310This is the value of point in the current buffer when this window is
1311selected; when it is not selected, it retains its previous value.
1312
1313@item force_start
1314If this flag is non-@code{nil}, it says that the window has been
1315scrolled explicitly by the Lisp program. This affects what the next
1316redisplay does if point is off the screen: instead of scrolling the
1317window to show the text around point, it moves point to a location that
1318is on the screen.
1319
1320@item frozen_window_start_p
1321This field is set temporarily to 1 to indicate to redisplay that
1322@code{start} of this window should not be changed, even if point
1323gets invisible.
1324
1325@item start_at_line_beg
1326Non-@code{nil} means current value of @code{start} was the beginning of a line
1327when it was chosen.
1328
b8d4c8d0
GM
1329@item use_time
1330This is the last time that the window was selected. The function
1331@code{get-lru-window} uses this field.
1332
1333@item sequence_number
1334A unique number assigned to this window when it was created.
1335
1336@item last_modified
1337The @code{modiff} field of the window's buffer, as of the last time
1338a redisplay completed in this window.
1339
1340@item last_overlay_modified
1341The @code{overlay_modiff} field of the window's buffer, as of the last
1342time a redisplay completed in this window.
1343
1344@item last_point
1345The buffer's value of point, as of the last time a redisplay completed
1346in this window.
1347
1348@item last_had_star
1349A non-@code{nil} value means the window's buffer was ``modified'' when the
1350window was last updated.
1351
1352@item vertical_scroll_bar
1353This window's vertical scroll bar.
1354
7d2d7cc0
GM
1355@item left_margin_cols
1356@itemx right_margin_cols
ee2d5b5e 1357The widths of the left and right margins in this window. A value of
7d2d7cc0
GM
1358@code{nil} means no margin.
1359
1360@item left_fringe_width
1361@itemx right_fringe_width
1362The widths of the left and right fringes in this window. A value of
1363@code{nil} or @code{t} means use the values of the frame.
1364
1365@item fringes_outside_margins
1366A non-@code{nil} value means the fringes outside the display margins;
1367othersize they are between the margin and the text.
b8d4c8d0
GM
1368
1369@item window_end_pos
1370This is computed as @code{z} minus the buffer position of the last glyph
1371in the current matrix of the window. The value is only valid if
1372@code{window_end_valid} is not @code{nil}.
1373
1374@item window_end_bytepos
1375The byte position corresponding to @code{window_end_pos}.
1376
1377@item window_end_vpos
1378The window-relative vertical position of the line containing
1379@code{window_end_pos}.
1380
1381@item window_end_valid
1382This field is set to a non-@code{nil} value if @code{window_end_pos} is truly
7d2d7cc0 1383valid. This is @code{nil} if nontrivial redisplay is pre-empted, since in that
b8d4c8d0
GM
1384case the display that @code{window_end_pos} was computed for did not get
1385onto the screen.
1386
b8d4c8d0
GM
1387@item cursor
1388A structure describing where the cursor is in this window.
1389
1390@item last_cursor
1391The value of @code{cursor} as of the last redisplay that finished.
1392
1393@item phys_cursor
1394A structure describing where the cursor of this window physically is.
1395
1396@item phys_cursor_type
7d2d7cc0
GM
1397@c FIXME What is this?
1398@c itemx phys_cursor_ascent
1399@itemx phys_cursor_height
1400@itemx phys_cursor_width
1401The type, height, and width of the cursor that was last displayed on
1402this window.
b8d4c8d0
GM
1403
1404@item phys_cursor_on_p
1405This field is non-zero if the cursor is physically on.
1406
1407@item cursor_off_p
7d2d7cc0
GM
1408Non-zero means the cursor in this window is logically off. This is
1409used for blinking the cursor.
b8d4c8d0
GM
1410
1411@item last_cursor_off_p
1412This field contains the value of @code{cursor_off_p} as of the time of
1413the last redisplay.
1414
1415@item must_be_updated_p
1416This is set to 1 during redisplay when this window must be updated.
1417
1418@item hscroll
1419This is the number of columns that the display in the window is scrolled
1420horizontally to the left. Normally, this is 0.
1421
1422@item vscroll
1423Vertical scroll amount, in pixels. Normally, this is 0.
1424
1425@item dedicated
1426Non-@code{nil} if this window is dedicated to its buffer.
1427
1428@item display_table
1429The window's display table, or @code{nil} if none is specified for it.
1430
1431@item update_mode_line
1432Non-@code{nil} means this window's mode line needs to be updated.
1433
1434@item base_line_number
1435The line number of a certain position in the buffer, or @code{nil}.
1436This is used for displaying the line number of point in the mode line.
1437
1438@item base_line_pos
1439The position in the buffer for which the line number is known, or
7d2d7cc0
GM
1440@code{nil} meaning none is known. If it is a buffer, don't display
1441the line number as long as the window shows that buffer.
b8d4c8d0
GM
1442
1443@item region_showing
1444If the region (or part of it) is highlighted in this window, this field
1445holds the mark position that made one end of that region. Otherwise,
1446this field is @code{nil}.
1447
1448@item column_number_displayed
1449The column number currently displayed in this window's mode line, or @code{nil}
1450if column numbers are not being displayed.
1451
1452@item current_matrix
7d2d7cc0
GM
1453@itemx desired_matrix
1454Glyph matrices describing the current and desired display of this window.
b8d4c8d0
GM
1455@end table
1456
1457@node Process Internals
520f2425 1458@subsection Process Internals
b8d4c8d0
GM
1459@cindex internals, of process
1460@cindex process internals
1461
7d2d7cc0
GM
1462 The fields of a process (for a complete list, see the definition of
1463@code{struct Lisp_Process} in @file{process.h}) include:
b8d4c8d0
GM
1464
1465@table @code
1466@item name
1467A string, the name of the process.
1468
1469@item command
1470A list containing the command arguments that were used to start this
c73e02fa
GM
1471process. For a network or serial process, it is @code{nil} if the
1472process is running or @code{t} if the process is stopped.
b8d4c8d0
GM
1473
1474@item filter
7d2d7cc0
GM
1475If non-@code{nil}, a function used to accept output from the process
1476instead of a buffer.
b8d4c8d0
GM
1477
1478@item sentinel
7d2d7cc0
GM
1479If non-@code{nil}, a function called whenever the state of the process
1480changes.
b8d4c8d0
GM
1481
1482@item buffer
1483The associated buffer of the process.
1484
1485@item pid
1486An integer, the operating system's process @acronym{ID}.
7d2d7cc0 1487Pseudo-processes such as network or serial connections use a value of 0.
b8d4c8d0
GM
1488
1489@item childp
7d2d7cc0
GM
1490A flag, @code{t} if this is really a child process. For a network or
1491serial connection, it is a plist based on the arguments to
1492@code{make-network-process} or @code{make-serial-process}.
b8d4c8d0
GM
1493
1494@item mark
1495A marker indicating the position of the end of the last output from this
1496process inserted into the buffer. This is often but not always the end
1497of the buffer.
1498
1499@item kill_without_query
ee2d5b5e
CY
1500If this is non-zero, killing Emacs while this process is still running
1501does not ask for confirmation about killing the process.
b8d4c8d0 1502
7d2d7cc0
GM
1503@item raw_status
1504The raw process status, as returned by the @code{wait} system call.
b8d4c8d0
GM
1505
1506@item status
1507The process status, as @code{process-status} should return it.
1508
1509@item tick
1510@itemx update_tick
1511If these two fields are not equal, a change in the status of the process
1512needs to be reported, either by running the sentinel or by inserting a
1513message in the process buffer.
1514
1515@item pty_flag
a2eb421b 1516Non-@code{nil} if communication with the subprocess uses a pty;
b8d4c8d0
GM
1517@code{nil} if it uses a pipe.
1518
1519@item infd
1520The file descriptor for input from the process.
1521
1522@item outfd
1523The file descriptor for output to the process.
1524
b8d4c8d0
GM
1525@item tty_name
1526The name of the terminal that the subprocess is using,
1527or @code{nil} if it is using pipes.
1528
1529@item decode_coding_system
1530Coding-system for decoding the input from this process.
1531
1532@item decoding_buf
1533A working buffer for decoding.
1534
1535@item decoding_carryover
1536Size of carryover in decoding.
1537
1538@item encode_coding_system
1539Coding-system for encoding the output to this process.
1540
1541@item encoding_buf
1542A working buffer for encoding.
1543
b8d4c8d0
GM
1544@item inherit_coding_system_flag
1545Flag to set @code{coding-system} of the process buffer from the
1546coding system used to decode process output.
c73e02fa
GM
1547
1548@item type
1549Symbol indicating the type of process: @code{real}, @code{network},
7d2d7cc0 1550@code{serial}.
c73e02fa 1551
b8d4c8d0 1552@end table
7d2d7cc0 1553
d92d9c95
PE
1554@node C Integer Types
1555@section C Integer Types
1556@cindex integer types (C programming language)
1557
1558Here are some guidelines for use of integer types in the Emacs C
1559source code. These guidelines sometimes give competing advice; common
1560sense is advised.
1561
1562@itemize @bullet
1563@item
1564Avoid arbitrary limits. For example, avoid @code{int len = strlen
1565(s);} unless the length of @code{s} is required for other reasons to
1566fit in @code{int} range.
1567
1568@item
1569Do not assume that signed integer arithmetic wraps around on overflow.
1570This is no longer true of Emacs porting targets: signed integer
1571overflow has undefined behavior in practice, and can dump core or
1572even cause earlier or later code to behave ``illogically''. Unsigned
1573overflow does wrap around reliably, modulo a power of two.
1574
1575@item
1576Prefer signed types to unsigned, as code gets confusing when signed
1577and unsigned types are combined. Many other guidelines assume that
1578types are signed; in the rarer cases where unsigned types are needed,
1579similar advice may apply to the unsigned counterparts (e.g.,
1580@code{size_t} instead of @code{ptrdiff_t}, or @code{uintptr_t} instead
1581of @code{intptr_t}).
1582
1583@item
1584Prefer @code{int} for Emacs character codes, in the range 0 ..@: 0x3FFFFF.
1585
1586@item
1587Prefer @code{ptrdiff_t} for sizes, i.e., for integers bounded by the
1588maximum size of any individual C object or by the maximum number of
1589elements in any C array. This is part of Emacs's general preference
1590for signed types. Using @code{ptrdiff_t} limits objects to
1591@code{PTRDIFF_MAX} bytes, but larger objects would cause trouble
1592anyway since they would break pointer subtraction, so this does not
1593impose an arbitrary limit.
1594
1595@item
1596Prefer @code{intptr_t} for internal representations of pointers, or
1597for integers bounded only by the number of objects that can exist at
1598any given time or by the total number of bytes that can be allocated.
1599Currently Emacs sometimes uses other types when @code{intptr_t} would
1600be better; fixing this is lower priority, as the code works as-is on
1601Emacs's current porting targets.
1602
1603@item
1604Prefer the Emacs-defined type @code{EMACS_INT} for representing values
1605converted to or from Emacs Lisp fixnums, as fixnum arithmetic is based
1606on @code{EMACS_INT}.
1607
1608@item
1609When representing a system value (such as a file size or a count of
1610seconds since the Epoch), prefer the corresponding system type (e.g.,
1611@code{off_t}, @code{time_t}). Do not assume that a system type is
1612signed, unless this assumption is known to be safe. For example,
1613although @code{off_t} is always signed, @code{time_t} need not be.
1614
1615@item
1616Prefer the Emacs-defined type @code{printmax_t} for representing
1617values that might be any signed integer value that can be printed,
1618using a @code{printf}-family function.
1619
1620@item
1621Prefer @code{intmax_t} for representing values that might be any
1622signed integer value.
1623
1624@item
1625In bitfields, prefer @code{unsigned int} or @code{signed int} to
1626@code{int}, as @code{int} is less portable: it might be signed, and
1627might not be. Single-bit bit fields are invariably @code{unsigned
1628int} so that their values are 0 and 1.
1629
1630@item
1631In C, Emacs commonly uses @code{bool}, 1, and 0 for boolean values.
1632Using @code{bool} for booleans can make programs easier to read and a
1633bit faster than using @code{int}. Although it is also OK to use
1634@code{int}, this older style is gradually being phased out. When
1635using @code{bool}, respect the limitations of the replacement
1636implementation of @code{bool}, as documented in the source file
1637@file{lib/stdbool.in.h}, so that Emacs remains portable to pre-C99
1638platforms.
1639@end itemize
1640
7d2d7cc0 1641@c FIXME Mention src/globals.h somewhere in this file?