Commit | Line | Data |
---|---|---|
b8d4c8d0 GM |
1 | @c -*-texinfo-*- |
2 | @c This is part of the GNU Emacs Lisp Reference Manual. | |
ab422c4d PE |
3 | @c Copyright (C) 1990-1993, 1998-1999, 2001-2013 Free Software |
4 | @c Foundation, Inc. | |
b8d4c8d0 | 5 | @c See the file elisp.texi for copying conditions. |
ecc6530d | 6 | @node GNU Emacs Internals |
b8d4c8d0 GM |
7 | @appendix GNU Emacs Internals |
8 | ||
9 | This chapter describes how the runnable Emacs executable is dumped with | |
10 | the preloaded Lisp libraries in it, how storage is allocated, and some | |
11 | internal aspects of GNU Emacs that may be of interest to C programmers. | |
12 | ||
13 | @menu | |
14 | * Building Emacs:: How the dumped Emacs is made. | |
333f9019 | 15 | * Pure Storage:: Kludge to make preloaded Lisp functions shareable. |
b8d4c8d0 GM |
16 | * Garbage Collection:: Reclaiming space for Lisp objects no longer used. |
17 | * Memory Usage:: Info about total size of Lisp objects made so far. | |
18 | * Writing Emacs Primitives:: Writing C code for Emacs. | |
19 | * Object Internals:: Data formats of buffers, windows, processes. | |
d92d9c95 | 20 | * C Integer Types:: How C integer types are used inside Emacs. |
b8d4c8d0 GM |
21 | @end menu |
22 | ||
23 | @node Building Emacs | |
520f2425 | 24 | @section Building Emacs |
b8d4c8d0 GM |
25 | @cindex building Emacs |
26 | @pindex temacs | |
27 | ||
28 | This section explains the steps involved in building the Emacs | |
29 | executable. You don't have to know this material to build and install | |
30 | Emacs, since the makefiles do all these things automatically. This | |
d15aac68 | 31 | information is pertinent to Emacs developers. |
b8d4c8d0 GM |
32 | |
33 | Compilation of the C source files in the @file{src} directory | |
34 | produces an executable file called @file{temacs}, also called a | |
d15aac68 CY |
35 | @dfn{bare impure Emacs}. It contains the Emacs Lisp interpreter and |
36 | I/O routines, but not the editing commands. | |
b8d4c8d0 GM |
37 | |
38 | @cindex @file{loadup.el} | |
d15aac68 CY |
39 | The command @w{@command{temacs -l loadup}} would run @file{temacs} |
40 | and direct it to load @file{loadup.el}. The @code{loadup} library | |
41 | loads additional Lisp libraries, which set up the normal Emacs editing | |
42 | environment. After this step, the Emacs executable is no longer | |
43 | @dfn{bare}. | |
b8d4c8d0 GM |
44 | |
45 | @cindex dumping Emacs | |
d15aac68 CY |
46 | Because it takes some time to load the standard Lisp files, the |
47 | @file{temacs} executable usually isn't run directly by users. | |
48 | Instead, as one of the last steps of building Emacs, the command | |
49 | @samp{temacs -batch -l loadup dump} is run. The special @samp{dump} | |
50 | argument causes @command{temacs} to dump out an executable program, | |
51 | called @file{emacs}, which has all the standard Lisp files preloaded. | |
52 | (The @samp{-batch} argument prevents @file{temacs} from trying to | |
53 | initialize any of its data on the terminal, so that the tables of | |
54 | terminal information are empty in the dumped Emacs.) | |
55 | ||
56 | @cindex preloaded Lisp files | |
b578c9cc | 57 | @vindex preloaded-file-list |
d15aac68 CY |
58 | The dumped @file{emacs} executable (also called a @dfn{pure} Emacs) |
59 | is the one which is installed. The variable | |
60 | @code{preloaded-file-list} stores a list of the Lisp files preloaded | |
61 | into the dumped Emacs. If you port Emacs to a new operating system, | |
62 | and are not able to implement dumping, then Emacs must load | |
63 | @file{loadup.el} each time it starts. | |
b8d4c8d0 GM |
64 | |
65 | @cindex @file{site-load.el} | |
b8d4c8d0 | 66 | You can specify additional files to preload by writing a library named |
520f2425 GM |
67 | @file{site-load.el} that loads them. You may need to rebuild Emacs |
68 | with an added definition | |
b8d4c8d0 GM |
69 | |
70 | @example | |
71 | #define SITELOAD_PURESIZE_EXTRA @var{n} | |
72 | @end example | |
73 | ||
74 | @noindent | |
520f2425 GM |
75 | to make @var{n} added bytes of pure space to hold the additional files; |
76 | see @file{src/puresize.h}. | |
b8d4c8d0 GM |
77 | (Try adding increments of 20000 until it is big enough.) However, the |
78 | advantage of preloading additional files decreases as machines get | |
79 | faster. On modern machines, it is usually not advisable. | |
80 | ||
81 | After @file{loadup.el} reads @file{site-load.el}, it finds the | |
82 | documentation strings for primitive and preloaded functions (and | |
83 | variables) in the file @file{etc/DOC} where they are stored, by | |
84 | calling @code{Snarf-documentation} (@pxref{Definition of | |
85 | Snarf-documentation,, Accessing Documentation}). | |
86 | ||
87 | @cindex @file{site-init.el} | |
88 | @cindex preloading additional functions and variables | |
89 | You can specify other Lisp expressions to execute just before dumping | |
90 | by putting them in a library named @file{site-init.el}. This file is | |
91 | executed after the documentation strings are found. | |
92 | ||
93 | If you want to preload function or variable definitions, there are | |
94 | three ways you can do this and make their documentation strings | |
95 | accessible when you subsequently run Emacs: | |
96 | ||
97 | @itemize @bullet | |
98 | @item | |
99 | Arrange to scan these files when producing the @file{etc/DOC} file, | |
100 | and load them with @file{site-load.el}. | |
101 | ||
102 | @item | |
103 | Load the files with @file{site-init.el}, then copy the files into the | |
104 | installation directory for Lisp files when you install Emacs. | |
105 | ||
106 | @item | |
5906db45 GM |
107 | Specify a @code{nil} value for @code{byte-compile-dynamic-docstrings} |
108 | as a local variable in each of these files, and load them with either | |
109 | @file{site-load.el} or @file{site-init.el}. (This method has the | |
110 | drawback that the documentation strings take up space in Emacs all the | |
111 | time.) | |
b8d4c8d0 GM |
112 | @end itemize |
113 | ||
114 | It is not advisable to put anything in @file{site-load.el} or | |
115 | @file{site-init.el} that would alter any of the features that users | |
116 | expect in an ordinary unmodified Emacs. If you feel you must override | |
117 | normal features for your site, do it with @file{default.el}, so that | |
118 | users can override your changes if they wish. @xref{Startup Summary}. | |
06a4f110 GM |
119 | Note that if either @file{site-load.el} or @file{site-init.el} changes |
120 | @code{load-path}, the changes will be lost after dumping. | |
121 | @xref{Library Search}. To make a permanent change to | |
122 | @code{load-path}, use the @option{--enable-locallisppath} option | |
123 | of @command{configure}. | |
b8d4c8d0 | 124 | |
520f2425 GM |
125 | In a package that can be preloaded, it is sometimes necessary (or |
126 | useful) to delay certain evaluations until Emacs subsequently starts | |
127 | up. The vast majority of such cases relate to the values of | |
128 | customizable variables. For example, @code{tutorial-directory} is a | |
129 | variable defined in @file{startup.el}, which is preloaded. The default | |
130 | value is set based on @code{data-directory}. The variable needs to | |
131 | access the value of @code{data-directory} when Emacs starts, not when | |
132 | it is dumped, because the Emacs executable has probably been installed | |
133 | in a different location since it was dumped. | |
134 | ||
135 | @defun custom-initialize-delay symbol value | |
136 | This function delays the initialization of @var{symbol} to the next | |
137 | Emacs start. You normally use this function by specifying it as the | |
138 | @code{:initialize} property of a customizable variable. (The argument | |
f003f294 | 139 | @var{value} is unused, and is provided only for compatibility with the |
520f2425 GM |
140 | form Custom expects.) |
141 | @end defun | |
b8d4c8d0 | 142 | |
520f2425 GM |
143 | In the unlikely event that you need a more general functionality than |
144 | @code{custom-initialize-delay} provides, you can use | |
145 | @code{before-init-hook} (@pxref{Startup Summary}). | |
b8d4c8d0 GM |
146 | |
147 | @defun dump-emacs to-file from-file | |
148 | @cindex unexec | |
149 | This function dumps the current state of Emacs into an executable file | |
150 | @var{to-file}. It takes symbols from @var{from-file} (this is normally | |
151 | the executable file @file{temacs}). | |
152 | ||
153 | If you want to use this function in an Emacs that was already dumped, | |
154 | you must run Emacs with @samp{-batch}. | |
155 | @end defun | |
156 | ||
157 | @node Pure Storage | |
520f2425 | 158 | @section Pure Storage |
b8d4c8d0 GM |
159 | @cindex pure storage |
160 | ||
161 | Emacs Lisp uses two kinds of storage for user-created Lisp objects: | |
162 | @dfn{normal storage} and @dfn{pure storage}. Normal storage is where | |
999dd333 GM |
163 | all the new data created during an Emacs session are kept |
164 | (@pxref{Garbage Collection}). Pure storage is used for certain data | |
165 | in the preloaded standard Lisp files---data that should never change | |
166 | during actual use of Emacs. | |
b8d4c8d0 | 167 | |
d15aac68 | 168 | Pure storage is allocated only while @command{temacs} is loading the |
b8d4c8d0 GM |
169 | standard preloaded Lisp libraries. In the file @file{emacs}, it is |
170 | marked as read-only (on operating systems that permit this), so that | |
171 | the memory space can be shared by all the Emacs jobs running on the | |
172 | machine at once. Pure storage is not expandable; a fixed amount is | |
173 | allocated when Emacs is compiled, and if that is not sufficient for | |
174 | the preloaded libraries, @file{temacs} allocates dynamic memory for | |
999dd333 GM |
175 | the part that didn't fit. The resulting image will work, but garbage |
176 | collection (@pxref{Garbage Collection}) is disabled in this situation, | |
177 | causing a memory leak. Such an overflow normally won't happen unless | |
178 | you try to preload additional libraries or add features to the | |
179 | standard ones. Emacs will display a warning about the overflow when | |
180 | it starts. If this happens, you should increase the compilation | |
181 | parameter @code{SYSTEM_PURESIZE_EXTRA} in the file | |
182 | @file{src/puresize.h} and rebuild Emacs. | |
b8d4c8d0 GM |
183 | |
184 | @defun purecopy object | |
185 | This function makes a copy in pure storage of @var{object}, and returns | |
186 | it. It copies a string by simply making a new string with the same | |
187 | characters, but without text properties, in pure storage. It | |
188 | recursively copies the contents of vectors and cons cells. It does | |
189 | not make copies of other objects such as symbols, but just returns | |
190 | them unchanged. It signals an error if asked to copy markers. | |
191 | ||
192 | This function is a no-op except while Emacs is being built and dumped; | |
999dd333 | 193 | it is usually called only in preloaded Lisp files. |
b8d4c8d0 GM |
194 | @end defun |
195 | ||
196 | @defvar pure-bytes-used | |
197 | The value of this variable is the number of bytes of pure storage | |
198 | allocated so far. Typically, in a dumped Emacs, this number is very | |
199 | close to the total amount of pure storage available---if it were not, | |
200 | we would preallocate less. | |
201 | @end defvar | |
202 | ||
203 | @defvar purify-flag | |
204 | This variable determines whether @code{defun} should make a copy of the | |
205 | function definition in pure storage. If it is non-@code{nil}, then the | |
206 | function definition is copied into pure storage. | |
207 | ||
208 | This flag is @code{t} while loading all of the basic functions for | |
333f9019 | 209 | building Emacs initially (allowing those functions to be shareable and |
b8d4c8d0 GM |
210 | non-collectible). Dumping Emacs as an executable always writes |
211 | @code{nil} in this variable, regardless of the value it actually has | |
212 | before and after dumping. | |
213 | ||
214 | You should not change this flag in a running Emacs. | |
215 | @end defvar | |
216 | ||
217 | @node Garbage Collection | |
520f2425 | 218 | @section Garbage Collection |
b8d4c8d0 GM |
219 | |
220 | @cindex memory allocation | |
d15aac68 CY |
221 | When a program creates a list or the user defines a new function |
222 | (such as by loading a library), that data is placed in normal storage. | |
223 | If normal storage runs low, then Emacs asks the operating system to | |
224 | allocate more memory. Different types of Lisp objects, such as | |
f3372c87 DA |
225 | symbols, cons cells, small vectors, markers, etc., are segregated in |
226 | distinct blocks in memory. (Large vectors, long strings, buffers and | |
227 | certain other editing types, which are fairly large, are allocated in | |
228 | individual blocks, one per object; small strings are packed into blocks | |
229 | of 8k bytes, and small vectors are packed into blocks of 4k bytes). | |
230 | ||
231 | @cindex vector-like objects, storage | |
232 | @cindex storage of vector-like Lisp objects | |
233 | Beyond the basic vector, a lot of objects like window, buffer, and | |
234 | frame are managed as if they were vectors. The corresponding C data | |
235 | structures include the @code{struct vectorlike_header} field whose | |
74934dcc DA |
236 | @code{size} member contains the subtype enumerated by @code{enum pvec_type} |
237 | and an information about how many @code{Lisp_Object} fields this structure | |
238 | contains and what the size of the rest data is. This information is | |
239 | needed to calculate the memory footprint of an object, and used | |
240 | by the vector allocation code while iterating over the vector blocks. | |
b8d4c8d0 | 241 | |
d15aac68 CY |
242 | @cindex garbage collection |
243 | It is quite common to use some storage for a while, then release it | |
244 | by (for example) killing a buffer or deleting the last pointer to an | |
245 | object. Emacs provides a @dfn{garbage collector} to reclaim this | |
246 | abandoned storage. The garbage collector operates by finding and | |
247 | marking all Lisp objects that are still accessible to Lisp programs. | |
248 | To begin with, it assumes all the symbols, their values and associated | |
249 | function definitions, and any data presently on the stack, are | |
250 | accessible. Any objects that can be reached indirectly through other | |
251 | accessible objects are also accessible. | |
b8d4c8d0 GM |
252 | |
253 | When marking is finished, all objects still unmarked are garbage. No | |
254 | matter what the Lisp program or the user does, it is impossible to refer | |
255 | to them, since there is no longer a way to reach them. Their space | |
256 | might as well be reused, since no one will miss them. The second | |
257 | (``sweep'') phase of the garbage collector arranges to reuse them. | |
258 | ||
259 | @c ??? Maybe add something describing weak hash tables here? | |
260 | ||
261 | @cindex free list | |
262 | The sweep phase puts unused cons cells onto a @dfn{free list} | |
263 | for future allocation; likewise for symbols and markers. It compacts | |
264 | the accessible strings so they occupy fewer 8k blocks; then it frees the | |
f3372c87 DA |
265 | other 8k blocks. Unreachable vectors from vector blocks are coalesced |
266 | to create largest possible free areas; if a free area spans a complete | |
267 | 4k block, that block is freed. Otherwise, the free area is recorded | |
268 | in a free list array, where each entry corresponds to a free list | |
269 | of areas of the same size. Large vectors, buffers, and other large | |
270 | objects are allocated and freed individually. | |
b8d4c8d0 GM |
271 | |
272 | @cindex CL note---allocate more storage | |
273 | @quotation | |
274 | @b{Common Lisp note:} Unlike other Lisps, GNU Emacs Lisp does not | |
275 | call the garbage collector when the free list is empty. Instead, it | |
276 | simply requests the operating system to allocate more storage, and | |
277 | processing continues until @code{gc-cons-threshold} bytes have been | |
278 | used. | |
279 | ||
280 | This means that you can make sure that the garbage collector will not | |
281 | run during a certain portion of a Lisp program by calling the garbage | |
282 | collector explicitly just before it (provided that portion of the | |
283 | program does not use so much space as to force a second garbage | |
284 | collection). | |
285 | @end quotation | |
286 | ||
287 | @deffn Command garbage-collect | |
288 | This command runs a garbage collection, and returns information on | |
289 | the amount of space in use. (Garbage collection can also occur | |
290 | spontaneously if you use more than @code{gc-cons-threshold} bytes of | |
291 | Lisp data since the previous garbage collection.) | |
292 | ||
74934dcc DA |
293 | @code{garbage-collect} returns a list with information on amount of space in |
294 | use, where each entry has the form @samp{(@var{name} @var{size} @var{used})} | |
295 | or @samp{(@var{name} @var{size} @var{used} @var{free})}. In the entry, | |
296 | @var{name} is a symbol describing the kind of objects this entry represents, | |
297 | @var{size} is the number of bytes used by each one, @var{used} is the number | |
298 | of those objects that were found live in the heap, and optional @var{free} is | |
299 | the number of those objects that are not live but that Emacs keeps around for | |
300 | future allocations. So an overall result is: | |
b8d4c8d0 GM |
301 | |
302 | @example | |
f24f2e22 | 303 | ((@code{conses} @var{cons-size} @var{used-conses} @var{free-conses}) |
74934dcc DA |
304 | (@code{symbols} @var{symbol-size} @var{used-symbols} @var{free-symbols}) |
305 | (@code{miscs} @var{misc-size} @var{used-miscs} @var{free-miscs}) | |
306 | (@code{strings} @var{string-size} @var{used-strings} @var{free-strings}) | |
307 | (@code{string-bytes} @var{byte-size} @var{used-bytes}) | |
308 | (@code{vectors} @var{vector-size} @var{used-vectors}) | |
309 | (@code{vector-slots} @var{slot-size} @var{used-slots} @var{free-slots}) | |
310 | (@code{floats} @var{float-size} @var{used-floats} @var{free-floats}) | |
311 | (@code{intervals} @var{interval-size} @var{used-intervals} @var{free-intervals}) | |
312 | (@code{buffers} @var{buffer-size} @var{used-buffers}) | |
313 | (@code{heap} @var{unit-size} @var{total-size} @var{free-size})) | |
b8d4c8d0 GM |
314 | @end example |
315 | ||
316 | Here is an example: | |
317 | ||
318 | @example | |
b8d4c8d0 | 319 | (garbage-collect) |
74934dcc DA |
320 | @result{} ((conses 16 49126 8058) (symbols 48 14607 0) |
321 | (miscs 40 34 56) (strings 32 2942 2607) | |
322 | (string-bytes 1 78607) (vectors 16 7247) | |
323 | (vector-slots 8 341609 29474) (floats 8 71 102) | |
324 | (intervals 56 27 26) (buffers 944 8) | |
325 | (heap 1024 11715 2678)) | |
b8d4c8d0 GM |
326 | @end example |
327 | ||
74934dcc DA |
328 | Below is a table explaining each element. Note that last @code{heap} entry |
329 | is optional and present only if an underlying @code{malloc} implementation | |
330 | provides @code{mallinfo} function. | |
b8d4c8d0 GM |
331 | |
332 | @table @var | |
74934dcc | 333 | @item cons-size |
39670ef4 | 334 | Internal size of a cons cell, i.e., @code{sizeof (struct Lisp_Cons)}. |
74934dcc | 335 | |
b8d4c8d0 GM |
336 | @item used-conses |
337 | The number of cons cells in use. | |
338 | ||
339 | @item free-conses | |
74934dcc DA |
340 | The number of cons cells for which space has been obtained from |
341 | the operating system, but that are not currently being used. | |
b8d4c8d0 | 342 | |
74934dcc | 343 | @item symbol-size |
39670ef4 | 344 | Internal size of a symbol, i.e., @code{sizeof (struct Lisp_Symbol)}. |
74934dcc DA |
345 | |
346 | @item used-symbols | |
b8d4c8d0 GM |
347 | The number of symbols in use. |
348 | ||
74934dcc DA |
349 | @item free-symbols |
350 | The number of symbols for which space has been obtained from | |
351 | the operating system, but that are not currently being used. | |
352 | ||
353 | @item misc-size | |
39670ef4 | 354 | Internal size of a miscellaneous entity, i.e., |
74934dcc DA |
355 | @code{sizeof (union Lisp_Misc)}, which is a size of the |
356 | largest type enumerated in @code{enum Lisp_Misc_Type}. | |
b8d4c8d0 GM |
357 | |
358 | @item used-miscs | |
74934dcc DA |
359 | The number of miscellaneous objects in use. These include markers |
360 | and overlays, plus certain objects not visible to users. | |
b8d4c8d0 GM |
361 | |
362 | @item free-miscs | |
363 | The number of miscellaneous objects for which space has been obtained | |
364 | from the operating system, but that are not currently being used. | |
365 | ||
74934dcc | 366 | @item string-size |
39670ef4 | 367 | Internal size of a string header, i.e., @code{sizeof (struct Lisp_String)}. |
74934dcc DA |
368 | |
369 | @item used-strings | |
370 | The number of string headers in use. | |
371 | ||
372 | @item free-strings | |
373 | The number of string headers for which space has been obtained | |
374 | from the operating system, but that are not currently being used. | |
375 | ||
376 | @item byte-size | |
377 | This is used for convenience and equals to @code{sizeof (char)}. | |
378 | ||
379 | @item used-bytes | |
380 | The total size of all string data in bytes. | |
381 | ||
382 | @item vector-size | |
39670ef4 | 383 | Internal size of a vector header, i.e., @code{sizeof (struct Lisp_Vector)}. |
74934dcc DA |
384 | |
385 | @item used-vectors | |
386 | The number of vector headers allocated from the vector blocks. | |
387 | ||
388 | @item slot-size | |
389 | Internal size of a vector slot, always equal to @code{sizeof (Lisp_Object)}. | |
390 | ||
391 | @item used-slots | |
392 | The number of slots in all used vectors. | |
393 | ||
394 | @item free-slots | |
395 | The number of free slots in all vector blocks. | |
b8d4c8d0 | 396 | |
74934dcc | 397 | @item float-size |
39670ef4 | 398 | Internal size of a float object, i.e., @code{sizeof (struct Lisp_Float)}. |
74934dcc | 399 | (Do not confuse it with the native platform @code{float} or @code{double}.) |
b8d4c8d0 GM |
400 | |
401 | @item used-floats | |
b8d4c8d0 GM |
402 | The number of floats in use. |
403 | ||
404 | @item free-floats | |
74934dcc DA |
405 | The number of floats for which space has been obtained from |
406 | the operating system, but that are not currently being used. | |
407 | ||
408 | @item interval-size | |
39670ef4 | 409 | Internal size of an interval object, i.e., @code{sizeof (struct interval)}. |
b8d4c8d0 GM |
410 | |
411 | @item used-intervals | |
74934dcc | 412 | The number of intervals in use. |
b8d4c8d0 GM |
413 | |
414 | @item free-intervals | |
74934dcc DA |
415 | The number of intervals for which space has been obtained from |
416 | the operating system, but that are not currently being used. | |
b8d4c8d0 | 417 | |
74934dcc | 418 | @item buffer-size |
39670ef4 | 419 | Internal size of a buffer, i.e., @code{sizeof (struct buffer)}. |
74934dcc | 420 | (Do not confuse with the value returned by @code{buffer-size} function.) |
b8d4c8d0 | 421 | |
74934dcc DA |
422 | @item used-buffers |
423 | The number of buffer objects in use. This includes killed buffers | |
39670ef4 | 424 | invisible to users, i.e., all buffers in @code{all_buffers} list. |
74934dcc DA |
425 | |
426 | @item unit-size | |
427 | The unit of heap space measurement, always equal to 1024 bytes. | |
428 | ||
429 | @item total-size | |
430 | Total heap size, in @var{unit-size} units. | |
431 | ||
432 | @item free-size | |
433 | Heap space which is not currently used, in @var{unit-size} units. | |
b8d4c8d0 GM |
434 | @end table |
435 | ||
999dd333 | 436 | If there was overflow in pure space (@pxref{Pure Storage}), |
b8d4c8d0 | 437 | @code{garbage-collect} returns @code{nil}, because a real garbage |
9658dc2a | 438 | collection cannot be done. |
b8d4c8d0 GM |
439 | @end deffn |
440 | ||
441 | @defopt garbage-collection-messages | |
442 | If this variable is non-@code{nil}, Emacs displays a message at the | |
443 | beginning and end of garbage collection. The default value is | |
999dd333 | 444 | @code{nil}. |
b8d4c8d0 GM |
445 | @end defopt |
446 | ||
447 | @defvar post-gc-hook | |
448 | This is a normal hook that is run at the end of garbage collection. | |
449 | Garbage collection is inhibited while the hook functions run, so be | |
450 | careful writing them. | |
451 | @end defvar | |
452 | ||
453 | @defopt gc-cons-threshold | |
454 | The value of this variable is the number of bytes of storage that must | |
455 | be allocated for Lisp objects after one garbage collection in order to | |
74934dcc DA |
456 | trigger another garbage collection. You can use the result returned by |
457 | @code{garbage-collect} to get an information about size of the particular | |
458 | object type; space allocated to the contents of buffers does not count. | |
459 | Note that the subsequent garbage collection does not happen immediately | |
460 | when the threshold is exhausted, but only the next time the Lisp interpreter | |
461 | is called. | |
462 | ||
463 | The initial threshold value is @code{GC_DEFAULT_THRESHOLD}, defined in | |
464 | @file{alloc.c}. Since it's defined in @code{word_size} units, the value | |
465 | is 400,000 for the default 32-bit configuration and 800,000 for the 64-bit | |
466 | one. If you specify a larger value, garbage collection will happen less | |
467 | often. This reduces the amount of time spent garbage collecting, but | |
468 | increases total memory use. You may want to do this when running a program | |
469 | that creates lots of Lisp data. | |
470 | ||
471 | You can make collections more frequent by specifying a smaller value, down | |
472 | to 1/10th of @code{GC_DEFAULT_THRESHOLD}. A value less than this minimum | |
473 | will remain in effect only until the subsequent garbage collection, at which | |
474 | time @code{garbage-collect} will set the threshold back to the minimum. | |
b8d4c8d0 GM |
475 | @end defopt |
476 | ||
477 | @defopt gc-cons-percentage | |
478 | The value of this variable specifies the amount of consing before a | |
479 | garbage collection occurs, as a fraction of the current heap size. | |
480 | This criterion and @code{gc-cons-threshold} apply in parallel, and | |
481 | garbage collection occurs only when both criteria are satisfied. | |
482 | ||
483 | As the heap size increases, the time to perform a garbage collection | |
484 | increases. Thus, it can be desirable to do them less frequently in | |
485 | proportion. | |
486 | @end defopt | |
487 | ||
488 | The value returned by @code{garbage-collect} describes the amount of | |
489 | memory used by Lisp data, broken down by data type. By contrast, the | |
490 | function @code{memory-limit} provides information on the total amount of | |
491 | memory Emacs is currently using. | |
492 | ||
b8d4c8d0 GM |
493 | @defun memory-limit |
494 | This function returns the address of the last byte Emacs has allocated, | |
495 | divided by 1024. We divide the value by 1024 to make sure it fits in a | |
496 | Lisp integer. | |
497 | ||
498 | You can use this to get a general idea of how your actions affect the | |
499 | memory usage. | |
500 | @end defun | |
501 | ||
502 | @defvar memory-full | |
d15aac68 | 503 | This variable is @code{t} if Emacs is nearly out of memory for Lisp |
b8d4c8d0 GM |
504 | objects, and @code{nil} otherwise. |
505 | @end defvar | |
506 | ||
507 | @defun memory-use-counts | |
508 | This returns a list of numbers that count the number of objects | |
509 | created in this Emacs session. Each of these counters increments for | |
510 | a certain kind of object. See the documentation string for details. | |
511 | @end defun | |
512 | ||
513 | @defvar gcs-done | |
514 | This variable contains the total number of garbage collections | |
515 | done so far in this Emacs session. | |
516 | @end defvar | |
517 | ||
518 | @defvar gc-elapsed | |
519 | This variable contains the total number of seconds of elapsed time | |
520 | during garbage collection so far in this Emacs session, as a floating | |
521 | point number. | |
522 | @end defvar | |
523 | ||
524 | @node Memory Usage | |
525 | @section Memory Usage | |
526 | @cindex memory usage | |
527 | ||
528 | These functions and variables give information about the total amount | |
529 | of memory allocation that Emacs has done, broken down by data type. | |
530 | Note the difference between these and the values returned by | |
01a6dcc8 | 531 | @code{garbage-collect}; those count objects that currently exist, but |
b8d4c8d0 GM |
532 | these count the number or size of all allocations, including those for |
533 | objects that have since been freed. | |
534 | ||
535 | @defvar cons-cells-consed | |
536 | The total number of cons cells that have been allocated so far | |
537 | in this Emacs session. | |
538 | @end defvar | |
539 | ||
540 | @defvar floats-consed | |
541 | The total number of floats that have been allocated so far | |
542 | in this Emacs session. | |
543 | @end defvar | |
544 | ||
545 | @defvar vector-cells-consed | |
546 | The total number of vector cells that have been allocated so far | |
547 | in this Emacs session. | |
548 | @end defvar | |
549 | ||
550 | @defvar symbols-consed | |
551 | The total number of symbols that have been allocated so far | |
552 | in this Emacs session. | |
553 | @end defvar | |
554 | ||
555 | @defvar string-chars-consed | |
556 | The total number of string characters that have been allocated so far | |
9658dc2a | 557 | in this session. |
b8d4c8d0 GM |
558 | @end defvar |
559 | ||
560 | @defvar misc-objects-consed | |
561 | The total number of miscellaneous objects that have been allocated so | |
9658dc2a | 562 | far in this session. These include markers and overlays, plus |
b8d4c8d0 GM |
563 | certain objects not visible to users. |
564 | @end defvar | |
565 | ||
566 | @defvar intervals-consed | |
567 | The total number of intervals that have been allocated so far | |
568 | in this Emacs session. | |
569 | @end defvar | |
570 | ||
571 | @defvar strings-consed | |
572 | The total number of strings that have been allocated so far in this | |
573 | Emacs session. | |
574 | @end defvar | |
575 | ||
576 | @node Writing Emacs Primitives | |
520f2425 | 577 | @section Writing Emacs Primitives |
b8d4c8d0 GM |
578 | @cindex primitive function internals |
579 | @cindex writing Emacs primitives | |
580 | ||
1df7defd | 581 | Lisp primitives are Lisp functions implemented in C@. The details of |
b8d4c8d0 GM |
582 | interfacing the C function so that Lisp can call it are handled by a few |
583 | C macros. The only way to really understand how to write new C code is | |
584 | to read the source, but we can explain some things here. | |
585 | ||
586 | An example of a special form is the definition of @code{or}, from | |
587 | @file{eval.c}. (An ordinary function would have the same general | |
588 | appearance.) | |
589 | ||
590 | @cindex garbage collection protection | |
591 | @smallexample | |
592 | @group | |
593 | DEFUN ("or", For, Sor, 0, UNEVALLED, 0, | |
d15aac68 CY |
594 | doc: /* Eval args until one of them yields non-nil, then return |
595 | that value. | |
596 | The remaining args are not evalled at all. | |
b8d4c8d0 GM |
597 | If all args return nil, return nil. |
598 | @end group | |
599 | @group | |
600 | usage: (or CONDITIONS ...) */) | |
b88746ba | 601 | (Lisp_Object args) |
b8d4c8d0 GM |
602 | @{ |
603 | register Lisp_Object val = Qnil; | |
604 | struct gcpro gcpro1; | |
605 | @end group | |
606 | ||
607 | @group | |
608 | GCPRO1 (args); | |
609 | @end group | |
610 | ||
611 | @group | |
612 | while (CONSP (args)) | |
613 | @{ | |
7d2d7cc0 | 614 | val = eval_sub (XCAR (args)); |
b8d4c8d0 GM |
615 | if (!NILP (val)) |
616 | break; | |
617 | args = XCDR (args); | |
618 | @} | |
619 | @end group | |
620 | ||
621 | @group | |
622 | UNGCPRO; | |
623 | return val; | |
624 | @} | |
625 | @end group | |
626 | @end smallexample | |
627 | ||
628 | @cindex @code{DEFUN}, C macro to define Lisp primitives | |
629 | Let's start with a precise explanation of the arguments to the | |
630 | @code{DEFUN} macro. Here is a template for them: | |
631 | ||
632 | @example | |
633 | DEFUN (@var{lname}, @var{fname}, @var{sname}, @var{min}, @var{max}, @var{interactive}, @var{doc}) | |
634 | @end example | |
635 | ||
636 | @table @var | |
637 | @item lname | |
638 | This is the name of the Lisp symbol to define as the function name; in | |
639 | the example above, it is @code{or}. | |
640 | ||
641 | @item fname | |
d15aac68 CY |
642 | This is the C function name for this function. This is the name that |
643 | is used in C code for calling the function. The name is, by | |
644 | convention, @samp{F} prepended to the Lisp name, with all dashes | |
645 | (@samp{-}) in the Lisp name changed to underscores. Thus, to call | |
646 | this function from C code, call @code{For}. | |
b8d4c8d0 GM |
647 | |
648 | @item sname | |
649 | This is a C variable name to use for a structure that holds the data for | |
650 | the subr object that represents the function in Lisp. This structure | |
651 | conveys the Lisp symbol name to the initialization routine that will | |
652 | create the symbol and store the subr object as its definition. By | |
653 | convention, this name is always @var{fname} with @samp{F} replaced with | |
654 | @samp{S}. | |
655 | ||
656 | @item min | |
657 | This is the minimum number of arguments that the function requires. The | |
658 | function @code{or} allows a minimum of zero arguments. | |
659 | ||
660 | @item max | |
661 | This is the maximum number of arguments that the function accepts, if | |
662 | there is a fixed maximum. Alternatively, it can be @code{UNEVALLED}, | |
663 | indicating a special form that receives unevaluated arguments, or | |
664 | @code{MANY}, indicating an unlimited number of evaluated arguments (the | |
665 | equivalent of @code{&rest}). Both @code{UNEVALLED} and @code{MANY} are | |
9658dc2a GM |
666 | macros. If @var{max} is a number, it must be more than @var{min} but |
667 | less than 8. | |
b8d4c8d0 | 668 | |
84ebefe9 | 669 | @cindex interactive specification in primitives |
b8d4c8d0 | 670 | @item interactive |
84ebefe9 XF |
671 | This is an interactive specification, a string such as might be used |
672 | as the argument of @code{interactive} in a Lisp function. In the case | |
673 | of @code{or}, it is 0 (a null pointer), indicating that @code{or} | |
b02b5481 GM |
674 | cannot be called interactively. A value of @code{""} indicates a |
675 | function that should receive no arguments when called interactively. | |
84ebefe9 XF |
676 | If the value begins with a @samp{"(}, the string is evaluated as a |
677 | Lisp form. For example: | |
678 | ||
b02b5481 | 679 | @example |
84ebefe9 | 680 | @group |
b02b5481 GM |
681 | DEFUN ("foo", Ffoo, Sfoo, 0, UNEVALLED, |
682 | "(list (read-char-by-name \"Insert character: \")\ | |
84ebefe9 XF |
683 | (prefix-numeric-value current-prefix-arg)\ |
684 | t))", | |
685 | doc: /* @dots{} /*) | |
686 | @end group | |
b02b5481 | 687 | @end example |
b8d4c8d0 GM |
688 | |
689 | @item doc | |
690 | This is the documentation string. It uses C comment syntax rather | |
691 | than C string syntax because comment syntax requires nothing special | |
692 | to include multiple lines. The @samp{doc:} identifies the comment | |
693 | that follows as the documentation string. The @samp{/*} and @samp{*/} | |
694 | delimiters that begin and end the comment are not part of the | |
695 | documentation string. | |
696 | ||
697 | If the last line of the documentation string begins with the keyword | |
698 | @samp{usage:}, the rest of the line is treated as the argument list | |
699 | for documentation purposes. This way, you can use different argument | |
700 | names in the documentation string from the ones used in the C code. | |
701 | @samp{usage:} is required if the function has an unlimited number of | |
702 | arguments. | |
703 | ||
704 | All the usual rules for documentation strings in Lisp code | |
705 | (@pxref{Documentation Tips}) apply to C code documentation strings | |
706 | too. | |
707 | @end table | |
708 | ||
b88746ba | 709 | After the call to the @code{DEFUN} macro, you must write the |
d15aac68 CY |
710 | argument list for the C function, including the types for the |
711 | arguments. If the primitive accepts a fixed maximum number of Lisp | |
712 | arguments, there must be one C argument for each Lisp argument, and | |
713 | each argument must be of type @code{Lisp_Object}. (Various macros and | |
714 | functions for creating values of type @code{Lisp_Object} are declared | |
715 | in the file @file{lisp.h}.) If the primitive has no upper limit on | |
716 | the number of Lisp arguments, it must have exactly two C arguments: | |
717 | the first is the number of Lisp arguments, and the second is the | |
718 | address of a block containing their values. These have types | |
39670ef4 | 719 | @code{int} and @w{@code{Lisp_Object *}} respectively. Since |
74934dcc DA |
720 | @code{Lisp_Object} can hold any Lisp object of any data type, you |
721 | can determine the actual data type only at run time; so if you want | |
722 | a primitive to accept only a certain type of argument, you must check | |
723 | the type explicitly using a suitable predicate (@pxref{Type Predicates}). | |
724 | @cindex type checking internals | |
b8d4c8d0 GM |
725 | |
726 | @cindex @code{GCPRO} and @code{UNGCPRO} | |
727 | @cindex protect C variables from garbage collection | |
728 | Within the function @code{For} itself, note the use of the macros | |
d15aac68 CY |
729 | @code{GCPRO1} and @code{UNGCPRO}. These macros are defined for the |
730 | sake of the few platforms which do not use Emacs' default | |
731 | stack-marking garbage collector. The @code{GCPRO1} macro ``protects'' | |
732 | a variable from garbage collection, explicitly informing the garbage | |
733 | collector that that variable and all its contents must be as | |
734 | accessible. GC protection is necessary in any function which can | |
735 | perform Lisp evaluation by calling @code{eval_sub} or @code{Feval} as | |
736 | a subroutine, either directly or indirectly. | |
b8d4c8d0 GM |
737 | |
738 | It suffices to ensure that at least one pointer to each object is | |
d15aac68 | 739 | GC-protected. Thus, a particular local variable can do without |
b8d4c8d0 | 740 | protection if it is certain that the object it points to will be |
7d2d7cc0 | 741 | preserved by some other pointer (such as another local variable that |
d15aac68 CY |
742 | has a @code{GCPRO}). Otherwise, the local variable needs a |
743 | @code{GCPRO}. | |
b8d4c8d0 GM |
744 | |
745 | The macro @code{GCPRO1} protects just one local variable. If you | |
746 | want to protect two variables, use @code{GCPRO2} instead; repeating | |
747 | @code{GCPRO1} will not work. Macros @code{GCPRO3}, @code{GCPRO4}, | |
748 | @code{GCPRO5}, and @code{GCPRO6} also exist. All these macros | |
749 | implicitly use local variables such as @code{gcpro1}; you must declare | |
750 | these explicitly, with type @code{struct gcpro}. Thus, if you use | |
751 | @code{GCPRO2}, you must declare @code{gcpro1} and @code{gcpro2}. | |
b8d4c8d0 GM |
752 | |
753 | @code{UNGCPRO} cancels the protection of the variables that are | |
754 | protected in the current function. It is necessary to do this | |
755 | explicitly. | |
756 | ||
b8d4c8d0 GM |
757 | You must not use C initializers for static or global variables unless |
758 | the variables are never written once Emacs is dumped. These variables | |
759 | with initializers are allocated in an area of memory that becomes | |
760 | read-only (on certain operating systems) as a result of dumping Emacs. | |
761 | @xref{Pure Storage}. | |
762 | ||
b8d4c8d0 GM |
763 | @cindex @code{defsubr}, Lisp symbol for a primitive |
764 | Defining the C function is not enough to make a Lisp primitive | |
765 | available; you must also create the Lisp symbol for the primitive and | |
766 | store a suitable subr object in its function cell. The code looks like | |
767 | this: | |
768 | ||
769 | @example | |
7d2d7cc0 | 770 | defsubr (&@var{sname}); |
b8d4c8d0 GM |
771 | @end example |
772 | ||
773 | @noindent | |
7d2d7cc0 | 774 | Here @var{sname} is the name you used as the third argument to @code{DEFUN}. |
b8d4c8d0 GM |
775 | |
776 | If you add a new primitive to a file that already has Lisp primitives | |
777 | defined in it, find the function (near the end of the file) named | |
778 | @code{syms_of_@var{something}}, and add the call to @code{defsubr} | |
779 | there. If the file doesn't have this function, or if you create a new | |
780 | file, add to it a @code{syms_of_@var{filename}} (e.g., | |
781 | @code{syms_of_myfile}). Then find the spot in @file{emacs.c} where all | |
782 | of these functions are called, and add a call to | |
783 | @code{syms_of_@var{filename}} there. | |
784 | ||
785 | @anchor{Defining Lisp variables in C} | |
786 | @vindex byte-boolean-vars | |
787 | @cindex defining Lisp variables in C | |
788 | @cindex @code{DEFVAR_INT}, @code{DEFVAR_LISP}, @code{DEFVAR_BOOL} | |
789 | The function @code{syms_of_@var{filename}} is also the place to define | |
790 | any C variables that are to be visible as Lisp variables. | |
791 | @code{DEFVAR_LISP} makes a C variable of type @code{Lisp_Object} visible | |
792 | in Lisp. @code{DEFVAR_INT} makes a C variable of type @code{int} | |
793 | visible in Lisp with a value that is always an integer. | |
794 | @code{DEFVAR_BOOL} makes a C variable of type @code{int} visible in Lisp | |
795 | with a value that is either @code{t} or @code{nil}. Note that variables | |
796 | defined with @code{DEFVAR_BOOL} are automatically added to the list | |
797 | @code{byte-boolean-vars} used by the byte compiler. | |
798 | ||
7d2d7cc0 GM |
799 | @cindex defining customization variables in C |
800 | If you want to make a Lisp variables that is defined in C behave | |
801 | like one declared with @code{defcustom}, add an appropriate entry to | |
802 | @file{cus-start.el}. | |
803 | ||
b8d4c8d0 GM |
804 | @cindex @code{staticpro}, protection from GC |
805 | If you define a file-scope C variable of type @code{Lisp_Object}, | |
806 | you must protect it from garbage-collection by calling @code{staticpro} | |
807 | in @code{syms_of_@var{filename}}, like this: | |
808 | ||
809 | @example | |
810 | staticpro (&@var{variable}); | |
811 | @end example | |
812 | ||
813 | Here is another example function, with more complicated arguments. | |
814 | This comes from the code in @file{window.c}, and it demonstrates the use | |
815 | of macros and functions to manipulate Lisp objects. | |
816 | ||
817 | @smallexample | |
818 | @group | |
819 | DEFUN ("coordinates-in-window-p", Fcoordinates_in_window_p, | |
7d2d7cc0 GM |
820 | Scoordinates_in_window_p, 2, 2, 0, |
821 | doc: /* Return non-nil if COORDINATES are in WINDOW. | |
822 | ... | |
b8d4c8d0 GM |
823 | @end group |
824 | @group | |
7d2d7cc0 GM |
825 | or `right-margin' is returned. */) |
826 | (register Lisp_Object coordinates, Lisp_Object window) | |
b8d4c8d0 | 827 | @{ |
7d2d7cc0 GM |
828 | struct window *w; |
829 | struct frame *f; | |
b8d4c8d0 | 830 | int x, y; |
7d2d7cc0 | 831 | Lisp_Object lx, ly; |
b8d4c8d0 GM |
832 | @end group |
833 | ||
834 | @group | |
7d2d7cc0 GM |
835 | CHECK_LIVE_WINDOW (window); |
836 | w = XWINDOW (window); | |
837 | f = XFRAME (w->frame); | |
838 | CHECK_CONS (coordinates); | |
839 | lx = Fcar (coordinates); | |
840 | ly = Fcdr (coordinates); | |
841 | CHECK_NUMBER_OR_FLOAT (lx); | |
842 | CHECK_NUMBER_OR_FLOAT (ly); | |
843 | x = FRAME_PIXEL_X_FROM_CANON_X (f, lx) + FRAME_INTERNAL_BORDER_WIDTH(f); | |
844 | y = FRAME_PIXEL_Y_FROM_CANON_Y (f, ly) + FRAME_INTERNAL_BORDER_WIDTH(f); | |
b8d4c8d0 GM |
845 | @end group |
846 | ||
847 | @group | |
7d2d7cc0 | 848 | switch (coordinates_in_window (w, x, y)) |
b8d4c8d0 | 849 | @{ |
7d2d7cc0 | 850 | case ON_NOTHING: /* NOT in window at all. */ |
b8d4c8d0 GM |
851 | return Qnil; |
852 | @end group | |
853 | ||
7d2d7cc0 | 854 | ... |
b8d4c8d0 GM |
855 | |
856 | @group | |
7d2d7cc0 | 857 | case ON_MODE_LINE: /* In mode line of window. */ |
b8d4c8d0 GM |
858 | return Qmode_line; |
859 | @end group | |
860 | ||
7d2d7cc0 GM |
861 | ... |
862 | ||
b8d4c8d0 | 863 | @group |
7d2d7cc0 GM |
864 | case ON_SCROLL_BAR: /* On scroll-bar of window. */ |
865 | /* Historically we are supposed to return nil in this case. */ | |
866 | return Qnil; | |
b8d4c8d0 GM |
867 | @end group |
868 | ||
869 | @group | |
870 | default: | |
871 | abort (); | |
872 | @} | |
873 | @} | |
874 | @end group | |
875 | @end smallexample | |
876 | ||
877 | Note that C code cannot call functions by name unless they are defined | |
1df7defd | 878 | in C@. The way to call a function written in Lisp is to use |
b8d4c8d0 GM |
879 | @code{Ffuncall}, which embodies the Lisp function @code{funcall}. Since |
880 | the Lisp function @code{funcall} accepts an unlimited number of | |
881 | arguments, in C it takes two: the number of Lisp-level arguments, and a | |
882 | one-dimensional array containing their values. The first Lisp-level | |
883 | argument is the Lisp function to call, and the rest are the arguments to | |
884 | pass to it. Since @code{Ffuncall} can call the evaluator, you must | |
885 | protect pointers from garbage collection around the call to | |
886 | @code{Ffuncall}. | |
887 | ||
888 | The C functions @code{call0}, @code{call1}, @code{call2}, and so on, | |
889 | provide handy ways to call a Lisp function conveniently with a fixed | |
890 | number of arguments. They work by calling @code{Ffuncall}. | |
891 | ||
892 | @file{eval.c} is a very good file to look through for examples; | |
893 | @file{lisp.h} contains the definitions for some important macros and | |
894 | functions. | |
895 | ||
896 | If you define a function which is side-effect free, update the code | |
7d2d7cc0 | 897 | in @file{byte-opt.el} that binds @code{side-effect-free-fns} and |
b8d4c8d0 GM |
898 | @code{side-effect-and-error-free-fns} so that the compiler optimizer |
899 | knows about it. | |
900 | ||
901 | @node Object Internals | |
520f2425 | 902 | @section Object Internals |
b8d4c8d0 GM |
903 | @cindex object internals |
904 | ||
74934dcc | 905 | Emacs Lisp provides a rich set of the data types. Some of them, like cons |
f24f2e22 | 906 | cells, integers and strings, are common to nearly all Lisp dialects. Some |
74934dcc DA |
907 | others, like markers and buffers, are quite special and needed to provide |
908 | the basic support to write editor commands in Lisp. To implement such | |
909 | a variety of object types and provide an efficient way to pass objects between | |
910 | the subsystems of an interpreter, there is a set of C data structures and | |
911 | a special type to represent the pointers to all of them, which is known as | |
912 | @dfn{tagged pointer}. | |
913 | ||
914 | In C, the tagged pointer is an object of type @code{Lisp_Object}. Any | |
915 | initialized variable of such a type always holds the value of one of the | |
916 | following basic data types: integer, symbol, string, cons cell, float, | |
917 | vectorlike or miscellaneous object. Each of these data types has the | |
918 | corresponding tag value. All tags are enumerated by @code{enum Lisp_Type} | |
919 | and placed into a 3-bit bitfield of the @code{Lisp_Object}. The rest of the | |
39670ef4 | 920 | bits is the value itself. Integer values are immediate, i.e., directly |
74934dcc DA |
921 | represented by those @dfn{value bits}, and all other objects are represented |
922 | by the C pointers to a corresponding object allocated from the heap. Width | |
923 | of the @code{Lisp_Object} is platform- and configuration-dependent: usually | |
39670ef4 | 924 | it's equal to the width of an underlying platform pointer (i.e., 32-bit on |
74934dcc DA |
925 | a 32-bit machine and 64-bit on a 64-bit one), but also there is a special |
926 | configuration where @code{Lisp_Object} is 64-bit but all pointers are 32-bit. | |
927 | The latter trick was designed to overcome the limited range of values for | |
928 | Lisp integers on a 32-bit system by using 64-bit @code{long long} type for | |
929 | @code{Lisp_Object}. | |
930 | ||
931 | The following C data structures are defined in @file{lisp.h} to represent | |
932 | the basic data types beyond integers: | |
933 | ||
934 | @table @code | |
935 | @item struct Lisp_Cons | |
936 | Cons cell, an object used to construct lists. | |
937 | ||
938 | @item struct Lisp_String | |
939 | String, the basic object to represent a sequence of characters. | |
940 | ||
941 | @item struct Lisp_Vector | |
942 | Array, a fixed-size set of Lisp objects which may be accessed by an index. | |
943 | ||
944 | @item struct Lisp_Symbol | |
945 | Symbol, the unique-named entity commonly used as an identifier. | |
946 | ||
947 | @item struct Lisp_Float | |
948 | Floating point value. | |
949 | ||
950 | @item union Lisp_Misc | |
951 | Miscellaneous kinds of objects which don't fit into any of the above. | |
952 | @end table | |
953 | ||
954 | These types are the first-class citizens of an internal type system. | |
955 | Since the tag space is limited, all other types are the subtypes of either | |
956 | @code{Lisp_Vectorlike} or @code{Lisp_Misc}. Vector subtypes are enumerated | |
957 | by @code{enum pvec_type}, and nearly all complex objects like windows, buffers, | |
958 | frames, and processes fall into this category. The rest of special types, | |
959 | including markers and overlays, are enumerated by @code{enum Lisp_Misc_Type} | |
960 | and form the set of subtypes of @code{Lisp_Misc}. | |
961 | ||
962 | Below there is a description of a few subtypes of @code{Lisp_Vectorlike}. | |
963 | Buffer object represents the text to display and edit. Window is the part | |
964 | of display structure which shows the buffer or used as a container to | |
965 | recursively place other windows on the same frame. (Do not confuse Emacs Lisp | |
966 | window object with the window as an entity managed by the user interface | |
967 | system like X; in Emacs terminology, the latter is called frame.) Finally, | |
968 | process object is used to manage the subprocesses. | |
b8d4c8d0 GM |
969 | |
970 | @menu | |
971 | * Buffer Internals:: Components of a buffer structure. | |
972 | * Window Internals:: Components of a window structure. | |
973 | * Process Internals:: Components of a process structure. | |
974 | @end menu | |
975 | ||
976 | @node Buffer Internals | |
520f2425 | 977 | @subsection Buffer Internals |
b8d4c8d0 GM |
978 | @cindex internals, of buffer |
979 | @cindex buffer internals | |
980 | ||
7d2d7cc0 | 981 | Two structures (see @file{buffer.h}) are used to represent buffers |
1df7defd | 982 | in C@. The @code{buffer_text} structure contains fields describing the |
7d2d7cc0 GM |
983 | text of a buffer; the @code{buffer} structure holds other fields. In |
984 | the case of indirect buffers, two or more @code{buffer} structures | |
985 | reference the same @code{buffer_text} structure. | |
b8d4c8d0 | 986 | |
c773345a | 987 | Here are some of the fields in @code{struct buffer_text}: |
b8d4c8d0 GM |
988 | |
989 | @table @code | |
990 | @item beg | |
c773345a | 991 | The address of the buffer contents. |
b8d4c8d0 GM |
992 | |
993 | @item gpt | |
c773345a CY |
994 | @itemx gpt_byte |
995 | The character and byte positions of the buffer gap. @xref{Buffer | |
996 | Gap}. | |
b8d4c8d0 GM |
997 | |
998 | @item z | |
c773345a CY |
999 | @itemx z_byte |
1000 | The character and byte positions of the end of the buffer text. | |
b8d4c8d0 GM |
1001 | |
1002 | @item gap_size | |
c773345a | 1003 | The size of buffer's gap. @xref{Buffer Gap}. |
b8d4c8d0 GM |
1004 | |
1005 | @item modiff | |
c773345a CY |
1006 | @itemx save_modiff |
1007 | @itemx chars_modiff | |
1008 | @itemx overlay_modiff | |
1009 | These fields count the number of buffer-modification events performed | |
1010 | in this buffer. @code{modiff} is incremented after each | |
1011 | buffer-modification event, and is never otherwise changed; | |
1012 | @code{save_modiff} contains the value of @code{modiff} the last time | |
1013 | the buffer was visited or saved; @code{chars_modiff} counts only | |
1014 | modifications to the characters in the buffer, ignoring all other | |
1015 | kinds of changes; and @code{overlay_modiff} counts only modifications | |
1016 | to the overlays. | |
b8d4c8d0 GM |
1017 | |
1018 | @item beg_unchanged | |
c773345a CY |
1019 | @itemx end_unchanged |
1020 | The number of characters at the start and end of the text that are | |
1021 | known to be unchanged since the last complete redisplay. | |
b8d4c8d0 GM |
1022 | |
1023 | @item unchanged_modified | |
c773345a CY |
1024 | @itemx overlay_unchanged_modified |
1025 | The values of @code{modiff} and @code{overlay_modiff}, respectively, | |
da6062e6 | 1026 | after the last complete redisplay. If their current values match |
c773345a | 1027 | @code{modiff} or @code{overlay_modiff}, that means |
b8d4c8d0 GM |
1028 | @code{beg_unchanged} and @code{end_unchanged} contain no useful |
1029 | information. | |
1030 | ||
1031 | @item markers | |
1032 | The markers that refer to this buffer. This is actually a single | |
1033 | marker, and successive elements in its marker @code{chain} are the other | |
1034 | markers referring to this buffer text. | |
1035 | ||
1036 | @item intervals | |
c773345a | 1037 | The interval tree which records the text properties of this buffer. |
b8d4c8d0 GM |
1038 | @end table |
1039 | ||
c773345a | 1040 | Some of the fields of @code{struct buffer} are: |
b8d4c8d0 GM |
1041 | |
1042 | @table @code | |
7d2d7cc0 | 1043 | @item header |
74934dcc DA |
1044 | A header of type @code{struct vectorlike_header} is common to all |
1045 | vectorlike objects. | |
b8d4c8d0 GM |
1046 | |
1047 | @item own_text | |
c773345a CY |
1048 | A @code{struct buffer_text} structure that ordinarily holds the buffer |
1049 | contents. In indirect buffers, this field is not used. | |
b8d4c8d0 GM |
1050 | |
1051 | @item text | |
c773345a CY |
1052 | A pointer to the @code{buffer_text} structure for this buffer. In an |
1053 | ordinary buffer, this is the @code{own_text} field above. In an | |
1054 | indirect buffer, this is the @code{own_text} field of the base buffer. | |
b8d4c8d0 | 1055 | |
74934dcc DA |
1056 | @item next |
1057 | A pointer to the next buffer, in the chain of all buffers, including | |
1058 | killed buffers. This chain is used only for allocation and garbage | |
1059 | collection, in order to collect killed buffers properly. | |
1060 | ||
b8d4c8d0 | 1061 | @item pt |
c773345a CY |
1062 | @itemx pt_byte |
1063 | The character and byte positions of point in a buffer. | |
b8d4c8d0 GM |
1064 | |
1065 | @item begv | |
c773345a CY |
1066 | @itemx begv_byte |
1067 | The character and byte positions of the beginning of the accessible | |
1068 | range of text in the buffer. | |
b8d4c8d0 GM |
1069 | |
1070 | @item zv | |
c773345a CY |
1071 | @itemx zv_byte |
1072 | The character and byte positions of the end of the accessible range of | |
1073 | text in the buffer. | |
b8d4c8d0 GM |
1074 | |
1075 | @item base_buffer | |
1076 | In an indirect buffer, this points to the base buffer. In an ordinary | |
1077 | buffer, it is null. | |
1078 | ||
c773345a CY |
1079 | @item local_flags |
1080 | This field contains flags indicating that certain variables are local | |
1081 | in this buffer. Such variables are declared in the C code using | |
1082 | @code{DEFVAR_PER_BUFFER}, and their buffer-local bindings are stored | |
1083 | in fields in the buffer structure itself. (Some of these fields are | |
b8d4c8d0 GM |
1084 | described in this table.) |
1085 | ||
1086 | @item modtime | |
c773345a CY |
1087 | The modification time of the visited file. It is set when the file is |
1088 | written or read. Before writing the buffer into a file, this field is | |
1089 | compared to the modification time of the file to see if the file has | |
1090 | changed on disk. @xref{Buffer Modification}. | |
b8d4c8d0 GM |
1091 | |
1092 | @item auto_save_modified | |
c773345a | 1093 | The time when the buffer was last auto-saved. |
b8d4c8d0 GM |
1094 | |
1095 | @item last_window_start | |
c773345a CY |
1096 | The @code{window-start} position in the buffer as of the last time the |
1097 | buffer was displayed in a window. | |
b8d4c8d0 GM |
1098 | |
1099 | @item clip_changed | |
c773345a CY |
1100 | This flag indicates that narrowing has changed in the buffer. |
1101 | @xref{Narrowing}. | |
b8d4c8d0 GM |
1102 | |
1103 | @item prevent_redisplay_optimizations_p | |
c773345a CY |
1104 | This flag indicates that redisplay optimizations should not be used to |
1105 | display this buffer. | |
b8d4c8d0 | 1106 | |
c773345a CY |
1107 | @item overlay_center |
1108 | This field holds the current overlay center position. @xref{Managing | |
1109 | Overlays}. | |
1110 | ||
1111 | @item overlays_before | |
1112 | @itemx overlays_after | |
1113 | These fields hold, respectively, a list of overlays that end at or | |
1114 | before the current overlay center, and a list of overlays that end | |
1115 | after the current overlay center. @xref{Managing Overlays}. | |
1116 | @code{overlays_before} is sorted in order of decreasing end position, | |
1117 | and @code{overlays_after} is sorted in order of increasing beginning | |
1118 | position. | |
b8d4c8d0 | 1119 | |
7d2d7cc0 GM |
1120 | @c FIXME? the following are now all Lisp_Object BUFFER_INTERNAL_FIELD (foo). |
1121 | ||
b8d4c8d0 | 1122 | @item name |
c773345a CY |
1123 | A Lisp string that names the buffer. It is guaranteed to be unique. |
1124 | @xref{Buffer Names}. | |
b8d4c8d0 | 1125 | |
c773345a CY |
1126 | @item save_length |
1127 | The length of the file this buffer is visiting, when last read or | |
1128 | saved. This and other fields concerned with saving are not kept in | |
1129 | the @code{buffer_text} structure because indirect buffers are never | |
1130 | saved. | |
b8d4c8d0 GM |
1131 | |
1132 | @item directory | |
c773345a CY |
1133 | The directory for expanding relative file names. This is the value of |
1134 | the buffer-local variable @code{default-directory} (@pxref{File Name Expansion}). | |
b8d4c8d0 | 1135 | |
c773345a CY |
1136 | @item filename |
1137 | The name of the file visited in this buffer, or @code{nil}. This is | |
1138 | the value of the buffer-local variable @code{buffer-file-name} | |
1139 | (@pxref{Buffer File Name}). | |
b8d4c8d0 | 1140 | |
c773345a CY |
1141 | @item undo_list |
1142 | @itemx backed_up | |
1143 | @itemx auto_save_file_name | |
7d2d7cc0 | 1144 | @itemx auto_save_file_format |
c773345a CY |
1145 | @itemx read_only |
1146 | @itemx file_format | |
1147 | @itemx file_truename | |
1148 | @itemx invisibility_spec | |
1149 | @itemx display_count | |
1150 | @itemx display_time | |
1151 | These fields store the values of Lisp variables that are automatically | |
1152 | buffer-local (@pxref{Buffer-Local Variables}), whose corresponding | |
1153 | variable names have the additional prefix @code{buffer-} and have | |
1154 | underscores replaced with dashes. For instance, @code{undo_list} | |
29157371 | 1155 | stores the value of @code{buffer-undo-list}. |
b8d4c8d0 GM |
1156 | |
1157 | @item mark | |
c773345a CY |
1158 | The mark for the buffer. The mark is a marker, hence it is also |
1159 | included on the list @code{markers}. @xref{The Mark}. | |
b8d4c8d0 GM |
1160 | |
1161 | @item local_var_alist | |
c773345a CY |
1162 | The association list describing the buffer-local variable bindings of |
1163 | this buffer, not including the built-in buffer-local bindings that | |
1164 | have special slots in the buffer object. (Those slots are omitted | |
1165 | from this table.) @xref{Buffer-Local Variables}. | |
b8d4c8d0 GM |
1166 | |
1167 | @item major_mode | |
1168 | Symbol naming the major mode of this buffer, e.g., @code{lisp-mode}. | |
1169 | ||
1170 | @item mode_name | |
c773345a | 1171 | Pretty name of the major mode, e.g., @code{"Lisp"}. |
b8d4c8d0 GM |
1172 | |
1173 | @item keymap | |
c773345a CY |
1174 | @itemx abbrev_table |
1175 | @itemx syntax_table | |
1176 | @itemx category_table | |
1177 | @itemx display_table | |
1178 | These fields store the buffer's local keymap (@pxref{Keymaps}), abbrev | |
1179 | table (@pxref{Abbrev Tables}), syntax table (@pxref{Syntax Tables}), | |
1180 | category table (@pxref{Categories}), and display table (@pxref{Display | |
1181 | Tables}). | |
b8d4c8d0 GM |
1182 | |
1183 | @item downcase_table | |
c773345a CY |
1184 | @itemx upcase_table |
1185 | @itemx case_canon_table | |
1186 | These fields store the conversion tables for converting text to lower | |
1187 | case, upper case, and for canonicalizing text for case-fold search. | |
b8d4c8d0 GM |
1188 | @xref{Case Tables}. |
1189 | ||
b8d4c8d0 GM |
1190 | @item minor_modes |
1191 | An alist of the minor modes of this buffer. | |
1192 | ||
b8d4c8d0 | 1193 | @item pt_marker |
c773345a CY |
1194 | @itemx begv_marker |
1195 | @itemx zv_marker | |
1196 | These fields are only used in an indirect buffer, or in a buffer that | |
1197 | is the base of an indirect buffer. Each holds a marker that records | |
1198 | @code{pt}, @code{begv}, and @code{zv} respectively, for this buffer | |
b8d4c8d0 GM |
1199 | when the buffer is not current. |
1200 | ||
c773345a CY |
1201 | @item mode_line_format |
1202 | @itemx header_line_format | |
1203 | @itemx case_fold_search | |
1204 | @itemx tab_width | |
1205 | @itemx fill_column | |
1206 | @itemx left_margin | |
1207 | @itemx auto_fill_function | |
c773345a CY |
1208 | @itemx truncate_lines |
1209 | @itemx word_wrap | |
1210 | @itemx ctl_arrow | |
7d2d7cc0 GM |
1211 | @itemx bidi_display_reordering |
1212 | @itemx bidi_paragraph_direction | |
c773345a CY |
1213 | @itemx selective_display |
1214 | @itemx selective_display_ellipses | |
1215 | @itemx overwrite_mode | |
1216 | @itemx abbrev_mode | |
c773345a CY |
1217 | @itemx mark_active |
1218 | @itemx enable_multibyte_characters | |
1219 | @itemx buffer_file_coding_system | |
c773345a CY |
1220 | @itemx cache_long_line_scans |
1221 | @itemx point_before_scroll | |
1222 | @itemx left_fringe_width | |
1223 | @itemx right_fringe_width | |
1224 | @itemx fringes_outside_margins | |
1225 | @itemx scroll_bar_width | |
1226 | @itemx indicate_empty_lines | |
1227 | @itemx indicate_buffer_boundaries | |
1228 | @itemx fringe_indicator_alist | |
1229 | @itemx fringe_cursor_alist | |
1230 | @itemx scroll_up_aggressively | |
1231 | @itemx scroll_down_aggressively | |
1232 | @itemx cursor_type | |
1233 | @itemx cursor_in_non_selected_windows | |
1234 | These fields store the values of Lisp variables that are automatically | |
1235 | buffer-local (@pxref{Buffer-Local Variables}), whose corresponding | |
1236 | variable names have underscores replaced with dashes. For instance, | |
1237 | @code{mode_line_format} stores the value of @code{mode-line-format}. | |
b8d4c8d0 GM |
1238 | |
1239 | @item last_selected_window | |
1240 | This is the last window that was selected with this buffer in it, or @code{nil} | |
1241 | if that window no longer displays this buffer. | |
b8d4c8d0 GM |
1242 | @end table |
1243 | ||
1244 | @node Window Internals | |
520f2425 | 1245 | @subsection Window Internals |
b8d4c8d0 GM |
1246 | @cindex internals, of window |
1247 | @cindex window internals | |
1248 | ||
7d2d7cc0 GM |
1249 | The fields of a window (for a complete list, see the definition of |
1250 | @code{struct window} in @file{window.h}) include: | |
b8d4c8d0 GM |
1251 | |
1252 | @table @code | |
1253 | @item frame | |
1254 | The frame that this window is on. | |
1255 | ||
1256 | @item mini_p | |
1257 | Non-@code{nil} if this window is a minibuffer window. | |
1258 | ||
1259 | @item parent | |
1260 | Internally, Emacs arranges windows in a tree; each group of siblings has | |
1261 | a parent window whose area includes all the siblings. This field points | |
1262 | to a window's parent. | |
1263 | ||
1264 | Parent windows do not display buffers, and play little role in display | |
1265 | except to shape their child windows. Emacs Lisp programs usually have | |
1266 | no access to the parent windows; they operate on the windows at the | |
1267 | leaves of the tree, which actually display buffers. | |
1268 | ||
b8d4c8d0 | 1269 | @item hchild |
ee2d5b5e CY |
1270 | @itemx vchild |
1271 | These fields contain the window's leftmost child and its topmost child | |
1272 | respectively. @code{hchild} is used if the window is subdivided | |
1273 | horizontally by child windows, and @code{vchild} if it is subdivided | |
7d2d7cc0 | 1274 | vertically. In a live window, only one of @code{hchild}, @code{vchild}, |
1df7defd | 1275 | and @code{buffer} (q.v.@:) is non-@code{nil}. |
b8d4c8d0 GM |
1276 | |
1277 | @item next | |
ee2d5b5e CY |
1278 | @itemx prev |
1279 | The next sibling and previous sibling of this window. @code{next} is | |
520f2425 GM |
1280 | @code{nil} if the window is the right-most or bottom-most in its group; |
1281 | @code{prev} is @code{nil} if it is the left-most or top-most in its | |
ee2d5b5e CY |
1282 | group. |
1283 | ||
1284 | @item left_col | |
1285 | The left-hand edge of the window, measured in columns, relative to the | |
1286 | leftmost column in the frame (column 0). | |
1287 | ||
1288 | @item top_line | |
1289 | The top edge of the window, measured in lines, relative to the topmost | |
1290 | line in the frame (line 0). | |
1291 | ||
1292 | @item total_cols | |
1293 | @itemx total_lines | |
1294 | The width and height of the window, measured in columns and lines | |
1295 | respectively. The width includes the scroll bar and fringes, and/or | |
1296 | the separator line on the right of the window (if any). | |
b8d4c8d0 GM |
1297 | |
1298 | @item buffer | |
ee2d5b5e | 1299 | The buffer that the window is displaying. |
b8d4c8d0 GM |
1300 | |
1301 | @item start | |
ee2d5b5e CY |
1302 | A marker pointing to the position in the buffer that is the first |
1303 | character displayed in the window. | |
b8d4c8d0 GM |
1304 | |
1305 | @item pointm | |
1306 | @cindex window point internals | |
1307 | This is the value of point in the current buffer when this window is | |
1308 | selected; when it is not selected, it retains its previous value. | |
1309 | ||
1310 | @item force_start | |
1311 | If this flag is non-@code{nil}, it says that the window has been | |
1312 | scrolled explicitly by the Lisp program. This affects what the next | |
1313 | redisplay does if point is off the screen: instead of scrolling the | |
1314 | window to show the text around point, it moves point to a location that | |
1315 | is on the screen. | |
1316 | ||
1317 | @item frozen_window_start_p | |
1318 | This field is set temporarily to 1 to indicate to redisplay that | |
1319 | @code{start} of this window should not be changed, even if point | |
1320 | gets invisible. | |
1321 | ||
1322 | @item start_at_line_beg | |
1323 | Non-@code{nil} means current value of @code{start} was the beginning of a line | |
1324 | when it was chosen. | |
1325 | ||
b8d4c8d0 GM |
1326 | @item use_time |
1327 | This is the last time that the window was selected. The function | |
1328 | @code{get-lru-window} uses this field. | |
1329 | ||
1330 | @item sequence_number | |
1331 | A unique number assigned to this window when it was created. | |
1332 | ||
1333 | @item last_modified | |
1334 | The @code{modiff} field of the window's buffer, as of the last time | |
1335 | a redisplay completed in this window. | |
1336 | ||
1337 | @item last_overlay_modified | |
1338 | The @code{overlay_modiff} field of the window's buffer, as of the last | |
1339 | time a redisplay completed in this window. | |
1340 | ||
1341 | @item last_point | |
1342 | The buffer's value of point, as of the last time a redisplay completed | |
1343 | in this window. | |
1344 | ||
1345 | @item last_had_star | |
1346 | A non-@code{nil} value means the window's buffer was ``modified'' when the | |
1347 | window was last updated. | |
1348 | ||
1349 | @item vertical_scroll_bar | |
1350 | This window's vertical scroll bar. | |
1351 | ||
7d2d7cc0 GM |
1352 | @item left_margin_cols |
1353 | @itemx right_margin_cols | |
ee2d5b5e | 1354 | The widths of the left and right margins in this window. A value of |
7d2d7cc0 GM |
1355 | @code{nil} means no margin. |
1356 | ||
1357 | @item left_fringe_width | |
1358 | @itemx right_fringe_width | |
1359 | The widths of the left and right fringes in this window. A value of | |
1360 | @code{nil} or @code{t} means use the values of the frame. | |
1361 | ||
1362 | @item fringes_outside_margins | |
1363 | A non-@code{nil} value means the fringes outside the display margins; | |
1364 | othersize they are between the margin and the text. | |
b8d4c8d0 GM |
1365 | |
1366 | @item window_end_pos | |
1367 | This is computed as @code{z} minus the buffer position of the last glyph | |
1368 | in the current matrix of the window. The value is only valid if | |
1369 | @code{window_end_valid} is not @code{nil}. | |
1370 | ||
1371 | @item window_end_bytepos | |
1372 | The byte position corresponding to @code{window_end_pos}. | |
1373 | ||
1374 | @item window_end_vpos | |
1375 | The window-relative vertical position of the line containing | |
1376 | @code{window_end_pos}. | |
1377 | ||
1378 | @item window_end_valid | |
1379 | This field is set to a non-@code{nil} value if @code{window_end_pos} is truly | |
7d2d7cc0 | 1380 | valid. This is @code{nil} if nontrivial redisplay is pre-empted, since in that |
b8d4c8d0 GM |
1381 | case the display that @code{window_end_pos} was computed for did not get |
1382 | onto the screen. | |
1383 | ||
b8d4c8d0 GM |
1384 | @item cursor |
1385 | A structure describing where the cursor is in this window. | |
1386 | ||
1387 | @item last_cursor | |
1388 | The value of @code{cursor} as of the last redisplay that finished. | |
1389 | ||
1390 | @item phys_cursor | |
1391 | A structure describing where the cursor of this window physically is. | |
1392 | ||
1393 | @item phys_cursor_type | |
7d2d7cc0 GM |
1394 | @c FIXME What is this? |
1395 | @c itemx phys_cursor_ascent | |
1396 | @itemx phys_cursor_height | |
1397 | @itemx phys_cursor_width | |
1398 | The type, height, and width of the cursor that was last displayed on | |
1399 | this window. | |
b8d4c8d0 GM |
1400 | |
1401 | @item phys_cursor_on_p | |
1402 | This field is non-zero if the cursor is physically on. | |
1403 | ||
1404 | @item cursor_off_p | |
7d2d7cc0 GM |
1405 | Non-zero means the cursor in this window is logically off. This is |
1406 | used for blinking the cursor. | |
b8d4c8d0 GM |
1407 | |
1408 | @item last_cursor_off_p | |
1409 | This field contains the value of @code{cursor_off_p} as of the time of | |
1410 | the last redisplay. | |
1411 | ||
1412 | @item must_be_updated_p | |
1413 | This is set to 1 during redisplay when this window must be updated. | |
1414 | ||
1415 | @item hscroll | |
1416 | This is the number of columns that the display in the window is scrolled | |
1417 | horizontally to the left. Normally, this is 0. | |
1418 | ||
1419 | @item vscroll | |
1420 | Vertical scroll amount, in pixels. Normally, this is 0. | |
1421 | ||
1422 | @item dedicated | |
1423 | Non-@code{nil} if this window is dedicated to its buffer. | |
1424 | ||
1425 | @item display_table | |
1426 | The window's display table, or @code{nil} if none is specified for it. | |
1427 | ||
1428 | @item update_mode_line | |
1429 | Non-@code{nil} means this window's mode line needs to be updated. | |
1430 | ||
1431 | @item base_line_number | |
1432 | The line number of a certain position in the buffer, or @code{nil}. | |
1433 | This is used for displaying the line number of point in the mode line. | |
1434 | ||
1435 | @item base_line_pos | |
1436 | The position in the buffer for which the line number is known, or | |
7d2d7cc0 GM |
1437 | @code{nil} meaning none is known. If it is a buffer, don't display |
1438 | the line number as long as the window shows that buffer. | |
b8d4c8d0 GM |
1439 | |
1440 | @item region_showing | |
1441 | If the region (or part of it) is highlighted in this window, this field | |
1442 | holds the mark position that made one end of that region. Otherwise, | |
1443 | this field is @code{nil}. | |
1444 | ||
1445 | @item column_number_displayed | |
1446 | The column number currently displayed in this window's mode line, or @code{nil} | |
1447 | if column numbers are not being displayed. | |
1448 | ||
1449 | @item current_matrix | |
7d2d7cc0 GM |
1450 | @itemx desired_matrix |
1451 | Glyph matrices describing the current and desired display of this window. | |
b8d4c8d0 GM |
1452 | @end table |
1453 | ||
1454 | @node Process Internals | |
520f2425 | 1455 | @subsection Process Internals |
b8d4c8d0 GM |
1456 | @cindex internals, of process |
1457 | @cindex process internals | |
1458 | ||
7d2d7cc0 GM |
1459 | The fields of a process (for a complete list, see the definition of |
1460 | @code{struct Lisp_Process} in @file{process.h}) include: | |
b8d4c8d0 GM |
1461 | |
1462 | @table @code | |
1463 | @item name | |
1464 | A string, the name of the process. | |
1465 | ||
1466 | @item command | |
1467 | A list containing the command arguments that were used to start this | |
c73e02fa GM |
1468 | process. For a network or serial process, it is @code{nil} if the |
1469 | process is running or @code{t} if the process is stopped. | |
b8d4c8d0 GM |
1470 | |
1471 | @item filter | |
7d2d7cc0 GM |
1472 | If non-@code{nil}, a function used to accept output from the process |
1473 | instead of a buffer. | |
b8d4c8d0 GM |
1474 | |
1475 | @item sentinel | |
7d2d7cc0 GM |
1476 | If non-@code{nil}, a function called whenever the state of the process |
1477 | changes. | |
b8d4c8d0 GM |
1478 | |
1479 | @item buffer | |
1480 | The associated buffer of the process. | |
1481 | ||
1482 | @item pid | |
1483 | An integer, the operating system's process @acronym{ID}. | |
7d2d7cc0 | 1484 | Pseudo-processes such as network or serial connections use a value of 0. |
b8d4c8d0 GM |
1485 | |
1486 | @item childp | |
7d2d7cc0 GM |
1487 | A flag, @code{t} if this is really a child process. For a network or |
1488 | serial connection, it is a plist based on the arguments to | |
1489 | @code{make-network-process} or @code{make-serial-process}. | |
b8d4c8d0 GM |
1490 | |
1491 | @item mark | |
1492 | A marker indicating the position of the end of the last output from this | |
1493 | process inserted into the buffer. This is often but not always the end | |
1494 | of the buffer. | |
1495 | ||
1496 | @item kill_without_query | |
ee2d5b5e CY |
1497 | If this is non-zero, killing Emacs while this process is still running |
1498 | does not ask for confirmation about killing the process. | |
b8d4c8d0 | 1499 | |
7d2d7cc0 GM |
1500 | @item raw_status |
1501 | The raw process status, as returned by the @code{wait} system call. | |
b8d4c8d0 GM |
1502 | |
1503 | @item status | |
1504 | The process status, as @code{process-status} should return it. | |
1505 | ||
1506 | @item tick | |
1507 | @itemx update_tick | |
1508 | If these two fields are not equal, a change in the status of the process | |
1509 | needs to be reported, either by running the sentinel or by inserting a | |
1510 | message in the process buffer. | |
1511 | ||
1512 | @item pty_flag | |
a2eb421b | 1513 | Non-@code{nil} if communication with the subprocess uses a pty; |
b8d4c8d0 GM |
1514 | @code{nil} if it uses a pipe. |
1515 | ||
1516 | @item infd | |
1517 | The file descriptor for input from the process. | |
1518 | ||
1519 | @item outfd | |
1520 | The file descriptor for output to the process. | |
1521 | ||
b8d4c8d0 GM |
1522 | @item tty_name |
1523 | The name of the terminal that the subprocess is using, | |
1524 | or @code{nil} if it is using pipes. | |
1525 | ||
1526 | @item decode_coding_system | |
1527 | Coding-system for decoding the input from this process. | |
1528 | ||
1529 | @item decoding_buf | |
1530 | A working buffer for decoding. | |
1531 | ||
1532 | @item decoding_carryover | |
1533 | Size of carryover in decoding. | |
1534 | ||
1535 | @item encode_coding_system | |
1536 | Coding-system for encoding the output to this process. | |
1537 | ||
1538 | @item encoding_buf | |
1539 | A working buffer for encoding. | |
1540 | ||
b8d4c8d0 GM |
1541 | @item inherit_coding_system_flag |
1542 | Flag to set @code{coding-system} of the process buffer from the | |
1543 | coding system used to decode process output. | |
c73e02fa GM |
1544 | |
1545 | @item type | |
1546 | Symbol indicating the type of process: @code{real}, @code{network}, | |
7d2d7cc0 | 1547 | @code{serial}. |
c73e02fa | 1548 | |
b8d4c8d0 | 1549 | @end table |
7d2d7cc0 | 1550 | |
d92d9c95 PE |
1551 | @node C Integer Types |
1552 | @section C Integer Types | |
1553 | @cindex integer types (C programming language) | |
1554 | ||
1555 | Here are some guidelines for use of integer types in the Emacs C | |
1556 | source code. These guidelines sometimes give competing advice; common | |
1557 | sense is advised. | |
1558 | ||
1559 | @itemize @bullet | |
1560 | @item | |
1561 | Avoid arbitrary limits. For example, avoid @code{int len = strlen | |
1562 | (s);} unless the length of @code{s} is required for other reasons to | |
1563 | fit in @code{int} range. | |
1564 | ||
1565 | @item | |
1566 | Do not assume that signed integer arithmetic wraps around on overflow. | |
1567 | This is no longer true of Emacs porting targets: signed integer | |
1568 | overflow has undefined behavior in practice, and can dump core or | |
1569 | even cause earlier or later code to behave ``illogically''. Unsigned | |
1570 | overflow does wrap around reliably, modulo a power of two. | |
1571 | ||
1572 | @item | |
1573 | Prefer signed types to unsigned, as code gets confusing when signed | |
1574 | and unsigned types are combined. Many other guidelines assume that | |
1575 | types are signed; in the rarer cases where unsigned types are needed, | |
1576 | similar advice may apply to the unsigned counterparts (e.g., | |
1577 | @code{size_t} instead of @code{ptrdiff_t}, or @code{uintptr_t} instead | |
1578 | of @code{intptr_t}). | |
1579 | ||
1580 | @item | |
1581 | Prefer @code{int} for Emacs character codes, in the range 0 ..@: 0x3FFFFF. | |
1582 | ||
1583 | @item | |
1584 | Prefer @code{ptrdiff_t} for sizes, i.e., for integers bounded by the | |
1585 | maximum size of any individual C object or by the maximum number of | |
1586 | elements in any C array. This is part of Emacs's general preference | |
1587 | for signed types. Using @code{ptrdiff_t} limits objects to | |
1588 | @code{PTRDIFF_MAX} bytes, but larger objects would cause trouble | |
1589 | anyway since they would break pointer subtraction, so this does not | |
1590 | impose an arbitrary limit. | |
1591 | ||
1592 | @item | |
1593 | Prefer @code{intptr_t} for internal representations of pointers, or | |
1594 | for integers bounded only by the number of objects that can exist at | |
1595 | any given time or by the total number of bytes that can be allocated. | |
1596 | Currently Emacs sometimes uses other types when @code{intptr_t} would | |
1597 | be better; fixing this is lower priority, as the code works as-is on | |
1598 | Emacs's current porting targets. | |
1599 | ||
1600 | @item | |
1601 | Prefer the Emacs-defined type @code{EMACS_INT} for representing values | |
1602 | converted to or from Emacs Lisp fixnums, as fixnum arithmetic is based | |
1603 | on @code{EMACS_INT}. | |
1604 | ||
1605 | @item | |
1606 | When representing a system value (such as a file size or a count of | |
1607 | seconds since the Epoch), prefer the corresponding system type (e.g., | |
1608 | @code{off_t}, @code{time_t}). Do not assume that a system type is | |
1609 | signed, unless this assumption is known to be safe. For example, | |
1610 | although @code{off_t} is always signed, @code{time_t} need not be. | |
1611 | ||
1612 | @item | |
1613 | Prefer the Emacs-defined type @code{printmax_t} for representing | |
1614 | values that might be any signed integer value that can be printed, | |
1615 | using a @code{printf}-family function. | |
1616 | ||
1617 | @item | |
1618 | Prefer @code{intmax_t} for representing values that might be any | |
1619 | signed integer value. | |
1620 | ||
1621 | @item | |
1622 | In bitfields, prefer @code{unsigned int} or @code{signed int} to | |
1623 | @code{int}, as @code{int} is less portable: it might be signed, and | |
1624 | might not be. Single-bit bit fields are invariably @code{unsigned | |
1625 | int} so that their values are 0 and 1. | |
1626 | ||
1627 | @item | |
1628 | In C, Emacs commonly uses @code{bool}, 1, and 0 for boolean values. | |
1629 | Using @code{bool} for booleans can make programs easier to read and a | |
1630 | bit faster than using @code{int}. Although it is also OK to use | |
1631 | @code{int}, this older style is gradually being phased out. When | |
1632 | using @code{bool}, respect the limitations of the replacement | |
1633 | implementation of @code{bool}, as documented in the source file | |
1634 | @file{lib/stdbool.in.h}, so that Emacs remains portable to pre-C99 | |
1635 | platforms. | |
1636 | @end itemize | |
1637 | ||
7d2d7cc0 | 1638 | @c FIXME Mention src/globals.h somewhere in this file? |