Convert consecutive FSF copyright years to ranges.
[bpt/emacs.git] / doc / lispref / processes.texi
CommitLineData
b8d4c8d0
GM
1@c -*-texinfo-*-
2@c This is part of the GNU Emacs Lisp Reference Manual.
73b0cd50 3@c Copyright (C) 1990-1995, 1998-1999, 2001-2011
d24880de 4@c Free Software Foundation, Inc.
b8d4c8d0 5@c See the file elisp.texi for copying conditions.
6336d8c3 6@setfilename ../../info/processes
b8d4c8d0
GM
7@node Processes, Display, Abbrevs, Top
8@chapter Processes
9@cindex child process
10@cindex parent process
11@cindex subprocess
12@cindex process
13
14 In the terminology of operating systems, a @dfn{process} is a space in
15which a program can execute. Emacs runs in a process. Emacs Lisp
16programs can invoke other programs in processes of their own. These are
17called @dfn{subprocesses} or @dfn{child processes} of the Emacs process,
18which is their @dfn{parent process}.
19
20 A subprocess of Emacs may be @dfn{synchronous} or @dfn{asynchronous},
21depending on how it is created. When you create a synchronous
22subprocess, the Lisp program waits for the subprocess to terminate
23before continuing execution. When you create an asynchronous
24subprocess, it can run in parallel with the Lisp program. This kind of
25subprocess is represented within Emacs by a Lisp object which is also
26called a ``process.'' Lisp programs can use this object to communicate
27with the subprocess or to control it. For example, you can send
28signals, obtain status information, receive output from the process, or
29send input to it.
30
31@defun processp object
23dd4ecd
EZ
32This function returns @code{t} if @var{object} represents an Emacs
33subprocess, @code{nil} otherwise.
b8d4c8d0
GM
34@end defun
35
23dd4ecd
EZ
36 In addition to subprocesses of the current Emacs session, you can
37also access other processes running on your machine. @xref{System
38Processes}.
39
b8d4c8d0
GM
40@menu
41* Subprocess Creation:: Functions that start subprocesses.
42* Shell Arguments:: Quoting an argument to pass it to a shell.
43* Synchronous Processes:: Details of using synchronous subprocesses.
44* Asynchronous Processes:: Starting up an asynchronous subprocess.
45* Deleting Processes:: Eliminating an asynchronous subprocess.
46* Process Information:: Accessing run-status and other attributes.
47* Input to Processes:: Sending input to an asynchronous subprocess.
48* Signals to Processes:: Stopping, continuing or interrupting
49 an asynchronous subprocess.
50* Output from Processes:: Collecting output from an asynchronous subprocess.
51* Sentinels:: Sentinels run when process run-status changes.
52* Query Before Exit:: Whether to query if exiting will kill a process.
23dd4ecd 53* System Processes:: Accessing other processes running on your system.
d24880de 54* Transaction Queues:: Transaction-based communication with subprocesses.
b8d4c8d0
GM
55* Network:: Opening network connections.
56* Network Servers:: Network servers let Emacs accept net connections.
57* Datagrams:: UDP network connections.
58* Low-Level Network:: Lower-level but more general function
59 to create connections and servers.
d24880de 60* Misc Network:: Additional relevant functions for net connections.
c73e02fa 61* Serial Ports:: Communicating with serial ports.
b8d4c8d0
GM
62* Byte Packing:: Using bindat to pack and unpack binary data.
63@end menu
64
65@node Subprocess Creation
66@section Functions that Create Subprocesses
67
583d8b3c 68 There are three primitives that create a new subprocess in which to run
b8d4c8d0
GM
69a program. One of them, @code{start-process}, creates an asynchronous
70process and returns a process object (@pxref{Asynchronous Processes}).
71The other two, @code{call-process} and @code{call-process-region},
72create a synchronous process and do not return a process object
73(@pxref{Synchronous Processes}).
74
75 Synchronous and asynchronous processes are explained in the following
76sections. Since the three functions are all called in a similar
77fashion, their common arguments are described here.
78
79@cindex execute program
80@cindex @code{PATH} environment variable
81@cindex @code{HOME} environment variable
82 In all cases, the function's @var{program} argument specifies the
83program to be run. An error is signaled if the file is not found or
84cannot be executed. If the file name is relative, the variable
85@code{exec-path} contains a list of directories to search. Emacs
86initializes @code{exec-path} when it starts up, based on the value of
87the environment variable @code{PATH}. The standard file name
88constructs, @samp{~}, @samp{.}, and @samp{..}, are interpreted as
89usual in @code{exec-path}, but environment variable substitutions
90(@samp{$HOME}, etc.) are not recognized; use
91@code{substitute-in-file-name} to perform them (@pxref{File Name
92Expansion}). @code{nil} in this list refers to
93@code{default-directory}.
94
95 Executing a program can also try adding suffixes to the specified
96name:
97
98@defvar exec-suffixes
99This variable is a list of suffixes (strings) to try adding to the
100specified program file name. The list should include @code{""} if you
101want the name to be tried exactly as specified. The default value is
102system-dependent.
103@end defvar
104
105 @strong{Please note:} The argument @var{program} contains only the
106name of the program; it may not contain any command-line arguments. You
107must use @var{args} to provide those.
108
109 Each of the subprocess-creating functions has a @var{buffer-or-name}
110argument which specifies where the standard output from the program will
111go. It should be a buffer or a buffer name; if it is a buffer name,
112that will create the buffer if it does not already exist. It can also
113be @code{nil}, which says to discard the output unless a filter function
114handles it. (@xref{Filter Functions}, and @ref{Read and Print}.)
115Normally, you should avoid having multiple processes send output to the
116same buffer because their output would be intermixed randomly.
117
118@cindex program arguments
119 All three of the subprocess-creating functions have a @code{&rest}
120argument, @var{args}. The @var{args} must all be strings, and they are
121supplied to @var{program} as separate command line arguments. Wildcard
122characters and other shell constructs have no special meanings in these
123strings, since the strings are passed directly to the specified program.
124
125 The subprocess gets its current directory from the value of
126@code{default-directory} (@pxref{File Name Expansion}).
127
128@cindex environment variables, subprocesses
129 The subprocess inherits its environment from Emacs, but you can
130specify overrides for it with @code{process-environment}. @xref{System
131Environment}.
132
133@defvar exec-directory
134@pindex movemail
135The value of this variable is a string, the name of a directory that
136contains programs that come with GNU Emacs, programs intended for Emacs
137to invoke. The program @code{movemail} is an example of such a program;
138Rmail uses it to fetch new mail from an inbox.
139@end defvar
140
141@defopt exec-path
142The value of this variable is a list of directories to search for
143programs to run in subprocesses. Each element is either the name of a
144directory (i.e., a string), or @code{nil}, which stands for the default
145directory (which is the value of @code{default-directory}).
146@cindex program directories
147
148The value of @code{exec-path} is used by @code{call-process} and
149@code{start-process} when the @var{program} argument is not an absolute
150file name.
151@end defopt
152
153@node Shell Arguments
154@section Shell Arguments
155@cindex arguments for shell commands
156@cindex shell command arguments
157
158 Lisp programs sometimes need to run a shell and give it a command
159that contains file names that were specified by the user. These
160programs ought to be able to support any valid file name. But the shell
161gives special treatment to certain characters, and if these characters
162occur in the file name, they will confuse the shell. To handle these
163characters, use the function @code{shell-quote-argument}:
164
165@defun shell-quote-argument argument
166This function returns a string which represents, in shell syntax,
167an argument whose actual contents are @var{argument}. It should
168work reliably to concatenate the return value into a shell command
169and then pass it to a shell for execution.
170
171Precisely what this function does depends on your operating system. The
172function is designed to work with the syntax of your system's standard
173shell; if you use an unusual shell, you will need to redefine this
174function.
175
176@example
177;; @r{This example shows the behavior on GNU and Unix systems.}
178(shell-quote-argument "foo > bar")
179 @result{} "foo\\ \\>\\ bar"
180
181;; @r{This example shows the behavior on MS-DOS and MS-Windows.}
182(shell-quote-argument "foo > bar")
183 @result{} "\"foo > bar\""
184@end example
185
186Here's an example of using @code{shell-quote-argument} to construct
187a shell command:
188
189@example
190(concat "diff -c "
191 (shell-quote-argument oldfile)
192 " "
193 (shell-quote-argument newfile))
194@end example
195@end defun
196
3c73e30e
EZ
197@cindex quoting and unquoting command-line arguments
198@cindex minibuffer input, and command-line arguments
199@cindex @code{call-process}, command-line arguments from minibuffer
200@cindex @code{start-process}, command-line arguments from minibuffer
4bb49a92
EZ
201 The following two functions are useful for combining a list of
202individual command-line argument strings into a single string, and
203taking a string apart into a list of individual command-line
204arguments. These functions are mainly intended to be used for
205converting user input in the minibuffer, a Lisp string, into a list of
206string arguments to be passed to @code{call-process} or
3c73e30e
EZ
207@code{start-process}, or for the converting such lists of arguments in
208a single Lisp string to be presented in the minibuffer or echo area.
a873ee3d
EZ
209
210@defun split-string-and-unquote string &optional separators
211This function splits @var{string} into substrings at matches for the
212regular expression @var{separators}, like @code{split-string} does
0999039c
CY
213(@pxref{Creating Strings}); in addition, it removes quoting from the
214substrings. It then makes a list of the substrings and returns it.
a873ee3d 215
4185820c
CY
216If @var{separators} is omitted or @code{nil}, it defaults to
217@code{"\\s-+"}, which is a regular expression that matches one or more
218characters with whitespace syntax (@pxref{Syntax Class Table}).
a873ee3d 219
3c73e30e 220This function supports two types of quoting: enclosing a whole string
0999039c
CY
221in double quotes @code{"@dots{}"}, and quoting individual characters
222with a backslash escape @samp{\}. The latter is also used in Lisp
223strings, so this function can handle those as well.
a873ee3d
EZ
224@end defun
225
226@defun combine-and-quote-strings list-of-strings &optional separator
227This function concatenates @var{list-of-strings} into a single string,
0999039c
CY
228quoting each string as necessary. It also sticks the @var{separator}
229string between each pair of strings; if @var{separator} is omitted or
230@code{nil}, it defaults to @code{" "}. The return value is the
231resulting string.
a873ee3d
EZ
232
233The strings in @var{list-of-strings} that need quoting are those that
234include @var{separator} as their substring. Quoting a string encloses
235it in double quotes @code{"@dots{}"}. In the simplest case, if you
4bb49a92
EZ
236are consing a command from the individual command-line arguments,
237every argument that includes embedded blanks will be quoted.
a873ee3d
EZ
238@end defun
239
b8d4c8d0
GM
240@node Synchronous Processes
241@section Creating a Synchronous Process
242@cindex synchronous subprocess
243
244 After a @dfn{synchronous process} is created, Emacs waits for the
245process to terminate before continuing. Starting Dired on GNU or
246Unix@footnote{On other systems, Emacs uses a Lisp emulation of
247@code{ls}; see @ref{Contents of Directories}.} is an example of this: it
248runs @code{ls} in a synchronous process, then modifies the output
249slightly. Because the process is synchronous, the entire directory
250listing arrives in the buffer before Emacs tries to do anything with it.
251
252 While Emacs waits for the synchronous subprocess to terminate, the
253user can quit by typing @kbd{C-g}. The first @kbd{C-g} tries to kill
254the subprocess with a @code{SIGINT} signal; but it waits until the
255subprocess actually terminates before quitting. If during that time the
256user types another @kbd{C-g}, that kills the subprocess instantly with
257@code{SIGKILL} and quits immediately (except on MS-DOS, where killing
258other processes doesn't work). @xref{Quitting}.
259
260 The synchronous subprocess functions return an indication of how the
261process terminated.
262
263 The output from a synchronous subprocess is generally decoded using a
264coding system, much like text read from a file. The input sent to a
265subprocess by @code{call-process-region} is encoded using a coding
266system, much like text written into a file. @xref{Coding Systems}.
267
268@defun call-process program &optional infile destination display &rest args
269This function calls @var{program} in a separate process and waits for
270it to finish.
271
272The standard input for the process comes from file @var{infile} if
273@var{infile} is not @code{nil}, and from the null device otherwise.
274The argument @var{destination} says where to put the process output.
275Here are the possibilities:
276
277@table @asis
278@item a buffer
279Insert the output in that buffer, before point. This includes both the
280standard output stream and the standard error stream of the process.
281
282@item a string
283Insert the output in a buffer with that name, before point.
284
285@item @code{t}
286Insert the output in the current buffer, before point.
287
288@item @code{nil}
289Discard the output.
290
291@item 0
292Discard the output, and return @code{nil} immediately without waiting
293for the subprocess to finish.
294
295In this case, the process is not truly synchronous, since it can run in
296parallel with Emacs; but you can think of it as synchronous in that
297Emacs is essentially finished with the subprocess as soon as this
298function returns.
299
300MS-DOS doesn't support asynchronous subprocesses, so this option doesn't
301work there.
302
303@item @code{(@var{real-destination} @var{error-destination})}
304Keep the standard output stream separate from the standard error stream;
305deal with the ordinary output as specified by @var{real-destination},
306and dispose of the error output according to @var{error-destination}.
307If @var{error-destination} is @code{nil}, that means to discard the
308error output, @code{t} means mix it with the ordinary output, and a
309string specifies a file name to redirect error output into.
310
311You can't directly specify a buffer to put the error output in; that is
312too difficult to implement. But you can achieve this result by sending
313the error output to a temporary file and then inserting the file into a
314buffer.
315@end table
316
317If @var{display} is non-@code{nil}, then @code{call-process} redisplays
318the buffer as output is inserted. (However, if the coding system chosen
319for decoding output is @code{undecided}, meaning deduce the encoding
320from the actual data, then redisplay sometimes cannot continue once
321non-@acronym{ASCII} characters are encountered. There are fundamental
322reasons why it is hard to fix this; see @ref{Output from Processes}.)
323
324Otherwise the function @code{call-process} does no redisplay, and the
325results become visible on the screen only when Emacs redisplays that
326buffer in the normal course of events.
327
328The remaining arguments, @var{args}, are strings that specify command
329line arguments for the program.
330
331The value returned by @code{call-process} (unless you told it not to
332wait) indicates the reason for process termination. A number gives the
333exit status of the subprocess; 0 means success, and any other value
334means failure. If the process terminated with a signal,
335@code{call-process} returns a string describing the signal.
336
337In the examples below, the buffer @samp{foo} is current.
338
339@smallexample
340@group
341(call-process "pwd" nil t)
342 @result{} 0
343
344---------- Buffer: foo ----------
345/usr/user/lewis/manual
346---------- Buffer: foo ----------
347@end group
348
349@group
350(call-process "grep" nil "bar" nil "lewis" "/etc/passwd")
351 @result{} 0
352
353---------- Buffer: bar ----------
354lewis:5LTsHm66CSWKg:398:21:Bil Lewis:/user/lewis:/bin/csh
355
356---------- Buffer: bar ----------
357@end group
358@end smallexample
359
360Here is a good example of the use of @code{call-process}, which used to
361be found in the definition of @code{insert-directory}:
362
363@smallexample
364@group
365(call-process insert-directory-program nil t nil @var{switches}
366 (if full-directory-p
367 (concat (file-name-as-directory file) ".")
368 file))
369@end group
370@end smallexample
371@end defun
372
373@defun process-file program &optional infile buffer display &rest args
374This function processes files synchronously in a separate process. It
375is similar to @code{call-process} but may invoke a file handler based
376on the value of the variable @code{default-directory}. The current
377working directory of the subprocess is @code{default-directory}.
378
379The arguments are handled in almost the same way as for
380@code{call-process}, with the following differences:
381
382Some file handlers may not support all combinations and forms of the
383arguments @var{infile}, @var{buffer}, and @var{display}. For example,
384some file handlers might behave as if @var{display} were @code{nil},
385regardless of the value actually passed. As another example, some
386file handlers might not support separating standard output and error
387output by way of the @var{buffer} argument.
388
389If a file handler is invoked, it determines the program to run based
390on the first argument @var{program}. For instance, consider that a
391handler for remote files is invoked. Then the path that is used for
392searching the program might be different than @code{exec-path}.
393
394The second argument @var{infile} may invoke a file handler. The file
395handler could be different from the handler chosen for the
396@code{process-file} function itself. (For example,
397@code{default-directory} could be on a remote host, whereas
398@var{infile} is on another remote host. Or @code{default-directory}
399could be non-special, whereas @var{infile} is on a remote host.)
400
401If @var{buffer} is a list of the form @code{(@var{real-destination}
402@var{error-destination})}, and @var{error-destination} names a file,
403then the same remarks as for @var{infile} apply.
404
405The remaining arguments (@var{args}) will be passed to the process
406verbatim. Emacs is not involved in processing file names that are
407present in @var{args}. To avoid confusion, it may be best to avoid
408absolute file names in @var{args}, but rather to specify all file
409names as relative to @code{default-directory}. The function
410@code{file-relative-name} is useful for constructing such relative
411file names.
412@end defun
413
3691a0a6
MA
414@defvar process-file-side-effects
415This variable indicates, whether a call of @code{process-file} changes
416remote files.
417
418Per default, this variable is always set to @code{t}, meaning that a
419call of @code{process-file} could potentially change any file on a
420remote host. When set to @code{nil}, a file handler could optimize
421its behaviour with respect to remote file attributes caching.
422
423This variable should never be changed by @code{setq}. Instead of, it
424shall be set only by let-binding.
425@end defvar
426
b8d4c8d0
GM
427@defun call-process-region start end program &optional delete destination display &rest args
428This function sends the text from @var{start} to @var{end} as
429standard input to a process running @var{program}. It deletes the text
430sent if @var{delete} is non-@code{nil}; this is useful when
431@var{destination} is @code{t}, to insert the output in the current
432buffer in place of the input.
433
434The arguments @var{destination} and @var{display} control what to do
435with the output from the subprocess, and whether to update the display
436as it comes in. For details, see the description of
437@code{call-process}, above. If @var{destination} is the integer 0,
438@code{call-process-region} discards the output and returns @code{nil}
439immediately, without waiting for the subprocess to finish (this only
440works if asynchronous subprocesses are supported).
441
442The remaining arguments, @var{args}, are strings that specify command
443line arguments for the program.
444
445The return value of @code{call-process-region} is just like that of
446@code{call-process}: @code{nil} if you told it to return without
447waiting; otherwise, a number or string which indicates how the
448subprocess terminated.
449
450In the following example, we use @code{call-process-region} to run the
451@code{cat} utility, with standard input being the first five characters
452in buffer @samp{foo} (the word @samp{input}). @code{cat} copies its
453standard input into its standard output. Since the argument
454@var{destination} is @code{t}, this output is inserted in the current
455buffer.
456
457@smallexample
458@group
459---------- Buffer: foo ----------
460input@point{}
461---------- Buffer: foo ----------
462@end group
463
464@group
465(call-process-region 1 6 "cat" nil t)
466 @result{} 0
467
468---------- Buffer: foo ----------
469inputinput@point{}
470---------- Buffer: foo ----------
471@end group
472@end smallexample
473
474 The @code{shell-command-on-region} command uses
475@code{call-process-region} like this:
476
477@smallexample
478@group
479(call-process-region
480 start end
481 shell-file-name ; @r{Name of program.}
482 nil ; @r{Do not delete region.}
483 buffer ; @r{Send output to @code{buffer}.}
484 nil ; @r{No redisplay during output.}
485 "-c" command) ; @r{Arguments for the shell.}
486@end group
487@end smallexample
488@end defun
489
490@defun call-process-shell-command command &optional infile destination display &rest args
491This function executes the shell command @var{command} synchronously
492in a separate process. The final arguments @var{args} are additional
493arguments to add at the end of @var{command}. The other arguments
494are handled as in @code{call-process}.
495@end defun
496
497@defun process-file-shell-command command &optional infile destination display &rest args
498This function is like @code{call-process-shell-command}, but uses
499@code{process-file} internally. Depending on @code{default-directory},
500@var{command} can be executed also on remote hosts.
501@end defun
502
503@defun shell-command-to-string command
504This function executes @var{command} (a string) as a shell command,
505then returns the command's output as a string.
506@end defun
507
583d8b3c
EZ
508@defun process-lines program &rest args
509This function runs @var{program} in a separate process, waits for it
510to finish, and returns its output as a list of strings. Each string
511in the list holds a single line of text output by the program; the
512end-of-line characters are stripped from each line. The arguments
513beyond @var{program}, @var{args}, are strings that specify
514command-line arguments with which to run the program.
515
516If @var{program} exits with a non-zero exit status, this function
517signals an error.
518
519This function works by calling @code{call-process}, so program output
520is decoded in the same way as for @code{call-process}.
521@end defun
522
b8d4c8d0
GM
523@node Asynchronous Processes
524@section Creating an Asynchronous Process
525@cindex asynchronous subprocess
526
527 After an @dfn{asynchronous process} is created, Emacs and the subprocess
528both continue running immediately. The process thereafter runs
529in parallel with Emacs, and the two can communicate with each other
530using the functions described in the following sections. However,
531communication is only partially asynchronous: Emacs sends data to the
532process only when certain functions are called, and Emacs accepts data
533from the process only when Emacs is waiting for input or for a time
534delay.
535
536 Here we describe how to create an asynchronous process.
537
538@defun start-process name buffer-or-name program &rest args
539This function creates a new asynchronous subprocess and starts the
540program @var{program} running in it. It returns a process object that
541stands for the new subprocess in Lisp. The argument @var{name}
542specifies the name for the process object; if a process with this name
543already exists, then @var{name} is modified (by appending @samp{<1>},
544etc.) to be unique. The buffer @var{buffer-or-name} is the buffer to
545associate with the process.
546
547The remaining arguments, @var{args}, are strings that specify command
548line arguments for the program.
549
550In the example below, the first process is started and runs (rather,
551sleeps) for 100 seconds. Meanwhile, the second process is started, and
552given the name @samp{my-process<1>} for the sake of uniqueness. It
553inserts the directory listing at the end of the buffer @samp{foo},
554before the first process finishes. Then it finishes, and a message to
555that effect is inserted in the buffer. Much later, the first process
556finishes, and another message is inserted in the buffer for it.
557
558@smallexample
559@group
560(start-process "my-process" "foo" "sleep" "100")
561 @result{} #<process my-process>
562@end group
563
564@group
565(start-process "my-process" "foo" "ls" "-l" "/user/lewis/bin")
566 @result{} #<process my-process<1>>
567
568---------- Buffer: foo ----------
569total 2
570lrwxrwxrwx 1 lewis 14 Jul 22 10:12 gnuemacs --> /emacs
571-rwxrwxrwx 1 lewis 19 Jul 30 21:02 lemon
572
573Process my-process<1> finished
574
575Process my-process finished
576---------- Buffer: foo ----------
577@end group
578@end smallexample
579@end defun
580
581@defun start-file-process name buffer-or-name program &rest args
582Like @code{start-process}, this function starts a new asynchronous
583subprocess running @var{program} in it, and returns its process
584object---when @code{default-directory} is not a magic file name.
585
586If @code{default-directory} is magic, the function invokes its file
587handler instead. This handler ought to run @var{program}, perhaps on
588the local host, perhaps on a remote host that corresponds to
589@code{default-directory}. In the latter case, the local part of
590@code{default-directory} becomes the working directory of the process.
591
592This function does not try to invoke file name handlers for
593@var{program} or for the @var{program-args}.
594
595Depending on the implementation of the file handler, it might not be
596possible to apply @code{process-filter} or @code{process-sentinel} to
597the resulting process object (@pxref{Filter Functions}, @pxref{Sentinels}).
598
599Some file handlers may not support @code{start-file-process} (for
600example @code{ange-ftp-hook-function}). In such cases, the function
601does nothing and returns @code{nil}.
602@end defun
603
03a74b84 604@defun start-process-shell-command name buffer-or-name command
b8d4c8d0
GM
605This function is like @code{start-process} except that it uses a shell
606to execute the specified command. The argument @var{command} is a shell
03a74b84 607command name. The variable @code{shell-file-name} specifies which shell to
b8d4c8d0
GM
608use.
609
610The point of running a program through the shell, rather than directly
611with @code{start-process}, is so that you can employ shell features such
612as wildcards in the arguments. It follows that if you include an
613arbitrary user-specified arguments in the command, you should quote it
614with @code{shell-quote-argument} first, so that any special shell
615characters do @emph{not} have their special shell meanings. @xref{Shell
616Arguments}.
617@end defun
618
03a74b84 619@defun start-file-process-shell-command name buffer-or-name command
b8d4c8d0
GM
620This function is like @code{start-process-shell-command}, but uses
621@code{start-file-process} internally. By this, @var{command} can be
622executed also on remote hosts, depending on @code{default-directory}.
623@end defun
624
625@defvar process-connection-type
626@cindex pipes
627@cindex @acronym{PTY}s
628This variable controls the type of device used to communicate with
629asynchronous subprocesses. If it is non-@code{nil}, then @acronym{PTY}s are
630used, when available. Otherwise, pipes are used.
631
632@acronym{PTY}s are usually preferable for processes visible to the user, as
633in Shell mode, because they allow job control (@kbd{C-c}, @kbd{C-z},
634etc.) to work between the process and its children, whereas pipes do
635not. For subprocesses used for internal purposes by programs, it is
636often better to use a pipe, because they are more efficient. In
637addition, the total number of @acronym{PTY}s is limited on many systems and
638it is good not to waste them.
639
640The value of @code{process-connection-type} takes effect when
641@code{start-process} is called. So you can specify how to communicate
642with one subprocess by binding the variable around the call to
643@code{start-process}.
644
645@smallexample
646@group
647(let ((process-connection-type nil)) ; @r{Use a pipe.}
648 (start-process @dots{}))
649@end group
650@end smallexample
651
652To determine whether a given subprocess actually got a pipe or a
653@acronym{PTY}, use the function @code{process-tty-name} (@pxref{Process
654Information}).
655@end defvar
656
657@node Deleting Processes
658@section Deleting Processes
659@cindex deleting processes
660
661 @dfn{Deleting a process} disconnects Emacs immediately from the
662subprocess. Processes are deleted automatically after they terminate,
663but not necessarily right away. You can delete a process explicitly
664at any time. If you delete a terminated process explicitly before it
665is deleted automatically, no harm results. Deleting a running
666process sends a signal to terminate it (and its child processes if
667any), and calls the process sentinel if it has one. @xref{Sentinels}.
668
669 When a process is deleted, the process object itself continues to
670exist as long as other Lisp objects point to it. All the Lisp
671primitives that work on process objects accept deleted processes, but
672those that do I/O or send signals will report an error. The process
673mark continues to point to the same place as before, usually into a
674buffer where output from the process was being inserted.
675
676@defopt delete-exited-processes
677This variable controls automatic deletion of processes that have
678terminated (due to calling @code{exit} or to a signal). If it is
679@code{nil}, then they continue to exist until the user runs
680@code{list-processes}. Otherwise, they are deleted immediately after
681they exit.
682@end defopt
683
684@defun delete-process process
685This function deletes a process, killing it with a @code{SIGKILL}
686signal. The argument may be a process, the name of a process, a
687buffer, or the name of a buffer. (A buffer or buffer-name stands for
688the process that @code{get-buffer-process} returns.) Calling
689@code{delete-process} on a running process terminates it, updates the
690process status, and runs the sentinel (if any) immediately. If the
691process has already terminated, calling @code{delete-process} has no
692effect on its status, or on the running of its sentinel (which will
693happen sooner or later).
694
695@smallexample
696@group
697(delete-process "*shell*")
698 @result{} nil
699@end group
700@end smallexample
701@end defun
702
703@node Process Information
704@section Process Information
705
706 Several functions return information about processes.
707@code{list-processes} is provided for interactive use.
708
709@deffn Command list-processes &optional query-only
710This command displays a listing of all living processes. In addition,
711it finally deletes any process whose status was @samp{Exited} or
712@samp{Signaled}. It returns @code{nil}.
713
714If @var{query-only} is non-@code{nil} then it lists only processes
715whose query flag is non-@code{nil}. @xref{Query Before Exit}.
716@end deffn
717
718@defun process-list
719This function returns a list of all processes that have not been deleted.
720
721@smallexample
722@group
723(process-list)
724 @result{} (#<process display-time> #<process shell>)
725@end group
726@end smallexample
727@end defun
728
729@defun get-process name
730This function returns the process named @var{name}, or @code{nil} if
731there is none. An error is signaled if @var{name} is not a string.
732
733@smallexample
734@group
735(get-process "shell")
736 @result{} #<process shell>
737@end group
738@end smallexample
739@end defun
740
741@defun process-command process
742This function returns the command that was executed to start
743@var{process}. This is a list of strings, the first string being the
744program executed and the rest of the strings being the arguments that
745were given to the program.
746
747@smallexample
748@group
749(process-command (get-process "shell"))
750 @result{} ("/bin/csh" "-i")
751@end group
752@end smallexample
753@end defun
754
c73e02fa
GM
755@defun process-contact process &optional key
756
757This function returns information about how a network or serial
758process was set up. For a network process, when @var{key} is
759@code{nil}, it returns @code{(@var{hostname} @var{service})} which
760specifies what you connected to. For a serial process, when @var{key}
761is @code{nil}, it returns @code{(@var{port} @var{speed})}. For an
762ordinary child process, this function always returns @code{t}.
763
764If @var{key} is @code{t}, the value is the complete status information
765for the connection, server, or serial port; that is, the list of
766keywords and values specified in @code{make-network-process} or
767@code{make-serial-process}, except that some of the values represent
768the current status instead of what you specified.
769
770For a network process:
771
772@table @code
773@item :buffer
774The associated value is the process buffer.
775@item :filter
776The associated value is the process filter function.
777@item :sentinel
778The associated value is the process sentinel function.
779@item :remote
780In a connection, the address in internal format of the remote peer.
781@item :local
782The local address, in internal format.
783@item :service
784In a server, if you specified @code{t} for @var{service},
785this value is the actual port number.
786@end table
787
788@code{:local} and @code{:remote} are included even if they were not
789specified explicitly in @code{make-network-process}.
790
791For a serial process, see @code{make-serial-process} and
792@code{serial-process-configure} for a list of keys.
793
794If @var{key} is a keyword, the function returns the value corresponding
795to that keyword.
796@end defun
797
b8d4c8d0
GM
798@defun process-id process
799This function returns the @acronym{PID} of @var{process}. This is an
800integer that distinguishes the process @var{process} from all other
801processes running on the same computer at the current time. The
802@acronym{PID} of a process is chosen by the operating system kernel when the
803process is started and remains constant as long as the process exists.
804@end defun
805
806@defun process-name process
807This function returns the name of @var{process}.
808@end defun
809
810@defun process-status process-name
811This function returns the status of @var{process-name} as a symbol.
170ddd97
CY
812The argument @var{process-name} must be a process, a buffer, or a
813process name (a string).
b8d4c8d0
GM
814
815The possible values for an actual subprocess are:
816
817@table @code
818@item run
819for a process that is running.
820@item stop
821for a process that is stopped but continuable.
822@item exit
823for a process that has exited.
824@item signal
825for a process that has received a fatal signal.
826@item open
827for a network connection that is open.
828@item closed
829for a network connection that is closed. Once a connection
830is closed, you cannot reopen it, though you might be able to open
831a new connection to the same place.
832@item connect
833for a non-blocking connection that is waiting to complete.
834@item failed
835for a non-blocking connection that has failed to complete.
836@item listen
837for a network server that is listening.
838@item nil
839if @var{process-name} is not the name of an existing process.
840@end table
841
842@smallexample
843@group
b8d4c8d0
GM
844(process-status (get-buffer "*shell*"))
845 @result{} run
846@end group
847@group
848x
849 @result{} #<process xx<1>>
850(process-status x)
851 @result{} exit
852@end group
853@end smallexample
854
855For a network connection, @code{process-status} returns one of the symbols
856@code{open} or @code{closed}. The latter means that the other side
857closed the connection, or Emacs did @code{delete-process}.
858@end defun
859
c73e02fa
GM
860@defun process-type process
861This function returns the symbol @code{network} for a network
862connection or server, @code{serial} for a serial port connection, or
863@code{real} for a real subprocess.
864@end defun
865
b8d4c8d0
GM
866@defun process-exit-status process
867This function returns the exit status of @var{process} or the signal
868number that killed it. (Use the result of @code{process-status} to
869determine which of those it is.) If @var{process} has not yet
870terminated, the value is 0.
871@end defun
872
873@defun process-tty-name process
874This function returns the terminal name that @var{process} is using for
875its communication with Emacs---or @code{nil} if it is using pipes
876instead of a terminal (see @code{process-connection-type} in
bace45fe
MA
877@ref{Asynchronous Processes}). If @var{process} represents a program
878running on a remote host, the terminal name used by that program on
879the remote host is provided as process property @code{remote-tty}.
b8d4c8d0
GM
880@end defun
881
882@defun process-coding-system process
883@anchor{Coding systems for a subprocess}
884This function returns a cons cell describing the coding systems in use
885for decoding output from @var{process} and for encoding input to
886@var{process} (@pxref{Coding Systems}). The value has this form:
887
888@example
889(@var{coding-system-for-decoding} . @var{coding-system-for-encoding})
890@end example
891@end defun
892
893@defun set-process-coding-system process &optional decoding-system encoding-system
894This function specifies the coding systems to use for subsequent output
895from and input to @var{process}. It will use @var{decoding-system} to
896decode subprocess output, and @var{encoding-system} to encode subprocess
897input.
898@end defun
899
900 Every process also has a property list that you can use to store
901miscellaneous values associated with the process.
902
903@defun process-get process propname
904This function returns the value of the @var{propname} property
905of @var{process}.
906@end defun
907
908@defun process-put process propname value
909This function sets the value of the @var{propname} property
910of @var{process} to @var{value}.
911@end defun
912
913@defun process-plist process
914This function returns the process plist of @var{process}.
915@end defun
916
917@defun set-process-plist process plist
918This function sets the process plist of @var{process} to @var{plist}.
919@end defun
920
921@node Input to Processes
922@section Sending Input to Processes
923@cindex process input
924
925 Asynchronous subprocesses receive input when it is sent to them by
926Emacs, which is done with the functions in this section. You must
927specify the process to send input to, and the input data to send. The
928data appears on the ``standard input'' of the subprocess.
929
930 Some operating systems have limited space for buffered input in a
931@acronym{PTY}. On these systems, Emacs sends an @acronym{EOF}
932periodically amidst the other characters, to force them through. For
933most programs, these @acronym{EOF}s do no harm.
934
935 Subprocess input is normally encoded using a coding system before the
936subprocess receives it, much like text written into a file. You can use
937@code{set-process-coding-system} to specify which coding system to use
938(@pxref{Process Information}). Otherwise, the coding system comes from
939@code{coding-system-for-write}, if that is non-@code{nil}; or else from
940the defaulting mechanism (@pxref{Default Coding Systems}).
941
942 Sometimes the system is unable to accept input for that process,
943because the input buffer is full. When this happens, the send functions
944wait a short while, accepting output from subprocesses, and then try
945again. This gives the subprocess a chance to read more of its pending
946input and make space in the buffer. It also allows filters, sentinels
947and timers to run---so take account of that in writing your code.
948
949 In these functions, the @var{process} argument can be a process or
950the name of a process, or a buffer or buffer name (which stands
951for a process via @code{get-buffer-process}). @code{nil} means
952the current buffer's process.
953
954@defun process-send-string process string
955This function sends @var{process} the contents of @var{string} as
956standard input. If it is @code{nil}, the current buffer's process is used.
957
958 The function returns @code{nil}.
959
960@smallexample
961@group
962(process-send-string "shell<1>" "ls\n")
963 @result{} nil
964@end group
965
966
967@group
968---------- Buffer: *shell* ----------
969...
970introduction.texi syntax-tables.texi~
971introduction.texi~ text.texi
972introduction.txt text.texi~
973...
974---------- Buffer: *shell* ----------
975@end group
976@end smallexample
977@end defun
978
979@defun process-send-region process start end
980This function sends the text in the region defined by @var{start} and
981@var{end} as standard input to @var{process}.
982
983An error is signaled unless both @var{start} and @var{end} are
984integers or markers that indicate positions in the current buffer. (It
985is unimportant which number is larger.)
986@end defun
987
988@defun process-send-eof &optional process
989This function makes @var{process} see an end-of-file in its
990input. The @acronym{EOF} comes after any text already sent to it.
991
992The function returns @var{process}.
993
994@smallexample
995@group
996(process-send-eof "shell")
997 @result{} "shell"
998@end group
999@end smallexample
1000@end defun
1001
106e6894
CY
1002@defun process-running-child-p &optional process
1003This function will tell you whether a @var{process} has given control of
b8d4c8d0
GM
1004its terminal to its own child process. The value is @code{t} if this is
1005true, or if Emacs cannot tell; it is @code{nil} if Emacs can be certain
1006that this is not so.
1007@end defun
1008
1009@node Signals to Processes
1010@section Sending Signals to Processes
1011@cindex process signals
1012@cindex sending signals
1013@cindex signals
1014
1015 @dfn{Sending a signal} to a subprocess is a way of interrupting its
1016activities. There are several different signals, each with its own
1017meaning. The set of signals and their names is defined by the operating
1018system. For example, the signal @code{SIGINT} means that the user has
1019typed @kbd{C-c}, or that some analogous thing has happened.
1020
1021 Each signal has a standard effect on the subprocess. Most signals
1022kill the subprocess, but some stop or resume execution instead. Most
1023signals can optionally be handled by programs; if the program handles
1024the signal, then we can say nothing in general about its effects.
1025
1026 You can send signals explicitly by calling the functions in this
1027section. Emacs also sends signals automatically at certain times:
1028killing a buffer sends a @code{SIGHUP} signal to all its associated
1029processes; killing Emacs sends a @code{SIGHUP} signal to all remaining
1030processes. (@code{SIGHUP} is a signal that usually indicates that the
1031user hung up the phone.)
1032
1033 Each of the signal-sending functions takes two optional arguments:
1034@var{process} and @var{current-group}.
1035
1036 The argument @var{process} must be either a process, a process
1037name, a buffer, a buffer name, or @code{nil}. A buffer or buffer name
1038stands for a process through @code{get-buffer-process}. @code{nil}
1039stands for the process associated with the current buffer. An error
1040is signaled if @var{process} does not identify a process.
1041
1042 The argument @var{current-group} is a flag that makes a difference
1043when you are running a job-control shell as an Emacs subprocess. If it
1044is non-@code{nil}, then the signal is sent to the current process-group
1045of the terminal that Emacs uses to communicate with the subprocess. If
1046the process is a job-control shell, this means the shell's current
1047subjob. If it is @code{nil}, the signal is sent to the process group of
1048the immediate subprocess of Emacs. If the subprocess is a job-control
1049shell, this is the shell itself.
1050
1051 The flag @var{current-group} has no effect when a pipe is used to
1052communicate with the subprocess, because the operating system does not
1053support the distinction in the case of pipes. For the same reason,
1054job-control shells won't work when a pipe is used. See
1055@code{process-connection-type} in @ref{Asynchronous Processes}.
1056
1057@defun interrupt-process &optional process current-group
1058This function interrupts the process @var{process} by sending the
1059signal @code{SIGINT}. Outside of Emacs, typing the ``interrupt
1060character'' (normally @kbd{C-c} on some systems, and @code{DEL} on
1061others) sends this signal. When the argument @var{current-group} is
1062non-@code{nil}, you can think of this function as ``typing @kbd{C-c}''
1063on the terminal by which Emacs talks to the subprocess.
1064@end defun
1065
1066@defun kill-process &optional process current-group
1067This function kills the process @var{process} by sending the
1068signal @code{SIGKILL}. This signal kills the subprocess immediately,
1069and cannot be handled by the subprocess.
1070@end defun
1071
1072@defun quit-process &optional process current-group
1073This function sends the signal @code{SIGQUIT} to the process
1074@var{process}. This signal is the one sent by the ``quit
1075character'' (usually @kbd{C-b} or @kbd{C-\}) when you are not inside
1076Emacs.
1077@end defun
1078
1079@defun stop-process &optional process current-group
1080This function stops the process @var{process} by sending the
1081signal @code{SIGTSTP}. Use @code{continue-process} to resume its
1082execution.
1083
1084Outside of Emacs, on systems with job control, the ``stop character''
1085(usually @kbd{C-z}) normally sends this signal. When
1086@var{current-group} is non-@code{nil}, you can think of this function as
1087``typing @kbd{C-z}'' on the terminal Emacs uses to communicate with the
1088subprocess.
1089@end defun
1090
1091@defun continue-process &optional process current-group
1092This function resumes execution of the process @var{process} by sending
1093it the signal @code{SIGCONT}. This presumes that @var{process} was
1094stopped previously.
1095@end defun
1096
b8d4c8d0
GM
1097@defun signal-process process signal
1098This function sends a signal to process @var{process}. The argument
1099@var{signal} specifies which signal to send; it should be an integer.
1100
1101The @var{process} argument can be a system process @acronym{ID}; that
1102allows you to send signals to processes that are not children of
23dd4ecd 1103Emacs. @xref{System Processes}.
b8d4c8d0
GM
1104@end defun
1105
1106@node Output from Processes
1107@section Receiving Output from Processes
1108@cindex process output
1109@cindex output from processes
1110
1111 There are two ways to receive the output that a subprocess writes to
1112its standard output stream. The output can be inserted in a buffer,
1113which is called the associated buffer of the process, or a function
1114called the @dfn{filter function} can be called to act on the output. If
1115the process has no buffer and no filter function, its output is
1116discarded.
1117
1118 When a subprocess terminates, Emacs reads any pending output,
1119then stops reading output from that subprocess. Therefore, if the
1120subprocess has children that are still live and still producing
1121output, Emacs won't receive that output.
1122
1123 Output from a subprocess can arrive only while Emacs is waiting: when
1124reading terminal input, in @code{sit-for} and @code{sleep-for}
1125(@pxref{Waiting}), and in @code{accept-process-output} (@pxref{Accepting
1126Output}). This minimizes the problem of timing errors that usually
1127plague parallel programming. For example, you can safely create a
1128process and only then specify its buffer or filter function; no output
1129can arrive before you finish, if the code in between does not call any
1130primitive that waits.
1131
1132@defvar process-adaptive-read-buffering
1133On some systems, when Emacs reads the output from a subprocess, the
1134output data is read in very small blocks, potentially resulting in
1135very poor performance. This behavior can be remedied to some extent
1136by setting the variable @var{process-adaptive-read-buffering} to a
1137non-@code{nil} value (the default), as it will automatically delay reading
1138from such processes, thus allowing them to produce more output before
1139Emacs tries to read it.
1140@end defvar
1141
1142 It is impossible to separate the standard output and standard error
1143streams of the subprocess, because Emacs normally spawns the subprocess
1144inside a pseudo-TTY, and a pseudo-TTY has only one output channel. If
1145you want to keep the output to those streams separate, you should
1146redirect one of them to a file---for example, by using an appropriate
1147shell command.
1148
1149@menu
1150* Process Buffers:: If no filter, output is put in a buffer.
1151* Filter Functions:: Filter functions accept output from the process.
1152* Decoding Output:: Filters can get unibyte or multibyte strings.
1153* Accepting Output:: How to wait until process output arrives.
1154@end menu
1155
1156@node Process Buffers
1157@subsection Process Buffers
1158
1159 A process can (and usually does) have an @dfn{associated buffer},
1160which is an ordinary Emacs buffer that is used for two purposes: storing
1161the output from the process, and deciding when to kill the process. You
1162can also use the buffer to identify a process to operate on, since in
1163normal practice only one process is associated with any given buffer.
1164Many applications of processes also use the buffer for editing input to
1165be sent to the process, but this is not built into Emacs Lisp.
1166
1167 Unless the process has a filter function (@pxref{Filter Functions}),
1168its output is inserted in the associated buffer. The position to insert
1169the output is determined by the @code{process-mark}, which is then
1170updated to point to the end of the text just inserted. Usually, but not
1171always, the @code{process-mark} is at the end of the buffer.
1172
16d1ff5f
CY
1173@findex process-kill-buffer-query-function
1174 Killing the associated buffer of a process also kills the process.
1175Emacs asks for confirmation first, if the process's
1176@code{process-query-on-exit-flag} is non-@code{nil} (@pxref{Query
1177Before Exit}). This confirmation is done by the function
1178@code{process-kill-buffer-query-function}, which is run from
1179@code{kill-buffer-query-functions} (@pxref{Killing Buffers}).
1180
b8d4c8d0
GM
1181@defun process-buffer process
1182This function returns the associated buffer of the process
1183@var{process}.
1184
1185@smallexample
1186@group
1187(process-buffer (get-process "shell"))
1188 @result{} #<buffer *shell*>
1189@end group
1190@end smallexample
1191@end defun
1192
1193@defun process-mark process
1194This function returns the process marker for @var{process}, which is the
1195marker that says where to insert output from the process.
1196
1197If @var{process} does not have a buffer, @code{process-mark} returns a
1198marker that points nowhere.
1199
1200Insertion of process output in a buffer uses this marker to decide where
1201to insert, and updates it to point after the inserted text. That is why
1202successive batches of output are inserted consecutively.
1203
1204Filter functions normally should use this marker in the same fashion
1205as is done by direct insertion of output in the buffer. A good
1206example of a filter function that uses @code{process-mark} is found at
1207the end of the following section.
1208
1209When the user is expected to enter input in the process buffer for
1210transmission to the process, the process marker separates the new input
1211from previous output.
1212@end defun
1213
1214@defun set-process-buffer process buffer
1215This function sets the buffer associated with @var{process} to
1216@var{buffer}. If @var{buffer} is @code{nil}, the process becomes
1217associated with no buffer.
1218@end defun
1219
1220@defun get-buffer-process buffer-or-name
1221This function returns a nondeleted process associated with the buffer
1222specified by @var{buffer-or-name}. If there are several processes
1223associated with it, this function chooses one (currently, the one most
1224recently created, but don't count on that). Deletion of a process
1225(see @code{delete-process}) makes it ineligible for this function to
1226return.
1227
1228It is usually a bad idea to have more than one process associated with
1229the same buffer.
1230
1231@smallexample
1232@group
1233(get-buffer-process "*shell*")
1234 @result{} #<process shell>
1235@end group
1236@end smallexample
1237
1238Killing the process's buffer deletes the process, which kills the
1239subprocess with a @code{SIGHUP} signal (@pxref{Signals to Processes}).
1240@end defun
1241
1242@node Filter Functions
1243@subsection Process Filter Functions
1244@cindex filter function
1245@cindex process filter
1246
1247 A process @dfn{filter function} is a function that receives the
1248standard output from the associated process. If a process has a filter,
1249then @emph{all} output from that process is passed to the filter. The
1250process buffer is used directly for output from the process only when
1251there is no filter.
1252
1253 The filter function can only be called when Emacs is waiting for
1254something, because process output arrives only at such times. Emacs
1255waits when reading terminal input, in @code{sit-for} and
1256@code{sleep-for} (@pxref{Waiting}), and in @code{accept-process-output}
1257(@pxref{Accepting Output}).
1258
1259 A filter function must accept two arguments: the associated process
1260and a string, which is output just received from it. The function is
1261then free to do whatever it chooses with the output.
1262
1263 Quitting is normally inhibited within a filter function---otherwise,
1264the effect of typing @kbd{C-g} at command level or to quit a user
1265command would be unpredictable. If you want to permit quitting inside
1266a filter function, bind @code{inhibit-quit} to @code{nil}. In most
1267cases, the right way to do this is with the macro
1268@code{with-local-quit}. @xref{Quitting}.
1269
1270 If an error happens during execution of a filter function, it is
1271caught automatically, so that it doesn't stop the execution of whatever
1272program was running when the filter function was started. However, if
1273@code{debug-on-error} is non-@code{nil}, the error-catching is turned
1274off. This makes it possible to use the Lisp debugger to debug the
1275filter function. @xref{Debugger}.
1276
1277 Many filter functions sometimes or always insert the text in the
1278process's buffer, mimicking the actions of Emacs when there is no
1279filter. Such filter functions need to use @code{set-buffer} in order to
1280be sure to insert in that buffer. To avoid setting the current buffer
1281semipermanently, these filter functions must save and restore the
78e33835
CY
1282current buffer. They should also check whether the buffer is still
1283alive, update the process marker, and in some cases update the value
1284of point. Here is how to do these things:
b8d4c8d0
GM
1285
1286@smallexample
1287@group
1288(defun ordinary-insertion-filter (proc string)
78e33835
CY
1289 (when (buffer-live-p (process-buffer proc))
1290 (with-current-buffer (process-buffer proc)
1291 (let ((moving (= (point) (process-mark proc))))
b8d4c8d0
GM
1292@end group
1293@group
78e33835 1294 (save-excursion
1e5c1c80 1295 ;; @r{Insert the text, advancing the process marker.}
78e33835
CY
1296 (goto-char (process-mark proc))
1297 (insert string)
1298 (set-marker (process-mark proc) (point)))
1299 (if moving (goto-char (process-mark proc)))))))
b8d4c8d0
GM
1300@end group
1301@end smallexample
1302
1303@noindent
1304The reason to use @code{with-current-buffer}, rather than using
1305@code{save-excursion} to save and restore the current buffer, is so as
1306to preserve the change in point made by the second call to
1307@code{goto-char}.
1308
1309 To make the filter force the process buffer to be visible whenever new
1310text arrives, insert the following line just before the
1311@code{with-current-buffer} construct:
1312
1313@smallexample
1314(display-buffer (process-buffer proc))
1315@end smallexample
1316
1317 To force point to the end of the new output, no matter where it was
1318previously, eliminate the variable @code{moving} and call
1319@code{goto-char} unconditionally.
1320
1321 In earlier Emacs versions, every filter function that did regular
1322expression searching or matching had to explicitly save and restore the
1323match data. Now Emacs does this automatically for filter functions;
1324they never need to do it explicitly. @xref{Match Data}.
1325
b8d4c8d0
GM
1326 The output to the function may come in chunks of any size. A program
1327that produces the same output twice in a row may send it as one batch of
1328200 characters one time, and five batches of 40 characters the next. If
1329the filter looks for certain text strings in the subprocess output, make
1330sure to handle the case where one of these strings is split across two
fd211f0b
CY
1331or more batches of output; one way to do this is to insert the
1332received text into a temporary buffer, which can then be searched.
b8d4c8d0
GM
1333
1334@defun set-process-filter process filter
1335This function gives @var{process} the filter function @var{filter}. If
1336@var{filter} is @code{nil}, it gives the process no filter.
1337@end defun
1338
1339@defun process-filter process
1340This function returns the filter function of @var{process}, or @code{nil}
1341if it has none.
1342@end defun
1343
1344 Here is an example of use of a filter function:
1345
1346@smallexample
1347@group
1348(defun keep-output (process output)
1349 (setq kept (cons output kept)))
1350 @result{} keep-output
1351@end group
1352@group
1353(setq kept nil)
1354 @result{} nil
1355@end group
1356@group
1357(set-process-filter (get-process "shell") 'keep-output)
1358 @result{} keep-output
1359@end group
1360@group
1361(process-send-string "shell" "ls ~/other\n")
1362 @result{} nil
1363kept
1364 @result{} ("lewis@@slug[8] % "
1365@end group
1366@group
1367"FINAL-W87-SHORT.MSS backup.otl kolstad.mss~
1368address.txt backup.psf kolstad.psf
1369backup.bib~ david.mss resume-Dec-86.mss~
1370backup.err david.psf resume-Dec.psf
1371backup.mss dland syllabus.mss
1372"
1373"#backups.mss# backup.mss~ kolstad.mss
1374")
1375@end group
1376@end smallexample
1377
1378@ignore @c The code in this example doesn't show the right way to do things.
1379Here is another, more realistic example, which demonstrates how to use
1380the process mark to do insertion in the same fashion as is done when
1381there is no filter function:
1382
1383@smallexample
1384@group
1385;; @r{Insert input in the buffer specified by @code{my-shell-buffer}}
1386;; @r{and make sure that buffer is shown in some window.}
1387(defun my-process-filter (proc str)
1388 (let ((cur (selected-window))
1389 (pop-up-windows t))
1390 (pop-to-buffer my-shell-buffer)
1391@end group
1392@group
1393 (goto-char (point-max))
1394 (insert str)
1395 (set-marker (process-mark proc) (point-max))
1396 (select-window cur)))
1397@end group
1398@end smallexample
1399@end ignore
1400
1401@node Decoding Output
1402@subsection Decoding Process Output
1403@cindex decode process output
1404
1405 When Emacs writes process output directly into a multibyte buffer,
1406it decodes the output according to the process output coding system.
1407If the coding system is @code{raw-text} or @code{no-conversion}, Emacs
1408converts the unibyte output to multibyte using
1409@code{string-to-multibyte}, and inserts the resulting multibyte text.
1410
1411 You can use @code{set-process-coding-system} to specify which coding
1412system to use (@pxref{Process Information}). Otherwise, the coding
1413system comes from @code{coding-system-for-read}, if that is
1414non-@code{nil}; or else from the defaulting mechanism (@pxref{Default
3355f04d
EZ
1415Coding Systems}). If the text output by a process contains null
1416bytes, Emacs by default uses @code{no-conversion} for it; see
1417@ref{Lisp and Coding Systems, inhibit-null-byte-detection}, for how to
1418control this behavior.
b8d4c8d0
GM
1419
1420 @strong{Warning:} Coding systems such as @code{undecided} which
1421determine the coding system from the data do not work entirely
1422reliably with asynchronous subprocess output. This is because Emacs
1423has to process asynchronous subprocess output in batches, as it
1424arrives. Emacs must try to detect the proper coding system from one
1425batch at a time, and this does not always work. Therefore, if at all
1426possible, specify a coding system that determines both the character
1427code conversion and the end of line conversion---that is, one like
1428@code{latin-1-unix}, rather than @code{undecided} or @code{latin-1}.
1429
4972c361
SM
1430@c Let's keep the index entries that were there for
1431@c set-process-filter-multibyte and process-filter-multibyte-p,
b8d4c8d0
GM
1432@cindex filter multibyte flag, of process
1433@cindex process filter multibyte flag
1434 When Emacs calls a process filter function, it provides the process
1435output as a multibyte string or as a unibyte string according to the
4972c361
SM
1436process's filter coding system. Emacs
1437decodes the output according to the process output coding system,
1438which usually produces a multibyte string, except for coding systems
1439such as @code{binary} and @code{raw-text}
b8d4c8d0
GM
1440
1441@node Accepting Output
1442@subsection Accepting Output from Processes
1443@cindex accept input from processes
1444
1445 Output from asynchronous subprocesses normally arrives only while
1446Emacs is waiting for some sort of external event, such as elapsed time
1447or terminal input. Occasionally it is useful in a Lisp program to
1448explicitly permit output to arrive at a specific point, or even to wait
1449until output arrives from a process.
1450
1451@defun accept-process-output &optional process seconds millisec just-this-one
1452This function allows Emacs to read pending output from processes. The
1453output is inserted in the associated buffers or given to their filter
1454functions. If @var{process} is non-@code{nil} then this function does
1455not return until some output has been received from @var{process}.
1456
1457@c Emacs 19 feature
1458The arguments @var{seconds} and @var{millisec} let you specify timeout
1459periods. The former specifies a period measured in seconds and the
1460latter specifies one measured in milliseconds. The two time periods
1461thus specified are added together, and @code{accept-process-output}
1462returns after that much time, whether or not there has been any
1463subprocess output.
1464
1465The argument @var{millisec} is semi-obsolete nowadays because
1466@var{seconds} can be a floating point number to specify waiting a
1467fractional number of seconds. If @var{seconds} is 0, the function
1468accepts whatever output is pending but does not wait.
1469
1470@c Emacs 22.1 feature
1471If @var{process} is a process, and the argument @var{just-this-one} is
1472non-@code{nil}, only output from that process is handled, suspending output
1473from other processes until some output has been received from that
1474process or the timeout expires. If @var{just-this-one} is an integer,
1475also inhibit running timers. This feature is generally not
1476recommended, but may be necessary for specific applications, such as
1477speech synthesis.
1478
1479The function @code{accept-process-output} returns non-@code{nil} if it
1480did get some output, or @code{nil} if the timeout expired before output
1481arrived.
1482@end defun
1483
1484@node Sentinels
1485@section Sentinels: Detecting Process Status Changes
1486@cindex process sentinel
1487@cindex sentinel (of process)
1488
1489 A @dfn{process sentinel} is a function that is called whenever the
1490associated process changes status for any reason, including signals
1491(whether sent by Emacs or caused by the process's own actions) that
1492terminate, stop, or continue the process. The process sentinel is
1493also called if the process exits. The sentinel receives two
1494arguments: the process for which the event occurred, and a string
1495describing the type of event.
1496
1497 The string describing the event looks like one of the following:
1498
1499@itemize @bullet
1500@item
1501@code{"finished\n"}.
1502
1503@item
1504@code{"exited abnormally with code @var{exitcode}\n"}.
1505
1506@item
1507@code{"@var{name-of-signal}\n"}.
1508
1509@item
1510@code{"@var{name-of-signal} (core dumped)\n"}.
1511@end itemize
1512
1513 A sentinel runs only while Emacs is waiting (e.g., for terminal
1514input, or for time to elapse, or for process output). This avoids the
1515timing errors that could result from running them at random places in
1516the middle of other Lisp programs. A program can wait, so that
1517sentinels will run, by calling @code{sit-for} or @code{sleep-for}
1518(@pxref{Waiting}), or @code{accept-process-output} (@pxref{Accepting
1519Output}). Emacs also allows sentinels to run when the command loop is
1520reading input. @code{delete-process} calls the sentinel when it
1521terminates a running process.
1522
1523 Emacs does not keep a queue of multiple reasons to call the sentinel
1524of one process; it records just the current status and the fact that
1525there has been a change. Therefore two changes in status, coming in
1526quick succession, can call the sentinel just once. However, process
1527termination will always run the sentinel exactly once. This is
1528because the process status can't change again after termination.
1529
1530 Emacs explicitly checks for output from the process before running
1531the process sentinel. Once the sentinel runs due to process
1532termination, no further output can arrive from the process.
1533
1534 A sentinel that writes the output into the buffer of the process
1535should check whether the buffer is still alive. If it tries to insert
1536into a dead buffer, it will get an error. If the buffer is dead,
1537@code{(buffer-name (process-buffer @var{process}))} returns @code{nil}.
1538
1539 Quitting is normally inhibited within a sentinel---otherwise, the
1540effect of typing @kbd{C-g} at command level or to quit a user command
1541would be unpredictable. If you want to permit quitting inside a
1542sentinel, bind @code{inhibit-quit} to @code{nil}. In most cases, the
1543right way to do this is with the macro @code{with-local-quit}.
1544@xref{Quitting}.
1545
1546 If an error happens during execution of a sentinel, it is caught
1547automatically, so that it doesn't stop the execution of whatever
1548programs was running when the sentinel was started. However, if
1549@code{debug-on-error} is non-@code{nil}, the error-catching is turned
1550off. This makes it possible to use the Lisp debugger to debug the
1551sentinel. @xref{Debugger}.
1552
1553 While a sentinel is running, the process sentinel is temporarily
1554set to @code{nil} so that the sentinel won't run recursively.
1555For this reason it is not possible for a sentinel to specify
1556a new sentinel.
1557
1558 In earlier Emacs versions, every sentinel that did regular expression
1559searching or matching had to explicitly save and restore the match data.
1560Now Emacs does this automatically for sentinels; they never need to do
1561it explicitly. @xref{Match Data}.
1562
1563@defun set-process-sentinel process sentinel
1564This function associates @var{sentinel} with @var{process}. If
1565@var{sentinel} is @code{nil}, then the process will have no sentinel.
1566The default behavior when there is no sentinel is to insert a message in
1567the process's buffer when the process status changes.
1568
1569Changes in process sentinel take effect immediately---if the sentinel
1570is slated to be run but has not been called yet, and you specify a new
1571sentinel, the eventual call to the sentinel will use the new one.
1572
1573@smallexample
1574@group
1575(defun msg-me (process event)
1576 (princ
1577 (format "Process: %s had the event `%s'" process event)))
1578(set-process-sentinel (get-process "shell") 'msg-me)
1579 @result{} msg-me
1580@end group
1581@group
1582(kill-process (get-process "shell"))
1583 @print{} Process: #<process shell> had the event `killed'
1584 @result{} #<process shell>
1585@end group
1586@end smallexample
1587@end defun
1588
1589@defun process-sentinel process
1590This function returns the sentinel of @var{process}, or @code{nil} if it
1591has none.
1592@end defun
1593
1594@defun waiting-for-user-input-p
1595While a sentinel or filter function is running, this function returns
1596non-@code{nil} if Emacs was waiting for keyboard input from the user at
1597the time the sentinel or filter function was called, @code{nil} if it
1598was not.
1599@end defun
1600
1601@node Query Before Exit
1602@section Querying Before Exit
1603
1604 When Emacs exits, it terminates all its subprocesses by sending them
1605the @code{SIGHUP} signal. Because subprocesses may be doing
1606valuable work, Emacs normally asks the user to confirm that it is ok
1607to terminate them. Each process has a query flag which, if
1608non-@code{nil}, says that Emacs should ask for confirmation before
1609exiting and thus killing that process. The default for the query flag
1610is @code{t}, meaning @emph{do} query.
1611
1612@defun process-query-on-exit-flag process
1613This returns the query flag of @var{process}.
1614@end defun
1615
1616@defun set-process-query-on-exit-flag process flag
1617This function sets the query flag of @var{process} to @var{flag}. It
1618returns @var{flag}.
1619
1620@smallexample
1621@group
1622;; @r{Don't query about the shell process}
1623(set-process-query-on-exit-flag (get-process "shell") nil)
1624 @result{} t
1625@end group
1626@end smallexample
1627@end defun
1628
1629@defun process-kill-without-query process &optional do-query
1630This function clears the query flag of @var{process}, so that
1631Emacs will not query the user on account of that process.
1632
1633Actually, the function does more than that: it returns the old value of
1634the process's query flag, and sets the query flag to @var{do-query}.
1635Please don't use this function to do those things any more---please
1636use the newer, cleaner functions @code{process-query-on-exit-flag} and
1637@code{set-process-query-on-exit-flag} in all but the simplest cases.
1638The only way you should use @code{process-kill-without-query} nowadays
1639is like this:
1640
1641@smallexample
1642@group
1643;; @r{Don't query about the shell process}
1644(process-kill-without-query (get-process "shell"))
1645@end group
1646@end smallexample
1647@end defun
1648
23dd4ecd
EZ
1649@node System Processes
1650@section Accessing Other Processes
1651@cindex system processes
1652
1653 In addition to accessing and manipulating processes that are
1654subprocesses of the current Emacs session, Emacs Lisp programs can
1655also access other processes running on the same machine. We call
1656these @dfn{system processes}, to distinguish between them and Emacs
1657subprocesses.
1658
1659 Emacs provides several primitives for accessing system processes.
1660Not all platforms support these primitives; on those which don't,
1661these primitives return @code{nil}.
1662
1663@defun list-system-processes
1664This function returns a list of all the processes running on the
1665system. Each process is identified by its @acronym{PID}, a numerical
1666process ID that is assigned by the OS and distinguishes the process
1667from all the other processes running on the same machine at the same
1668time.
1669@end defun
1670
a20878b6 1671@defun process-attributes pid
23dd4ecd
EZ
1672This function returns an alist of attributes for the process specified
1673by its process ID @var{pid}. Each association in the alist is of the
1674form @code{(@var{key} . @var{value})}, where @var{key} designates the
1675attribute and @var{value} is the value of that attribute. The various
1676attribute @var{key}'s that this function can return are listed below.
1677Not all platforms support all of these attributes; if an attribute is
1678not supported, its association will not appear in the returned alist.
1679Values that are numbers can be either integer or floating-point,
1680depending on the magnitude of the value.
1681
1682@table @code
1683@item euid
1684The effective user ID of the user who invoked the process. The
1685corresponding @var{value} is a number. If the process was invoked by
1686the same user who runs the current Emacs session, the value is
1687identical to what @code{user-uid} returns (@pxref{User
1688Identification}).
1689
1690@item user
1691User name corresponding to the process's effective user ID, a string.
1692
1693@item egid
1694The group ID of the effective user ID, a number.
1695
1696@item group
1697Group name corresponding to the effective user's group ID, a string.
1698
1699@item comm
1700The name of the command that runs in the process. This is a string
1701that usually specifies the name of the executable file of the process,
1702without the leading directories. However, some special system
1703processes can report strings that do not correspond to an executable
1704file of a program.
1705
1706@item state
1707The state code of the process. This is a short string that encodes
1708the scheduling state of the process. Here's a list of the most
1709frequently seen codes:
1710
1711@table @code
1dca458f 1712@item "D"
23dd4ecd 1713uninterruptible sleep (usually I/O)
1dca458f 1714@item "R"
23dd4ecd 1715running
1dca458f 1716@item "S"
23dd4ecd 1717interruptible sleep (waiting for some event)
1dca458f 1718@item "T"
23dd4ecd 1719stopped, e.g., by a job control signal
1dca458f
EZ
1720@item "Z"
1721``zombie'': a process that terminated, but was not reaped by its parent
23dd4ecd
EZ
1722@end table
1723
1724@noindent
1725For the full list of the possible states, see the manual page of the
1726@command{ps} command.
1727
1728@item ppid
1729The process ID of the parent process, a number.
1730
1731@item pgrp
1732The process group ID of the process, a number.
1733
1734@item sess
1735The session ID of the process. This is a number that is the process
1736ID of the process's @dfn{session leader}.
1737
1738@item ttname
1739A string that is the name of the process's controlling terminal. On
1740Unix and GNU systems, this is normally the file name of the
1741corresponding terminal device, such as @file{/dev/pts65}.
1742
1743@item tpgid
1744The numerical process group ID of the foreground process group that
1745uses the process's terminal.
1746
1747@item minflt
1748The number of minor page faults caused by the process since its
1749beginning. (Minor page faults are those that don't involve reading
1750from disk.)
1751
1752@item majflt
1753The number of major page faults caused by the process since its
1754beginning. (Major page faults require a disk to be read, and are thus
1755more expensive than minor page faults.)
1756
1757@item cminflt
1758@itemx cmajflt
1759Like @code{minflt} and @code{majflt}, but include the number of page
1760faults for all the child processes of the given process.
1761
1762@item utime
1763Time spent by the process in the user context, for running the
1764application's code. The corresponding @var{value} is in the
1765@w{@code{(@var{high} @var{low} @var{microsec})}} format, the same
1766format used by functions @code{current-time} (@pxref{Time of Day,
1767current-time}) and @code{file-attributes} (@pxref{File Attributes}).
1768
1769@item stime
1770Time spent by the process in the system (kernel) context, for
1771processing system calls. The corresponding @var{value} is in the same
1772format as for @code{utime}.
1773
af34ad36
EZ
1774@item time
1775The sum of @code{utime} and @code{stime}. The corresponding
1776@var{value} is in the same format as for @code{utime}.
1777
23dd4ecd
EZ
1778@item cutime
1779@itemx cstime
af34ad36
EZ
1780@itemx ctime
1781Like @code{utime}, @code{stime}, and @code{time}, but include the
1782times of all the child processes of the given process.
23dd4ecd
EZ
1783
1784@item pri
1785The numerical priority of the process.
1786
1787@item nice
1dca458f
EZ
1788The @dfn{nice value} of the process, a number. (Processes with smaller
1789nice values get scheduled more favorably.)
23dd4ecd
EZ
1790
1791@item thcount
1792The number of threads in the process.
1793
1794@item start
1795The time the process was started, in the @w{@code{(@var{high}
1796@var{low} @var{microsec})}} format used by @code{current-time} and
1797@code{file-attributes}.
1798
1799@item etime
1800The time elapsed since the process started, in the @w{@code{(@var{high}
1801@var{low} @var{microsec})}} format.
1802
1803@item vsize
1804The virtual memory size of the process, measured in kilobytes.
1805
1806@item rss
1807The size of the process's @dfn{resident set}, the number of kilobytes
1808occupied by the process in the machine's physical memory.
1809
1810@item pcpu
1811The percentage of the CPU time used by the process since it started.
1812The corresponding @var{value} is a floating-point number between 0 and
1813100.
1814
1815@item pmem
1816The percentage of the total physical memory installed on the machine
1817used by the process's resident set. The value is a floating-point
1818number between 0 and 100.
1819
1820@item args
1821The command-line with which the process was invoked. This is a string
1822in which individual command-line arguments are separated by blanks;
1823whitespace characters that are embedded in the arguments are quoted as
1824appropriate for the system's shell: escaped by backslash characters on
1825GNU and Unix, and enclosed in double quote characters on Windows.
1826Thus, this command-line string can be directly used in primitives such
1827as @code{shell-command}.
1828@end table
1829
1830@end defun
1831
1832
b8d4c8d0
GM
1833@node Transaction Queues
1834@section Transaction Queues
1835@cindex transaction queue
1836
1837You can use a @dfn{transaction queue} to communicate with a subprocess
1838using transactions. First use @code{tq-create} to create a transaction
1839queue communicating with a specified process. Then you can call
1840@code{tq-enqueue} to send a transaction.
1841
1842@defun tq-create process
1843This function creates and returns a transaction queue communicating with
1844@var{process}. The argument @var{process} should be a subprocess
1845capable of sending and receiving streams of bytes. It may be a child
1846process, or it may be a TCP connection to a server, possibly on another
1847machine.
1848@end defun
1849
1850@defun tq-enqueue queue question regexp closure fn &optional delay-question
1851This function sends a transaction to queue @var{queue}. Specifying the
1852queue has the effect of specifying the subprocess to talk to.
1853
1854The argument @var{question} is the outgoing message that starts the
1855transaction. The argument @var{fn} is the function to call when the
1856corresponding answer comes back; it is called with two arguments:
1857@var{closure}, and the answer received.
1858
1859The argument @var{regexp} is a regular expression that should match
1860text at the end of the entire answer, but nothing before; that's how
1861@code{tq-enqueue} determines where the answer ends.
1862
4185820c
CY
1863If the argument @var{delay-question} is non-@code{nil}, delay sending
1864this question until the process has finished replying to any previous
b8d4c8d0
GM
1865questions. This produces more reliable results with some processes.
1866
1867The return value of @code{tq-enqueue} itself is not meaningful.
1868@end defun
1869
1870@defun tq-close queue
1871Shut down transaction queue @var{queue}, waiting for all pending transactions
1872to complete, and then terminate the connection or child process.
1873@end defun
1874
1875Transaction queues are implemented by means of a filter function.
1876@xref{Filter Functions}.
1877
1878@node Network
1879@section Network Connections
1880@cindex network connection
1881@cindex TCP
1882@cindex UDP
1883
1884 Emacs Lisp programs can open stream (TCP) and datagram (UDP) network
1885connections to other processes on the same machine or other machines.
1886A network connection is handled by Lisp much like a subprocess, and is
1887represented by a process object. However, the process you are
1888communicating with is not a child of the Emacs process, so it has no
1889process @acronym{ID}, and you can't kill it or send it signals. All you
1890can do is send and receive data. @code{delete-process} closes the
1891connection, but does not kill the program at the other end; that
1892program must decide what to do about closure of the connection.
1893
1894 Lisp programs can listen for connections by creating network
1895servers. A network server is also represented by a kind of process
1896object, but unlike a network connection, the network server never
1897transfers data itself. When it receives a connection request, it
1898creates a new network connection to represent the connection just
1899made. (The network connection inherits certain information, including
1900the process plist, from the server.) The network server then goes
1901back to listening for more connection requests.
1902
1903 Network connections and servers are created by calling
1904@code{make-network-process} with an argument list consisting of
1905keyword/argument pairs, for example @code{:server t} to create a
1906server process, or @code{:type 'datagram} to create a datagram
1907connection. @xref{Low-Level Network}, for details. You can also use
1908the @code{open-network-stream} function described below.
1909
c73e02fa
GM
1910 To distinguish the different types of processes, the
1911@code{process-type} function returns the symbol @code{network} for a
1912network connection or server, @code{serial} for a serial port
1913connection, or @code{real} for a real subprocess.
1914
1915 The @code{process-status} function returns @code{open},
1916@code{closed}, @code{connect}, and @code{failed} for network
1917connections. For a network server, the status is always
b8d4c8d0
GM
1918@code{listen}. None of those values is possible for a real
1919subprocess. @xref{Process Information}.
1920
1921 You can stop and resume operation of a network process by calling
1922@code{stop-process} and @code{continue-process}. For a server
1923process, being stopped means not accepting new connections. (Up to 5
1924connection requests will be queued for when you resume the server; you
1925can increase this limit, unless it is imposed by the operating
1926system.) For a network stream connection, being stopped means not
1927processing input (any arriving input waits until you resume the
1928connection). For a datagram connection, some number of packets may be
1929queued but input may be lost. You can use the function
1930@code{process-command} to determine whether a network connection or
1931server is stopped; a non-@code{nil} value means yes.
1932
1933@defun open-network-stream name buffer-or-name host service
1934This function opens a TCP connection, and returns a process object
1935that represents the connection.
1936
1937The @var{name} argument specifies the name for the process object. It
1938is modified as necessary to make it unique.
1939
1940The @var{buffer-or-name} argument is the buffer to associate with the
1941connection. Output from the connection is inserted in the buffer,
1942unless you specify a filter function to handle the output. If
1943@var{buffer-or-name} is @code{nil}, it means that the connection is not
1944associated with any buffer.
1945
1946The arguments @var{host} and @var{service} specify where to connect to;
1947@var{host} is the host name (a string), and @var{service} is the name of
1948a defined network service (a string) or a port number (an integer).
1949@end defun
1950
b8d4c8d0
GM
1951@node Network Servers
1952@section Network Servers
1953@cindex network servers
1954
1955 You create a server by calling @code{make-network-process} with
1956@code{:server t}. The server will listen for connection requests from
1957clients. When it accepts a client connection request, that creates a
1958new network connection, itself a process object, with the following
1959parameters:
1960
1961@itemize @bullet
1962@item
1963The connection's process name is constructed by concatenating the
1964server process' @var{name} with a client identification string. The
1965client identification string for an IPv4 connection looks like
1966@samp{<@var{a}.@var{b}.@var{c}.@var{d}:@var{p}>}. Otherwise, it is a
1967unique number in brackets, as in @samp{<@var{nnn}>}. The number
1968is unique for each connection in the Emacs session.
1969
1970@item
1971If the server's filter is non-@code{nil}, the connection process does
1972not get a separate process buffer; otherwise, Emacs creates a new
1973buffer for the purpose. The buffer name is the server's buffer name
1974or process name, concatenated with the client identification string.
1975
1976The server's process buffer value is never used directly by Emacs, but
1977it is passed to the log function, which can log connections by
1978inserting text there.
1979
1980@item
1981The communication type and the process filter and sentinel are
1982inherited from those of the server. The server never directly
1983uses its filter and sentinel; their sole purpose is to initialize
1984connections made to the server.
1985
1986@item
1987The connection's process contact info is set according to the client's
1988addressing information (typically an IP address and a port number).
1989This information is associated with the @code{process-contact}
1990keywords @code{:host}, @code{:service}, @code{:remote}.
1991
1992@item
1993The connection's local address is set up according to the port
1994number used for the connection.
1995
1996@item
1997The client process' plist is initialized from the server's plist.
1998@end itemize
1999
2000@node Datagrams
2001@section Datagrams
2002@cindex datagrams
2003
2004 A datagram connection communicates with individual packets rather
2005than streams of data. Each call to @code{process-send} sends one
2006datagram packet (@pxref{Input to Processes}), and each datagram
2007received results in one call to the filter function.
2008
2009 The datagram connection doesn't have to talk with the same remote
2010peer all the time. It has a @dfn{remote peer address} which specifies
2011where to send datagrams to. Each time an incoming datagram is passed
2012to the filter function, the peer address is set to the address that
2013datagram came from; that way, if the filter function sends a datagram,
2014it will go back to that place. You can specify the remote peer
2015address when you create the datagram connection using the
2016@code{:remote} keyword. You can change it later on by calling
2017@code{set-process-datagram-address}.
2018
2019@defun process-datagram-address process
2020If @var{process} is a datagram connection or server, this function
2021returns its remote peer address.
2022@end defun
2023
2024@defun set-process-datagram-address process address
2025If @var{process} is a datagram connection or server, this function
2026sets its remote peer address to @var{address}.
2027@end defun
2028
2029@node Low-Level Network
2030@section Low-Level Network Access
2031
2032 You can also create network connections by operating at a lower
2033level than that of @code{open-network-stream}, using
2034@code{make-network-process}.
2035
2036@menu
2037* Proc: Network Processes. Using @code{make-network-process}.
2038* Options: Network Options. Further control over network connections.
2039* Features: Network Feature Testing.
2040 Determining which network features work on
2041 the machine you are using.
2042@end menu
2043
2044@node Network Processes
2045@subsection @code{make-network-process}
2046
2047 The basic function for creating network connections and network
2048servers is @code{make-network-process}. It can do either of those
2049jobs, depending on the arguments you give it.
2050
2051@defun make-network-process &rest args
2052This function creates a network connection or server and returns the
2053process object that represents it. The arguments @var{args} are a
2054list of keyword/argument pairs. Omitting a keyword is always
2055equivalent to specifying it with value @code{nil}, except for
2056@code{:coding}, @code{:filter-multibyte}, and @code{:reuseaddr}. Here
2057are the meaningful keywords:
2058
2059@table @asis
2060@item :name @var{name}
2061Use the string @var{name} as the process name. It is modified if
2062necessary to make it unique.
2063
2064@item :type @var{type}
2065Specify the communication type. A value of @code{nil} specifies a
2066stream connection (the default); @code{datagram} specifies a datagram
f7704e26
CY
2067connection; @code{seqpacket} specifies a ``sequenced packet stream''
2068connection. Both connections and servers can be of these types.
b8d4c8d0
GM
2069
2070@item :server @var{server-flag}
2071If @var{server-flag} is non-@code{nil}, create a server. Otherwise,
2072create a connection. For a stream type server, @var{server-flag} may
2073be an integer which then specifies the length of the queue of pending
2074connections to the server. The default queue length is 5.
2075
2076@item :host @var{host}
2077Specify the host to connect to. @var{host} should be a host name or
2078Internet address, as a string, or the symbol @code{local} to specify
2079the local host. If you specify @var{host} for a server, it must
2080specify a valid address for the local host, and only clients
2081connecting to that address will be accepted.
2082
2083@item :service @var{service}
2084@var{service} specifies a port number to connect to, or, for a server,
2085the port number to listen on. It should be a service name that
2086translates to a port number, or an integer specifying the port number
2087directly. For a server, it can also be @code{t}, which means to let
2088the system select an unused port number.
2089
2090@item :family @var{family}
2091@var{family} specifies the address (and protocol) family for
2092communication. @code{nil} means determine the proper address family
2093automatically for the given @var{host} and @var{service}.
2094@code{local} specifies a Unix socket, in which case @var{host} is
2095ignored. @code{ipv4} and @code{ipv6} specify to use IPv4 and IPv6
2096respectively.
2097
2098@item :local @var{local-address}
2099For a server process, @var{local-address} is the address to listen on.
2100It overrides @var{family}, @var{host} and @var{service}, and you
2101may as well not specify them.
2102
2103@item :remote @var{remote-address}
2104For a connection, @var{remote-address} is the address to connect to.
2105It overrides @var{family}, @var{host} and @var{service}, and you
2106may as well not specify them.
2107
2108For a datagram server, @var{remote-address} specifies the initial
2109setting of the remote datagram address.
2110
2111The format of @var{local-address} or @var{remote-address} depends on
2112the address family:
2113
2114@itemize -
2115@item
2116An IPv4 address is represented as a five-element vector of four 8-bit
2117integers and one 16-bit integer
2118@code{[@var{a} @var{b} @var{c} @var{d} @var{p}]} corresponding to
2119numeric IPv4 address @var{a}.@var{b}.@var{c}.@var{d} and port number
2120@var{p}.
2121
2122@item
2123An IPv6 address is represented as a nine-element vector of 16-bit
2124integers @code{[@var{a} @var{b} @var{c} @var{d} @var{e} @var{f}
2125@var{g} @var{h} @var{p}]} corresponding to numeric IPv6 address
2126@var{a}:@var{b}:@var{c}:@var{d}:@var{e}:@var{f}:@var{g}:@var{h} and
2127port number @var{p}.
2128
2129@item
2130A local address is represented as a string which specifies the address
2131in the local address space.
2132
2133@item
2134An ``unsupported family'' address is represented by a cons
2135@code{(@var{f} . @var{av})}, where @var{f} is the family number and
2136@var{av} is a vector specifying the socket address using one element
2137per address data byte. Do not rely on this format in portable code,
2138as it may depend on implementation defined constants, data sizes, and
2139data structure alignment.
2140@end itemize
2141
2142@item :nowait @var{bool}
2143If @var{bool} is non-@code{nil} for a stream connection, return
2144without waiting for the connection to complete. When the connection
2145succeeds or fails, Emacs will call the sentinel function, with a
2146second argument matching @code{"open"} (if successful) or
2147@code{"failed"}. The default is to block, so that
2148@code{make-network-process} does not return until the connection
2149has succeeded or failed.
2150
2151@item :stop @var{stopped}
2152Start the network connection or server in the `stopped' state if
2153@var{stopped} is non-@code{nil}.
2154
2155@item :buffer @var{buffer}
2156Use @var{buffer} as the process buffer.
2157
2158@item :coding @var{coding}
2159Use @var{coding} as the coding system for this process. To specify
2160different coding systems for decoding data from the connection and for
2161encoding data sent to it, specify @code{(@var{decoding} .
2162@var{encoding})} for @var{coding}.
2163
2164If you don't specify this keyword at all, the default
2165is to determine the coding systems from the data.
2166
2167@item :noquery @var{query-flag}
2168Initialize the process query flag to @var{query-flag}.
2169@xref{Query Before Exit}.
2170
2171@item :filter @var{filter}
2172Initialize the process filter to @var{filter}.
2173
b8d4c8d0
GM
2174@item :sentinel @var{sentinel}
2175Initialize the process sentinel to @var{sentinel}.
2176
2177@item :log @var{log}
2178Initialize the log function of a server process to @var{log}. The log
2179function is called each time the server accepts a network connection
2180from a client. The arguments passed to the log function are
2181@var{server}, @var{connection}, and @var{message}, where @var{server}
2182is the server process, @var{connection} is the new process for the
2183connection, and @var{message} is a string describing what has
2184happened.
2185
2186@item :plist @var{plist}
2187Initialize the process plist to @var{plist}.
2188@end table
2189
2190The original argument list, modified with the actual connection
2191information, is available via the @code{process-contact} function.
2192@end defun
2193
2194@node Network Options
2195@subsection Network Options
2196
2197 The following network options can be specified when you create a
2198network process. Except for @code{:reuseaddr}, you can also set or
2199modify these options later, using @code{set-network-process-option}.
2200
2201 For a server process, the options specified with
2202@code{make-network-process} are not inherited by the client
2203connections, so you will need to set the necessary options for each
2204child connection as it is created.
2205
2206@table @asis
2207@item :bindtodevice @var{device-name}
2208If @var{device-name} is a non-empty string identifying a network
2209interface name (see @code{network-interface-list}), only handle
2210packets received on that interface. If @var{device-name} is @code{nil}
2211(the default), handle packets received on any interface.
2212
2213Using this option may require special privileges on some systems.
2214
2215@item :broadcast @var{broadcast-flag}
2216If @var{broadcast-flag} is non-@code{nil} for a datagram process, the
2217process will receive datagram packet sent to a broadcast address, and
2218be able to send packets to a broadcast address. Ignored for a stream
2219connection.
2220
2221@item :dontroute @var{dontroute-flag}
2222If @var{dontroute-flag} is non-@code{nil}, the process can only send
2223to hosts on the same network as the local host.
2224
2225@item :keepalive @var{keepalive-flag}
2226If @var{keepalive-flag} is non-@code{nil} for a stream connection,
2227enable exchange of low-level keep-alive messages.
2228
2229@item :linger @var{linger-arg}
2230If @var{linger-arg} is non-@code{nil}, wait for successful
2231transmission of all queued packets on the connection before it is
2232deleted (see @code{delete-process}). If @var{linger-arg} is an
2233integer, it specifies the maximum time in seconds to wait for queued
2234packets to be sent before closing the connection. Default is
2235@code{nil} which means to discard unsent queued packets when the
2236process is deleted.
2237
2238@item :oobinline @var{oobinline-flag}
2239If @var{oobinline-flag} is non-@code{nil} for a stream connection,
2240receive out-of-band data in the normal data stream. Otherwise, ignore
2241out-of-band data.
2242
2243@item :priority @var{priority}
2244Set the priority for packets sent on this connection to the integer
2245@var{priority}. The interpretation of this number is protocol
2246specific, such as setting the TOS (type of service) field on IP
2247packets sent on this connection. It may also have system dependent
2248effects, such as selecting a specific output queue on the network
2249interface.
2250
2251@item :reuseaddr @var{reuseaddr-flag}
2252If @var{reuseaddr-flag} is non-@code{nil} (the default) for a stream
2253server process, allow this server to reuse a specific port number (see
2254@code{:service}) unless another process on this host is already
2255listening on that port. If @var{reuseaddr-flag} is @code{nil}, there
2256may be a period of time after the last use of that port (by any
2257process on the host), where it is not possible to make a new server on
2258that port.
2259@end table
2260
106e6894 2261@defun set-network-process-option process option value &optional no-error
b8d4c8d0
GM
2262This function sets or modifies a network option for network process
2263@var{process}. See @code{make-network-process} for details of options
106e6894
CY
2264@var{option} and their corresponding values @var{value}. If
2265@var{no-error} is non-@code{nil}, this function returns @code{nil}
2266instead of signaling an error if @var{option} is not a supported
2267option. If the function successfully completes, it returns @code{t}.
b8d4c8d0
GM
2268
2269The current setting of an option is available via the
2270@code{process-contact} function.
2271@end defun
2272
2273@node Network Feature Testing
2274@subsection Testing Availability of Network Features
2275
2276 To test for the availability of a given network feature, use
2277@code{featurep} like this:
2278
2279@example
2280(featurep 'make-network-process '(@var{keyword} @var{value}))
2281@end example
2282
2283@noindent
2284The result of the first form is @code{t} if it works to specify
2285@var{keyword} with value @var{value} in @code{make-network-process}.
2286The result of the second form is @code{t} if @var{keyword} is
2287supported by @code{make-network-process}. Here are some of the
2288@var{keyword}---@var{value} pairs you can test in
2289this way.
2290
2291@table @code
2292@item (:nowait t)
2293Non-@code{nil} if non-blocking connect is supported.
2294@item (:type datagram)
2295Non-@code{nil} if datagrams are supported.
2296@item (:family local)
2297Non-@code{nil} if local (a.k.a.@: ``UNIX domain'') sockets are supported.
2298@item (:family ipv6)
2299Non-@code{nil} if IPv6 is supported.
2300@item (:service t)
2301Non-@code{nil} if the system can select the port for a server.
2302@end table
2303
2304 To test for the availability of a given network option, use
2305@code{featurep} like this:
2306
2307@example
2308(featurep 'make-network-process '@var{keyword})
2309@end example
2310
2311@noindent
2312Here are some of the options you can test in this way.
2313
2314@table @code
2315@item :bindtodevice
2316@itemx :broadcast
2317@itemx :dontroute
2318@itemx :keepalive
2319@itemx :linger
2320@itemx :oobinline
2321@itemx :priority
2322@itemx :reuseaddr
2323That particular network option is supported by
2324@code{make-network-process} and @code{set-network-process-option}.
2325@end table
2326
2327@node Misc Network
2328@section Misc Network Facilities
2329
2330 These additional functions are useful for creating and operating
305a7ef2
EZ
2331on network connections. Note that they are supported only on some
2332systems.
b8d4c8d0
GM
2333
2334@defun network-interface-list
2335This function returns a list describing the network interfaces
2336of the machine you are using. The value is an alist whose
2337elements have the form @code{(@var{name} . @var{address})}.
2338@var{address} has the same form as the @var{local-address}
2339and @var{remote-address} arguments to @code{make-network-process}.
2340@end defun
2341
2342@defun network-interface-info ifname
2343This function returns information about the network interface named
2344@var{ifname}. The value is a list of the form
2345@code{(@var{addr} @var{bcast} @var{netmask} @var{hwaddr} @var{flags})}.
2346
2347@table @var
2348@item addr
2349The Internet protocol address.
2350@item bcast
2351The broadcast address.
2352@item netmask
2353The network mask.
2354@item hwaddr
2355The layer 2 address (Ethernet MAC address, for instance).
2356@item flags
2357The current flags of the interface.
2358@end table
2359@end defun
2360
2361@defun format-network-address address &optional omit-port
2362This function converts the Lisp representation of a network address to
2363a string.
2364
2365A five-element vector @code{[@var{a} @var{b} @var{c} @var{d} @var{p}]}
2366represents an IPv4 address @var{a}.@var{b}.@var{c}.@var{d} and port
2367number @var{p}. @code{format-network-address} converts that to the
2368string @code{"@var{a}.@var{b}.@var{c}.@var{d}:@var{p}"}.
2369
2370A nine-element vector @code{[@var{a} @var{b} @var{c} @var{d} @var{e}
2371@var{f} @var{g} @var{h} @var{p}]} represents an IPv6 address along
2372with a port number. @code{format-network-address} converts that to
2373the string
2374@code{"[@var{a}:@var{b}:@var{c}:@var{d}:@var{e}:@var{f}:@var{g}:@var{h}]:@var{p}"}.
2375
2376If the vector does not include the port number, @var{p}, or if
2377@var{omit-port} is non-@code{nil}, the result does not include the
2378@code{:@var{p}} suffix.
2379@end defun
2380
c73e02fa
GM
2381@node Serial Ports
2382@section Communicating with Serial Ports
2383@cindex @file{/dev/tty}
2384@cindex @file{COM1}
545c2782 2385@cindex serial connections
c73e02fa
GM
2386
2387 Emacs can communicate with serial ports. For interactive use,
2388@kbd{M-x serial-term} opens a terminal window. In a Lisp program,
2389@code{make-serial-process} creates a process object.
2390
2391 The serial port can be configured at run-time, without having to
2392close and re-open it. The function @code{serial-process-configure}
2393lets you change the speed, bytesize, and other parameters. In a
2394terminal window created by @code{serial-term}, you can click on the
2395mode line for configuration.
2396
2397 A serial connection is represented by a process object which can be
2398used similar to a subprocess or network process. You can send and
2399receive data and configure the serial port. A serial process object
80e6b6df
EZ
2400has no process ID, you can't send signals to it, and the status codes
2401are different from other types of processes.
c73e02fa
GM
2402@code{delete-process} on the process object or @code{kill-buffer} on
2403the process buffer close the connection, but this does not affect the
2404device connected to the serial port.
2405
2406 The function @code{process-type} returns the symbol @code{serial}
80e6b6df 2407for a process object representing a serial port connection.
c73e02fa
GM
2408
2409 Serial ports are available on GNU/Linux, Unix, and Windows systems.
2410
80e6b6df 2411@deffn Command serial-term port speed
c73e02fa 2412Start a terminal-emulator for a serial port in a new buffer.
80e6b6df
EZ
2413@var{port} is the name of the serial port to which to connect. For
2414example, this could be @file{/dev/ttyS0} on Unix. On Windows, this
2415could be @file{COM1}, or @file{\\.\COM10} (double the backslashes in
2416Lisp strings).
c73e02fa
GM
2417
2418@var{speed} is the speed of the serial port in bits per second. 9600
80e6b6df
EZ
2419is a common value. The buffer is in Term mode; see @ref{Term Mode,,,
2420emacs, The GNU Emacs Manual}, for the commands to use in that buffer.
2421You can change the speed and the configuration in the mode line menu.
2422@end deffn
c73e02fa
GM
2423
2424@defun make-serial-process &rest args
80e6b6df
EZ
2425This function creates a process and a buffer. Arguments are specified
2426as keyword/argument pairs. Here's the list of the meaningful keywords:
c73e02fa
GM
2427
2428@table @code
80e6b6df
EZ
2429@item :port @var{port}@r{ (mandatory)}
2430This is the name of the serial port. On Unix and GNU systems, this is
2431a file name such as @file{/dev/ttyS0}. On Windows, this could be
2432@file{COM1}, or @file{\\.\COM10} for ports higher than @file{COM9}
2433(double the backslashes in Lisp strings).
2434
2435@item :speed @var{speed}@r{ (mandatory)}
2436The speed of the serial port in bits per second. This function calls
2437@code{serial-process-configure} to handle the speed.
2438
2439@item :name @var{name}
2440The name of the process. If @var{name} is not given, @var{port} will
2441serve as the process name as well.
2442
2443@item :buffer @var{buffer}
2444The buffer to associate with the process. The value could be either a
2445buffer or a string that names a buffer. Process output goes at the
2446end of that buffer, unless you specify an output stream or filter
2447function to handle the output. If @var{buffer} is not given, the
2448process buffer's name is taken from the value of the @code{:name}
2449keyword.
2450
2451@item :coding @var{coding}
c73e02fa
GM
2452If @var{coding} is a symbol, it specifies the coding system used for
2453both reading and writing for this process. If @var{coding} is a cons
2454@code{(decoding . encoding)}, @var{decoding} is used for reading, and
80e6b6df
EZ
2455@var{encoding} is used for writing. If not specified, the default is
2456to determine the coding systems from data itself.
c73e02fa 2457
80e6b6df
EZ
2458@item :noquery @var{query-flag}
2459Initialize the process query flag to @var{query-flag}. @xref{Query
2460Before Exit}. The flags defaults to @code{nil} if unspecified.
c73e02fa 2461
80e6b6df 2462@item :stop @var{bool}
c73e02fa
GM
2463Start process in the @code{stopped} state if @var{bool} is
2464non-@code{nil}. In the stopped state, a serial process does not
2465accept incoming data, but you can send outgoing data. The stopped
2466state is cleared by @code{continue-process} and set by
2467@code{stop-process}.
2468
80e6b6df 2469@item :filter @var{filter}
c73e02fa
GM
2470Install @var{filter} as the process filter.
2471
80e6b6df 2472@item :sentinel @var{sentinel}
c73e02fa
GM
2473Install @var{sentinel} as the process sentinel.
2474
80e6b6df 2475@item :plist @var{plist}
c73e02fa
GM
2476Install @var{plist} as the initial plist of the process.
2477
2478@item :speed
2479@itemx :bytesize
2480@itemx :parity
2481@itemx :stopbits
2482@itemx :flowcontrol
2483These arguments are handled by @code{serial-process-configure}, which
2484is called by @code{make-serial-process}.
2485@end table
2486
2487The original argument list, possibly modified by later configuration,
2488is available via the function @code{process-contact}.
2489
2490Examples:
2491
2492@example
2493(make-serial-process :port "/dev/ttyS0" :speed 9600)
2494
2495(make-serial-process :port "COM1" :speed 115200 :stopbits 2)
2496
80e6b6df
EZ
2497(make-serial-process :port "\\\\.\\COM13" :speed 1200
2498 :bytesize 7 :parity 'odd)
c73e02fa 2499
57eedec1
GM
2500(make-serial-process :port "/dev/tty.BlueConsole-SPP-1"
2501 :speed nil)
c73e02fa
GM
2502@end example
2503@end defun
2504
2505@defun serial-process-configure &rest args
80e6b6df
EZ
2506@cindex baud, in serial connections
2507@cindex bytesize, in serial connections
2508@cindex parity, in serial connections
2509@cindex stopbits, in serial connections
2510@cindex flowcontrol, in serial connections
2511
2512This functions configures a serial port connection. Arguments are
2513specified as keyword/argument pairs. Attributes that are not given
2514are re-initialized from the process's current configuration (available
2515via the function @code{process-contact}) or set to reasonable default
2516values. The following arguments are defined:
c73e02fa
GM
2517
2518@table @code
80e6b6df
EZ
2519@item :process @var{process}
2520@itemx :name @var{name}
2521@itemx :buffer @var{buffer}
2522@itemx :port @var{port}
c73e02fa
GM
2523Any of these arguments can be given to identify the process that is to
2524be configured. If none of these arguments is given, the current
2525buffer's process is used.
2526
2527@item :speed @var{speed}
545c2782
EZ
2528The speed of the serial port in bits per second, a.k.a.@: @dfn{baud
2529rate}. The value can be any number, but most serial ports work only
2530at a few defined values between 1200 and 115200, with 9600 being the
2531most common value. If @var{speed} is @code{nil}, the function ignores
2532all other arguments and does not configure the port. This may be
2533useful for special serial ports such as Bluetooth-to-serial converters
2534which can only be configured through AT commands sent through the
538395d9
EZ
2535connection. The value of @code{nil} for @var{speed} is valid only for
2536connections that were already opened by a previous call to
80e6b6df 2537@code{make-serial-process} or @code{serial-term}.
c73e02fa
GM
2538
2539@item :bytesize @var{bytesize}
80e6b6df
EZ
2540The number of bits per byte, which can be 7 or 8. If @var{bytesize}
2541is not given or @code{nil}, it defaults to 8.
c73e02fa
GM
2542
2543@item :parity @var{parity}
80e6b6df 2544The value can be @code{nil} (don't use parity), the symbol
c73e02fa 2545@code{odd} (use odd parity), or the symbol @code{even} (use even
80e6b6df 2546parity). If @var{parity} is not given, it defaults to no parity.
c73e02fa
GM
2547
2548@item :stopbits @var{stopbits}
80e6b6df
EZ
2549The number of stopbits used to terminate a transmission
2550of each byte. @var{stopbits} can be 1 or 2. If @var{stopbits} is not
2551given or @code{nil}, it defaults to 1.
c73e02fa
GM
2552
2553@item :flowcontrol @var{flowcontrol}
80e6b6df
EZ
2554The type of flow control to use for this connection, which is either
2555@code{nil} (don't use flow control), the symbol @code{hw} (use RTS/CTS
2556hardware flow control), or the symbol @code{sw} (use XON/XOFF software
2557flow control). If @var{flowcontrol} is not given, it defaults to no
2558flow control.
c73e02fa
GM
2559@end table
2560
2561@code{serial-process-configure} is called by @code{make-serial-process} for the
2562initial configuration of the serial port.
2563
2564Examples:
2565
2566@example
2567(serial-process-configure :process "/dev/ttyS0" :speed 1200)
2568
57eedec1
GM
2569(serial-process-configure :buffer "COM1" :stopbits 1
2570 :parity 'odd :flowcontrol 'hw)
c73e02fa
GM
2571
2572(serial-process-configure :port "\\\\.\\COM13" :bytesize 7)
2573@end example
2574@end defun
2575
b8d4c8d0
GM
2576@node Byte Packing
2577@section Packing and Unpacking Byte Arrays
2578@cindex byte packing and unpacking
2579
2580 This section describes how to pack and unpack arrays of bytes,
2581usually for binary network protocols. These functions convert byte arrays
2582to alists, and vice versa. The byte array can be represented as a
2583unibyte string or as a vector of integers, while the alist associates
2584symbols either with fixed-size objects or with recursive sub-alists.
2585
2586@cindex serializing
2587@cindex deserializing
2588@cindex packing
2589@cindex unpacking
2590 Conversion from byte arrays to nested alists is also known as
2591@dfn{deserializing} or @dfn{unpacking}, while going in the opposite
2592direction is also known as @dfn{serializing} or @dfn{packing}.
2593
2594@menu
2595* Bindat Spec:: Describing data layout.
2596* Bindat Functions:: Doing the unpacking and packing.
2597* Bindat Examples:: Samples of what bindat.el can do for you!
2598@end menu
2599
2600@node Bindat Spec
2601@subsection Describing Data Layout
2602
2603 To control unpacking and packing, you write a @dfn{data layout
2604specification}, a special nested list describing named and typed
2605@dfn{fields}. This specification controls length of each field to be
2606processed, and how to pack or unpack it. We normally keep bindat specs
2607in variables whose names end in @samp{-bindat-spec}; that kind of name
2608is automatically recognized as ``risky.''
2609
2610@cindex endianness
2611@cindex big endian
2612@cindex little endian
2613@cindex network byte ordering
2614 A field's @dfn{type} describes the size (in bytes) of the object
2615that the field represents and, in the case of multibyte fields, how
2616the bytes are ordered within the field. The two possible orderings
2617are ``big endian'' (also known as ``network byte ordering'') and
2618``little endian.'' For instance, the number @code{#x23cd} (decimal
26199165) in big endian would be the two bytes @code{#x23} @code{#xcd};
2620and in little endian, @code{#xcd} @code{#x23}. Here are the possible
2621type values:
2622
2623@table @code
2624@item u8
2625@itemx byte
2626Unsigned byte, with length 1.
2627
2628@item u16
2629@itemx word
2630@itemx short
2631Unsigned integer in network byte order, with length 2.
2632
2633@item u24
2634Unsigned integer in network byte order, with length 3.
2635
2636@item u32
2637@itemx dword
2638@itemx long
2639Unsigned integer in network byte order, with length 4.
2640Note: These values may be limited by Emacs' integer implementation limits.
2641
2642@item u16r
2643@itemx u24r
2644@itemx u32r
2645Unsigned integer in little endian order, with length 2, 3 and 4, respectively.
2646
2647@item str @var{len}
2648String of length @var{len}.
2649
2650@item strz @var{len}
2651Zero-terminated string, in a fixed-size field with length @var{len}.
2652
2653@item vec @var{len} [@var{type}]
2654Vector of @var{len} elements of type @var{type}, or bytes if not
2655@var{type} is specified.
2656The @var{type} is any of the simple types above, or another vector
2657specified as a list @code{(vec @var{len} [@var{type}])}.
2658
2659@item ip
2660Four-byte vector representing an Internet address. For example:
2661@code{[127 0 0 1]} for localhost.
2662
2663@item bits @var{len}
2664List of set bits in @var{len} bytes. The bytes are taken in big
2665endian order and the bits are numbered starting with @code{8 *
2666@var{len} @minus{} 1} and ending with zero. For example: @code{bits
26672} unpacks @code{#x28} @code{#x1c} to @code{(2 3 4 11 13)} and
2668@code{#x1c} @code{#x28} to @code{(3 5 10 11 12)}.
2669
2670@item (eval @var{form})
2671@var{form} is a Lisp expression evaluated at the moment the field is
2672unpacked or packed. The result of the evaluation should be one of the
2673above-listed type specifications.
2674@end table
2675
2676For a fixed-size field, the length @var{len} is given as an integer
2677specifying the number of bytes in the field.
2678
2679When the length of a field is not fixed, it typically depends on the
2680value of a preceding field. In this case, the length @var{len} can be
2681given either as a list @code{(@var{name} ...)} identifying a
2682@dfn{field name} in the format specified for @code{bindat-get-field}
2683below, or by an expression @code{(eval @var{form})} where @var{form}
2684should evaluate to an integer, specifying the field length.
2685
2686A field specification generally has the form @code{([@var{name}]
2687@var{handler})}. The square braces indicate that @var{name} is
2688optional. (Don't use names that are symbols meaningful as type
2689specifications (above) or handler specifications (below), since that
2690would be ambiguous.) @var{name} can be a symbol or the expression
2691@code{(eval @var{form})}, in which case @var{form} should evaluate to
2692a symbol.
2693
2694@var{handler} describes how to unpack or pack the field and can be one
2695of the following:
2696
2697@table @code
2698@item @var{type}
2699Unpack/pack this field according to the type specification @var{type}.
2700
2701@item eval @var{form}
2702Evaluate @var{form}, a Lisp expression, for side-effect only. If the
2703field name is specified, the value is bound to that field name.
2704
2705@item fill @var{len}
2706Skip @var{len} bytes. In packing, this leaves them unchanged,
2707which normally means they remain zero. In unpacking, this means
2708they are ignored.
2709
2710@item align @var{len}
2711Skip to the next multiple of @var{len} bytes.
2712
2713@item struct @var{spec-name}
2714Process @var{spec-name} as a sub-specification. This describes a
2715structure nested within another structure.
2716
2717@item union @var{form} (@var{tag} @var{spec})@dots{}
2718@c ??? I don't see how one would actually use this.
2719@c ??? what kind of expression would be useful for @var{form}?
2720Evaluate @var{form}, a Lisp expression, find the first @var{tag}
2721that matches it, and process its associated data layout specification
2722@var{spec}. Matching can occur in one of three ways:
2723
2724@itemize
2725@item
2726If a @var{tag} has the form @code{(eval @var{expr})}, evaluate
2727@var{expr} with the variable @code{tag} dynamically bound to the value
2728of @var{form}. A non-@code{nil} result indicates a match.
2729
2730@item
2731@var{tag} matches if it is @code{equal} to the value of @var{form}.
2732
2733@item
2734@var{tag} matches unconditionally if it is @code{t}.
2735@end itemize
2736
2737@item repeat @var{count} @var{field-specs}@dots{}
2738Process the @var{field-specs} recursively, in order, then repeat
2739starting from the first one, processing all the specs @var{count}
2740times overall. The @var{count} is given using the same formats as a
2741field length---if an @code{eval} form is used, it is evaluated just once.
2742For correct operation, each spec in @var{field-specs} must include a name.
2743@end table
2744
2745For the @code{(eval @var{form})} forms used in a bindat specification,
2746the @var{form} can access and update these dynamically bound variables
2747during evaluation:
2748
2749@table @code
2750@item last
2751Value of the last field processed.
2752
2753@item bindat-raw
2754The data as a byte array.
2755
2756@item bindat-idx
2757Current index (within @code{bindat-raw}) for unpacking or packing.
2758
2759@item struct
2760The alist containing the structured data that have been unpacked so
2761far, or the entire structure being packed. You can use
2762@code{bindat-get-field} to access specific fields of this structure.
2763
2764@item count
2765@itemx index
2766Inside a @code{repeat} block, these contain the maximum number of
2767repetitions (as specified by the @var{count} parameter), and the
2768current repetition number (counting from 0). Setting @code{count} to
2769zero will terminate the inner-most repeat block after the current
2770repetition has completed.
2771@end table
2772
2773@node Bindat Functions
2774@subsection Functions to Unpack and Pack Bytes
2775
2776 In the following documentation, @var{spec} refers to a data layout
2777specification, @code{bindat-raw} to a byte array, and @var{struct} to an
2778alist representing unpacked field data.
2779
2780@defun bindat-unpack spec bindat-raw &optional bindat-idx
2781This function unpacks data from the unibyte string or byte
2782array @code{bindat-raw}
2783according to @var{spec}. Normally this starts unpacking at the
2784beginning of the byte array, but if @var{bindat-idx} is non-@code{nil}, it
2785specifies a zero-based starting position to use instead.
2786
2787The value is an alist or nested alist in which each element describes
2788one unpacked field.
2789@end defun
2790
2791@defun bindat-get-field struct &rest name
2792This function selects a field's data from the nested alist
2793@var{struct}. Usually @var{struct} was returned by
2794@code{bindat-unpack}. If @var{name} corresponds to just one argument,
2795that means to extract a top-level field value. Multiple @var{name}
2796arguments specify repeated lookup of sub-structures. An integer name
2797acts as an array index.
2798
2799For example, if @var{name} is @code{(a b 2 c)}, that means to find
2800field @code{c} in the third element of subfield @code{b} of field
2801@code{a}. (This corresponds to @code{struct.a.b[2].c} in C.)
2802@end defun
2803
2804 Although packing and unpacking operations change the organization of
2805data (in memory), they preserve the data's @dfn{total length}, which is
2806the sum of all the fields' lengths, in bytes. This value is not
2807generally inherent in either the specification or alist alone; instead,
2808both pieces of information contribute to its calculation. Likewise, the
2809length of a string or array being unpacked may be longer than the data's
2810total length as described by the specification.
2811
2812@defun bindat-length spec struct
2813This function returns the total length of the data in @var{struct},
2814according to @var{spec}.
2815@end defun
2816
2817@defun bindat-pack spec struct &optional bindat-raw bindat-idx
2818This function returns a byte array packed according to @var{spec} from
2819the data in the alist @var{struct}. Normally it creates and fills a
2820new byte array starting at the beginning. However, if @var{bindat-raw}
2821is non-@code{nil}, it specifies a pre-allocated unibyte string or vector to
2822pack into. If @var{bindat-idx} is non-@code{nil}, it specifies the starting
2823offset for packing into @code{bindat-raw}.
2824
2825When pre-allocating, you should make sure @code{(length @var{bindat-raw})}
2826meets or exceeds the total length to avoid an out-of-range error.
2827@end defun
2828
2829@defun bindat-ip-to-string ip
2830Convert the Internet address vector @var{ip} to a string in the usual
2831dotted notation.
2832
2833@example
2834(bindat-ip-to-string [127 0 0 1])
2835 @result{} "127.0.0.1"
2836@end example
2837@end defun
2838
2839@node Bindat Examples
2840@subsection Examples of Byte Unpacking and Packing
2841
2842 Here is a complete example of byte unpacking and packing:
2843
2844@lisp
2845(defvar fcookie-index-spec
2846 '((:version u32)
2847 (:count u32)
2848 (:longest u32)
2849 (:shortest u32)
2850 (:flags u32)
2851 (:delim u8)
2852 (:ignored fill 3)
2853 (:offset repeat (:count)
2854 (:foo u32)))
2855 "Description of a fortune cookie index file's contents.")
2856
2857(defun fcookie (cookies &optional index)
2858 "Display a random fortune cookie from file COOKIES.
2859Optional second arg INDEX specifies the associated index
2860filename, which is by default constructed by appending
2861\".dat\" to COOKIES. Display cookie text in possibly
2862new buffer \"*Fortune Cookie: BASENAME*\" where BASENAME
2863is COOKIES without the directory part."
2864 (interactive "fCookies file: ")
2865 (let* ((info (with-temp-buffer
2866 (insert-file-contents-literally
2867 (or index (concat cookies ".dat")))
2868 (bindat-unpack fcookie-index-spec
2869 (buffer-string))))
2870 (sel (random (bindat-get-field info :count)))
2871 (beg (cdar (bindat-get-field info :offset sel)))
2872 (end (or (cdar (bindat-get-field info
2873 :offset (1+ sel)))
2874 (nth 7 (file-attributes cookies)))))
2875 (switch-to-buffer
2876 (get-buffer-create
2877 (format "*Fortune Cookie: %s*"
2878 (file-name-nondirectory cookies))))
2879 (erase-buffer)
2880 (insert-file-contents-literally
2881 cookies nil beg (- end 3))))
2882
2883(defun fcookie-create-index (cookies &optional index delim)
2884 "Scan file COOKIES, and write out its index file.
2885Optional second arg INDEX specifies the index filename,
2886which is by default constructed by appending \".dat\" to
2887COOKIES. Optional third arg DELIM specifies the unibyte
2888character which, when found on a line of its own in
2889COOKIES, indicates the border between entries."
2890 (interactive "fCookies file: ")
2891 (setq delim (or delim ?%))
2892 (let ((delim-line (format "\n%c\n" delim))
2893 (count 0)
2894 (max 0)
2895 min p q len offsets)
2896 (unless (= 3 (string-bytes delim-line))
2897 (error "Delimiter cannot be represented in one byte"))
2898 (with-temp-buffer
2899 (insert-file-contents-literally cookies)
2900 (while (and (setq p (point))
2901 (search-forward delim-line (point-max) t)
2902 (setq len (- (point) 3 p)))
2903 (setq count (1+ count)
2904 max (max max len)
2905 min (min (or min max) len)
2906 offsets (cons (1- p) offsets))))
2907 (with-temp-buffer
2908 (set-buffer-multibyte nil)
2909 (insert
2910 (bindat-pack
2911 fcookie-index-spec
2912 `((:version . 2)
2913 (:count . ,count)
2914 (:longest . ,max)
2915 (:shortest . ,min)
2916 (:flags . 0)
2917 (:delim . ,delim)
2918 (:offset . ,(mapcar (lambda (o)
2919 (list (cons :foo o)))
2920 (nreverse offsets))))))
2921 (let ((coding-system-for-write 'raw-text-unix))
2922 (write-file (or index (concat cookies ".dat")))))))
2923@end lisp
2924
2925Following is an example of defining and unpacking a complex structure.
2926Consider the following C structures:
2927
2928@example
2929struct header @{
2930 unsigned long dest_ip;
2931 unsigned long src_ip;
2932 unsigned short dest_port;
2933 unsigned short src_port;
2934@};
2935
2936struct data @{
2937 unsigned char type;
2938 unsigned char opcode;
2939 unsigned short length; /* In network byte order */
2940 unsigned char id[8]; /* null-terminated string */
2941 unsigned char data[/* (length + 3) & ~3 */];
2942@};
2943
2944struct packet @{
2945 struct header header;
2946 unsigned long counters[2]; /* In little endian order */
2947 unsigned char items;
2948 unsigned char filler[3];
2949 struct data item[/* items */];
2950
2951@};
2952@end example
2953
2954The corresponding data layout specification:
2955
2956@lisp
2957(setq header-spec
2958 '((dest-ip ip)
2959 (src-ip ip)
2960 (dest-port u16)
2961 (src-port u16)))
2962
2963(setq data-spec
2964 '((type u8)
2965 (opcode u8)
2966 (length u16) ;; network byte order
2967 (id strz 8)
2968 (data vec (length))
2969 (align 4)))
2970
2971(setq packet-spec
2972 '((header struct header-spec)
2973 (counters vec 2 u32r) ;; little endian order
2974 (items u8)
2975 (fill 3)
2976 (item repeat (items)
2977 (struct data-spec))))
2978@end lisp
2979
2980A binary data representation:
2981
2982@lisp
2983(setq binary-data
2984 [ 192 168 1 100 192 168 1 101 01 28 21 32
2985 160 134 1 0 5 1 0 0 2 0 0 0
2986 2 3 0 5 ?A ?B ?C ?D ?E ?F 0 0 1 2 3 4 5 0 0 0
2987 1 4 0 7 ?B ?C ?D ?E ?F ?G 0 0 6 7 8 9 10 11 12 0 ])
2988@end lisp
2989
2990The corresponding decoded structure:
2991
2992@lisp
2993(setq decoded (bindat-unpack packet-spec binary-data))
2994 @result{}
2995((header
2996 (dest-ip . [192 168 1 100])
2997 (src-ip . [192 168 1 101])
2998 (dest-port . 284)
2999 (src-port . 5408))
3000 (counters . [100000 261])
3001 (items . 2)
3002 (item ((data . [1 2 3 4 5])
3003 (id . "ABCDEF")
3004 (length . 5)
3005 (opcode . 3)
3006 (type . 2))
3007 ((data . [6 7 8 9 10 11 12])
3008 (id . "BCDEFG")
3009 (length . 7)
3010 (opcode . 4)
3011 (type . 1))))
3012@end lisp
3013
3014Fetching data from this structure:
3015
3016@lisp
3017(bindat-get-field decoded 'item 1 'id)
3018 @result{} "BCDEFG"
3019@end lisp