Import Upstream version 20180207
[hcoop/debian/mlton.git] / doc / guide / localhost / AST
CommitLineData
7f918cf1
CE
1<!DOCTYPE html>\r
2<html lang="en">\r
3<head>\r
4<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">\r
5<meta name="generator" content="AsciiDoc 8.6.9">\r
6<title>AST</title>\r
7<link rel="stylesheet" href="./asciidoc.css" type="text/css">\r
8<link rel="stylesheet" href="./pygments.css" type="text/css">\r
9\r
10\r
11<script type="text/javascript" src="./asciidoc.js"></script>\r
12<script type="text/javascript">\r
13/*<![CDATA[*/\r
14asciidoc.install();\r
15/*]]>*/\r
16</script>\r
17<link rel="stylesheet" href="./mlton.css" type="text/css">\r
18</head>\r
19<body class="article">\r
20<div id="banner">\r
21<div id="banner-home">\r
22<a href="./Home">MLton 20180207</a>\r
23</div>\r
24</div>\r
25<div id="header">\r
26<h1>AST</h1>\r
27</div>\r
28<div id="content">\r
29<div id="preamble">\r
30<div class="sectionbody">\r
31<div class="paragraph"><p><a href="AST">AST</a> is the <a href="IntermediateLanguage">IntermediateLanguage</a> produced by the <a href="FrontEnd">FrontEnd</a>\r
32and translated by <a href="Elaborate">Elaborate</a> to <a href="CoreML">CoreML</a>.</p></div>\r
33</div>\r
34</div>\r
35<div class="sect1">\r
36<h2 id="_description">Description</h2>\r
37<div class="sectionbody">\r
38<div class="paragraph"><p>The abstract syntax tree produced by the <a href="FrontEnd">FrontEnd</a>.</p></div>\r
39</div>\r
40</div>\r
41<div class="sect1">\r
42<h2 id="_implementation">Implementation</h2>\r
43<div class="sectionbody">\r
44<div class="ulist"><ul>\r
45<li>\r
46<p>\r
47<a href="https://github.com/MLton/mlton/blob/master/mlton/ast/ast-programs.sig"><span class="monospaced">ast-programs.sig</span></a>\r
48</p>\r
49</li>\r
50<li>\r
51<p>\r
52<a href="https://github.com/MLton/mlton/blob/master/mlton/ast/ast-programs.fun"><span class="monospaced">ast-programs.fun</span></a>\r
53</p>\r
54</li>\r
55<li>\r
56<p>\r
57<a href="https://github.com/MLton/mlton/blob/master/mlton/ast/ast-modules.sig"><span class="monospaced">ast-modules.sig</span></a>\r
58</p>\r
59</li>\r
60<li>\r
61<p>\r
62<a href="https://github.com/MLton/mlton/blob/master/mlton/ast/ast-modules.fun"><span class="monospaced">ast-modules.fun</span></a>\r
63</p>\r
64</li>\r
65<li>\r
66<p>\r
67<a href="https://github.com/MLton/mlton/blob/master/mlton/ast/ast-core.sig"><span class="monospaced">ast-core.sig</span></a>\r
68</p>\r
69</li>\r
70<li>\r
71<p>\r
72<a href="https://github.com/MLton/mlton/blob/master/mlton/ast/ast-core.fun"><span class="monospaced">ast-core.fun</span></a>\r
73</p>\r
74</li>\r
75<li>\r
76<p>\r
77<a href="https://github.com/MLton/mlton/tree/master/mlton/ast"><span class="monospaced">ast</span></a>\r
78</p>\r
79</li>\r
80</ul></div>\r
81</div>\r
82</div>\r
83<div class="sect1">\r
84<h2 id="_type_checking">Type Checking</h2>\r
85<div class="sectionbody">\r
86<div class="paragraph"><p>The <a href="AST">AST</a> <a href="IntermediateLanguage">IntermediateLanguage</a> has no independent type\r
87checker. Type inference is performed on an AST program as part of\r
88<a href="Elaborate">Elaborate</a>.</p></div>\r
89</div>\r
90</div>\r
91<div class="sect1">\r
92<h2 id="_details_and_notes">Details and Notes</h2>\r
93<div class="sectionbody">\r
94<div class="sect2">\r
95<h3 id="_source_locations">Source locations</h3>\r
96<div class="paragraph"><p>MLton makes use of a relatively clean method for annotating the\r
97abstract syntax tree with source location information. Every source\r
98program phrase is "wrapped" with the <span class="monospaced">WRAPPED</span> interface:</p></div>\r
99<div class="listingblock">\r
100<div class="content"><div class="highlight"><pre><span class="k">signature</span><span class="w"> </span><span class="n">WRAPPED</span><span class="w"> </span><span class="p">=</span><span class="w"></span>\r
101<span class="w"> </span><span class="k">sig</span><span class="w"></span>\r
102<span class="w"> </span><span class="k">type</span><span class="w"> </span><span class="n">node&#39;</span><span class="w"></span>\r
103<span class="w"> </span><span class="k">type</span><span class="w"> </span><span class="n">obj</span><span class="w"></span>\r
104\r
105<span class="w"> </span><span class="k">val</span><span class="w"> </span><span class="n">dest</span><span class="p">:</span><span class="w"> </span><span class="n">obj</span><span class="w"> </span><span class="p">-&gt;</span><span class="w"> </span><span class="n">node&#39;</span><span class="w"> </span><span class="n">*</span><span class="w"> </span><span class="n">Region</span><span class="p">.</span><span class="n">t</span><span class="w"></span>\r
106<span class="w"> </span><span class="k">val</span><span class="w"> </span><span class="n">makeRegion&#39;</span><span class="p">:</span><span class="w"> </span><span class="n">node&#39;</span><span class="w"> </span><span class="n">*</span><span class="w"> </span><span class="n">SourcePos</span><span class="p">.</span><span class="n">t</span><span class="w"> </span><span class="n">*</span><span class="w"> </span><span class="n">SourcePos</span><span class="p">.</span><span class="n">t</span><span class="w"> </span><span class="p">-&gt;</span><span class="w"> </span><span class="n">obj</span><span class="w"></span>\r
107<span class="w"> </span><span class="k">val</span><span class="w"> </span><span class="n">makeRegion</span><span class="p">:</span><span class="w"> </span><span class="n">node&#39;</span><span class="w"> </span><span class="n">*</span><span class="w"> </span><span class="n">Region</span><span class="p">.</span><span class="n">t</span><span class="w"> </span><span class="p">-&gt;</span><span class="w"> </span><span class="n">obj</span><span class="w"></span>\r
108<span class="w"> </span><span class="k">val</span><span class="w"> </span><span class="n">node</span><span class="p">:</span><span class="w"> </span><span class="n">obj</span><span class="w"> </span><span class="p">-&gt;</span><span class="w"> </span><span class="n">node&#39;</span><span class="w"></span>\r
109<span class="w"> </span><span class="k">val</span><span class="w"> </span><span class="n">region</span><span class="p">:</span><span class="w"> </span><span class="n">obj</span><span class="w"> </span><span class="p">-&gt;</span><span class="w"> </span><span class="n">Region</span><span class="p">.</span><span class="n">t</span><span class="w"></span>\r
110<span class="w"> </span><span class="k">end</span><span class="w"></span>\r
111</pre></div></div></div>\r
112<div class="paragraph"><p>The key idea is that <span class="monospaced">node'</span> is the type of an unannotated syntax\r
113phrase and <span class="monospaced">obj</span> is the type of its annotated counterpart. In the\r
114implementation, every <span class="monospaced">node'</span> is annotated with a <span class="monospaced">Region.t</span>\r
115(<a href="https://github.com/MLton/mlton/blob/master/mlton/control/region.sig"><span class="monospaced">region.sig</span></a>,\r
116<a href="https://github.com/MLton/mlton/blob/master/mlton/control/region.sml"><span class="monospaced">region.sml</span></a>), which describes the\r
117syntax phrase&#8217;s left source position and right source position, where\r
118<span class="monospaced">SourcePos.t</span> (<a href="https://github.com/MLton/mlton/blob/master/mlton/control/source-pos.sig"><span class="monospaced">source-pos.sig</span></a>,\r
119<a href="https://github.com/MLton/mlton/blob/master/mlton/control/source-pos.sml"><span class="monospaced">source-pos.sml</span></a>) denotes a\r
120particular file, line, and column. A typical use of the <span class="monospaced">WRAPPED</span>\r
121interface is illustrated by the following code:</p></div>\r
122<div class="listingblock">\r
123<div class="content"><div class="highlight"><pre><span class="w"> </span><span class="k">datatype</span><span class="w"> </span><span class="n">node</span><span class="w"> </span><span class="p">=</span><span class="w"></span>\r
124<span class="w"> </span><span class="n">App</span><span class="w"> </span><span class="k">of</span><span class="w"> </span><span class="n">Longcon</span><span class="p">.</span><span class="n">t</span><span class="w"> </span><span class="n">*</span><span class="w"> </span><span class="n">t</span><span class="w"></span>\r
125<span class="w"> </span><span class="p">|</span><span class="w"> </span><span class="n">Const</span><span class="w"> </span><span class="k">of</span><span class="w"> </span><span class="n">Const</span><span class="p">.</span><span class="n">t</span><span class="w"></span>\r
126<span class="w"> </span><span class="p">|</span><span class="w"> </span><span class="n">Constraint</span><span class="w"> </span><span class="k">of</span><span class="w"> </span><span class="n">t</span><span class="w"> </span><span class="n">*</span><span class="w"> </span><span class="n">Type</span><span class="p">.</span><span class="n">t</span><span class="w"></span>\r
127<span class="w"> </span><span class="p">|</span><span class="w"> </span><span class="n">FlatApp</span><span class="w"> </span><span class="k">of</span><span class="w"> </span><span class="n">t</span><span class="w"> </span><span class="n">vector</span><span class="w"></span>\r
128<span class="w"> </span><span class="p">|</span><span class="w"> </span><span class="n">Layered</span><span class="w"> </span><span class="k">of</span><span class="w"> </span><span class="p">{</span><span class="n">constraint</span><span class="p">:</span><span class="w"> </span><span class="n">Type</span><span class="p">.</span><span class="n">t</span><span class="w"> </span><span class="n">option</span><span class="p">,</span><span class="w"></span>\r
129<span class="w"> </span><span class="n">fixop</span><span class="p">:</span><span class="w"> </span><span class="n">Fixop</span><span class="p">.</span><span class="n">t</span><span class="p">,</span><span class="w"></span>\r
130<span class="w"> </span><span class="n">pat</span><span class="p">:</span><span class="w"> </span><span class="n">t</span><span class="p">,</span><span class="w"></span>\r
131<span class="w"> </span><span class="n">var</span><span class="p">:</span><span class="w"> </span><span class="n">Var</span><span class="p">.</span><span class="n">t</span><span class="p">}</span><span class="w"></span>\r
132<span class="w"> </span><span class="p">|</span><span class="w"> </span><span class="n">List</span><span class="w"> </span><span class="k">of</span><span class="w"> </span><span class="n">t</span><span class="w"> </span><span class="n">vector</span><span class="w"></span>\r
133<span class="w"> </span><span class="p">|</span><span class="w"> </span><span class="n">Paren</span><span class="w"> </span><span class="k">of</span><span class="w"> </span><span class="n">t</span><span class="w"></span>\r
134<span class="w"> </span><span class="p">|</span><span class="w"> </span><span class="n">Or</span><span class="w"> </span><span class="k">of</span><span class="w"> </span><span class="n">t</span><span class="w"> </span><span class="n">vector</span><span class="w"></span>\r
135<span class="w"> </span><span class="p">|</span><span class="w"> </span><span class="n">Record</span><span class="w"> </span><span class="k">of</span><span class="w"> </span><span class="p">{</span><span class="n">flexible</span><span class="p">:</span><span class="w"> </span><span class="n">bool</span><span class="p">,</span><span class="w"></span>\r
136<span class="w"> </span><span class="n">items</span><span class="p">:</span><span class="w"> </span><span class="p">(</span><span class="n">Record</span><span class="p">.</span><span class="n">Field</span><span class="p">.</span><span class="n">t</span><span class="w"> </span><span class="n">*</span><span class="w"> </span><span class="n">Region</span><span class="p">.</span><span class="n">t</span><span class="w"> </span><span class="n">*</span><span class="w"> </span><span class="n">Item</span><span class="p">.</span><span class="n">t</span><span class="p">)</span><span class="w"> </span><span class="n">vector</span><span class="p">}</span><span class="w"></span>\r
137<span class="w"> </span><span class="p">|</span><span class="w"> </span><span class="n">Tuple</span><span class="w"> </span><span class="k">of</span><span class="w"> </span><span class="n">t</span><span class="w"> </span><span class="n">vector</span><span class="w"></span>\r
138<span class="w"> </span><span class="p">|</span><span class="w"> </span><span class="n">Var</span><span class="w"> </span><span class="k">of</span><span class="w"> </span><span class="p">{</span><span class="n">fixop</span><span class="p">:</span><span class="w"> </span><span class="n">Fixop</span><span class="p">.</span><span class="n">t</span><span class="p">,</span><span class="w"></span>\r
139<span class="w"> </span><span class="n">name</span><span class="p">:</span><span class="w"> </span><span class="n">Longvid</span><span class="p">.</span><span class="n">t</span><span class="p">}</span><span class="w"></span>\r
140<span class="w"> </span><span class="p">|</span><span class="w"> </span><span class="n">Vector</span><span class="w"> </span><span class="k">of</span><span class="w"> </span><span class="n">t</span><span class="w"> </span><span class="n">vector</span><span class="w"></span>\r
141<span class="w"> </span><span class="p">|</span><span class="w"> </span><span class="n">Wild</span><span class="w"></span>\r
142</pre></div></div></div>\r
143<div class="paragraph"><p>Thus, AST nodes are cleanly separated from source locations. By way\r
144of contrast, consider the approach taken by <a href="SMLNJ">SML/NJ</a> (and also\r
145by the <a href="CKitLibrary">CKit Library</a>). Each datatype denoting a syntax\r
146phrase dedicates a special constructor for annotating source\r
147locations:</p></div>\r
148<div class="listingblock">\r
149<div class="content"><div class="highlight"><pre><span class="k">datatype</span><span class="w"> </span><span class="n">pat</span><span class="w"> </span><span class="p">=</span><span class="w"> </span><span class="n">WildPat</span><span class="w"> </span><span class="cm">(* empty pattern *)</span><span class="w"></span>\r
150<span class="w"> </span><span class="p">|</span><span class="w"> </span><span class="n">AppPat</span><span class="w"> </span><span class="k">of</span><span class="w"> </span><span class="p">{</span><span class="n">constr</span><span class="p">:</span><span class="n">pat</span><span class="p">,</span><span class="n">argument</span><span class="p">:</span><span class="n">pat</span><span class="p">}</span><span class="w"> </span><span class="cm">(* application *)</span><span class="w"></span>\r
151<span class="w"> </span><span class="p">|</span><span class="w"> </span><span class="n">MarkPat</span><span class="w"> </span><span class="k">of</span><span class="w"> </span><span class="n">pat</span><span class="w"> </span><span class="n">*</span><span class="w"> </span><span class="n">region</span><span class="w"> </span><span class="cm">(* mark a pattern *)</span><span class="w"></span>\r
152</pre></div></div></div>\r
153<div class="paragraph"><p>The main drawback of this approach is that static type checking is not\r
154sufficient to guarantee that the AST emitted from the front-end is\r
155properly annotated.</p></div>\r
156</div>\r
157</div>\r
158</div>\r
159</div>\r
160<div id="footnotes"><hr></div>\r
161<div id="footer">\r
162<div id="footer-text">\r
163</div>\r
164<div id="footer-badges">\r
165</div>\r
166</div>\r
167</body>\r
168</html>\r