Import Debian changes 20180207-1
[hcoop/debian/mlton.git] / doc / guide / localhost / ProfilingTheStack
1 <!DOCTYPE html>
2 <html lang="en">
3 <head>
4 <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
5 <meta name="generator" content="AsciiDoc 8.6.9">
6 <title>ProfilingTheStack</title>
7 <link rel="stylesheet" href="./asciidoc.css" type="text/css">
8 <link rel="stylesheet" href="./pygments.css" type="text/css">
9
10
11 <script type="text/javascript" src="./asciidoc.js"></script>
12 <script type="text/javascript">
13 /*<![CDATA[*/
14 asciidoc.install();
15 /*]]>*/
16 </script>
17 <link rel="stylesheet" href="./mlton.css" type="text/css">
18 </head>
19 <body class="article">
20 <div id="banner">
21 <div id="banner-home">
22 <a href="./Home">MLton 20180207</a>
23 </div>
24 </div>
25 <div id="header">
26 <h1>ProfilingTheStack</h1>
27 </div>
28 <div id="content">
29 <div id="preamble">
30 <div class="sectionbody">
31 <div class="paragraph"><p>For all forms of <a href="Profiling">Profiling</a>, you can gather counts for all
32 functions on the stack, not just the currently executing function. To
33 do so, compile your program with <span class="monospaced">-profile-stack true</span>. For example,
34 suppose that <span class="monospaced">list-rev.sml</span> contains the following.</p></div>
35 <div class="listingblock">
36 <div class="content"><div class="highlight"><pre><span class="k">fun</span><span class="w"> </span><span class="n">append</span><span class="w"> </span><span class="p">(</span><span class="n">l1</span><span class="p">,</span><span class="w"> </span><span class="n">l2</span><span class="p">)</span><span class="w"> </span><span class="p">=</span><span class="w"></span>
37 <span class="w"> </span><span class="k">case</span><span class="w"> </span><span class="n">l1</span><span class="w"> </span><span class="k">of</span><span class="w"></span>
38 <span class="w"> </span><span class="p">[]</span><span class="w"> </span><span class="p">=&gt;</span><span class="w"> </span><span class="n">l2</span><span class="w"></span>
39 <span class="w"> </span><span class="p">|</span><span class="w"> </span><span class="n">x</span><span class="w"> </span><span class="n">::</span><span class="w"> </span><span class="n">l1</span><span class="w"> </span><span class="p">=&gt;</span><span class="w"> </span><span class="n">x</span><span class="w"> </span><span class="n">::</span><span class="w"> </span><span class="n">append</span><span class="w"> </span><span class="p">(</span><span class="n">l1</span><span class="p">,</span><span class="w"> </span><span class="n">l2</span><span class="p">)</span><span class="w"></span>
40
41 <span class="k">fun</span><span class="w"> </span><span class="n">rev</span><span class="w"> </span><span class="n">l</span><span class="w"> </span><span class="p">=</span><span class="w"></span>
42 <span class="w"> </span><span class="k">case</span><span class="w"> </span><span class="n">l</span><span class="w"> </span><span class="k">of</span><span class="w"></span>
43 <span class="w"> </span><span class="p">[]</span><span class="w"> </span><span class="p">=&gt;</span><span class="w"> </span><span class="p">[]</span><span class="w"></span>
44 <span class="w"> </span><span class="p">|</span><span class="w"> </span><span class="n">x</span><span class="w"> </span><span class="n">::</span><span class="w"> </span><span class="n">l</span><span class="w"> </span><span class="p">=&gt;</span><span class="w"> </span><span class="n">append</span><span class="w"> </span><span class="p">(</span><span class="n">rev</span><span class="w"> </span><span class="n">l</span><span class="p">,</span><span class="w"> </span><span class="p">[</span><span class="n">x</span><span class="p">])</span><span class="w"></span>
45
46 <span class="k">val</span><span class="w"> </span><span class="n">l</span><span class="w"> </span><span class="p">=</span><span class="w"> </span><span class="n">List</span><span class="p">.</span><span class="n">tabulate</span><span class="w"> </span><span class="p">(</span><span class="mi">1000</span><span class="p">,</span><span class="w"> </span><span class="k">fn</span><span class="w"> </span><span class="n">i</span><span class="w"> </span><span class="p">=&gt;</span><span class="w"> </span><span class="n">i</span><span class="p">)</span><span class="w"></span>
47 <span class="k">val</span><span class="w"> </span><span class="p">_</span><span class="w"> </span><span class="p">=</span><span class="w"> </span><span class="mi">1</span><span class="w"> </span><span class="n">+</span><span class="w"> </span><span class="n">hd</span><span class="w"> </span><span class="p">(</span><span class="n">rev</span><span class="w"> </span><span class="n">l</span><span class="p">)</span><span class="w"></span>
48 </pre></div></div></div>
49 <div class="paragraph"><p>Compile with stack profiling and then run the program.</p></div>
50 <div class="listingblock">
51 <div class="content monospaced">
52 <pre>% mlton -profile alloc -profile-stack true list-rev.sml
53 % ./list-rev</pre>
54 </div></div>
55 <div class="paragraph"><p>Display the profiling data.</p></div>
56 <div class="listingblock">
57 <div class="content monospaced">
58 <pre>% mlprof -show-line true list-rev mlmon.out
59 6,030,136 bytes allocated (108,336 bytes by GC)
60 function cur stack GC
61 ----------------------- ----- ----- ----
62 append list-rev.sml: 1 97.6% 97.6% 1.4%
63 &lt;gc&gt; 1.8% 0.0% 1.8%
64 &lt;main&gt; 0.4% 98.2% 1.8%
65 rev list-rev.sml: 6 0.2% 97.6% 1.8%</pre>
66 </div></div>
67 <div class="paragraph"><p>In the above table, we see that <span class="monospaced">rev</span>, defined on line 6 of
68 <span class="monospaced">list-rev.sml</span>, is only responsible for 0.2% of the allocation, but is
69 on the stack while 97.6% of the allocation is done by the user program
70 and while 1.8% of the allocation is done by the garbage collector.</p></div>
71 <div class="paragraph"><p>The run-time performance impact of <span class="monospaced">-profile-stack true</span> can be
72 noticeable since there is some extra bookkeeping at every nontail call
73 and return.</p></div>
74 </div>
75 </div>
76 </div>
77 <div id="footnotes"><hr></div>
78 <div id="footer">
79 <div id="footer-text">
80 </div>
81 <div id="footer-badges">
82 </div>
83 </div>
84 </body>
85 </html>