diff options
Diffstat (limited to 'lcc/doc/4.html')
| -rwxr-xr-x | lcc/doc/4.html | 754 | 
1 files changed, 754 insertions, 0 deletions
| diff --git a/lcc/doc/4.html b/lcc/doc/4.html new file mode 100755 index 0000000..0158d8b --- /dev/null +++ b/lcc/doc/4.html @@ -0,0 +1,754 @@ +<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
 +<html>
 +
 +<head>
 +<link HREF="mailto:drh@microsoft.com" REV="made" TITLE="David R. Hanson">
 +<title>The lcc 4.1 Code-Generation Interface</title>
 +</head>
 +
 +<body>
 +
 +<h1>The lcc 4.1 Code-Generation Interface</h1>
 +
 +<p ALIGN="LEFT"><strong><a HREF="http://www.research.microsoft.com/~cwfraser/">Christopher
 +W. Fraser</a> and <a HREF="http://www.research.microsoft.com/~drh/">David R. Hanson</a>, <a
 +HREF="http://www.research.microsoft.com/">Microsoft Research</a></strong></p>
 +
 +<h2>Contents</h2>
 +
 +<dir>
 +  <li><a HREF="#intro">Introduction</a> </li>
 +  <li><a HREF="#metrics">5.1 Type Metrics</a></li>
 +  <li><a HREF="#symbols">5.3 Symbols</a> </li>
 +  <li><a HREF="#operators">5.5 Dag Operators</a></li>
 +  <li><a HREF="#flags">5.6 Interface Flags</a></li>
 +  <li><a HREF="#definitions">5.8 Definitions</a></li>
 +  <li><a HREF="#constants">5.9 Constants</a></li>
 +  <li><a HREF="#upcalls">5.12 Upcalls</a></li>
 +</dir>
 +
 +<h2><a NAME="intro">Introduction</a></h2>
 +
 +<p>Version 4.1 is the latest release of <a
 +HREF="http://www.cs.princeton.edu/software/lcc/">lcc</a>, the ANSI C compiler described in
 +our book <cite>A Retargetable C Compiler: Design and Implementation</cite>
 +(Addison-Wesley, 1995, ISBN 0-8053-1670-1). This document summarizes the differences
 +between the 4.1 code-generation interface and the 3.x interface described in Chap. 5 of <cite>A
 +Retargetable C Compiler</cite>.</p>
 +
 +<p>Previous versions of lcc supported only three sizes of integers, two sizes of floats,
 +and insisted that pointers fit in unsigned integers (see Sec. 5.1 of <cite>A Retargetable
 +C Compiler</cite>). These assumptions simplified the compiler, and were suitable for
 +32-bit architectures. But on 64-bit architectures, such as the DEC ALPHA, it's natural to
 +have four sizes of integers and perhaps three sizes of floats, and on 16-bit
 +architectures, 32-bit pointers don't fit in unsigned integers. Also, the 3.x constaints
 +limited the use of lcc's back ends for other languages, such as Java.</p>
 +
 +<p>Version 4.x removes all of these restrictions: It supports any number of sizes for
 +integers and floats, and the size of pointers need not be related to the size of any of
 +the integer types. The major changes in the code-generation interface are: 
 +
 +<ul>
 +  <li>The number of type suffixes has been reduced to 6.</li>
 +  <li>Dag operators are composed of a generic operator, a type suffix, and a size.</li>
 +  <li>Unsigned variants of several operators have been added.</li>
 +  <li>Several interface functions have new signatures.</li>
 +</ul>
 +
 +<p>In addition, version 4.x is written in ANSI C and uses the standard I/O library and
 +other standard C functions.</p>
 +
 +<p>The sections below parallel the subsections of Chap. 5 of <cite>A Retargetable C
 +Compiler</cite> and summarize the differences between the 3.x and 4.x code-generation
 +interface. Unaffected subsections are omitted. Page citations refer to pages in <cite>A
 +Retargetable C Compiler</cite>.</p>
 +
 +<h2><a NAME="metrics">5.1 Type Metrics</a></h2>
 +
 +<p>There are now 10 metrics in an interface record:</p>
 +
 +<pre>Metrics charmetric;
 +Metrics shortmetric;
 +Metrics intmetric;
 +Metrics longmetric;
 +Metrics longlongmetric;
 +Metrics floatmetric;
 +Metrics doublemetric;
 +Metrics longdoublemetric;
 +Metrics ptrmetric;
 +Metrics structmetric;</pre>
 +
 +<p>Each of these specifies the size and alignment of the corresponding type. <code>ptrmetric</code>
 +describes all pointers.</p>
 +
 +<h2><a NAME="symbols">5.3 Symbols</a></h2>
 +
 +<p>The actual value of a constant is stored in the <code>u.c.v</code> field of a symbol,
 +which holds a <code>Value</code>:</p>
 +
 +<pre>typedef union value {
 +	long i;
 +	unsigned long u;
 +	long double d;
 +	void *p;
 +	void (*g)(void);
 +} Value;</pre>
 +
 +<p>The value is stored in the appropriate field according to its type, which is given by
 +the symbol's <code>type</code> field.</p>
 +
 +<h2><a NAME="operators">5.5 Dag Operators</a></h2>
 +
 +<p>The <code>op</code> field a of <code>node</code> structure holds a dag operator, which
 +consists of a generic operator, a type suffix, and a size indicator. The type suffixes
 +are:</p>
 +
 +<pre>enum {
 +	F=FLOAT,
 +	I=INT,
 +	U=UNSIGNED,
 +	P=POINTER,
 +	V=VOID,
 +	B=STRUCT
 +};
 +
 +#define sizeop(n) ((n)<<10)</pre>
 +
 +<p>Given a generic operator <code>o</code>, a type suffix <code>t</code>, and a size <code>s</code>,
 +a type- and size-specific operator is formed by <code>o+t+sizeop(s)</code>. For example, <code>ADD+F+sizeop(4)</code>
 +forms the operator <code>ADDF4</code>, which denotes the sum of two 4-byte floats.
 +Similarly, <code>ADD+F+sizeop(8)</code> forms <code>ADDF8</code>, which denotes 8-byte
 +floating addition. In the 3.x code-generation interface, <code>ADDF</code> and <code>ADDD</code>
 +denoted these operations. There was no size indicator in the 3.x operators because the
 +type suffix supplied both a type and a size.</p>
 +
 +<p>Table 5.1 lists each generic operator, its valid type suffixes, and the number of <code>kids</code>
 +and <code>syms</code> that it uses; multiple values for <code>kids</code> indicate
 +type-specific variants. The notations in the <strong>syms</strong> column give the number
 +of <code>syms</code> values and a one-letter code that suggests their uses: 1V indicates
 +that <code>syms[0]</code> points to a symbol for a variable, 1C indicates that <code>syms[0]</code>
 +is a constant, and 1L indicates that <code>syms[0]</code> is a label. For 1S, <code>syms[0]</code>
 +is a constant whose value is a size in bytes; 2S adds <code>syms[1]</code>, which is a
 +constant whose value is an alignment. For most operators, the type suffix and size
 +indicator denote the type and size of operation to perform and the type and size of the
 +result.</p>
 +
 +<table WIDTH="100%" BORDER="0" CELLPADDING="0" CELLSPACING="0">
 +  <tr>
 +    <td COLSPAN="6" ALIGN="CENTER"><strong>Table 5.1<img SRC="/~drh/resources/dot_clear.gif"
 +    ALT="|" WIDTH="18" HEIGHT="1">Node Operators.</strong></td>
 +  </tr>
 +  <tr>
 +    <td><strong>syms</strong></td>
 +    <td><strong>kids</strong></td>
 +    <td><strong>Operator</strong></td>
 +    <td><strong>Type Suffixes</strong></td>
 +    <td><strong>Sizes</strong></td>
 +    <td><strong>Operation</strong></td>
 +  </tr>
 +  <tr>
 +    <td>1V</td>
 +    <td>0</td>
 +    <td><code>ADDRF</code></td>
 +    <td><code>...P..</code></td>
 +    <td>p</td>
 +    <td>address of a parameter</td>
 +  </tr>
 +  <tr>
 +    <td>1V</td>
 +    <td>0</td>
 +    <td><code>ADDRG</code></td>
 +    <td><code>...P..</code></td>
 +    <td>p</td>
 +    <td>address of a global</td>
 +  </tr>
 +  <tr>
 +    <td>1V</td>
 +    <td>0</td>
 +    <td><code>ADDRL</code></td>
 +    <td><code>...P..</code></td>
 +    <td>p</td>
 +    <td>address of a local</td>
 +  </tr>
 +  <tr>
 +    <td>1C</td>
 +    <td>0</td>
 +    <td><code>CNST</code></td>
 +    <td><code>FIUP..</code></td>
 +    <td>fdx csilh p</td>
 +    <td>constant</td>
 +  </tr>
 +  <tr ALIGN="LEFT" VALIGN="TOP">
 +    <td><img SRC="/~drh/resources/dot_clear.gif" ALT="|" WIDTH="1" HEIGHT="12"></td>
 +    <td></td>
 +    <td></td>
 +    <td></td>
 +    <td></td>
 +    <td></td>
 +  </tr>
 +  <tr>
 +    <td></td>
 +    <td>1</td>
 +    <td><code>BCOM</code></td>
 +    <td><code>.IU...</code></td>
 +    <td>ilh</td>
 +    <td>bitwise complement</td>
 +  </tr>
 +  <tr>
 +    <td>1S</td>
 +    <td>1</td>
 +    <td><code>CVF</code></td>
 +    <td><code>FI....</code></td>
 +    <td>fdx ilh</td>
 +    <td>convert from float</td>
 +  </tr>
 +  <tr>
 +    <td>1S</td>
 +    <td>1</td>
 +    <td><code>CVI</code></td>
 +    <td><code>FIU...</code></td>
 +    <td>fdx csilh csilhp</td>
 +    <td>convert from signed integer</td>
 +  </tr>
 +  <tr>
 +    <td>1S</td>
 +    <td>1</td>
 +    <td><code>CVP</code></td>
 +    <td><code>..U..</code></td>
 +    <td>p</td>
 +    <td>convert from pointer</td>
 +  </tr>
 +  <tr>
 +    <td>1S</td>
 +    <td>1</td>
 +    <td><code>CVU</code></td>
 +    <td><code>.IUP..</code></td>
 +    <td>csilh p</td>
 +    <td>convert from unsigned integer</td>
 +  </tr>
 +  <tr>
 +    <td></td>
 +    <td>1</td>
 +    <td><code>INDIR</code></td>
 +    <td><code>FIUP.B</code></td>
 +    <td>fdx csilh p</td>
 +    <td>fetch</td>
 +  </tr>
 +  <tr>
 +    <td></td>
 +    <td>1</td>
 +    <td><code>NEG</code></td>
 +    <td><code>FI....</code></td>
 +    <td>fdx ilh</td>
 +    <td>negation</td>
 +  </tr>
 +  <tr>
 +    <td><img SRC="/~drh/resources/dot_clear.gif" ALT="|" WIDTH="1" HEIGHT="12"></td>
 +    <td></td>
 +    <td></td>
 +    <td></td>
 +    <td></td>
 +    <td></td>
 +  </tr>
 +  <tr>
 +    <td></td>
 +    <td>2</td>
 +    <td><code>ADD</code></td>
 +    <td><code>FIUP..</code></td>
 +    <td>fdx ilh ilhp p</td>
 +    <td>addition</td>
 +  </tr>
 +  <tr>
 +    <td></td>
 +    <td>2</td>
 +    <td><code>BAND</code></td>
 +    <td><code>.IU...</code></td>
 +    <td>ilh</td>
 +    <td>bitwise AND</td>
 +  </tr>
 +  <tr>
 +    <td></td>
 +    <td>2</td>
 +    <td><code>BOR</code></td>
 +    <td><code>.IU...</code></td>
 +    <td>ilh</td>
 +    <td>bitwise inclusive OR</td>
 +  </tr>
 +  <tr>
 +    <td></td>
 +    <td>2</td>
 +    <td><code>BXOR</code></td>
 +    <td><code>.IU...</code></td>
 +    <td>ilh</td>
 +    <td>bitwise exclusive OR</td>
 +  </tr>
 +  <tr>
 +    <td></td>
 +    <td>2</td>
 +    <td><code>DIV</code></td>
 +    <td><code>FIU...</code></td>
 +    <td>fdx ilh</td>
 +    <td>division</td>
 +  </tr>
 +  <tr>
 +    <td></td>
 +    <td>2</td>
 +    <td><code>LSH</code></td>
 +    <td><code>.IU...</code></td>
 +    <td>ilh</td>
 +    <td>left shift</td>
 +  </tr>
 +  <tr>
 +    <td></td>
 +    <td>2</td>
 +    <td><code>MOD</code></td>
 +    <td><code>.IU...</code></td>
 +    <td>ilh</td>
 +    <td>modulus</td>
 +  </tr>
 +  <tr>
 +    <td></td>
 +    <td>2</td>
 +    <td><code>MUL</code></td>
 +    <td><code>FIU...</code></td>
 +    <td>fdx ilh</td>
 +    <td>multiplication</td>
 +  </tr>
 +  <tr>
 +    <td></td>
 +    <td>2</td>
 +    <td><code>RSH</code></td>
 +    <td><code>.IU...</code></td>
 +    <td>ilh</td>
 +    <td>right shift</td>
 +  </tr>
 +  <tr>
 +    <td></td>
 +    <td>2</td>
 +    <td><code>SUB</code></td>
 +    <td><code>FIUP..</code></td>
 +    <td>fdx ilh ilhp p</td>
 +    <td>subtraction</td>
 +  </tr>
 +  <tr>
 +    <td><img SRC="/~drh/resources/dot_clear.gif" ALT="|" WIDTH="1" HEIGHT="12"></td>
 +    <td></td>
 +    <td></td>
 +    <td></td>
 +    <td></td>
 +    <td></td>
 +  </tr>
 +  <tr>
 +    <td>2S</td>
 +    <td>2</td>
 +    <td><code>ASGN</code></td>
 +    <td><code>FIUP.B</code></td>
 +    <td>fdx csilh p</td>
 +    <td>assignment</td>
 +  </tr>
 +  <tr>
 +    <td>1L</td>
 +    <td>2</td>
 +    <td><code>EQ</code></td>
 +    <td><code>FIU...</code></td>
 +    <td>fdx ilh ilhp</td>
 +    <td>jump if equal</td>
 +  </tr>
 +  <tr>
 +    <td>1L</td>
 +    <td>2</td>
 +    <td><code>GE</code></td>
 +    <td><code>FIU...</code></td>
 +    <td>fdx ilh ilhp</td>
 +    <td>jump if greater than or equal</td>
 +  </tr>
 +  <tr>
 +    <td>1L</td>
 +    <td>2</td>
 +    <td><code>GT</code></td>
 +    <td><code>FIU...</code></td>
 +    <td>fdx ilh ilhp</td>
 +    <td>jump if greater than</td>
 +  </tr>
 +  <tr>
 +    <td>1L</td>
 +    <td>2</td>
 +    <td><code>LE</code></td>
 +    <td><code>FIU...</code></td>
 +    <td>fdx ilh ilhp</td>
 +    <td>jump if less than or equal</td>
 +  </tr>
 +  <tr>
 +    <td>1L</td>
 +    <td>2</td>
 +    <td><code>LT</code></td>
 +    <td><code>FIU...</code></td>
 +    <td>fdx ilh ilhp</td>
 +    <td>jump if less than</td>
 +  </tr>
 +  <tr>
 +    <td>1L</td>
 +    <td>2</td>
 +    <td><code>NE</code></td>
 +    <td><code>FIU...</code></td>
 +    <td>fdx ilh ilhp</td>
 +    <td>jump if not equal</td>
 +  </tr>
 +  <tr>
 +    <td></td>
 +    <td></td>
 +    <td></td>
 +    <td></td>
 +    <td></td>
 +    <td></td>
 +  </tr>
 +  <tr>
 +    <td>2S</td>
 +    <td>1</td>
 +    <td><code>ARG</code></td>
 +    <td><code>FIUP.B</code></td>
 +    <td>fdx ilh p</td>
 +    <td>argument</td>
 +  </tr>
 +  <tr>
 +    <td>1</td>
 +    <td>1 or 2</td>
 +    <td><code>CALL</code></td>
 +    <td><code>FIUPVB</code></td>
 +    <td>fdx ilh p</td>
 +    <td>function call</td>
 +  </tr>
 +  <tr>
 +    <td></td>
 +    <td>1</td>
 +    <td><code>RET</code></td>
 +    <td><code>FIUPV.</code></td>
 +    <td>fdx ilh p</td>
 +    <td>return from function</td>
 +  </tr>
 +  <tr>
 +    <td><img SRC="/~drh/resources/dot_clear.gif" ALT="|" WIDTH="1" HEIGHT="12"></td>
 +    <td></td>
 +    <td></td>
 +    <td></td>
 +    <td></td>
 +    <td></td>
 +  </tr>
 +  <tr>
 +    <td></td>
 +    <td>1</td>
 +    <td><code>JUMP</code></td>
 +    <td><code>....V.</code></td>
 +    <td></td>
 +    <td>unconditional jump</td>
 +  </tr>
 +  <tr>
 +    <td>1L</td>
 +    <td>0</td>
 +    <td><code>LABEL</code></td>
 +    <td><code>....V.</code></td>
 +    <td></td>
 +    <td>label definition</td>
 +  </tr>
 +</table>
 +
 +<p>The entries in the <strong>Sizes</strong> column indicate sizes of the operators that
 +back ends must implement. Letters denote the size of float (f), double (d), long double
 +(x), character (c), short integer (s), integer (i), long integer (l), "long
 +long" integer (h) , and pointer (p). These sizes are separated into sets for each
 +type suffix, except that a single set is used for both I and U when the set for I is
 +identical to the set for U.</p>
 +
 +<p>The actual values for the size indicators, fdxcsilhp, depend on the target. A
 +specification like <code>ADDF</code>f denotes the operator <code>ADD+F+sizeop(</code>f<code>)</code>,
 +where "f" is replaced by a target-dependent value, e.g., <code>ADDF4</code> and <code>ADDF8</code>.
 +For example, back ends must implement the following <code>CVI</code> and <code>MUL</code>
 +operators.</p>
 +
 +<blockquote>
 +  <p><code>CVIF</code>f <code>CVIF</code>d <code>CVIF</code>x<br>
 +  <code>CVII</code>c <code>CVII</code>s <code>CVII</code>i <code>CVII</code>l <code>CVII</code>h<br>
 +  <code>CVIU</code>c <code>CVIU</code>s <code>CVIU</code>i <code>CVIU</code>l <code>CVIU</code>h
 +  <code>CVIU</code>p<br>
 +  <br>
 +  <code>MULF</code>f <code>MULF</code>d <code>MULF</code>x<br>
 +  <code>MULI</code>i <code>MULI</code>l <code>MULI</code>h<br>
 +  <code>MULU</code>i <code>MULU</code>l <code>MULU</code>h</p>
 +</blockquote>
 +
 +<p>On most platforms, there are fewer than three sizes of floats and six sizes of
 +integers, and pointers are usually the same size as one of the integers. And lcc doesn't
 +support the "long long" type, so h is not currently used. So the set of
 +platform-specific operators is usually smaller than the list above suggests. For example,
 +the X86, SPARC, and MIPS back ends implement the following <code>CVI</code> and <code>MUL</code>
 +operators.</p>
 +
 +<blockquote>
 +  <p><code>CVIF</code>4 <code>CVIF</code>8<br>
 +  <code>CVII</code>1 <code>CVII</code>2 <code>CVII</code>4<br>
 +  <code>CVIU</code>1 <code>CVIU</code>2 <code>CVIU</code>4 <br>
 +  <br>
 +  <code>MULF</code>4 <code>MULF</code>8<br>
 +  <code>MULI</code>4<br>
 +  <code>MULU</code>4</p>
 +</blockquote>
 +
 +<p>The set of operators is thus target-dependent; for example, <code>ADDI8</code> appears
 +only if the target supports an 8-byte integer type. <a
 +HREF="ftp://ftp.cs.princeton.edu/pub/packages/lcc/contrib/ops.c"><code>ops.c</code></a> is
 +a program that, given a set of sizes, prints the required operators and their values,
 +e.g.,</p>
 +
 +<blockquote>
 +  <pre>% <em>ops c=1 s=2 i=4 l=4 h=4 f=4 d=8 x=8 p=4</em>
 +...
 + CVIF4=4225 CVIF8=8321
 + CVII1=1157 CVII2=2181 CVII4=4229
 + CVIU1=1158 CVIU2=2182 CVIU4=4230
 +...
 + MULF4=4561 MULF8=8657
 + MULI4=4565
 + MULU4=4566
 +...
 +131 operators</pre>
 +</blockquote>
 +
 +<p>The type suffix for a conversion operator denotes the type of the result and the size
 +indicator gives the size of the result. For example, <code>CVUI4</code> converts an
 +unsigned (<code>U</code>) to a 4-byte signed integer (<code>I4</code>). The <code>syms[0]</code>
 +field points to a symbol-table entry for a integer constant that gives the size of the
 +source operand. For example, if <code>syms[0]</code> in a <code>CVUI4</code> points to a
 +symbol-table entry for 2, the conversion widens a 2-byte unsigned integer to a 4-byte
 +signed integer. Conversions that widen unsigned integers zero-extend; those that widen
 +signed integers sign-extend.</p>
 +
 +<p>The front end composes conversions between types <em>T</em><sub>1</sub> and <em>T</em><sub>2</sub>
 +by widening <em>T</em><sub>1</sub> to it's "supertype", if necessary, converting
 +that result to <em>T</em><sub>2</sub>'s supertype, then narrowing the result to <em>T</em><sub>2</sub>,
 +if necessary. The following table lists the supertypes; omitted entries are their own
 +supertypes.</p>
 +
 +<blockquote>
 +  <table BORDER="0" CELLPADDING="0" CELLSPACING="0">
 +    <tr>
 +      <td><strong>Type</strong></td>
 +      <td><img SRC="/~drh/resources/dot_clear.gif" ALT="|" WIDTH="24" HEIGHT="1"></td>
 +      <td><strong>Supertype</strong></td>
 +    </tr>
 +    <tr>
 +      <td>signed char</td>
 +      <td></td>
 +      <td>int</td>
 +    </tr>
 +    <tr>
 +      <td>signed short</td>
 +      <td></td>
 +      <td>int</td>
 +    </tr>
 +    <tr ALIGN="LEFT" VALIGN="TOP">
 +      <td>unsigned char</td>
 +      <td></td>
 +      <td>int, if sizeof (char) < sizeof (int)<br>
 +      unsigned, otherwise</td>
 +    </tr>
 +    <tr ALIGN="LEFT" VALIGN="TOP">
 +      <td>unsigned short</td>
 +      <td></td>
 +      <td>int, if sizeof (short) < sizeof (int)<br>
 +      unsigned, otherwise</td>
 +    </tr>
 +    <tr ALIGN="LEFT" VALIGN="TOP">
 +      <td>void *</td>
 +      <td></td>
 +      <td>an unsigned type as large as a pointer</td>
 +    </tr>
 +  </table>
 +</blockquote>
 +
 +<p>Pointers are converted to an unsigned type of the same size, even when that type is not
 +one of the integer types.</p>
 +
 +<p>For example, the front end converts a signed short to a float by first converting it to
 +an int and then to a float. It converts an unsigned short to an int with a single <code>CVUI</code>i
 +conversion, when shorts are smaller than ints.</p>
 +
 +<p>There are now signed and unsigned variants of <code>ASGN</code>, <code>INDIR</code>, <code>BCOM</code>,
 +<code>BOR</code>, <code>BXOR</code>, <code>BAND</code>, <code>ARG</code>, <code>CALL</code>,
 +and <code>RET</code> to simplify code generation on platforms that use different
 +instructions or register set for signed and unsigned operations. Likewise there are now
 +pointer variants of <code>ASGN</code>, <code>INDIR</code>, <code>ARG</code>, <code>CALL</code>,
 +and <code>RET</code>.</p>
 +
 +<h2><a NAME="flags">5.6 Interface Flags</a></h2>
 +
 +<pre>unsigned unsigned_char:1;</pre>
 +
 +<p>tells the front end whether plain characters are signed or unsigned. If it's zero, char
 +is a signed type; otherwise, char is an unsigned type.</p>
 +
 +<p>All the interface flags can be set by command-line options, e.g., <code>-Wf-unsigned_char=1</code>
 +causes plain characters to be unsigned.</p>
 +
 +<h2><a NAME="definitions">5.8 Definitions</a></h2>
 +
 +<p>The front end announces local variables by calling</p>
 +
 +<pre>void (*local)(Symbol);</pre>
 +
 +<p>It announces temporaries likewise; these have the symbol's <code>temporary</code> flag
 +set, which indicates that the symbol will be used only in the next call to <code>gen</code>.
 +If a temporary's <code>u.t.cse</code> field is nonnull, it points to the node that
 +computes the value assigned to the temporary; see page 346.</p>
 +
 +<p>The front end calls</p>
 +
 +<pre>void (*address)(Symbol p, Symbol q, long n);</pre>
 +
 +<p>to initialize <code>q</code> to a symbol that represents an address of the form <em>x</em>+<code>n</code>,
 +where <em>x</em> is the address represented by <code>p</code> and the long integer <code>n</code>
 +is positive or negative.</p>
 +
 +<h2><a NAME="constants">5.9 Constants</a></h2>
 +
 +<p>The interface function</p>
 +
 +<pre>void (*defconst)(int suffix, int size, Value v);</pre>
 +
 +<p>initializes constants. defconst emits directives to define a cell and initialize it to
 +a constant value. v is the constant value, suffix identifies the type of the value, and
 +size is the size of the value in bytes. The value of suffix indicates which field of v
 +holds the value, as shown in the following table.</p>
 +
 +<blockquote>
 +  <table BORDER="0" CELLPADDING="1" CELLSPACING="1">
 +    <tr>
 +      <td><strong>suffix</strong></td>
 +      <td><img SRC="/~drh/resources/dot_clear.gif" ALT="|" WIDTH="24" HEIGHT="1"></td>
 +      <td><strong>v Field</strong></td>
 +      <td><img SRC="/~drh/resources/dot_clear.gif" ALT="|" WIDTH="24" HEIGHT="1"></td>
 +      <td><strong>size</strong></td>
 +    </tr>
 +    <tr>
 +      <td><code>F</code></td>
 +      <td></td>
 +      <td><code>v.d</code></td>
 +      <td></td>
 +      <td>float, double, long double</td>
 +    </tr>
 +    <tr>
 +      <td><code>I</code></td>
 +      <td></td>
 +      <td><code>v.i</code></td>
 +      <td></td>
 +      <td>signed char, signed short, signed int, signed long</td>
 +    </tr>
 +    <tr>
 +      <td><code>U</code></td>
 +      <td></td>
 +      <td><code>v.u</code></td>
 +      <td></td>
 +      <td>unsigned char, unsigned short, unsigned int, unsigned long</td>
 +    </tr>
 +    <tr>
 +      <td><code>P</code></td>
 +      <td></td>
 +      <td><code>v.p</code></td>
 +      <td></td>
 +      <td>void *</td>
 +    </tr>
 +  </table>
 +</blockquote>
 +
 +<p><code>defconst</code> must narrow <code>v.</code>x when <code>size</code> is less than <code>sizeof</code>
 +<code>v.</code>x; e.g., to emit an unsigned char, <code>defconst</code> should emit <code>(unsigned
 +char)v.i</code>.</p>
 +
 +<h2><a NAME="upcalls">5.12 Upcalls</a></h2>
 +
 +<p>lcc 4.x uses standard I/O and its I/O functions have been changed accordingly. lcc
 +reads input from the standard input, emits code to the standard output, and writes
 +diagnostics to the standard error output. It uses <code>freopen</code> to redirect these
 +streams to explicit files, when necessary.</p>
 +
 +<p><code>bp</code>, <code>outflush</code>, and <code>outs</code> have been eliminated.</p>
 +
 +<pre>extern void fprint(FILE *f, const char *fmt, ...);
 +extern void  print(const char *fmt, ...);</pre>
 +
 +<p>print formatted data to file <code>f</code> (<code>fprint</code>) or the standard
 +output (<code>print</code>). These functions are like standard C's <code>printf</code> and
 +<code>fprintf</code>, but support only some of the standard conversion specifiers and do
 +not support flags, precision, and field-width specifications. They support the following
 +new conversion specifiers in addition to those described on page 99.</p>
 +
 +<blockquote>
 +  <table BORDER="0" CELLPADDING="0" CELLSPACING="0">
 +    <tr>
 +      <td><strong>Specifiers</strong></td>
 +      <td><img SRC="/~drh/resources/dot_clear.gif" ALT="|" WIDTH="24" HEIGHT="1"></td>
 +      <td><strong>Corresponding printf Specifiers</strong></td>
 +    </tr>
 +    <tr>
 +      <td><code>%c</code></td>
 +      <td></td>
 +      <td><code>%c</code></td>
 +    </tr>
 +    <tr>
 +      <td><code>%d %D</code></td>
 +      <td></td>
 +      <td><code>%d %ld</code></td>
 +    </tr>
 +    <tr>
 +      <td><code>%u %U</code></td>
 +      <td></td>
 +      <td><code>%u %lu</code></td>
 +    </tr>
 +    <tr>
 +      <td><code>%x %X</code></td>
 +      <td></td>
 +      <td><code>%x %lx</code></td>
 +    </tr>
 +    <tr>
 +      <td><code>%f %e %g</code></td>
 +      <td></td>
 +      <td><code>%e %f %g</code></td>
 +    </tr>
 +    <tr ALIGN="LEFT" VALIGN="TOP">
 +      <td><code>%p</code></td>
 +      <td></td>
 +      <td>Converts the corresponding void * argument to unsigned long and prints it with the <code>printf</code>
 +      <code>%#x</code> specifier or just <code>%x</code> when the argument is null.</td>
 +    </tr>
 +    <tr ALIGN="LEFT" VALIGN="TOP">
 +      <td><code>%I</code></td>
 +      <td></td>
 +      <td>Prints the number of spaces given by the corresponding argument.</td>
 +    </tr>
 +  </table>
 +</blockquote>
 +
 +<pre>#define generic(op)  ((op)&0x3F0)
 +#define specific(op) ((op)&0x3FF)</pre>
 +
 +<p><code>generic(op)</code> returns the generic variant of <code>op</code>; that is,
 +without its type suffix and size indicator. <code>specific(op)</code> returns the
 +type-specific variant of <code>op</code>; that is, without its size indicator.</p>
 +
 +<p><code>newconst</code> has been replaced by</p>
 +
 +<pre>extern Symbol intconst(int n);</pre>
 +
 +<p>which installs the integer constant <code>n</code> in the symbol table, if necessary,
 +and returns a pointer to the symbol-table entry.</p>
 +
 +<hr>
 +
 +<address>
 +  <a HREF="http://www.research.microsoft.com/~cwfraser/">Chris Fraser</a> / <a
 +  HREF="mailto:cwfraser@microsoft.com">cwfraser@microsoft.com</a><br>
 +  <a HREF="http://www.research.microsoft.com/~drh/">David Hanson</a> / <a
 +  HREF="mailto:drh@microsoft.com">drh@microsoft.com</a><br>
 +  $Revision: 145 $ $Date: 2001-10-17 16:53:10 -0500 (Wed, 17 Oct 2001) $ 
 +</address>
 +</body>
 +</html>
 | 
