CbC/CbC_llvm: llvm/docs/GlobalISel/GMIR.rst annotate

annotate llvm/docs/GlobalISel/GMIR.rst @ 201:a96fbbdf2d0f

...

author	Shinji KONO <kono@ie.u-ryukyu.ac.jp>
date	Fri, 04 Jun 2021 21:07:06 +0900
parents	0572611fdcc8
children	2e18cbf3894f

rev	line source
150 1d019706d866 LLVM10 anatofuz parents: diff changeset	1 .. _gmir:
1d019706d866 LLVM10 anatofuz parents: diff changeset	2
1d019706d866 LLVM10 anatofuz parents: diff changeset	3 Generic Machine IR
1d019706d866 LLVM10 anatofuz parents: diff changeset	4 ==================
1d019706d866 LLVM10 anatofuz parents: diff changeset	5
1d019706d866 LLVM10 anatofuz parents: diff changeset	6 .. contents::
1d019706d866 LLVM10 anatofuz parents: diff changeset	7 :local:
1d019706d866 LLVM10 anatofuz parents: diff changeset	8
1d019706d866 LLVM10 anatofuz parents: diff changeset	9 Generic MIR (gMIR) is an intermediate representation that shares the same data
1d019706d866 LLVM10 anatofuz parents: diff changeset	10 structures as :doc:`MachineIR (MIR) <../MIRLangRef>` but has more relaxed
1d019706d866 LLVM10 anatofuz parents: diff changeset	11 constraints. As the compilation pipeline proceeds, these constraints are
1d019706d866 LLVM10 anatofuz parents: diff changeset	12 gradually tightened until gMIR has become MIR.
1d019706d866 LLVM10 anatofuz parents: diff changeset	13
1d019706d866 LLVM10 anatofuz parents: diff changeset	14 The rest of this document will assume that you are familiar with the concepts
1d019706d866 LLVM10 anatofuz parents: diff changeset	15 in :doc:`MachineIR (MIR) <../MIRLangRef>` and will highlight the differences
1d019706d866 LLVM10 anatofuz parents: diff changeset	16 between MIR and gMIR.
1d019706d866 LLVM10 anatofuz parents: diff changeset	17
1d019706d866 LLVM10 anatofuz parents: diff changeset	18 .. _gmir-instructions:
1d019706d866 LLVM10 anatofuz parents: diff changeset	19
1d019706d866 LLVM10 anatofuz parents: diff changeset	20 Generic Machine Instructions
1d019706d866 LLVM10 anatofuz parents: diff changeset	21 ----------------------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	22
1d019706d866 LLVM10 anatofuz parents: diff changeset	23 .. note::
1d019706d866 LLVM10 anatofuz parents: diff changeset	24
1d019706d866 LLVM10 anatofuz parents: diff changeset	25 This section expands on :ref:`mir-instructions` from the MIR Language
1d019706d866 LLVM10 anatofuz parents: diff changeset	26 Reference.
1d019706d866 LLVM10 anatofuz parents: diff changeset	27
1d019706d866 LLVM10 anatofuz parents: diff changeset	28 Whereas MIR deals largely in Target Instructions and only has a small set of
1d019706d866 LLVM10 anatofuz parents: diff changeset	29 target independent opcodes such as ``COPY``, ``PHI``, and ``REG_SEQUENCE``,
1d019706d866 LLVM10 anatofuz parents: diff changeset	30 gMIR defines a rich collection of ``Generic Opcodes`` which are target
1d019706d866 LLVM10 anatofuz parents: diff changeset	31 independent and describe operations which are typically supported by targets.
1d019706d866 LLVM10 anatofuz parents: diff changeset	32 One example is ``G_ADD`` which is the generic opcode for an integer addition.
1d019706d866 LLVM10 anatofuz parents: diff changeset	33 More information on each of the generic opcodes can be found at
1d019706d866 LLVM10 anatofuz parents: diff changeset	34 :doc:`GenericOpcode`.
1d019706d866 LLVM10 anatofuz parents: diff changeset	35
1d019706d866 LLVM10 anatofuz parents: diff changeset	36 The ``MachineIRBuilder`` class wraps the ``MachineInstrBuilder`` and provides
1d019706d866 LLVM10 anatofuz parents: diff changeset	37 a convenient way to create these generic instructions.
1d019706d866 LLVM10 anatofuz parents: diff changeset	38
1d019706d866 LLVM10 anatofuz parents: diff changeset	39 .. _gmir-gvregs:
1d019706d866 LLVM10 anatofuz parents: diff changeset	40
1d019706d866 LLVM10 anatofuz parents: diff changeset	41 Generic Virtual Registers
1d019706d866 LLVM10 anatofuz parents: diff changeset	42 -------------------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	43
1d019706d866 LLVM10 anatofuz parents: diff changeset	44 .. note::
1d019706d866 LLVM10 anatofuz parents: diff changeset	45
1d019706d866 LLVM10 anatofuz parents: diff changeset	46 This section expands on :ref:`mir-registers` from the MIR Language
1d019706d866 LLVM10 anatofuz parents: diff changeset	47 Reference.
1d019706d866 LLVM10 anatofuz parents: diff changeset	48
1d019706d866 LLVM10 anatofuz parents: diff changeset	49 Generic virtual registers are like virtual registers but they are not assigned a
1d019706d866 LLVM10 anatofuz parents: diff changeset	50 Register Class constraint. Instead, generic virtual registers have less strict
1d019706d866 LLVM10 anatofuz parents: diff changeset	51 constraints starting with a :ref:`gmir-llt` and then further constrained to a
1d019706d866 LLVM10 anatofuz parents: diff changeset	52 :ref:`gmir-regbank`. Eventually they will be constrained to a register class
1d019706d866 LLVM10 anatofuz parents: diff changeset	53 at which point they become normal virtual registers.
1d019706d866 LLVM10 anatofuz parents: diff changeset	54
1d019706d866 LLVM10 anatofuz parents: diff changeset	55 Generic virtual registers can be used with all the virtual register API's
1d019706d866 LLVM10 anatofuz parents: diff changeset	56 provided by ``MachineRegisterInfo``. In particular, the def-use chain API's can
1d019706d866 LLVM10 anatofuz parents: diff changeset	57 be used without needing to distinguish them from non-generic virtual registers.
1d019706d866 LLVM10 anatofuz parents: diff changeset	58
1d019706d866 LLVM10 anatofuz parents: diff changeset	59 For simplicity, most generic instructions only accept virtual registers (both
1d019706d866 LLVM10 anatofuz parents: diff changeset	60 generic and non-generic). There are some exceptions to this but in general:
1d019706d866 LLVM10 anatofuz parents: diff changeset	61
1d019706d866 LLVM10 anatofuz parents: diff changeset	62 * instead of immediates, they use a generic virtual register defined by an
1d019706d866 LLVM10 anatofuz parents: diff changeset	63 instruction that materializes the immediate value (see
1d019706d866 LLVM10 anatofuz parents: diff changeset	64 :ref:`irtranslator-constants`). Typically this is a G_CONSTANT or a
1d019706d866 LLVM10 anatofuz parents: diff changeset	65 G_FCONSTANT. One example of an exception to this rule is G_SEXT_INREG where
1d019706d866 LLVM10 anatofuz parents: diff changeset	66 having an immediate is mandatory.
1d019706d866 LLVM10 anatofuz parents: diff changeset	67 * instead of physical register, they use a generic virtual register that is
1d019706d866 LLVM10 anatofuz parents: diff changeset	68 either defined by a ``COPY`` from the physical register or used by a ``COPY``
1d019706d866 LLVM10 anatofuz parents: diff changeset	69 that defines the physical register.
1d019706d866 LLVM10 anatofuz parents: diff changeset	70
1d019706d866 LLVM10 anatofuz parents: diff changeset	71 .. admonition:: Historical Note
1d019706d866 LLVM10 anatofuz parents: diff changeset	72
1d019706d866 LLVM10 anatofuz parents: diff changeset	73 We started with an alternative representation, where MRI tracks a size for
1d019706d866 LLVM10 anatofuz parents: diff changeset	74 each generic virtual register, and instructions have lists of types.
1d019706d866 LLVM10 anatofuz parents: diff changeset	75 That had two flaws: the type and size are redundant, and there was no generic
1d019706d866 LLVM10 anatofuz parents: diff changeset	76 way of getting a given operand's type (as there was no 1:1 mapping between
1d019706d866 LLVM10 anatofuz parents: diff changeset	77 instruction types and operands).
1d019706d866 LLVM10 anatofuz parents: diff changeset	78 We considered putting the type in some variant of MCInstrDesc instead:
173 0572611fdcc8 reorgnization done Shinji KONO <kono@ie.u-ryukyu.ac.jp> parents: 150 diff changeset	79 See `PR26576 <https://llvm.org/PR26576>`_: [GlobalISel] Generic MachineInstrs
150 1d019706d866 LLVM10 anatofuz parents: diff changeset	80 need a type but this increases the memory footprint of the related objects
1d019706d866 LLVM10 anatofuz parents: diff changeset	81
1d019706d866 LLVM10 anatofuz parents: diff changeset	82 .. _gmir-regbank:
1d019706d866 LLVM10 anatofuz parents: diff changeset	83
1d019706d866 LLVM10 anatofuz parents: diff changeset	84 Register Bank
1d019706d866 LLVM10 anatofuz parents: diff changeset	85 -------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	86
1d019706d866 LLVM10 anatofuz parents: diff changeset	87 A Register Bank is a set of register classes defined by the target. This
1d019706d866 LLVM10 anatofuz parents: diff changeset	88 definition is rather loose so let's talk about what they can achieve.
1d019706d866 LLVM10 anatofuz parents: diff changeset	89
1d019706d866 LLVM10 anatofuz parents: diff changeset	90 Suppose we have a processor that has two register files, A and B. These are
1d019706d866 LLVM10 anatofuz parents: diff changeset	91 equal in every way and support the same instructions for the same cost. They're
1d019706d866 LLVM10 anatofuz parents: diff changeset	92 just physically stored apart and each instruction can only access registers from
1d019706d866 LLVM10 anatofuz parents: diff changeset	93 A or B but never a mix of the two. If we want to perform an operation on data
1d019706d866 LLVM10 anatofuz parents: diff changeset	94 that's in split between the two register files, we must first copy all the data
1d019706d866 LLVM10 anatofuz parents: diff changeset	95 into a single register file.
1d019706d866 LLVM10 anatofuz parents: diff changeset	96
1d019706d866 LLVM10 anatofuz parents: diff changeset	97 Given a processor like this, we would benefit from clustering related data
1d019706d866 LLVM10 anatofuz parents: diff changeset	98 together into one register file so that we minimize the cost of copying data
1d019706d866 LLVM10 anatofuz parents: diff changeset	99 back and forth to satisfy the (possibly conflicting) requirements of all the
1d019706d866 LLVM10 anatofuz parents: diff changeset	100 instructions. Register Banks are a means to constrain the register allocator to
1d019706d866 LLVM10 anatofuz parents: diff changeset	101 use a particular register file for a virtual register.
1d019706d866 LLVM10 anatofuz parents: diff changeset	102
1d019706d866 LLVM10 anatofuz parents: diff changeset	103 In practice, register files A and B are rarely equal. They can typically store
1d019706d866 LLVM10 anatofuz parents: diff changeset	104 the same data but there's usually some restrictions on what operations you can
1d019706d866 LLVM10 anatofuz parents: diff changeset	105 do on each register file. A fairly common pattern is for one of them to be
1d019706d866 LLVM10 anatofuz parents: diff changeset	106 accessible to integer operations and the other accessible to floating point
1d019706d866 LLVM10 anatofuz parents: diff changeset	107 operations. To accomodate this, let's rename A and B to GPR (general purpose
1d019706d866 LLVM10 anatofuz parents: diff changeset	108 registers) and FPR (floating point registers).
1d019706d866 LLVM10 anatofuz parents: diff changeset	109
1d019706d866 LLVM10 anatofuz parents: diff changeset	110 We now have some additional constraints that limit us. An operation like G_FMUL
1d019706d866 LLVM10 anatofuz parents: diff changeset	111 has to happen in FPR and G_ADD has to happen in GPR. However, even though this
1d019706d866 LLVM10 anatofuz parents: diff changeset	112 prescribes a lot of the assignments we still have some freedom. A G_LOAD can
1d019706d866 LLVM10 anatofuz parents: diff changeset	113 happen in both GPR and FPR, and which we want depends on who is going to consume
1d019706d866 LLVM10 anatofuz parents: diff changeset	114 the loaded data. Similarly, G_FNEG can happen in both GPR and FPR. If we assign
1d019706d866 LLVM10 anatofuz parents: diff changeset	115 it to FPR, then we'll use floating point negation. However, if we assign it to
1d019706d866 LLVM10 anatofuz parents: diff changeset	116 GPR then we can equivalently G_XOR the sign bit with 1 to invert it.
1d019706d866 LLVM10 anatofuz parents: diff changeset	117
1d019706d866 LLVM10 anatofuz parents: diff changeset	118 In summary, Register Banks are a means of disambiguating between seemingly
1d019706d866 LLVM10 anatofuz parents: diff changeset	119 equivalent choices based on some analysis of the differences when each choice
1d019706d866 LLVM10 anatofuz parents: diff changeset	120 is applied in a given context.
1d019706d866 LLVM10 anatofuz parents: diff changeset	121
1d019706d866 LLVM10 anatofuz parents: diff changeset	122 To give some concrete examples:
1d019706d866 LLVM10 anatofuz parents: diff changeset	123
1d019706d866 LLVM10 anatofuz parents: diff changeset	124 AArch64
1d019706d866 LLVM10 anatofuz parents: diff changeset	125
1d019706d866 LLVM10 anatofuz parents: diff changeset	126 AArch64 has three main banks. GPR for integer operations, FPR for floating
1d019706d866 LLVM10 anatofuz parents: diff changeset	127 point and also for the NEON vector instruction set. The third is CCR and
1d019706d866 LLVM10 anatofuz parents: diff changeset	128 describes the condition code register used for predication.
1d019706d866 LLVM10 anatofuz parents: diff changeset	129
1d019706d866 LLVM10 anatofuz parents: diff changeset	130 MIPS
1d019706d866 LLVM10 anatofuz parents: diff changeset	131
1d019706d866 LLVM10 anatofuz parents: diff changeset	132 MIPS has five main banks of which many programs only really use one or two.
1d019706d866 LLVM10 anatofuz parents: diff changeset	133 GPR is the general purpose bank for integer operations. FGR or CP1 is for
1d019706d866 LLVM10 anatofuz parents: diff changeset	134 the floating point operations as well as the MSA vector instructions and a
1d019706d866 LLVM10 anatofuz parents: diff changeset	135 few other application specific extensions. CP0 is for system registers and
1d019706d866 LLVM10 anatofuz parents: diff changeset	136 few programs will use it. CP2 and CP3 are for any application specific
1d019706d866 LLVM10 anatofuz parents: diff changeset	137 coprocessors that may be present in the chip. Arguably, there is also a sixth
1d019706d866 LLVM10 anatofuz parents: diff changeset	138 for the LO and HI registers but these are only used for the result of a few
1d019706d866 LLVM10 anatofuz parents: diff changeset	139 operations and it's of questionable value to model distinctly from GPR.
1d019706d866 LLVM10 anatofuz parents: diff changeset	140
1d019706d866 LLVM10 anatofuz parents: diff changeset	141 X86
1d019706d866 LLVM10 anatofuz parents: diff changeset	142
1d019706d866 LLVM10 anatofuz parents: diff changeset	143 X86 can be seen as having 3 main banks: general-purpose, x87, and
1d019706d866 LLVM10 anatofuz parents: diff changeset	144 vector (which could be further split into a bank per domain for single vs
1d019706d866 LLVM10 anatofuz parents: diff changeset	145 double precision instructions). It also looks like there's arguably a few
1d019706d866 LLVM10 anatofuz parents: diff changeset	146 more potential banks such as one for the AVX512 Mask Registers.
1d019706d866 LLVM10 anatofuz parents: diff changeset	147
1d019706d866 LLVM10 anatofuz parents: diff changeset	148 Register banks are described by a target-provided API,
1d019706d866 LLVM10 anatofuz parents: diff changeset	149 :ref:`RegisterBankInfo <api-registerbankinfo>`.
1d019706d866 LLVM10 anatofuz parents: diff changeset	150
1d019706d866 LLVM10 anatofuz parents: diff changeset	151 .. _gmir-llt:
1d019706d866 LLVM10 anatofuz parents: diff changeset	152
1d019706d866 LLVM10 anatofuz parents: diff changeset	153 Low Level Type
1d019706d866 LLVM10 anatofuz parents: diff changeset	154 --------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	155
1d019706d866 LLVM10 anatofuz parents: diff changeset	156 Additionally, every generic virtual register has a type, represented by an
1d019706d866 LLVM10 anatofuz parents: diff changeset	157 instance of the ``LLT`` class.
1d019706d866 LLVM10 anatofuz parents: diff changeset	158
1d019706d866 LLVM10 anatofuz parents: diff changeset	159 Like ``EVT``/``MVT``/``Type``, it has no distinction between unsigned and signed
1d019706d866 LLVM10 anatofuz parents: diff changeset	160 integer types. Furthermore, it also has no distinction between integer and
1d019706d866 LLVM10 anatofuz parents: diff changeset	161 floating-point types: it mainly conveys absolutely necessary information, such
1d019706d866 LLVM10 anatofuz parents: diff changeset	162 as size and number of vector lanes:
1d019706d866 LLVM10 anatofuz parents: diff changeset	163
1d019706d866 LLVM10 anatofuz parents: diff changeset	164 * ``sN`` for scalars
1d019706d866 LLVM10 anatofuz parents: diff changeset	165 * ``pN`` for pointers
1d019706d866 LLVM10 anatofuz parents: diff changeset	166 * ``<N x sM>`` for vectors
1d019706d866 LLVM10 anatofuz parents: diff changeset	167
1d019706d866 LLVM10 anatofuz parents: diff changeset	168 ``LLT`` is intended to replace the usage of ``EVT`` in SelectionDAG.
1d019706d866 LLVM10 anatofuz parents: diff changeset	169
1d019706d866 LLVM10 anatofuz parents: diff changeset	170 Here are some LLT examples and their ``EVT`` and ``Type`` equivalents:
1d019706d866 LLVM10 anatofuz parents: diff changeset	171
1d019706d866 LLVM10 anatofuz parents: diff changeset	172 ============= ========= ======================================
1d019706d866 LLVM10 anatofuz parents: diff changeset	173 LLT EVT IR Type
1d019706d866 LLVM10 anatofuz parents: diff changeset	174 ============= ========= ======================================
1d019706d866 LLVM10 anatofuz parents: diff changeset	175 ``s1`` ``i1`` ``i1``
1d019706d866 LLVM10 anatofuz parents: diff changeset	176 ``s8`` ``i8`` ``i8``
1d019706d866 LLVM10 anatofuz parents: diff changeset	177 ``s32`` ``i32`` ``i32``
1d019706d866 LLVM10 anatofuz parents: diff changeset	178 ``s32`` ``f32`` ``float``
1d019706d866 LLVM10 anatofuz parents: diff changeset	179 ``s17`` ``i17`` ``i17``
1d019706d866 LLVM10 anatofuz parents: diff changeset	180 ``s16`` N/A ``{i8, i8}`` [#abi-dependent]_
1d019706d866 LLVM10 anatofuz parents: diff changeset	181 ``s32`` N/A ``[4 x i8]`` [#abi-dependent]_
1d019706d866 LLVM10 anatofuz parents: diff changeset	182 ``p0`` ``iPTR`` ``i8``, ``i32``, ``%opaque*``
1d019706d866 LLVM10 anatofuz parents: diff changeset	183 ``p2`` ``iPTR`` ``i8 addrspace(2)*``
1d019706d866 LLVM10 anatofuz parents: diff changeset	184 ``<4 x s32>`` ``v4f32`` ``<4 x float>``
1d019706d866 LLVM10 anatofuz parents: diff changeset	185 ``s64`` ``v1f64`` ``<1 x double>``
1d019706d866 LLVM10 anatofuz parents: diff changeset	186 ``<3 x s32>`` ``v3i32`` ``<3 x i32>``
1d019706d866 LLVM10 anatofuz parents: diff changeset	187 ============= ========= ======================================
1d019706d866 LLVM10 anatofuz parents: diff changeset	188
1d019706d866 LLVM10 anatofuz parents: diff changeset	189
1d019706d866 LLVM10 anatofuz parents: diff changeset	190 Rationale: instructions already encode a specific interpretation of types
1d019706d866 LLVM10 anatofuz parents: diff changeset	191 (e.g., ``add`` vs. ``fadd``, or ``sdiv`` vs. ``udiv``). Also encoding that
1d019706d866 LLVM10 anatofuz parents: diff changeset	192 information in the type system requires introducing bitcast with no real
1d019706d866 LLVM10 anatofuz parents: diff changeset	193 advantage for the selector.
1d019706d866 LLVM10 anatofuz parents: diff changeset	194
1d019706d866 LLVM10 anatofuz parents: diff changeset	195 Pointer types are distinguished by address space. This matches IR, as opposed
1d019706d866 LLVM10 anatofuz parents: diff changeset	196 to SelectionDAG where address space is an attribute on operations.
1d019706d866 LLVM10 anatofuz parents: diff changeset	197 This representation better supports pointers having different sizes depending
1d019706d866 LLVM10 anatofuz parents: diff changeset	198 on their addressspace.
1d019706d866 LLVM10 anatofuz parents: diff changeset	199
1d019706d866 LLVM10 anatofuz parents: diff changeset	200 .. note::
1d019706d866 LLVM10 anatofuz parents: diff changeset	201
1d019706d866 LLVM10 anatofuz parents: diff changeset	202 .. caution::
1d019706d866 LLVM10 anatofuz parents: diff changeset	203
1d019706d866 LLVM10 anatofuz parents: diff changeset	204 Is this still true? I thought we'd removed the 1-element vector concept.
1d019706d866 LLVM10 anatofuz parents: diff changeset	205 Hypothetically, it could be distinct from a scalar but I think we failed to
1d019706d866 LLVM10 anatofuz parents: diff changeset	206 find a real occurrence.
1d019706d866 LLVM10 anatofuz parents: diff changeset	207
1d019706d866 LLVM10 anatofuz parents: diff changeset	208 Currently, LLT requires at least 2 elements in vectors, but some targets have
1d019706d866 LLVM10 anatofuz parents: diff changeset	209 the concept of a '1-element vector'. Representing them as their underlying
1d019706d866 LLVM10 anatofuz parents: diff changeset	210 scalar type is a nice simplification.
1d019706d866 LLVM10 anatofuz parents: diff changeset	211
1d019706d866 LLVM10 anatofuz parents: diff changeset	212 .. rubric:: Footnotes
1d019706d866 LLVM10 anatofuz parents: diff changeset	213
1d019706d866 LLVM10 anatofuz parents: diff changeset	214 .. [#abi-dependent] This mapping is ABI dependent. Here we've assumed no additional padding is required.
1d019706d866 LLVM10 anatofuz parents: diff changeset	215
1d019706d866 LLVM10 anatofuz parents: diff changeset	216 Generic Opcode Reference
1d019706d866 LLVM10 anatofuz parents: diff changeset	217 ------------------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	218
1d019706d866 LLVM10 anatofuz parents: diff changeset	219 The Generic Opcodes that are available are described at :doc:`GenericOpcode`.

Mercurial > hg > CbC > CbC_llvm

annotate llvm/docs/GlobalISel/GMIR.rst @ 201:a96fbbdf2d0f