CbC/CbC_llvm: llvm/docs/WritingAnLLVMBackend.rst annotate

annotate llvm/docs/WritingAnLLVMBackend.rst @ 164:fdfabb438fbf

...

author	anatofuz
date	Thu, 19 Mar 2020 17:02:53 +0900
parents	1d019706d866
children	c4bab56944e8

rev	line source
150 1d019706d866 LLVM10 anatofuz parents: diff changeset	1 =======================
1d019706d866 LLVM10 anatofuz parents: diff changeset	2 Writing an LLVM Backend
1d019706d866 LLVM10 anatofuz parents: diff changeset	3 =======================
1d019706d866 LLVM10 anatofuz parents: diff changeset	4
1d019706d866 LLVM10 anatofuz parents: diff changeset	5 .. toctree::
1d019706d866 LLVM10 anatofuz parents: diff changeset	6 :hidden:
1d019706d866 LLVM10 anatofuz parents: diff changeset	7
1d019706d866 LLVM10 anatofuz parents: diff changeset	8 HowToUseInstrMappings
1d019706d866 LLVM10 anatofuz parents: diff changeset	9
1d019706d866 LLVM10 anatofuz parents: diff changeset	10 .. contents::
1d019706d866 LLVM10 anatofuz parents: diff changeset	11 :local:
1d019706d866 LLVM10 anatofuz parents: diff changeset	12
1d019706d866 LLVM10 anatofuz parents: diff changeset	13 Introduction
1d019706d866 LLVM10 anatofuz parents: diff changeset	14 ============
1d019706d866 LLVM10 anatofuz parents: diff changeset	15
1d019706d866 LLVM10 anatofuz parents: diff changeset	16 This document describes techniques for writing compiler backends that convert
1d019706d866 LLVM10 anatofuz parents: diff changeset	17 the LLVM Intermediate Representation (IR) to code for a specified machine or
1d019706d866 LLVM10 anatofuz parents: diff changeset	18 other languages. Code intended for a specific machine can take the form of
1d019706d866 LLVM10 anatofuz parents: diff changeset	19 either assembly code or binary code (usable for a JIT compiler).
1d019706d866 LLVM10 anatofuz parents: diff changeset	20
1d019706d866 LLVM10 anatofuz parents: diff changeset	21 The backend of LLVM features a target-independent code generator that may
1d019706d866 LLVM10 anatofuz parents: diff changeset	22 create output for several types of target CPUs --- including X86, PowerPC,
1d019706d866 LLVM10 anatofuz parents: diff changeset	23 ARM, and SPARC. The backend may also be used to generate code targeted at SPUs
1d019706d866 LLVM10 anatofuz parents: diff changeset	24 of the Cell processor or GPUs to support the execution of compute kernels.
1d019706d866 LLVM10 anatofuz parents: diff changeset	25
1d019706d866 LLVM10 anatofuz parents: diff changeset	26 The document focuses on existing examples found in subdirectories of
1d019706d866 LLVM10 anatofuz parents: diff changeset	27 ``llvm/lib/Target`` in a downloaded LLVM release. In particular, this document
1d019706d866 LLVM10 anatofuz parents: diff changeset	28 focuses on the example of creating a static compiler (one that emits text
1d019706d866 LLVM10 anatofuz parents: diff changeset	29 assembly) for a SPARC target, because SPARC has fairly standard
1d019706d866 LLVM10 anatofuz parents: diff changeset	30 characteristics, such as a RISC instruction set and straightforward calling
1d019706d866 LLVM10 anatofuz parents: diff changeset	31 conventions.
1d019706d866 LLVM10 anatofuz parents: diff changeset	32
1d019706d866 LLVM10 anatofuz parents: diff changeset	33 Audience
1d019706d866 LLVM10 anatofuz parents: diff changeset	34 --------
1d019706d866 LLVM10 anatofuz parents: diff changeset	35
1d019706d866 LLVM10 anatofuz parents: diff changeset	36 The audience for this document is anyone who needs to write an LLVM backend to
1d019706d866 LLVM10 anatofuz parents: diff changeset	37 generate code for a specific hardware or software target.
1d019706d866 LLVM10 anatofuz parents: diff changeset	38
1d019706d866 LLVM10 anatofuz parents: diff changeset	39 Prerequisite Reading
1d019706d866 LLVM10 anatofuz parents: diff changeset	40 --------------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	41
1d019706d866 LLVM10 anatofuz parents: diff changeset	42 These essential documents must be read before reading this document:
1d019706d866 LLVM10 anatofuz parents: diff changeset	43
1d019706d866 LLVM10 anatofuz parents: diff changeset	44 * `LLVM Language Reference Manual <LangRef.html>`_ --- a reference manual for
1d019706d866 LLVM10 anatofuz parents: diff changeset	45 the LLVM assembly language.
1d019706d866 LLVM10 anatofuz parents: diff changeset	46
1d019706d866 LLVM10 anatofuz parents: diff changeset	47 * :doc:`CodeGenerator` --- a guide to the components (classes and code
1d019706d866 LLVM10 anatofuz parents: diff changeset	48 generation algorithms) for translating the LLVM internal representation into
1d019706d866 LLVM10 anatofuz parents: diff changeset	49 machine code for a specified target. Pay particular attention to the
1d019706d866 LLVM10 anatofuz parents: diff changeset	50 descriptions of code generation stages: Instruction Selection, Scheduling and
1d019706d866 LLVM10 anatofuz parents: diff changeset	51 Formation, SSA-based Optimization, Register Allocation, Prolog/Epilog Code
1d019706d866 LLVM10 anatofuz parents: diff changeset	52 Insertion, Late Machine Code Optimizations, and Code Emission.
1d019706d866 LLVM10 anatofuz parents: diff changeset	53
1d019706d866 LLVM10 anatofuz parents: diff changeset	54 * :doc:`TableGen/index` --- a document that describes the TableGen
1d019706d866 LLVM10 anatofuz parents: diff changeset	55 (``tblgen``) application that manages domain-specific information to support
1d019706d866 LLVM10 anatofuz parents: diff changeset	56 LLVM code generation. TableGen processes input from a target description
1d019706d866 LLVM10 anatofuz parents: diff changeset	57 file (``.td`` suffix) and generates C++ code that can be used for code
1d019706d866 LLVM10 anatofuz parents: diff changeset	58 generation.
1d019706d866 LLVM10 anatofuz parents: diff changeset	59
1d019706d866 LLVM10 anatofuz parents: diff changeset	60 * :doc:`WritingAnLLVMPass` --- The assembly printer is a ``FunctionPass``, as
1d019706d866 LLVM10 anatofuz parents: diff changeset	61 are several ``SelectionDAG`` processing steps.
1d019706d866 LLVM10 anatofuz parents: diff changeset	62
1d019706d866 LLVM10 anatofuz parents: diff changeset	63 To follow the SPARC examples in this document, have a copy of `The SPARC
1d019706d866 LLVM10 anatofuz parents: diff changeset	64 Architecture Manual, Version 8 <http://www.sparc.org/standards/V8.pdf>`_ for
1d019706d866 LLVM10 anatofuz parents: diff changeset	65 reference. For details about the ARM instruction set, refer to the `ARM
1d019706d866 LLVM10 anatofuz parents: diff changeset	66 Architecture Reference Manual <http://infocenter.arm.com/>`_. For more about
1d019706d866 LLVM10 anatofuz parents: diff changeset	67 the GNU Assembler format (``GAS``), see `Using As
1d019706d866 LLVM10 anatofuz parents: diff changeset	68 <http://sourceware.org/binutils/docs/as/index.html>`_, especially for the
1d019706d866 LLVM10 anatofuz parents: diff changeset	69 assembly printer. "Using As" contains a list of target machine dependent
1d019706d866 LLVM10 anatofuz parents: diff changeset	70 features.
1d019706d866 LLVM10 anatofuz parents: diff changeset	71
1d019706d866 LLVM10 anatofuz parents: diff changeset	72 Basic Steps
1d019706d866 LLVM10 anatofuz parents: diff changeset	73 -----------
1d019706d866 LLVM10 anatofuz parents: diff changeset	74
1d019706d866 LLVM10 anatofuz parents: diff changeset	75 To write a compiler backend for LLVM that converts the LLVM IR to code for a
1d019706d866 LLVM10 anatofuz parents: diff changeset	76 specified target (machine or other language), follow these steps:
1d019706d866 LLVM10 anatofuz parents: diff changeset	77
1d019706d866 LLVM10 anatofuz parents: diff changeset	78 * Create a subclass of the ``TargetMachine`` class that describes
1d019706d866 LLVM10 anatofuz parents: diff changeset	79 characteristics of your target machine. Copy existing examples of specific
1d019706d866 LLVM10 anatofuz parents: diff changeset	80 ``TargetMachine`` class and header files; for example, start with
1d019706d866 LLVM10 anatofuz parents: diff changeset	81 ``SparcTargetMachine.cpp`` and ``SparcTargetMachine.h``, but change the file
1d019706d866 LLVM10 anatofuz parents: diff changeset	82 names for your target. Similarly, change code that references "``Sparc``" to
1d019706d866 LLVM10 anatofuz parents: diff changeset	83 reference your target.
1d019706d866 LLVM10 anatofuz parents: diff changeset	84
1d019706d866 LLVM10 anatofuz parents: diff changeset	85 * Describe the register set of the target. Use TableGen to generate code for
1d019706d866 LLVM10 anatofuz parents: diff changeset	86 register definition, register aliases, and register classes from a
1d019706d866 LLVM10 anatofuz parents: diff changeset	87 target-specific ``RegisterInfo.td`` input file. You should also write
1d019706d866 LLVM10 anatofuz parents: diff changeset	88 additional code for a subclass of the ``TargetRegisterInfo`` class that
1d019706d866 LLVM10 anatofuz parents: diff changeset	89 represents the class register file data used for register allocation and also
1d019706d866 LLVM10 anatofuz parents: diff changeset	90 describes the interactions between registers.
1d019706d866 LLVM10 anatofuz parents: diff changeset	91
1d019706d866 LLVM10 anatofuz parents: diff changeset	92 * Describe the instruction set of the target. Use TableGen to generate code
1d019706d866 LLVM10 anatofuz parents: diff changeset	93 for target-specific instructions from target-specific versions of
1d019706d866 LLVM10 anatofuz parents: diff changeset	94 ``TargetInstrFormats.td`` and ``TargetInstrInfo.td``. You should write
1d019706d866 LLVM10 anatofuz parents: diff changeset	95 additional code for a subclass of the ``TargetInstrInfo`` class to represent
1d019706d866 LLVM10 anatofuz parents: diff changeset	96 machine instructions supported by the target machine.
1d019706d866 LLVM10 anatofuz parents: diff changeset	97
1d019706d866 LLVM10 anatofuz parents: diff changeset	98 * Describe the selection and conversion of the LLVM IR from a Directed Acyclic
1d019706d866 LLVM10 anatofuz parents: diff changeset	99 Graph (DAG) representation of instructions to native target-specific
1d019706d866 LLVM10 anatofuz parents: diff changeset	100 instructions. Use TableGen to generate code that matches patterns and
1d019706d866 LLVM10 anatofuz parents: diff changeset	101 selects instructions based on additional information in a target-specific
1d019706d866 LLVM10 anatofuz parents: diff changeset	102 version of ``TargetInstrInfo.td``. Write code for ``XXXISelDAGToDAG.cpp``,
1d019706d866 LLVM10 anatofuz parents: diff changeset	103 where ``XXX`` identifies the specific target, to perform pattern matching and
1d019706d866 LLVM10 anatofuz parents: diff changeset	104 DAG-to-DAG instruction selection. Also write code in ``XXXISelLowering.cpp``
1d019706d866 LLVM10 anatofuz parents: diff changeset	105 to replace or remove operations and data types that are not supported
1d019706d866 LLVM10 anatofuz parents: diff changeset	106 natively in a SelectionDAG.
1d019706d866 LLVM10 anatofuz parents: diff changeset	107
1d019706d866 LLVM10 anatofuz parents: diff changeset	108 * Write code for an assembly printer that converts LLVM IR to a GAS format for
1d019706d866 LLVM10 anatofuz parents: diff changeset	109 your target machine. You should add assembly strings to the instructions
1d019706d866 LLVM10 anatofuz parents: diff changeset	110 defined in your target-specific version of ``TargetInstrInfo.td``. You
1d019706d866 LLVM10 anatofuz parents: diff changeset	111 should also write code for a subclass of ``AsmPrinter`` that performs the
1d019706d866 LLVM10 anatofuz parents: diff changeset	112 LLVM-to-assembly conversion and a trivial subclass of ``TargetAsmInfo``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	113
1d019706d866 LLVM10 anatofuz parents: diff changeset	114 * Optionally, add support for subtargets (i.e., variants with different
1d019706d866 LLVM10 anatofuz parents: diff changeset	115 capabilities). You should also write code for a subclass of the
1d019706d866 LLVM10 anatofuz parents: diff changeset	116 ``TargetSubtarget`` class, which allows you to use the ``-mcpu=`` and
1d019706d866 LLVM10 anatofuz parents: diff changeset	117 ``-mattr=`` command-line options.
1d019706d866 LLVM10 anatofuz parents: diff changeset	118
1d019706d866 LLVM10 anatofuz parents: diff changeset	119 * Optionally, add JIT support and create a machine code emitter (subclass of
1d019706d866 LLVM10 anatofuz parents: diff changeset	120 ``TargetJITInfo``) that is used to emit binary code directly into memory.
1d019706d866 LLVM10 anatofuz parents: diff changeset	121
1d019706d866 LLVM10 anatofuz parents: diff changeset	122 In the ``.cpp`` and ``.h``. files, initially stub up these methods and then
1d019706d866 LLVM10 anatofuz parents: diff changeset	123 implement them later. Initially, you may not know which private members that
1d019706d866 LLVM10 anatofuz parents: diff changeset	124 the class will need and which components will need to be subclassed.
1d019706d866 LLVM10 anatofuz parents: diff changeset	125
1d019706d866 LLVM10 anatofuz parents: diff changeset	126 Preliminaries
1d019706d866 LLVM10 anatofuz parents: diff changeset	127 -------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	128
1d019706d866 LLVM10 anatofuz parents: diff changeset	129 To actually create your compiler backend, you need to create and modify a few
1d019706d866 LLVM10 anatofuz parents: diff changeset	130 files. The absolute minimum is discussed here. But to actually use the LLVM
1d019706d866 LLVM10 anatofuz parents: diff changeset	131 target-independent code generator, you must perform the steps described in the
1d019706d866 LLVM10 anatofuz parents: diff changeset	132 :doc:`LLVM Target-Independent Code Generator <CodeGenerator>` document.
1d019706d866 LLVM10 anatofuz parents: diff changeset	133
1d019706d866 LLVM10 anatofuz parents: diff changeset	134 First, you should create a subdirectory under ``lib/Target`` to hold all the
1d019706d866 LLVM10 anatofuz parents: diff changeset	135 files related to your target. If your target is called "Dummy", create the
1d019706d866 LLVM10 anatofuz parents: diff changeset	136 directory ``lib/Target/Dummy``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	137
1d019706d866 LLVM10 anatofuz parents: diff changeset	138 In this new directory, create a ``CMakeLists.txt``. It is easiest to copy a
1d019706d866 LLVM10 anatofuz parents: diff changeset	139 ``CMakeLists.txt`` of another target and modify it. It should at least contain
1d019706d866 LLVM10 anatofuz parents: diff changeset	140 the ``LLVM_TARGET_DEFINITIONS`` variable. The library can be named ``LLVMDummy``
1d019706d866 LLVM10 anatofuz parents: diff changeset	141 (for example, see the MIPS target). Alternatively, you can split the library
1d019706d866 LLVM10 anatofuz parents: diff changeset	142 into ``LLVMDummyCodeGen`` and ``LLVMDummyAsmPrinter``, the latter of which
1d019706d866 LLVM10 anatofuz parents: diff changeset	143 should be implemented in a subdirectory below ``lib/Target/Dummy`` (for example,
1d019706d866 LLVM10 anatofuz parents: diff changeset	144 see the PowerPC target).
1d019706d866 LLVM10 anatofuz parents: diff changeset	145
1d019706d866 LLVM10 anatofuz parents: diff changeset	146 Note that these two naming schemes are hardcoded into ``llvm-config``. Using
1d019706d866 LLVM10 anatofuz parents: diff changeset	147 any other naming scheme will confuse ``llvm-config`` and produce a lot of
1d019706d866 LLVM10 anatofuz parents: diff changeset	148 (seemingly unrelated) linker errors when linking ``llc``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	149
1d019706d866 LLVM10 anatofuz parents: diff changeset	150 To make your target actually do something, you need to implement a subclass of
1d019706d866 LLVM10 anatofuz parents: diff changeset	151 ``TargetMachine``. This implementation should typically be in the file
1d019706d866 LLVM10 anatofuz parents: diff changeset	152 ``lib/Target/DummyTargetMachine.cpp``, but any file in the ``lib/Target``
1d019706d866 LLVM10 anatofuz parents: diff changeset	153 directory will be built and should work. To use LLVM's target independent code
1d019706d866 LLVM10 anatofuz parents: diff changeset	154 generator, you should do what all current machine backends do: create a
1d019706d866 LLVM10 anatofuz parents: diff changeset	155 subclass of ``LLVMTargetMachine``. (To create a target from scratch, create a
1d019706d866 LLVM10 anatofuz parents: diff changeset	156 subclass of ``TargetMachine``.)
1d019706d866 LLVM10 anatofuz parents: diff changeset	157
1d019706d866 LLVM10 anatofuz parents: diff changeset	158 To get LLVM to actually build and link your target, you need to run ``cmake``
1d019706d866 LLVM10 anatofuz parents: diff changeset	159 with ``-DLLVM_EXPERIMENTAL_TARGETS_TO_BUILD=Dummy``. This will build your
1d019706d866 LLVM10 anatofuz parents: diff changeset	160 target without needing to add it to the list of all the targets.
1d019706d866 LLVM10 anatofuz parents: diff changeset	161
1d019706d866 LLVM10 anatofuz parents: diff changeset	162 Once your target is stable, you can add it to the ``LLVM_ALL_TARGETS`` variable
1d019706d866 LLVM10 anatofuz parents: diff changeset	163 located in the main ``CMakeLists.txt``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	164
1d019706d866 LLVM10 anatofuz parents: diff changeset	165 Target Machine
1d019706d866 LLVM10 anatofuz parents: diff changeset	166 ==============
1d019706d866 LLVM10 anatofuz parents: diff changeset	167
1d019706d866 LLVM10 anatofuz parents: diff changeset	168 ``LLVMTargetMachine`` is designed as a base class for targets implemented with
1d019706d866 LLVM10 anatofuz parents: diff changeset	169 the LLVM target-independent code generator. The ``LLVMTargetMachine`` class
1d019706d866 LLVM10 anatofuz parents: diff changeset	170 should be specialized by a concrete target class that implements the various
1d019706d866 LLVM10 anatofuz parents: diff changeset	171 virtual methods. ``LLVMTargetMachine`` is defined as a subclass of
1d019706d866 LLVM10 anatofuz parents: diff changeset	172 ``TargetMachine`` in ``include/llvm/Target/TargetMachine.h``. The
1d019706d866 LLVM10 anatofuz parents: diff changeset	173 ``TargetMachine`` class implementation (``TargetMachine.cpp``) also processes
1d019706d866 LLVM10 anatofuz parents: diff changeset	174 numerous command-line options.
1d019706d866 LLVM10 anatofuz parents: diff changeset	175
1d019706d866 LLVM10 anatofuz parents: diff changeset	176 To create a concrete target-specific subclass of ``LLVMTargetMachine``, start
1d019706d866 LLVM10 anatofuz parents: diff changeset	177 by copying an existing ``TargetMachine`` class and header. You should name the
1d019706d866 LLVM10 anatofuz parents: diff changeset	178 files that you create to reflect your specific target. For instance, for the
1d019706d866 LLVM10 anatofuz parents: diff changeset	179 SPARC target, name the files ``SparcTargetMachine.h`` and
1d019706d866 LLVM10 anatofuz parents: diff changeset	180 ``SparcTargetMachine.cpp``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	181
1d019706d866 LLVM10 anatofuz parents: diff changeset	182 For a target machine ``XXX``, the implementation of ``XXXTargetMachine`` must
1d019706d866 LLVM10 anatofuz parents: diff changeset	183 have access methods to obtain objects that represent target components. These
1d019706d866 LLVM10 anatofuz parents: diff changeset	184 methods are named ``get*Info``, and are intended to obtain the instruction set
1d019706d866 LLVM10 anatofuz parents: diff changeset	185 (``getInstrInfo``), register set (``getRegisterInfo``), stack frame layout
1d019706d866 LLVM10 anatofuz parents: diff changeset	186 (``getFrameInfo``), and similar information. ``XXXTargetMachine`` must also
1d019706d866 LLVM10 anatofuz parents: diff changeset	187 implement the ``getDataLayout`` method to access an object with target-specific
1d019706d866 LLVM10 anatofuz parents: diff changeset	188 data characteristics, such as data type size and alignment requirements.
1d019706d866 LLVM10 anatofuz parents: diff changeset	189
1d019706d866 LLVM10 anatofuz parents: diff changeset	190 For instance, for the SPARC target, the header file ``SparcTargetMachine.h``
1d019706d866 LLVM10 anatofuz parents: diff changeset	191 declares prototypes for several ``get*Info`` and ``getDataLayout`` methods that
1d019706d866 LLVM10 anatofuz parents: diff changeset	192 simply return a class member.
1d019706d866 LLVM10 anatofuz parents: diff changeset	193
1d019706d866 LLVM10 anatofuz parents: diff changeset	194 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	195
1d019706d866 LLVM10 anatofuz parents: diff changeset	196 namespace llvm {
1d019706d866 LLVM10 anatofuz parents: diff changeset	197
1d019706d866 LLVM10 anatofuz parents: diff changeset	198 class Module;
1d019706d866 LLVM10 anatofuz parents: diff changeset	199
1d019706d866 LLVM10 anatofuz parents: diff changeset	200 class SparcTargetMachine : public LLVMTargetMachine {
1d019706d866 LLVM10 anatofuz parents: diff changeset	201 const DataLayout DataLayout; // Calculates type size & alignment
1d019706d866 LLVM10 anatofuz parents: diff changeset	202 SparcSubtarget Subtarget;
1d019706d866 LLVM10 anatofuz parents: diff changeset	203 SparcInstrInfo InstrInfo;
1d019706d866 LLVM10 anatofuz parents: diff changeset	204 TargetFrameInfo FrameInfo;
1d019706d866 LLVM10 anatofuz parents: diff changeset	205
1d019706d866 LLVM10 anatofuz parents: diff changeset	206 protected:
1d019706d866 LLVM10 anatofuz parents: diff changeset	207 virtual const TargetAsmInfo *createTargetAsmInfo() const;
1d019706d866 LLVM10 anatofuz parents: diff changeset	208
1d019706d866 LLVM10 anatofuz parents: diff changeset	209 public:
1d019706d866 LLVM10 anatofuz parents: diff changeset	210 SparcTargetMachine(const Module &M, const std::string &FS);
1d019706d866 LLVM10 anatofuz parents: diff changeset	211
1d019706d866 LLVM10 anatofuz parents: diff changeset	212 virtual const SparcInstrInfo *getInstrInfo() const {return &InstrInfo; }
1d019706d866 LLVM10 anatofuz parents: diff changeset	213 virtual const TargetFrameInfo *getFrameInfo() const {return &FrameInfo; }
1d019706d866 LLVM10 anatofuz parents: diff changeset	214 virtual const TargetSubtarget *getSubtargetImpl() const{return &Subtarget; }
1d019706d866 LLVM10 anatofuz parents: diff changeset	215 virtual const TargetRegisterInfo *getRegisterInfo() const {
1d019706d866 LLVM10 anatofuz parents: diff changeset	216 return &InstrInfo.getRegisterInfo();
1d019706d866 LLVM10 anatofuz parents: diff changeset	217 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	218 virtual const DataLayout *getDataLayout() const { return &DataLayout; }
1d019706d866 LLVM10 anatofuz parents: diff changeset	219 static unsigned getModuleMatchQuality(const Module &M);
1d019706d866 LLVM10 anatofuz parents: diff changeset	220
1d019706d866 LLVM10 anatofuz parents: diff changeset	221 // Pass Pipeline Configuration
1d019706d866 LLVM10 anatofuz parents: diff changeset	222 virtual bool addInstSelector(PassManagerBase &PM, bool Fast);
1d019706d866 LLVM10 anatofuz parents: diff changeset	223 virtual bool addPreEmitPass(PassManagerBase &PM, bool Fast);
1d019706d866 LLVM10 anatofuz parents: diff changeset	224 };
1d019706d866 LLVM10 anatofuz parents: diff changeset	225
1d019706d866 LLVM10 anatofuz parents: diff changeset	226 } // end namespace llvm
1d019706d866 LLVM10 anatofuz parents: diff changeset	227
1d019706d866 LLVM10 anatofuz parents: diff changeset	228 * ``getInstrInfo()``
1d019706d866 LLVM10 anatofuz parents: diff changeset	229 * ``getRegisterInfo()``
1d019706d866 LLVM10 anatofuz parents: diff changeset	230 * ``getFrameInfo()``
1d019706d866 LLVM10 anatofuz parents: diff changeset	231 * ``getDataLayout()``
1d019706d866 LLVM10 anatofuz parents: diff changeset	232 * ``getSubtargetImpl()``
1d019706d866 LLVM10 anatofuz parents: diff changeset	233
1d019706d866 LLVM10 anatofuz parents: diff changeset	234 For some targets, you also need to support the following methods:
1d019706d866 LLVM10 anatofuz parents: diff changeset	235
1d019706d866 LLVM10 anatofuz parents: diff changeset	236 * ``getTargetLowering()``
1d019706d866 LLVM10 anatofuz parents: diff changeset	237 * ``getJITInfo()``
1d019706d866 LLVM10 anatofuz parents: diff changeset	238
1d019706d866 LLVM10 anatofuz parents: diff changeset	239 Some architectures, such as GPUs, do not support jumping to an arbitrary
1d019706d866 LLVM10 anatofuz parents: diff changeset	240 program location and implement branching using masked execution and loop using
1d019706d866 LLVM10 anatofuz parents: diff changeset	241 special instructions around the loop body. In order to avoid CFG modifications
1d019706d866 LLVM10 anatofuz parents: diff changeset	242 that introduce irreducible control flow not handled by such hardware, a target
1d019706d866 LLVM10 anatofuz parents: diff changeset	243 must call `setRequiresStructuredCFG(true)` when being initialized.
1d019706d866 LLVM10 anatofuz parents: diff changeset	244
1d019706d866 LLVM10 anatofuz parents: diff changeset	245 In addition, the ``XXXTargetMachine`` constructor should specify a
1d019706d866 LLVM10 anatofuz parents: diff changeset	246 ``TargetDescription`` string that determines the data layout for the target
1d019706d866 LLVM10 anatofuz parents: diff changeset	247 machine, including characteristics such as pointer size, alignment, and
1d019706d866 LLVM10 anatofuz parents: diff changeset	248 endianness. For example, the constructor for ``SparcTargetMachine`` contains
1d019706d866 LLVM10 anatofuz parents: diff changeset	249 the following:
1d019706d866 LLVM10 anatofuz parents: diff changeset	250
1d019706d866 LLVM10 anatofuz parents: diff changeset	251 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	252
1d019706d866 LLVM10 anatofuz parents: diff changeset	253 SparcTargetMachine::SparcTargetMachine(const Module &M, const std::string &FS)
1d019706d866 LLVM10 anatofuz parents: diff changeset	254 : DataLayout("E-p:32:32-f128:128:128"),
1d019706d866 LLVM10 anatofuz parents: diff changeset	255 Subtarget(M, FS), InstrInfo(Subtarget),
1d019706d866 LLVM10 anatofuz parents: diff changeset	256 FrameInfo(TargetFrameInfo::StackGrowsDown, 8, 0) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	257 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	258
1d019706d866 LLVM10 anatofuz parents: diff changeset	259 Hyphens separate portions of the ``TargetDescription`` string.
1d019706d866 LLVM10 anatofuz parents: diff changeset	260
1d019706d866 LLVM10 anatofuz parents: diff changeset	261 * An upper-case "``E``" in the string indicates a big-endian target data model.
1d019706d866 LLVM10 anatofuz parents: diff changeset	262 A lower-case "``e``" indicates little-endian.
1d019706d866 LLVM10 anatofuz parents: diff changeset	263
1d019706d866 LLVM10 anatofuz parents: diff changeset	264 * "``p:``" is followed by pointer information: size, ABI alignment, and
1d019706d866 LLVM10 anatofuz parents: diff changeset	265 preferred alignment. If only two figures follow "``p:``", then the first
1d019706d866 LLVM10 anatofuz parents: diff changeset	266 value is pointer size, and the second value is both ABI and preferred
1d019706d866 LLVM10 anatofuz parents: diff changeset	267 alignment.
1d019706d866 LLVM10 anatofuz parents: diff changeset	268
1d019706d866 LLVM10 anatofuz parents: diff changeset	269 * Then a letter for numeric type alignment: "``i``", "``f``", "``v``", or
1d019706d866 LLVM10 anatofuz parents: diff changeset	270 "``a``" (corresponding to integer, floating point, vector, or aggregate).
1d019706d866 LLVM10 anatofuz parents: diff changeset	271 "``i``", "``v``", or "``a``" are followed by ABI alignment and preferred
1d019706d866 LLVM10 anatofuz parents: diff changeset	272 alignment. "``f``" is followed by three values: the first indicates the size
1d019706d866 LLVM10 anatofuz parents: diff changeset	273 of a long double, then ABI alignment, and then ABI preferred alignment.
1d019706d866 LLVM10 anatofuz parents: diff changeset	274
1d019706d866 LLVM10 anatofuz parents: diff changeset	275 Target Registration
1d019706d866 LLVM10 anatofuz parents: diff changeset	276 ===================
1d019706d866 LLVM10 anatofuz parents: diff changeset	277
1d019706d866 LLVM10 anatofuz parents: diff changeset	278 You must also register your target with the ``TargetRegistry``, which is what
1d019706d866 LLVM10 anatofuz parents: diff changeset	279 other LLVM tools use to be able to lookup and use your target at runtime. The
1d019706d866 LLVM10 anatofuz parents: diff changeset	280 ``TargetRegistry`` can be used directly, but for most targets there are helper
1d019706d866 LLVM10 anatofuz parents: diff changeset	281 templates which should take care of the work for you.
1d019706d866 LLVM10 anatofuz parents: diff changeset	282
1d019706d866 LLVM10 anatofuz parents: diff changeset	283 All targets should declare a global ``Target`` object which is used to
1d019706d866 LLVM10 anatofuz parents: diff changeset	284 represent the target during registration. Then, in the target's ``TargetInfo``
1d019706d866 LLVM10 anatofuz parents: diff changeset	285 library, the target should define that object and use the ``RegisterTarget``
1d019706d866 LLVM10 anatofuz parents: diff changeset	286 template to register the target. For example, the Sparc registration code
1d019706d866 LLVM10 anatofuz parents: diff changeset	287 looks like this:
1d019706d866 LLVM10 anatofuz parents: diff changeset	288
1d019706d866 LLVM10 anatofuz parents: diff changeset	289 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	290
1d019706d866 LLVM10 anatofuz parents: diff changeset	291 Target llvm::getTheSparcTarget();
1d019706d866 LLVM10 anatofuz parents: diff changeset	292
1d019706d866 LLVM10 anatofuz parents: diff changeset	293 extern "C" void LLVMInitializeSparcTargetInfo() {
1d019706d866 LLVM10 anatofuz parents: diff changeset	294 RegisterTarget<Triple::sparc, /HasJIT=/false>
1d019706d866 LLVM10 anatofuz parents: diff changeset	295 X(getTheSparcTarget(), "sparc", "Sparc");
1d019706d866 LLVM10 anatofuz parents: diff changeset	296 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	297
1d019706d866 LLVM10 anatofuz parents: diff changeset	298 This allows the ``TargetRegistry`` to look up the target by name or by target
1d019706d866 LLVM10 anatofuz parents: diff changeset	299 triple. In addition, most targets will also register additional features which
1d019706d866 LLVM10 anatofuz parents: diff changeset	300 are available in separate libraries. These registration steps are separate,
1d019706d866 LLVM10 anatofuz parents: diff changeset	301 because some clients may wish to only link in some parts of the target --- the
1d019706d866 LLVM10 anatofuz parents: diff changeset	302 JIT code generator does not require the use of the assembler printer, for
1d019706d866 LLVM10 anatofuz parents: diff changeset	303 example. Here is an example of registering the Sparc assembly printer:
1d019706d866 LLVM10 anatofuz parents: diff changeset	304
1d019706d866 LLVM10 anatofuz parents: diff changeset	305 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	306
1d019706d866 LLVM10 anatofuz parents: diff changeset	307 extern "C" void LLVMInitializeSparcAsmPrinter() {
1d019706d866 LLVM10 anatofuz parents: diff changeset	308 RegisterAsmPrinter<SparcAsmPrinter> X(getTheSparcTarget());
1d019706d866 LLVM10 anatofuz parents: diff changeset	309 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	310
1d019706d866 LLVM10 anatofuz parents: diff changeset	311 For more information, see "`llvm/Target/TargetRegistry.h
1d019706d866 LLVM10 anatofuz parents: diff changeset	312 </doxygen/TargetRegistry_8h-source.html>`_".
1d019706d866 LLVM10 anatofuz parents: diff changeset	313
1d019706d866 LLVM10 anatofuz parents: diff changeset	314 Register Set and Register Classes
1d019706d866 LLVM10 anatofuz parents: diff changeset	315 =================================
1d019706d866 LLVM10 anatofuz parents: diff changeset	316
1d019706d866 LLVM10 anatofuz parents: diff changeset	317 You should describe a concrete target-specific class that represents the
1d019706d866 LLVM10 anatofuz parents: diff changeset	318 register file of a target machine. This class is called ``XXXRegisterInfo``
1d019706d866 LLVM10 anatofuz parents: diff changeset	319 (where ``XXX`` identifies the target) and represents the class register file
1d019706d866 LLVM10 anatofuz parents: diff changeset	320 data that is used for register allocation. It also describes the interactions
1d019706d866 LLVM10 anatofuz parents: diff changeset	321 between registers.
1d019706d866 LLVM10 anatofuz parents: diff changeset	322
1d019706d866 LLVM10 anatofuz parents: diff changeset	323 You also need to define register classes to categorize related registers. A
1d019706d866 LLVM10 anatofuz parents: diff changeset	324 register class should be added for groups of registers that are all treated the
1d019706d866 LLVM10 anatofuz parents: diff changeset	325 same way for some instruction. Typical examples are register classes for
1d019706d866 LLVM10 anatofuz parents: diff changeset	326 integer, floating-point, or vector registers. A register allocator allows an
1d019706d866 LLVM10 anatofuz parents: diff changeset	327 instruction to use any register in a specified register class to perform the
1d019706d866 LLVM10 anatofuz parents: diff changeset	328 instruction in a similar manner. Register classes allocate virtual registers
1d019706d866 LLVM10 anatofuz parents: diff changeset	329 to instructions from these sets, and register classes let the
1d019706d866 LLVM10 anatofuz parents: diff changeset	330 target-independent register allocator automatically choose the actual
1d019706d866 LLVM10 anatofuz parents: diff changeset	331 registers.
1d019706d866 LLVM10 anatofuz parents: diff changeset	332
1d019706d866 LLVM10 anatofuz parents: diff changeset	333 Much of the code for registers, including register definition, register
1d019706d866 LLVM10 anatofuz parents: diff changeset	334 aliases, and register classes, is generated by TableGen from
1d019706d866 LLVM10 anatofuz parents: diff changeset	335 ``XXXRegisterInfo.td`` input files and placed in ``XXXGenRegisterInfo.h.inc``
1d019706d866 LLVM10 anatofuz parents: diff changeset	336 and ``XXXGenRegisterInfo.inc`` output files. Some of the code in the
1d019706d866 LLVM10 anatofuz parents: diff changeset	337 implementation of ``XXXRegisterInfo`` requires hand-coding.
1d019706d866 LLVM10 anatofuz parents: diff changeset	338
1d019706d866 LLVM10 anatofuz parents: diff changeset	339 Defining a Register
1d019706d866 LLVM10 anatofuz parents: diff changeset	340 -------------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	341
1d019706d866 LLVM10 anatofuz parents: diff changeset	342 The ``XXXRegisterInfo.td`` file typically starts with register definitions for
1d019706d866 LLVM10 anatofuz parents: diff changeset	343 a target machine. The ``Register`` class (specified in ``Target.td``) is used
1d019706d866 LLVM10 anatofuz parents: diff changeset	344 to define an object for each register. The specified string ``n`` becomes the
1d019706d866 LLVM10 anatofuz parents: diff changeset	345 ``Name`` of the register. The basic ``Register`` object does not have any
1d019706d866 LLVM10 anatofuz parents: diff changeset	346 subregisters and does not specify any aliases.
1d019706d866 LLVM10 anatofuz parents: diff changeset	347
1d019706d866 LLVM10 anatofuz parents: diff changeset	348 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	349
1d019706d866 LLVM10 anatofuz parents: diff changeset	350 class Register<string n> {
1d019706d866 LLVM10 anatofuz parents: diff changeset	351 string Namespace = "";
1d019706d866 LLVM10 anatofuz parents: diff changeset	352 string AsmName = n;
1d019706d866 LLVM10 anatofuz parents: diff changeset	353 string Name = n;
1d019706d866 LLVM10 anatofuz parents: diff changeset	354 int SpillSize = 0;
1d019706d866 LLVM10 anatofuz parents: diff changeset	355 int SpillAlignment = 0;
1d019706d866 LLVM10 anatofuz parents: diff changeset	356 list<Register> Aliases = [];
1d019706d866 LLVM10 anatofuz parents: diff changeset	357 list<Register> SubRegs = [];
1d019706d866 LLVM10 anatofuz parents: diff changeset	358 list<int> DwarfNumbers = [];
1d019706d866 LLVM10 anatofuz parents: diff changeset	359 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	360
1d019706d866 LLVM10 anatofuz parents: diff changeset	361 For example, in the ``X86RegisterInfo.td`` file, there are register definitions
1d019706d866 LLVM10 anatofuz parents: diff changeset	362 that utilize the ``Register`` class, such as:
1d019706d866 LLVM10 anatofuz parents: diff changeset	363
1d019706d866 LLVM10 anatofuz parents: diff changeset	364 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	365
1d019706d866 LLVM10 anatofuz parents: diff changeset	366 def AL : Register<"AL">, DwarfRegNum<[0, 0, 0]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	367
1d019706d866 LLVM10 anatofuz parents: diff changeset	368 This defines the register ``AL`` and assigns it values (with ``DwarfRegNum``)
1d019706d866 LLVM10 anatofuz parents: diff changeset	369 that are used by ``gcc``, ``gdb``, or a debug information writer to identify a
1d019706d866 LLVM10 anatofuz parents: diff changeset	370 register. For register ``AL``, ``DwarfRegNum`` takes an array of 3 values
1d019706d866 LLVM10 anatofuz parents: diff changeset	371 representing 3 different modes: the first element is for X86-64, the second for
1d019706d866 LLVM10 anatofuz parents: diff changeset	372 exception handling (EH) on X86-32, and the third is generic. -1 is a special
1d019706d866 LLVM10 anatofuz parents: diff changeset	373 Dwarf number that indicates the gcc number is undefined, and -2 indicates the
1d019706d866 LLVM10 anatofuz parents: diff changeset	374 register number is invalid for this mode.
1d019706d866 LLVM10 anatofuz parents: diff changeset	375
1d019706d866 LLVM10 anatofuz parents: diff changeset	376 From the previously described line in the ``X86RegisterInfo.td`` file, TableGen
1d019706d866 LLVM10 anatofuz parents: diff changeset	377 generates this code in the ``X86GenRegisterInfo.inc`` file:
1d019706d866 LLVM10 anatofuz parents: diff changeset	378
1d019706d866 LLVM10 anatofuz parents: diff changeset	379 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	380
1d019706d866 LLVM10 anatofuz parents: diff changeset	381 static const unsigned GR8[] = { X86::AL, ... };
1d019706d866 LLVM10 anatofuz parents: diff changeset	382
1d019706d866 LLVM10 anatofuz parents: diff changeset	383 const unsigned AL_AliasSet[] = { X86::AX, X86::EAX, X86::RAX, 0 };
1d019706d866 LLVM10 anatofuz parents: diff changeset	384
1d019706d866 LLVM10 anatofuz parents: diff changeset	385 const TargetRegisterDesc RegisterDescriptors[] = {
1d019706d866 LLVM10 anatofuz parents: diff changeset	386 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	387 { "AL", "AL", AL_AliasSet, Empty_SubRegsSet, Empty_SubRegsSet, AL_SuperRegsSet }, ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	388
1d019706d866 LLVM10 anatofuz parents: diff changeset	389 From the register info file, TableGen generates a ``TargetRegisterDesc`` object
1d019706d866 LLVM10 anatofuz parents: diff changeset	390 for each register. ``TargetRegisterDesc`` is defined in
1d019706d866 LLVM10 anatofuz parents: diff changeset	391 ``include/llvm/Target/TargetRegisterInfo.h`` with the following fields:
1d019706d866 LLVM10 anatofuz parents: diff changeset	392
1d019706d866 LLVM10 anatofuz parents: diff changeset	393 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	394
1d019706d866 LLVM10 anatofuz parents: diff changeset	395 struct TargetRegisterDesc {
1d019706d866 LLVM10 anatofuz parents: diff changeset	396 const char *AsmName; // Assembly language name for the register
1d019706d866 LLVM10 anatofuz parents: diff changeset	397 const char *Name; // Printable name for the reg (for debugging)
1d019706d866 LLVM10 anatofuz parents: diff changeset	398 const unsigned *AliasSet; // Register Alias Set
1d019706d866 LLVM10 anatofuz parents: diff changeset	399 const unsigned *SubRegs; // Sub-register set
1d019706d866 LLVM10 anatofuz parents: diff changeset	400 const unsigned *ImmSubRegs; // Immediate sub-register set
1d019706d866 LLVM10 anatofuz parents: diff changeset	401 const unsigned *SuperRegs; // Super-register set
1d019706d866 LLVM10 anatofuz parents: diff changeset	402 };
1d019706d866 LLVM10 anatofuz parents: diff changeset	403
1d019706d866 LLVM10 anatofuz parents: diff changeset	404 TableGen uses the entire target description file (``.td``) to determine text
1d019706d866 LLVM10 anatofuz parents: diff changeset	405 names for the register (in the ``AsmName`` and ``Name`` fields of
1d019706d866 LLVM10 anatofuz parents: diff changeset	406 ``TargetRegisterDesc``) and the relationships of other registers to the defined
1d019706d866 LLVM10 anatofuz parents: diff changeset	407 register (in the other ``TargetRegisterDesc`` fields). In this example, other
1d019706d866 LLVM10 anatofuz parents: diff changeset	408 definitions establish the registers "``AX``", "``EAX``", and "``RAX``" as
1d019706d866 LLVM10 anatofuz parents: diff changeset	409 aliases for one another, so TableGen generates a null-terminated array
1d019706d866 LLVM10 anatofuz parents: diff changeset	410 (``AL_AliasSet``) for this register alias set.
1d019706d866 LLVM10 anatofuz parents: diff changeset	411
1d019706d866 LLVM10 anatofuz parents: diff changeset	412 The ``Register`` class is commonly used as a base class for more complex
1d019706d866 LLVM10 anatofuz parents: diff changeset	413 classes. In ``Target.td``, the ``Register`` class is the base for the
1d019706d866 LLVM10 anatofuz parents: diff changeset	414 ``RegisterWithSubRegs`` class that is used to define registers that need to
1d019706d866 LLVM10 anatofuz parents: diff changeset	415 specify subregisters in the ``SubRegs`` list, as shown here:
1d019706d866 LLVM10 anatofuz parents: diff changeset	416
1d019706d866 LLVM10 anatofuz parents: diff changeset	417 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	418
1d019706d866 LLVM10 anatofuz parents: diff changeset	419 class RegisterWithSubRegs<string n, list<Register> subregs> : Register<n> {
1d019706d866 LLVM10 anatofuz parents: diff changeset	420 let SubRegs = subregs;
1d019706d866 LLVM10 anatofuz parents: diff changeset	421 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	422
1d019706d866 LLVM10 anatofuz parents: diff changeset	423 In ``SparcRegisterInfo.td``, additional register classes are defined for SPARC:
1d019706d866 LLVM10 anatofuz parents: diff changeset	424 a ``Register`` subclass, ``SparcReg``, and further subclasses: ``Ri``, ``Rf``,
1d019706d866 LLVM10 anatofuz parents: diff changeset	425 and ``Rd``. SPARC registers are identified by 5-bit ID numbers, which is a
1d019706d866 LLVM10 anatofuz parents: diff changeset	426 feature common to these subclasses. Note the use of "``let``" expressions to
1d019706d866 LLVM10 anatofuz parents: diff changeset	427 override values that are initially defined in a superclass (such as ``SubRegs``
1d019706d866 LLVM10 anatofuz parents: diff changeset	428 field in the ``Rd`` class).
1d019706d866 LLVM10 anatofuz parents: diff changeset	429
1d019706d866 LLVM10 anatofuz parents: diff changeset	430 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	431
1d019706d866 LLVM10 anatofuz parents: diff changeset	432 class SparcReg<string n> : Register<n> {
1d019706d866 LLVM10 anatofuz parents: diff changeset	433 field bits<5> Num;
1d019706d866 LLVM10 anatofuz parents: diff changeset	434 let Namespace = "SP";
1d019706d866 LLVM10 anatofuz parents: diff changeset	435 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	436 // Ri - 32-bit integer registers
1d019706d866 LLVM10 anatofuz parents: diff changeset	437 class Ri<bits<5> num, string n> :
1d019706d866 LLVM10 anatofuz parents: diff changeset	438 SparcReg<n> {
1d019706d866 LLVM10 anatofuz parents: diff changeset	439 let Num = num;
1d019706d866 LLVM10 anatofuz parents: diff changeset	440 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	441 // Rf - 32-bit floating-point registers
1d019706d866 LLVM10 anatofuz parents: diff changeset	442 class Rf<bits<5> num, string n> :
1d019706d866 LLVM10 anatofuz parents: diff changeset	443 SparcReg<n> {
1d019706d866 LLVM10 anatofuz parents: diff changeset	444 let Num = num;
1d019706d866 LLVM10 anatofuz parents: diff changeset	445 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	446 // Rd - Slots in the FP register file for 64-bit floating-point values.
1d019706d866 LLVM10 anatofuz parents: diff changeset	447 class Rd<bits<5> num, string n, list<Register> subregs> : SparcReg<n> {
1d019706d866 LLVM10 anatofuz parents: diff changeset	448 let Num = num;
1d019706d866 LLVM10 anatofuz parents: diff changeset	449 let SubRegs = subregs;
1d019706d866 LLVM10 anatofuz parents: diff changeset	450 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	451
1d019706d866 LLVM10 anatofuz parents: diff changeset	452 In the ``SparcRegisterInfo.td`` file, there are register definitions that
1d019706d866 LLVM10 anatofuz parents: diff changeset	453 utilize these subclasses of ``Register``, such as:
1d019706d866 LLVM10 anatofuz parents: diff changeset	454
1d019706d866 LLVM10 anatofuz parents: diff changeset	455 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	456
1d019706d866 LLVM10 anatofuz parents: diff changeset	457 def G0 : Ri< 0, "G0">, DwarfRegNum<[0]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	458 def G1 : Ri< 1, "G1">, DwarfRegNum<[1]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	459 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	460 def F0 : Rf< 0, "F0">, DwarfRegNum<[32]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	461 def F1 : Rf< 1, "F1">, DwarfRegNum<[33]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	462 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	463 def D0 : Rd< 0, "F0", [F0, F1]>, DwarfRegNum<[32]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	464 def D1 : Rd< 2, "F2", [F2, F3]>, DwarfRegNum<[34]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	465
1d019706d866 LLVM10 anatofuz parents: diff changeset	466 The last two registers shown above (``D0`` and ``D1``) are double-precision
1d019706d866 LLVM10 anatofuz parents: diff changeset	467 floating-point registers that are aliases for pairs of single-precision
1d019706d866 LLVM10 anatofuz parents: diff changeset	468 floating-point sub-registers. In addition to aliases, the sub-register and
1d019706d866 LLVM10 anatofuz parents: diff changeset	469 super-register relationships of the defined register are in fields of a
1d019706d866 LLVM10 anatofuz parents: diff changeset	470 register's ``TargetRegisterDesc``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	471
1d019706d866 LLVM10 anatofuz parents: diff changeset	472 Defining a Register Class
1d019706d866 LLVM10 anatofuz parents: diff changeset	473 -------------------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	474
1d019706d866 LLVM10 anatofuz parents: diff changeset	475 The ``RegisterClass`` class (specified in ``Target.td``) is used to define an
1d019706d866 LLVM10 anatofuz parents: diff changeset	476 object that represents a group of related registers and also defines the
1d019706d866 LLVM10 anatofuz parents: diff changeset	477 default allocation order of the registers. A target description file
1d019706d866 LLVM10 anatofuz parents: diff changeset	478 ``XXXRegisterInfo.td`` that uses ``Target.td`` can construct register classes
1d019706d866 LLVM10 anatofuz parents: diff changeset	479 using the following class:
1d019706d866 LLVM10 anatofuz parents: diff changeset	480
1d019706d866 LLVM10 anatofuz parents: diff changeset	481 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	482
1d019706d866 LLVM10 anatofuz parents: diff changeset	483 class RegisterClass<string namespace,
1d019706d866 LLVM10 anatofuz parents: diff changeset	484 list<ValueType> regTypes, int alignment, dag regList> {
1d019706d866 LLVM10 anatofuz parents: diff changeset	485 string Namespace = namespace;
1d019706d866 LLVM10 anatofuz parents: diff changeset	486 list<ValueType> RegTypes = regTypes;
1d019706d866 LLVM10 anatofuz parents: diff changeset	487 int Size = 0; // spill size, in bits; zero lets tblgen pick the size
1d019706d866 LLVM10 anatofuz parents: diff changeset	488 int Alignment = alignment;
1d019706d866 LLVM10 anatofuz parents: diff changeset	489
1d019706d866 LLVM10 anatofuz parents: diff changeset	490 // CopyCost is the cost of copying a value between two registers
1d019706d866 LLVM10 anatofuz parents: diff changeset	491 // default value 1 means a single instruction
1d019706d866 LLVM10 anatofuz parents: diff changeset	492 // A negative value means copying is extremely expensive or impossible
1d019706d866 LLVM10 anatofuz parents: diff changeset	493 int CopyCost = 1;
1d019706d866 LLVM10 anatofuz parents: diff changeset	494 dag MemberList = regList;
1d019706d866 LLVM10 anatofuz parents: diff changeset	495
1d019706d866 LLVM10 anatofuz parents: diff changeset	496 // for register classes that are subregisters of this class
1d019706d866 LLVM10 anatofuz parents: diff changeset	497 list<RegisterClass> SubRegClassList = [];
1d019706d866 LLVM10 anatofuz parents: diff changeset	498
1d019706d866 LLVM10 anatofuz parents: diff changeset	499 code MethodProtos = [{}]; // to insert arbitrary code
1d019706d866 LLVM10 anatofuz parents: diff changeset	500 code MethodBodies = [{}];
1d019706d866 LLVM10 anatofuz parents: diff changeset	501 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	502
1d019706d866 LLVM10 anatofuz parents: diff changeset	503 To define a ``RegisterClass``, use the following 4 arguments:
1d019706d866 LLVM10 anatofuz parents: diff changeset	504
1d019706d866 LLVM10 anatofuz parents: diff changeset	505 * The first argument of the definition is the name of the namespace.
1d019706d866 LLVM10 anatofuz parents: diff changeset	506
1d019706d866 LLVM10 anatofuz parents: diff changeset	507 * The second argument is a list of ``ValueType`` register type values that are
1d019706d866 LLVM10 anatofuz parents: diff changeset	508 defined in ``include/llvm/CodeGen/ValueTypes.td``. Defined values include
1d019706d866 LLVM10 anatofuz parents: diff changeset	509 integer types (such as ``i16``, ``i32``, and ``i1`` for Boolean),
1d019706d866 LLVM10 anatofuz parents: diff changeset	510 floating-point types (``f32``, ``f64``), and vector types (for example,
1d019706d866 LLVM10 anatofuz parents: diff changeset	511 ``v8i16`` for an ``8 x i16`` vector). All registers in a ``RegisterClass``
1d019706d866 LLVM10 anatofuz parents: diff changeset	512 must have the same ``ValueType``, but some registers may store vector data in
1d019706d866 LLVM10 anatofuz parents: diff changeset	513 different configurations. For example a register that can process a 128-bit
1d019706d866 LLVM10 anatofuz parents: diff changeset	514 vector may be able to handle 16 8-bit integer elements, 8 16-bit integers, 4
1d019706d866 LLVM10 anatofuz parents: diff changeset	515 32-bit integers, and so on.
1d019706d866 LLVM10 anatofuz parents: diff changeset	516
1d019706d866 LLVM10 anatofuz parents: diff changeset	517 * The third argument of the ``RegisterClass`` definition specifies the
1d019706d866 LLVM10 anatofuz parents: diff changeset	518 alignment required of the registers when they are stored or loaded to
1d019706d866 LLVM10 anatofuz parents: diff changeset	519 memory.
1d019706d866 LLVM10 anatofuz parents: diff changeset	520
1d019706d866 LLVM10 anatofuz parents: diff changeset	521 * The final argument, ``regList``, specifies which registers are in this class.
1d019706d866 LLVM10 anatofuz parents: diff changeset	522 If an alternative allocation order method is not specified, then ``regList``
1d019706d866 LLVM10 anatofuz parents: diff changeset	523 also defines the order of allocation used by the register allocator. Besides
1d019706d866 LLVM10 anatofuz parents: diff changeset	524 simply listing registers with ``(add R0, R1, ...)``, more advanced set
1d019706d866 LLVM10 anatofuz parents: diff changeset	525 operators are available. See ``include/llvm/Target/Target.td`` for more
1d019706d866 LLVM10 anatofuz parents: diff changeset	526 information.
1d019706d866 LLVM10 anatofuz parents: diff changeset	527
1d019706d866 LLVM10 anatofuz parents: diff changeset	528 In ``SparcRegisterInfo.td``, three ``RegisterClass`` objects are defined:
1d019706d866 LLVM10 anatofuz parents: diff changeset	529 ``FPRegs``, ``DFPRegs``, and ``IntRegs``. For all three register classes, the
1d019706d866 LLVM10 anatofuz parents: diff changeset	530 first argument defines the namespace with the string "``SP``". ``FPRegs``
1d019706d866 LLVM10 anatofuz parents: diff changeset	531 defines a group of 32 single-precision floating-point registers (``F0`` to
1d019706d866 LLVM10 anatofuz parents: diff changeset	532 ``F31``); ``DFPRegs`` defines a group of 16 double-precision registers
1d019706d866 LLVM10 anatofuz parents: diff changeset	533 (``D0-D15``).
1d019706d866 LLVM10 anatofuz parents: diff changeset	534
1d019706d866 LLVM10 anatofuz parents: diff changeset	535 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	536
1d019706d866 LLVM10 anatofuz parents: diff changeset	537 // F0, F1, F2, ..., F31
1d019706d866 LLVM10 anatofuz parents: diff changeset	538 def FPRegs : RegisterClass<"SP", [f32], 32, (sequence "F%u", 0, 31)>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	539
1d019706d866 LLVM10 anatofuz parents: diff changeset	540 def DFPRegs : RegisterClass<"SP", [f64], 64,
1d019706d866 LLVM10 anatofuz parents: diff changeset	541 (add D0, D1, D2, D3, D4, D5, D6, D7, D8,
1d019706d866 LLVM10 anatofuz parents: diff changeset	542 D9, D10, D11, D12, D13, D14, D15)>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	543
1d019706d866 LLVM10 anatofuz parents: diff changeset	544 def IntRegs : RegisterClass<"SP", [i32], 32,
1d019706d866 LLVM10 anatofuz parents: diff changeset	545 (add L0, L1, L2, L3, L4, L5, L6, L7,
1d019706d866 LLVM10 anatofuz parents: diff changeset	546 I0, I1, I2, I3, I4, I5,
1d019706d866 LLVM10 anatofuz parents: diff changeset	547 O0, O1, O2, O3, O4, O5, O7,
1d019706d866 LLVM10 anatofuz parents: diff changeset	548 G1,
1d019706d866 LLVM10 anatofuz parents: diff changeset	549 // Non-allocatable regs:
1d019706d866 LLVM10 anatofuz parents: diff changeset	550 G2, G3, G4,
1d019706d866 LLVM10 anatofuz parents: diff changeset	551 O6, // stack ptr
1d019706d866 LLVM10 anatofuz parents: diff changeset	552 I6, // frame ptr
1d019706d866 LLVM10 anatofuz parents: diff changeset	553 I7, // return address
1d019706d866 LLVM10 anatofuz parents: diff changeset	554 G0, // constant zero
1d019706d866 LLVM10 anatofuz parents: diff changeset	555 G5, G6, G7 // reserved for kernel
1d019706d866 LLVM10 anatofuz parents: diff changeset	556 )>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	557
1d019706d866 LLVM10 anatofuz parents: diff changeset	558 Using ``SparcRegisterInfo.td`` with TableGen generates several output files
1d019706d866 LLVM10 anatofuz parents: diff changeset	559 that are intended for inclusion in other source code that you write.
1d019706d866 LLVM10 anatofuz parents: diff changeset	560 ``SparcRegisterInfo.td`` generates ``SparcGenRegisterInfo.h.inc``, which should
1d019706d866 LLVM10 anatofuz parents: diff changeset	561 be included in the header file for the implementation of the SPARC register
1d019706d866 LLVM10 anatofuz parents: diff changeset	562 implementation that you write (``SparcRegisterInfo.h``). In
1d019706d866 LLVM10 anatofuz parents: diff changeset	563 ``SparcGenRegisterInfo.h.inc`` a new structure is defined called
1d019706d866 LLVM10 anatofuz parents: diff changeset	564 ``SparcGenRegisterInfo`` that uses ``TargetRegisterInfo`` as its base. It also
1d019706d866 LLVM10 anatofuz parents: diff changeset	565 specifies types, based upon the defined register classes: ``DFPRegsClass``,
1d019706d866 LLVM10 anatofuz parents: diff changeset	566 ``FPRegsClass``, and ``IntRegsClass``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	567
1d019706d866 LLVM10 anatofuz parents: diff changeset	568 ``SparcRegisterInfo.td`` also generates ``SparcGenRegisterInfo.inc``, which is
1d019706d866 LLVM10 anatofuz parents: diff changeset	569 included at the bottom of ``SparcRegisterInfo.cpp``, the SPARC register
1d019706d866 LLVM10 anatofuz parents: diff changeset	570 implementation. The code below shows only the generated integer registers and
1d019706d866 LLVM10 anatofuz parents: diff changeset	571 associated register classes. The order of registers in ``IntRegs`` reflects
1d019706d866 LLVM10 anatofuz parents: diff changeset	572 the order in the definition of ``IntRegs`` in the target description file.
1d019706d866 LLVM10 anatofuz parents: diff changeset	573
1d019706d866 LLVM10 anatofuz parents: diff changeset	574 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	575
1d019706d866 LLVM10 anatofuz parents: diff changeset	576 // IntRegs Register Class...
1d019706d866 LLVM10 anatofuz parents: diff changeset	577 static const unsigned IntRegs[] = {
1d019706d866 LLVM10 anatofuz parents: diff changeset	578 SP::L0, SP::L1, SP::L2, SP::L3, SP::L4, SP::L5,
1d019706d866 LLVM10 anatofuz parents: diff changeset	579 SP::L6, SP::L7, SP::I0, SP::I1, SP::I2, SP::I3,
1d019706d866 LLVM10 anatofuz parents: diff changeset	580 SP::I4, SP::I5, SP::O0, SP::O1, SP::O2, SP::O3,
1d019706d866 LLVM10 anatofuz parents: diff changeset	581 SP::O4, SP::O5, SP::O7, SP::G1, SP::G2, SP::G3,
1d019706d866 LLVM10 anatofuz parents: diff changeset	582 SP::G4, SP::O6, SP::I6, SP::I7, SP::G0, SP::G5,
1d019706d866 LLVM10 anatofuz parents: diff changeset	583 SP::G6, SP::G7,
1d019706d866 LLVM10 anatofuz parents: diff changeset	584 };
1d019706d866 LLVM10 anatofuz parents: diff changeset	585
1d019706d866 LLVM10 anatofuz parents: diff changeset	586 // IntRegsVTs Register Class Value Types...
1d019706d866 LLVM10 anatofuz parents: diff changeset	587 static const MVT::ValueType IntRegsVTs[] = {
1d019706d866 LLVM10 anatofuz parents: diff changeset	588 MVT::i32, MVT::Other
1d019706d866 LLVM10 anatofuz parents: diff changeset	589 };
1d019706d866 LLVM10 anatofuz parents: diff changeset	590
1d019706d866 LLVM10 anatofuz parents: diff changeset	591 namespace SP { // Register class instances
1d019706d866 LLVM10 anatofuz parents: diff changeset	592 DFPRegsClass DFPRegsRegClass;
1d019706d866 LLVM10 anatofuz parents: diff changeset	593 FPRegsClass FPRegsRegClass;
1d019706d866 LLVM10 anatofuz parents: diff changeset	594 IntRegsClass IntRegsRegClass;
1d019706d866 LLVM10 anatofuz parents: diff changeset	595 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	596 // IntRegs Sub-register Classes...
1d019706d866 LLVM10 anatofuz parents: diff changeset	597 static const TargetRegisterClass* const IntRegsSubRegClasses [] = {
1d019706d866 LLVM10 anatofuz parents: diff changeset	598 NULL
1d019706d866 LLVM10 anatofuz parents: diff changeset	599 };
1d019706d866 LLVM10 anatofuz parents: diff changeset	600 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	601 // IntRegs Super-register Classes..
1d019706d866 LLVM10 anatofuz parents: diff changeset	602 static const TargetRegisterClass* const IntRegsSuperRegClasses [] = {
1d019706d866 LLVM10 anatofuz parents: diff changeset	603 NULL
1d019706d866 LLVM10 anatofuz parents: diff changeset	604 };
1d019706d866 LLVM10 anatofuz parents: diff changeset	605 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	606 // IntRegs Register Class sub-classes...
1d019706d866 LLVM10 anatofuz parents: diff changeset	607 static const TargetRegisterClass* const IntRegsSubclasses [] = {
1d019706d866 LLVM10 anatofuz parents: diff changeset	608 NULL
1d019706d866 LLVM10 anatofuz parents: diff changeset	609 };
1d019706d866 LLVM10 anatofuz parents: diff changeset	610 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	611 // IntRegs Register Class super-classes...
1d019706d866 LLVM10 anatofuz parents: diff changeset	612 static const TargetRegisterClass* const IntRegsSuperclasses [] = {
1d019706d866 LLVM10 anatofuz parents: diff changeset	613 NULL
1d019706d866 LLVM10 anatofuz parents: diff changeset	614 };
1d019706d866 LLVM10 anatofuz parents: diff changeset	615
1d019706d866 LLVM10 anatofuz parents: diff changeset	616 IntRegsClass::IntRegsClass() : TargetRegisterClass(IntRegsRegClassID,
1d019706d866 LLVM10 anatofuz parents: diff changeset	617 IntRegsVTs, IntRegsSubclasses, IntRegsSuperclasses, IntRegsSubRegClasses,
1d019706d866 LLVM10 anatofuz parents: diff changeset	618 IntRegsSuperRegClasses, 4, 4, 1, IntRegs, IntRegs + 32) {}
1d019706d866 LLVM10 anatofuz parents: diff changeset	619 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	620
1d019706d866 LLVM10 anatofuz parents: diff changeset	621 The register allocators will avoid using reserved registers, and callee saved
1d019706d866 LLVM10 anatofuz parents: diff changeset	622 registers are not used until all the volatile registers have been used. That
1d019706d866 LLVM10 anatofuz parents: diff changeset	623 is usually good enough, but in some cases it may be necessary to provide custom
1d019706d866 LLVM10 anatofuz parents: diff changeset	624 allocation orders.
1d019706d866 LLVM10 anatofuz parents: diff changeset	625
1d019706d866 LLVM10 anatofuz parents: diff changeset	626 Implement a subclass of ``TargetRegisterInfo``
1d019706d866 LLVM10 anatofuz parents: diff changeset	627 ----------------------------------------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	628
1d019706d866 LLVM10 anatofuz parents: diff changeset	629 The final step is to hand code portions of ``XXXRegisterInfo``, which
1d019706d866 LLVM10 anatofuz parents: diff changeset	630 implements the interface described in ``TargetRegisterInfo.h`` (see
1d019706d866 LLVM10 anatofuz parents: diff changeset	631 :ref:`TargetRegisterInfo`). These functions return ``0``, ``NULL``, or
1d019706d866 LLVM10 anatofuz parents: diff changeset	632 ``false``, unless overridden. Here is a list of functions that are overridden
1d019706d866 LLVM10 anatofuz parents: diff changeset	633 for the SPARC implementation in ``SparcRegisterInfo.cpp``:
1d019706d866 LLVM10 anatofuz parents: diff changeset	634
1d019706d866 LLVM10 anatofuz parents: diff changeset	635 * ``getCalleeSavedRegs`` --- Returns a list of callee-saved registers in the
1d019706d866 LLVM10 anatofuz parents: diff changeset	636 order of the desired callee-save stack frame offset.
1d019706d866 LLVM10 anatofuz parents: diff changeset	637
1d019706d866 LLVM10 anatofuz parents: diff changeset	638 * ``getReservedRegs`` --- Returns a bitset indexed by physical register
1d019706d866 LLVM10 anatofuz parents: diff changeset	639 numbers, indicating if a particular register is unavailable.
1d019706d866 LLVM10 anatofuz parents: diff changeset	640
1d019706d866 LLVM10 anatofuz parents: diff changeset	641 * ``hasFP`` --- Return a Boolean indicating if a function should have a
1d019706d866 LLVM10 anatofuz parents: diff changeset	642 dedicated frame pointer register.
1d019706d866 LLVM10 anatofuz parents: diff changeset	643
1d019706d866 LLVM10 anatofuz parents: diff changeset	644 * ``eliminateCallFramePseudoInstr`` --- If call frame setup or destroy pseudo
1d019706d866 LLVM10 anatofuz parents: diff changeset	645 instructions are used, this can be called to eliminate them.
1d019706d866 LLVM10 anatofuz parents: diff changeset	646
1d019706d866 LLVM10 anatofuz parents: diff changeset	647 * ``eliminateFrameIndex`` --- Eliminate abstract frame indices from
1d019706d866 LLVM10 anatofuz parents: diff changeset	648 instructions that may use them.
1d019706d866 LLVM10 anatofuz parents: diff changeset	649
1d019706d866 LLVM10 anatofuz parents: diff changeset	650 * ``emitPrologue`` --- Insert prologue code into the function.
1d019706d866 LLVM10 anatofuz parents: diff changeset	651
1d019706d866 LLVM10 anatofuz parents: diff changeset	652 * ``emitEpilogue`` --- Insert epilogue code into the function.
1d019706d866 LLVM10 anatofuz parents: diff changeset	653
1d019706d866 LLVM10 anatofuz parents: diff changeset	654 .. _instruction-set:
1d019706d866 LLVM10 anatofuz parents: diff changeset	655
1d019706d866 LLVM10 anatofuz parents: diff changeset	656 Instruction Set
1d019706d866 LLVM10 anatofuz parents: diff changeset	657 ===============
1d019706d866 LLVM10 anatofuz parents: diff changeset	658
1d019706d866 LLVM10 anatofuz parents: diff changeset	659 During the early stages of code generation, the LLVM IR code is converted to a
1d019706d866 LLVM10 anatofuz parents: diff changeset	660 ``SelectionDAG`` with nodes that are instances of the ``SDNode`` class
1d019706d866 LLVM10 anatofuz parents: diff changeset	661 containing target instructions. An ``SDNode`` has an opcode, operands, type
1d019706d866 LLVM10 anatofuz parents: diff changeset	662 requirements, and operation properties. For example, is an operation
1d019706d866 LLVM10 anatofuz parents: diff changeset	663 commutative, does an operation load from memory. The various operation node
1d019706d866 LLVM10 anatofuz parents: diff changeset	664 types are described in the ``include/llvm/CodeGen/SelectionDAGNodes.h`` file
1d019706d866 LLVM10 anatofuz parents: diff changeset	665 (values of the ``NodeType`` enum in the ``ISD`` namespace).
1d019706d866 LLVM10 anatofuz parents: diff changeset	666
1d019706d866 LLVM10 anatofuz parents: diff changeset	667 TableGen uses the following target description (``.td``) input files to
1d019706d866 LLVM10 anatofuz parents: diff changeset	668 generate much of the code for instruction definition:
1d019706d866 LLVM10 anatofuz parents: diff changeset	669
1d019706d866 LLVM10 anatofuz parents: diff changeset	670 * ``Target.td`` --- Where the ``Instruction``, ``Operand``, ``InstrInfo``, and
1d019706d866 LLVM10 anatofuz parents: diff changeset	671 other fundamental classes are defined.
1d019706d866 LLVM10 anatofuz parents: diff changeset	672
1d019706d866 LLVM10 anatofuz parents: diff changeset	673 * ``TargetSelectionDAG.td`` --- Used by ``SelectionDAG`` instruction selection
1d019706d866 LLVM10 anatofuz parents: diff changeset	674 generators, contains ``SDTC*`` classes (selection DAG type constraint),
1d019706d866 LLVM10 anatofuz parents: diff changeset	675 definitions of ``SelectionDAG`` nodes (such as ``imm``, ``cond``, ``bb``,
1d019706d866 LLVM10 anatofuz parents: diff changeset	676 ``add``, ``fadd``, ``sub``), and pattern support (``Pattern``, ``Pat``,
1d019706d866 LLVM10 anatofuz parents: diff changeset	677 ``PatFrag``, ``PatLeaf``, ``ComplexPattern``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	678
1d019706d866 LLVM10 anatofuz parents: diff changeset	679 * ``XXXInstrFormats.td`` --- Patterns for definitions of target-specific
1d019706d866 LLVM10 anatofuz parents: diff changeset	680 instructions.
1d019706d866 LLVM10 anatofuz parents: diff changeset	681
1d019706d866 LLVM10 anatofuz parents: diff changeset	682 * ``XXXInstrInfo.td`` --- Target-specific definitions of instruction templates,
1d019706d866 LLVM10 anatofuz parents: diff changeset	683 condition codes, and instructions of an instruction set. For architecture
1d019706d866 LLVM10 anatofuz parents: diff changeset	684 modifications, a different file name may be used. For example, for Pentium
1d019706d866 LLVM10 anatofuz parents: diff changeset	685 with SSE instruction, this file is ``X86InstrSSE.td``, and for Pentium with
1d019706d866 LLVM10 anatofuz parents: diff changeset	686 MMX, this file is ``X86InstrMMX.td``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	687
1d019706d866 LLVM10 anatofuz parents: diff changeset	688 There is also a target-specific ``XXX.td`` file, where ``XXX`` is the name of
1d019706d866 LLVM10 anatofuz parents: diff changeset	689 the target. The ``XXX.td`` file includes the other ``.td`` input files, but
1d019706d866 LLVM10 anatofuz parents: diff changeset	690 its contents are only directly important for subtargets.
1d019706d866 LLVM10 anatofuz parents: diff changeset	691
1d019706d866 LLVM10 anatofuz parents: diff changeset	692 You should describe a concrete target-specific class ``XXXInstrInfo`` that
1d019706d866 LLVM10 anatofuz parents: diff changeset	693 represents machine instructions supported by a target machine.
1d019706d866 LLVM10 anatofuz parents: diff changeset	694 ``XXXInstrInfo`` contains an array of ``XXXInstrDescriptor`` objects, each of
1d019706d866 LLVM10 anatofuz parents: diff changeset	695 which describes one instruction. An instruction descriptor defines:
1d019706d866 LLVM10 anatofuz parents: diff changeset	696
1d019706d866 LLVM10 anatofuz parents: diff changeset	697 * Opcode mnemonic
1d019706d866 LLVM10 anatofuz parents: diff changeset	698 * Number of operands
1d019706d866 LLVM10 anatofuz parents: diff changeset	699 * List of implicit register definitions and uses
1d019706d866 LLVM10 anatofuz parents: diff changeset	700 * Target-independent properties (such as memory access, is commutable)
1d019706d866 LLVM10 anatofuz parents: diff changeset	701 * Target-specific flags
1d019706d866 LLVM10 anatofuz parents: diff changeset	702
1d019706d866 LLVM10 anatofuz parents: diff changeset	703 The Instruction class (defined in ``Target.td``) is mostly used as a base for
1d019706d866 LLVM10 anatofuz parents: diff changeset	704 more complex instruction classes.
1d019706d866 LLVM10 anatofuz parents: diff changeset	705
1d019706d866 LLVM10 anatofuz parents: diff changeset	706 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	707
1d019706d866 LLVM10 anatofuz parents: diff changeset	708 class Instruction {
1d019706d866 LLVM10 anatofuz parents: diff changeset	709 string Namespace = "";
1d019706d866 LLVM10 anatofuz parents: diff changeset	710 dag OutOperandList; // A dag containing the MI def operand list.
1d019706d866 LLVM10 anatofuz parents: diff changeset	711 dag InOperandList; // A dag containing the MI use operand list.
1d019706d866 LLVM10 anatofuz parents: diff changeset	712 string AsmString = ""; // The .s format to print the instruction with.
1d019706d866 LLVM10 anatofuz parents: diff changeset	713 list<dag> Pattern; // Set to the DAG pattern for this instruction.
1d019706d866 LLVM10 anatofuz parents: diff changeset	714 list<Register> Uses = [];
1d019706d866 LLVM10 anatofuz parents: diff changeset	715 list<Register> Defs = [];
1d019706d866 LLVM10 anatofuz parents: diff changeset	716 list<Predicate> Predicates = []; // predicates turned into isel match code
1d019706d866 LLVM10 anatofuz parents: diff changeset	717 ... remainder not shown for space ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	718 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	719
1d019706d866 LLVM10 anatofuz parents: diff changeset	720 A ``SelectionDAG`` node (``SDNode``) should contain an object representing a
1d019706d866 LLVM10 anatofuz parents: diff changeset	721 target-specific instruction that is defined in ``XXXInstrInfo.td``. The
1d019706d866 LLVM10 anatofuz parents: diff changeset	722 instruction objects should represent instructions from the architecture manual
1d019706d866 LLVM10 anatofuz parents: diff changeset	723 of the target machine (such as the SPARC Architecture Manual for the SPARC
1d019706d866 LLVM10 anatofuz parents: diff changeset	724 target).
1d019706d866 LLVM10 anatofuz parents: diff changeset	725
1d019706d866 LLVM10 anatofuz parents: diff changeset	726 A single instruction from the architecture manual is often modeled as multiple
1d019706d866 LLVM10 anatofuz parents: diff changeset	727 target instructions, depending upon its operands. For example, a manual might
1d019706d866 LLVM10 anatofuz parents: diff changeset	728 describe an add instruction that takes a register or an immediate operand. An
1d019706d866 LLVM10 anatofuz parents: diff changeset	729 LLVM target could model this with two instructions named ``ADDri`` and
1d019706d866 LLVM10 anatofuz parents: diff changeset	730 ``ADDrr``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	731
1d019706d866 LLVM10 anatofuz parents: diff changeset	732 You should define a class for each instruction category and define each opcode
1d019706d866 LLVM10 anatofuz parents: diff changeset	733 as a subclass of the category with appropriate parameters such as the fixed
1d019706d866 LLVM10 anatofuz parents: diff changeset	734 binary encoding of opcodes and extended opcodes. You should map the register
1d019706d866 LLVM10 anatofuz parents: diff changeset	735 bits to the bits of the instruction in which they are encoded (for the JIT).
1d019706d866 LLVM10 anatofuz parents: diff changeset	736 Also you should specify how the instruction should be printed when the
1d019706d866 LLVM10 anatofuz parents: diff changeset	737 automatic assembly printer is used.
1d019706d866 LLVM10 anatofuz parents: diff changeset	738
1d019706d866 LLVM10 anatofuz parents: diff changeset	739 As is described in the SPARC Architecture Manual, Version 8, there are three
1d019706d866 LLVM10 anatofuz parents: diff changeset	740 major 32-bit formats for instructions. Format 1 is only for the ``CALL``
1d019706d866 LLVM10 anatofuz parents: diff changeset	741 instruction. Format 2 is for branch on condition codes and ``SETHI`` (set high
1d019706d866 LLVM10 anatofuz parents: diff changeset	742 bits of a register) instructions. Format 3 is for other instructions.
1d019706d866 LLVM10 anatofuz parents: diff changeset	743
1d019706d866 LLVM10 anatofuz parents: diff changeset	744 Each of these formats has corresponding classes in ``SparcInstrFormat.td``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	745 ``InstSP`` is a base class for other instruction classes. Additional base
1d019706d866 LLVM10 anatofuz parents: diff changeset	746 classes are specified for more precise formats: for example in
1d019706d866 LLVM10 anatofuz parents: diff changeset	747 ``SparcInstrFormat.td``, ``F2_1`` is for ``SETHI``, and ``F2_2`` is for
1d019706d866 LLVM10 anatofuz parents: diff changeset	748 branches. There are three other base classes: ``F3_1`` for register/register
1d019706d866 LLVM10 anatofuz parents: diff changeset	749 operations, ``F3_2`` for register/immediate operations, and ``F3_3`` for
1d019706d866 LLVM10 anatofuz parents: diff changeset	750 floating-point operations. ``SparcInstrInfo.td`` also adds the base class
1d019706d866 LLVM10 anatofuz parents: diff changeset	751 ``Pseudo`` for synthetic SPARC instructions.
1d019706d866 LLVM10 anatofuz parents: diff changeset	752
1d019706d866 LLVM10 anatofuz parents: diff changeset	753 ``SparcInstrInfo.td`` largely consists of operand and instruction definitions
1d019706d866 LLVM10 anatofuz parents: diff changeset	754 for the SPARC target. In ``SparcInstrInfo.td``, the following target
1d019706d866 LLVM10 anatofuz parents: diff changeset	755 description file entry, ``LDrr``, defines the Load Integer instruction for a
1d019706d866 LLVM10 anatofuz parents: diff changeset	756 Word (the ``LD`` SPARC opcode) from a memory address to a register. The first
1d019706d866 LLVM10 anatofuz parents: diff changeset	757 parameter, the value 3 (``11``\ :sub:`2`), is the operation value for this
1d019706d866 LLVM10 anatofuz parents: diff changeset	758 category of operation. The second parameter (``000000``\ :sub:`2`) is the
1d019706d866 LLVM10 anatofuz parents: diff changeset	759 specific operation value for ``LD``/Load Word. The third parameter is the
1d019706d866 LLVM10 anatofuz parents: diff changeset	760 output destination, which is a register operand and defined in the ``Register``
1d019706d866 LLVM10 anatofuz parents: diff changeset	761 target description file (``IntRegs``).
1d019706d866 LLVM10 anatofuz parents: diff changeset	762
1d019706d866 LLVM10 anatofuz parents: diff changeset	763 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	764
1d019706d866 LLVM10 anatofuz parents: diff changeset	765 def LDrr : F3_1 <3, 0b000000, (outs IntRegs:$dst), (ins MEMrr:$addr),
1d019706d866 LLVM10 anatofuz parents: diff changeset	766 "ld [$addr], $dst",
1d019706d866 LLVM10 anatofuz parents: diff changeset	767 [(set i32:$dst, (load ADDRrr:$addr))]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	768
1d019706d866 LLVM10 anatofuz parents: diff changeset	769 The fourth parameter is the input source, which uses the address operand
1d019706d866 LLVM10 anatofuz parents: diff changeset	770 ``MEMrr`` that is defined earlier in ``SparcInstrInfo.td``:
1d019706d866 LLVM10 anatofuz parents: diff changeset	771
1d019706d866 LLVM10 anatofuz parents: diff changeset	772 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	773
1d019706d866 LLVM10 anatofuz parents: diff changeset	774 def MEMrr : Operand<i32> {
1d019706d866 LLVM10 anatofuz parents: diff changeset	775 let PrintMethod = "printMemOperand";
1d019706d866 LLVM10 anatofuz parents: diff changeset	776 let MIOperandInfo = (ops IntRegs, IntRegs);
1d019706d866 LLVM10 anatofuz parents: diff changeset	777 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	778
1d019706d866 LLVM10 anatofuz parents: diff changeset	779 The fifth parameter is a string that is used by the assembly printer and can be
1d019706d866 LLVM10 anatofuz parents: diff changeset	780 left as an empty string until the assembly printer interface is implemented.
1d019706d866 LLVM10 anatofuz parents: diff changeset	781 The sixth and final parameter is the pattern used to match the instruction
1d019706d866 LLVM10 anatofuz parents: diff changeset	782 during the SelectionDAG Select Phase described in :doc:`CodeGenerator`.
1d019706d866 LLVM10 anatofuz parents: diff changeset	783 This parameter is detailed in the next section, :ref:`instruction-selector`.
1d019706d866 LLVM10 anatofuz parents: diff changeset	784
1d019706d866 LLVM10 anatofuz parents: diff changeset	785 Instruction class definitions are not overloaded for different operand types,
1d019706d866 LLVM10 anatofuz parents: diff changeset	786 so separate versions of instructions are needed for register, memory, or
1d019706d866 LLVM10 anatofuz parents: diff changeset	787 immediate value operands. For example, to perform a Load Integer instruction
1d019706d866 LLVM10 anatofuz parents: diff changeset	788 for a Word from an immediate operand to a register, the following instruction
1d019706d866 LLVM10 anatofuz parents: diff changeset	789 class is defined:
1d019706d866 LLVM10 anatofuz parents: diff changeset	790
1d019706d866 LLVM10 anatofuz parents: diff changeset	791 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	792
1d019706d866 LLVM10 anatofuz parents: diff changeset	793 def LDri : F3_2 <3, 0b000000, (outs IntRegs:$dst), (ins MEMri:$addr),
1d019706d866 LLVM10 anatofuz parents: diff changeset	794 "ld [$addr], $dst",
1d019706d866 LLVM10 anatofuz parents: diff changeset	795 [(set i32:$dst, (load ADDRri:$addr))]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	796
1d019706d866 LLVM10 anatofuz parents: diff changeset	797 Writing these definitions for so many similar instructions can involve a lot of
1d019706d866 LLVM10 anatofuz parents: diff changeset	798 cut and paste. In ``.td`` files, the ``multiclass`` directive enables the
1d019706d866 LLVM10 anatofuz parents: diff changeset	799 creation of templates to define several instruction classes at once (using the
1d019706d866 LLVM10 anatofuz parents: diff changeset	800 ``defm`` directive). For example in ``SparcInstrInfo.td``, the ``multiclass``
1d019706d866 LLVM10 anatofuz parents: diff changeset	801 pattern ``F3_12`` is defined to create 2 instruction classes each time
1d019706d866 LLVM10 anatofuz parents: diff changeset	802 ``F3_12`` is invoked:
1d019706d866 LLVM10 anatofuz parents: diff changeset	803
1d019706d866 LLVM10 anatofuz parents: diff changeset	804 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	805
1d019706d866 LLVM10 anatofuz parents: diff changeset	806 multiclass F3_12 <string OpcStr, bits<6> Op3Val, SDNode OpNode> {
1d019706d866 LLVM10 anatofuz parents: diff changeset	807 def rr : F3_1 <2, Op3Val,
1d019706d866 LLVM10 anatofuz parents: diff changeset	808 (outs IntRegs:$dst), (ins IntRegs:$b, IntRegs:$c),
1d019706d866 LLVM10 anatofuz parents: diff changeset	809 !strconcat(OpcStr, " $b, $c, $dst"),
1d019706d866 LLVM10 anatofuz parents: diff changeset	810 [(set i32:$dst, (OpNode i32:$b, i32:$c))]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	811 def ri : F3_2 <2, Op3Val,
1d019706d866 LLVM10 anatofuz parents: diff changeset	812 (outs IntRegs:$dst), (ins IntRegs:$b, i32imm:$c),
1d019706d866 LLVM10 anatofuz parents: diff changeset	813 !strconcat(OpcStr, " $b, $c, $dst"),
1d019706d866 LLVM10 anatofuz parents: diff changeset	814 [(set i32:$dst, (OpNode i32:$b, simm13:$c))]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	815 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	816
1d019706d866 LLVM10 anatofuz parents: diff changeset	817 So when the ``defm`` directive is used for the ``XOR`` and ``ADD``
1d019706d866 LLVM10 anatofuz parents: diff changeset	818 instructions, as seen below, it creates four instruction objects: ``XORrr``,
1d019706d866 LLVM10 anatofuz parents: diff changeset	819 ``XORri``, ``ADDrr``, and ``ADDri``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	820
1d019706d866 LLVM10 anatofuz parents: diff changeset	821 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	822
1d019706d866 LLVM10 anatofuz parents: diff changeset	823 defm XOR : F3_12<"xor", 0b000011, xor>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	824 defm ADD : F3_12<"add", 0b000000, add>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	825
1d019706d866 LLVM10 anatofuz parents: diff changeset	826 ``SparcInstrInfo.td`` also includes definitions for condition codes that are
1d019706d866 LLVM10 anatofuz parents: diff changeset	827 referenced by branch instructions. The following definitions in
1d019706d866 LLVM10 anatofuz parents: diff changeset	828 ``SparcInstrInfo.td`` indicate the bit location of the SPARC condition code.
1d019706d866 LLVM10 anatofuz parents: diff changeset	829 For example, the 10\ :sup:`th` bit represents the "greater than" condition for
1d019706d866 LLVM10 anatofuz parents: diff changeset	830 integers, and the 22\ :sup:`nd` bit represents the "greater than" condition for
1d019706d866 LLVM10 anatofuz parents: diff changeset	831 floats.
1d019706d866 LLVM10 anatofuz parents: diff changeset	832
1d019706d866 LLVM10 anatofuz parents: diff changeset	833 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	834
1d019706d866 LLVM10 anatofuz parents: diff changeset	835 def ICC_NE : ICC_VAL< 9>; // Not Equal
1d019706d866 LLVM10 anatofuz parents: diff changeset	836 def ICC_E : ICC_VAL< 1>; // Equal
1d019706d866 LLVM10 anatofuz parents: diff changeset	837 def ICC_G : ICC_VAL<10>; // Greater
1d019706d866 LLVM10 anatofuz parents: diff changeset	838 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	839 def FCC_U : FCC_VAL<23>; // Unordered
1d019706d866 LLVM10 anatofuz parents: diff changeset	840 def FCC_G : FCC_VAL<22>; // Greater
1d019706d866 LLVM10 anatofuz parents: diff changeset	841 def FCC_UG : FCC_VAL<21>; // Unordered or Greater
1d019706d866 LLVM10 anatofuz parents: diff changeset	842 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	843
1d019706d866 LLVM10 anatofuz parents: diff changeset	844 (Note that ``Sparc.h`` also defines enums that correspond to the same SPARC
1d019706d866 LLVM10 anatofuz parents: diff changeset	845 condition codes. Care must be taken to ensure the values in ``Sparc.h``
1d019706d866 LLVM10 anatofuz parents: diff changeset	846 correspond to the values in ``SparcInstrInfo.td``. I.e., ``SPCC::ICC_NE = 9``,
1d019706d866 LLVM10 anatofuz parents: diff changeset	847 ``SPCC::FCC_U = 23`` and so on.)
1d019706d866 LLVM10 anatofuz parents: diff changeset	848
1d019706d866 LLVM10 anatofuz parents: diff changeset	849 Instruction Operand Mapping
1d019706d866 LLVM10 anatofuz parents: diff changeset	850 ---------------------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	851
1d019706d866 LLVM10 anatofuz parents: diff changeset	852 The code generator backend maps instruction operands to fields in the
1d019706d866 LLVM10 anatofuz parents: diff changeset	853 instruction. Operands are assigned to unbound fields in the instruction in the
1d019706d866 LLVM10 anatofuz parents: diff changeset	854 order they are defined. Fields are bound when they are assigned a value. For
1d019706d866 LLVM10 anatofuz parents: diff changeset	855 example, the Sparc target defines the ``XNORrr`` instruction as a ``F3_1``
1d019706d866 LLVM10 anatofuz parents: diff changeset	856 format instruction having three operands.
1d019706d866 LLVM10 anatofuz parents: diff changeset	857
1d019706d866 LLVM10 anatofuz parents: diff changeset	858 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	859
1d019706d866 LLVM10 anatofuz parents: diff changeset	860 def XNORrr : F3_1<2, 0b000111,
1d019706d866 LLVM10 anatofuz parents: diff changeset	861 (outs IntRegs:$dst), (ins IntRegs:$b, IntRegs:$c),
1d019706d866 LLVM10 anatofuz parents: diff changeset	862 "xnor $b, $c, $dst",
1d019706d866 LLVM10 anatofuz parents: diff changeset	863 [(set i32:$dst, (not (xor i32:$b, i32:$c)))]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	864
1d019706d866 LLVM10 anatofuz parents: diff changeset	865 The instruction templates in ``SparcInstrFormats.td`` show the base class for
1d019706d866 LLVM10 anatofuz parents: diff changeset	866 ``F3_1`` is ``InstSP``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	867
1d019706d866 LLVM10 anatofuz parents: diff changeset	868 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	869
1d019706d866 LLVM10 anatofuz parents: diff changeset	870 class InstSP<dag outs, dag ins, string asmstr, list<dag> pattern> : Instruction {
1d019706d866 LLVM10 anatofuz parents: diff changeset	871 field bits<32> Inst;
1d019706d866 LLVM10 anatofuz parents: diff changeset	872 let Namespace = "SP";
1d019706d866 LLVM10 anatofuz parents: diff changeset	873 bits<2> op;
1d019706d866 LLVM10 anatofuz parents: diff changeset	874 let Inst{31-30} = op;
1d019706d866 LLVM10 anatofuz parents: diff changeset	875 dag OutOperandList = outs;
1d019706d866 LLVM10 anatofuz parents: diff changeset	876 dag InOperandList = ins;
1d019706d866 LLVM10 anatofuz parents: diff changeset	877 let AsmString = asmstr;
1d019706d866 LLVM10 anatofuz parents: diff changeset	878 let Pattern = pattern;
1d019706d866 LLVM10 anatofuz parents: diff changeset	879 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	880
1d019706d866 LLVM10 anatofuz parents: diff changeset	881 ``InstSP`` leaves the ``op`` field unbound.
1d019706d866 LLVM10 anatofuz parents: diff changeset	882
1d019706d866 LLVM10 anatofuz parents: diff changeset	883 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	884
1d019706d866 LLVM10 anatofuz parents: diff changeset	885 class F3<dag outs, dag ins, string asmstr, list<dag> pattern>
1d019706d866 LLVM10 anatofuz parents: diff changeset	886 : InstSP<outs, ins, asmstr, pattern> {
1d019706d866 LLVM10 anatofuz parents: diff changeset	887 bits<5> rd;
1d019706d866 LLVM10 anatofuz parents: diff changeset	888 bits<6> op3;
1d019706d866 LLVM10 anatofuz parents: diff changeset	889 bits<5> rs1;
1d019706d866 LLVM10 anatofuz parents: diff changeset	890 let op{1} = 1; // Op = 2 or 3
1d019706d866 LLVM10 anatofuz parents: diff changeset	891 let Inst{29-25} = rd;
1d019706d866 LLVM10 anatofuz parents: diff changeset	892 let Inst{24-19} = op3;
1d019706d866 LLVM10 anatofuz parents: diff changeset	893 let Inst{18-14} = rs1;
1d019706d866 LLVM10 anatofuz parents: diff changeset	894 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	895
1d019706d866 LLVM10 anatofuz parents: diff changeset	896 ``F3`` binds the ``op`` field and defines the ``rd``, ``op3``, and ``rs1``
1d019706d866 LLVM10 anatofuz parents: diff changeset	897 fields. ``F3`` format instructions will bind the operands ``rd``, ``op3``, and
1d019706d866 LLVM10 anatofuz parents: diff changeset	898 ``rs1`` fields.
1d019706d866 LLVM10 anatofuz parents: diff changeset	899
1d019706d866 LLVM10 anatofuz parents: diff changeset	900 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	901
1d019706d866 LLVM10 anatofuz parents: diff changeset	902 class F3_1<bits<2> opVal, bits<6> op3val, dag outs, dag ins,
1d019706d866 LLVM10 anatofuz parents: diff changeset	903 string asmstr, list<dag> pattern> : F3<outs, ins, asmstr, pattern> {
1d019706d866 LLVM10 anatofuz parents: diff changeset	904 bits<8> asi = 0; // asi not currently used
1d019706d866 LLVM10 anatofuz parents: diff changeset	905 bits<5> rs2;
1d019706d866 LLVM10 anatofuz parents: diff changeset	906 let op = opVal;
1d019706d866 LLVM10 anatofuz parents: diff changeset	907 let op3 = op3val;
1d019706d866 LLVM10 anatofuz parents: diff changeset	908 let Inst{13} = 0; // i field = 0
1d019706d866 LLVM10 anatofuz parents: diff changeset	909 let Inst{12-5} = asi; // address space identifier
1d019706d866 LLVM10 anatofuz parents: diff changeset	910 let Inst{4-0} = rs2;
1d019706d866 LLVM10 anatofuz parents: diff changeset	911 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	912
1d019706d866 LLVM10 anatofuz parents: diff changeset	913 ``F3_1`` binds the ``op3`` field and defines the ``rs2`` fields. ``F3_1``
1d019706d866 LLVM10 anatofuz parents: diff changeset	914 format instructions will bind the operands to the ``rd``, ``rs1``, and ``rs2``
1d019706d866 LLVM10 anatofuz parents: diff changeset	915 fields. This results in the ``XNORrr`` instruction binding ``$dst``, ``$b``,
1d019706d866 LLVM10 anatofuz parents: diff changeset	916 and ``$c`` operands to the ``rd``, ``rs1``, and ``rs2`` fields respectively.
1d019706d866 LLVM10 anatofuz parents: diff changeset	917
1d019706d866 LLVM10 anatofuz parents: diff changeset	918 Instruction Operand Name Mapping
1d019706d866 LLVM10 anatofuz parents: diff changeset	919 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
1d019706d866 LLVM10 anatofuz parents: diff changeset	920
1d019706d866 LLVM10 anatofuz parents: diff changeset	921 TableGen will also generate a function called getNamedOperandIdx() which
1d019706d866 LLVM10 anatofuz parents: diff changeset	922 can be used to look up an operand's index in a MachineInstr based on its
1d019706d866 LLVM10 anatofuz parents: diff changeset	923 TableGen name. Setting the UseNamedOperandTable bit in an instruction's
1d019706d866 LLVM10 anatofuz parents: diff changeset	924 TableGen definition will add all of its operands to an enumeration in the
1d019706d866 LLVM10 anatofuz parents: diff changeset	925 llvm::XXX:OpName namespace and also add an entry for it into the OperandMap
1d019706d866 LLVM10 anatofuz parents: diff changeset	926 table, which can be queried using getNamedOperandIdx()
1d019706d866 LLVM10 anatofuz parents: diff changeset	927
1d019706d866 LLVM10 anatofuz parents: diff changeset	928 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	929
1d019706d866 LLVM10 anatofuz parents: diff changeset	930 int DstIndex = SP::getNamedOperandIdx(SP::XNORrr, SP::OpName::dst); // => 0
1d019706d866 LLVM10 anatofuz parents: diff changeset	931 int BIndex = SP::getNamedOperandIdx(SP::XNORrr, SP::OpName::b); // => 1
1d019706d866 LLVM10 anatofuz parents: diff changeset	932 int CIndex = SP::getNamedOperandIdx(SP::XNORrr, SP::OpName::c); // => 2
1d019706d866 LLVM10 anatofuz parents: diff changeset	933 int DIndex = SP::getNamedOperandIdx(SP::XNORrr, SP::OpName::d); // => -1
1d019706d866 LLVM10 anatofuz parents: diff changeset	934
1d019706d866 LLVM10 anatofuz parents: diff changeset	935 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	936
1d019706d866 LLVM10 anatofuz parents: diff changeset	937 The entries in the OpName enum are taken verbatim from the TableGen definitions,
1d019706d866 LLVM10 anatofuz parents: diff changeset	938 so operands with lowercase names will have lower case entries in the enum.
1d019706d866 LLVM10 anatofuz parents: diff changeset	939
1d019706d866 LLVM10 anatofuz parents: diff changeset	940 To include the getNamedOperandIdx() function in your backend, you will need
1d019706d866 LLVM10 anatofuz parents: diff changeset	941 to define a few preprocessor macros in XXXInstrInfo.cpp and XXXInstrInfo.h.
1d019706d866 LLVM10 anatofuz parents: diff changeset	942 For example:
1d019706d866 LLVM10 anatofuz parents: diff changeset	943
1d019706d866 LLVM10 anatofuz parents: diff changeset	944 XXXInstrInfo.cpp:
1d019706d866 LLVM10 anatofuz parents: diff changeset	945
1d019706d866 LLVM10 anatofuz parents: diff changeset	946 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	947
1d019706d866 LLVM10 anatofuz parents: diff changeset	948 #define GET_INSTRINFO_NAMED_OPS // For getNamedOperandIdx() function
1d019706d866 LLVM10 anatofuz parents: diff changeset	949 #include "XXXGenInstrInfo.inc"
1d019706d866 LLVM10 anatofuz parents: diff changeset	950
1d019706d866 LLVM10 anatofuz parents: diff changeset	951 XXXInstrInfo.h:
1d019706d866 LLVM10 anatofuz parents: diff changeset	952
1d019706d866 LLVM10 anatofuz parents: diff changeset	953 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	954
1d019706d866 LLVM10 anatofuz parents: diff changeset	955 #define GET_INSTRINFO_OPERAND_ENUM // For OpName enum
1d019706d866 LLVM10 anatofuz parents: diff changeset	956 #include "XXXGenInstrInfo.inc"
1d019706d866 LLVM10 anatofuz parents: diff changeset	957
1d019706d866 LLVM10 anatofuz parents: diff changeset	958 namespace XXX {
1d019706d866 LLVM10 anatofuz parents: diff changeset	959 int16_t getNamedOperandIdx(uint16_t Opcode, uint16_t NamedIndex);
1d019706d866 LLVM10 anatofuz parents: diff changeset	960 } // End namespace XXX
1d019706d866 LLVM10 anatofuz parents: diff changeset	961
1d019706d866 LLVM10 anatofuz parents: diff changeset	962 Instruction Operand Types
1d019706d866 LLVM10 anatofuz parents: diff changeset	963 ^^^^^^^^^^^^^^^^^^^^^^^^^
1d019706d866 LLVM10 anatofuz parents: diff changeset	964
1d019706d866 LLVM10 anatofuz parents: diff changeset	965 TableGen will also generate an enumeration consisting of all named Operand
1d019706d866 LLVM10 anatofuz parents: diff changeset	966 types defined in the backend, in the llvm::XXX::OpTypes namespace.
1d019706d866 LLVM10 anatofuz parents: diff changeset	967 Some common immediate Operand types (for instance i8, i32, i64, f32, f64)
1d019706d866 LLVM10 anatofuz parents: diff changeset	968 are defined for all targets in ``include/llvm/Target/Target.td``, and are
1d019706d866 LLVM10 anatofuz parents: diff changeset	969 available in each Target's OpTypes enum. Also, only named Operand types appear
1d019706d866 LLVM10 anatofuz parents: diff changeset	970 in the enumeration: anonymous types are ignored.
1d019706d866 LLVM10 anatofuz parents: diff changeset	971 For example, the X86 backend defines ``brtarget`` and ``brtarget8``, both
1d019706d866 LLVM10 anatofuz parents: diff changeset	972 instances of the TableGen ``Operand`` class, which represent branch target
1d019706d866 LLVM10 anatofuz parents: diff changeset	973 operands:
1d019706d866 LLVM10 anatofuz parents: diff changeset	974
1d019706d866 LLVM10 anatofuz parents: diff changeset	975 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	976
1d019706d866 LLVM10 anatofuz parents: diff changeset	977 def brtarget : Operand<OtherVT>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	978 def brtarget8 : Operand<OtherVT>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	979
1d019706d866 LLVM10 anatofuz parents: diff changeset	980 This results in:
1d019706d866 LLVM10 anatofuz parents: diff changeset	981
1d019706d866 LLVM10 anatofuz parents: diff changeset	982 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	983
1d019706d866 LLVM10 anatofuz parents: diff changeset	984 namespace X86 {
1d019706d866 LLVM10 anatofuz parents: diff changeset	985 namespace OpTypes {
1d019706d866 LLVM10 anatofuz parents: diff changeset	986 enum OperandType {
1d019706d866 LLVM10 anatofuz parents: diff changeset	987 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	988 brtarget,
1d019706d866 LLVM10 anatofuz parents: diff changeset	989 brtarget8,
1d019706d866 LLVM10 anatofuz parents: diff changeset	990 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	991 i32imm,
1d019706d866 LLVM10 anatofuz parents: diff changeset	992 i64imm,
1d019706d866 LLVM10 anatofuz parents: diff changeset	993 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	994 OPERAND_TYPE_LIST_END
1d019706d866 LLVM10 anatofuz parents: diff changeset	995 } // End namespace OpTypes
1d019706d866 LLVM10 anatofuz parents: diff changeset	996 } // End namespace X86
1d019706d866 LLVM10 anatofuz parents: diff changeset	997
1d019706d866 LLVM10 anatofuz parents: diff changeset	998 In typical TableGen fashion, to use the enum, you will need to define a
1d019706d866 LLVM10 anatofuz parents: diff changeset	999 preprocessor macro:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1000
1d019706d866 LLVM10 anatofuz parents: diff changeset	1001 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1002
1d019706d866 LLVM10 anatofuz parents: diff changeset	1003 #define GET_INSTRINFO_OPERAND_TYPES_ENUM // For OpTypes enum
1d019706d866 LLVM10 anatofuz parents: diff changeset	1004 #include "XXXGenInstrInfo.inc"
1d019706d866 LLVM10 anatofuz parents: diff changeset	1005
1d019706d866 LLVM10 anatofuz parents: diff changeset	1006
1d019706d866 LLVM10 anatofuz parents: diff changeset	1007 Instruction Scheduling
1d019706d866 LLVM10 anatofuz parents: diff changeset	1008 ----------------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	1009
1d019706d866 LLVM10 anatofuz parents: diff changeset	1010 Instruction itineraries can be queried using MCDesc::getSchedClass(). The
1d019706d866 LLVM10 anatofuz parents: diff changeset	1011 value can be named by an enumeration in llvm::XXX::Sched namespace generated
1d019706d866 LLVM10 anatofuz parents: diff changeset	1012 by TableGen in XXXGenInstrInfo.inc. The name of the schedule classes are
1d019706d866 LLVM10 anatofuz parents: diff changeset	1013 the same as provided in XXXSchedule.td plus a default NoItinerary class.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1014
1d019706d866 LLVM10 anatofuz parents: diff changeset	1015 The schedule models are generated by TableGen by the SubtargetEmitter,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1016 using the ``CodeGenSchedModels`` class. This is distinct from the itinerary
1d019706d866 LLVM10 anatofuz parents: diff changeset	1017 method of specifying machine resource use. The tool ``utils/schedcover.py``
1d019706d866 LLVM10 anatofuz parents: diff changeset	1018 can be used to determine which instructions have been covered by the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1019 schedule model description and which haven't. The first step is to use the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1020 instructions below to create an output file. Then run ``schedcover.py`` on the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1021 output file:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1022
1d019706d866 LLVM10 anatofuz parents: diff changeset	1023 .. code-block:: shell
1d019706d866 LLVM10 anatofuz parents: diff changeset	1024
1d019706d866 LLVM10 anatofuz parents: diff changeset	1025 $ <src>/utils/schedcover.py <build>/lib/Target/AArch64/tblGenSubtarget.with
1d019706d866 LLVM10 anatofuz parents: diff changeset	1026 instruction, default, CortexA53Model, CortexA57Model, CycloneModel, ExynosM3Model, FalkorModel, KryoModel, ThunderX2T99Model, ThunderXT8XModel
1d019706d866 LLVM10 anatofuz parents: diff changeset	1027 ABSv16i8, WriteV, , , CyWriteV3, M3WriteNMISC1, FalkorWr_2VXVY_2cyc, KryoWrite_2cyc_XY_XY_150ln, ,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1028 ABSv1i64, WriteV, , , CyWriteV3, M3WriteNMISC1, FalkorWr_1VXVY_2cyc, KryoWrite_2cyc_XY_noRSV_67ln, ,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1029 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	1030
1d019706d866 LLVM10 anatofuz parents: diff changeset	1031 To capture the debug output from generating a schedule model, change to the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1032 appropriate target directory and use the following command:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1033 command with the ``subtarget-emitter`` debug option:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1034
1d019706d866 LLVM10 anatofuz parents: diff changeset	1035 .. code-block:: shell
1d019706d866 LLVM10 anatofuz parents: diff changeset	1036
1d019706d866 LLVM10 anatofuz parents: diff changeset	1037 $ <build>/bin/llvm-tblgen -debug-only=subtarget-emitter -gen-subtarget \
1d019706d866 LLVM10 anatofuz parents: diff changeset	1038 -I <src>/lib/Target/<target> -I <src>/include \
1d019706d866 LLVM10 anatofuz parents: diff changeset	1039 -I <src>/lib/Target <src>/lib/Target/<target>/<target>.td \
1d019706d866 LLVM10 anatofuz parents: diff changeset	1040 -o <build>/lib/Target/<target>/<target>GenSubtargetInfo.inc.tmp \
1d019706d866 LLVM10 anatofuz parents: diff changeset	1041 > tblGenSubtarget.dbg 2>&1
1d019706d866 LLVM10 anatofuz parents: diff changeset	1042
1d019706d866 LLVM10 anatofuz parents: diff changeset	1043 Where ``<build>`` is the build directory, ``src`` is the source directory,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1044 and ``<target>`` is the name of the target.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1045 To double check that the above command is what is needed, one can capture the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1046 exact TableGen command from a build by using:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1047
1d019706d866 LLVM10 anatofuz parents: diff changeset	1048 .. code-block:: shell
1d019706d866 LLVM10 anatofuz parents: diff changeset	1049
1d019706d866 LLVM10 anatofuz parents: diff changeset	1050 $ VERBOSE=1 make ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	1051
1d019706d866 LLVM10 anatofuz parents: diff changeset	1052 and search for ``llvm-tblgen`` commands in the output.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1053
1d019706d866 LLVM10 anatofuz parents: diff changeset	1054
1d019706d866 LLVM10 anatofuz parents: diff changeset	1055 Instruction Relation Mapping
1d019706d866 LLVM10 anatofuz parents: diff changeset	1056 ----------------------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	1057
1d019706d866 LLVM10 anatofuz parents: diff changeset	1058 This TableGen feature is used to relate instructions with each other. It is
1d019706d866 LLVM10 anatofuz parents: diff changeset	1059 particularly useful when you have multiple instruction formats and need to
1d019706d866 LLVM10 anatofuz parents: diff changeset	1060 switch between them after instruction selection. This entire feature is driven
1d019706d866 LLVM10 anatofuz parents: diff changeset	1061 by relation models which can be defined in ``XXXInstrInfo.td`` files
1d019706d866 LLVM10 anatofuz parents: diff changeset	1062 according to the target-specific instruction set. Relation models are defined
1d019706d866 LLVM10 anatofuz parents: diff changeset	1063 using ``InstrMapping`` class as a base. TableGen parses all the models
1d019706d866 LLVM10 anatofuz parents: diff changeset	1064 and generates instruction relation maps using the specified information.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1065 Relation maps are emitted as tables in the ``XXXGenInstrInfo.inc`` file
1d019706d866 LLVM10 anatofuz parents: diff changeset	1066 along with the functions to query them. For the detailed information on how to
1d019706d866 LLVM10 anatofuz parents: diff changeset	1067 use this feature, please refer to :doc:`HowToUseInstrMappings`.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1068
1d019706d866 LLVM10 anatofuz parents: diff changeset	1069 Implement a subclass of ``TargetInstrInfo``
1d019706d866 LLVM10 anatofuz parents: diff changeset	1070 -------------------------------------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	1071
1d019706d866 LLVM10 anatofuz parents: diff changeset	1072 The final step is to hand code portions of ``XXXInstrInfo``, which implements
1d019706d866 LLVM10 anatofuz parents: diff changeset	1073 the interface described in ``TargetInstrInfo.h`` (see :ref:`TargetInstrInfo`).
1d019706d866 LLVM10 anatofuz parents: diff changeset	1074 These functions return ``0`` or a Boolean or they assert, unless overridden.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1075 Here's a list of functions that are overridden for the SPARC implementation in
1d019706d866 LLVM10 anatofuz parents: diff changeset	1076 ``SparcInstrInfo.cpp``:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1077
1d019706d866 LLVM10 anatofuz parents: diff changeset	1078 * ``isLoadFromStackSlot`` --- If the specified machine instruction is a direct
1d019706d866 LLVM10 anatofuz parents: diff changeset	1079 load from a stack slot, return the register number of the destination and the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1080 ``FrameIndex`` of the stack slot.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1081
1d019706d866 LLVM10 anatofuz parents: diff changeset	1082 * ``isStoreToStackSlot`` --- If the specified machine instruction is a direct
1d019706d866 LLVM10 anatofuz parents: diff changeset	1083 store to a stack slot, return the register number of the destination and the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1084 ``FrameIndex`` of the stack slot.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1085
1d019706d866 LLVM10 anatofuz parents: diff changeset	1086 * ``copyPhysReg`` --- Copy values between a pair of physical registers.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1087
1d019706d866 LLVM10 anatofuz parents: diff changeset	1088 * ``storeRegToStackSlot`` --- Store a register value to a stack slot.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1089
1d019706d866 LLVM10 anatofuz parents: diff changeset	1090 * ``loadRegFromStackSlot`` --- Load a register value from a stack slot.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1091
1d019706d866 LLVM10 anatofuz parents: diff changeset	1092 * ``storeRegToAddr`` --- Store a register value to memory.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1093
1d019706d866 LLVM10 anatofuz parents: diff changeset	1094 * ``loadRegFromAddr`` --- Load a register value from memory.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1095
1d019706d866 LLVM10 anatofuz parents: diff changeset	1096 * ``foldMemoryOperand`` --- Attempt to combine instructions of any load or
1d019706d866 LLVM10 anatofuz parents: diff changeset	1097 store instruction for the specified operand(s).
1d019706d866 LLVM10 anatofuz parents: diff changeset	1098
1d019706d866 LLVM10 anatofuz parents: diff changeset	1099 Branch Folding and If Conversion
1d019706d866 LLVM10 anatofuz parents: diff changeset	1100 --------------------------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	1101
1d019706d866 LLVM10 anatofuz parents: diff changeset	1102 Performance can be improved by combining instructions or by eliminating
1d019706d866 LLVM10 anatofuz parents: diff changeset	1103 instructions that are never reached. The ``analyzeBranch`` method in
1d019706d866 LLVM10 anatofuz parents: diff changeset	1104 ``XXXInstrInfo`` may be implemented to examine conditional instructions and
1d019706d866 LLVM10 anatofuz parents: diff changeset	1105 remove unnecessary instructions. ``analyzeBranch`` looks at the end of a
1d019706d866 LLVM10 anatofuz parents: diff changeset	1106 machine basic block (MBB) for opportunities for improvement, such as branch
1d019706d866 LLVM10 anatofuz parents: diff changeset	1107 folding and if conversion. The ``BranchFolder`` and ``IfConverter`` machine
1d019706d866 LLVM10 anatofuz parents: diff changeset	1108 function passes (see the source files ``BranchFolding.cpp`` and
1d019706d866 LLVM10 anatofuz parents: diff changeset	1109 ``IfConversion.cpp`` in the ``lib/CodeGen`` directory) call ``analyzeBranch``
1d019706d866 LLVM10 anatofuz parents: diff changeset	1110 to improve the control flow graph that represents the instructions.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1111
1d019706d866 LLVM10 anatofuz parents: diff changeset	1112 Several implementations of ``analyzeBranch`` (for ARM, Alpha, and X86) can be
1d019706d866 LLVM10 anatofuz parents: diff changeset	1113 examined as models for your own ``analyzeBranch`` implementation. Since SPARC
1d019706d866 LLVM10 anatofuz parents: diff changeset	1114 does not implement a useful ``analyzeBranch``, the ARM target implementation is
1d019706d866 LLVM10 anatofuz parents: diff changeset	1115 shown below.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1116
1d019706d866 LLVM10 anatofuz parents: diff changeset	1117 ``analyzeBranch`` returns a Boolean value and takes four parameters:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1118
1d019706d866 LLVM10 anatofuz parents: diff changeset	1119 * ``MachineBasicBlock &MBB`` --- The incoming block to be examined.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1120
1d019706d866 LLVM10 anatofuz parents: diff changeset	1121 * ``MachineBasicBlock *&TBB`` --- A destination block that is returned. For a
1d019706d866 LLVM10 anatofuz parents: diff changeset	1122 conditional branch that evaluates to true, ``TBB`` is the destination.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1123
1d019706d866 LLVM10 anatofuz parents: diff changeset	1124 * ``MachineBasicBlock *&FBB`` --- For a conditional branch that evaluates to
1d019706d866 LLVM10 anatofuz parents: diff changeset	1125 false, ``FBB`` is returned as the destination.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1126
1d019706d866 LLVM10 anatofuz parents: diff changeset	1127 * ``std::vector<MachineOperand> &Cond`` --- List of operands to evaluate a
1d019706d866 LLVM10 anatofuz parents: diff changeset	1128 condition for a conditional branch.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1129
1d019706d866 LLVM10 anatofuz parents: diff changeset	1130 In the simplest case, if a block ends without a branch, then it falls through
1d019706d866 LLVM10 anatofuz parents: diff changeset	1131 to the successor block. No destination blocks are specified for either ``TBB``
1d019706d866 LLVM10 anatofuz parents: diff changeset	1132 or ``FBB``, so both parameters return ``NULL``. The start of the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1133 ``analyzeBranch`` (see code below for the ARM target) shows the function
1d019706d866 LLVM10 anatofuz parents: diff changeset	1134 parameters and the code for the simplest case.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1135
1d019706d866 LLVM10 anatofuz parents: diff changeset	1136 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1137
1d019706d866 LLVM10 anatofuz parents: diff changeset	1138 bool ARMInstrInfo::analyzeBranch(MachineBasicBlock &MBB,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1139 MachineBasicBlock *&TBB,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1140 MachineBasicBlock *&FBB,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1141 std::vector<MachineOperand> &Cond) const
1d019706d866 LLVM10 anatofuz parents: diff changeset	1142 {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1143 MachineBasicBlock::iterator I = MBB.end();
1d019706d866 LLVM10 anatofuz parents: diff changeset	1144 if (I == MBB.begin() \|\| !isUnpredicatedTerminator(--I))
1d019706d866 LLVM10 anatofuz parents: diff changeset	1145 return false;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1146
1d019706d866 LLVM10 anatofuz parents: diff changeset	1147 If a block ends with a single unconditional branch instruction, then
1d019706d866 LLVM10 anatofuz parents: diff changeset	1148 ``analyzeBranch`` (shown below) should return the destination of that branch in
1d019706d866 LLVM10 anatofuz parents: diff changeset	1149 the ``TBB`` parameter.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1150
1d019706d866 LLVM10 anatofuz parents: diff changeset	1151 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1152
1d019706d866 LLVM10 anatofuz parents: diff changeset	1153 if (LastOpc == ARM::B \|\| LastOpc == ARM::tB) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1154 TBB = LastInst->getOperand(0).getMBB();
1d019706d866 LLVM10 anatofuz parents: diff changeset	1155 return false;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1156 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	1157
1d019706d866 LLVM10 anatofuz parents: diff changeset	1158 If a block ends with two unconditional branches, then the second branch is
1d019706d866 LLVM10 anatofuz parents: diff changeset	1159 never reached. In that situation, as shown below, remove the last branch
1d019706d866 LLVM10 anatofuz parents: diff changeset	1160 instruction and return the penultimate branch in the ``TBB`` parameter.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1161
1d019706d866 LLVM10 anatofuz parents: diff changeset	1162 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1163
1d019706d866 LLVM10 anatofuz parents: diff changeset	1164 if ((SecondLastOpc == ARM::B \|\| SecondLastOpc == ARM::tB) &&
1d019706d866 LLVM10 anatofuz parents: diff changeset	1165 (LastOpc == ARM::B \|\| LastOpc == ARM::tB)) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1166 TBB = SecondLastInst->getOperand(0).getMBB();
1d019706d866 LLVM10 anatofuz parents: diff changeset	1167 I = LastInst;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1168 I->eraseFromParent();
1d019706d866 LLVM10 anatofuz parents: diff changeset	1169 return false;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1170 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	1171
1d019706d866 LLVM10 anatofuz parents: diff changeset	1172 A block may end with a single conditional branch instruction that falls through
1d019706d866 LLVM10 anatofuz parents: diff changeset	1173 to successor block if the condition evaluates to false. In that case,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1174 ``analyzeBranch`` (shown below) should return the destination of that
1d019706d866 LLVM10 anatofuz parents: diff changeset	1175 conditional branch in the ``TBB`` parameter and a list of operands in the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1176 ``Cond`` parameter to evaluate the condition.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1177
1d019706d866 LLVM10 anatofuz parents: diff changeset	1178 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1179
1d019706d866 LLVM10 anatofuz parents: diff changeset	1180 if (LastOpc == ARM::Bcc \|\| LastOpc == ARM::tBcc) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1181 // Block ends with fall-through condbranch.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1182 TBB = LastInst->getOperand(0).getMBB();
1d019706d866 LLVM10 anatofuz parents: diff changeset	1183 Cond.push_back(LastInst->getOperand(1));
1d019706d866 LLVM10 anatofuz parents: diff changeset	1184 Cond.push_back(LastInst->getOperand(2));
1d019706d866 LLVM10 anatofuz parents: diff changeset	1185 return false;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1186 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	1187
1d019706d866 LLVM10 anatofuz parents: diff changeset	1188 If a block ends with both a conditional branch and an ensuing unconditional
1d019706d866 LLVM10 anatofuz parents: diff changeset	1189 branch, then ``analyzeBranch`` (shown below) should return the conditional
1d019706d866 LLVM10 anatofuz parents: diff changeset	1190 branch destination (assuming it corresponds to a conditional evaluation of
1d019706d866 LLVM10 anatofuz parents: diff changeset	1191 "``true``") in the ``TBB`` parameter and the unconditional branch destination
1d019706d866 LLVM10 anatofuz parents: diff changeset	1192 in the ``FBB`` (corresponding to a conditional evaluation of "``false``"). A
1d019706d866 LLVM10 anatofuz parents: diff changeset	1193 list of operands to evaluate the condition should be returned in the ``Cond``
1d019706d866 LLVM10 anatofuz parents: diff changeset	1194 parameter.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1195
1d019706d866 LLVM10 anatofuz parents: diff changeset	1196 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1197
1d019706d866 LLVM10 anatofuz parents: diff changeset	1198 unsigned SecondLastOpc = SecondLastInst->getOpcode();
1d019706d866 LLVM10 anatofuz parents: diff changeset	1199
1d019706d866 LLVM10 anatofuz parents: diff changeset	1200 if ((SecondLastOpc == ARM::Bcc && LastOpc == ARM::B) \|\|
1d019706d866 LLVM10 anatofuz parents: diff changeset	1201 (SecondLastOpc == ARM::tBcc && LastOpc == ARM::tB)) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1202 TBB = SecondLastInst->getOperand(0).getMBB();
1d019706d866 LLVM10 anatofuz parents: diff changeset	1203 Cond.push_back(SecondLastInst->getOperand(1));
1d019706d866 LLVM10 anatofuz parents: diff changeset	1204 Cond.push_back(SecondLastInst->getOperand(2));
1d019706d866 LLVM10 anatofuz parents: diff changeset	1205 FBB = LastInst->getOperand(0).getMBB();
1d019706d866 LLVM10 anatofuz parents: diff changeset	1206 return false;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1207 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	1208
1d019706d866 LLVM10 anatofuz parents: diff changeset	1209 For the last two cases (ending with a single conditional branch or ending with
1d019706d866 LLVM10 anatofuz parents: diff changeset	1210 one conditional and one unconditional branch), the operands returned in the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1211 ``Cond`` parameter can be passed to methods of other instructions to create new
1d019706d866 LLVM10 anatofuz parents: diff changeset	1212 branches or perform other operations. An implementation of ``analyzeBranch``
1d019706d866 LLVM10 anatofuz parents: diff changeset	1213 requires the helper methods ``removeBranch`` and ``insertBranch`` to manage
1d019706d866 LLVM10 anatofuz parents: diff changeset	1214 subsequent operations.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1215
1d019706d866 LLVM10 anatofuz parents: diff changeset	1216 ``analyzeBranch`` should return false indicating success in most circumstances.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1217 ``analyzeBranch`` should only return true when the method is stumped about what
1d019706d866 LLVM10 anatofuz parents: diff changeset	1218 to do, for example, if a block has three terminating branches.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1219 ``analyzeBranch`` may return true if it encounters a terminator it cannot
1d019706d866 LLVM10 anatofuz parents: diff changeset	1220 handle, such as an indirect branch.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1221
1d019706d866 LLVM10 anatofuz parents: diff changeset	1222 .. _instruction-selector:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1223
1d019706d866 LLVM10 anatofuz parents: diff changeset	1224 Instruction Selector
1d019706d866 LLVM10 anatofuz parents: diff changeset	1225 ====================
1d019706d866 LLVM10 anatofuz parents: diff changeset	1226
1d019706d866 LLVM10 anatofuz parents: diff changeset	1227 LLVM uses a ``SelectionDAG`` to represent LLVM IR instructions, and nodes of
1d019706d866 LLVM10 anatofuz parents: diff changeset	1228 the ``SelectionDAG`` ideally represent native target instructions. During code
1d019706d866 LLVM10 anatofuz parents: diff changeset	1229 generation, instruction selection passes are performed to convert non-native
1d019706d866 LLVM10 anatofuz parents: diff changeset	1230 DAG instructions into native target-specific instructions. The pass described
1d019706d866 LLVM10 anatofuz parents: diff changeset	1231 in ``XXXISelDAGToDAG.cpp`` is used to match patterns and perform DAG-to-DAG
1d019706d866 LLVM10 anatofuz parents: diff changeset	1232 instruction selection. Optionally, a pass may be defined (in
1d019706d866 LLVM10 anatofuz parents: diff changeset	1233 ``XXXBranchSelector.cpp``) to perform similar DAG-to-DAG operations for branch
1d019706d866 LLVM10 anatofuz parents: diff changeset	1234 instructions. Later, the code in ``XXXISelLowering.cpp`` replaces or removes
1d019706d866 LLVM10 anatofuz parents: diff changeset	1235 operations and data types not supported natively (legalizes) in a
1d019706d866 LLVM10 anatofuz parents: diff changeset	1236 ``SelectionDAG``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1237
1d019706d866 LLVM10 anatofuz parents: diff changeset	1238 TableGen generates code for instruction selection using the following target
1d019706d866 LLVM10 anatofuz parents: diff changeset	1239 description input files:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1240
1d019706d866 LLVM10 anatofuz parents: diff changeset	1241 * ``XXXInstrInfo.td`` --- Contains definitions of instructions in a
1d019706d866 LLVM10 anatofuz parents: diff changeset	1242 target-specific instruction set, generates ``XXXGenDAGISel.inc``, which is
1d019706d866 LLVM10 anatofuz parents: diff changeset	1243 included in ``XXXISelDAGToDAG.cpp``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1244
1d019706d866 LLVM10 anatofuz parents: diff changeset	1245 * ``XXXCallingConv.td`` --- Contains the calling and return value conventions
1d019706d866 LLVM10 anatofuz parents: diff changeset	1246 for the target architecture, and it generates ``XXXGenCallingConv.inc``,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1247 which is included in ``XXXISelLowering.cpp``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1248
1d019706d866 LLVM10 anatofuz parents: diff changeset	1249 The implementation of an instruction selection pass must include a header that
1d019706d866 LLVM10 anatofuz parents: diff changeset	1250 declares the ``FunctionPass`` class or a subclass of ``FunctionPass``. In
1d019706d866 LLVM10 anatofuz parents: diff changeset	1251 ``XXXTargetMachine.cpp``, a Pass Manager (PM) should add each instruction
1d019706d866 LLVM10 anatofuz parents: diff changeset	1252 selection pass into the queue of passes to run.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1253
1d019706d866 LLVM10 anatofuz parents: diff changeset	1254 The LLVM static compiler (``llc``) is an excellent tool for visualizing the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1255 contents of DAGs. To display the ``SelectionDAG`` before or after specific
1d019706d866 LLVM10 anatofuz parents: diff changeset	1256 processing phases, use the command line options for ``llc``, described at
1d019706d866 LLVM10 anatofuz parents: diff changeset	1257 :ref:`SelectionDAG-Process`.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1258
1d019706d866 LLVM10 anatofuz parents: diff changeset	1259 To describe instruction selector behavior, you should add patterns for lowering
1d019706d866 LLVM10 anatofuz parents: diff changeset	1260 LLVM code into a ``SelectionDAG`` as the last parameter of the instruction
1d019706d866 LLVM10 anatofuz parents: diff changeset	1261 definitions in ``XXXInstrInfo.td``. For example, in ``SparcInstrInfo.td``,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1262 this entry defines a register store operation, and the last parameter describes
1d019706d866 LLVM10 anatofuz parents: diff changeset	1263 a pattern with the store DAG operator.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1264
1d019706d866 LLVM10 anatofuz parents: diff changeset	1265 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	1266
1d019706d866 LLVM10 anatofuz parents: diff changeset	1267 def STrr : F3_1< 3, 0b000100, (outs), (ins MEMrr:$addr, IntRegs:$src),
1d019706d866 LLVM10 anatofuz parents: diff changeset	1268 "st $src, [$addr]", [(store i32:$src, ADDRrr:$addr)]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1269
1d019706d866 LLVM10 anatofuz parents: diff changeset	1270 ``ADDRrr`` is a memory mode that is also defined in ``SparcInstrInfo.td``:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1271
1d019706d866 LLVM10 anatofuz parents: diff changeset	1272 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	1273
1d019706d866 LLVM10 anatofuz parents: diff changeset	1274 def ADDRrr : ComplexPattern<i32, 2, "SelectADDRrr", [], []>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1275
1d019706d866 LLVM10 anatofuz parents: diff changeset	1276 The definition of ``ADDRrr`` refers to ``SelectADDRrr``, which is a function
1d019706d866 LLVM10 anatofuz parents: diff changeset	1277 defined in an implementation of the Instructor Selector (such as
1d019706d866 LLVM10 anatofuz parents: diff changeset	1278 ``SparcISelDAGToDAG.cpp``).
1d019706d866 LLVM10 anatofuz parents: diff changeset	1279
1d019706d866 LLVM10 anatofuz parents: diff changeset	1280 In ``lib/Target/TargetSelectionDAG.td``, the DAG operator for store is defined
1d019706d866 LLVM10 anatofuz parents: diff changeset	1281 below:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1282
1d019706d866 LLVM10 anatofuz parents: diff changeset	1283 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	1284
1d019706d866 LLVM10 anatofuz parents: diff changeset	1285 def store : PatFrag<(ops node:$val, node:$ptr),
1d019706d866 LLVM10 anatofuz parents: diff changeset	1286 (st node:$val, node:$ptr), [{
1d019706d866 LLVM10 anatofuz parents: diff changeset	1287 if (StoreSDNode *ST = dyn_cast<StoreSDNode>(N))
1d019706d866 LLVM10 anatofuz parents: diff changeset	1288 return !ST->isTruncatingStore() &&
1d019706d866 LLVM10 anatofuz parents: diff changeset	1289 ST->getAddressingMode() == ISD::UNINDEXED;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1290 return false;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1291 }]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1292
1d019706d866 LLVM10 anatofuz parents: diff changeset	1293 ``XXXInstrInfo.td`` also generates (in ``XXXGenDAGISel.inc``) the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1294 ``SelectCode`` method that is used to call the appropriate processing method
1d019706d866 LLVM10 anatofuz parents: diff changeset	1295 for an instruction. In this example, ``SelectCode`` calls ``Select_ISD_STORE``
1d019706d866 LLVM10 anatofuz parents: diff changeset	1296 for the ``ISD::STORE`` opcode.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1297
1d019706d866 LLVM10 anatofuz parents: diff changeset	1298 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1299
1d019706d866 LLVM10 anatofuz parents: diff changeset	1300 SDNode *SelectCode(SDValue N) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1301 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	1302 MVT::ValueType NVT = N.getNode()->getValueType(0);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1303 switch (N.getOpcode()) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1304 case ISD::STORE: {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1305 switch (NVT) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1306 default:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1307 return Select_ISD_STORE(N);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1308 break;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1309 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	1310 break;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1311 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	1312 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	1313
1d019706d866 LLVM10 anatofuz parents: diff changeset	1314 The pattern for ``STrr`` is matched, so elsewhere in ``XXXGenDAGISel.inc``,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1315 code for ``STrr`` is created for ``Select_ISD_STORE``. The ``Emit_22`` method
1d019706d866 LLVM10 anatofuz parents: diff changeset	1316 is also generated in ``XXXGenDAGISel.inc`` to complete the processing of this
1d019706d866 LLVM10 anatofuz parents: diff changeset	1317 instruction.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1318
1d019706d866 LLVM10 anatofuz parents: diff changeset	1319 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1320
1d019706d866 LLVM10 anatofuz parents: diff changeset	1321 SDNode *Select_ISD_STORE(const SDValue &N) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1322 SDValue Chain = N.getOperand(0);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1323 if (Predicate_store(N.getNode())) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1324 SDValue N1 = N.getOperand(1);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1325 SDValue N2 = N.getOperand(2);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1326 SDValue CPTmp0;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1327 SDValue CPTmp1;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1328
1d019706d866 LLVM10 anatofuz parents: diff changeset	1329 // Pattern: (st:void i32:i32:$src,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1330 // ADDRrr:i32:$addr)<<P:Predicate_store>>
1d019706d866 LLVM10 anatofuz parents: diff changeset	1331 // Emits: (STrr:void ADDRrr:i32:$addr, IntRegs:i32:$src)
1d019706d866 LLVM10 anatofuz parents: diff changeset	1332 // Pattern complexity = 13 cost = 1 size = 0
1d019706d866 LLVM10 anatofuz parents: diff changeset	1333 if (SelectADDRrr(N, N2, CPTmp0, CPTmp1) &&
1d019706d866 LLVM10 anatofuz parents: diff changeset	1334 N1.getNode()->getValueType(0) == MVT::i32 &&
1d019706d866 LLVM10 anatofuz parents: diff changeset	1335 N2.getNode()->getValueType(0) == MVT::i32) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1336 return Emit_22(N, SP::STrr, CPTmp0, CPTmp1);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1337 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	1338 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	1339
1d019706d866 LLVM10 anatofuz parents: diff changeset	1340 The SelectionDAG Legalize Phase
1d019706d866 LLVM10 anatofuz parents: diff changeset	1341 -------------------------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	1342
1d019706d866 LLVM10 anatofuz parents: diff changeset	1343 The Legalize phase converts a DAG to use types and operations that are natively
1d019706d866 LLVM10 anatofuz parents: diff changeset	1344 supported by the target. For natively unsupported types and operations, you
1d019706d866 LLVM10 anatofuz parents: diff changeset	1345 need to add code to the target-specific ``XXXTargetLowering`` implementation to
1d019706d866 LLVM10 anatofuz parents: diff changeset	1346 convert unsupported types and operations to supported ones.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1347
1d019706d866 LLVM10 anatofuz parents: diff changeset	1348 In the constructor for the ``XXXTargetLowering`` class, first use the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1349 ``addRegisterClass`` method to specify which types are supported and which
1d019706d866 LLVM10 anatofuz parents: diff changeset	1350 register classes are associated with them. The code for the register classes
1d019706d866 LLVM10 anatofuz parents: diff changeset	1351 are generated by TableGen from ``XXXRegisterInfo.td`` and placed in
1d019706d866 LLVM10 anatofuz parents: diff changeset	1352 ``XXXGenRegisterInfo.h.inc``. For example, the implementation of the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1353 constructor for the SparcTargetLowering class (in ``SparcISelLowering.cpp``)
1d019706d866 LLVM10 anatofuz parents: diff changeset	1354 starts with the following code:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1355
1d019706d866 LLVM10 anatofuz parents: diff changeset	1356 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1357
1d019706d866 LLVM10 anatofuz parents: diff changeset	1358 addRegisterClass(MVT::i32, SP::IntRegsRegisterClass);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1359 addRegisterClass(MVT::f32, SP::FPRegsRegisterClass);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1360 addRegisterClass(MVT::f64, SP::DFPRegsRegisterClass);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1361
1d019706d866 LLVM10 anatofuz parents: diff changeset	1362 You should examine the node types in the ``ISD`` namespace
1d019706d866 LLVM10 anatofuz parents: diff changeset	1363 (``include/llvm/CodeGen/SelectionDAGNodes.h``) and determine which operations
1d019706d866 LLVM10 anatofuz parents: diff changeset	1364 the target natively supports. For operations that do not have native
1d019706d866 LLVM10 anatofuz parents: diff changeset	1365 support, add a callback to the constructor for the ``XXXTargetLowering`` class,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1366 so the instruction selection process knows what to do. The ``TargetLowering``
1d019706d866 LLVM10 anatofuz parents: diff changeset	1367 class callback methods (declared in ``llvm/Target/TargetLowering.h``) are:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1368
1d019706d866 LLVM10 anatofuz parents: diff changeset	1369 * ``setOperationAction`` --- General operation.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1370 * ``setLoadExtAction`` --- Load with extension.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1371 * ``setTruncStoreAction`` --- Truncating store.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1372 * ``setIndexedLoadAction`` --- Indexed load.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1373 * ``setIndexedStoreAction`` --- Indexed store.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1374 * ``setConvertAction`` --- Type conversion.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1375 * ``setCondCodeAction`` --- Support for a given condition code.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1376
1d019706d866 LLVM10 anatofuz parents: diff changeset	1377 Note: on older releases, ``setLoadXAction`` is used instead of
1d019706d866 LLVM10 anatofuz parents: diff changeset	1378 ``setLoadExtAction``. Also, on older releases, ``setCondCodeAction`` may not
1d019706d866 LLVM10 anatofuz parents: diff changeset	1379 be supported. Examine your release to see what methods are specifically
1d019706d866 LLVM10 anatofuz parents: diff changeset	1380 supported.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1381
1d019706d866 LLVM10 anatofuz parents: diff changeset	1382 These callbacks are used to determine that an operation does or does not work
1d019706d866 LLVM10 anatofuz parents: diff changeset	1383 with a specified type (or types). And in all cases, the third parameter is a
1d019706d866 LLVM10 anatofuz parents: diff changeset	1384 ``LegalAction`` type enum value: ``Promote``, ``Expand``, ``Custom``, or
1d019706d866 LLVM10 anatofuz parents: diff changeset	1385 ``Legal``. ``SparcISelLowering.cpp`` contains examples of all four
1d019706d866 LLVM10 anatofuz parents: diff changeset	1386 ``LegalAction`` values.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1387
1d019706d866 LLVM10 anatofuz parents: diff changeset	1388 Promote
1d019706d866 LLVM10 anatofuz parents: diff changeset	1389 ^^^^^^^
1d019706d866 LLVM10 anatofuz parents: diff changeset	1390
1d019706d866 LLVM10 anatofuz parents: diff changeset	1391 For an operation without native support for a given type, the specified type
1d019706d866 LLVM10 anatofuz parents: diff changeset	1392 may be promoted to a larger type that is supported. For example, SPARC does
1d019706d866 LLVM10 anatofuz parents: diff changeset	1393 not support a sign-extending load for Boolean values (``i1`` type), so in
1d019706d866 LLVM10 anatofuz parents: diff changeset	1394 ``SparcISelLowering.cpp`` the third parameter below, ``Promote``, changes
1d019706d866 LLVM10 anatofuz parents: diff changeset	1395 ``i1`` type values to a large type before loading.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1396
1d019706d866 LLVM10 anatofuz parents: diff changeset	1397 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1398
1d019706d866 LLVM10 anatofuz parents: diff changeset	1399 setLoadExtAction(ISD::SEXTLOAD, MVT::i1, Promote);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1400
1d019706d866 LLVM10 anatofuz parents: diff changeset	1401 Expand
1d019706d866 LLVM10 anatofuz parents: diff changeset	1402 ^^^^^^
1d019706d866 LLVM10 anatofuz parents: diff changeset	1403
1d019706d866 LLVM10 anatofuz parents: diff changeset	1404 For a type without native support, a value may need to be broken down further,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1405 rather than promoted. For an operation without native support, a combination
1d019706d866 LLVM10 anatofuz parents: diff changeset	1406 of other operations may be used to similar effect. In SPARC, the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1407 floating-point sine and cosine trig operations are supported by expansion to
1d019706d866 LLVM10 anatofuz parents: diff changeset	1408 other operations, as indicated by the third parameter, ``Expand``, to
1d019706d866 LLVM10 anatofuz parents: diff changeset	1409 ``setOperationAction``:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1410
1d019706d866 LLVM10 anatofuz parents: diff changeset	1411 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1412
1d019706d866 LLVM10 anatofuz parents: diff changeset	1413 setOperationAction(ISD::FSIN, MVT::f32, Expand);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1414 setOperationAction(ISD::FCOS, MVT::f32, Expand);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1415
1d019706d866 LLVM10 anatofuz parents: diff changeset	1416 Custom
1d019706d866 LLVM10 anatofuz parents: diff changeset	1417 ^^^^^^
1d019706d866 LLVM10 anatofuz parents: diff changeset	1418
1d019706d866 LLVM10 anatofuz parents: diff changeset	1419 For some operations, simple type promotion or operation expansion may be
1d019706d866 LLVM10 anatofuz parents: diff changeset	1420 insufficient. In some cases, a special intrinsic function must be implemented.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1421
1d019706d866 LLVM10 anatofuz parents: diff changeset	1422 For example, a constant value may require special treatment, or an operation
1d019706d866 LLVM10 anatofuz parents: diff changeset	1423 may require spilling and restoring registers in the stack and working with
1d019706d866 LLVM10 anatofuz parents: diff changeset	1424 register allocators.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1425
1d019706d866 LLVM10 anatofuz parents: diff changeset	1426 As seen in ``SparcISelLowering.cpp`` code below, to perform a type conversion
1d019706d866 LLVM10 anatofuz parents: diff changeset	1427 from a floating point value to a signed integer, first the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1428 ``setOperationAction`` should be called with ``Custom`` as the third parameter:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1429
1d019706d866 LLVM10 anatofuz parents: diff changeset	1430 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1431
1d019706d866 LLVM10 anatofuz parents: diff changeset	1432 setOperationAction(ISD::FP_TO_SINT, MVT::i32, Custom);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1433
1d019706d866 LLVM10 anatofuz parents: diff changeset	1434 In the ``LowerOperation`` method, for each ``Custom`` operation, a case
1d019706d866 LLVM10 anatofuz parents: diff changeset	1435 statement should be added to indicate what function to call. In the following
1d019706d866 LLVM10 anatofuz parents: diff changeset	1436 code, an ``FP_TO_SINT`` opcode will call the ``LowerFP_TO_SINT`` method:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1437
1d019706d866 LLVM10 anatofuz parents: diff changeset	1438 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1439
1d019706d866 LLVM10 anatofuz parents: diff changeset	1440 SDValue SparcTargetLowering::LowerOperation(SDValue Op, SelectionDAG &DAG) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1441 switch (Op.getOpcode()) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1442 case ISD::FP_TO_SINT: return LowerFP_TO_SINT(Op, DAG);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1443 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	1444 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	1445 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	1446
1d019706d866 LLVM10 anatofuz parents: diff changeset	1447 Finally, the ``LowerFP_TO_SINT`` method is implemented, using an FP register to
1d019706d866 LLVM10 anatofuz parents: diff changeset	1448 convert the floating-point value to an integer.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1449
1d019706d866 LLVM10 anatofuz parents: diff changeset	1450 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1451
1d019706d866 LLVM10 anatofuz parents: diff changeset	1452 static SDValue LowerFP_TO_SINT(SDValue Op, SelectionDAG &DAG) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1453 assert(Op.getValueType() == MVT::i32);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1454 Op = DAG.getNode(SPISD::FTOI, MVT::f32, Op.getOperand(0));
1d019706d866 LLVM10 anatofuz parents: diff changeset	1455 return DAG.getNode(ISD::BITCAST, MVT::i32, Op);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1456 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	1457
1d019706d866 LLVM10 anatofuz parents: diff changeset	1458 Legal
1d019706d866 LLVM10 anatofuz parents: diff changeset	1459 ^^^^^
1d019706d866 LLVM10 anatofuz parents: diff changeset	1460
1d019706d866 LLVM10 anatofuz parents: diff changeset	1461 The ``Legal`` ``LegalizeAction`` enum value simply indicates that an operation
1d019706d866 LLVM10 anatofuz parents: diff changeset	1462 is natively supported. ``Legal`` represents the default condition, so it
1d019706d866 LLVM10 anatofuz parents: diff changeset	1463 is rarely used. In ``SparcISelLowering.cpp``, the action for ``CTPOP`` (an
1d019706d866 LLVM10 anatofuz parents: diff changeset	1464 operation to count the bits set in an integer) is natively supported only for
1d019706d866 LLVM10 anatofuz parents: diff changeset	1465 SPARC v9. The following code enables the ``Expand`` conversion technique for
1d019706d866 LLVM10 anatofuz parents: diff changeset	1466 non-v9 SPARC implementations.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1467
1d019706d866 LLVM10 anatofuz parents: diff changeset	1468 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1469
1d019706d866 LLVM10 anatofuz parents: diff changeset	1470 setOperationAction(ISD::CTPOP, MVT::i32, Expand);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1471 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	1472 if (TM.getSubtarget<SparcSubtarget>().isV9())
1d019706d866 LLVM10 anatofuz parents: diff changeset	1473 setOperationAction(ISD::CTPOP, MVT::i32, Legal);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1474
1d019706d866 LLVM10 anatofuz parents: diff changeset	1475 Calling Conventions
1d019706d866 LLVM10 anatofuz parents: diff changeset	1476 -------------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	1477
1d019706d866 LLVM10 anatofuz parents: diff changeset	1478 To support target-specific calling conventions, ``XXXGenCallingConv.td`` uses
1d019706d866 LLVM10 anatofuz parents: diff changeset	1479 interfaces (such as ``CCIfType`` and ``CCAssignToReg``) that are defined in
1d019706d866 LLVM10 anatofuz parents: diff changeset	1480 ``lib/Target/TargetCallingConv.td``. TableGen can take the target descriptor
1d019706d866 LLVM10 anatofuz parents: diff changeset	1481 file ``XXXGenCallingConv.td`` and generate the header file
1d019706d866 LLVM10 anatofuz parents: diff changeset	1482 ``XXXGenCallingConv.inc``, which is typically included in
1d019706d866 LLVM10 anatofuz parents: diff changeset	1483 ``XXXISelLowering.cpp``. You can use the interfaces in
1d019706d866 LLVM10 anatofuz parents: diff changeset	1484 ``TargetCallingConv.td`` to specify:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1485
1d019706d866 LLVM10 anatofuz parents: diff changeset	1486 * The order of parameter allocation.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1487
1d019706d866 LLVM10 anatofuz parents: diff changeset	1488 * Where parameters and return values are placed (that is, on the stack or in
1d019706d866 LLVM10 anatofuz parents: diff changeset	1489 registers).
1d019706d866 LLVM10 anatofuz parents: diff changeset	1490
1d019706d866 LLVM10 anatofuz parents: diff changeset	1491 * Which registers may be used.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1492
1d019706d866 LLVM10 anatofuz parents: diff changeset	1493 * Whether the caller or callee unwinds the stack.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1494
1d019706d866 LLVM10 anatofuz parents: diff changeset	1495 The following example demonstrates the use of the ``CCIfType`` and
1d019706d866 LLVM10 anatofuz parents: diff changeset	1496 ``CCAssignToReg`` interfaces. If the ``CCIfType`` predicate is true (that is,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1497 if the current argument is of type ``f32`` or ``f64``), then the action is
1d019706d866 LLVM10 anatofuz parents: diff changeset	1498 performed. In this case, the ``CCAssignToReg`` action assigns the argument
1d019706d866 LLVM10 anatofuz parents: diff changeset	1499 value to the first available register: either ``R0`` or ``R1``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1500
1d019706d866 LLVM10 anatofuz parents: diff changeset	1501 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	1502
1d019706d866 LLVM10 anatofuz parents: diff changeset	1503 CCIfType<[f32,f64], CCAssignToReg<[R0, R1]>>
1d019706d866 LLVM10 anatofuz parents: diff changeset	1504
1d019706d866 LLVM10 anatofuz parents: diff changeset	1505 ``SparcCallingConv.td`` contains definitions for a target-specific return-value
1d019706d866 LLVM10 anatofuz parents: diff changeset	1506 calling convention (``RetCC_Sparc32``) and a basic 32-bit C calling convention
1d019706d866 LLVM10 anatofuz parents: diff changeset	1507 (``CC_Sparc32``). The definition of ``RetCC_Sparc32`` (shown below) indicates
1d019706d866 LLVM10 anatofuz parents: diff changeset	1508 which registers are used for specified scalar return types. A single-precision
1d019706d866 LLVM10 anatofuz parents: diff changeset	1509 float is returned to register ``F0``, and a double-precision float goes to
1d019706d866 LLVM10 anatofuz parents: diff changeset	1510 register ``D0``. A 32-bit integer is returned in register ``I0`` or ``I1``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1511
1d019706d866 LLVM10 anatofuz parents: diff changeset	1512 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	1513
1d019706d866 LLVM10 anatofuz parents: diff changeset	1514 def RetCC_Sparc32 : CallingConv<[
1d019706d866 LLVM10 anatofuz parents: diff changeset	1515 CCIfType<[i32], CCAssignToReg<[I0, I1]>>,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1516 CCIfType<[f32], CCAssignToReg<[F0]>>,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1517 CCIfType<[f64], CCAssignToReg<[D0]>>
1d019706d866 LLVM10 anatofuz parents: diff changeset	1518 ]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1519
1d019706d866 LLVM10 anatofuz parents: diff changeset	1520 The definition of ``CC_Sparc32`` in ``SparcCallingConv.td`` introduces
1d019706d866 LLVM10 anatofuz parents: diff changeset	1521 ``CCAssignToStack``, which assigns the value to a stack slot with the specified
1d019706d866 LLVM10 anatofuz parents: diff changeset	1522 size and alignment. In the example below, the first parameter, 4, indicates
1d019706d866 LLVM10 anatofuz parents: diff changeset	1523 the size of the slot, and the second parameter, also 4, indicates the stack
1d019706d866 LLVM10 anatofuz parents: diff changeset	1524 alignment along 4-byte units. (Special cases: if size is zero, then the ABI
1d019706d866 LLVM10 anatofuz parents: diff changeset	1525 size is used; if alignment is zero, then the ABI alignment is used.)
1d019706d866 LLVM10 anatofuz parents: diff changeset	1526
1d019706d866 LLVM10 anatofuz parents: diff changeset	1527 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	1528
1d019706d866 LLVM10 anatofuz parents: diff changeset	1529 def CC_Sparc32 : CallingConv<[
1d019706d866 LLVM10 anatofuz parents: diff changeset	1530 // All arguments get passed in integer registers if there is space.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1531 CCIfType<[i32, f32, f64], CCAssignToReg<[I0, I1, I2, I3, I4, I5]>>,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1532 CCAssignToStack<4, 4>
1d019706d866 LLVM10 anatofuz parents: diff changeset	1533 ]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1534
1d019706d866 LLVM10 anatofuz parents: diff changeset	1535 ``CCDelegateTo`` is another commonly used interface, which tries to find a
1d019706d866 LLVM10 anatofuz parents: diff changeset	1536 specified sub-calling convention, and, if a match is found, it is invoked. In
1d019706d866 LLVM10 anatofuz parents: diff changeset	1537 the following example (in ``X86CallingConv.td``), the definition of
1d019706d866 LLVM10 anatofuz parents: diff changeset	1538 ``RetCC_X86_32_C`` ends with ``CCDelegateTo``. After the current value is
1d019706d866 LLVM10 anatofuz parents: diff changeset	1539 assigned to the register ``ST0`` or ``ST1``, the ``RetCC_X86Common`` is
1d019706d866 LLVM10 anatofuz parents: diff changeset	1540 invoked.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1541
1d019706d866 LLVM10 anatofuz parents: diff changeset	1542 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	1543
1d019706d866 LLVM10 anatofuz parents: diff changeset	1544 def RetCC_X86_32_C : CallingConv<[
1d019706d866 LLVM10 anatofuz parents: diff changeset	1545 CCIfType<[f32], CCAssignToReg<[ST0, ST1]>>,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1546 CCIfType<[f64], CCAssignToReg<[ST0, ST1]>>,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1547 CCDelegateTo<RetCC_X86Common>
1d019706d866 LLVM10 anatofuz parents: diff changeset	1548 ]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1549
1d019706d866 LLVM10 anatofuz parents: diff changeset	1550 ``CCIfCC`` is an interface that attempts to match the given name to the current
1d019706d866 LLVM10 anatofuz parents: diff changeset	1551 calling convention. If the name identifies the current calling convention,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1552 then a specified action is invoked. In the following example (in
1d019706d866 LLVM10 anatofuz parents: diff changeset	1553 ``X86CallingConv.td``), if the ``Fast`` calling convention is in use, then
1d019706d866 LLVM10 anatofuz parents: diff changeset	1554 ``RetCC_X86_32_Fast`` is invoked. If the ``SSECall`` calling convention is in
1d019706d866 LLVM10 anatofuz parents: diff changeset	1555 use, then ``RetCC_X86_32_SSE`` is invoked.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1556
1d019706d866 LLVM10 anatofuz parents: diff changeset	1557 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	1558
1d019706d866 LLVM10 anatofuz parents: diff changeset	1559 def RetCC_X86_32 : CallingConv<[
1d019706d866 LLVM10 anatofuz parents: diff changeset	1560 CCIfCC<"CallingConv::Fast", CCDelegateTo<RetCC_X86_32_Fast>>,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1561 CCIfCC<"CallingConv::X86_SSECall", CCDelegateTo<RetCC_X86_32_SSE>>,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1562 CCDelegateTo<RetCC_X86_32_C>
1d019706d866 LLVM10 anatofuz parents: diff changeset	1563 ]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1564
1d019706d866 LLVM10 anatofuz parents: diff changeset	1565 Other calling convention interfaces include:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1566
1d019706d866 LLVM10 anatofuz parents: diff changeset	1567 * ``CCIf <predicate, action>`` --- If the predicate matches, apply the action.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1568
1d019706d866 LLVM10 anatofuz parents: diff changeset	1569 * ``CCIfInReg <action>`` --- If the argument is marked with the "``inreg``"
1d019706d866 LLVM10 anatofuz parents: diff changeset	1570 attribute, then apply the action.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1571
1d019706d866 LLVM10 anatofuz parents: diff changeset	1572 * ``CCIfNest <action>`` --- If the argument is marked with the "``nest``"
1d019706d866 LLVM10 anatofuz parents: diff changeset	1573 attribute, then apply the action.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1574
1d019706d866 LLVM10 anatofuz parents: diff changeset	1575 * ``CCIfNotVarArg <action>`` --- If the current function does not take a
1d019706d866 LLVM10 anatofuz parents: diff changeset	1576 variable number of arguments, apply the action.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1577
1d019706d866 LLVM10 anatofuz parents: diff changeset	1578 * ``CCAssignToRegWithShadow <registerList, shadowList>`` --- similar to
1d019706d866 LLVM10 anatofuz parents: diff changeset	1579 ``CCAssignToReg``, but with a shadow list of registers.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1580
1d019706d866 LLVM10 anatofuz parents: diff changeset	1581 * ``CCPassByVal <size, align>`` --- Assign value to a stack slot with the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1582 minimum specified size and alignment.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1583
1d019706d866 LLVM10 anatofuz parents: diff changeset	1584 * ``CCPromoteToType <type>`` --- Promote the current value to the specified
1d019706d866 LLVM10 anatofuz parents: diff changeset	1585 type.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1586
1d019706d866 LLVM10 anatofuz parents: diff changeset	1587 * ``CallingConv <[actions]>`` --- Define each calling convention that is
1d019706d866 LLVM10 anatofuz parents: diff changeset	1588 supported.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1589
1d019706d866 LLVM10 anatofuz parents: diff changeset	1590 Assembly Printer
1d019706d866 LLVM10 anatofuz parents: diff changeset	1591 ================
1d019706d866 LLVM10 anatofuz parents: diff changeset	1592
1d019706d866 LLVM10 anatofuz parents: diff changeset	1593 During the code emission stage, the code generator may utilize an LLVM pass to
1d019706d866 LLVM10 anatofuz parents: diff changeset	1594 produce assembly output. To do this, you want to implement the code for a
1d019706d866 LLVM10 anatofuz parents: diff changeset	1595 printer that converts LLVM IR to a GAS-format assembly language for your target
1d019706d866 LLVM10 anatofuz parents: diff changeset	1596 machine, using the following steps:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1597
1d019706d866 LLVM10 anatofuz parents: diff changeset	1598 * Define all the assembly strings for your target, adding them to the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1599 instructions defined in the ``XXXInstrInfo.td`` file. (See
1d019706d866 LLVM10 anatofuz parents: diff changeset	1600 :ref:`instruction-set`.) TableGen will produce an output file
1d019706d866 LLVM10 anatofuz parents: diff changeset	1601 (``XXXGenAsmWriter.inc``) with an implementation of the ``printInstruction``
1d019706d866 LLVM10 anatofuz parents: diff changeset	1602 method for the ``XXXAsmPrinter`` class.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1603
1d019706d866 LLVM10 anatofuz parents: diff changeset	1604 * Write ``XXXTargetAsmInfo.h``, which contains the bare-bones declaration of
1d019706d866 LLVM10 anatofuz parents: diff changeset	1605 the ``XXXTargetAsmInfo`` class (a subclass of ``TargetAsmInfo``).
1d019706d866 LLVM10 anatofuz parents: diff changeset	1606
1d019706d866 LLVM10 anatofuz parents: diff changeset	1607 * Write ``XXXTargetAsmInfo.cpp``, which contains target-specific values for
1d019706d866 LLVM10 anatofuz parents: diff changeset	1608 ``TargetAsmInfo`` properties and sometimes new implementations for methods.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1609
1d019706d866 LLVM10 anatofuz parents: diff changeset	1610 * Write ``XXXAsmPrinter.cpp``, which implements the ``AsmPrinter`` class that
1d019706d866 LLVM10 anatofuz parents: diff changeset	1611 performs the LLVM-to-assembly conversion.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1612
1d019706d866 LLVM10 anatofuz parents: diff changeset	1613 The code in ``XXXTargetAsmInfo.h`` is usually a trivial declaration of the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1614 ``XXXTargetAsmInfo`` class for use in ``XXXTargetAsmInfo.cpp``. Similarly,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1615 ``XXXTargetAsmInfo.cpp`` usually has a few declarations of ``XXXTargetAsmInfo``
1d019706d866 LLVM10 anatofuz parents: diff changeset	1616 replacement values that override the default values in ``TargetAsmInfo.cpp``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1617 For example in ``SparcTargetAsmInfo.cpp``:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1618
1d019706d866 LLVM10 anatofuz parents: diff changeset	1619 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1620
1d019706d866 LLVM10 anatofuz parents: diff changeset	1621 SparcTargetAsmInfo::SparcTargetAsmInfo(const SparcTargetMachine &TM) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1622 Data16bitsDirective = "\t.half\t";
1d019706d866 LLVM10 anatofuz parents: diff changeset	1623 Data32bitsDirective = "\t.word\t";
1d019706d866 LLVM10 anatofuz parents: diff changeset	1624 Data64bitsDirective = 0; // .xword is only supported by V9.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1625 ZeroDirective = "\t.skip\t";
1d019706d866 LLVM10 anatofuz parents: diff changeset	1626 CommentString = "!";
1d019706d866 LLVM10 anatofuz parents: diff changeset	1627 ConstantPoolSection = "\t.section \".rodata\",#alloc\n";
1d019706d866 LLVM10 anatofuz parents: diff changeset	1628 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	1629
1d019706d866 LLVM10 anatofuz parents: diff changeset	1630 The X86 assembly printer implementation (``X86TargetAsmInfo``) is an example
1d019706d866 LLVM10 anatofuz parents: diff changeset	1631 where the target specific ``TargetAsmInfo`` class uses an overridden methods:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1632 ``ExpandInlineAsm``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1633
1d019706d866 LLVM10 anatofuz parents: diff changeset	1634 A target-specific implementation of ``AsmPrinter`` is written in
1d019706d866 LLVM10 anatofuz parents: diff changeset	1635 ``XXXAsmPrinter.cpp``, which implements the ``AsmPrinter`` class that converts
1d019706d866 LLVM10 anatofuz parents: diff changeset	1636 the LLVM to printable assembly. The implementation must include the following
1d019706d866 LLVM10 anatofuz parents: diff changeset	1637 headers that have declarations for the ``AsmPrinter`` and
1d019706d866 LLVM10 anatofuz parents: diff changeset	1638 ``MachineFunctionPass`` classes. The ``MachineFunctionPass`` is a subclass of
1d019706d866 LLVM10 anatofuz parents: diff changeset	1639 ``FunctionPass``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1640
1d019706d866 LLVM10 anatofuz parents: diff changeset	1641 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1642
1d019706d866 LLVM10 anatofuz parents: diff changeset	1643 #include "llvm/CodeGen/AsmPrinter.h"
1d019706d866 LLVM10 anatofuz parents: diff changeset	1644 #include "llvm/CodeGen/MachineFunctionPass.h"
1d019706d866 LLVM10 anatofuz parents: diff changeset	1645
1d019706d866 LLVM10 anatofuz parents: diff changeset	1646 As a ``FunctionPass``, ``AsmPrinter`` first calls ``doInitialization`` to set
1d019706d866 LLVM10 anatofuz parents: diff changeset	1647 up the ``AsmPrinter``. In ``SparcAsmPrinter``, a ``Mangler`` object is
1d019706d866 LLVM10 anatofuz parents: diff changeset	1648 instantiated to process variable names.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1649
1d019706d866 LLVM10 anatofuz parents: diff changeset	1650 In ``XXXAsmPrinter.cpp``, the ``runOnMachineFunction`` method (declared in
1d019706d866 LLVM10 anatofuz parents: diff changeset	1651 ``MachineFunctionPass``) must be implemented for ``XXXAsmPrinter``. In
1d019706d866 LLVM10 anatofuz parents: diff changeset	1652 ``MachineFunctionPass``, the ``runOnFunction`` method invokes
1d019706d866 LLVM10 anatofuz parents: diff changeset	1653 ``runOnMachineFunction``. Target-specific implementations of
1d019706d866 LLVM10 anatofuz parents: diff changeset	1654 ``runOnMachineFunction`` differ, but generally do the following to process each
1d019706d866 LLVM10 anatofuz parents: diff changeset	1655 machine function:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1656
1d019706d866 LLVM10 anatofuz parents: diff changeset	1657 * Call ``SetupMachineFunction`` to perform initialization.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1658
1d019706d866 LLVM10 anatofuz parents: diff changeset	1659 * Call ``EmitConstantPool`` to print out (to the output stream) constants which
1d019706d866 LLVM10 anatofuz parents: diff changeset	1660 have been spilled to memory.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1661
1d019706d866 LLVM10 anatofuz parents: diff changeset	1662 * Call ``EmitJumpTableInfo`` to print out jump tables used by the current
1d019706d866 LLVM10 anatofuz parents: diff changeset	1663 function.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1664
1d019706d866 LLVM10 anatofuz parents: diff changeset	1665 * Print out the label for the current function.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1666
1d019706d866 LLVM10 anatofuz parents: diff changeset	1667 * Print out the code for the function, including basic block labels and the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1668 assembly for the instruction (using ``printInstruction``)
1d019706d866 LLVM10 anatofuz parents: diff changeset	1669
1d019706d866 LLVM10 anatofuz parents: diff changeset	1670 The ``XXXAsmPrinter`` implementation must also include the code generated by
1d019706d866 LLVM10 anatofuz parents: diff changeset	1671 TableGen that is output in the ``XXXGenAsmWriter.inc`` file. The code in
1d019706d866 LLVM10 anatofuz parents: diff changeset	1672 ``XXXGenAsmWriter.inc`` contains an implementation of the ``printInstruction``
1d019706d866 LLVM10 anatofuz parents: diff changeset	1673 method that may call these methods:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1674
1d019706d866 LLVM10 anatofuz parents: diff changeset	1675 * ``printOperand``
1d019706d866 LLVM10 anatofuz parents: diff changeset	1676 * ``printMemOperand``
1d019706d866 LLVM10 anatofuz parents: diff changeset	1677 * ``printCCOperand`` (for conditional statements)
1d019706d866 LLVM10 anatofuz parents: diff changeset	1678 * ``printDataDirective``
1d019706d866 LLVM10 anatofuz parents: diff changeset	1679 * ``printDeclare``
1d019706d866 LLVM10 anatofuz parents: diff changeset	1680 * ``printImplicitDef``
1d019706d866 LLVM10 anatofuz parents: diff changeset	1681 * ``printInlineAsm``
1d019706d866 LLVM10 anatofuz parents: diff changeset	1682
1d019706d866 LLVM10 anatofuz parents: diff changeset	1683 The implementations of ``printDeclare``, ``printImplicitDef``,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1684 ``printInlineAsm``, and ``printLabel`` in ``AsmPrinter.cpp`` are generally
1d019706d866 LLVM10 anatofuz parents: diff changeset	1685 adequate for printing assembly and do not need to be overridden.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1686
1d019706d866 LLVM10 anatofuz parents: diff changeset	1687 The ``printOperand`` method is implemented with a long ``switch``/``case``
1d019706d866 LLVM10 anatofuz parents: diff changeset	1688 statement for the type of operand: register, immediate, basic block, external
1d019706d866 LLVM10 anatofuz parents: diff changeset	1689 symbol, global address, constant pool index, or jump table index. For an
1d019706d866 LLVM10 anatofuz parents: diff changeset	1690 instruction with a memory address operand, the ``printMemOperand`` method
1d019706d866 LLVM10 anatofuz parents: diff changeset	1691 should be implemented to generate the proper output. Similarly,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1692 ``printCCOperand`` should be used to print a conditional operand.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1693
1d019706d866 LLVM10 anatofuz parents: diff changeset	1694 ``doFinalization`` should be overridden in ``XXXAsmPrinter``, and it should be
1d019706d866 LLVM10 anatofuz parents: diff changeset	1695 called to shut down the assembly printer. During ``doFinalization``, global
1d019706d866 LLVM10 anatofuz parents: diff changeset	1696 variables and constants are printed to output.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1697
1d019706d866 LLVM10 anatofuz parents: diff changeset	1698 Subtarget Support
1d019706d866 LLVM10 anatofuz parents: diff changeset	1699 =================
1d019706d866 LLVM10 anatofuz parents: diff changeset	1700
1d019706d866 LLVM10 anatofuz parents: diff changeset	1701 Subtarget support is used to inform the code generation process of instruction
1d019706d866 LLVM10 anatofuz parents: diff changeset	1702 set variations for a given chip set. For example, the LLVM SPARC
1d019706d866 LLVM10 anatofuz parents: diff changeset	1703 implementation provided covers three major versions of the SPARC microprocessor
1d019706d866 LLVM10 anatofuz parents: diff changeset	1704 architecture: Version 8 (V8, which is a 32-bit architecture), Version 9 (V9, a
1d019706d866 LLVM10 anatofuz parents: diff changeset	1705 64-bit architecture), and the UltraSPARC architecture. V8 has 16
1d019706d866 LLVM10 anatofuz parents: diff changeset	1706 double-precision floating-point registers that are also usable as either 32
1d019706d866 LLVM10 anatofuz parents: diff changeset	1707 single-precision or 8 quad-precision registers. V8 is also purely big-endian.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1708 V9 has 32 double-precision floating-point registers that are also usable as 16
1d019706d866 LLVM10 anatofuz parents: diff changeset	1709 quad-precision registers, but cannot be used as single-precision registers.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1710 The UltraSPARC architecture combines V9 with UltraSPARC Visual Instruction Set
1d019706d866 LLVM10 anatofuz parents: diff changeset	1711 extensions.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1712
1d019706d866 LLVM10 anatofuz parents: diff changeset	1713 If subtarget support is needed, you should implement a target-specific
1d019706d866 LLVM10 anatofuz parents: diff changeset	1714 ``XXXSubtarget`` class for your architecture. This class should process the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1715 command-line options ``-mcpu=`` and ``-mattr=``.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1716
1d019706d866 LLVM10 anatofuz parents: diff changeset	1717 TableGen uses definitions in the ``Target.td`` and ``Sparc.td`` files to
1d019706d866 LLVM10 anatofuz parents: diff changeset	1718 generate code in ``SparcGenSubtarget.inc``. In ``Target.td``, shown below, the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1719 ``SubtargetFeature`` interface is defined. The first 4 string parameters of
1d019706d866 LLVM10 anatofuz parents: diff changeset	1720 the ``SubtargetFeature`` interface are a feature name, an attribute set by the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1721 feature, the value of the attribute, and a description of the feature. (The
1d019706d866 LLVM10 anatofuz parents: diff changeset	1722 fifth parameter is a list of features whose presence is implied, and its
1d019706d866 LLVM10 anatofuz parents: diff changeset	1723 default value is an empty array.)
1d019706d866 LLVM10 anatofuz parents: diff changeset	1724
1d019706d866 LLVM10 anatofuz parents: diff changeset	1725 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	1726
1d019706d866 LLVM10 anatofuz parents: diff changeset	1727 class SubtargetFeature<string n, string a, string v, string d,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1728 list<SubtargetFeature> i = []> {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1729 string Name = n;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1730 string Attribute = a;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1731 string Value = v;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1732 string Desc = d;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1733 list<SubtargetFeature> Implies = i;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1734 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	1735
1d019706d866 LLVM10 anatofuz parents: diff changeset	1736 In the ``Sparc.td`` file, the ``SubtargetFeature`` is used to define the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1737 following features.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1738
1d019706d866 LLVM10 anatofuz parents: diff changeset	1739 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	1740
1d019706d866 LLVM10 anatofuz parents: diff changeset	1741 def FeatureV9 : SubtargetFeature<"v9", "IsV9", "true",
1d019706d866 LLVM10 anatofuz parents: diff changeset	1742 "Enable SPARC-V9 instructions">;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1743 def FeatureV8Deprecated : SubtargetFeature<"deprecated-v8",
1d019706d866 LLVM10 anatofuz parents: diff changeset	1744 "V8DeprecatedInsts", "true",
1d019706d866 LLVM10 anatofuz parents: diff changeset	1745 "Enable deprecated V8 instructions in V9 mode">;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1746 def FeatureVIS : SubtargetFeature<"vis", "IsVIS", "true",
1d019706d866 LLVM10 anatofuz parents: diff changeset	1747 "Enable UltraSPARC Visual Instruction Set extensions">;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1748
1d019706d866 LLVM10 anatofuz parents: diff changeset	1749 Elsewhere in ``Sparc.td``, the ``Proc`` class is defined and then is used to
1d019706d866 LLVM10 anatofuz parents: diff changeset	1750 define particular SPARC processor subtypes that may have the previously
1d019706d866 LLVM10 anatofuz parents: diff changeset	1751 described features.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1752
1d019706d866 LLVM10 anatofuz parents: diff changeset	1753 .. code-block:: text
1d019706d866 LLVM10 anatofuz parents: diff changeset	1754
1d019706d866 LLVM10 anatofuz parents: diff changeset	1755 class Proc<string Name, list<SubtargetFeature> Features>
1d019706d866 LLVM10 anatofuz parents: diff changeset	1756 : Processor<Name, NoItineraries, Features>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1757
1d019706d866 LLVM10 anatofuz parents: diff changeset	1758 def : Proc<"generic", []>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1759 def : Proc<"v8", []>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1760 def : Proc<"supersparc", []>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1761 def : Proc<"sparclite", []>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1762 def : Proc<"f934", []>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1763 def : Proc<"hypersparc", []>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1764 def : Proc<"sparclite86x", []>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1765 def : Proc<"sparclet", []>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1766 def : Proc<"tsc701", []>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1767 def : Proc<"v9", [FeatureV9]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1768 def : Proc<"ultrasparc", [FeatureV9, FeatureV8Deprecated]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1769 def : Proc<"ultrasparc3", [FeatureV9, FeatureV8Deprecated]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1770 def : Proc<"ultrasparc3-vis", [FeatureV9, FeatureV8Deprecated, FeatureVIS]>;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1771
1d019706d866 LLVM10 anatofuz parents: diff changeset	1772 From ``Target.td`` and ``Sparc.td`` files, the resulting
1d019706d866 LLVM10 anatofuz parents: diff changeset	1773 ``SparcGenSubtarget.inc`` specifies enum values to identify the features,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1774 arrays of constants to represent the CPU features and CPU subtypes, and the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1775 ``ParseSubtargetFeatures`` method that parses the features string that sets
1d019706d866 LLVM10 anatofuz parents: diff changeset	1776 specified subtarget options. The generated ``SparcGenSubtarget.inc`` file
1d019706d866 LLVM10 anatofuz parents: diff changeset	1777 should be included in the ``SparcSubtarget.cpp``. The target-specific
1d019706d866 LLVM10 anatofuz parents: diff changeset	1778 implementation of the ``XXXSubtarget`` method should follow this pseudocode:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1779
1d019706d866 LLVM10 anatofuz parents: diff changeset	1780 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1781
1d019706d866 LLVM10 anatofuz parents: diff changeset	1782 XXXSubtarget::XXXSubtarget(const Module &M, const std::string &FS) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1783 // Set the default features
1d019706d866 LLVM10 anatofuz parents: diff changeset	1784 // Determine default and user specified characteristics of the CPU
1d019706d866 LLVM10 anatofuz parents: diff changeset	1785 // Call ParseSubtargetFeatures(FS, CPU) to parse the features string
1d019706d866 LLVM10 anatofuz parents: diff changeset	1786 // Perform any additional operations
1d019706d866 LLVM10 anatofuz parents: diff changeset	1787 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	1788
1d019706d866 LLVM10 anatofuz parents: diff changeset	1789 JIT Support
1d019706d866 LLVM10 anatofuz parents: diff changeset	1790 ===========
1d019706d866 LLVM10 anatofuz parents: diff changeset	1791
1d019706d866 LLVM10 anatofuz parents: diff changeset	1792 The implementation of a target machine optionally includes a Just-In-Time (JIT)
1d019706d866 LLVM10 anatofuz parents: diff changeset	1793 code generator that emits machine code and auxiliary structures as binary
1d019706d866 LLVM10 anatofuz parents: diff changeset	1794 output that can be written directly to memory. To do this, implement JIT code
1d019706d866 LLVM10 anatofuz parents: diff changeset	1795 generation by performing the following steps:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1796
1d019706d866 LLVM10 anatofuz parents: diff changeset	1797 * Write an ``XXXCodeEmitter.cpp`` file that contains a machine function pass
1d019706d866 LLVM10 anatofuz parents: diff changeset	1798 that transforms target-machine instructions into relocatable machine
1d019706d866 LLVM10 anatofuz parents: diff changeset	1799 code.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1800
1d019706d866 LLVM10 anatofuz parents: diff changeset	1801 * Write an ``XXXJITInfo.cpp`` file that implements the JIT interfaces for
1d019706d866 LLVM10 anatofuz parents: diff changeset	1802 target-specific code-generation activities, such as emitting machine code and
1d019706d866 LLVM10 anatofuz parents: diff changeset	1803 stubs.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1804
1d019706d866 LLVM10 anatofuz parents: diff changeset	1805 * Modify ``XXXTargetMachine`` so that it provides a ``TargetJITInfo`` object
1d019706d866 LLVM10 anatofuz parents: diff changeset	1806 through its ``getJITInfo`` method.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1807
1d019706d866 LLVM10 anatofuz parents: diff changeset	1808 There are several different approaches to writing the JIT support code. For
1d019706d866 LLVM10 anatofuz parents: diff changeset	1809 instance, TableGen and target descriptor files may be used for creating a JIT
1d019706d866 LLVM10 anatofuz parents: diff changeset	1810 code generator, but are not mandatory. For the Alpha and PowerPC target
1d019706d866 LLVM10 anatofuz parents: diff changeset	1811 machines, TableGen is used to generate ``XXXGenCodeEmitter.inc``, which
1d019706d866 LLVM10 anatofuz parents: diff changeset	1812 contains the binary coding of machine instructions and the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1813 ``getBinaryCodeForInstr`` method to access those codes. Other JIT
1d019706d866 LLVM10 anatofuz parents: diff changeset	1814 implementations do not.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1815
1d019706d866 LLVM10 anatofuz parents: diff changeset	1816 Both ``XXXJITInfo.cpp`` and ``XXXCodeEmitter.cpp`` must include the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1817 ``llvm/CodeGen/MachineCodeEmitter.h`` header file that defines the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1818 ``MachineCodeEmitter`` class containing code for several callback functions
1d019706d866 LLVM10 anatofuz parents: diff changeset	1819 that write data (in bytes, words, strings, etc.) to the output stream.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1820
1d019706d866 LLVM10 anatofuz parents: diff changeset	1821 Machine Code Emitter
1d019706d866 LLVM10 anatofuz parents: diff changeset	1822 --------------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	1823
1d019706d866 LLVM10 anatofuz parents: diff changeset	1824 In ``XXXCodeEmitter.cpp``, a target-specific of the ``Emitter`` class is
1d019706d866 LLVM10 anatofuz parents: diff changeset	1825 implemented as a function pass (subclass of ``MachineFunctionPass``). The
1d019706d866 LLVM10 anatofuz parents: diff changeset	1826 target-specific implementation of ``runOnMachineFunction`` (invoked by
1d019706d866 LLVM10 anatofuz parents: diff changeset	1827 ``runOnFunction`` in ``MachineFunctionPass``) iterates through the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1828 ``MachineBasicBlock`` calls ``emitInstruction`` to process each instruction and
1d019706d866 LLVM10 anatofuz parents: diff changeset	1829 emit binary code. ``emitInstruction`` is largely implemented with case
1d019706d866 LLVM10 anatofuz parents: diff changeset	1830 statements on the instruction types defined in ``XXXInstrInfo.h``. For
1d019706d866 LLVM10 anatofuz parents: diff changeset	1831 example, in ``X86CodeEmitter.cpp``, the ``emitInstruction`` method is built
1d019706d866 LLVM10 anatofuz parents: diff changeset	1832 around the following ``switch``/``case`` statements:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1833
1d019706d866 LLVM10 anatofuz parents: diff changeset	1834 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1835
1d019706d866 LLVM10 anatofuz parents: diff changeset	1836 switch (Desc->TSFlags & X86::FormMask) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1837 case X86II::Pseudo: // for not yet implemented instructions
1d019706d866 LLVM10 anatofuz parents: diff changeset	1838 ... // or pseudo-instructions
1d019706d866 LLVM10 anatofuz parents: diff changeset	1839 break;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1840 case X86II::RawFrm: // for instructions with a fixed opcode value
1d019706d866 LLVM10 anatofuz parents: diff changeset	1841 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	1842 break;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1843 case X86II::AddRegFrm: // for instructions that have one register operand
1d019706d866 LLVM10 anatofuz parents: diff changeset	1844 ... // added to their opcode
1d019706d866 LLVM10 anatofuz parents: diff changeset	1845 break;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1846 case X86II::MRMDestReg:// for instructions that use the Mod/RM byte
1d019706d866 LLVM10 anatofuz parents: diff changeset	1847 ... // to specify a destination (register)
1d019706d866 LLVM10 anatofuz parents: diff changeset	1848 break;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1849 case X86II::MRMDestMem:// for instructions that use the Mod/RM byte
1d019706d866 LLVM10 anatofuz parents: diff changeset	1850 ... // to specify a destination (memory)
1d019706d866 LLVM10 anatofuz parents: diff changeset	1851 break;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1852 case X86II::MRMSrcReg: // for instructions that use the Mod/RM byte
1d019706d866 LLVM10 anatofuz parents: diff changeset	1853 ... // to specify a source (register)
1d019706d866 LLVM10 anatofuz parents: diff changeset	1854 break;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1855 case X86II::MRMSrcMem: // for instructions that use the Mod/RM byte
1d019706d866 LLVM10 anatofuz parents: diff changeset	1856 ... // to specify a source (memory)
1d019706d866 LLVM10 anatofuz parents: diff changeset	1857 break;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1858 case X86II::MRM0r: case X86II::MRM1r: // for instructions that operate on
1d019706d866 LLVM10 anatofuz parents: diff changeset	1859 case X86II::MRM2r: case X86II::MRM3r: // a REGISTER r/m operand and
1d019706d866 LLVM10 anatofuz parents: diff changeset	1860 case X86II::MRM4r: case X86II::MRM5r: // use the Mod/RM byte and a field
1d019706d866 LLVM10 anatofuz parents: diff changeset	1861 case X86II::MRM6r: case X86II::MRM7r: // to hold extended opcode data
1d019706d866 LLVM10 anatofuz parents: diff changeset	1862 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	1863 break;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1864 case X86II::MRM0m: case X86II::MRM1m: // for instructions that operate on
1d019706d866 LLVM10 anatofuz parents: diff changeset	1865 case X86II::MRM2m: case X86II::MRM3m: // a MEMORY r/m operand and
1d019706d866 LLVM10 anatofuz parents: diff changeset	1866 case X86II::MRM4m: case X86II::MRM5m: // use the Mod/RM byte and a field
1d019706d866 LLVM10 anatofuz parents: diff changeset	1867 case X86II::MRM6m: case X86II::MRM7m: // to hold extended opcode data
1d019706d866 LLVM10 anatofuz parents: diff changeset	1868 ...
1d019706d866 LLVM10 anatofuz parents: diff changeset	1869 break;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1870 case X86II::MRMInitReg: // for instructions whose source and
1d019706d866 LLVM10 anatofuz parents: diff changeset	1871 ... // destination are the same register
1d019706d866 LLVM10 anatofuz parents: diff changeset	1872 break;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1873 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	1874
1d019706d866 LLVM10 anatofuz parents: diff changeset	1875 The implementations of these case statements often first emit the opcode and
1d019706d866 LLVM10 anatofuz parents: diff changeset	1876 then get the operand(s). Then depending upon the operand, helper methods may
1d019706d866 LLVM10 anatofuz parents: diff changeset	1877 be called to process the operand(s). For example, in ``X86CodeEmitter.cpp``,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1878 for the ``X86II::AddRegFrm`` case, the first data emitted (by ``emitByte``) is
1d019706d866 LLVM10 anatofuz parents: diff changeset	1879 the opcode added to the register operand. Then an object representing the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1880 machine operand, ``MO1``, is extracted. The helper methods such as
1d019706d866 LLVM10 anatofuz parents: diff changeset	1881 ``isImmediate``, ``isGlobalAddress``, ``isExternalSymbol``,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1882 ``isConstantPoolIndex``, and ``isJumpTableIndex`` determine the operand type.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1883 (``X86CodeEmitter.cpp`` also has private methods such as ``emitConstant``,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1884 ``emitGlobalAddress``, ``emitExternalSymbolAddress``, ``emitConstPoolAddress``,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1885 and ``emitJumpTableAddress`` that emit the data into the output stream.)
1d019706d866 LLVM10 anatofuz parents: diff changeset	1886
1d019706d866 LLVM10 anatofuz parents: diff changeset	1887 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1888
1d019706d866 LLVM10 anatofuz parents: diff changeset	1889 case X86II::AddRegFrm:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1890 MCE.emitByte(BaseOpcode + getX86RegNum(MI.getOperand(CurOp++).getReg()));
1d019706d866 LLVM10 anatofuz parents: diff changeset	1891
1d019706d866 LLVM10 anatofuz parents: diff changeset	1892 if (CurOp != NumOps) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1893 const MachineOperand &MO1 = MI.getOperand(CurOp++);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1894 unsigned Size = X86InstrInfo::sizeOfImm(Desc);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1895 if (MO1.isImmediate())
1d019706d866 LLVM10 anatofuz parents: diff changeset	1896 emitConstant(MO1.getImm(), Size);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1897 else {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1898 unsigned rt = Is64BitMode ? X86::reloc_pcrel_word
1d019706d866 LLVM10 anatofuz parents: diff changeset	1899 : (IsPIC ? X86::reloc_picrel_word : X86::reloc_absolute_word);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1900 if (Opcode == X86::MOV64ri)
1d019706d866 LLVM10 anatofuz parents: diff changeset	1901 rt = X86::reloc_absolute_dword; // FIXME: add X86II flag?
1d019706d866 LLVM10 anatofuz parents: diff changeset	1902 if (MO1.isGlobalAddress()) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1903 bool NeedStub = isa<Function>(MO1.getGlobal());
1d019706d866 LLVM10 anatofuz parents: diff changeset	1904 bool isLazy = gvNeedsLazyPtr(MO1.getGlobal());
1d019706d866 LLVM10 anatofuz parents: diff changeset	1905 emitGlobalAddress(MO1.getGlobal(), rt, MO1.getOffset(), 0,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1906 NeedStub, isLazy);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1907 } else if (MO1.isExternalSymbol())
1d019706d866 LLVM10 anatofuz parents: diff changeset	1908 emitExternalSymbolAddress(MO1.getSymbolName(), rt);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1909 else if (MO1.isConstantPoolIndex())
1d019706d866 LLVM10 anatofuz parents: diff changeset	1910 emitConstPoolAddress(MO1.getIndex(), rt);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1911 else if (MO1.isJumpTableIndex())
1d019706d866 LLVM10 anatofuz parents: diff changeset	1912 emitJumpTableAddress(MO1.getIndex(), rt);
1d019706d866 LLVM10 anatofuz parents: diff changeset	1913 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	1914 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	1915 break;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1916
1d019706d866 LLVM10 anatofuz parents: diff changeset	1917 In the previous example, ``XXXCodeEmitter.cpp`` uses the variable ``rt``, which
1d019706d866 LLVM10 anatofuz parents: diff changeset	1918 is a ``RelocationType`` enum that may be used to relocate addresses (for
1d019706d866 LLVM10 anatofuz parents: diff changeset	1919 example, a global address with a PIC base offset). The ``RelocationType`` enum
1d019706d866 LLVM10 anatofuz parents: diff changeset	1920 for that target is defined in the short target-specific ``XXXRelocations.h``
1d019706d866 LLVM10 anatofuz parents: diff changeset	1921 file. The ``RelocationType`` is used by the ``relocate`` method defined in
1d019706d866 LLVM10 anatofuz parents: diff changeset	1922 ``XXXJITInfo.cpp`` to rewrite addresses for referenced global symbols.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1923
1d019706d866 LLVM10 anatofuz parents: diff changeset	1924 For example, ``X86Relocations.h`` specifies the following relocation types for
1d019706d866 LLVM10 anatofuz parents: diff changeset	1925 the X86 addresses. In all four cases, the relocated value is added to the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1926 value already in memory. For ``reloc_pcrel_word`` and ``reloc_picrel_word``,
1d019706d866 LLVM10 anatofuz parents: diff changeset	1927 there is an additional initial adjustment.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1928
1d019706d866 LLVM10 anatofuz parents: diff changeset	1929 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1930
1d019706d866 LLVM10 anatofuz parents: diff changeset	1931 enum RelocationType {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1932 reloc_pcrel_word = 0, // add reloc value after adjusting for the PC loc
1d019706d866 LLVM10 anatofuz parents: diff changeset	1933 reloc_picrel_word = 1, // add reloc value after adjusting for the PIC base
1d019706d866 LLVM10 anatofuz parents: diff changeset	1934 reloc_absolute_word = 2, // absolute relocation; no additional adjustment
1d019706d866 LLVM10 anatofuz parents: diff changeset	1935 reloc_absolute_dword = 3 // absolute relocation; no additional adjustment
1d019706d866 LLVM10 anatofuz parents: diff changeset	1936 };
1d019706d866 LLVM10 anatofuz parents: diff changeset	1937
1d019706d866 LLVM10 anatofuz parents: diff changeset	1938 Target JIT Info
1d019706d866 LLVM10 anatofuz parents: diff changeset	1939 ---------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	1940
1d019706d866 LLVM10 anatofuz parents: diff changeset	1941 ``XXXJITInfo.cpp`` implements the JIT interfaces for target-specific
1d019706d866 LLVM10 anatofuz parents: diff changeset	1942 code-generation activities, such as emitting machine code and stubs. At
1d019706d866 LLVM10 anatofuz parents: diff changeset	1943 minimum, a target-specific version of ``XXXJITInfo`` implements the following:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1944
1d019706d866 LLVM10 anatofuz parents: diff changeset	1945 * ``getLazyResolverFunction`` --- Initializes the JIT, gives the target a
1d019706d866 LLVM10 anatofuz parents: diff changeset	1946 function that is used for compilation.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1947
1d019706d866 LLVM10 anatofuz parents: diff changeset	1948 * ``emitFunctionStub`` --- Returns a native function with a specified address
1d019706d866 LLVM10 anatofuz parents: diff changeset	1949 for a callback function.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1950
1d019706d866 LLVM10 anatofuz parents: diff changeset	1951 * ``relocate`` --- Changes the addresses of referenced globals, based on
1d019706d866 LLVM10 anatofuz parents: diff changeset	1952 relocation types.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1953
1d019706d866 LLVM10 anatofuz parents: diff changeset	1954 * Callback function that are wrappers to a function stub that is used when the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1955 real target is not initially known.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1956
1d019706d866 LLVM10 anatofuz parents: diff changeset	1957 ``getLazyResolverFunction`` is generally trivial to implement. It makes the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1958 incoming parameter as the global ``JITCompilerFunction`` and returns the
1d019706d866 LLVM10 anatofuz parents: diff changeset	1959 callback function that will be used a function wrapper. For the Alpha target
1d019706d866 LLVM10 anatofuz parents: diff changeset	1960 (in ``AlphaJITInfo.cpp``), the ``getLazyResolverFunction`` implementation is
1d019706d866 LLVM10 anatofuz parents: diff changeset	1961 simply:
1d019706d866 LLVM10 anatofuz parents: diff changeset	1962
1d019706d866 LLVM10 anatofuz parents: diff changeset	1963 .. code-block:: c++
1d019706d866 LLVM10 anatofuz parents: diff changeset	1964
1d019706d866 LLVM10 anatofuz parents: diff changeset	1965 TargetJITInfo::LazyResolverFn AlphaJITInfo::getLazyResolverFunction(
1d019706d866 LLVM10 anatofuz parents: diff changeset	1966 JITCompilerFn F) {
1d019706d866 LLVM10 anatofuz parents: diff changeset	1967 JITCompilerFunction = F;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1968 return AlphaCompilationCallback;
1d019706d866 LLVM10 anatofuz parents: diff changeset	1969 }
1d019706d866 LLVM10 anatofuz parents: diff changeset	1970
1d019706d866 LLVM10 anatofuz parents: diff changeset	1971 For the X86 target, the ``getLazyResolverFunction`` implementation is a little
1d019706d866 LLVM10 anatofuz parents: diff changeset	1972 more complicated, because it returns a different callback function for
1d019706d866 LLVM10 anatofuz parents: diff changeset	1973 processors with SSE instructions and XMM registers.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1974
1d019706d866 LLVM10 anatofuz parents: diff changeset	1975 The callback function initially saves and later restores the callee register
1d019706d866 LLVM10 anatofuz parents: diff changeset	1976 values, incoming arguments, and frame and return address. The callback
1d019706d866 LLVM10 anatofuz parents: diff changeset	1977 function needs low-level access to the registers or stack, so it is typically
1d019706d866 LLVM10 anatofuz parents: diff changeset	1978 implemented with assembler.
1d019706d866 LLVM10 anatofuz parents: diff changeset	1979

Mercurial > hg > CbC > CbC_llvm

annotate llvm/docs/WritingAnLLVMBackend.rst @ 164:fdfabb438fbf