CbC/CbC_llvm: lld/docs/NewLLD.rst annotate

annotate lld/docs/NewLLD.rst @ 154:f7e988d3e4cc

fix def file

author	anatofuz
date	Wed, 11 Mar 2020 19:23:03 +0900
parents	1d019706d866
children	0572611fdcc8

rev	line source
150 1d019706d866 LLVM10 anatofuz parents: diff changeset	1 The ELF, COFF and Wasm Linkers
1d019706d866 LLVM10 anatofuz parents: diff changeset	2 ==============================
1d019706d866 LLVM10 anatofuz parents: diff changeset	3
1d019706d866 LLVM10 anatofuz parents: diff changeset	4 The ELF Linker as a Library
1d019706d866 LLVM10 anatofuz parents: diff changeset	5 ---------------------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	6
1d019706d866 LLVM10 anatofuz parents: diff changeset	7 You can embed LLD to your program by linking against it and calling the linker's
1d019706d866 LLVM10 anatofuz parents: diff changeset	8 entry point function lld::elf::link.
1d019706d866 LLVM10 anatofuz parents: diff changeset	9
1d019706d866 LLVM10 anatofuz parents: diff changeset	10 The current policy is that it is your responsibility to give trustworthy object
1d019706d866 LLVM10 anatofuz parents: diff changeset	11 files. The function is guaranteed to return as long as you do not pass corrupted
1d019706d866 LLVM10 anatofuz parents: diff changeset	12 or malicious object files. A corrupted file could cause a fatal error or SEGV.
1d019706d866 LLVM10 anatofuz parents: diff changeset	13 That being said, you don't need to worry too much about it if you create object
1d019706d866 LLVM10 anatofuz parents: diff changeset	14 files in the usual way and give them to the linker. It is naturally expected to
1d019706d866 LLVM10 anatofuz parents: diff changeset	15 work, or otherwise it's a linker's bug.
1d019706d866 LLVM10 anatofuz parents: diff changeset	16
1d019706d866 LLVM10 anatofuz parents: diff changeset	17 Design
1d019706d866 LLVM10 anatofuz parents: diff changeset	18 ======
1d019706d866 LLVM10 anatofuz parents: diff changeset	19
1d019706d866 LLVM10 anatofuz parents: diff changeset	20 We will describe the design of the linkers in the rest of the document.
1d019706d866 LLVM10 anatofuz parents: diff changeset	21
1d019706d866 LLVM10 anatofuz parents: diff changeset	22 Key Concepts
1d019706d866 LLVM10 anatofuz parents: diff changeset	23 ------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	24
1d019706d866 LLVM10 anatofuz parents: diff changeset	25 Linkers are fairly large pieces of software.
1d019706d866 LLVM10 anatofuz parents: diff changeset	26 There are many design choices you have to make to create a complete linker.
1d019706d866 LLVM10 anatofuz parents: diff changeset	27
1d019706d866 LLVM10 anatofuz parents: diff changeset	28 This is a list of design choices we've made for ELF and COFF LLD.
1d019706d866 LLVM10 anatofuz parents: diff changeset	29 We believe that these high-level design choices achieved a right balance
1d019706d866 LLVM10 anatofuz parents: diff changeset	30 between speed, simplicity and extensibility.
1d019706d866 LLVM10 anatofuz parents: diff changeset	31
1d019706d866 LLVM10 anatofuz parents: diff changeset	32 * Implement as native linkers
1d019706d866 LLVM10 anatofuz parents: diff changeset	33
1d019706d866 LLVM10 anatofuz parents: diff changeset	34 We implemented the linkers as native linkers for each file format.
1d019706d866 LLVM10 anatofuz parents: diff changeset	35
1d019706d866 LLVM10 anatofuz parents: diff changeset	36 The linkers share the same design but share very little code.
1d019706d866 LLVM10 anatofuz parents: diff changeset	37 Sharing code makes sense if the benefit is worth its cost.
1d019706d866 LLVM10 anatofuz parents: diff changeset	38 In our case, the object formats are different enough that we thought the layer
1d019706d866 LLVM10 anatofuz parents: diff changeset	39 to abstract the differences wouldn't be worth its complexity and run-time
1d019706d866 LLVM10 anatofuz parents: diff changeset	40 cost. Elimination of the abstract layer has greatly simplified the
1d019706d866 LLVM10 anatofuz parents: diff changeset	41 implementation.
1d019706d866 LLVM10 anatofuz parents: diff changeset	42
1d019706d866 LLVM10 anatofuz parents: diff changeset	43 * Speed by design
1d019706d866 LLVM10 anatofuz parents: diff changeset	44
1d019706d866 LLVM10 anatofuz parents: diff changeset	45 One of the most important things in archiving high performance is to
1d019706d866 LLVM10 anatofuz parents: diff changeset	46 do less rather than do it efficiently.
1d019706d866 LLVM10 anatofuz parents: diff changeset	47 Therefore, the high-level design matters more than local optimizations.
1d019706d866 LLVM10 anatofuz parents: diff changeset	48 Since we are trying to create a high-performance linker,
1d019706d866 LLVM10 anatofuz parents: diff changeset	49 it is very important to keep the design as efficient as possible.
1d019706d866 LLVM10 anatofuz parents: diff changeset	50
1d019706d866 LLVM10 anatofuz parents: diff changeset	51 Broadly speaking, we do not do anything until we have to do it.
1d019706d866 LLVM10 anatofuz parents: diff changeset	52 For example, we do not read section contents or relocations
1d019706d866 LLVM10 anatofuz parents: diff changeset	53 until we need them to continue linking.
1d019706d866 LLVM10 anatofuz parents: diff changeset	54 When we need to do some costly operation (such as looking up
1d019706d866 LLVM10 anatofuz parents: diff changeset	55 a hash table for each symbol), we do it only once.
1d019706d866 LLVM10 anatofuz parents: diff changeset	56 We obtain a handle (which is typically just a pointer to actual data)
1d019706d866 LLVM10 anatofuz parents: diff changeset	57 on the first operation and use it throughout the process.
1d019706d866 LLVM10 anatofuz parents: diff changeset	58
1d019706d866 LLVM10 anatofuz parents: diff changeset	59 * Efficient archive file handling
1d019706d866 LLVM10 anatofuz parents: diff changeset	60
1d019706d866 LLVM10 anatofuz parents: diff changeset	61 LLD's handling of archive files (the files with ".a" file extension) is
1d019706d866 LLVM10 anatofuz parents: diff changeset	62 different from the traditional Unix linkers and similar to Windows linkers.
1d019706d866 LLVM10 anatofuz parents: diff changeset	63 We'll describe how the traditional Unix linker handles archive files, what the
1d019706d866 LLVM10 anatofuz parents: diff changeset	64 problem is, and how LLD approached the problem.
1d019706d866 LLVM10 anatofuz parents: diff changeset	65
1d019706d866 LLVM10 anatofuz parents: diff changeset	66 The traditional Unix linker maintains a set of undefined symbols during
1d019706d866 LLVM10 anatofuz parents: diff changeset	67 linking. The linker visits each file in the order as they appeared in the
1d019706d866 LLVM10 anatofuz parents: diff changeset	68 command line until the set becomes empty. What the linker would do depends on
1d019706d866 LLVM10 anatofuz parents: diff changeset	69 file type.
1d019706d866 LLVM10 anatofuz parents: diff changeset	70
1d019706d866 LLVM10 anatofuz parents: diff changeset	71 - If the linker visits an object file, the linker links object files to the
1d019706d866 LLVM10 anatofuz parents: diff changeset	72 result, and undefined symbols in the object file are added to the set.
1d019706d866 LLVM10 anatofuz parents: diff changeset	73
1d019706d866 LLVM10 anatofuz parents: diff changeset	74 - If the linker visits an archive file, it checks for the archive file's
1d019706d866 LLVM10 anatofuz parents: diff changeset	75 symbol table and extracts all object files that have definitions for any
1d019706d866 LLVM10 anatofuz parents: diff changeset	76 symbols in the set.
1d019706d866 LLVM10 anatofuz parents: diff changeset	77
1d019706d866 LLVM10 anatofuz parents: diff changeset	78 This algorithm sometimes leads to a counter-intuitive behavior. If you give
1d019706d866 LLVM10 anatofuz parents: diff changeset	79 archive files before object files, nothing will happen because when the linker
1d019706d866 LLVM10 anatofuz parents: diff changeset	80 visits archives, there is no undefined symbols in the set. As a result, no
1d019706d866 LLVM10 anatofuz parents: diff changeset	81 files are extracted from the first archive file, and the link is done at that
1d019706d866 LLVM10 anatofuz parents: diff changeset	82 point because the set is empty after it visits one file.
1d019706d866 LLVM10 anatofuz parents: diff changeset	83
1d019706d866 LLVM10 anatofuz parents: diff changeset	84 You can fix the problem by reordering the files,
1d019706d866 LLVM10 anatofuz parents: diff changeset	85 but that cannot fix the issue of mutually-dependent archive files.
1d019706d866 LLVM10 anatofuz parents: diff changeset	86
1d019706d866 LLVM10 anatofuz parents: diff changeset	87 Linking mutually-dependent archive files is tricky. You may specify the same
1d019706d866 LLVM10 anatofuz parents: diff changeset	88 archive file multiple times to let the linker visit it more than once. Or,
1d019706d866 LLVM10 anatofuz parents: diff changeset	89 you may use the special command line options, `--start-group` and
1d019706d866 LLVM10 anatofuz parents: diff changeset	90 `--end-group`, to let the linker loop over the files between the options until
1d019706d866 LLVM10 anatofuz parents: diff changeset	91 no new symbols are added to the set.
1d019706d866 LLVM10 anatofuz parents: diff changeset	92
1d019706d866 LLVM10 anatofuz parents: diff changeset	93 Visiting the same archive files multiple times makes the linker slower.
1d019706d866 LLVM10 anatofuz parents: diff changeset	94
1d019706d866 LLVM10 anatofuz parents: diff changeset	95 Here is how LLD approaches the problem. Instead of memorizing only undefined
1d019706d866 LLVM10 anatofuz parents: diff changeset	96 symbols, we program LLD so that it memorizes all symbols. When it sees an
1d019706d866 LLVM10 anatofuz parents: diff changeset	97 undefined symbol that can be resolved by extracting an object file from an
1d019706d866 LLVM10 anatofuz parents: diff changeset	98 archive file it previously visited, it immediately extracts the file and links
1d019706d866 LLVM10 anatofuz parents: diff changeset	99 it. It is doable because LLD does not forget symbols it has seen in archive
1d019706d866 LLVM10 anatofuz parents: diff changeset	100 files.
1d019706d866 LLVM10 anatofuz parents: diff changeset	101
1d019706d866 LLVM10 anatofuz parents: diff changeset	102 We believe that LLD's way is efficient and easy to justify.
1d019706d866 LLVM10 anatofuz parents: diff changeset	103
1d019706d866 LLVM10 anatofuz parents: diff changeset	104 The semantics of LLD's archive handling are different from the traditional
1d019706d866 LLVM10 anatofuz parents: diff changeset	105 Unix's. You can observe it if you carefully craft archive files to exploit
1d019706d866 LLVM10 anatofuz parents: diff changeset	106 it. However, in reality, we don't know any program that cannot link with our
1d019706d866 LLVM10 anatofuz parents: diff changeset	107 algorithm so far, so it's not going to cause trouble.
1d019706d866 LLVM10 anatofuz parents: diff changeset	108
1d019706d866 LLVM10 anatofuz parents: diff changeset	109 Numbers You Want to Know
1d019706d866 LLVM10 anatofuz parents: diff changeset	110 ------------------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	111
1d019706d866 LLVM10 anatofuz parents: diff changeset	112 To give you intuition about what kinds of data the linker is mainly working on,
1d019706d866 LLVM10 anatofuz parents: diff changeset	113 I'll give you the list of objects and their numbers LLD has to read and process
1d019706d866 LLVM10 anatofuz parents: diff changeset	114 in order to link a very large executable. In order to link Chrome with debug
1d019706d866 LLVM10 anatofuz parents: diff changeset	115 info, which is roughly 2 GB in output size, LLD reads
1d019706d866 LLVM10 anatofuz parents: diff changeset	116
1d019706d866 LLVM10 anatofuz parents: diff changeset	117 - 17,000 files,
1d019706d866 LLVM10 anatofuz parents: diff changeset	118 - 1,800,000 sections,
1d019706d866 LLVM10 anatofuz parents: diff changeset	119 - 6,300,000 symbols, and
1d019706d866 LLVM10 anatofuz parents: diff changeset	120 - 13,000,000 relocations.
1d019706d866 LLVM10 anatofuz parents: diff changeset	121
1d019706d866 LLVM10 anatofuz parents: diff changeset	122 LLD produces the 2 GB executable in 15 seconds.
1d019706d866 LLVM10 anatofuz parents: diff changeset	123
1d019706d866 LLVM10 anatofuz parents: diff changeset	124 These numbers vary depending on your program, but in general,
1d019706d866 LLVM10 anatofuz parents: diff changeset	125 you have a lot of relocations and symbols for each file.
1d019706d866 LLVM10 anatofuz parents: diff changeset	126 If your program is written in C++, symbol names are likely to be
1d019706d866 LLVM10 anatofuz parents: diff changeset	127 pretty long because of name mangling.
1d019706d866 LLVM10 anatofuz parents: diff changeset	128
1d019706d866 LLVM10 anatofuz parents: diff changeset	129 It is important to not waste time on relocations and symbols.
1d019706d866 LLVM10 anatofuz parents: diff changeset	130
1d019706d866 LLVM10 anatofuz parents: diff changeset	131 In the above case, the total amount of symbol strings is 450 MB,
1d019706d866 LLVM10 anatofuz parents: diff changeset	132 and inserting all of them to a hash table takes 1.5 seconds.
1d019706d866 LLVM10 anatofuz parents: diff changeset	133 Therefore, if you causally add a hash table lookup for each symbol,
1d019706d866 LLVM10 anatofuz parents: diff changeset	134 it would slow down the linker by 10%. So, don't do that.
1d019706d866 LLVM10 anatofuz parents: diff changeset	135
1d019706d866 LLVM10 anatofuz parents: diff changeset	136 On the other hand, you don't have to pursue efficiency
1d019706d866 LLVM10 anatofuz parents: diff changeset	137 when handling files.
1d019706d866 LLVM10 anatofuz parents: diff changeset	138
1d019706d866 LLVM10 anatofuz parents: diff changeset	139 Important Data Structures
1d019706d866 LLVM10 anatofuz parents: diff changeset	140 -------------------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	141
1d019706d866 LLVM10 anatofuz parents: diff changeset	142 We will describe the key data structures in LLD in this section. The linker can
1d019706d866 LLVM10 anatofuz parents: diff changeset	143 be understood as the interactions between them. Once you understand their
1d019706d866 LLVM10 anatofuz parents: diff changeset	144 functions, the code of the linker should look obvious to you.
1d019706d866 LLVM10 anatofuz parents: diff changeset	145
1d019706d866 LLVM10 anatofuz parents: diff changeset	146 * Symbol
1d019706d866 LLVM10 anatofuz parents: diff changeset	147
1d019706d866 LLVM10 anatofuz parents: diff changeset	148 This class represents a symbol.
1d019706d866 LLVM10 anatofuz parents: diff changeset	149 They are created for symbols in object files or archive files.
1d019706d866 LLVM10 anatofuz parents: diff changeset	150 The linker creates linker-defined symbols as well.
1d019706d866 LLVM10 anatofuz parents: diff changeset	151
1d019706d866 LLVM10 anatofuz parents: diff changeset	152 There are basically three types of Symbols: Defined, Undefined, or Lazy.
1d019706d866 LLVM10 anatofuz parents: diff changeset	153
1d019706d866 LLVM10 anatofuz parents: diff changeset	154 - Defined symbols are for all symbols that are considered as "resolved",
1d019706d866 LLVM10 anatofuz parents: diff changeset	155 including real defined symbols, COMDAT symbols, common symbols,
1d019706d866 LLVM10 anatofuz parents: diff changeset	156 absolute symbols, linker-created symbols, etc.
1d019706d866 LLVM10 anatofuz parents: diff changeset	157 - Undefined symbols represent undefined symbols, which need to be replaced by
1d019706d866 LLVM10 anatofuz parents: diff changeset	158 Defined symbols by the resolver until the link is complete.
1d019706d866 LLVM10 anatofuz parents: diff changeset	159 - Lazy symbols represent symbols we found in archive file headers
1d019706d866 LLVM10 anatofuz parents: diff changeset	160 which can turn into Defined if we read archive members.
1d019706d866 LLVM10 anatofuz parents: diff changeset	161
1d019706d866 LLVM10 anatofuz parents: diff changeset	162 There's only one Symbol instance for each unique symbol name. This uniqueness
1d019706d866 LLVM10 anatofuz parents: diff changeset	163 is guaranteed by the symbol table. As the resolver reads symbols from input
1d019706d866 LLVM10 anatofuz parents: diff changeset	164 files, it replaces an existing Symbol with the "best" Symbol for its symbol
1d019706d866 LLVM10 anatofuz parents: diff changeset	165 name using the placement new.
1d019706d866 LLVM10 anatofuz parents: diff changeset	166
1d019706d866 LLVM10 anatofuz parents: diff changeset	167 The above mechanism allows you to use pointers to Symbols as a very cheap way
1d019706d866 LLVM10 anatofuz parents: diff changeset	168 to access name resolution results. Assume for example that you have a pointer
1d019706d866 LLVM10 anatofuz parents: diff changeset	169 to an undefined symbol before name resolution. If the symbol is resolved to a
1d019706d866 LLVM10 anatofuz parents: diff changeset	170 defined symbol by the resolver, the pointer will "automatically" point to the
1d019706d866 LLVM10 anatofuz parents: diff changeset	171 defined symbol, because the undefined symbol the pointer pointed to will have
1d019706d866 LLVM10 anatofuz parents: diff changeset	172 been replaced by the defined symbol in-place.
1d019706d866 LLVM10 anatofuz parents: diff changeset	173
1d019706d866 LLVM10 anatofuz parents: diff changeset	174 * SymbolTable
1d019706d866 LLVM10 anatofuz parents: diff changeset	175
1d019706d866 LLVM10 anatofuz parents: diff changeset	176 SymbolTable is basically a hash table from strings to Symbols
1d019706d866 LLVM10 anatofuz parents: diff changeset	177 with logic to resolve symbol conflicts. It resolves conflicts by symbol type.
1d019706d866 LLVM10 anatofuz parents: diff changeset	178
1d019706d866 LLVM10 anatofuz parents: diff changeset	179 - If we add Defined and Undefined symbols, the symbol table will keep the
1d019706d866 LLVM10 anatofuz parents: diff changeset	180 former.
1d019706d866 LLVM10 anatofuz parents: diff changeset	181 - If we add Defined and Lazy symbols, it will keep the former.
1d019706d866 LLVM10 anatofuz parents: diff changeset	182 - If we add Lazy and Undefined, it will keep the former,
1d019706d866 LLVM10 anatofuz parents: diff changeset	183 but it will also trigger the Lazy symbol to load the archive member
1d019706d866 LLVM10 anatofuz parents: diff changeset	184 to actually resolve the symbol.
1d019706d866 LLVM10 anatofuz parents: diff changeset	185
1d019706d866 LLVM10 anatofuz parents: diff changeset	186 * Chunk (COFF specific)
1d019706d866 LLVM10 anatofuz parents: diff changeset	187
1d019706d866 LLVM10 anatofuz parents: diff changeset	188 Chunk represents a chunk of data that will occupy space in an output.
1d019706d866 LLVM10 anatofuz parents: diff changeset	189 Each regular section becomes a chunk.
1d019706d866 LLVM10 anatofuz parents: diff changeset	190 Chunks created for common or BSS symbols are not backed by sections.
1d019706d866 LLVM10 anatofuz parents: diff changeset	191 The linker may create chunks to append additional data to an output as well.
1d019706d866 LLVM10 anatofuz parents: diff changeset	192
1d019706d866 LLVM10 anatofuz parents: diff changeset	193 Chunks know about their size, how to copy their data to mmap'ed outputs,
1d019706d866 LLVM10 anatofuz parents: diff changeset	194 and how to apply relocations to them.
1d019706d866 LLVM10 anatofuz parents: diff changeset	195 Specifically, section-based chunks know how to read relocation tables
1d019706d866 LLVM10 anatofuz parents: diff changeset	196 and how to apply them.
1d019706d866 LLVM10 anatofuz parents: diff changeset	197
1d019706d866 LLVM10 anatofuz parents: diff changeset	198 * InputSection (ELF specific)
1d019706d866 LLVM10 anatofuz parents: diff changeset	199
1d019706d866 LLVM10 anatofuz parents: diff changeset	200 Since we have less synthesized data for ELF, we don't abstract slices of
1d019706d866 LLVM10 anatofuz parents: diff changeset	201 input files as Chunks for ELF. Instead, we directly use the input section
1d019706d866 LLVM10 anatofuz parents: diff changeset	202 as an internal data type.
1d019706d866 LLVM10 anatofuz parents: diff changeset	203
1d019706d866 LLVM10 anatofuz parents: diff changeset	204 InputSection knows about their size and how to copy themselves to
1d019706d866 LLVM10 anatofuz parents: diff changeset	205 mmap'ed outputs, just like COFF Chunks.
1d019706d866 LLVM10 anatofuz parents: diff changeset	206
1d019706d866 LLVM10 anatofuz parents: diff changeset	207 * OutputSection
1d019706d866 LLVM10 anatofuz parents: diff changeset	208
1d019706d866 LLVM10 anatofuz parents: diff changeset	209 OutputSection is a container of InputSections (ELF) or Chunks (COFF).
1d019706d866 LLVM10 anatofuz parents: diff changeset	210 An InputSection or Chunk belongs to at most one OutputSection.
1d019706d866 LLVM10 anatofuz parents: diff changeset	211
1d019706d866 LLVM10 anatofuz parents: diff changeset	212 There are mainly three actors in this linker.
1d019706d866 LLVM10 anatofuz parents: diff changeset	213
1d019706d866 LLVM10 anatofuz parents: diff changeset	214 * InputFile
1d019706d866 LLVM10 anatofuz parents: diff changeset	215
1d019706d866 LLVM10 anatofuz parents: diff changeset	216 InputFile is a superclass of file readers.
1d019706d866 LLVM10 anatofuz parents: diff changeset	217 We have a different subclass for each input file type,
1d019706d866 LLVM10 anatofuz parents: diff changeset	218 such as regular object file, archive file, etc.
1d019706d866 LLVM10 anatofuz parents: diff changeset	219 They are responsible for creating and owning Symbols and InputSections/Chunks.
1d019706d866 LLVM10 anatofuz parents: diff changeset	220
1d019706d866 LLVM10 anatofuz parents: diff changeset	221 * Writer
1d019706d866 LLVM10 anatofuz parents: diff changeset	222
1d019706d866 LLVM10 anatofuz parents: diff changeset	223 The writer is responsible for writing file headers and InputSections/Chunks to
1d019706d866 LLVM10 anatofuz parents: diff changeset	224 a file. It creates OutputSections, put all InputSections/Chunks into them,
1d019706d866 LLVM10 anatofuz parents: diff changeset	225 assign unique, non-overlapping addresses and file offsets to them, and then
1d019706d866 LLVM10 anatofuz parents: diff changeset	226 write them down to a file.
1d019706d866 LLVM10 anatofuz parents: diff changeset	227
1d019706d866 LLVM10 anatofuz parents: diff changeset	228 * Driver
1d019706d866 LLVM10 anatofuz parents: diff changeset	229
1d019706d866 LLVM10 anatofuz parents: diff changeset	230 The linking process is driven by the driver. The driver:
1d019706d866 LLVM10 anatofuz parents: diff changeset	231
1d019706d866 LLVM10 anatofuz parents: diff changeset	232 - processes command line options,
1d019706d866 LLVM10 anatofuz parents: diff changeset	233 - creates a symbol table,
1d019706d866 LLVM10 anatofuz parents: diff changeset	234 - creates an InputFile for each input file and puts all symbols within into
1d019706d866 LLVM10 anatofuz parents: diff changeset	235 the symbol table,
1d019706d866 LLVM10 anatofuz parents: diff changeset	236 - checks if there's no remaining undefined symbols,
1d019706d866 LLVM10 anatofuz parents: diff changeset	237 - creates a writer,
1d019706d866 LLVM10 anatofuz parents: diff changeset	238 - and passes the symbol table to the writer to write the result to a file.
1d019706d866 LLVM10 anatofuz parents: diff changeset	239
1d019706d866 LLVM10 anatofuz parents: diff changeset	240 Link-Time Optimization
1d019706d866 LLVM10 anatofuz parents: diff changeset	241 ----------------------
1d019706d866 LLVM10 anatofuz parents: diff changeset	242
1d019706d866 LLVM10 anatofuz parents: diff changeset	243 LTO is implemented by handling LLVM bitcode files as object files.
1d019706d866 LLVM10 anatofuz parents: diff changeset	244 The linker resolves symbols in bitcode files normally. If all symbols
1d019706d866 LLVM10 anatofuz parents: diff changeset	245 are successfully resolved, it then runs LLVM passes
1d019706d866 LLVM10 anatofuz parents: diff changeset	246 with all bitcode files to convert them to one big regular ELF/COFF file.
1d019706d866 LLVM10 anatofuz parents: diff changeset	247 Finally, the linker replaces bitcode symbols with ELF/COFF symbols,
1d019706d866 LLVM10 anatofuz parents: diff changeset	248 so that they are linked as if they were in the native format from the beginning.
1d019706d866 LLVM10 anatofuz parents: diff changeset	249
1d019706d866 LLVM10 anatofuz parents: diff changeset	250 The details are described in this document.
1d019706d866 LLVM10 anatofuz parents: diff changeset	251 http://llvm.org/docs/LinkTimeOptimization.html
1d019706d866 LLVM10 anatofuz parents: diff changeset	252
1d019706d866 LLVM10 anatofuz parents: diff changeset	253 Glossary
1d019706d866 LLVM10 anatofuz parents: diff changeset	254 --------
1d019706d866 LLVM10 anatofuz parents: diff changeset	255
1d019706d866 LLVM10 anatofuz parents: diff changeset	256 * RVA (COFF)
1d019706d866 LLVM10 anatofuz parents: diff changeset	257
1d019706d866 LLVM10 anatofuz parents: diff changeset	258 Short for Relative Virtual Address.
1d019706d866 LLVM10 anatofuz parents: diff changeset	259
1d019706d866 LLVM10 anatofuz parents: diff changeset	260 Windows executables or DLLs are not position-independent; they are
1d019706d866 LLVM10 anatofuz parents: diff changeset	261 linked against a fixed address called an image base. RVAs are
1d019706d866 LLVM10 anatofuz parents: diff changeset	262 offsets from an image base.
1d019706d866 LLVM10 anatofuz parents: diff changeset	263
1d019706d866 LLVM10 anatofuz parents: diff changeset	264 Default image bases are 0x140000000 for executables and 0x18000000
1d019706d866 LLVM10 anatofuz parents: diff changeset	265 for DLLs. For example, when we are creating an executable, we assume
1d019706d866 LLVM10 anatofuz parents: diff changeset	266 that the executable will be loaded at address 0x140000000 by the
1d019706d866 LLVM10 anatofuz parents: diff changeset	267 loader, so we apply relocations accordingly. Result texts and data
1d019706d866 LLVM10 anatofuz parents: diff changeset	268 will contain raw absolute addresses.
1d019706d866 LLVM10 anatofuz parents: diff changeset	269
1d019706d866 LLVM10 anatofuz parents: diff changeset	270 * VA
1d019706d866 LLVM10 anatofuz parents: diff changeset	271
1d019706d866 LLVM10 anatofuz parents: diff changeset	272 Short for Virtual Address. For COFF, it is equivalent to RVA + image base.
1d019706d866 LLVM10 anatofuz parents: diff changeset	273
1d019706d866 LLVM10 anatofuz parents: diff changeset	274 * Base relocations (COFF)
1d019706d866 LLVM10 anatofuz parents: diff changeset	275
1d019706d866 LLVM10 anatofuz parents: diff changeset	276 Relocation information for the loader. If the loader decides to map
1d019706d866 LLVM10 anatofuz parents: diff changeset	277 an executable or a DLL to a different address than their image
1d019706d866 LLVM10 anatofuz parents: diff changeset	278 bases, it fixes up binaries using information contained in the base
1d019706d866 LLVM10 anatofuz parents: diff changeset	279 relocation table. A base relocation table consists of a list of
1d019706d866 LLVM10 anatofuz parents: diff changeset	280 locations containing addresses. The loader adds a difference between
1d019706d866 LLVM10 anatofuz parents: diff changeset	281 RVA and actual load address to all locations listed there.
1d019706d866 LLVM10 anatofuz parents: diff changeset	282
1d019706d866 LLVM10 anatofuz parents: diff changeset	283 Note that this run-time relocation mechanism is much simpler than ELF.
1d019706d866 LLVM10 anatofuz parents: diff changeset	284 There's no PLT or GOT. Images are relocated as a whole just
1d019706d866 LLVM10 anatofuz parents: diff changeset	285 by shifting entire images in memory by some offsets. Although doing
1d019706d866 LLVM10 anatofuz parents: diff changeset	286 this breaks text sharing, I think this mechanism is not actually bad
1d019706d866 LLVM10 anatofuz parents: diff changeset	287 on today's computers.
1d019706d866 LLVM10 anatofuz parents: diff changeset	288
1d019706d866 LLVM10 anatofuz parents: diff changeset	289 * ICF
1d019706d866 LLVM10 anatofuz parents: diff changeset	290
1d019706d866 LLVM10 anatofuz parents: diff changeset	291 Short for Identical COMDAT Folding (COFF) or Identical Code Folding (ELF).
1d019706d866 LLVM10 anatofuz parents: diff changeset	292
1d019706d866 LLVM10 anatofuz parents: diff changeset	293 ICF is an optimization to reduce output size by merging read-only sections
1d019706d866 LLVM10 anatofuz parents: diff changeset	294 by not only their names but by their contents. If two read-only sections
1d019706d866 LLVM10 anatofuz parents: diff changeset	295 happen to have the same metadata, actual contents and relocations,
1d019706d866 LLVM10 anatofuz parents: diff changeset	296 they are merged by ICF. It is known as an effective technique,
1d019706d866 LLVM10 anatofuz parents: diff changeset	297 and it usually reduces C++ program's size by a few percent or more.
1d019706d866 LLVM10 anatofuz parents: diff changeset	298
1d019706d866 LLVM10 anatofuz parents: diff changeset	299 Note that this is not an entirely sound optimization. C/C++ require
1d019706d866 LLVM10 anatofuz parents: diff changeset	300 different functions have different addresses. If a program depends on
1d019706d866 LLVM10 anatofuz parents: diff changeset	301 that property, it would fail at runtime.
1d019706d866 LLVM10 anatofuz parents: diff changeset	302
1d019706d866 LLVM10 anatofuz parents: diff changeset	303 On Windows, that's not really an issue because MSVC link.exe enabled
1d019706d866 LLVM10 anatofuz parents: diff changeset	304 the optimization by default. As long as your program works
1d019706d866 LLVM10 anatofuz parents: diff changeset	305 with the linker's default settings, your program should be safe with ICF.
1d019706d866 LLVM10 anatofuz parents: diff changeset	306
1d019706d866 LLVM10 anatofuz parents: diff changeset	307 On Unix, your program is generally not guaranteed to be safe with ICF,
1d019706d866 LLVM10 anatofuz parents: diff changeset	308 although large programs happen to work correctly.
1d019706d866 LLVM10 anatofuz parents: diff changeset	309 LLD works fine with ICF for example.
1d019706d866 LLVM10 anatofuz parents: diff changeset	310
1d019706d866 LLVM10 anatofuz parents: diff changeset	311 Other Info
1d019706d866 LLVM10 anatofuz parents: diff changeset	312 ----------
1d019706d866 LLVM10 anatofuz parents: diff changeset	313
1d019706d866 LLVM10 anatofuz parents: diff changeset	314 .. toctree::
1d019706d866 LLVM10 anatofuz parents: diff changeset	315 :maxdepth: 1
1d019706d866 LLVM10 anatofuz parents: diff changeset	316
1d019706d866 LLVM10 anatofuz parents: diff changeset	317 missingkeyfunction

Mercurial > hg > CbC > CbC_llvm

annotate lld/docs/NewLLD.rst @ 154:f7e988d3e4cc