登录查看更多内容

Optimizing LLVM Back-End: Global Instruction Selection

abhinav Ashok kumar

Curating Insights & Innovating in GPU Compiler | Performance Analyst at Qualcomm | LLVM Contributor | Maintain News Letter | AI/ML in Compiler

发布日期: 2024年3月31日

The LLVM compiler framework, popular for its versatility and optimization capabilities, constantly evolves to meet the demands of modern software development. One of the important phases in LLVM's backend is, instruction selection, which has seen significant advancements over the years.

This article explains Global Instruction Selection in LLVM, what problems it solves, and how it helps make compiling faster.

Introduction:

Instruction selection is a critical phase in the LLVM compiler's backend, responsible for translating LLVM-IR into lower-level Machine IR (MIR) suitable for the target architecture. Traditionally, LLVM implemented SelectionDAG and FastISel for this task, each with its strengths and limitations. However, conventional instruction selection techniques often operate within narrow scopes, focusing on individual statements or basic blocks that only achieve local code optimization. In contrast, global instruction selection, implemented in LLVM, takes a broader approach by optimizing entire functions in the form of Static Single Assignment (SSA) representation.

Instruction Selection Challenges: The traditional instruction selection methods in LLVM, namely SelectionDAG and FastISel, exhibit effectiveness alongside face performance bottlenecks and optimization constraints. Unfortunately, SelectionDAG, renowned for its power, consumes a significant portion of compilation time due to the complexity of legalizing operations and combining DAG nodes.

Conversely, FastISel prioritizes speed over optimization ability, limiting support to common data types that fit within a single machine register. FastISel does not handle 128-bit integers or structures and supports only a subset of frequently used LLVM-IR operations and intrinsics. When an unsupported LLVM-IR instruction is encountered, FastISel resorts to SelectionDAG for the remaining block. However, for function calls involving unsupported data types, parameters passed on the stack, or unimplemented intrinsics, only the impacted call instruction is processed by SelectionDAG

LLVM's Global Instruction Selection

The global instruction selection framework within LLVM offers a powerful set of tools and optimization modules to improve code generation efficiency. This framework directly translates LLVM-IR into machine instructions suitable for the target platform, bypassing the need for intermediate representations like the DAG used in SelectionDAGISel.

The primary objectives of global instruction selection are to overcome the limitations of DAG-based and fast instruction selection methods while addressing three main issues:

Compilation Performance: Traditional methods like SelectionDAGISel introduce additional overhead by using a new DAG intermediate representation, leading to slower compilation times. GlobalISel streamlines this process by directly generating target machine instructions from LLVM-IR operations, thus improving compilation speed.
Granularity Optimization: Unlike other methods that may miss out on optimization opportunities within basic blocks, GlobalISel operates on entire functions, retaining complete LLVM-IR information. This holistic approach allows for more thorough exploration of optimization possibilities across the entire codebase.
Modularity Enhancement: Existing methods like SelectionDAGISel and FastISel are standalone modules with limited code reuse and modularity. In contrast, GlobalISel is designed for code reuse, leveraging LLVM's compilation pipeline. It allows for better customization to meet the needs of different target platforms, fostering collaboration and flexibility.

领英推荐

Mastering Concurrency: Task

Evangelist Apps 1 年前

Angular v15 is available now!

Newel Technologies Pvt. Ltd. 2 年前

How to leverage Dart context-aware safety for clean…

NonStop io Technologies 6 个月前

The architecture of global instruction selection consists of four key modules:

IRTranslator: This module converts LLVM-IR into Generic Machine IR (GMIR), a machine-independent representation.
Legalizer: Responsible for ensuring compatibility with the target platform, the Legalizer converts operand types and operations to match the supported instructions.
RegisterBankSelector: Allocates register banks for operands, optimizing register usage.
InstructionSelector: Converts mixed machine instructions into target-specific machine instructions, completing the code generation process.

Despite these advancements, the quality of generated code using the basic instruction selection framework may still be suboptimal. To address this, global instruction selection offers a flexible combination of instructions that can be configured for specific target platforms, ensuring optional and cust omizable optimization.

Overall, the global instruction selection framework within LLVM represents a significant leap forward in code generation efficiency and optimization, promising better performance and adaptability for diverse target platforms.

Conclusion

The introduction of global-based instruction selection marks a significant leap forward in the LLVM compiler's capabilities. This method addresses key limitations seen in previous selection techniques. By directly supporting a wider range of data types and operations, it eliminates the constraints of fast instruction selection. Additionally, it effectively tackles the loss of global optimization opportunities that DAG-based selection methods often face, while also reducing the time it takes to compile.

Implemented on the Shenwei platform, the optimization brought about by LLVM's global instruction selection noticeably improves the efficiency of code generation processes. Though there's a slight increase in compilation time, the overall compilation cycle becomes more effective. With code quality being the same, there's a notable 8% boost in the quality of generated code.

However, it's important to note that the code quality still falls short compared to DAG-based selection methods. To address this, further exploration into how DAG-based instruction merging compares with global instruction selection is needed. By examining more examples of instruction merging, we hope to refine the compiler's performance and narrow the gap between these two methods.

In sum, global instruction selection within LLVM represents a significant step forward in optimizing code generation. While there's room for improvement, its potential to enhance both code quality and performance is clear. With ongoing research and tweaks, global instruction selection promises further advancements in compiler technology, making software development more efficient and accessible.

References:

LLVM Project, “LLVM 17 documentation: Global instruction selection,”
Research Paper Link

Michael Olea

Composer | Programmer | Geologist

8 个月

https://www.amazon.com/SSA-based-Compiler-Design-Fabrice-Rastello-ebook/dp/B0BPLHMCXM/ref=tmm_kin_swatch_0?_encoding=UTF8&dib_tag=se&dib=eyJ2IjoiMSJ9.IrWSj4XDCiSnwY-mymoMEHcBuw_UHmT9cFgI4HXD5k7GjHj071QN20LucGBJIEps.0AxVY0X6ra2OMZyZfh6jguj4OIKZBtXMkZJbpaUWtbY&qid=1720891898&sr=1-1

要查看或添加评论，请登录

abhinav Ashok kumar的更多文章

Parallel programming Evolution

2025年3月12日

Parallel programming Evolution

Parallel programming has revolutionized how we leverage modern computing power! From instruction-level parallelism…
LLVM vs. GCC: A Comprehensive Comparison

2025年3月7日

LLVM vs. GCC: A Comprehensive Comparison

When it comes to compiling C, C++, and other languages, LLVM and GCC are two of the most widely used compiler…
Exploring TVM for Beginners: A Must-Read Guide for Compiler Enthusiasts

2025年3月4日

Exploring TVM for Beginners: A Must-Read Guide for Compiler Enthusiasts

For those diving into machine learning compilers, TVM is a powerful tool that optimizes deep learning models for…
Optimizing LLVM Passes: Understanding Pass Execution Time

2025年2月25日

Optimizing LLVM Passes: Understanding Pass Execution Time

Optimizing LLVM passes is crucial for improving performance and efficiency for compiler engineers. A key aspect of this…
CPP MCQ Stack

2025年2月17日

CPP MCQ Stack

Welcome to Compiler Sutra — the place to be if you want to improve at C++ and compilers! Link :…

1 条评论
Disabling LLVM Pass

2025年2月11日

Disabling LLVM Pass

?? Disabling an LLVM Pass for Custom Compiler Modifications ?? LLVM is at the core of many modern compilers, and its…

1 条评论
How LLVM Solve Traditional Compiler Problem m*n

2025年1月27日

How LLVM Solve Traditional Compiler Problem m*n

LLVM (Low-Level Virtual Machine) is a compiler framework that helps compiler developers to transform and build…
Pass In LLVM To Count the Number of Instructions in It

2024年12月22日

Pass In LLVM To Count the Number of Instructions in It

You can read the full tutorial here: Read the Full Tutorial This tutorial explores FunctionCount.cpp, a practical…
Unlocking C++11 part 2

2024年11月10日

Unlocking C++11 part 2

Hello, Tech Enthusiasts Here is the link for the Unlocking C++11 Part 1 The C++11 standard has transformed how we write…

1 条评论
Unlocking C++11

2024年11月3日

Unlocking C++11

Hello, Tech Enthusiasts! The C++11 standard has transformed how we write C++ code by introducing new features to make…

See all articles

Optimizing LLVM Back-End: Global Instruction Selection

abhinav Ashok kumar

Curating Insights & Innovating in GPU Compiler | Performance Analyst at Qualcomm | LLVM Contributor | Maintain News Letter | AI/ML in Compiler

Introduction:

LLVM's Global Instruction Selection

领英推荐

Conclusion

abhinav Ashok kumar的更多文章

社区洞察

其他会员也浏览了

Code Blocks

Under the Hood: Exploring the Inner Workings of Jetpack Compose

Exploring and Innovating with the GNU Compiler on AArch64: Part 2 (Nov 3, 2024)

Compiler Magic and Memory Mischief

Understanding the Go Concurrency Model

React has some new updates: My Concise Overview of React 19's Updates—What Every Developer Ought to Know

Understanding Qualifiers in C++: A Complete Guide

Exploring and Innovating with the GNU Compiler on AArch64: Part 3.1 Dec 5, 2024

Mastering Solidity Pragma: 6 Ways to Specify Compiler Versions

When to Use volatile?

Introduction:

LLVM's Global Instruction Selection

领英推荐

Conclusion

abhinav Ashok kumar的更多文章

Parallel programming Evolution

LLVM vs. GCC: A Comprehensive Comparison

Exploring TVM for Beginners: A Must-Read Guide for Compiler Enthusiasts

Optimizing LLVM Passes: Understanding Pass Execution Time

CPP MCQ Stack

Disabling LLVM Pass

How LLVM Solve Traditional Compiler Problem m*n

Pass In LLVM To Count the Number of Instructions in It

Unlocking C++11 part 2

Unlocking C++11

社区洞察

其他会员也浏览了

Code Blocks

Under the Hood: Exploring the Inner Workings of Jetpack Compose

Exploring and Innovating with the GNU Compiler on AArch64: Part 2 (Nov 3, 2024)

Compiler Magic and Memory Mischief

Understanding the Go Concurrency Model

React has some new updates: My Concise Overview of React 19's Updates—What Every Developer Ought to Know

Understanding Qualifiers in C++: A Complete Guide

Exploring and Innovating with the GNU Compiler on AArch64: Part 3.1 Dec 5, 2024

Mastering Solidity Pragma: 6 Ways to Specify Compiler Versions

When to Use volatile?