Passes represent the basic infrastructure for transformation and optimization. This document provides an overview of the pass infrastructure in MLIR and how to use it.
See MLIR specification for more information about MLIR and its core aspects, such as the IR structure and operations.
See MLIR Rewrites for a quick start on graph rewriting in MLIR. If a transformation involves pattern matching operation DAGs, this is a great place to start.
In MLIR, the main unit of abstraction and transformation is an operation. As such, the pass manager is designed to work on instances of operations at different levels of nesting. In the following paragraphs, we refer to the operation that a pass operates on as the “current operation”.
The structure of the pass manager, and the concept of nesting, is detailed further below. All passes in MLIR derive from OperationPass
and adhere to the following restrictions; any noncompliance will lead to problematic behavior in multithreaded and other advanced scenarios:
runOnOperation
. A pass may be run on many different operations with no guarantee of execution order.By default, an operation pass is op-agnostic
, meaning that it operates on the operation type of the pass manager that it is added to. This means a pass may operate on many different types of operations. Agnostic passes should be written such that they do not make assumptions on the operation they run on. Examples of this type of pass are Canonicalization and Common Sub-Expression Elimination.
To create an agnostic operation pass, a derived class must adhere to the following:
OperationPass
.void runOnOperation()
method.A simple pass may look like:
/// Here we utilize the CRTP `PassWrapper` utility class to provide some /// necessary utility hooks. This is only necessary for passes defined directly /// in C++. Passes defined declaratively use a cleaner mechanism for providing /// these utilities. struct MyOperationPass : public PassWrapper<MyOperationPass, OperationPass<>> { void runOnOperation() override { // Get the current operation being operated on. Operation *op = getOperation(); ... } };
If a pass needs to constrain its execution to specific types or classes of operations, additional filtering may be applied on top. This transforms a once agnostic
pass into one more specific to a certain context. There are various ways in which to filter the execution of a pass, and different contexts in which filtering may apply:
Static filtering allows for applying additional constraints on the operation types a pass may be scheduled on. This type of filtering generally allows for building more constrained passes that can only be scheduled on operations that satisfy the necessary constraints. For example, this allows for specifying passes that only run on operations of a certain, those that provide a certain interface, trait, or some other constraint that applies to all instances of that operation type. Below is an example of a pass that only permits scheduling on operations that implement FunctionOpInterface
:
struct MyFunctionPass : ... { /// This method is used to provide additional static filtering, and returns if the /// pass may be scheduled on the given operation type. bool canScheduleOn(RegisteredOperationName opInfo) const override { return opInfo.hasInterface<FunctionOpInterface>(); } void runOnOperation() { // Here we can freely cast to FunctionOpInterface, because our `canScheduleOn` ensures // that our pass is only executed on operations implementing that interface. FunctionOpInterface op = cast<FunctionOpInterface>(getOperation()); } };
When a pass with static filtering is added to an op-specific
pass manager, it asserts that the operation type of the pass manager satisfies the static constraints of the pass. When added to an op-agnostic
pass manager, that pass manager, and all passes contained within, inherits the static constraints of the pass. For example, if the pass filters on FunctionOpInterface
, as in the MyFunctionPass
example above, only operations that implement FunctionOpInterface
will be considered when executing any passes within the pass manager. This invariant is important to keep in mind, as each pass added to an op-agnostic
pass manager further constrains the operations that may be scheduled on it. Consider the following example:
func.func @foo() { // ... return } module @someModule { // ... }
If we were to apply the op-agnostic pipeline, any(cse,my-function-pass)
, to the above MLIR snippet it would only run on the foo
function operation. This is because the my-function-pass
has a static filtering constraint to only schedule on operations implementing FunctionOpInterface
. Remember that this constraint is inherited by the entire pass manager, so we never consider someModule
for any of the passes, including cse
which normally can be scheduled on any operation.
In the above section, we detailed a general mechanism for statically filtering the types of operations that a pass may be scheduled on. Sugar is provided on top of that mechanism to simplify the definition of passes that are restricted to scheduling on a single operation type. In these cases, a pass simply needs to provide the type of operation to the OperationPass
base class. This will automatically instill filtering on that operation type:
/// Here we utilize the CRTP `PassWrapper` utility class to provide some /// necessary utility hooks. This is only necessary for passes defined directly /// in C++. Passes defined declaratively use a cleaner mechanism for providing /// these utilities. struct MyFunctionPass : public PassWrapper<MyOperationPass, OperationPass<func::FuncOp>> { void runOnOperation() { // Get the current operation being operated on. func::FuncOp op = getOperation(); } };
In the above section, we detailed a general mechanism for statically filtering the types of operations that a pass may be scheduled on. Sugar is provided on top of that mechanism to simplify the definition of passes that are restricted to scheduling on a specific operation interface. In these cases, a pass simply needs to inherit from the InterfacePass
base class. This class is similar to OperationPass
, but expects the type of interface to operate on. This will automatically instill filtering on that interface type:
/// Here we utilize the CRTP `PassWrapper` utility class to provide some /// necessary utility hooks. This is only necessary for passes defined directly /// in C++. Passes defined declaratively use a cleaner mechanism for providing /// these utilities. struct MyFunctionPass : public PassWrapper<MyOperationPass, InterfacePass<FunctionOpInterface>> { void runOnOperation() { // Get the current operation being operated on. FunctionOpInterface op = getOperation(); } };
Dialects must be loaded in the MLIRContext before entities from these dialects (operations, types, attributes, ...) can be created. Dialects must also be loaded before starting the execution of a multi-threaded pass pipeline. To this end, a pass that may create an entity from a dialect that isn't guaranteed to already be loaded must express this by overriding the getDependentDialects()
method and declare this list of Dialects explicitly. See also the dependentDialects
field in the TableGen Specification.
In certain situations, a Pass may contain state that is constructed dynamically, but is potentially expensive to recompute in successive runs of the Pass. One such example is when using PDL
-based patterns, which are compiled into a bytecode during runtime. In these situations, a pass may override the following hook to initialize this heavy state:
LogicalResult initialize(MLIRContext *context)
This hook is executed once per run of a full pass pipeline, meaning that it does not have access to the state available during a runOnOperation
call. More concretely, all necessary accesses to an MLIRContext
should be driven via the provided context
parameter, and methods that utilize “per-run” state such as getContext
/getOperation
/getAnalysis
/etc. must not be used. In case of an error during initialization, the pass is expected to emit an error diagnostic and return a failure()
which will abort the pass pipeline execution.
An important concept, along with transformation passes, are analyses. These are conceptually similar to transformation passes, except that they compute information on a specific operation without modifying it. In MLIR, analyses are not passes but free-standing classes that are computed lazily on-demand and cached to avoid unnecessary recomputation. An analysis in MLIR must adhere to the following:
Operation*
or Operation*
and AnalysisManager &
.AnalysisManager &
should be used to query any necessary analysis dependencies.An analysis may provide additional hooks to control various behavior:
bool isInvalidated(const AnalysisManager::PreservedAnalyses &)
Given a preserved analysis set, the analysis returns true if it should truly be invalidated. This allows for more fine-tuned invalidation in cases where an analysis wasn't explicitly marked preserved, but may be preserved (or invalidated) based upon other properties such as analyses sets. If the analysis uses any other analysis as a dependency, it must also check if the dependency was invalidated.
The base OperationPass
class provides utilities for querying and preserving analyses for the current operation being processed.
getAnalysis<>
getCachedAnalysis<>
getCachedParentAnalysis<>
getCachedChildAnalysis<>
getChildAnalysis<>
Using the example passes defined above, let's see some examples:
/// An interesting analysis. struct MyOperationAnalysis { // Compute this analysis with the provided operation. MyOperationAnalysis(Operation *op); }; struct MyOperationAnalysisWithDependency { MyOperationAnalysisWithDependency(Operation *op, AnalysisManager &am) { // Request other analysis as dependency MyOperationAnalysis &otherAnalysis = am.getAnalysis<MyOperationAnalysis>(); ... } bool isInvalidated(const AnalysisManager::PreservedAnalyses &pa) { // Check if analysis or its dependency were invalidated return !pa.isPreserved<MyOperationAnalysisWithDependency>() || !pa.isPreserved<MyOperationAnalysis>(); } }; void MyOperationPass::runOnOperation() { // Query MyOperationAnalysis for the current operation. MyOperationAnalysis &myAnalysis = getAnalysis<MyOperationAnalysis>(); // Query a cached instance of MyOperationAnalysis for the current operation. // It will not be computed if it doesn't exist. auto optionalAnalysis = getCachedAnalysis<MyOperationAnalysis>(); if (optionalAnalysis) ... // Query a cached instance of MyOperationAnalysis for the parent operation of // the current operation. It will not be computed if it doesn't exist. auto optionalAnalysis = getCachedParentAnalysis<MyOperationAnalysis>(); if (optionalAnalysis) ... }
Analyses that are constructed after being queried by a pass are cached to avoid unnecessary computation if they are requested again later. To avoid stale analyses, all analyses are assumed to be invalidated by a pass. To avoid invalidation, a pass must specifically mark analyses that are known to be preserved.
markAllAnalysesPreserved
markAnalysesPreserved<>
void MyOperationPass::runOnOperation() { // Mark all analyses as preserved. This is useful if a pass can guarantee // that no transformation was performed. markAllAnalysesPreserved(); // Mark specific analyses as preserved. This is used if some transformation // was performed, but some analyses were either unaffected or explicitly // preserved. markAnalysesPreserved<MyAnalysis, MyAnalyses...>(); }
Passes in MLIR are allowed to gracefully fail. This may happen if some invariant of the pass was broken, potentially leaving the IR in some invalid state. If such a situation occurs, the pass can directly signal a failure to the pass manager via the signalPassFailure
method. If a pass signaled a failure when executing, no other passes in the pipeline will execute and the top-level call to PassManager::run
will return failure
.
void MyOperationPass::runOnOperation() { // Signal failure on a broken invariant. if (some_broken_invariant) return signalPassFailure(); }
The above sections introduced the different types of passes and their invariants. This section introduces the concept of a PassManager, and how it can be used to configure and schedule a pass pipeline. There are two main classes related to pass management, the PassManager
and the OpPassManager
. The PassManager
class acts as the top-level entry point, and contains various configurations used for the entire pass pipeline. The OpPassManager
class is used to schedule passes to run at a specific level of nesting. The top-level PassManager
also functions as an OpPassManager
.
An OpPassManager
is essentially a collection of passes anchored to execute on operations at a given level of nesting. A pass manager may be op-specific
(anchored on a specific operation type), or op-agnostic
(not restricted to any specific operation, and executed on any viable operation type). Operation types that anchor pass managers must adhere to the following requirement:
Must be registered and marked IsolatedFromAbove
.
Passes can be added to a pass manager via addPass
.
An OpPassManager
is generally created by explicitly nesting a pipeline within another existing OpPassManager
via the nest<OpT>
or nestAny
methods. The former method takes the operation type that the nested pass manager will operate on. The latter method nests an op-agnostic
pass manager, that may run on any viable operation type. Nesting in this sense, corresponds to the structural nesting within Regions of the IR.
For example, the following .mlir
:
module { spirv.module "Logical" "GLSL450" { func @foo() { ... } } }
Has the nesting structure of:
`builtin.module` `spirv.module` `spirv.func`
Below is an example of constructing a pipeline that operates on the above structure:
// Create a top-level `PassManager` class. auto pm = PassManager::on<ModuleOp>(ctx); // Add a pass on the top-level module operation. pm.addPass(std::make_unique<MyModulePass>()); // Nest a pass manager that operates on `spirv.module` operations nested // directly under the top-level module. OpPassManager &nestedModulePM = pm.nest<spirv::ModuleOp>(); nestedModulePM.addPass(std::make_unique<MySPIRVModulePass>()); // Nest a pass manager that operates on functions within the nested SPIRV // module. OpPassManager &nestedFunctionPM = nestedModulePM.nest<func::FuncOp>(); nestedFunctionPM.addPass(std::make_unique<MyFunctionPass>()); // Nest an op-agnostic pass manager. This will operate on any viable // operation, e.g. func.func, spirv.func, spirv.module, builtin.module, etc. OpPassManager &nestedAnyPM = nestedModulePM.nestAny(); nestedAnyPM.addPass(createCanonicalizePass()); nestedAnyPM.addPass(createCSEPass()); // Run the pass manager on the top-level module. ModuleOp m = ...; if (failed(pm.run(m))) ... // One of the passes signaled a failure.
The above pass manager contains the following pipeline structure:
OpPassManager<ModuleOp> MyModulePass OpPassManager<spirv::ModuleOp> MySPIRVModulePass OpPassManager<func::FuncOp> MyFunctionPass OpPassManager<> Canonicalizer CSE
These pipelines are then run over a single operation at a time. This means that, for example, given a series of consecutive passes on func::FuncOp, it will execute all on the first function, then all on the second function, etc. until the entire program has been run through the passes. This provides several benefits:
In some situations it may be useful to run a pass pipeline within another pass, to allow configuring or filtering based on some invariants of the current operation being operated on. For example, the Inliner Pass may want to run intraprocedural simplification passes while it is inlining to produce a better cost model, and provide more optimal inlining. To enable this, passes may run an arbitrary OpPassManager
on the current operation being operated on or any operation nested within the current operation via the LogicalResult Pass::runPipeline(OpPassManager &, Operation *)
method. This method returns whether the dynamic pipeline succeeded or failed, similarly to the result of the top-level PassManager::run
method. A simple example is shown below:
void MyModulePass::runOnOperation() { ModuleOp module = getOperation(); if (hasSomeSpecificProperty(module)) { OpPassManager dynamicPM("builtin.module"); ...; // Build the dynamic pipeline. if (failed(runPipeline(dynamicPM, module))) return signalPassFailure(); } }
Note: though above the dynamic pipeline was constructed within the runOnOperation
method, this is not necessary and pipelines should be cached when possible as the OpPassManager
class can be safely copy constructed.
The mechanism described in this section should be used whenever a pass pipeline should run in a nested fashion, i.e. when the nested pipeline cannot be scheduled statically along with the rest of the main pass pipeline. More specifically, a PassManager
should generally never need to be constructed within a Pass
. Using runPipeline
also ensures that all analyses, instrumentations, and other pass manager related components are integrated with the dynamic pipeline being executed.
MLIR provides a builtin mechanism for passes to specify options that configure its behavior. These options are parsed at pass construction time independently for each instance of the pass. Options are defined using the Option<>
and ListOption<>
classes, and generally follow the LLVM command line flag definition rules. One major distinction from the LLVM command line functionality is that all ListOption
s are comma-separated, and delimited sub-ranges within individual elements of the list may contain commas that are not treated as separators for the top-level list.
struct MyPass ... { /// Make sure that we have a valid default constructor and copy constructor to /// ensure that the options are initialized properly. MyPass() = default; MyPass(const MyPass& pass) {} /// Any parameters after the description are forwarded to llvm::cl::list and /// llvm::cl::opt respectively. Option<int> exampleOption{*this, "flag-name", llvm::cl::desc("...")}; ListOption<int> exampleListOption{*this, "list-flag-name", llvm::cl::desc("...")}; };
For pass pipelines, the PassPipelineRegistration
templates take an additional template parameter for an optional Option
struct definition. This struct should inherit from mlir::PassPipelineOptions
and contain the desired pipeline options. When using PassPipelineRegistration
, the constructor now takes a function with the signature void (OpPassManager &pm, const MyPipelineOptions&)
which should construct the passes from the options and pass them to the pm:
struct MyPipelineOptions : public PassPipelineOptions { // The structure of these options is the same as those for pass options. Option<int> exampleOption{*this, "flag-name", llvm::cl::desc("...")}; ListOption<int> exampleListOption{*this, "list-flag-name", llvm::cl::desc("...")}; }; void registerMyPasses() { PassPipelineRegistration<MyPipelineOptions>( "example-pipeline", "Run an example pipeline.", [](OpPassManager &pm, const MyPipelineOptions &pipelineOptions) { // Initialize the pass manager. }); }
Statistics are a way to keep track of what the compiler is doing and how effective various transformations are. It is often useful to see what effect specific transformations have on a particular input, and how often they trigger. Pass statistics are specific to each pass instance, which allow for seeing the effect of placing a particular transformation at specific places within the pass pipeline. For example, they help answer questions like “What happens if I run CSE again here?”.
Statistics can be added to a pass by using the ‘Pass::Statistic’ class. This class takes as a constructor arguments: the parent pass, a name, and a description. This class acts like an atomic unsigned integer, and may be incremented and updated accordingly. These statistics rely on the same infrastructure as llvm::Statistic
and thus have similar usage constraints. Collected statistics can be dumped by the pass manager programmatically via PassManager::enableStatistics
; or via -mlir-pass-statistics
and -mlir-pass-statistics-display
on the command line.
An example is shown below:
struct MyPass ... { /// Make sure that we have a valid default constructor and copy constructor to /// ensure that the options are initialized properly. MyPass() = default; MyPass(const MyPass& pass) {} StringRef getArgument() const final { // This is the argument used to refer to the pass in // the textual format (on the commandline for example). return "argument"; } StringRef getDescription() const final { // This is a brief description of the pass. return "description"; } /// Define the statistic to track during the execution of MyPass. Statistic exampleStat{this, "exampleStat", "An example statistic"}; void runOnOperation() { ... // Update the statistic after some invariant was hit. ++exampleStat; ... } };
The collected statistics may be aggregated in two types of views:
A pipeline view that models the structure of the pass manager, this is the default view:
$ mlir-opt -pass-pipeline='any(func.func(my-pass,my-pass))' foo.mlir -mlir-pass-statistics ===-------------------------------------------------------------------------=== ... Pass statistics report ... ===-------------------------------------------------------------------------=== 'func.func' Pipeline MyPass (S) 15 exampleStat - An example statistic VerifierPass MyPass (S) 6 exampleStat - An example statistic VerifierPass VerifierPass
A list view that aggregates the statistics of all instances of a specific pass together:
$ mlir-opt -pass-pipeline='any(func.func(my-pass,my-pass))' foo.mlir -mlir-pass-statistics -mlir-pass-statistics-display=list ===-------------------------------------------------------------------------=== ... Pass statistics report ... ===-------------------------------------------------------------------------=== MyPass (S) 21 exampleStat - An example statistic
Briefly shown in the example definitions of the various pass types is the PassRegistration
class. This mechanism allows for registering pass classes so that they may be created within a textual pass pipeline description. An example registration is shown below:
void registerMyPass() { PassRegistration<MyPass>(); }
MyPass
is the name of the derived pass class.getArgument()
method is used to get the identifier that will be used to refer to the pass.getDescription()
method provides a short summary describing the pass.For passes that cannot be default-constructed, PassRegistration
accepts an optional argument that takes a callback to create the pass:
void registerMyPass() { PassRegistration<MyParametricPass>( []() -> std::unique_ptr<Pass> { std::unique_ptr<Pass> p = std::make_unique<MyParametricPass>(/*options*/); /*... non-trivial-logic to configure the pass ...*/; return p; }); }
This variant of registration can be used, for example, to accept the configuration of a pass from command-line arguments and pass it to the pass constructor.
Note: Make sure that the pass is copy-constructible in a way that does not share data as the pass manager may create copies of the pass to run in parallel.
Described above is the mechanism used for registering a specific derived pass class. On top of that, MLIR allows for registering custom pass pipelines in a similar fashion. This allows for custom pipelines to be available to tools like mlir-opt in the same way that passes are, which is useful for encapsulating common pipelines like the “-O1” series of passes. Pipelines are registered via a similar mechanism to passes in the form of PassPipelineRegistration
. Compared to PassRegistration
, this class takes an additional parameter in the form of a pipeline builder that modifies a provided OpPassManager
.
void pipelineBuilder(OpPassManager &pm) { pm.addPass(std::make_unique<MyPass>()); pm.addPass(std::make_unique<MyOtherPass>()); } void registerMyPasses() { // Register an existing pipeline builder function. PassPipelineRegistration<>( "argument", "description", pipelineBuilder); // Register an inline pipeline builder. PassPipelineRegistration<>( "argument", "description", [](OpPassManager &pm) { pm.addPass(std::make_unique<MyPass>()); pm.addPass(std::make_unique<MyOtherPass>()); }); }
The previous sections detailed how to register passes and pass pipelines with a specific argument and description. Once registered, these can be used to configure a pass manager from a string description. This is especially useful for tools like mlir-opt
, that configure pass managers from the command line, or as options to passes that utilize dynamic pass pipelines.
To support the ability to describe the full structure of pass pipelines, MLIR supports a custom textual description of pass pipelines. The textual description includes the nesting structure, the arguments of the passes and pass pipelines to run, and any options for those passes and pipelines. A textual pipeline is defined as a series of names, each of which may in itself recursively contain a nested pipeline description. The syntax for this specification is as follows:
pipeline ::= op-anchor `(` pipeline-element (`,` pipeline-element)* `)` pipeline-element ::= pipeline | (pass-name | pass-pipeline-name) options? options ::= '{' (key ('=' value)?)+ '}'
op-anchor
func.func
or builtin.module
, or any
, for op-agnostic pass managers that execute on any viable operation (i.e. any operation that can be used to anchor a pass manager).pass-name
| pass-pipeline-name
cse
or canonicalize
.options
For example, the following pipeline:
$ mlir-opt foo.mlir -cse -canonicalize -convert-func-to-llvm='use-bare-ptr-memref-call-conv=1'
Can also be specified as (via the -pass-pipeline
flag):
# Anchor the cse and canonicalize passes on the `func.func` operation. $ mlir-opt foo.mlir -pass-pipeline='builtin.module(func.func(cse,canonicalize),convert-func-to-llvm{use-bare-ptr-memref-call-conv=1})' # Anchor the cse and canonicalize passes on "any" viable root operation. $ mlir-opt foo.mlir -pass-pipeline='builtin.module(any(cse,canonicalize),convert-func-to-llvm{use-bare-ptr-memref-call-conv=1})'
In order to support round-tripping a pass to the textual representation using OpPassManager::printAsTextualPipeline(raw_ostream&)
, override StringRef Pass::getArgument()
to specify the argument used when registering a pass.
Some aspects of a Pass may be specified declaratively, in a form similar to operations. This specification simplifies several mechanisms used when defining passes. It can be used for generating pass registration calls, defining boilerplate pass utilities, and generating pass documentation.
Consider the following pass specified in C++:
struct MyPass : PassWrapper<MyPass, OperationPass<ModuleOp>> { MyPass() = default; MyPass(const MyPass &) {} ... // Specify any options. Option<bool> option{ *this, "example-option", llvm::cl::desc("An example option"), llvm::cl::init(true)}; ListOption<int64_t> listOption{ *this, "example-list", llvm::cl::desc("An example list option")}; // Specify any statistics. Statistic statistic{this, "example-statistic", "An example statistic"}; }; /// Expose this pass to the outside world. std::unique_ptr<Pass> foo::createMyPass() { return std::make_unique<MyPass>(); } /// Register this pass. void foo::registerMyPass() { PassRegistration<MyPass>(); }
This pass may be specified declaratively as so:
def MyPass : Pass<"my-pass", "ModuleOp"> { let summary = "My Pass Summary"; let description = [{ Here we can now give a much larger description of `MyPass`, including all of its various constraints and behavior. }]; // Specify any options. let options = [ Option<"option", "example-option", "bool", /*default=*/"true", "An example option">, ListOption<"listOption", "example-list", "int64_t", "An example list option"> ]; // Specify any statistics. let statistics = [ Statistic<"statistic", "example-statistic", "An example statistic"> ]; }
Using the gen-pass-decls
generator, we can generate most of the boilerplate above automatically. This generator takes as an input a -name
parameter, that provides a tag for the group of passes that are being generated. This generator produces code with multiple purposes:
The first is to register the declared passes with the global registry. For each pass, the generator produces a registerPassName
where PassName
is the name of the definition specified in tablegen. It also generates a registerGroupPasses
, where Group
is the tag provided via the -name
input parameter, that registers all of the passes present.
// Tablegen options: -gen-pass-decls -name="Example" // Passes.h namespace foo { #define GEN_PASS_REGISTRATION #include "Passes.h.inc" } // namespace foo void registerMyPasses() { // Register all of the passes. foo::registerExamplePasses(); // Or // Register `MyPass` specifically. foo::registerMyPass(); }
The second is to provide a way to configure the pass options. These classes are named in the form of MyPassOptions
, where MyPass
is the name of the pass definition in tablegen. The configurable parameters reflect the options declared in the tablegen file. These declarations can be enabled for the whole group of passes by defining the GEN_PASS_DECL
macro, or on a per-pass basis by defining GEN_PASS_DECL_PASSNAME
where PASSNAME
is the uppercase version of the name specified in tablegen.
// .h.inc #ifdef GEN_PASS_DECL_MYPASS struct MyPassOptions { bool option = true; ::llvm::ArrayRef<int64_t> listOption; }; #undef GEN_PASS_DECL_MYPASS #endif // GEN_PASS_DECL_MYPASS
The utogenerated file will also contain the declarations of the default constructors.
// .h.inc #ifdef GEN_PASS_DECL_MYPASS ... std::unique_ptr<::mlir::Pass> createMyPass(); std::unique_ptr<::mlir::Pass> createMyPass(const MyPassOptions &options); #undef GEN_PASS_DECL_MYPASS #endif // GEN_PASS_DECL_MYPASS
The last purpose of this generator is to emit a base class for each of the passes, containing most of the boiler plate related to pass definitions. These classes are named in the form of MyPassBase
and are declared inside the impl
namespace, where MyPass
is the name of the pass definition in tablegen. We can update the original C++ pass definition as so:
// MyPass.cpp /// Include the generated base pass class definitions. namespace foo { #define GEN_PASS_DEF_MYPASS #include "Passes.h.inc" } /// Define the main class as deriving from the generated base class. struct MyPass : foo::impl::MyPassBase<MyPass> { using MyPassBase::MyPassBase; /// The definitions of the options and statistics are now generated within /// the base class, but are accessible in the same way. };
These definitions can be enabled on a per-pass basis by defining the appropriate preprocessor GEN_PASS_DEF_PASSNAME
macro, with PASSNAME
equal to the uppercase version of the name of the pass definition in tablegen. The default constructors are also defined and expect the name of the actual pass class to be equal to the name defined in tablegen.
Using the gen-pass-doc
generator, markdown documentation for each of the passes can be generated. See Passes.md for example output of real MLIR passes.
The Pass
class is used to begin a new pass definition. This class takes as an argument the registry argument to attribute to the pass, as well as an optional string corresponding to the operation type that the pass operates on. The class contains the following fields:
summary
description
dependentDialects
Dialect
classes this pass may introduce entities, Attributes/Operations/Types/etc., of.options
statistics
constructor
Options may be specified via the Option
and ListOption
classes. The Option
class takes the following template parameters:
def MyPass : Pass<"my-pass"> { let options = [ Option<"option", "example-option", "bool", /*default=*/"true", "An example option">, ]; }
The ListOption
class takes the following fields:
def MyPass : Pass<"my-pass"> { let options = [ ListOption<"listOption", "example-list", "int64_t", "An example list option"> ]; }
Statistics may be specified via the Statistic
, which takes the following template parameters:
def MyPass : Pass<"my-pass"> { let statistics = [ Statistic<"statistic", "example-statistic", "An example statistic"> ]; }
MLIR provides a customizable framework to instrument pass execution and analysis computation, via the PassInstrumentation
class. This class provides hooks into the PassManager that observe various events:
runBeforePipeline
runAfterPipeline
runBeforePass
runAfterPass
runAfterPassFailed
will not be.runAfterPassFailed
runAfterPass
will not be.runBeforeAnalysis
runBeforeAnalysis
/runAfterAnalysis
pair for the dependency can be called from inside of the current runBeforeAnalysis
/runAfterAnalysis
pair.runAfterAnalysis
PassInstrumentation instances may be registered directly with a PassManager instance via the addInstrumentation
method. Instrumentations added to the PassManager are run in a stack like fashion, i.e. the last instrumentation to execute a runBefore*
hook will be the first to execute the respective runAfter*
hook. The hooks of a PassInstrumentation
class are guaranteed to be executed in a thread-safe fashion, so additional synchronization is not necessary. Below in an example instrumentation that counts the number of times the DominanceInfo
analysis is computed:
struct DominanceCounterInstrumentation : public PassInstrumentation { /// The cumulative count of how many times dominance has been calculated. unsigned &count; DominanceCounterInstrumentation(unsigned &count) : count(count) {} void runAfterAnalysis(llvm::StringRef, TypeID id, Operation *) override { if (id == TypeID::get<DominanceInfo>()) ++count; } }; MLIRContext *ctx = ...; PassManager pm(ctx); // Add the instrumentation to the pass manager. unsigned domInfoCount; pm.addInstrumentation( std::make_unique<DominanceCounterInstrumentation>(domInfoCount)); // Run the pass manager on a module operation. ModuleOp m = ...; if (failed(pm.run(m))) ... llvm::errs() << "DominanceInfo was computed " << domInfoCount << " times!\n";
MLIR utilizes the pass instrumentation framework to provide a few useful developer tools and utilities. Each of these instrumentations are directly available to all users of the MLIR pass framework.
The PassTiming instrumentation provides timing information about the execution of passes and computation of analyses. This provides a quick glimpse into what passes are taking the most time to execute, as well as how much of an effect a pass has on the total execution time of the pipeline. Users can enable this instrumentation directly on the PassManager via enableTiming
. This instrumentation is also made available in mlir-opt via the -mlir-timing
flag. The PassTiming instrumentation provides several different display modes for the timing results, each of which is described below:
In this mode, the results are displayed in a list sorted by total time with each pass/analysis instance aggregated into one unique result. This view is useful for getting an overview of what analyses/passes are taking the most time in a pipeline. This display mode is available in mlir-opt via -mlir-timing-display=list
.
$ mlir-opt foo.mlir -mlir-disable-threading -pass-pipeline='builtin.module(func.func(cse,canonicalize),convert-func-to-llvm)' -mlir-timing -mlir-timing-display=list ===-------------------------------------------------------------------------=== ... Execution time report ... ===-------------------------------------------------------------------------=== Total Execution Time: 0.0135 seconds ----Wall Time---- ----Name---- 0.0135 (100.0%) root 0.0041 ( 30.1%) Parser 0.0018 ( 13.3%) ConvertFuncToLLVMPass 0.0011 ( 8.2%) Output 0.0007 ( 5.2%) Pipeline Collection : ['func.func'] 0.0006 ( 4.6%) 'func.func' Pipeline 0.0005 ( 3.5%) Canonicalizer 0.0001 ( 0.9%) CSE 0.0001 ( 0.5%) (A) DataLayoutAnalysis 0.0000 ( 0.1%) (A) DominanceInfo 0.0058 ( 43.2%) Rest 0.0135 (100.0%) Total
The results can be displayed in JSON format via -mlir-output-format=json
.
$ mlir-opt foo.mlir -mlir-disable-threading -pass-pipeline='builtin.module(func.func(cse,canonicalize),convert-func-to-llvm)' -mlir-timing -mlir-timing-display=list -mlir-output-format=json [ {"wall": {"duration": 0.0135, "percentage": 100.0}, "name": "root"}, {"wall": {"duration": 0.0041, "percentage": 30.1}, "name": "Parser"}, {"wall": {"duration": 0.0018, "percentage": 13.3}, "name": "ConvertFuncToLLVMPass"}, {"wall": {"duration": 0.0011, "percentage": 8.2}, "name": "Output"}, {"wall": {"duration": 0.0007, "percentage": 5.2}, "name": "Pipeline Collection : ['func.func']"}, {"wall": {"duration": 0.0006, "percentage": 4.6}, "name": "'func.func' Pipeline"}, {"wall": {"duration": 0.0005, "percentage": 3.5}, "name": "Canonicalizer"}, {"wall": {"duration": 0.0001, "percentage": 0.9}, "name": "CSE"}, {"wall": {"duration": 0.0001, "percentage": 0.5}, "name": "(A) DataLayoutAnalysis"}, {"wall": {"duration": 0.0000, "percentage": 0.1}, "name": "(A) DominanceInfo"}, {"wall": {"duration": 0.0058, "percentage": 43.2}, "name": "Rest"}, {"wall": {"duration": 0.0135, "percentage": 100.0}, "name": "Total"} ]
In this mode, the results are displayed in a nested pipeline view that mirrors the internal pass pipeline that is being executed in the pass manager. This view is useful for understanding specifically which parts of the pipeline are taking the most time, and can also be used to identify when analyses are being invalidated and recomputed. This is the default display mode.
$ mlir-opt foo.mlir -mlir-disable-threading -pass-pipeline='builtin.module(func.func(cse,canonicalize),convert-func-to-llvm)' -mlir-timing ===-------------------------------------------------------------------------=== ... Execution time report ... ===-------------------------------------------------------------------------=== Total Execution Time: 0.0127 seconds ----Wall Time---- ----Name---- 0.0038 ( 30.2%) Parser 0.0006 ( 4.8%) 'func.func' Pipeline 0.0001 ( 0.9%) CSE 0.0000 ( 0.1%) (A) DominanceInfo 0.0005 ( 3.7%) Canonicalizer 0.0017 ( 13.7%) ConvertFuncToLLVMPass 0.0001 ( 0.6%) (A) DataLayoutAnalysis 0.0010 ( 8.2%) Output 0.0054 ( 42.5%) Rest 0.0127 (100.0%) Total
The results can be displayed in JSON format via -mlir-output-format=json
.
$ mlir-opt foo.mlir -mlir-disable-threading -pass-pipeline='builtin.module(func.func(cse,canonicalize),convert-func-to-llvm)' -mlir-timing -mlir-output-format=json [ {"wall": {"duration": 0.0038, "percentage": 30.2}, "name": "Parser", "passes": [ {}]}, {"wall": {"duration": 0.0006, "percentage": 4.8}, "name": "'func.func' Pipeline", "passes": [ {"wall": {"duration": 0.0001, "percentage": 0.9}, "name": "CSE", "passes": [ {"wall": {"duration": 0.0000, "percentage": 0.1}, "name": "(A) DominanceInfo", "passes": [ {}]}, {}]}, {"wall": {"duration": 0.0005, "percentage": 3.7}, "name": "Canonicalizer", "passes": [ {}]}, {}]}, {"wall": {"duration": 0.0017, "percentage": 13.7}, "name": "ConvertFuncToLLVMPass", "passes": [ {"wall": {"duration": 0.0001, "percentage": 0.6}, "name": "(A) DataLayoutAnalysis", "passes": [ {}]}, {}]}, {"wall": {"duration": 0.0010, "percentage": 8.2}, "name": "Output", "passes": [ {}]}, {"wall": {"duration": 0.0054, "percentage": 42.5}, "name": "Rest"}, {"wall": {"duration": 0.0127, "percentage": 100.0}, "name": "Total"} ]
When multi-threading is enabled in the pass manager the meaning of the display slightly changes. First, a new timing column is added, User Time
, that displays the total time spent across all threads. Secondly, the Wall Time
column displays the longest individual time spent amongst all of the threads. This means that the Wall Time
column will continue to give an indicator on the perceived time, or clock time, whereas the User Time
will display the total cpu time.
$ mlir-opt foo.mlir -pass-pipeline='builtin.module(func.func(cse,canonicalize),convert-func-to-llvm)' -mlir-timing ===-------------------------------------------------------------------------=== ... Pass execution timing report ... ===-------------------------------------------------------------------------=== Total Execution Time: 0.0078 seconds ---User Time--- ---Wall Time--- --- Name --- 0.0177 ( 88.5%) 0.0057 ( 71.3%) 'func.func' Pipeline 0.0044 ( 22.0%) 0.0015 ( 18.9%) CSE 0.0029 ( 14.5%) 0.0012 ( 15.2%) (A) DominanceInfo 0.0038 ( 18.9%) 0.0015 ( 18.7%) VerifierPass 0.0089 ( 44.6%) 0.0025 ( 31.1%) Canonicalizer 0.0006 ( 3.0%) 0.0002 ( 2.6%) VerifierPass 0.0004 ( 2.2%) 0.0004 ( 5.4%) VerifierPass 0.0013 ( 6.5%) 0.0013 ( 16.3%) LLVMLoweringPass 0.0006 ( 2.8%) 0.0006 ( 7.0%) VerifierPass 0.0200 (100.0%) 0.0081 (100.0%) Total
When debugging it is often useful to dump the IR at various stages of a pass pipeline. This is where the IR printing instrumentation comes into play. This instrumentation allows for conditionally printing the IR before and after pass execution by optionally filtering on the pass being executed. This instrumentation can be added directly to the PassManager via the enableIRPrinting
method. mlir-opt
provides a few useful flags for utilizing this instrumentation:
mlir-print-ir-before=(comma-separated-pass-list)
mlir-print-ir-before-all
$ mlir-opt foo.mlir -pass-pipeline='func.func(cse)' -mlir-print-ir-before=cse *** IR Dump Before CSE *** func.func @simple_constant() -> (i32, i32) { %c1_i32 = arith.constant 1 : i32 %c1_i32_0 = arith.constant 1 : i32 return %c1_i32, %c1_i32_0 : i32, i32 }
mlir-print-ir-after=(comma-separated-pass-list)
mlir-print-ir-after-all
$ mlir-opt foo.mlir -pass-pipeline='func.func(cse)' -mlir-print-ir-after=cse *** IR Dump After CSE *** func.func @simple_constant() -> (i32, i32) { %c1_i32 = arith.constant 1 : i32 return %c1_i32, %c1_i32 : i32, i32 }
mlir-print-ir-after-change
$ mlir-opt foo.mlir -pass-pipeline='func.func(cse,cse)' -mlir-print-ir-after=cse -mlir-print-ir-after-change *** IR Dump After CSE *** func.func @simple_constant() -> (i32, i32) { %c1_i32 = arith.constant 1 : i32 return %c1_i32, %c1_i32 : i32, i32 }
mlir-print-ir-after-failure
mlir-print-ir-after
flags above.$ mlir-opt foo.mlir -pass-pipeline='func.func(cse,bad-pass)' -mlir-print-ir-after-failure *** IR Dump After BadPass Failed *** func.func @simple_constant() -> (i32, i32) { %c1_i32 = arith.constant 1 : i32 return %c1_i32, %c1_i32 : i32, i32 }
mlir-print-ir-module-scope
-mlir-disable-threading
)$ mlir-opt foo.mlir -mlir-disable-threading -pass-pipeline='func.func(cse)' -mlir-print-ir-after=cse -mlir-print-ir-module-scope *** IR Dump After CSE *** ('func.func' operation: @bar) func.func @bar(%arg0: f32, %arg1: f32) -> f32 { ... } func.func @simple_constant() -> (i32, i32) { %c1_i32 = arith.constant 1 : i32 %c1_i32_0 = arith.constant 1 : i32 return %c1_i32, %c1_i32_0 : i32, i32 } *** IR Dump After CSE *** ('func.func' operation: @simple_constant) func.func @bar(%arg0: f32, %arg1: f32) -> f32 { ... } func.func @simple_constant() -> (i32, i32) { %c1_i32 = arith.constant 1 : i32 return %c1_i32, %c1_i32 : i32, i32 }
mlir-print-ir-tree-dir=(directory path)
stderr
. If you provide a directory using this option, the output corresponding to each pass will be printed to a file in the directory tree rooted at (directory path)
. The path created for each pass reflects the nesting structure of the IR and the pass pipeline.func.func
located within two nested builtin.module
ops.1_1_pass4.mlir
files, the first 1 refers to the counter for the parent op, and the second refers to the counter for the respective function.$ pipeline="builtin.module(pass1,pass2,func.func(pass3,pass4),pass5)" $ mlir-opt foo.mlir -pass-pipeline="$pipeline" -mlir-print-ir-tree-dir=/tmp/pipeline_output $ tree /tmp/pipeline_output /tmp/pass_output ├── builtin_module_the_symbol_name │ ├── 0_pass1.mlir │ ├── 1_pass2.mlir │ ├── 2_pass5.mlir │ ├── func_func_my_func_name │ │ ├── 1_0_pass3.mlir │ │ ├── 1_1_pass4.mlir │ ├── func_func_my_other_func_name │ │ ├── 1_0_pass3.mlir │ │ ├── 1_1_pass4.mlir
mlir-use-nameloc-as-prefix
If your source IR has named locations (loc("named_location")"
) then passing this flag will use those names (named_location
) to prefix the corresponding SSA identifiers:
%1 = memref.load %0[] : memref<i32> loc("alice") %2 = memref.load %0[] : memref<i32> loc("bob") %3 = memref.load %0[] : memref<i32> loc("bob")
will print
%alice = memref.load %0[] : memref<i32> %bob = memref.load %0[] : memref<i32> %bob_0 = memref.load %0[] : memref<i32>
These names will also be preserved through passes to newly created operations if using the appropriate location.
The pass manager in MLIR contains a builtin mechanism to generate reproducibles in the event of a crash, or a pass failure. This functionality can be enabled via PassManager::enableCrashReproducerGeneration
or via the command line flag mlir-pass-pipeline-crash-reproducer
. In either case, an argument is provided that corresponds to the output .mlir
file name that the reproducible should be written to. The reproducible contains the configuration of the pass manager that was executing, as well as the initial IR before any passes were run. The reproducer is stored within the assembly format as an external resource. A potential reproducible may have the form:
module { func.func @foo() { ... } } {-# external_resources: { mlir_reproducer: { pipeline: "builtin.module(func.func(cse,canonicalize),inline)", disable_threading: true, verify_each: true } } #-}
The configuration dumped can be passed to mlir-opt
by specifying -run-reproducer
flag. This will result in parsing the configuration of the reproducer and adjusting the necessary opt state, e.g. configuring the pass manager, context, etc.
Beyond specifying a filename, one can also register a ReproducerStreamFactory
function that would be invoked in the case of a crash and the reproducer written to its stream.
An additional flag may be passed to PassManager::enableCrashReproducerGeneration
, and specified via mlir-pass-pipeline-local-reproducer
on the command line, that signals that the pass manager should attempt to generate a “local” reproducer. This will attempt to generate a reproducer containing IR right before the pass that fails. This is useful for situations where the crash is known to be within a specific pass, or when the original input relies on components (like dialects or passes) that may not always be available.
Note: Local reproducer generation requires that multi-threading is disabled(-mlir-disable-threading
)
For example, if the failure in the previous example came from the canonicalize
pass, the following reproducer would be generated:
module { func.func @foo() { ... } } {-# external_resources: { mlir_reproducer: { pipeline: "builtin.module(func.func(canonicalize))", disable_threading: true, verify_each: true } } #-}