src/solve/canonicalization.md - rust-lang/rustc-dev-guide - Git at Google

 # Canonicalization

 Canonicalization is the process of *isolating* a value from its context and is necessary
 for global caching of goals which include inference variables.

 The idea is that given the goals `u32: Trait<?x>` and `u32: Trait<?y>`, where `?x` and `?y`
 are two different currently unconstrained inference variables, we should get the same result
 for both goals. We can therefore prove *the canonical query* `exists<T> u32: Trait<T>` once
 and reuse the result.

 Let's first go over the way canonical queries work and then dive into the specifics of
 how canonicalization works.

 ## A walkthrough of canonical queries

 To make this a bit easier, let's use the trait goal `u32: Trait<?x>` as an example with the
 assumption that the only relevant impl is `impl<T> Trait<Vec<T>> for u32`.

 ### Canonicalizing the input

 We start by *canonicalizing* the goal, replacing inference variables with existential and
 placeholders with universal bound variables. This would result in the *canonical goal*
 `exists<T> u32: Trait<T>`.

 We remember the original values of all bound variables in the original context. Here this would
 map `T` back to `?x`. These original values are used later on when dealing with the query
 response.

 We now call the canonical query with the canonical goal.

 ### Instantiating the canonical goal inside of the query

 To actually try to prove the canonical goal we start by instantiating the bound variables with
 inference variables and placeholders again.

 This happens inside of the query in a completely separate `InferCtxt`. Inside of the query we
 now have a goal `u32: Trait<?0>`. We also remember which value we've used to instantiate the bound
 variables in the canonical goal, which maps `T` to `?0`.

 We now compute the goal `u32: Trait<?0>` and figure out that this holds, but we've constrained
 `?0` to `Vec<?1>`. We finally convert this result to something useful to the caller.

 ### Canonicalizing the query response

 We have to return to the caller both whether the goal holds, and the inference constraints
 from inside of the query.

 To return the inference results to the caller we canonicalize the mapping from bound variables
 to the instantiated values in the query. This means that the query response is `Certainty::Yes`
 and a mapping from `T` to `exists<U> Vec<U>`.

 ### Instantiating the query response

 The caller now has to apply the constraints returned by the query. For this they first
 instantiate the bound variables of the canonical response with inference variables and
 placeholders again, so the mapping in the response is now from `T` to `Vec<?z>`.

 It now equates the original value of `T` (`?x`) with the value for `T` in the
 response (`Vec<?z>`), which correctly constrains `?x` to `Vec<?z>`.

 ## `ExternalConstraints`

 Computing a trait goal may not only constrain inference variables, it can also add region
 obligations, e.g. given a goal `(): AOutlivesB<'a, 'b>` we would like to return the fact that
 `'a: 'b` has to hold.

 This is done by not only returning the mapping from bound variables to the instantiated values
 from the query but also extracting additional `ExternalConstraints` from the `InferCtxt` context
 while building the response.

 ## How exactly does canonicalization work

 TODO: link to code once the PR lands and elaborate

 - types and consts: infer to existentially bound var, placeholder to universally bound var,
     considering universes
 - generic parameters in the input get treated as placeholders in the root universe
 - all regions in the input get all mapped to existentially bound vars and we "uniquify" them.
     `&'a (): Trait<'a>` gets canonicalized to `exists<'0, '1> &'0 (): Trait<'1>`. We do not care
     about their universes and simply put all regions into the highest universe of the input.
 - in the output everything in a universe of the caller gets put into the root universe and only
     gets its correct universe when we unify the var values with the orig values of the caller
 - we do not uniquify regions in the response and don't canonicalize `'static`
	# Canonicalization

	Canonicalization is the process of isolating a value from its context and is necessary
	for global caching of goals which include inference variables.

	The idea is that given the goals `u32: Trait<?x>` and `u32: Trait<?y>`, where `?x` and `?y`
	are two different currently unconstrained inference variables, we should get the same result
	for both goals. We can therefore prove the canonical query `exists<T> u32: Trait<T>` once
	and reuse the result.

	Let's first go over the way canonical queries work and then dive into the specifics of
	how canonicalization works.

	## A walkthrough of canonical queries

	To make this a bit easier, let's use the trait goal `u32: Trait<?x>` as an example with the
	assumption that the only relevant impl is `impl<T> Trait<Vec<T>> for u32`.

	### Canonicalizing the input

	We start by canonicalizing the goal, replacing inference variables with existential and
	placeholders with universal bound variables. This would result in the canonical goal
	`exists<T> u32: Trait<T>`.

	We remember the original values of all bound variables in the original context. Here this would
	map `T` back to `?x`. These original values are used later on when dealing with the query
	response.

	We now call the canonical query with the canonical goal.

	### Instantiating the canonical goal inside of the query

	To actually try to prove the canonical goal we start by instantiating the bound variables with
	inference variables and placeholders again.

	This happens inside of the query in a completely separate `InferCtxt`. Inside of the query we
	now have a goal `u32: Trait<?0>`. We also remember which value we've used to instantiate the bound
	variables in the canonical goal, which maps `T` to `?0`.

	We now compute the goal `u32: Trait<?0>` and figure out that this holds, but we've constrained
	`?0` to `Vec<?1>`. We finally convert this result to something useful to the caller.

	### Canonicalizing the query response

	We have to return to the caller both whether the goal holds, and the inference constraints
	from inside of the query.

	To return the inference results to the caller we canonicalize the mapping from bound variables
	to the instantiated values in the query. This means that the query response is `Certainty::Yes`
	and a mapping from `T` to `exists<U> Vec<U>`.

	### Instantiating the query response

	The caller now has to apply the constraints returned by the query. For this they first
	instantiate the bound variables of the canonical response with inference variables and
	placeholders again, so the mapping in the response is now from `T` to `Vec<?z>`.

	It now equates the original value of `T` (`?x`) with the value for `T` in the
	response (`Vec<?z>`), which correctly constrains `?x` to `Vec<?z>`.

	## `ExternalConstraints`

	Computing a trait goal may not only constrain inference variables, it can also add region
	obligations, e.g. given a goal `(): AOutlivesB<'a, 'b>` we would like to return the fact that
	`'a: 'b` has to hold.

	This is done by not only returning the mapping from bound variables to the instantiated values
	from the query but also extracting additional `ExternalConstraints` from the `InferCtxt` context
	while building the response.

	## How exactly does canonicalization work

	TODO: link to code once the PR lands and elaborate

	- types and consts: infer to existentially bound var, placeholder to universally bound var,
	considering universes
	- generic parameters in the input get treated as placeholders in the root universe
	- all regions in the input get all mapped to existentially bound vars and we "uniquify" them.
	`&'a (): Trait<'a>` gets canonicalized to `exists<'0, '1> &'0 (): Trait<'1>`. We do not care
	about their universes and simply put all regions into the highest universe of the input.
	- in the output everything in a universe of the caller gets put into the root universe and only
	gets its correct universe when we unify the var values with the orig values of the caller
	- we do not uniquify regions in the response and don't canonicalize `'static`