9. Using Lean as a Proof Assistant
This chapter marks three shifts in the way we are using Lean. First, up to this point we have been using Lean as a programming language, whereas here we will begin to use Lean as a proof assistant. In the theory-implementation-application trichotomy that we used to describe this course, this marks a shift from using Lean to implement logical tools to using Lean as an important application of logic, one that is used to verify hardware, software, and complex systems as well as mathematical theorems. Finally, whereas up to now our focus has been on representing logical languages in Lean, now we are using Lean as a logical language itself. This can be viewed as a shift from treating logic as an object language to treating logic as a metalanguage, a distinction we will now explain.
Imagine implementing one programming language, such as Lisp, in another programming language, like C++. In this scenario, we can characterize Lisp as being the object language, that is, the one that is being implemented, and C++ as the metalanguage, the one that is carrying out the implementation. What we did in Chapter 5 is similar: we are used one logical system, Lean, to implement another one, propositional logic. Whether we care about logical languages or programming languages (or a combination of both), it’s often the case that we use one language to implement or describe another. If we use Lean to implement a decision procedure for propositional logic, propositional logic is the object language, the target of our implementation. If we use propositional connectives (and more) to prove the correctness of that procedure in Lean, then in that case we are using propositional logic as part of the metalanguage, the thing we are using to achieve our goal.
One goal of this chapter is to clarify the sense in which Lean itself is a logical system, which is to say, its language can be used to state mathematical theorems and prove them. Using a logical foundation like Lean’s as both a programming language and a mathematical language brings a number of benefits:
It allows us to specify the behavior of computer programs in the same language that we write them.
It allows us to prove, rigorously, that our computer programs are correct, which is to say, that they meet their specifications.
It allows us to enforce preconditions on our programs. For example, we can write functions whose input is required to be a positive integer, a requirement that is enforced statically, at compile time. Compiler optimizations can make use of this knowledge.
It allows us to compute with objects in our mathematical libraries.
It gives us ways of using computation reliably in mathematical proofs.
Although we will not discuss it in this course, Lean also serves as its own metaprogramming language, which means that we can use Lean to develop automation that can help us construct programs and proofs. In that way, Lean becomes a self-extending system, meaning that we can improve its support for programming and theorem proving using the system itself.
Lean’s logical foundation is powerful and expressive enough to carry out any mathematical argument, and proof assistants are now commonly used for hardware and software verification. This chapter, we introduce the basics of using Lean as a proof assistant by focusing on the treatment of propositional logic. Chapter 13 will extend this to equational reasoning and proof by induction, but even that only scratches the surface. You can learn more about the use of Lean as a proof assistant in either Mathematics in Lean or Theorem Proving in Lean 4. You may also enjoy interactive tutorials, like the Natural Number Game, found on the Lean Game Server.
For the examples in this chapter, we use the following imports:
import Mathlib.Data.Real.Basic
import Mathlib.Tactic
Since importing a file also imports everything it depends on, these serve to make available to us a modest portion of Lean’s mathematical library, Mathlib.
9.1. Propositional logic
In Lean, we can declare variables that range over propositions and then use them to build more complicated propositions.
variable (P Q R S : Prop)
#check True
#check False
#check P ∧ Q
#check P ∨ Q
#check P → Q
#check P ↔ Q
#check ¬ P
Hovering over the symbols will give you options for typing them. Using \and
, \or
, \to
,
\iff
, and \not
will work.
In Lean, you state a theorem with the keyword theorem
.
theorem easy : P → P := by
intro h
apply h
What follows the keyword by
is what is known as a tactic proof, that is,
a list of commands that tells Lean how to construct a proof of the theorem.
In the next section, we will say more about how proofs are represented in Lean
and what tactics do underneath the hood.
Here, instead, we will focus on how to use them.
Stating a theorem creates a goal, namely, the theorem to be proved.
If you put your cursor at the beginning of the line intro h
,
the infoview window shows the following goal:
P Q R S: Prop
⊢ P → P
If you put your cursor at the end of the line, you will see that the goal has changed to:
P Q R S: Prop
h : P
⊢ P
When the cursor is at the beginning of the line intro h
,
the highlighting in the infoview window helpfully shows what is about to change,
and when you move the cursor to the end of the line,
the highlighting shows what has just changed.
The notation should remind you of a sequent in natural deduction: the second goal says that we
are given propositions P
, Q
, R
, and S
together with the assumption
that P
holds, labelled as h
, and we are to prove P
.
The command apply P
finishes it off, leaving you with the happy message, “no goals.”
The label h
is arbitrary, and you can use any valid identifier instead.
It’s conventional to use the letter h
for hypotheses.
The information before the turnstile is called the context.
The right side of the sequent is also sometimes confusingly called the goal, but
we will call it the conclusion of the sequent and reserve the term “goal”
for the entire sequent.
You can use the example
keyword to prove a theorem without naming it:
example : P → P := by
intro h
apply h
When an apply
command finishes off the goal exactly, it is conventional to use the
exact
tactic instead:
example : P → P := by
intro h
exact h
You can also end a proof with the done
tactic,
which does nothing but declare that there are no more goals.
example : P → P := by
intro h
exact h
done
It is often useful to put a done
at the end of a proof while you are still
writing it.
Lean gives you an error message if the proof isn’t over, and the error message tells you
the goals that remain to be proved.
It can be convenient to force Lean to put the error message on the done
command rather than somewhere lower down in the file. You can then delete the done
when you are finished with the proof.
Notice, by the way, that you can learn more about a tactic by hovering over it in VS Code.
The apply
tactic can also be used to apply an implication.
example (h1 : P → Q) (h2 : P) : Q := by
apply h1
exact h2
In this example, applying the assumption P → Q
to the goal reduces the task of
proving Q
to the task of proving P
.
To prove a conjunction, we can use the constructor
tactic:
example : P → Q → P ∧ Q := by
intro hP hQ
constructor
. exact hP
. exact hQ
The command intro hP hQ
is shorthand for writing intro hP
followed by intro hQ
.
The constructor
tactic tells Lean we intend to prove P ∧ Q
by proving
each conjunct in turn.
As a result, after that line, there are two goals to prove, one for each conjunct.
The first period focuses on the first goal, which is solved using hP
,
and the second period focuses on the second goal, which is using hQ
.
Lean is whitespace sensitive, and once you focus on a goal, Lean expects you to maintain
the indentation until the goal is solved.
You can check that the proof still works if you delete the periods.
Structuring a proof in this way, however, tends to make it more readable and robust.
At this point, there is a trick that is worth mentioning.
At any point in a proof, you can solve the current goal with the sorry
tactic.
You can prove any theorem at all using sorry
.
It’s cheating, and the squiggly line the editor puts underneath the name of the theorem tells
you as much.
But it is often a useful device when writing proofs, because it means you can temporarily
close a goal to work on others, and then come back to it.
The natural way to use a conjunction h : P ∧ Q
in a hypothesis is to split it
to the hypotheses hP : P
and hQ : Q
.
example : P ∧ Q → Q ∧ P := by
intro h
rcases h with ⟨hP, hQ⟩
constructor
. exact hQ
. exact hP
Here the angle brackets denote a conjunctive pattern for Lean to match h
against, and the rcases
tactic does the matching, removing h : P ∧ Q
and replacing it with hP : P
and hQ : Q
.
The corresponding proof for disjunction is dual:
example : P ∨ Q → Q ∨ P := by
intro h
rcases h with hP | hQ
. right
exact hP
. left
exact hQ
Here we use a vertical bar with rcases h
because the split is disjunctive,
resulting in two goals: one in which we have hP : P
in the context and one
in which we have hQ : Q
. The right
tactic tells Lean that we want to prove
Q ∨ P
by proving P
, and the left
tactic tells Lean that we want to prove
Q ∨ P
by proving Q
.
Notice that in contrast to casing on a conjunction, which results in one new goal and
two new hypotheses,
casing on a disjunction results in two new goals, each with one new hypothesis.
Recall from Section 8.6 that in the natural deduction proof system, rules for the connectives can be categorized as either introduction rules or elimination rules. Even though we have not yet clarified the relationship between natural deduction and Lean’s internal logic, you should notice that the tactics we have given so far follow a similar pattern:
Given a goal of the form
P → Q
, we can use theintro
tactic to prove it.Given a hypothesis of the form
h : P → Q
, we can use theapply
tactic to use it.Given a goal of the form
P ∧ Q
, we can use theconstructor
tactic to prove it.Given a hypothesis of the form
h : P ∧ Q
, we can use thercases
tactic to use it.Given a goal of the form
P ∨ Q
, we can use theleft
andright
tactics to prove it.Given a hypothesis of the form
h : P ∨ Q
, we can use thercases
tactic to use it.
In Lean, negation ¬ P
is defined to be P → False
.
For most purposes, the two expressions are interchangeable.
This means that you can prove ¬ P
by assuming P
and deriving False
, and if you have hnp : ¬ P
,
you an reduce the task of proving False
to the task of proving P
.
In the next example, the intro
tactic implements the first strategy,
and the apply
tactic implements the second.
example (h : P → Q) : ¬ Q → ¬ P := by
intro hnQ
intro hP
apply hnQ
apply h
exact hP
After the second intro
, we have h : P → Q
, hnQ : ¬ Q
, and hP : P
in the context and False
as the desired conclusion. Applying hnQ
leaves
us the goal of proving Q
, which we do using h
and hP
.
There is no introduction rule for falsity; there is no canonical way to prove False
!
If we assume h : False
, we can reach any conclusion we want,
using either contradiction
or rcases h
.
Intuitively, there is no proof of False
, so if we have h : False
,
there are no cases to consider.
example (h : False) : P := by
contradiction
example (h : False) : P := by
rcases h
As a convenience, if you have h : P
and hnP : ¬ P
in the context,
the contradiction tactic solves the goal automatically.
The principles we have used before all fall within what is known as
intuitionistic logic. Many mathematical arguments require classical logic,
which is embodied in the law of the excluded middle, P ∨ ¬ P
.
This is embodied in the by_cases
tactic, which lets us split on cases.
In the proof below, we have hP : P
in the first case and hnP : ¬ P
in the second.
example (h1 : P → Q) : ¬ P ∨ Q := by
by_cases hP : P
. right
apply h1
exact hP
. left
exact hP
example : ¬ ¬ P → P := by
intro hnnP
by_cases hP : P
. exact hP
. contradiction
Classical reasoning is also embodied in the by_contra
tactic, which allows
us to do proof by contradiction.
example : ¬ ¬ P → P := by
intro hnnP
by_contra hnP
apply hnnP
exact hnP
Applying by_contra hnP
to a goal with conclusion P
adds hnP : ¬ P
to the context
and asks us to prove False
.
You can think of P ↔ Q
as an abbreviation for (P → Q) ∧ (Q → P)
.
It is not implemented exactly that way in Lean, but like a conjunction,
the task of proving such a statement can be split into the forward
and backward directions.
example (h1 : P ↔ Q) (h2 : P) : Q := by
rcases h1 with ⟨hpq, _⟩
apply hpq
exact h2
example (h1 : P ↔ Q) : Q ↔ P := by
rcases h1 with ⟨h2, h3⟩
constructor
. exact h3
. exact h2
In the first example, the underscore omits a label for the hypothesis, since we don’t
needed it. If you use a label, Lean’s built in linter will warn you that the label is unused.
The hypothesis still appears in the context with an inaccessible name, and can be used
by automation. If you replace the underscore with a dash (-
), the rcases
tactic
clears it entirely from the context.
The methods in this section are complete for classical propositional logic,
which is to say, any proof in classical propositional logic can be carried out
using these methods.
That is not to say that these methods are the most efficient; Lean
offers both automation and syntax for writing proofs more compactly.
As far as automation is concerned, all the proofs in this section
can be carried out using the tauto
tactic, which stands for “tautology.”
example (h1 : P → Q) : ¬ P ∨ Q := by
tauto
But writing propositional proofs by hand is a good way to learn how propositional logic works in Lean, and in more complicated proofs we often need to carry out simple logical manipulations by hand.
9.2. Proof terms
Remember that, in Lean’s foundation, everything is expression.
In particular, if an expression P
has type Prop
, then a term p
of type P
is interpreted as a proof of P
.
Lean’s proof language is therefore essentially the same as its programming language,
which means that we can write proofs the same way we write programs.
def prod_swap (α β : Type) : α × β → β × α :=
fun p => Prod.mk (Prod.snd p) (Prod.fst p)
theorem and_swap (P Q : Prop) : P ∧ Q → Q ∧ P :=
fun h => And.intro (And.right h) (And.left h)
def sum_swap (α β : Type) : Sum α β → Sum β α :=
fun s => match s with
| Sum.inl a => Sum.inr a
| Sum.inr b => Sum.inl b
theorem or_swap (P Q : Prop) : P ∨ Q → Q ∨ P :=
fun h => match h with
| Or.inl hp => Or.inr hp
| Or.inr hq => Or.inl hq
Instead of using the identifier def
, it is conventional to use the word theorem
to name the proof of a proposition.
Each keyword serves to assign an expression to an identifier; the main difference
is that theorems are marked opaque, which means that they are generally
not unfolded. A proof written this way is called a proof term.
The tactic proofs described in the previous section are nothing more than instructions
that tell the system how to construct such an expression.
The syntax for proof terms, in turn, is closely related to natural deduction.
The #explode
command prints proof terms in a format that is closer to the presentation of
natural deduction in Section 8.6.
#explode and_swap
#explode or_swap
Don’t look too hard for an exact correspondence, because the details differ. But Lean’s foundation can be seen as a vast generalization of natural deduction, giving rise to its uniform treatment of proofs and programs. The analogy between the two is known as the Curry-Howard correspondence, and it explains the fact that the introduction and elimination rules for the connectives look a lot like the constructors and destructors for the corresponding data types. The examples above show that the introduction and elimination rules for conjunction parallel the constructors and destructors for the pair data type, and the introduction and elimination rules for disjunction parallel the constructors and destructors for the sum data type. In Lean’s foundation, the introduction and elimination rules for implication are exactly lambda abstraction and function application, respectively.
example : P → P ∧ P := fun h => And.intro h h
example (h1 : P → Q) (h2 : P) : Q := h1 h2
In Lean, ¬ P
is defined to be P → False
. The following proof assumes P
and
¬ P
, concludes ⊥
, and then uses the ex falso principle to conclude Q
.
example : P → ¬ P → Q := fun h1 h2 => False.elim (h2 h1)
Lean allows the use of anonymous constructors and projections with proof terms just as with data:
example : P ∧ Q → Q ∧ P :=
fun h => ⟨h.right, h.left⟩
You can use proof terms in a tactic proof:
example : P → Q → (P ∧ Q) ∨ R := by
intro h1 h2
exact Or.inl (And.intro h1 h2)
In fact, under the hood, a tactic proof is essentially a program that constructs the relevant proof. Here are some more examples:
theorem and_swap' : P ∧ Q → Q ∧ P := by
intro h
rcases h with ⟨h1, h2⟩
constructor
. exact h2
. exact h1
theorem or_swap' : P ∨ Q → Q ∨ P := by
intro h
rcases h with h1 | h2
. right; exact h1
. left; exact h2
#print and_swap'
#explode and_swap'
#print or_swap'
#explode or_swap'
Lean’s show_term
tactic will show you the proof term that a tactic block produces,
and it will even given you the option of clicking to replace the tactic block by an explicit
use of that term.
example (P Q : Prop) : P ∧ Q → Q ∧ P := by
intro h
show_term {
rcases h with ⟨h1, h2⟩
constructor
. exact h2
. exact h1}
You can read more about Lean expressions in
Theorem Proving in Lean 4.
For the most part, in this course, we will focus on tactic proofs,
but you should feel free to use a proof term with apply
or exact
when you feel like it.
More importantly, you should keep in mind that at the end of the day, what you are doing is
constructing something like a natural deduction proof, represented as an expression in Lean’s
foundation.
You can therefore think of Lean as an engine for manipulating expressions that can represent
data types, statements, programs, and proofs.
Tactics are pieces of automation that are designed to help us construct these expressions.
9.3. Structured proofs
Tactics allow us to work backward to prove a goal by reducing it to simpler ones.
Lean also provides a means of reasoning forward from hypotheses,
using the have
tactic:
example (h1 : P → Q) (h2 : Q → R) (h3 : P) : R := by
have hQ : Q := by
apply h1
exact h3
apply h2
exact hQ
Here the first line of the proof states an intermediate goal of proving Q
.
The result is named hQ
, which we are then free to use.
Using have
tends to make proofs more readable
because the text displays the stepping stones to the main goal.
It also tends to make proofs more robust,
in that errors are localized to small, compartmentalized parts of the proof.
If you omit the keyword by
, you can use a proof term to fill an easy have
:
example (h1 : P → Q) (h2 : Q → R) (h3 : P) : R := by
have hQ : Q := h1 h3
exact h2 hQ
You can even write have hQ := h1 h3
in the second line of the first proof, omitting the type
ascription, because Lean can figure that out. Of course, this sacrifices readability somewhat.
You can generallly eliminate a have
statement by inlining the result.
For example, replacing hQ
by h1 h3
yields the first proof below:
example (h1 : P → Q) (h2 : Q → R) (h3 : P) : R := by
exact h2 (h1 h3)
example (h1 : P → Q) (h2 : Q → R) (h3 : P) : R := h2 (h1 h3)
The proof by exact h2 (h1 h3)
is equivalent to just presenting the proof term h2 (h1 h3)
,
as we do in the second example.
Lean also has a show
tactic, which declares the goal we are about to solve.
example (h1 : P → Q) (h2 : Q → R) (h3 : P) : R := by
have hQ : Q := h1 h3
show R
exact h2 hQ
example (h1 : P ∧ Q) (h2 : Q → R) : P ∧ R := by
rcases h1 with ⟨hP, hQ⟩
have hR : R := h2 hQ
show P ∧ R
exact ⟨hP, hR⟩
The show
tactic mainly serves to make a proof more readable, though it does a little more work than
we are letting on here. For example, if there are multiple goals left to prove, show
fill focus on the
first one that matches the given statement.
9.4. Exercises
Complete the following proofs in Lean.
import Mathlib.Data.Real.Basic
variable (P Q R S : Prop)
example : P ∧ Q ∧ (P → R) → R := by
sorry
example : (P → Q) ∧ (Q → R) → P → r := by
sorry
example (h : P → Q) (h1 : P ∧ R) : Q ∧ r := by
sorry
example (h : ¬ (P ∧ Q)) : P → ¬ Q := by
sorry
example (h : ¬ (P → Q)) : ¬ Q := by
sorry
example (h : P ∧ ¬ Q) : ¬ (P → Q) := by
sorry
example (h1 : P ∨ Q) (h2 : P → R) : r ∨ Q := by
sorry
example (h1 : P ∨ Q → R) : (P → R) ∧ (Q → R) := by
sorry
example (h1 : P → R) (h2 : Q → R) : P ∨ Q → R := by
sorry
example (h : ¬ (P ∨ Q)) : ¬ P ∧ ¬ Q := by
sorry
example (h : ¬ P ∧ ¬ Q) : ¬ (P ∨ Q) := by
sorry
-- this one requires classical logic!
example (h : ¬ (P ∧ Q)) : ¬ P ∨ ¬ Q := by
sorry
-- this one too
example (h : P → Q) : ¬ P ∨ Q := by
sorry