.. _chapter_decision_procedures_for_first_order_logic:

Decision Procedures for First-Order Logic
=========================================

In the terminology of logic and computer science, a *decision procedure* is an
algorithm that accepts a class of yes/no questions and answers them correctly. Fix a set of first-order sentences, :math:`\Gamma`, which we can think of as a set of axioms of as a description of a
possible state of affairs. Here are some questions we can ask:

- Given  a first-order sentence :math:`A`,
  does :math:`\Gamma` prove :math:`A`, that is, :math:`\Gamma \proves A`?
- Given  a first-order sentence :math:`A`,
  does :math:`\Gamma` entail :math:`A`, that is, :math:`\Gamma \models A`?
- Given  a first-order sentence :math:`A`,
  is :math:`\Gamma \cup \{ A \}` is satisfiable, that is, is there a model
  :math:`\mdl M` of :math:`\Gamma` in which :math:`A` is true?

The first two questions are equivalent. We
have not yet presented deductive systems for first-order logic, but we will do so in
:numref:`Chapter %s <chapter_proof_systems_for_first_order_logic>`, and by the soundness and
completeness theorems for those systems, we will have that a sentence :math:`A` is provable from
a set of hypotheses :math:`\Gamma` if and only if it is entailed by :math:`\Gamma`.
The third question is equivalent to the first two by translation: as with propositional logic,
we have that :math:`\Gamma \models A` if and only if :math:`\Gamma \cup \{ \lnot A \}` is
unsatisfiable. So questions about entailment are equivalent to questions about satisfiability,
negating the formula in question.

Whether or not a problem of the form above is decidable depends on :math:`\Gamma`. For example,
if :math:`\Gamma` is empty, the first question boils down to the question as to whether a sentence
:math:`A` is provable in first-order logic. If the language in question has a binary relation symbol
or two unary function symbols, the question is undecidable. In the special case where the language
has only unary predicate symbols and a single function symbol, the question is decidable.
Neither of these facts are easy to prove.

Interestingly, adding axioms to :math:`\Gamma` can make provability decidable. One way to think
of this is that doing so constrains the types of models that have to be considered.
Given a set of axioms :math:`\Gamma`, deciding whether or not provability is decidable
of often hard, and, for that reason, generally interesting. To show that the answer is "yes,"
one should describe an algorithm, and then there is the further question as to whether the
algorithm can be made to run efficiently on the specific questions we care about. A "no" answer
usually proceeds to showing that a solution would lead to a solution to the halting problem,
or another problem that has been shown to be reducible to it.

The second type question above asks about the truth of a sentence :math:`A` in all models of
:math:`\Gamma`, and the third type of question above asks about the truth of a sentence
:math:`A` in *some* model of :math:`\Gamma`. There are other types of decision procedures we
might consider. Fixing a model :math:`\mdl M`, we can ask:

- Given a first-order formula :math:`A`, possibly with free variables,
  is there a variable assignment :math:`\sigma` such that :math:`A` is true in :math:`\mdl M`
  under :math:`\sigma`, that is, :math:`\models_{\mdl M, \sigma} A`?
- Given a first-order formula :math:`A`, possibly with free variables,
  is :math:`A` is true in :math:`\mdl M` under every assignment :math:`\sigma`?

In the first case, :math:`A` is said to be *satisfiable* in :math:`\mdl M`, and in the second case,
it is said to be :math:`A` *valid* in :math:`\mdl M`. But be careful: the words are being used
in a slightly different sense than before, when satisfiable meant "true in some model" rather than
"true for some assignment" and valid meant "true in all models" rather than
"true for all assignments."

Remember that a sentence is a formula without free variables.
If :math:`A` is a sentence, both questions boil down to the question as to
whether :math:`A` is true in :math:`\mdl M`. Notice that if :math:`A` has free variables
:math:`x_1, x_2, \ldots, x_n`, then the following are all equivalent:

- :math:`A` is satisfiable in :math:`\mdl M`.
- :math:`\ex {x_1, x_2, \ldots, x_n} A` is true in :math:`\mdl M`.
- :math:`\fa {x_1, x_2, \ldots, x_n} \lnot A` is false in :math:`\mdl M`.
- :math:`\lnot A` is not valid in :math:`\mdl M`.

It is confusing that there are so many ways to ask the same question! But you have to get used
to it: logicians and computer scientists slip back and forth between the different ways of
thinking about a problem, sometimes even in the same sentence.

The relationship between the first group of questions, having to do with entailment and
satisfiability, and the second group of questions, having to do with truth in models, is
subtle. Sometimes, for a given model :math:`\mdl M`, there is a natural set of axioms :math:`\Gamma`
such that a sentence is true in :math:`\mdl M` if and only if it is provable from :math:`\Gamma`.
Sometimes, instead, one can show that there is *no* computable set of axioms that has this property.
Life is complicated! The set of all true sentences of a model :math:`\mdl M`
is called "the theory of :math:`\mdl M`, so we can express the last property
by saying that the theory of :math:`\mdl M` is not computable.

To muddy the waters even further, instead of asking questions about all first-order formulas,
we can consider restricted problems where we are only allowed to ask about formulas of a certain
kind. A formula is said to be *quantifier-free* if it has no quantifiers, *universal* if it consists
of any number of universal quantifiers :math:`\forall` (possibly none) followed by a
quantifier-free formula, and *existential* if it consists of any number of existential quantifiers
followed by a quantifier-free formula. In some cases, So we can ask, for a decision procedure for
the provability of universal formulas from a set of axioms or for the satisfiability of an
existential formula in a model, and in some cases, we may have a positive answer even though
the full problem is undecidable.

In this chapter, we will describe, in detail,
a decision procedure for the validity universal formulas in pure first-order logic, that is,
first-order logic without any axioms. We will also describe a decision procedure for the
satisfiability of quantifier-free formulas in the theory of linear equations and
inequalities in the real numbers. Finally, we will state some other important decidability and
undecidability results, to give you a fuller sense of the landscape.
In :numref:`Chapter %s <chapter_using_smt_solvers>`, we will consider SMT solvers,
whose main strength is that they are capable of combining decision procedures for the
quantifier-free parts of various theories and using them together effectively.


.. _section_equality:

Equality
--------

Fix a language, :math:`L`. We will consider equations :math:`s = t` and *disequations*
:math:`u \ne v` between closed terms.
The fact that we are considering closed terms mean that there are no variables to substitute
for; computer scientists sometimes call these *ground* terms.
(As with unification, in some contexts we may want to treat some variables as constant.
What is important here is not whether we call them variables or constants, but, rather,
the fact that we are not considering substitutions.)

The problem we are addressing here is this: given a set of equations and disequations, is
it satisfiable? Notice that here we are asking about satisfiability in *any* model. In particular,
the set

.. math::

  \{ s_1 = t_1, \ldots, s_n = t_n, u_1 \ne v_1, \ldots, u_m \ne v_m \}

is *unsatisfiable* if and only if we have

.. math::

  s_1 = t_1, \ldots, s_n = t_n \proves u_1 = v_1 \lor \cdots \lor u_m = v_m

So we can think of the problem in either way.

For example, consider the following set of sentences:

#. :math:`f(a, a) = b`
#. :math:`g(c, a) = c`
#. :math:`g(c, f(a, a)) = f(g(c, a), g(c, a))`
#. :math:`f(c, c) \ne g(c, b)`

Is it satisfiable?

Before we answer that, let's make some general observations. A set that only has equations and
no disequations is easily satisfiable, namely, in a model with a single element, where every
expression is equal to every other one. Similarly, a set that only has disequations is easily
satisfiable, unless one of the disequations is of the form :math:`t \ne t`.
For that purpose, we can use the term model, where every term is interpreted as itself.
The interesting cases fall in between these two extremes, where the equations and disequations
balance one another.

Coming back to the question, the following considerations show that the answer is "no."
Each of the following is a consequence of the equations above:

5. :math:`g(c, f(a, a)) = g(c, b)` from 1
6. :math:`f(g(c, a), g(c, a)) = f(c, c)` from 2
7. :math:`f(c, c) = g(c, b)` from 3, 5, and 6.

This contradicts the disequation 4 above. To understand what is going on, it is helpful
to think of :math:`f` as addition, :math:`g` as multiplication, :math:`a` as the number 1,
and :math:`b` as the number 2.
But the argument is fully abstract, and shows that the disequation cannot hold in any
model in which all the equations are satisfied.

These considerations encapsulate the main ideas behind the proof of the following theorem:

.. admonition:: Theorem

    The question as to whether a finite set of ground equations and disequations is satisfiable
    is decidable.

The idea behind the proof is to use a *saturation* argument: starting from the equations in
question, we derive new equations until no more equations are derivable.
If we manage to contradict one of the disequations, the original set is not satisfiable.
In the case where no contradiction is found, we will argue that the original set is satisfiable.

To make all this precise, we need a set of rules for deriving equations.

  .. raw:: html

      <div class="math notranslate nohighlight">
      \[\begin{prooftree}
      \AXC{$t = t$}
      \end{prooftree}
      \quad \quad
      \begin{prooftree}
      \AXC{$s = t$}
      \UIC{$t = s$}
      \end{prooftree}
      \quad \quad
      \begin{prooftree}
      \AXC{$r = s$}
      \AXC{$s = t$}
      \BIC{$r = t$}
      \end{prooftree}
      \quad\quad
      \begin{prooftree}
      \AXC{$s_1 = t_1$}
      \AXC{$\ldots$}
      \AXC{$s_n = t_n$}
      \TIC{$f(s_1, \ldots, s_n) = f(t_1, \ldots, t_n)$}
      \end{prooftree}
      \]</div>

  .. raw:: latex

      \begin{center}
      \AXC{$t = t$}
      \DP
      \quad \quad
      \AXC{$s = t$}
      \UIC{$t = s$}
      \DP
      \quad \quad
      \AXC{$r = s$}
      \AXC{$s = t$}
      \BIC{$r = s$}
      \DP
      \quad \quad
      \AXC{$s_1 = t_1$}
      \AXC{$\ldots$}
      \AXC{$s_n = t_n$}
      \TIC{$f(s_1, \ldots, s_n) = f(t_1, \ldots, t_n)$}
      \DP
      \end{center}

The first three rules express the reflexivity, symmetry, and transitivity of equality,
respectively.
The last rule is called the *congruence* rule.
You should convince yourself that using these rules we can derive

  .. raw:: html

      <div class="math notranslate nohighlight">
      \[\begin{prooftree}
      \AXC{$r = s$}
      \UIC{$t[r/x] = t[s/x]$}
      \end{prooftree}
      \]</div>

  .. raw:: latex

      \begin{center}
      \AXC{$r = s$}
      \UIC{$t[r/x] = t[s/x]$}
      \DP
      \end{center}


for any terms :math:`r`, :math:`s`, and :math:`t` and variable :math:`x`.

Returning to our proof plan, we want to show that if applying these rules successively
does not result in a contradiction, then there is a model in which the original equations
and disequations are all true.
But a problem arises: what if the original set contains an equation :math:`a = f(a)`?
Then our algorithm falls into an infinite loop, deriving
:math:`a = f(a) = f(f(a)) = f(f(f(a))) = \ldots`.
The solution is to restrict attention to *subterms* of terms appearing in the original
equations and disequations.
The theorem follows from the following lemma.

.. admonition:: Lemma

    Let :math:`\Gamma` consist of a set of equations and disequations.
    Let :math:`S` be the set of subterms of all the terms occurring in :math:`\Gamma`.
    Let :math:`\Gamma'` be the set of all equations between elements of :math:`S`
    that can be derived from the equations in :math:`\Gamma` using the rules above.
    Then :math:`\Gamma` is satisfiable if and only if no disequation in :math:`\Gamma`
    is the negation of an equation in :math:`\Gamma'`.

The algorithm implicit in this lemma is called *congruence closure*.

.. admonition:: Proof

    One direction of the lemma is easy. Since the equational rules preserve truth in any
    model, if we can derive a contradiction from the equations and disequations in :math:`\Gamma`,
    then :math:`\Gamma` is unsatisfiable. The other direction is harder.
    Since there are only finitely many pairs of terms in :math:`S`, the algorithm necessarily
    terminates.
    We need to show that if it terminates without deriving a contradiction, then there is a model
    that satisfies :math:`\Gamma`.

    Say two elements :math:`s` and :math:`t` are *equivalent*, written :math:`s \equiv t`,
    if they are proved equal from the
    equations in :math:`\Gamma`. The rules guarantee that this is an *equivalence relation*, which is
    to say, it is reflexive, symmetric, and transitive. It is also a *congruence*, which means that
    applying a function symbol to equivalent terms results in equivalent terms.

    To each element :math:`t`, we associate its *equivalence class* :math:`[t]`, defined by

    .. math::

        [t] = \{ s \in S \mid s \equiv t \}.

    In words, :math:`[t]` is the set of terms equivalent to :math:`t`.
    Assuming the algorithm terminates without a contradiction, define a model :math:`\mdl M`
    whose universe consists of all the equivalence classes of elements of :math:`S`
    together with a new element, :math:`\star`. For elements :math:`t_1, \ldots t_n` in :math:`S`,
    interpret each :math:`n`-ary function symbol :math:`f` by the function

    .. math::

        f^{\mdl M}([t_1], \ldots, [t_n]) = \begin{cases}
          [f(t_1, \ldots, t_n)] & \text{if $f(t_1, \ldots, t_n)$ is in $S$} \\
          \star & \text{otherwise}
        \end{cases}

    In other words, what :math:`f^{\mdl M}` does to each equivalence class is determined by what
    :math:`f` does to each of the elements.
    The fact that :math:`\equiv` is a congruence ensures that this makes sense.
    This is just a truncated version of the term model, in which provably equal terms
    are all glued together.

    It is not hard to show that for every term :math:`t` in :math:`S`,
    :math:`\tval{t}_{\mdl M}` is equal to  :math:`[t]`.
    But this is what we need. For every equation :math:`s = t` in :math:`\Gamma`, :math:`s` and :math:`t` are in the
    same equivalence class, so they are equal in the model.
    And if :math:`s` and :math:`t` are not provably equal, then :math:`[s]` and :math:`[t]` are not the same, so
    every disequation :math:`s \ne t` in :math:`\Gamma` is true in :math:`\mdl M` as well.

For examples of the algorithm in action, first let us show that the set

.. math::

  f^3(a) = a, \, f^5(a) = a, \, f(a) \ne a

is unsatisfiable, where :math:`f^n(a)` abbreviates :math:`n`-fold application :math:`f(f(\cdots f(a)))`.
The set of all subterms is

.. math::

  a, \, f(a), \, f^2(a), \, f^3(a), \, f^4(a), \, f^5(a).

We start with the equivalence classes :math:`\{ a, f^3(a) \}` and :math:`\{ a, f^5(a)\}`
as well as all the others subterms in singleton sets.
From :math:`a = f^3(a)` we derive :math:`f(a) = f^4(a)` by congruence, giving rise to the set
:math:`\{ f(a), f^4(a) \}`. Applying congruence again gives rise to the set :math:`\{ f^2(a), f^5(a) \}`,
which is merged with :math:`\{ a, f^5(a)\}` to yield :math:`\{ a, f^2(a), f^5(a) \}`.
Applying congruence again yields :math:`\{ f(a), f^3(a) \}`. (We ignore the term :math:`f^6(a)`.)
This is merged with the set :math:`\{ a, f^3(a) \}` to yield :math:`\{ a, f(a), f^3(a)\}`.
Applying congruence again yields :math:`\{ f(a), f^2(a), f^4(a)\}`, which is merged with
:math:`\{ a, f(a), f^3(a) \}` and :math:`\{ f^2(a), f^5(a) \}` to yield
:math:`\{ a, f(a), f^2(a), f^3(a), f^5(a) \}`.
At this point, we have derived :math:`f(a) = a`, contradicting the disequality in the original set.
So the set is unsatisfiable.

Suppose we start instead with the set

  .. math::

    f^2(a) = a, \, f^4(a) = a, \, f(a) \ne a, \, f(a) \ne b

You can check that in this case, the algorithm terminates with the following three
equivalence classes:

- :math:`[a] = \{ a, f^2(a), f^4(a)\}`
- :math:`[f(a)] = \{ f(a), f^3(a) \}`
- :math:`[b] = \{ b \}`.

We now construct a model :math:`\mdl M` with these elements and an additional element :math:`\star`, with

.. math::

    f^{\mdl M}([a]) & = [f(a)] \\
    f^{\mdl M}([f(a)]) & = [a] \\
    f^{\mdl M}([b]) & = \star \\
    f^{\mdl M}(\star) & = \star

You can check that this satisfies the original set of equations and disequations. We don't really need
to introduce the new element :math:`\star`; you can check that everything works if we replace it by
any of the equivalence classes. We simply find it clearer to use :math:`\star` as a catch-all for
everything that falls outside the scope of the finite set of terms we started with.

Our analysis establishes an interesting property of
first-order logic: it is possible to prove a disjunction :math:`u_1 = v_1 \lor \cdots \lor u_m = v_m`
from a set of equation if and only if it is possible to prove :math:`u_i = v_i` for some :math:`i`.
This is a property known as *convexity*. It relies on the fact that we allow only positive
equations on the right-hand side. For example, :math:`a = b \lor a \ne b` is provable in
first-order logic, but clearly neither disjunct is provable on its own.

We have described the algorithm as working on closed terms, that is, terms with no variables.
Of course, there is no harm if we allow variables in the terms and simply treat them as
constants. The point is that in this formulation of the problem, a hypothesis :math:`f(x) = a`
is interpreted as a statement about one particular :math:`x` that never changes in the
statement of a problem. It is an entirely different problem to consider hypotheses
like :math:`\fa x f(x) = a` for which we are allowed to substitute any term for :math:`x`.
For example, we might want to add axioms like :math:`\fa {x, y, z} (x + y) + z` and
:math:`\fa {x, y}. x + y = y + x` as axioms for the integers or real numbers. The problem of
determining whether a single equation follows from a set of universally
quantified equations is known as the *word problem*, and, in general it is undecidable.

Implementing congruence closure
-------------------------------

Congruence closuure can be implemented efficiently (and *is* implemented efficiently in SMT
solvers) using *union-find* data structures.


[This section needs to be written. In the meanwhile, see the `slides <https://www.cs.cmu.edu/~mheule/15311-s24/slides/congruence-closure.pdf>`_ and the file ``CongruenceClosure.lean``].

.. _deciding_universal_sentences:

Deciding universal sentences
----------------------------

Remember that the following problems are all intertranslatable:

- Given a universal sentence :math:`A`, is :math:`A` valid, that is, true in every model? (Equivalently:
  if :math:`A` provable?)
- Given a quantifier-free formula :math:`A`, does :math:`\models_{\mdl{M}, \sigma} A` hold for every
  model :math:`\mdl M` and every variable assignment :math:`\sigma`?
- Given a quantifier-free formula :math:`A`, does :math:`\models_{\mdl{M}, \sigma} A` hold for *some*
  model :math:`\mdl M` and variable assignment :math:`\sigma`?
- Given an existential sentence :math:`A`, is :math:`A` satisfiable, that is, true in some model?

In particular, a universal formula :math:`\fa {\vec x} A` is valid if and only if :math:`A` holds
for every model and every variable assignment, which happens if and only if :math:`\lnot A` never holds for any model and variable assignment, which happens if and only if :math:`\ex {\vec x} \lnot A` is
not satisfiable. Notice that in each case we are asking questions about *pure* first-order logic,
without any axioms :math:`\Gamma`.

Remember that in first-order logic, the atomic formulas include equations :math:`s = t` and formulas
:math:`R(t_1, \ldots, t_n)`, where :math:`R` is a relation symbol. The set of *literals* include the
negations of those as well. It is not hard to extend congruence closure to an algorithm to determine
the satisfiability of any finite set of literals. To start with, we add the following rule to our
equational proof system:

  .. raw:: html

      <div class="math notranslate nohighlight">
      \[
      \begin{prooftree}
      \AXC{$s_1 = t_1$}
      \AXC{$\ldots$}
      \AXC{$s_n = t_n$}
      \AXC{$R(s_1, \ldots, s_n)$}
      \QuaternaryInfC{$R(t_1, \ldots, t_n)$}
      \end{prooftree}
      \]</div>

  .. raw:: latex

      \begin{center}
      \AXC{$s_1 = t_1$}
      \AXC{$\ldots$}
      \AXC{$s_n = t_n$}
      \AXC{$R(s_1, \ldots, s_n)$}
      \QuaternaryInfC{$R(t_1, \ldots, t_n)$}
      \DP
      \end{center}

Suppose :math:`\Gamma` is a set of literals.
To test the satisfiability of :math:`\Gamma`, we do not have to change much in the previous algorithm.
Using the congruence rule for relations, whenever we have derived :math:`R(s_1, \ldots, s_n)`
and we have also derived equations :math:`s_i = t_i` for every :math:`i`,
we can conclude :math:`R(t_1, \ldots, t_n)`.
The algorithm terminates when we contradict a disequality or another negated atomic formula.
If the algorithm terminates without a contradiction, we build a model as before,
where we simply declare that :math:`R^{\mdl M}([t_1], \ldots, [t_n])` holds if and only if
we have determined that :math:`R(t_1, \ldots, t_n)` in a consequence of the original set.
Another way to think about the algorithm is that we can replace each atomic formula
:math:`R(t_1, \ldots, t_n)`
by an equation :math:`f_R(t_1, \ldots, t_n) = \top` and each negated atomic formula
:math:`\lnot R(t_1, \ldots, t_n)` by a disequation :math:`f_R(t_1, \ldots, t_n) \ne \top` and
run the usual congruence closure algorithm on that.

Now suppose we are given an existential sentence :math:`\ex {x_1, \ldots, x_n} A`
where :math:`A` is quantifier-free, and suppose we want to determine whether it is satisfiable.
Write :math:`A` in disjunctive normal form, that is, as a disjunction
:math:`A_1 \lor \cdots \lor A_n` of conjunctions of literals.
Then :math:`\ex {x_1, \ldots, x_n} A` is satisfiable if and only if one of the formulas
:math:`A_1, \ldots, A_n` is satisfied by some model :math:`\mdl M` and variable assignment
:math:`\sigma`. That reduces the task to determining whether a conjunction of literals is satisfiable,
and we have just explained how to do that.

Since a sentence is valid if and only if its negation is satisfiable, and since the negation
of a universal sentence is an existential sentence, we have shown the following.

.. admonition:: Theorem

  The validity of universal sentences in pure first-order logic is decidable.
  Equivalently, the satisfiability of existential sentences is decidable.


Linear arithmetic
-----------------

We now turn from questions having to do with satisfiability and validity in arbitrary
models to questions about satisfiability in a particular model, namely, the real numbers.
A *linear expression* is one of the form :math:`a_1 x_1 + a_2 x_2 + \cdots + a_n x_n + b`,
where each :math:`a_i` is a rational number, :math:`b` is a rational number,
and each :math:`x_i` is a variable.
We think of the variables :math:`x_i` as ranging over the real numbers.
A *linear constraint* is one of the form :math:`s = t` or :math:`s < t`, where
:math:`s` and :math:`t` are linear expressions. (In practice, we usually include constraints
of the form :math:`s \le t` and sometimes :math:`s \ne t` as well. But the first can be
written :math:`s < t` and the second can be written :math:`s < t \lor t < s`, so questions about
those can be rexpressed in terms of :math:`<` and :math:`=`, and focusing on those will simplify
the presentation below.)

Notice that any linear constraint is equivalent to one of the form :math:`t = 0` or :math:`t > 0`,
since we can move all the terms to one side. For example, the constraint :math:`3 x + 2 y < 3y + 4z`
is equivalent to :math:`-3x + y + 4z > 0`.
An important observation that we will use below is that any linear constraint that involves
a variable :math:`x` can be written as :math:`x = t`, :math:`x < t`, or :math:`t < x`,
where :math:`x` does not occur in :math:`t`.
We do this by simply solving for :math:`x`.
For example, the previous constraint can be expressed as :math:`x < (1/3)y + (4/3)z`.
Remember that dividing both sides of an inequality by a negative number reverses the direction.

In this section we say that a set :math:`\Gamma` of linear constraints is *satisfiable* if and only
if there is an assignment of real
values to the variables that makes them all true. Our first goal is to prove the following.

.. admonition:: Theorem

    The question as to whether a finite set of linear constraints is satisfiable is
    decidable.

.. admonition:: Proof

    We use induction on the number of variables. If there are no variables at all,
    :math:`\Gamma` contains only expressions of the form :math:`b_0 < b_1` or :math:`b_0 = b_1`
    where :math:`b_0` and :math:`b_1` are constants, and we only need to perform
    the comparisons to see whether they are true. Remember that if :math:`\Gamma`
    is the empty set, we take it to be trivially satisfied.

    In the inductive step, :math:`\Gamma` contains a variable.
    If :math:`\Gamma` contains any false constant equations, it is unsatisfiable,
    and it it contains any true constant equations, we can remove them without
    affecting satisfiability.
    If :math:`\Gamma` contains a nontrivial equation with a variable :math:`x`,
    we put
    it in the form :math:`x = t` and then substitute :math:`t` for :math:`x` everywhere.
    The resulting set of constraints has one fewer variable, and clearly
    it is equisatisfiable with the original one.
    Given an assignment to the new set of constraints, we just assign :math:`x`
    the value of :math:`t`.

    So we can now assume that there are no equations in :math:`\Gamma`.
    We can divide the inequalities in :math:`\Gamma` intro three kinds:

    - those that don't contain :math:`x` at all
    - those that can be expressed in the form :math:`s_i < x`
    - those that can be expressed in the form :math:`x < t_j`

    Let :math:`\Gamma'` be the set that results from removing the inequalities
    in the last two categories
    and replacing them with inequalities of the form :math:`s_i < t_j`.
    We claim :math:`\Gamma'` is equisatisfiable with :math:`\Gamma`.
    Clearly any assignment that satisfies :math:`\Gamma` also satisfies :math:`\Gamma'`.
    Conversely, suppose :math:`\sigma` is an assignment that satisfies :math:`\Gamma'`.
    Then, under that assignment, the value of each :math:`s_i` is less than the value
    of every :math:`t_j`. We obtain an assignment satisfying :math:`\Gamma`
    by mapping :math:`x` to any value between the largest :math:`s_i` and the
    smallest :math:`t_j`. (If one of the last two categories is empty, we
    remove the constraints in the other category entirely,
    since they can be satisfied by taking :math:`x` sufficiently large or sufficiently
    small.)

Implementing Fourier-Motzkin
----------------------------

The procedure implicit in this proof is known as the *Fourier-Motzkin* procedure.
The idea can be found in the work of Jean-Baptiste Joseph
Fourier in the early nineteenth century (the same Fourier who gave us Fourier
analysis), but it was rediscovered by multiple people in the nineteenth century, including Theo In the worst case, every elimination step divides the number of equations in half and
then squares it, resulting in doubly exponential behavior.
The procedure works well in practice, though, since in many applications each variable is
contained in only a few equations. (There are obvious heuristics, like choosing a variable
at each stage that minimizes the number of equations at the next stage.)
There is an implementation of the procedure in the file `FourierMotzkin.lean` in the `Examples`
folder,
modulo two components that we ask you to supply.
SMT solvers use much more efficient methods based on the simplex algorithm from linear programming.


A full decision procedure
-------------------------

We can describe the Fourier-Motzkin procedure more explicitly as a decision procedure
for satisfiability of existential sentences in a langauge for the real numbers as follows.
Suppose we are given a problem in linear arithmetic where the variables are labeled
:math:`x_1, x_2, \ldots, x_n` and the
constraints are labeled :math:`c_1, c_2, \ldots, c_m`. Then what we are really asking as to whether
the formula :math:`\ex {x_1, \ldots, x_n} c_1 \land c_2 \land \cdots \land c_m`
is true of the real numbers when the constraints are interpreted in the expected way.
To make this more precise, consider the structure :math:`(\mathbb R, 0, 1, +, <)`
in a language with symbols :math:`0`, :math:`1`, :math:`+`, and :math:`<`.
All the constraints can be expressed in this language, albeit in a clunky way. For example,
we can write :math:`3 x` as :math:`x + x + x`, and express a constraint like :math:`x -(1/2)y + (4/3)z < 0`
as :math:`6x + 8z < 3y`. Alternatively, we can add symbols for scalar multiplication to the language.

We now obtain a decision procedure for arbitrary existential formulas
:math:`\ex {x_1, \ldots, x_n} A` as follows.
Given a formula :math:`\ex x A`, put :math:`A` into negation normal form, so that all the negations
are pushed down to atomic formulas.
Replace :math:`\lnot (s < t)` by :math:`t < s \lor s = t`, and we can replace
:math:`s \ne t` by :math:`s < t \lor t < s`.
(In practice, it is more efficient to include :math:`\le` in the language as well, and use the
fact that :math:`\lnot (s \le t)` is equivalent to :math:`t < s`.)
Putting the result into disjunctive normal form, we can assume that all the atomic
formulas are of the form :math:`s < t` or :math:`s = t`.
We can move the existential quantifiers through the disjunction as we did in :numref:`deciding_universal_sentences` and then apply the Fourier-Motzkin procedure to
each disjunction.

In fact, a modification of the algorithm provides a decision procedure for the satisfiability
of *any* sentence in the language, not just the existential ones.

.. admonition:: Theorem

    The question as to whether a sentence :math:`A` is true in :math:`(\mathbb R, 0, 1, +, <, \le)`
    is decidable.

We will only sketch the details here.
The algorithm uses an important method known as "elimination of quantifiers."
The idea is to successively eliminate quantifiers, one by one, until we are left with a
quantifier-free sentence. We can determine the truth of that by simply calculating.

We will show that any formula :math:`\ex x A`, where :math:`A` is quantifier-free,
is equivalent to a quantifier-free formula :math:`A'` that does not include :math:`x`.
Repeating the process and using the fact that :math:`\fa x A` is equivalent to
:math:`\lnot \ex x \lnot A`, we can eliminate all the quantifiers. We are then
left with a quantifier-free sentence, that is, a boolean combination of equations and
inequalities between closed terms. We can decide the truth of that sentence by evaluating
the terms.

We have already seen all the ideas. The procedure above allows us to write
:math:`\ex x A` as :math:`(\ex x A_1) \lor (\ex x A_2) \lor \cdots \lor (\ex x A_n)`
where each :math:`A_i` is a conjunction of atomic formulas.
So we only need to show how to eliminate an existential quantifier from a conjunction
of constraints of the form :math:`s < t` or :math:`s = t`.
But that is exactly what the pivot step in the Fourier-Motzkin procedure does, and we are done.

It is possible to write down axioms that justify every step of the transformation.
The resulting set of axioms is known as the theory of *linear arithmetic*.
The argument shows that the resulting set of axioms characterizes the structure exactly,
and that the question of provability from those axioms is decidable.

You should also notice that the justification of the procedure only used the fact basic
facts about arithmetic on the real numbers, as well as the fact we can find a real number
between any other two. So the procedure works just the same way, and returns the same answer,
for other structures that satisfy these properties, like the rationals.
In other words, the structure :math:`(\mathbb Q, 0, 1, +, <, \le)` has exactly the same
theory as :math:`(\mathbb R, 0, 1, +, <, \le)`.


Other theories
--------------

What happens if we extend linear arithmetic by adding multiplication between
arbitrary terms? Formally, we are asking about the theory of the real numbers
:math:`(\mathbb{R}, 0, 1, +, \times, <)` with zero, one, addition,
multiplication, and the less-than relation. It is equivalent to extending
linear arithmetic by allowing atomic formulas :math:`p = 0` and :math:`p > 0` where :math:`p` is
an arbitrary polynomial. The theory, known also as the theory of *Real closed
fields*, is still decidable. The theorem was proved by Alfred Tarski before World War II,
but it wasn't published until 1948, after the war.

Returning to the language without multiplication, one can ask what happens if we replace
the real numbers by the integers.
In other words, we can ask whether the truth of sentences in the structure
:math:`(\mathbb Z, 0, 1, +, <)` is decidable.
In contrast to the reals, the order on the integers is *discrete*, since
there is nothing between a value :math:`x` and :math:`x + 1`.
The problem is nonetheless decidable.
The result was first proved  in 1926 by Mojżesz Presburger, a student of Tarski's,
who later died in the Holocaust. The story has it that Tarski did not think
the result was enough for a dissertation, and made him do more work.
The resulting theory is known as *Presburger arithmetic* or *linear integer arithmetic*.

The decision procedure is more complicated than that for linear arithmetic,
and we will not discuss it here.
SMT solvers, however, use efficient implementations of the *existential fragment*
of the theory, which is to say, the satisfiability problem for quantifier-free formulas.

What happens if we add multiplication?
In contrast to the case with the real numbers, however, the theory of the integers
with addition and multiplication is undecidable.
In other words, there is no algorithm to decide truth in the model
:math:`(\mathbb Z, 0, 1, +, \times)`.
This follows from the methods that Gödel used to prove the incompleteness theorems,
and it is also a consequence of *Tarski's theorem* on the undefinability of truth.
The phenomena can be stated in very strong terms: no theory or structure in which one
can interpret a small amount of arithmetic with addition and multiplication is decidable.