Exercise statement
Prove Proposition 2.2.14.
Proposition 2.2.14 (Strong principle of induction). Let
be a natural number, and let
be a property pertaining to an arbitrary natural number
. Suppose that for each
, we have the following implication: if
is true for all natural numbers
, then
is also true. (In particular, this means that
is true, since in this case the hypothesis is vacuous.) Then we can conclude that
is true for all natural numbers
.
Hints
- Define
to be the property that
is true for all
; note that
is vacuously true when
.
How to think about the exercise
My first reaction to reading the proposition is something like “What on earth is going on here?”, and if that’s your reaction too, don’t worry! Even just understanding what we are trying to prove can be a challenge sometimes. In this case, the proposition has a nested “if … then …” clause and multiple variables (
) that make it hard to tell what is going on. In cases like this, we can first ask, is there a simpler statement we can try to prove instead that could still give us the main idea for the full solution? One idea is to take
; in the proposition
just a constant that is fixed throughout, and acts as the “base case” of the strong induction. Picking
as the constant also cleans up the inequality
because every natural number is greater than or equal to zero, so we can simply omit those conditions too.
So by setting
and cleaning things up a bit, what we are trying to show is something like this: Let
be a property pertaining to an arbitrary natural number
. Suppose that for each
, we have the following implication: if
is true for all natural numbers
, then
is also true. Then we can conclude that
is true for all natural numbers
.
Okay, since we are proving a version of strong induction, and in induction we start at some base case
, a natural question to ask now is, can we prove that
is true? The special property of
we are given in this case is that if
is true for all
, then
is also true. But there is no natural number
such that
, so it is vacuously true that
is true for all
. So we see that
is true.
Now can we prove that
is true? The special property of
we are given in this case is that if
is true for all
, then
is also true. But the only natural number
such that
is the number
, and we already know that
is true. So we have
as well.
Can we now prove that
is true? The special property of
we are given in this case is that if
is true for all
, then
is also true. What are the numbers less than
? That would be
and
, and we know both
and
are true. So
is also true.
I hope you see the pattern now. In the general case, we are trying to prove that
is true. The special property of
we are given is that if
is true for all natural numbers
, then
is also true. But by the previous rounds of the proof, we know that
are all true, and
are the only natural numbers less than
. So
is true as well.
So that’s the basic idea. The argument above wasn’t rigorous because (1) we used notation like
and
even though subtraction hasn’t been defined in the book yet, and (2) we just said “by the previous rounds of the proof” and didn’t actually give all the previous rounds of the proof. We need to fill in those previous rounds in a rigorous way, which is where (ordinary) induction comes in.
So now how do we set up the induction? The obvious statement to try to prove by induction is
. We already showed that
is true above, which completes the base case. So if we can prove that
implies
, then we’ll be done. So suppose
is true. The special property of
we are given is that if
is true for all natural numbers
, then
is also true. What are the numbers less than
? That would be
. Now we see the problem. We only know that
is true; we don’t know that
are true, since those statements weren’t part of our inductive hypothesis. So we can’t make use of the special property of
, and we are stuck.
But the nice thing about trying the obvious thing and hitting a dead end is that this can give you insight into the problem and help you reach an actual solution. This is true here. If we think about the reason we couldn’t make use of the special property of
, it was that we only knew
as part of our inductive hypothesis, whereas we wanted to know all of
. So could we make the conjunction “
and
and
and
and
and
” appear as our inductive hypothesis? Yes, we can! We can change the statement we prove by induction from
to “
and
and
and
and
and
”. To get rid of the “
”, we can state this instead as “
is true for all
”. We can call this statement
.
Now let’s see if we can prove
is true for all natural numbers
by induction. For the base case, we have to show
, which is the statement that
is true for all
. But, since the only natural number less than or equal to
is
itself, this is just the assertion that
is true. And we already know that
is true from our discussion above. Now suppose inductively that
is true. We want to show that
is true. The statement
is the assertion that
is true for all
, and the statement
is the assertion that
is true for all
. In more intuitive terms, this means that we know
are all true, and we want to show that
are all true. The only new statement to prove is the claim that
is true, so if we can show just that new statement, then we’ll be done. But the special property of
we are given is that if
is true for all natural numbers
, then
is also true. And we already know that
is true for all natural numbers
(that’s just the claim that
are all true). So we get
, and that closes our induction. Since
is true for all
, this means in particular that
is also true for all
.
The statement
that we used above is almost the same as the property
that is suggested in the hint, but slightly different in that it includes the endpoint
. It doesn’t really matter which of
or
you use. I found
a bit more natural above, but the model solution below uses
as suggested in the hint.
To recap some of the problem-solving strategies I employed above:
- If the full goal is too complicated, try to simplify by e.g. proving a special case. Here, we used
.
- If you don’t know which statement to prove by induction, just pick the obvious one and see how far you get; sometimes a “wrong” approach is a necessary stepping stone to finding the right approach.
- Quantified statements like “
is true for all
” can be tricky for your brain to understand, so just write “
and
and
and
and
and
” instead when you are still figuring out how to approach the proof.
I want to say more, but I’m feeling tired after writing up the proof, so here are some quick comments, mostly intended as notes to myself for when I return to this exercise later:
- I should also say something about why this is called strong induction. (basically the hypothesis contains more assumptions, which makes the statement stronger)
- I should give an example of a problem where using strong induction makes the proof go through but where using regular induction makes it impossible.
- In the proof below I showed at one point that
. This is like splitting by the cases
and
(where the latter ends in contradiction so we don’t need to worry about it). There is an alternative way to split by cases, which is
and
. In the latter case,
is automatically true as Tao writes in the hint. In the former case, we have to further break into
(use
) or
(so we want to show
, but this is vacuously true).
- There are a lot of questions on math SE asking about this exercise. It would be nice if I can go through them in detail at some point to make sure that this post covers all of the common points of confusion.
Model solution 1
Let
to be the property that
is true for all
. Given this definition of
, we can restate the hypothesis for
as follows: for each
, if
is true then
is also true. To go a little more slowly, we know from the statement of Proposition 2.2.14 that
has the following property:
- For each
: if
is true for all
then
is also true.
We can use
to rewrite the above bullet point as follows:
- For each
: if
is true then
is also true.
We will prove by induction that
is true for all natural numbers
.
We first want to show
, which says that
is true for all
. This is vacuously true since there is no natural number strictly less than zero:
means
and
. So we have some natural number
such that
, but this means
by Corollary 2.2.9, which contradicts the fact that
. So
is true for all such numbers, because none exist.
Now suppose inductively that we have shown
is true. We want to show that
is true, i.e. we want to show that
is true for all
. So let
be a natural number and suppose
. Our goal is to show that
is true.
Since
, we claim that either
or
. This is because
implies
by Proposition 2.2.12(e) which implies
by Proposition 2.2.12(d). We know
or
; in the latter case we thus have
, so
. Thus we see that either
or
.
Since
is true, we know that
is true for all
. Thus in the case where
, we already know that
is true.
To complete the proof we must show that
is true in the case
, i.e. we must show that
is true. Since
, by transitivity of order
, so
which means
. Since
we can use the hypothesis for
, which says that if
is true then
is also true. Since
is true by inductive hypothesis, we see that
. But since
in this case, we have
as desired.
This completes the induction, and we have that
is true for all natural numbers
. Our original goal was to show that
is true for all natural numbers
. So let
. Then we know that
is true. Since
, we see that
is true as required.
Model solution 2