Mathematical Proof and the Principles of Mathematics/Logic/The universal quantifier

We've introduced the idea of quantifiers in general, and the universal quantifier in particular. Before going on to the second quantifier we'll give the rules of inference for the first one.

Proving a universal edit

To prove a statement of the form

For all  ,  

first ask the reader to pick an arbitrary constant  , then prove the statement  . The idea is that, since the constant   was chosen at random, with no assumptions other than that it's an object in the universe of discourse, the proof of   is valid hold no matter the choice of  . Therefore   is true for all  

Note that for a proof if implication, the reader is asked to make an assumption, then you as the prover must derive a conclusion. Similarly, in the proof of a universal, the reader is asked to do something, namely pick a constant, then you as the prover must derive a conclusion. So we'll state the rule of inference in a way similar to the way the first rule inference for implication is stated.

If by choosing   as an arbitrary constant one can derive  , then deduce
For all  ,  

With this in mind, we'll make the structure of the proof of a universal will be

Line Statement Justification
1 Choose   Arbitrary constant
(something)
n   ?
n+1 For all  ,   From 1 and n

The indentation is used to show that the letter a introduced on line 1 has no meaning outside the range 1 through n.

(It should be noted here that most books on logic follow a different convention. The rule is often stated as

From   deduce
For all  ,  .

where   is an unbound variable in   and subject to restrictions on where   appears elsewhere in the proof. This may be a valid form of reasoning, but it's not the way a proof is normally written in a mathematical context. For one thing, the expression  , since it has an unbound variable, is really a predicate, not a statement. But proofs in mathematics, aside from the occasional imperative such as "Assume ... ," contain only statements. The rule also requires the distinction between bound and unbound variables. But we're avoiding that distinction and focusing on the difference between statements and predicates instead.)

Example proof #1 edit

We use the rule of inference above to prove:

Prop. 1: For all  , (  implies  ).

First, lay out the structure of the proof as above.

Line Statement Justification
1 Choose   Arbitrary constant
(something)
n   implies   ?
n+1 For all  , (  implies  ) From 1 and n

The task now is to prove   implies  , but   is a statement so we can apply the previously proven proposition

  implies  .

So the proof can be completed as

Line Statement Justification
1 Choose   Arbitrary constant
2   implies   Prop. 1 on Direct proofs for implication.
3 For all  , (  implies  ) From 1 and 2

In prose form this becomes:

Prop. 1: For all  , (  implies  ).
Proof: Let   be arbitrary. Then   implies   by Prop. 1 on Direct proofs for implication. Therefore   implies   for all  .

Using a universal edit

The rule for using a universal quantifier is relatively simple.

If   is a constant then from
For all  ,  
deduce
 .

Basically this says that if   holds for all   then it holds for a particular value,  , of  . Note that this is only valid when   is defined in the current scope.

Example proof #2 edit

We now have all the rules of inference needed to proof statements involving universal quantifiers, so let's put them to work with another example. The classical syllogism

All people are mortal.
Socrates is a person.
Therefore Socrates is mortal.

may be restated in our notation as

For all  , (  implies  ).
 
Therefore  

where   is the predicate   is a person,   is the predicate  , and   is the constant Socrates. This is supposed to be a syllogism, in other words the conclusion is supposed to be valid for any  ,   and  , as long as the first two statements are valid. This amounts to

Prop. 2: For all  , (  implies  ) and   implies  

This is an implication, so start with the standard outline for a direct proof.

Line Statement Justification
1 For all  , (  implies  ) and   Hypothesis
(something)
n   ?
n+1 For all  , (  implies  ) and   implies   From 1 and n

It's probably a good idea to break up the 'and' into separate statements.

Line Statement Justification
1 For all  , (  implies  ) and   Hypothesis
2 For all  , (  implies  ) From 1
3   From 1
(something)
n   ?
n+1 For all  , (  implies  ) and   implies   From 1 and n

We have   and need to derive  , so something like   implies   will do the trick. But we can get that by applying s to the universal quantifier. Filling in the details gives:

Line Statement Justification
1 For all  , (  implies  ) and   Hypothesis
2 For all  , (  implies  ) From 1
3   From 1
4   implies   From 2
5   From 2 and 4
6 For all  , (  implies  ) and   implies   From 1 and 5

Translating categorical propositions edit

Historically, logic dealt with categorical propositions; these are statements that relate two predicates in specific ways. There are four types:

All P are Q.
No P are Q.
Some P are Q.
Some P are not Q.

The first type, which we've already seen in the previous section, becomes

For all  , (  implies  ).

in our notation. The second type may be rephrased as

All P are not Q.

So in our notation it becomes

For all  , (  implies not  ).

Note that we have from propositional logic

  implies not   iff   implies not  .

We leave it as an exercise to prove

No P are Q iff No Q are P.

Now think about what it would mean for the first type

All P are Q.

to be False. There would need to be some object which is a P but not a Q. In other words, the statement of the fourth type

Some P are not Q.

would have to be True. On the other hand, if

Some P are not Q.

is False, then there are no P which are not Q, or put another way,

All P are Q.

So the fourth statement

Some P are not Q.

can be translated as

Not (all P are Q).

or

Not (for all  , (  implies  )).

Similarly, the third statement

Some P are Q.

Can be translated as

Not (no P are Q).

or

Not (for all  , (  implies not  )).

We'll introduce another quantifier in the next page which will make these expressions more tractable. In the mean time, we leave it as an exercise to translate two of the categorical syllogisms

All P are Q.
All Q are R.
Therefore all P are R.

and

All P are Q.
No R are Q.
Therefore No P are R.

into our notation and prove them using the rules of inference we've given up to now.

You may have noticed that results our attempts to translate categorical propositions to our notation have been both more verbose and less like natural language than the originals. So you might well wonder what is the advantage of our notation. One advantage is that our notation is expressive enough to include all mathematical statements while categorical propositions alone are too restrictive. Secondly, the categorical syllogisms do not cover all the valid forms of reasoning which are needed to prove theorems. Consider

All triangles and rectangles are rectilinear figures.
All squares are rectangles with all sides equal.
Therefore all squares are rectilinear figures.

This seems to be a valid syllogism, but since the premises both involve three predicates instead of two, it's not one of the categorical syllogisms. In addition, categorical propositions deal only with predicates and not relations, and it would be impossible to do much mathematics with predicates only.