The Foundations of Science: Science and Hypothesis, The Value of Science, Science and Method

CHAPTER I

Chapter 253,669 wordsPublic domain

ON THE NATURE OF MATHEMATICAL REASONING

The very possibility of the science of mathematics seems an insoluble contradiction. If this science is deductive only in appearance, whence does it derive that perfect rigor no one dreams of doubting? If, on the contrary, all the propositions it enunciates can be deduced one from another by the rules of formal logic, why is not mathematics reduced to an immense tautology? The syllogism can teach us nothing essentially new, and, if everything is to spring from the principle of identity, everything should be capable of being reduced to it. Shall we then admit that the enunciations of all those theorems which fill so many volumes are nothing but devious ways of saying _A_ is _A_?

Without doubt, we can go back to the axioms, which are at the source of all these reasonings. If we decide that these can not be reduced to the principle of contradiction, if still less we see in them experimental facts which could not partake of mathematical necessity, we have yet the resource of classing them among synthetic _a priori_ judgments. This is not to solve the difficulty, but only to baptize it; and even if the nature of synthetic judgments were for us no mystery, the contradiction would not have disappeared, it would only have moved back; syllogistic reasoning remains incapable of adding anything to the data given it: these data reduce themselves to a few axioms, and we should find nothing else in the conclusions.

No theorem could be new if no new axiom intervened in its demonstration; reasoning could give us only the immediately evident verities borrowed from direct intuition; it would be only an intermediary parasite, and therefore should we not have good reason to ask whether the whole syllogistic apparatus did not serve solely to disguise our borrowing?

The contradiction will strike us the more if we open any book on mathematics; on every page the author will announce his intention of generalizing some proposition already known. Does the mathematical method proceed from the particular to the general, and, if so, how then can it be called deductive?

If finally the science of number were purely analytic, or could be analytically derived from a small number of synthetic judgments, it seems that a mind sufficiently powerful could at a glance perceive all its truths; nay more, we might even hope that some day one would invent to express them a language sufficiently simple to have them appear self-evident to an ordinary intelligence.

If we refuse to admit these consequences, it must be conceded that mathematical reasoning has of itself a sort of creative virtue and consequently differs from the syllogism.

The difference must even be profound. We shall not, for example, find the key to the mystery in the frequent use of that rule according to which one and the same uniform operation applied to two equal numbers will give identical results.

All these modes of reasoning, whether or not they be reducible to the syllogism properly so called, retain the analytic character, and just because of that are powerless.

The discussion is old; Leibnitz tried to prove 2 and 2 make 4; let us look a moment at his demonstration.

I will suppose the number 1 defined and also the operation _x_ + 1 which consists in adding unity to a given number _x_.

These definitions, whatever they be, do not enter into the course of the reasoning.

I define then the numbers 2, 3 and 4 by the equalities

(1) 1 + 1 = 2; (2) 2 + 1 = 3; (3) 3 + 1 = 4.

In the same way, I define the operation _x_ + 2 by the relation:

(4) _x_ + 2 = (_x_ + 1) + 1.

That presupposed, we have

2 + 1 + 1 = 3 + 1 (Definition 2), 3 + 1 = 4 (Definition 3), 2 + 2 = (2 + 1) + 1 (Definition 4),

whence

2 + 2 = 4 Q.E.D.

It can not be denied that this reasoning is purely analytic. But ask any mathematician: 'That is not a demonstration properly so called,' he will say to you: 'that is a verification.' We have confined ourselves to comparing two purely conventional definitions and have ascertained their identity; we have learned nothing new. _Verification_ differs from true demonstration precisely because it is purely analytic and because it is sterile. It is sterile because the conclusion is nothing but the premises translated into another language. On the contrary, true demonstration is fruitful because the conclusion here is in a sense more general than the premises.

The equality 2 + 2 = 4 is thus susceptible of a verification only because it is particular. Every particular enunciation in mathematics can always be verified in this same way. But if mathematics could be reduced to a series of such verifications, it would not be a science. So a chess-player, for example, does not create a science in winning a game. There is no science apart from the general.

It may even be said the very object of the exact sciences is to spare us these direct verifications.

III

Let us, therefore, see the geometer at work and seek to catch his process.

The task is not without difficulty; it does not suffice to open a work at random and analyze any demonstration in it.

We must first exclude geometry, where the question is complicated by arduous problems relative to the rôle of the postulates, to the nature and the origin of the notion of space. For analogous reasons we can not turn to the infinitesimal analysis. We must seek mathematical thought where it has remained pure, that is, in arithmetic.

A choice still is necessary; in the higher parts of the theory of numbers, the primitive mathematical notions have already undergone an elaboration so profound that it becomes difficult to analyze them.

It is, therefore, at the beginning of arithmetic that we must expect to find the explanation we seek, but it happens that precisely in the demonstration of the most elementary theorems the authors of the classic treatises have shown the least precision and rigor. We must not impute this to them as a crime; they have yielded to a necessity; beginners are not prepared for real mathematical rigor; they would see in it only useless and irksome subtleties; it would be a waste of time to try prematurely to make them more exacting; they must pass over rapidly, but without skipping stations, the road traversed slowly by the founders of the science.

Why is so long a preparation necessary to become habituated to this perfect rigor, which, it would seem, should naturally impress itself upon all good minds? This is a logical and psychological problem well worthy of study.

But we shall not take it up; it is foreign to our purpose; all I wish to insist on is that, not to fail of our purpose, we must recast the demonstrations of the most elementary theorems and give them, not the crude form in which they are left, so as not to harass beginners, but the form that will satisfy a skilled geometer.

DEFINITION OF ADDITION.--I suppose already defined the operation _x_ + 1, which consists in adding the number 1 to a given number _x_.

This definition, whatever it be, does not enter into our subsequent reasoning.

We now have to define the operation _x_ + _a_, which consists in adding the number _a_ to a given number _x_.

Supposing we have defined the operation

_x_ + (_a_ - 1),

the operation _x_ + _a_ will be defined by the equality

(1) _x_ + _a_ = [_x_ + (_a_ - 1)] + 1.

We shall know then what _x + a_ is when we know what _x_ + (_a_ - 1) is, and as I have supposed that to start with we knew what _x_ + 1 is, we can define successively and 'by recurrence' the operations _x_ + 2, _x_ + 3, etc.

This definition deserves a moment's attention; it is of a particular nature which already distinguishes it from the purely logical definition; the equality (1) contains an infinity of distinct definitions, each having a meaning only when one knows the preceding.

PROPERTIES OF ADDITION.--_Associativity._--I say that

_a_ + (_b_ + _c_) = (_a_ + _b_) + _c_.

In fact the theorem is true for _c_ = 1; it is then written

_a_ + (_b_ + 1) = (_a_ + _b_) + 1,

which, apart from the difference of notation, is nothing but the equality (1), by which I have just defined addition.

Supposing the theorem true for _c_ = [gamma], I say it will be true for _c_ = [gamma] + 1.

In fact, supposing

(_a_ + _b_) + [gamma] = _a_ + (_b_ + [gamma]),

it follows that

[(_a_ + _b_) + [gamma]] + 1 = [_a_ + (_b_ + [gamma])] + 1

or by definition (1)

(_a_ + _b_) + ([gamma] + 1) = _a_ + (_b_ + [gamma] + 1) = _a_ + [_b_ + ([gamma] + 1)],

which shows, by a series of purely analytic deductions, that the theorem is true for [gamma] + 1.

Being true for _c_ = 1, we thus see successively that so it is for _c_ = 2, for _c_ = 3, etc.

_Commutativity._--1º I say that

_a_ + 1 = 1 + _a_.

The theorem is evidently true for _a_ = 1; we can _verify_ by purely analytic reasoning that if it is true for _a_ = [gamma] it will be true for _a_ = [gamma] + 1; for then

([gamma] + 1) + 1 = (1 + [gamma]) + 1 = 1 + ([gamma] + 1);

now it is true for _a_ = 1, therefore it will be true for _a_ = 2, for _a_ = 3, etc., which is expressed by saying that the enunciated proposition is demonstrated by recurrence.

2º I say that

_a_ + _b_ = _b_ + _a_.

The theorem has just been demonstrated for _b_ = 1; it can be verified analytically that if it is true for _b_ = [beta], it will be true for _b_ = [beta] + 1.

The proposition is therefore established by recurrence.

DEFINITION OF MULTIPLICATION.--We shall define multiplication by the equalities.

(1) _a_ × 1 = _a_.

(2) _a_ × _b_ = [_a_ × (_b_ - 1)] + _a_.

Like equality (1), equality (2) contains an infinity of definitions; having defined a × 1, it enables us to define successively: _a_ × 2, _a_ × 3, etc.

PROPERTIES OF MULTIPLICATION.--_Distributivity._--I say that

(_a_ + _b_) × _c_ = (_a_ × _c_) + (_b_ × _c_).

We verify analytically that the equality is true for _c_ = 1; then that if the theorem is true for _c_ = [gamma], it will be true for _c_ = [gamma] + 1.

The proposition is, therefore, demonstrated by recurrence.

_Commutativity._--1º I say that

_a_ × 1 = 1 × _a_.

The theorem is evident for _a_ = 1.

We verify analytically that if it is true for _a_ = [alpha], it will be true for _a_ = [alpha] + 1.

2º I say that

_a_ × _b_ = _b_ × _a_.

The theorem has just been proven for _b_ = 1. We could verify analytically that if it is true for _b_ = [beta], it will be true for _b_ = [beta] + 1.

Here I stop this monotonous series of reasonings. But this very monotony has the better brought out the procedure which is uniform and is met again at each step.

This procedure is the demonstration by recurrence. We first establish a theorem for _n_ = 1; then we show that if it is true of _n_ - 1, it is true of _n_, and thence conclude that it is true for all the whole numbers.

We have just seen how it may be used to demonstrate the rules of addition and multiplication, that is to say, the rules of the algebraic calculus; this calculus is an instrument of transformation, which lends itself to many more differing combinations than does the simple syllogism; but it is still an instrument purely analytic, and incapable of teaching us anything new. If mathematics had no other instrument, it would therefore be forthwith arrested in its development; but it has recourse anew to the same procedure, that is, to reasoning by recurrence, and it is able to continue its forward march.

If we look closely, at every step we meet again this mode of reasoning, either in the simple form we have just given it, or under a form more or less modified.

Here then we have the mathematical reasoning _par excellence_, and we must examine it more closely.

The essential characteristic of reasoning by recurrence is that it contains, condensed, so to speak, in a single formula, an infinity of syllogisms.

That this may the better be seen, I will state one after another these syllogisms which are, if you will allow me the expression, arranged in 'cascade.'

These are of course hypothetical syllogisms.

The theorem is true of the number 1.

Now, if it is true of 1, it is true of 2.

Therefore it is true of 2.

Now, if it is true of 2, it is true of 3.

Therefore it is true of 3, and so on.

We see that the conclusion of each syllogism serves as minor to the following.

Furthermore the majors of all our syllogisms can be reduced to a single formula.

If the theorem is true of _n_ - 1, so it is of _n_.

We see, then, that in reasoning by recurrence we confine ourselves to stating the minor of the first syllogism, and the general formula which contains as particular cases all the majors.

This never-ending series of syllogisms is thus reduced to a phrase of a few lines.

It is now easy to comprehend why every particular consequence of a theorem can, as I have explained above, be verified by purely analytic procedures.

If instead of showing that our theorem is true of all numbers, we only wish to show it true of the number 6, for example, it will suffice for us to establish the first 5 syllogisms of our cascade; 9 would be necessary if we wished to prove the theorem for the number 10; more would be needed for a larger number; but, however great this number might be, we should always end by reaching it, and the analytic verification would be possible.

And yet, however far we thus might go, we could never rise to the general theorem, applicable to all numbers, which alone can be the object of science. To reach this, an infinity of syllogisms would be necessary; it would be necessary to overleap an abyss that the patience of the analyst, restricted to the resources of formal logic alone, never could fill up.

I asked at the outset why one could not conceive of a mind sufficiently powerful to perceive at a glance the whole body of mathematical truths.

The answer is now easy; a chess-player is able to combine four moves, five moves, in advance, but, however extraordinary he may be, he will never prepare more than a finite number of them; if he applies his faculties to arithmetic, he will not be able to perceive its general truths by a single direct intuition; to arrive at the smallest theorem he can not dispense with the aid of reasoning by recurrence, for this is an instrument which enables us to pass from the finite to the infinite.

This instrument is always useful, for, allowing us to overleap at a bound as many stages as we wish, it spares us verifications, long, irksome and monotonous, which would quickly become impracticable. But it becomes indispensable as soon as we aim at the general theorem, to which analytic verification would bring us continually nearer without ever enabling us to reach it.

In this domain of arithmetic, we may think ourselves very far from the infinitesimal analysis, and yet, as we have just seen, the idea of the mathematical infinite already plays a preponderant rôle, and without it there would be no science, because there would be nothing general.

The judgment on which reasoning by recurrence rests can be put under other forms; we may say, for example, that in an infinite collection of different whole numbers there is always one which is less than all the others.

We can easily pass from one enunciation to the other and thus get the illusion of having demonstrated the legitimacy of reasoning by recurrence. But we shall always be arrested, we shall always arrive at an undemonstrable axiom which will be in reality only the proposition to be proved translated into another language.

We can not therefore escape the conclusion that the rule of reasoning by recurrence is irreducible to the principle of contradiction.

Neither can this rule come to us from experience; experience could teach us that the rule is true for the first ten or hundred numbers; for example, it can not attain to the indefinite series of numbers, but only to a portion of this series, more or less long but always limited.

Now if it were only a question of that, the principle of contradiction would suffice; it would always allow of our developing as many syllogisms as we wished; it is only when it is a question of including an infinity of them in a single formula, it is only before the infinite that this principle fails, and there too, experience becomes powerless. This rule, inaccessible to analytic demonstration and to experience, is the veritable type of the synthetic _a priori_ judgment. On the other hand, we can not think of seeing in it a convention, as in some of the postulates of geometry.

Why then does this judgment force itself upon us with an irresistible evidence? It is because it is only the affirmation of the power of the mind which knows itself capable of conceiving the indefinite repetition of the same act when once this act is possible. The mind has a direct intuition of this power, and experience can only give occasion for using it and thereby becoming conscious of it.

But, one will say, if raw experience can not legitimatize reasoning by recurrence, is it so of experiment aided by induction? We see successively that a theorem is true of the number 1, of the number 2, of the number 3 and so on; the law is evident, we say, and it has the same warranty as every physical law based on observations, whose number is very great but limited.

Here is, it must be admitted, a striking analogy with the usual procedures of induction. But there is an essential difference. Induction applied to the physical sciences is always uncertain, because it rests on the belief in a general order of the universe, an order outside of us. Mathematical induction, that is, demonstration by recurrence, on the contrary, imposes itself necessarily because it is only the affirmation of a property of the mind itself.

VII

Mathematicians, as I have said before, always endeavor to _generalize_ the propositions they have obtained, and, to seek no other example, we have just proved the equality:

_a_ + 1 = 1 + _a_

and afterwards used it to establish the equality

_a_ + _b_ = _b_ + _a_

which is manifestly more general.

Mathematics can, therefore, like the other sciences, proceed from the particular to the general.

This is a fact which would have appeared incomprehensible to us at the outset of this study, but which is no longer mysterious to us, since we have ascertained the analogies between demonstration by recurrence and ordinary induction.

Without doubt recurrent reasoning in mathematics and inductive reasoning in physics rest on different foundations, but their march is parallel, they advance in the same sense, that is to say, from the particular to the general.

Let us examine the case a little more closely.

To demonstrate the equality

_a_ + 2 = 2 + _a_

it suffices to twice apply the rule

(1) _a_ + 1 = 1 + _a_

and write

(2) _a_ + 2 = _a_ + 1 + 1 = 1 + _a_ + 1 = 1 + 1 + _a_ = 2 + _a_.

The equality (2) thus deduced in purely analytic way from the equality (1) is, however, not simply a particular ease of it; it is something quite different.

We can not therefore even say that in the really analytic and deductive part of mathematical reasoning we proceed from the general to the particular in the ordinary sense of the word.

The two members of the equality (2) are simply combinations more complicated than the two members of the equality (1), and analysis only serves to separate the elements which enter into these combinations and to study their relations.

Mathematicians proceed therefore 'by construction,' they 'construct' combinations more and more complicated. Coming back then by the analysis of these combinations, of these aggregates, so to speak, to their primitive elements, they perceive the relations of these elements and from them deduce the relations of the aggregates themselves.

This is a purely analytical proceeding, but it is not, however, a proceeding from the general to the particular, because evidently the aggregates can not be regarded as more particular than their elements.

Great importance, and justly, has been attached to this procedure of 'construction,' and some have tried to see in it the necessary and sufficient condition for the progress of the exact sciences.

Necessary, without doubt; but sufficient, no.

For a construction to be useful and not a vain toil for the mind, that it may serve as stepping-stone to one wishing to mount, it must first of all possess a sort of unity enabling us to see in it something besides the juxtaposition of its elements.

Or, more exactly, there must be some advantage in considering the construction rather than its elements themselves.

What can this advantage be?

Why reason on a polygon, for instance, which is always decomposable into triangles, and not on the elementary triangles?

It is because there are properties appertaining to polygons of any number of sides and that may be immediately applied to any particular polygon.

Usually, on the contrary, it is only at the cost of the most prolonged exertions that they could be found by studying directly the relations of the elementary triangles. The knowledge of the general theorem spares us these efforts.

A construction, therefore, becomes interesting only when it can be ranged beside other analogous constructions, forming species of the same genus.

If the quadrilateral is something besides the juxtaposition of two triangles, this is because it belongs to the genus polygon.

Moreover, one must be able to demonstrate the properties of the genus without being forced to establish them successively for each of the species.

To attain that, we must necessarily mount from the particular to the general, ascending one or more steps.

The analytic procedure 'by construction' does not oblige us to descend, but it leaves us at the same level.

We can ascend only by mathematical induction, which alone can teach us something new. Without the aid of this induction, different in certain respects from physical induction, but quite as fertile, construction would be powerless to create science.

Observe finally that this induction is possible only if the same operation can be repeated indefinitely. That is why the theory of chess can never become a science, for the different moves of the same game do not resemble one another.