The Foundations of Science: Science and Hypothesis, The Value of Science, Science and Method

CHAPTER VIII

Chapter 323,964 wordsPublic domain

ENERGY AND THERMODYNAMICS

ENERGETICS.--The difficulties inherent in the classic mechanics have led certain minds to prefer a new system they call _energetics_.

Energetics took its rise as an outcome of the discovery of the principle of the conservation of energy. Helmholtz gave it its final form.

It begins by defining two quantities which play the fundamental rôle in this theory. They are _kinetic energy_, or _vis viva_, and _potential energy_.

All the changes which bodies in nature can undergo are regulated by two experimental laws:

1º The sum of kinetic energy and potential energy is constant. This is the principle of the conservation of energy.

2º If a system of bodies is at _A_ at the time t_{0} and at _B_ at the time t_{1}, it always goes from the first situation to the second in such a way that the _mean_ value of the difference between the two sorts of energy, in the interval of time which separates the two epochs t_{0} and t_{1}, may be as small as possible.

This is Hamilton's principle, which is one of the forms of the principle of least action.

The energetic theory has the following advantages over the classic theory:

1º It is less incomplete; that is to say, Hamilton's principle and that of the conservation of energy teach us more than the fundamental principles of the classic theory, and exclude certain motions not realized in nature and which would be compatible with the classic theory:

2º It saves us the hypothesis of atoms, which it was almost impossible to avoid with the classic theory.

But it raises in its turn new difficulties:

The definitions of the two sorts of energy would raise difficulties almost as great as those of force and mass in the first system. Yet they may be gotten over more easily, at least in the simplest cases.

Suppose an isolated system formed of a certain number of material points; suppose these points subjected to forces depending only on their relative position and their mutual distances, and independent of their velocities. In virtue of the principle of the conservation of energy, a function of forces must exist.

In this simple case the enunciation of the principle of the conservation of energy is of extreme simplicity. A certain quantity, accessible to experiment, must remain constant. This quantity is the sum of two terms; the first depends only on the position of the material points and is independent of their velocities; the second is proportional to the square of these velocities. This resolution can take place only in a single way.

The first of these terms, which I shall call _U_, will be the potential energy; the second, which I shall call _T_, will be the kinetic energy.

It is true that if _T_ + _U_ is a constant, so is any function of _T_ + _U_,

{Phi}(_T_ + _U_).

But this function {Phi}(_T_ + _U_) will not be the sum of two terms the one independent of the velocities, the other proportional to the square of these velocities. Among the functions which remain constant there is only one which enjoys this property, that is _T_ + _U_ (or a linear function of _T_ + _U_, which comes to the same thing, since this linear function may always be reduced to _T_ + _U_ by change of unit and of origin). This then is what we shall call energy; the first term we shall call potential energy and the second kinetic energy. The definition of the two sorts of energy can therefore be carried through without any ambiguity.

It is the same with the definition of the masses. Kinetic energy, or _vis viva_, is expressed very simply by the aid of the masses and the relative velocities of all the material points with reference to one of them. These relative velocities are accessible to observation, and, when we know the expression of the kinetic energy as function of these relative velocities, the coefficients of this expression will give us the masses.

Thus, in this simple case, the fundamental ideas may be defined without difficulty. But the difficulties reappear in the more complicated cases and, for instance, if the forces, in lieu of depending only on the distances, depend also on the velocities. For example, Weber supposes the mutual action of two electric molecules to depend not only on their distance, but on their velocity and their acceleration. If material points should attract each other according to an analogous law, _U_ would depend on the velocity, and might contain a term proportional to the square of the velocity.

Among the terms proportional to the squares of the velocities, how distinguish those which come from _T_ or from _U_? Consequently, how distinguish the two parts of energy?

But still more; how define energy itself? We no longer have any reason to take as definition _T_ + _U_ rather than any other function of _T_ + _U_, when the property which characterized _T_ + _U_ has disappeared, that, namely, of being the sum of two terms of a particular form.

But this is not all; it is necessary to take account, not only of mechanical energy properly so called, but of the other forms of energy, heat, chemical energy, electric energy, etc. The principle of the conservation of energy should be written:

_T_ + _U_ + _Q_ = const.

where _T_ would represent the sensible kinetic energy, _U_ the potential energy of position, depending only on the position of the bodies, _Q_ the internal molecular energy, under the thermal, chemic or electric form.

All would go well if these three terms were absolutely distinct, if _T_ were proportional to the square of the velocities, _U_ independent of these velocities and of the state of the bodies, _Q_ independent of the velocities and of the positions of the bodies and dependent only on their internal state.

The expression for the energy could be resolved only in one single way into three terms of this form.

But this is not the case; consider electrified bodies; the electrostatic energy due to their mutual action will evidently depend upon their charge, that is to say, on their state; but it will equally depend upon their position. If these bodies are in motion, they will act one upon another electrodynamically and the electrodynamic energy will depend not only upon their state and their position, but upon their velocities.

We therefore no longer have any means of making the separation of the terms which should make part of _T_, of _U_ and of _Q_, and of separating the three parts of energy.

If (_T_ + _U_ + _Q_) is constant so is any function [phi](_T_ + _U_ + _Q_).

If _T_ + _U_ + _Q_ were of the particular form I have above considered, no ambiguity would result; among the functions [phi](_T_ + _U_ + _Q_) which remain constant, there would only be one of this particular form, and that I should convene to call energy.

But as I have said, this is not rigorously the case; among the functions which remain constant, there is none which can be put rigorously under this particular form; hence, how choose among them the one which should be called energy? We no longer have anything to guide us in our choice.

There only remains for us one enunciation of the principle of the conservation of energy: _There is something which remains constant_. Under this form it is in its turn out of the reach of experiment and reduces to a sort of tautology. It is clear that if the world is governed by laws, there will be quantities which will remain constant. Like Newton's laws, and, for an analogous reason, the principle of the conservation of energy, founded on experiment, could no longer be invalidated by it.

This discussion shows that in passing from the classic to the energetic system progress has been made; but at the same time it shows this progress is insufficient.

Another objection seems to me still more grave: the principle of least action is applicable to reversible phenomena; but it is not at all satisfactory in so far as irreversible phenomena are concerned; the attempt by Helmholtz to extend it to this kind of phenomena did not succeed and could not succeed; in this regard everything remains to be done. The very statement of the principle of least action has something about it repugnant to the mind. To go from one point to another, a material molecule, acted upon by no force, but required to move on a surface, will take the geodesic line, that is to say, the shortest path.

This molecule seems to know the point whither it is to go, to foresee the time it would take to reach it by such and such a route, and then to choose the most suitable path. The statement presents the molecule to us, so to speak, as a living and free being. Clearly it would be better to replace it by an enunciation less objectionable, and where, as the philosophers would say, final causes would not seem to be substituted for efficient causes.

THERMODYNAMICS.[4]--The rôle of the two fundamental principles of thermodynamics in all branches of natural philosophy becomes daily more important. Abandoning the ambitious theories of forty years ago, which were encumbered by molecular hypotheses, we are trying to-day to erect upon thermodynamics alone the entire edifice of mathematical physics. Will the two principles of Mayer and of Clausius assure to it foundations solid enough for it to last some time? No one doubts it; but whence comes this confidence?

[4] The following lines are a partial reproduction of the preface of my book _Thermodynamique_.

An eminent physicist said to me one day _à propos_ of the law of errors: "All the world believes it firmly, because the mathematicians imagine that it is a fact of observation, and the observers that it is a theorem of mathematics." It was long so for the principle of the conservation of energy. It is no longer so to-day; no one is ignorant that this is an experimental fact.

But then what gives us the right to attribute to the principle itself more generality and more precision than to the experiments which have served to demonstrate it? This is to ask whether it is legitimate, as is done every day, to generalize empirical data, and I shall not have the presumption to discuss this question, after so many philosophers have vainly striven to solve it. One thing is certain; if this power were denied us, science could not exist or, at least, reduced to a sort of inventory, to the ascertaining of isolated facts, it would have no value for us, since it could give no satisfaction to our craving for order and harmony and since it would be at the same time incapable of foreseeing. As the circumstances which have preceded any fact will probably never be simultaneously reproduced, a first generalization is already necessary to foresee whether this fact will be reproduced again after the least of these circumstances shall be changed.

But every proposition may be generalized in an infinity of ways. Among all the generalizations possible, we must choose, and we can only choose the simplest. We are therefore led to act as if a simple law were, other things being equal, more probable than a complicated law.

Half a century ago this was frankly confessed, and it was proclaimed that nature loves simplicity; she has since too often given us the lie. To-day we no longer confess this tendency, and we retain only so much of it as is indispensable if science is not to become impossible.

In formulating a general, simple and precise law on the basis of experiments relatively few and presenting certain divergences, we have therefore only obeyed a necessity from which the human mind can not free itself.

But there is something more, and this is why I dwell upon the point.

No one doubts that Mayer's principle is destined to survive all the particular laws from which it was obtained, just as Newton's law has survived Kepler's laws, from which it sprang, and which are only approximative if account be taken of perturbations.

Why does this principle occupy thus a sort of privileged place among all the physical laws? There are many little reasons for it.

First of all it is believed that we could not reject it or even doubt its absolute rigor without admitting the possibility of perpetual motion; of course we are on our guard at such a prospect, and we think ourselves less rash in affirming Mayer's principle than in denying it.

That is perhaps not wholly accurate; the impossibility of perpetual motion implies the conservation of energy only for reversible phenomena.

The imposing simplicity of Mayer's principle likewise contributes to strengthen our faith. In a law deduced immediately from experiment, like Mariotte's, this simplicity would rather seem to us a reason for distrust; but here this is no longer the case; we see elements, at first sight disparate, arrange themselves in an unexpected order and form a harmonious whole; and we refuse to believe that an unforeseen harmony may be a simple effect of chance. It seems that our conquest is the dearer to us the more effort it has cost us, or that we are the surer of having wrested her true secret from nature the more jealously she has hidden it from us.

But those are only little reasons; to establish Mayer's law as an absolute principle, a more profound discussion is necessary. But if this be attempted, it is seen that this absolute principle is not even easy to state.

In each particular case it is clearly seen what energy is and at least a provisional definition of it can be given; but it is impossible to find a general definition for it.

If we try to enunciate the principle in all its generality and apply it to the universe, we see it vanish, so to speak, and nothing is left but this: _There is something which remains constant_.

But has even this any meaning? In the determinist hypothesis, the state of the universe is determined by an extremely great number _n_ of parameters which I shall call x_{1}, x_{2},... x_{_n_}. As soon as the values of these _n_ parameters at any instant are known, their derivatives with respect to the time are likewise known and consequently the values of these same parameters at a preceding or subsequent instant can be calculated. In other words, these _n_ parameters satisfy _n_ differential equations of the first order.

These equations admit of _n_ - 1 integrals and consequently there are _n_ - 1 functions of x_{1}, x_{2},... x_{_n_}, which remain constant. _If then we say there is something which remains constant_, we only utter a tautology. We should even be puzzled to say which among all our integrals should retain the name of energy.

Besides, Mayer's principle is not understood in this sense when it is applied to a limited system. It is then assumed that _p_ of our parameters vary independently, so that we only have _n_ - _p_ relations, generally linear, between our _n_ parameters and their derivatives.

To simplify the enunciation, suppose that the sum of the work of the external forces is null, as well as that of the quantities of heat given off to the outside. Then the signification of our principle will be:

_There is a combination of these n - p relations whose first member is an exact differential_; and then this differential vanishing in virtue of our _n_ - _p_ relations, its integral is a constant and this integral is called energy.

But how can it be possible that there are several parameters whose variations are independent? That can only happen under the influence of external forces (although we have supposed, for simplicity, that the algebraic sum of the effects of these forces is null). In fact, if the system were completely isolated from all external action, the values of our _n_ parameters at a given instant would suffice to determine the state of the system at any subsequent instant, provided always we retain the determinist hypothesis; we come back therefore to the same difficulty as above.

If the future state of the system is not entirely determined by its present state, this is because it depends besides upon the state of bodies external to the system. But then is it probable that there exist between the parameters _x_, which define the state of the system, equations independent of this state of the external bodies? and if in certain cases we believe we can find such, is this not solely in consequence of our ignorance and because the influence of these bodies is too slight for our experimenting to detect it?

If the system is not regarded as completely isolated, it is probable that the rigorously exact expression of its internal energy will depend on the state of the external bodies. Again, I have above supposed the sum of the external work was null, and if we try to free ourselves from this rather artificial restriction, the enunciation becomes still more difficult.

To formulate Mayer's principle in an absolute sense, it is therefore necessary to extend it to the whole universe, and then we find ourselves face to face with the very difficulty we sought to avoid.

In conclusion, using ordinary language, the law of the conservation of energy can have only one signification, which is that there is a property common to all the possibilities; but on the determinist hypothesis there is only a single possibility, and then the law has no longer any meaning.

On the indeterminist hypothesis, on the contrary, it would have a meaning, even if it were taken in an absolute sense; it would appear as a limitation imposed upon freedom.

But this word reminds me that I am digressing and am on the point of leaving the domain of mathematics and physics. I check myself therefore and will stress of all this discussion only one impression, that Mayer's law is a form flexible enough for us to put into it almost whatever we wish. By that I do not mean it corresponds to no objective reality, nor that it reduces itself to a mere tautology, since, in each particular case, and provided one does not try to push to the absolute, it has a perfectly clear meaning.

This flexibility is a reason for believing in its permanence, and as, on the other hand, it will disappear only to lose itself in a higher harmony, we may work with confidence, supporting ourselves upon it, certain beforehand that our labor will not be lost.

Almost everything I have just said applies to the principle of Clausius. What distinguishes it is that it is expressed by an inequality. Perhaps it will be said it is the same with all physical laws, since their precision is always limited by errors of observation. But they at least claim to be first approximations, and it is hoped to replace them little by little by laws more and more precise. If, on the other hand, the principle of Clausius reduces to an inequality, this is not caused by the imperfection of our means of observation, but by the very nature of the question.

GENERAL CONCLUSIONS ON PART THIRD

The principles of mechanics, then, present themselves to us under two different aspects. On the one hand, they are truths founded on experiment and approximately verified so far as concerns almost isolated systems. On the other hand, they are postulates applicable to the totality of the universe and regarded as rigorously true.

If these postulates possess a generality and a certainty which are lacking to the experimental verities whence they are drawn, this is because they reduce in the last analysis to a mere convention which we have the right to make, because we are certain beforehand that no experiment can ever contradict it.

This convention, however, is not absolutely arbitrary; it does not spring from our caprice; we adopt it because certain experiments have shown us that it would be convenient.

Thus is explained how experiment could make the principles of mechanics, and yet why it can not overturn them.

Compare with geometry: The fundamental propositions of geometry, as for instance Euclid's postulate, are nothing more than conventions, and it is just as unreasonable to inquire whether they are true or false as to ask whether the metric system is true or false.

Only, these conventions are convenient, and it is certain experiments which have taught us that.

At first blush, the analogy is complete; the rôle of experiment seems the same. One will therefore be tempted to say: Either mechanics must be regarded as an experimental science, and then the same must hold for geometry; or else, on the contrary, geometry is a deductive science, and then one may say as much of mechanics.

Such a conclusion would be illegitimate. The experiments which have led us to adopt as more convenient the fundamental conventions of geometry bear on objects which have nothing in common with those geometry studies; they bear on the properties of solid bodies, on the rectilinear propagation of light. They are experiments of mechanics, experiments of optics; they can not in any way be regarded as experiments of geometry. And even the principal reason why our geometry seems convenient to us is that the different parts of our body, our eye, our limbs, have the properties of solid bodies. On this account, our fundamental experiments are preeminently physiological experiments, which bear, not on space which is the object the geometer must study, but on his body, that is to say, on the instrument he must use for this study.

On the contrary, the fundamental conventions of mechanics, and the experiments which prove to us that they are convenient, bear on exactly the same objects or on analogous objects. The conventional and general principles are the natural and direct generalization of the experimental and particular principles.

Let it not be said that thus I trace artificial frontiers between the sciences; that if I separate by a barrier geometry properly so called from the study of solid bodies, I could just as well erect one between experimental mechanics and the conventional mechanics of the general principles. In fact, who does not see that in separating these two sciences I mutilate them both, and that what will remain of conventional mechanics when it shall be isolated will be only a very small thing and can in no way be compared to that superb body of doctrine called geometry?

One sees now why the teaching of mechanics should remain experimental.

Only thus can it make us comprehend the genesis of the science, and that is indispensable for the complete understanding of the science itself.

Besides, if we study mechanics, it is to apply it; and we can apply it only if it remains objective. Now, as we have seen, what the principles gain in generality and certainty they lose in objectivity. It is, therefore, above all with the objective side of the principles that we must be familiarized early, and that can be done only by going from the particular to the general, instead of the inverse.

The principles are conventions and disguised definitions. Yet they are drawn from experimental laws; these laws have, so to speak, been exalted into principles to which our mind attributes an absolute value.

Some philosophers have generalized too far; they believed the principles were the whole science and consequently that the whole science was conventional.

This paradoxical doctrine, called nominalism, will not bear examination.

How can a law become a principle? It expressed a relation between two real terms _A_ and _B_. But it was not rigorously true, it was only approximate. We introduce arbitrarily an intermediary term _C_ more or less fictitious, and _C_ is _by definition_ that which has with _A_ _exactly_ the relation expressed by the law.

Then our law is separated into an absolute and rigorous principle which expresses the relation of _A_ to _C_ and an experimental law, approximate and subject to revision, which expresses the relation of _C_ to _B_. It is clear that, however far this partition is pushed, some laws will always be left remaining.

We go to enter now the domain of laws properly so called.