I.—KINEMATICAL PORTION.
§ 1. Definition of Synchronism.
Let us have a co-ordinate system, in which the Newtonian equations hold. For distinguishing this system from another which will be introduced hereafter, we shall always call it “the stationary system.”
If a material point be at rest in this system, then its position in this system can be found out by a measuring rod, and can be expressed by the methods of Euclidean Geometry, or in Cartesian co-ordinates.
If we wish to describe the motion of a material point, the values of its coordinates must be expressed as functions of time. It is always to be borne in mind that such a mathematical definition has a physical sense, only when we have a clear notion of what is meant by time. We have to take into consideration the fact that those of our conceptions, in which time plays a part, are always conceptions of synchronism. For example, we say that a train arrives here at 7 o’clock; this means that the exact pointing of the little hand of my watch to 7, and the arrival of the train are synchronous events.
It may appear that all difficulties connected with the definition of time can be removed when in place of time, we substitute the position of the little hand of my watch. Such a definition is in fact sufficient, when it is required to define time exclusively for the place at which the clock is stationed. But the definition is not sufficient when it is required to connect by time events taking place at different stations,—or what amounts to the same thing,—to estimate by means of time (zeitlich werten) the occurrence of events, which take place at stations distant from the clock.
Now with regard to this attempt;—the time-estimation of events, we can satisfy ourselves in the following manner. Suppose an observer—who is stationed at the origin of coordinates with the clock—associates a ray of light which comes to him through space, and gives testimony to the event of which the time is to be estimated,—with the corresponding position of the hands of the clock. But such an association has this defect,—it depends on the position of the observer provided with the clock, as we know by experience. We can attain to a more practicable result by the following treatment.
If an observer be stationed at A with a clock, he can estimate the time of events occurring in the immediate neighbourhood of A, by looking for the position of the hands of the clock, which are synchronous with the event. If an observer be stationed at B with a clock,—we should add that the clock is of the same nature as the one at A,—he can estimate the time of events occurring about B. But without further premises, it is not possible to compare, as far as time is concerned, the events at B with the events at A. We have hitherto an A-time, and a B-time, but no time common to A and B. This last time (i.e., common time) can be defined, if we establish by definition that the time which light requires in travelling from A to B is equivalent to the time which light requires in travelling from B to A. For example, a ray of light proceeds from A at A-time tA towards B, arrives and is reflected from B at B-time tB, and returns to A at A-time t′A. According to the definition, both clocks are synchronous, if
tB - tA = t′A - tB.
We assume that this definition of synchronism is possible without involving any inconsistency, for any number of points, therefore the following relations hold:—
1. If the clock at B be synchronous with the clock at A, then the clock at A is synchronous with the clock at B.
2. If the clock at A as well as the clock at B are both synchronous with the clock at C, then the clocks at A and B are synchronous.
Thus with the help of certain physical experiences, we have established what we understand when we speak of clocks at rest at different stations, and synchronous with one another; and thereby we have arrived at a definition of synchronism and time.
In accordance with experience we shall assume that the magnitude
where c is a universal constant.
We have defined time essentially with a clock at rest in a stationary system. On account of its adaptability to the stationary system, we call the time defined in this way as “time of the stationary system.”
§ 2. On the Relativity of Length and Time.
The following reflections are based on the Principle of Relativity and on the Principle of Constancy of the velocity of light, both of which we define in the following way:—
1. The laws according to which the nature of physical systems alter are independent of the manner in which these changes are referred to two co-ordinate systems which have a uniform translators motion relative to each other.
2. Every ray of light moves in the “stationary co-ordinate system” with the same velocity c, the velocity being independent of the condition whether this ray of light is emitted by a body at rest or in motion.[[6]] Therefore
velocity = Path of Light/Interval of time,
where, by ‘interval of time’ we mean time as defined in [§1].
Let us have a rigid rod at rest; this has a length l, when measured by a measuring rod at rest; we suppose that the axis of the rod is laid along the X-axis of the system at rest, and then a uniform velocity v, parallel to the axis of X, is imparted to it. Let us now enquire about the length of the moving rod; this can be obtained by either of these operations.—
(a) The observer provided with the measuring rod moves along with the rod to be measured, and measures by direct superposition the length of the rod:—just as if the observer, the measuring rod, and the rod to be measured were at rest.
(b) The observer finds out, by means of clocks placed in a system at rest (the clocks being synchronous as defined in [§1]), the points of this system where the ends of the rod to be measured occur at a particular time t. The distance between these two points, measured by the previously used measuring rod, this time it being at rest, is a length, which we may call the “length of the rod.”
According to the Principle of Relativity, the length found out by the operation a), which we may call “the length of the rod in the moving system” is equal to the length l of the rod in the stationary system.
The length which is found out by the second method, may be called ‘the length of the moving rod measured from the stationary system.’ This length is to be estimated on the basis of our principle, and we shall find it to be different from l.
In the generally recognised kinematics, we silently assume that the lengths defined by these two operations are equal, or in other words, that at an epoch of time t, a moving rigid body is geometrically replaceable by the same body, which can replace it in the condition of rest.
Relativity of Time.
Let us suppose that the two clocks synchronous with the clocks in the system at rest are brought to the ends A, and B of a rod, i.e., the time of the clocks correspond to the time of the stationary system at the points where they happen to arrive; these clocks are therefore synchronous in the stationary system.
We further imagine that there are two observers at the two watches, and moving with them, and that these observers apply the criterion for synchronism to the two clocks. At the time tA, a ray of light goes out from A, is reflected from B at the time tB, and arrives back at A at time t′A. Taking into consideration the principle of, constancy of the velocity of light, we have
tB - tA = rAB/(c - v),
and t′A - tB = rAB/(c + v),
where rAB is the length of the moving rod, measured in the stationary system. Therefore the observers stationed with the watches will not find the clocks synchronous, though the observer in the stationary system must declare the clocks to be synchronous. We therefore see that we can attach no absolute significance to the concept of synchronism; but two events which are synchronous when viewed from one system, will not be synchronous when viewed from a system moving relatively to this system.
§ 3. Theory of Co-ordinate and Time-Transformation from a stationary system to a system which moves relatively to this with uniform velocity.
Let there be given, in the stationary system two co-ordinate systems, i.e., two series of three mutually perpendicular lines issuing from a point. Let the X-axes of each coincide with one another, and the Y and Z-axes be parallel. Let a rigid measuring rod, and a number of clocks be given to each of the systems, and let the rods and clocks in each be exactly alike each other.
Let the initial point of one of the systems (k) have a constant velocity in the direction of the X-axis of the other which is stationary system K, the motion being also communicated to the rods and clocks in the system (k). Any time t of the stationary system K corresponds to a definite position of the axes of the moving system, which are always parallel to the axes of the stationary system. By t, we always mean the time in the stationary system.
We suppose that the space is measured by the stationary measuring rod placed in the stationary system, as well as by the moving measuring rod placed in the moving system, and we thus obtain the co-ordinates (x, y, z) for the stationary system, and (ξ, η, ζ) for the moving system. Let the time t be determined for each point of the stationary system (which are provided with clocks) by means of the clocks which are placed in the stationary system, with the help of light-signals as described in [§ 1]. Let also the time τ of the moving system be determined for each point of the moving system (in which there are clocks which are at rest relative to the moving system), by means of the method of light signals between these points (in which there are clocks) in the manner described in [§ 1].
To every value of (x, y, z, t) which fully determines the position and time of an event in the stationary system, there correspond a system of values (ξ, η, ζ, τ); now the problem is to find out the system of equations connecting these magnitudes.
Primarily it is clear that on account of the property of homogeneity which we ascribe to time and space, the equations must be linear.
If we put x′ = x - vt, then it is clear that at a point relatively at rest in the system k, we have a system of values (x′ y z) which are independent of time. Now let us find out τ as a function of (x′, y, z, t). For this purpose we have to express in equations the fact that τ is not other than the time given by the clocks which are at rest in the system k which must be made synchronous in the manner described in [§ 1].
Let a ray of light be sent at time τ₀ from the origin of the system k along the X-axis towards x′ and let it be reflected from that place at time τ₁ towards the origin of moving co-ordinates and let it arrive there at time τ₂; then we must have
½ (τ0 + τ2) = τ1
If we now introduce the condition that τ is a function of co-ordinates, and apply the principle of constancy of the velocity of light in the stationary system, we have
It is to be noticed that instead of the origin of co-ordinates, we could select some other point as the exit point for rays of light, and therefore the above equation holds for all values of (x′, y, z, t,).
A similar conception, being applied to the y- and z-axis gives us, when we take into consideration the fact that light when viewed from the stationary system, is always propagated along those axes with the velocity √(c² - v²), we have the questions
∂ ∂τ
---- = 0, ---- = 0.
∂y ∂z
From these equations it follows that τ is a linear function of x′ and t. From equations (1) we obtain
vx′
τ = a (t - --------- )
c2 - v2
where a is an unknown function of v.
With the help of these results it is easy to obtain the magnitudes (ξ, η, ζ) if we express by means of equations the fact that light, when measured in the moving system is always propagated with the constant velocity c (as the principle of constancy of light velocity in conjunction with the principle of relativity requires). For a time τ = 0, if the ray is sent in the direction of increasing ξ, we have
vx′
ξ = cτ, i.e. ξ = a c(t - ------------ )
c2 - v2
Now the ray of light moves relative to the origin of k with a velocity c - v, measured in the stationary system; therefore we have
x′
-------- = t
c - v
Substituting these values of t in the equation for ξ, we obtain
c2
ξ = a ------------ x′
c2 - v2
In an analogous manner, we obtain by considering the ray of light which moves along the y-axis,
vx′
η = cτ = a c(t - ----------- )
c2 - v2
where
y
------------- = t, x′ = 0,
√ (c2 - v2)
Therefore
c
η = a ------------- y,
√ (c2 - v2)
c
ζ = a --------------- z .
√ (c2 - v2)
If for x′, we substitute its value x - tv, we obtain
v.c
τ = φ (v). β (t - ------- ,
c2
ξ = φ (v). β (x - vt) ,
η = φ (v) y
ζ = φ (v) z ,
where
and
φ (v) = ac / √ (c2 - v2) = a / β
is a function of v.
If we make no assumption about the initial position of the moving system and about the null-point of t, then an additive constant is to be added to the right hand side.
We have now to show, that every ray of light moves in the moving system with a velocity c (when measured in the moving system), in case, as we have actually assumed, c is also the velocity in the stationary system; for we have not as yet adduced any proof in support of the assumption that the principle of relativity is reconcilable with the principle of constant light-velocity.
At a time τ = t = 0 let a spherical wave be sent out from the common origin of the two systems of co-ordinates, and let it spread with a velocity c in the system K. If (x, y, z), be a point reached by the wave, we have
x2 + y2 + z2 = c2t2
with the aid of our transformation-equations, let us transform this equation, and we obtain by a simple calculation,
ξ2 + η2 + ζ2 = c2τ2.
Therefore the wave is propagated in the moving system with the same velocity c, and as a spherical wave.[[7]] Therefore we show that the two principles are mutually reconcilable.
In the transformations we have got an undetermined function φ(v), and we now proceed to find it out.
Let us introduce for this purpose a third co-ordinate system k′, which is set in motion relative to the system k, the motion being parallel to the ξ-axis. Let the velocity of the origin be (-v). At the time t = 0, all the initial co-ordinate points coincide, and for t = x = y = z = 0, the time t′ of the system k′ = 0. We shall say that (x′ y′ z′ t′) are the co-ordinates measured in the system k′, then by a two-fold application of the transformation-equations, we obtain
v
τ′ = φ(-v)β(-v){τ + --- ξ}
c2
= φ(v)φ(-v)t,
x′ = φ](v)β(v)(ξ + vτ)
= φ(v)φ(-v)x, etc.
Since the relations between (x′, y′, z′, t′), and (x, y, z, t) do not contain time explicitly, therefore K and k′ are relatively at rest.
It appears that the systems K and k′ are identical.
∴ φ(v)φ(-v) = 1.
Let us now turn our attention to the part of the ξ-axis between (ξ = 0, η = 0, ζ = 0), and (ξ = 0, η = 1, ζ = 0). Let this piece of the y-axis be covered with a rod moving with the velocity v relative to the system K and perpendicular to its axis;—the ends of the rod having therefore the co-ordinates
x1 = vt, y1 = l / φ(v), z1 = 0
x2 = vt, y2 = 0, z2 = 0
Therefore the length of the rod measured in the system K is l/φ(v). For the system moving with velocity (-v), we have on grounds of symmetry,
l l
------ = ------
φ(v) φ(-v)
∴ φ(v) = φ(-v), ∴ φ(v) = 1.
§ 4. The physical significance of the equations obtained concerning moving rigid bodies and moving clocks.
Let us consider a rigid sphere (i.e., one having a spherical figure when tested in the stationary system) of radius R which is at rest relative to the system (K), and whose centre coincides with the origin of K then the equation of the surface of this sphere, which is moving with a velocity v relative to K, is
ξ2 + η2 + ζ2 = R2.
At time t = 0, the equation is expressed by means of (x, y, z, t,) as
A rigid body which has the figure of a sphere when measured in the moving system, has therefore in the moving condition—when considered from the stationary system, the figure of a rotational ellipsoid with semi-axes
Therefore the y and z dimensions of the sphere (therefore of any figure also) do not appear to be modified by the motion, but the x dimension is shortened in the ratio
the shortening is the larger, the larger is v. For v = c, all moving bodies, when considered from a stationary system shrink into planes. For a velocity larger than the velocity of light, our propositions become meaningless; in our theory c plays the part of infinite velocity.
It is clear that similar results hold about stationary bodies in a stationary system when considered from a uniformly moving system.
Let us now consider that a clock which is lying at rest in the stationary system gives the time t, and lying at rest relative to the moving system is capable of giving the time τ; suppose it to be placed at the origin of the moving system k, and to be so arranged that it gives the time τ. How much does the clock gain, when viewed from the stationary system K? We have,
Therefore the clock loses by an amount ½(v²/c²) per second of motion, to the second order of approximation.
From this, the following peculiar consequence follows. Suppose at two points A and B of the stationary system two clocks are given which are synchronous in the sense explained in § 3 when viewed from the stationary system. Suppose the clock at A to be set in motion in the line joining it with B, then after the arrival of the clock at B, they will no longer be found synchronous, but the clock which was set in motion from A will lag behind the clock which had been all along at B by an amount ½t(v²/c²), where t is the time required for the journey.
We see forthwith that the result holds also when the clock moves from A to B by a polygonal line, and also when A and B coincide.
If we assume that the result obtained for a polygonal line holds also for a curved line, we obtain the following law. If at A, there be two synchronous clocks, and if we set in motion one of them with a constant velocity along a closed curve till it comes back to A, the journey being completed in t-seconds, then after arrival, the last mentioned clock will be behind the stationary one by ½t(v²/c²) seconds. From this, we conclude that a clock placed at the equator must be slower by a very small amount than a similarly constructed clock which is placed at the pole, all other conditions being identical.
§ 5. Addition-Theorem of Velocities.
Let a point move in the system k (which moves with velocity v along the x-axis of the system K) according to the equation
where wξ and wη are constants.
It is required to find out the motion of the point relative to the system K. If we now introduce the system of equations in § 3 in the equation of motion of the point, we obtain
The law of parallelogram of velocities hold up to the first order of approximation. We can put
and
i.e., α is put equal to the angle between the velocities v, and w. Then we have—
It should be noticed that v and w enter into the expression for velocity symmetrically. If w has the direction of the ξ-axis of the moving system,
From this equation, we see that by combining two velocities, each of which is smaller than c, we obtain a velocity which is always smaller than c. If we put v = c - χ, and w = c - λ, where χ and λ are each smaller than c,[[8]]
It is also clear that the velocity of light c cannot be altered by adding to it a velocity smaller than c. For this case,
We have obtained the formula for U for the case when v and w have the same direction; it can also be obtained by combining two transformations according to section [§ 3]. If in addition to the systems K, and k, we introduce the system k´, of which the initial point moves parallel to the ξ-axis with velocity w, then between the magnitudes, x, y, z, t and the corresponding magnitudes of k´, we obtain a system of equations, which differ from the equations in [§ 3], only in the respect that in place of v, we shall have to write,
We see that such a parallel transformation forms a group.
We have deduced the kinematics corresponding to our two fundamental principles for the laws necessary for us, and we shall now pass over to their application in electrodynamics.