Contents

Grids, Sensors, and Signals

The idea of a sensor grid was introduced earlier as a way of making measurements on moving objects, and recording their histories as they pass the sensors. With a grid, sensors make only local measurements on local events. That's the simplest way of making measurements of position and time. In this way, it's possible to simply express the fundamental ideas of relativity.

Earlier, we defined a signal as a physical object (matter or radiation) that moves energy from one place to another. This section focuses on radiant signals only (or just signals for short). The best example of a radiant signal is a photon of electromagnetic radiation (light waves, radio waves, and so on). As well as making measurements on passing objects, sensors can also make measurements by detecting radiant signals. The best example of this is an astronomical telescope. It's usually not possible to travel to distant astronomical objects, so astronomers are left with the only source of data left to them: the incoming signals hitting their detectors. It's true that the ultimate source of the signal is distant, but the detection event itself is a local measurement on the signal as it arrives at the local sensor. The only difference is that the signal usually refers back to a distant, non-local object.

Canvas not supported. Please upgrade your browser. Canvas not supported. Please upgrade your browser.

Astronomers make use of all the incoming radiant signals they can get their eager hands on:

These signals:
Part II does not make use of the rel prefix convention used in Part I, where it's used to emphasize the difference between relative and invariant physical quantities. This was useful in Part I especially because it references a number of physical quantities that are invariant in the world of Newton, but relative in the world of Einstein.

In optics, however, almost all physical quantities (frequency, direction, and so on) are relative according to Newton and according to Einstein. So for optics it seems less compelling to make the relative/invariant distinction using the rel convention.

Signals: Energy And Direction

Many different quantities can be measured using a signal. They can be divided into two categories.

First, quantities related to the energy carried by the signal:

Second, quantities related to the direction of the signal:

The quantities listed here are all relative to a given grid. They aren't invariant.

Signals travelling at the speed limit (such as light) must arrive at the detector by travelling on the detector's past light cone. Quantities related to direction can be found by looking at the intersection of the detector's past light cone with the history of the emitter.

Canvas not supported. Please upgrade your browser.

When an object can be approximated as a point, then its history is a curve or a line. When an object has a measurable size, then it's said to be an extended object, and it can't be approximated as a point. To represent an extended object, you can represent it as a collection of points that form a volume in space. In space-time, the history of the object is the collected histories of all those points, taken as a group. That collection makes up a history tube in space time. It corresponds to the volume of points changing position over time.

The diagram above shows the past light cone of a detector intersecting the history tube of an extended object, at one moment in time according to the detector's wristwatch. You can see that, in general, the intersection events are the same as the emission events of the light signal, corresponding to light emitted from the surface of the extended object. The emission events don't all share the same ct value: they aren't simultaneous. This is because the light cone slices across the history tube at a 45° angle. However, the light signal detection events are indeed simultaneous, since the light signals all meet up at the same event.

So, the image of an extended object seen in a detector is made up of light that was emitted at different times. This is simply caused by the differences in travel-time for the light emitted from the different parts of the object.

When the object isn't moving with respect to the detector, the differences in travel-time don't make much difference. When the object is moving with respect to the detector, and when the differences in travel-time are significant, then the differences show up as distortions in the detected image. The larger the differences in travel-time, the greater the distortions.

Light-Slice and Time-Slice

A surface in space-time can intersect the history of a given object. Let's call the intersection of a surface with the history of an object a slice of the history. There are two important ways to slice an object's history:

NameIntersect History WithDefines
Time-slicea surface ct=constant the spatial geometry of an object with respect to a given grid
Light-slicethe past light cone of a detectorthe optical properties of an object in a given detector

Sometimes a light-slice of an object's history in spacetime can be approximated using a plane instead of a cone, with the plane oriented at 45 degrees with respect to the ct axis (and oriented in the right direction). This only works if the object subtends a small angle at the detector. In that case, the curvature of the past light cone is very small in the region of the object's history.

Photon Signals

The remainder of Part II is specific to light, or electromagnetic radiation. Electromagnetic radiation is carried by photons, massless particles that always travel at the speed limit (through empty space). Like any radiant signal, a photon carries a certain amount of energy in a certain direction. Usually, you're interested in exactly those two things - the photon's energy and its direction of motion. Sometimes, the polarization can be of interest too. The energy of a photon is proportional to its frequency. The energy/frequency of a photon is continuous. It's traditional to divide up these continuous values into named ranges or chunks. In order from the lowest to the highest energy/frequency, these chunks are: It's important to note that the energy and the direction of motion of a photon are both specific to a grid, and change from one grid to another. A photon appearing as a gamma-ray photon in one grid can appear, say, as a microwave photon in a second grid, which is boosted with respect to the first grid. It's important to keep this in mind. We investigate below how the energy and direction of motion change during a boost.

The energy and direction of motion of a photon with respect to a grid are represented in a diagram like this:

Canvas not supported. Please upgrade your browser. Canvas not supported. Please upgrade your browser.

These diagrams are similar in structure to regular space-time diagrams, but:

The diagrams are a bit different because they show photons as a wave vector (or propagation vector):

In the diagram above on the left, the blue arrow indicates a photon with higher energy than the green arrow, since it has a greater length. We're making use of various colors for wave vectors, to show their relative energy differences:

Events On The Light Cone With Constant ct

Both the frequency and direction of motion of a photon are specific to a grid. How do they change during a boost? Well, let's go back to space-time events for a moment. If you apply the boost transformation to events that: then you can see also, at the same time, exactly how the energy and direction of a photon change during the same boost.

The selected events appear in a circle centered on the ct axis. This animation shows how these events get boosted. As you can see, the usual toothpaste effect still applies: some events are squeezed toward the origin, while others are stretched away from it.

The important point here is that the boost transformation for these events on the light cone can be interpreted as applying to wave vectors as well. The reason this works is because, as mentioned above, the photon vector is a 4-vector. So, the boost transformation for events also defines how the 4 parts of the wave vector change during a boost, in the same way it does for events. It's the exact same math, and the exact same geometry. So, the boost transformation for these events shows how to calculate the change in frequency and direction of photons.

The circle around the ct axis corresponds to wave vectors having the same energy, but different directions. The generated ellipse corresponds to how these wave vectors appear in the boosted grid.

Canvas not supported. Please upgrade your browser.

The ellipse in the animation shows two important effects:

Direction: Aberration

The sensors in a grid can detect and measure the direction of photons coming from an object (such as a distant star). How is a photon's direction affected by a boost transformation?

There are 3 directions to be aware of:

The detector-direction is exactly opposite the photon-direction.

Both the photon-direction and the detector-direction make a certain angle with the boost-direction. In a boosted grid, the photon-direction is deflected away from the boost-direction. Equivalently, the detector-direction is deflected towards the boost-direction. These are illustrated in the diagrams below.

This effect is called the aberration of light. Like the the pancake effect, aberration is highly directional in nature.

When travelling in a spacecraft at relativistic speeds in interstellar space, the stars would be seen as bunched up towards the direction of motion of the spacecraft. The aberration of light was discovered by accident by James Bradley in 1729.

There are two versions of the aberration formula, one for the photon-direction, and one for the detector-direction. In the formulas below, the prime denotes the boosted grid, and the angles are with respect to the boost-direction.

A boost deflects the detector-direction toward the boost-direction, from D to D':

Canvas not supported. Please upgrade your browser. Canvas not supported. Please upgrade your browser.

A boost deflects the photon-direction away from the boost-direction, from P to P':

Canvas not supported. Please upgrade your browser. Canvas not supported. Please upgrade your browser.

The effect of aberration can be equivalently seen as:

As you can see from the animations, the amount of concentration in the forward direction can become quite high. As a measure of the amount of concentration, this site uses the idea of the half-sky radius, meaning the angular radius from the direction of motion that encloses what would be half the sky in an unboosted grid.

Aberration Preserves Angles

Think of a constellation of stars in the night sky, with the stars being joined together with imaginary lines to make a shape. Those lines make angles with each other. If you apply aberration to the starlight, the directions of the stars change, but the angles made by the constellation lines actually stay the same. This is an example of what mathematicians call a conformal transformation.

Conformal transformations preserve angles, but not lengths. That is:

In the case of aberration of starlight in the night sky, circular shapes of any size transform into other circular shapes.

In relativity, conformal transformations always preserve light-like relations (s2 = 0), but generally distort time-like and space-like relations. That means they preserve light cones and cause-and-effect relations, but generally distort everything else.

Frequency: Doppler Effect

A process that repeats itself is called periodic. Some examples: The time it takes for the process to repeat is called the period, measured in seconds. Periodic behavior is usually measured using the inverse of the period, called the frequency. The unit of frequency is 1/second, which is also called hertz. The shorter the period, the higher the frequency. All of the above examples can be measured in hertz:
It's important to understand that, from the point of view of physics, all of the above measurements are in hertz. That's because 'ticks', 'rotations', 'waves' and so on are dimensionless. They're dimensionless since they represent counts of things, and counts of things are pure numbers, without any physical unit.

Periodic processes can send out periodic signals. The signal is usually electromagnetic radiation moving at the speed limit. These signals can be detected by the sensors in a grid. In this way, each sensor can measure the frequency of the signals received at its own location. There are two cases, depending on whether the signal source and the detector are moving with respect to each other:

Examples using light signals (in green).

Here the signal source and the two detectors are not moving with respect to each other:

Canvas not supported. Please upgrade your browser.

Here the signal source is moving directly away from detector D1 and directly toward detector D2:

Canvas not supported. Please upgrade your browser.

The change in frequency of a signal from a source moving relative to a detector is called the Doppler effect. Here's an animation showing the effect for motion along the x axis. The Doppler effect has two underlying causes, which can work in opposing ways:

A change in frequency is often described as:

Doppler Factor

The change in frequency seen in the Doppler effect is given by a number called the Doppler factor. The Doppler factor is usually given the symbol D. Here's the formula for D (note that D is dimensionless, just like β and γ):

Canvas not supported. Please upgrade your browser. Canvas not supported. Please upgrade your browser.

The Doppler factor D is a bit trickier than β and γ, since it depends on two things, not just one:

For each β, there's always a θ for which D=1, and the effect passes from blueshift to redshift (or vice versa). Here, the angle (from the direction of motion) for which D=1 is called the neutral angle. The neutral angle is always less than or equal to 90°. Objects at the neutral angle have D=1, and are neither blueshifted nor redshifted.

This animation is an excellent way to explore how D changes with β and θ. Note that for the case of θ=90° (the case of transverse motion), D is just the inverse of the warp factor γ. Here's a second animation that combines aberration and the Doppler effect in a single diagram.

The frequency of a signal seen by a detector is the frequency-at-rest multiplied by the Doppler factor D. The Doppler factor appears again and again in formulas related to waves and light. Some examples:

ItemVaries AsItemVaries As
Any FrequencyDPlane Wave Energy Density (J m-3) D2
Photon EnergyDPlane Wave Energy Flux Density (J s-1 m-2)D2
Photon FrequencyDTemperature of black body radiationD
Photon Wave NumberDSolid angleD-2
Photon WavelengthD-1Angular sizeD-1
Plane Wave E,H field intensitiesDDetected energy per sec per steradD4
Aberration sin θD-1Emitted energy per sec per steradD3/γ

Two Variations for the θ Angle

An angle θ appears in the formulas for the Doppler effect and for aberration. It's an angle made with the boost-direction. There are two ways to define the θ angle. Simply put, you can use either θ or π - θ. Because of the identities:

this can change signs in formulas that use a cos(θ). For example, here are two variations of the formula for the Doppler effect, which reflect the two ways of defining θ:

The returned value of D is the same in both cases, but the formula is different simply because of a different convention for defining θ.

Optical Effects as Non-Mysterious Prototypes

To most people, the Doppler effect and aberration are fairly easy to understand. They don't seem particularly strange or contrary to common experience. On the other hand, time dilation and the pancake effect are usually seen as being very unusual, mysterious, and counter to most people's intuition.

There's a point of view, however, in which time dilation and the pancake effect can be seen as non-mysterious as the Doppler effect and aberration. The idea is that these effects are all tied together by the same mathematics: the boost transformation. All of these effects are consequences of how the boost transformation alters its input 4-vectors. In the case of time dilation and the pancake effect, the input 4-vectors are certain events. In the case of the Doppler effect and aberration, the input 4-vectors are the wave vectors (energy and direction of motion of the photon). But in all cases it's just the boost transformation, and only the boost transformation, that controls the physics.

The boost transformation changes an input 4-vector into an output 4-vector. For photons, the end-point of the wave vector stays on the light cone. It moves up and down the light cone (Doppler effect), and it changes its direction on the light cone (aberration). For the events involved with time dilation and the pancake effect, the end-point of the 4-vector (an event) also moves around on a surface, but the surface isn't a light cone, it's a hyperboloid corresponding to a given value of the squared-interval.

A cone and a hyperboloid don't have the same shape. But that difference shouldn't prevent us from viewing the two cases as being, at a high level, essentially the same kind of operation: a boost transformation mapping an input 4-vector to an output 4-vector, subject to the constraint of staying on a surface (either a light cone, or a hyperboloid with constant squared-interval).

From this point of view, it doesn't seem logically consistent to think of time dilation and the pancake effect as mysterious, while simultaneously thinking of the Doppler effect and aberration as non-mysterious.

From Introduction to Special Relativity, by Robert Resnick:

Is the length contraction "real" or "apparent"? We might answer this by posing a similar question. Is the frequency, or wavelength, shift in the Doppler effect real or apparent?

If you define relativistic as meaning any physics that involves β, the ratio of a speed of something to the speed limit, then the first measurements of relativistic effects were made long before the creation of the theory of special relativity in 1905. The Doppler effect for light was predicted and measured in the 1840s. In 1729, the first measurement of stellar aberration was published by James Bradley. Stellar aberration is essentially a measurement of the ratio of the speed of the Earth in its orbit (relative to the center of mass of the solar system) to the speed of light.

In these early years, the formulas for stellar aberration and the Doppler effect were approximations to the full relativistic formulas, and were valid only for low values of β.

Brightness: Beaming Effect

There are different ways of measuring how bright an object is. One fundamental measure is the total amount of energy received per unit area on a detector, per unit time, over all frequencies. This is called the energy flux density. During a boost, the energy flux density varies as the square of the doppler factor D. For a given object, the exact way in which the brightness changes with D depends on: The increase in brightness depends on D raised to some power. Because of this, the increase in brightness can become very high. This is called relativistic beaming. The section on astrophysical jets contains more about relativistic beaming.

For the brightness of stars, astronomers usually measure brightness using a logarithmic scale called stellar magnitude. A bright star may have magnitude 1.0, while a dim star, barely visible to the human eye, might be of magnitude 6.0. The magnitude depends on the logarithm of the energy flux density (per hertz). For example, a difference of 5 magnitudes means a brightness difference of 100 times. For historical reasons, stellar magnitude actually increases as the brightness decreases (which is a bit confusing).

The radiation from stars is often roughly approximated as blackbody radiation, which is characterized by a temperature T expressed in Kelvins. The temperature T corresponds to the surface temperature of the star. The temperature also corresponds to the color of the star: cooler stars are reddish-orange, hotter stars are blue-white. During a boost, the temperature T is multiplied by the Doppler factor D, and the color also changes to match the new temperature.

For a star of surface temperature T, here's an approximate formula (McKinley, Doherty, 1979) for the change in a star's brightness during a boost, expressed in stellar magnitudes, which depends on both D and T (expressed in Kelvins):

Here's a diagram that illustrates the above formula. It shows the change in magnitude for various spectral classes of stars. The spectral class corresponds to the star's surface temperature T.

The Appearance of a Sky Full of Stars

In movies and television, science fiction animations often show a spacecraft travelling through a field of stars. How accurate are those depictions? Do they agree with the rules of relativity? No, they don't. If you were travelling through space with high β in the neighbourhood of the Sun, the basic character what would you see is shown by this interesting simulation. It accounts for:

In summary, as β increases from zero,

After you reach top speed, you wouldn't see "star trails" as you travel through space, as often depicted in science fiction. Space is simply too empty and vast. Even at ultra-relativistic speeds, the chance of a rapid fly-by of a star is very, very small. These star trails are created by animators to simply give an impression of rapid motion. They've used physics that's appropriate for showing rapid motion on the surface of the Earth, but not for interstellar space. If you wanted to see an actual fly-by of a star, you would have to plan it precisely, and aim towards a specific target.

Here are two interesting graphs showing the total number of stars visible from the Earth, their total relative brightness, and how they change with increasing β. These diagrams were created using 2,539,802 stars taken from the Tycho-2 star catalog. The limiting magnitude of 5.0 was selected to correspond to viewing the stars through a spacecraft window. As in the simulation, the direction of motion is towards Polaris. Notice the very strong cut-off at the highest speeds. This is because most of the starlight is blueshifted out of the visible range. Here, for the brightness unit, a single star of magnitude 0.0 has 1 unit of brightness.

Canvas not supported. Please upgrade your browser. Canvas not supported. Please upgrade your browser.

Here's a large image created using the Tycho-2 star catalog. It shows the stars brighter than magnitude 5.0, at β=0.993, and it uses the stereographic projection. As in the simulation, the direction of motion is towards Polaris. The image file is large, and it may take your browser a few minutes to render it. It's best to download the image file directly, and open it in some tool other than a browser. The image differs from the simulation in these ways:

These changes were made to let you better appreciate the overall density of stars. To create such images on your own, using the Tycho-2 star catalog, you can use this desktop software, created explicitly for that purpose.

Angular Size

Extended sources of light, such as a galaxy or a star cluster, have an angular size, the number of degrees it appears to extend across the sky. Point-like sources of light, such as a star, don't have a measurable apparent angular size.

Because of small differences in the amount of aberration between different directions, the angular size of an extended source of light changes during a boost. During a boost, a (small) angular size is divided by the corresponding Doppler factor D. For example, an object appearing at 15° from the direction of motion, with β=0.6, will have its angular size reduced by D(0.6,15°)=1.9027 with respect to the same measurement in a grid with β=0. (The Doppler factor is applied here after accounting for aberration.)

This implies a curious effect. For a travelling spacecraft, items in the forward view will appear to have a smaller angular size, and will seem compressed, and apparently more distant, compared with the appearance from another spacecraft which is not moving in that direction. Conversely, items in the rearward view will appear magnified, and apparently closer, since D will be less than 1.

You can see this effect using the star field simulation, once for β=0 and once for β=0.5:

(There are small distortions in the chart because it projects the celestial sphere onto the flat screen. But in this case, the distortions are pretty small.)

Geometric Outline

A sensor can take an image of an object, using the incoming light. That image will have an outline. Does that outline change during a boost? In general, yes. In a boosted grid, the outline of an image will be distorted from the image seen in an unboosted grid. The cause is the differences in light travel-time mentioned earlier.

Spherical objects are an exception to the rule: the image of a spherical object will always have a circular outline in every grid, no matter what the boost.

There's a second exception as well. When an object has a small angular size as seen from the detector, then:

Apparent Velocity

By itself, a signal doesn't carry any distance information. However, distance information can be available through other means. (For instance, through triangulation using two detectors.) If a distance is available, then it can be combined with the direction, to deduce an apparent velocity (symbol b). Canvas not supported. Please upgrade your browser.

The apparent velocity b is usually split into two parts:

In an inertial grid, the true velocity of an object is extracted from the event data gathered by multiple sensors, strung out along the object's history. Here, we're measuring the apparent velocity b using direction data from a single sensor (located far from the emitter), combined with an independent meausurement of the distance. Remember that the apparent direction comes from the geometric intersection in space-time of the detector's past light cone and the emitter's history. This is another example of a geometric point that doesn't obey the speed limit rule. The apparent velocity of an object is not at all the same as its true velocity. (To emphasize this, it's best to use a different symbol b instead of β.)

Notice the scissors effect in the following diagrams for radial motion. At high β, the history of an approaching emitter can be at a very small angle with respect to the detector's past light cone. Geometrically, it's sort of like the closing of a pair of scissors. As time passes, the intersection point will sweep across the emitter's history. The higher β, the faster the sweeping speed of the intersection point. The same sort of effect is seen even when the motion is not purely radial.

Canvas not supported. Please upgrade your browser. Canvas not supported. Please upgrade your browser.

Here's an animation showing the scissors effect.

The scissors effect means that, in the limiting case, no early warning of an approaching threat is possible if the threat moves towards you at an ultra-relativistic speed. In such cases, the news of the approaching disaster arrives just before the disaster itself.

In general, there's both radial and transverse motion. Here's an animation that shows how both parts of the apparent velocity change with β and θ (the angle the motion makes with the line of sight). It uses these formulas:

Astrophysical Jets

The above effects regarding apparent speed have actually been observed numerous times, in astrophysical jets. These jets explode out of the centers of galaxies at high speeds. They can also occur on a smaller scale, around black holes behaving as microquasars.

The speeds of the material in the jets can be very high, approaching β=1. The transverse motions of blobs in a jet can actually be tracked over the course of a few years (or even a few days, in the case of microquasars). When the distance to the source of the jet is known, then the apparent transverse speed can be calculated from the year-to-year transverse motion. Sometimes the apparent speed of a jet is found to be more than 1 light-year per year.

Here's a series of images of a radio galaxy named 3C111. It shows a blob of matter emitted from the core in late 1996. It moves (downwards, as shown here) about 15 light-years in under 4 years:

This is not a violation of the speed limit rule, since, as stated above, the apparent speed is the speed of a geometric intersection point in space-time.

Here's an animation that lets you explore these jets.

Note that the direction of the jet in space is random. If the jet happens to point roughly towards the Earth, then the scissors effects mentioned above take place. You have to use your imagination a bit with these blobs, since the information in an image is only 2-dimensional, but the motion is 3-dimensional. The emitted blob is not travelling only in the downward direction here. It's direction of motion is mostly towards you. It's important to understand that.

Due to beaming, the apparent brightness of a jet depends on the geometry. If a jet is pointed towards the Earth, the brightness of the jet is increased. If a jet is pointed away from the Earth, its brightness decreases.

Other examples of jets:

In general, it's difficult for astronomers to find the distances to objects. It's interesting to note that these astrophysical jets provide a new, relativistic way of measuring distances.