- (a)
- Plot , , and in the on the same plane to see an example of the parallelogram law of complex addition.
- (b)
- What happens geometrically when you multiply a complex number by ? What if you multiply by a second time? Verify that this is the same as multiplying by .
Complex Numbers
The complex numbers, denoted , have the wonderful property that every polynomial with complex coefficients has a (complex) root. This fact is known as the Fundamental Theorem of Algebra.
One pleasant aspect of the complex numbers is that, whereas describing the real numbers in terms of the rationals is a rather complicated business, the complex numbers are quite easy to describe in terms of real numbers. Every complex number has the form \begin{equation*} a + bi \end{equation*} where and are real numbers, and is a root of the polynomial . Here and are called the real part and the imaginary part of the complex number, respectively. The real numbers are now regarded as special complex numbers of the form , with zero imaginary part. The complex numbers of the form with zero real part are called pure imaginary numbers. The complex number itself is called the imaginary unit and is distinguished by the fact that \begin{equation*} i^2 = -1 \end{equation*} As the terms complex and imaginary suggest, these numbers met with some resistance when they were first used. This has changed; now they are essential in science and engineering as well as mathematics, and they are used extensively. The names persist, however, and continue to be a bit misleading: These numbers are no more “complex” than the real numbers, and the number is no more “imaginary” than .
Much as for polynomials, two complex numbers are declared to be equal if and only if they have the same real parts and the same imaginary parts. In symbols, \begin{equation*} a+bi = a^{\prime } + b^{\prime } i \quad \mbox{ if and only if } a = a^{\prime } \mbox{ and } b = b^{\prime } \end{equation*} The addition and subtraction of complex numbers is accomplished by adding and subtracting real and imaginary parts: \begin{align*} (a+bi) + (a^{\prime } + b^{\prime }i) & = (a + a^{\prime }) + (b + b^{\prime })i\\ (a+bi) - (a^{\prime } + b^{\prime }i) & = (a - a^{\prime }) + (b - b^{\prime })i \end{align*}
This is analogous to these operations for linear polynomials and , and the multiplication of complex numbers is also analogous with one difference: . The definition is \begin{equation*} (a+bi)(a^{\prime } + b^{\prime }i) = (a a^{\prime } - b b^{\prime }) + (a b^{\prime } + b a^{\prime })i \end{equation*} With these definitions of equality, addition, and multiplication, the complex numbers satisfy all the basic arithmetical axioms adhered to by the real numbers (the verifications are omitted). One consequence of this is that they can be manipulated in the obvious fashion, except that is replaced by wherever it occurs, and the rule for equality must be observed.
As for real numbers, it is possible to divide by every nonzero complex number . That is, there exists a complex number such that . As in the real case, this number is called the inverse of and is denoted by or . Moreover, if , the fact that means that or . Hence , and an explicit formula for the inverse is \begin{equation*} \frac{1}{z} = \frac{a}{a^2 + b^2} - \frac{b}{a^2+b^2}{i} \end{equation*} In actual calculations, the work is facilitated by two useful notions: the conjugate and the absolute value of a complex number. The next example illustrates the technique.
The key to this technique is that the product in the denominator turned out to be a real number. The situation in general leads to the following notation: If is a complex number, the conjugate of is the complex number, denoted , given by \begin{equation*} \overline{z} = a-bi \quad \mbox{ where } z = a+bi \end{equation*} Hence is obtained from by negating the imaginary part. Thus and . If we multiply by , we obtain \begin{equation*} z \overline{z} = a^2 + b^2 \quad \mbox{ where } z = a+bi \end{equation*}
The real number is always nonnegative, so we can state the following definition: The absolute value or modulus of a complex number , denoted by , is the positive square root ; that is, \begin{equation*} |z| = \sqrt{a^2 + b^2} \quad \mbox{ where } z = a+bi \end{equation*} For example, and .
Note that if a real number is viewed as the complex number , its absolute value (as a complex number) is , which agrees with its absolute value as a real number.
With these notions in hand, we can describe the technique applied in Example ex:033897 as follows: When converting a quotient of complex numbers to the form , multiply top and bottom by the conjugate of the denominator.
The following list contains the most important properties of conjugates and absolute values. Throughout, and denote complex numbers. \begin{equation*} \label{eqn:complex_number_properties} \begin{array}{llcll} C1. & \overline{z \pm w} = \overline{z} \pm \overline{w} & \quad & C7. & \frac{1}{z} = \frac{1}{|z|^2}\overline{z} \\ C2. & \overline{zw} = \overline{z}~\overline{w} & \quad & C8. & |z| \geq 0 \mbox{ for all complex numbers } z \\ C3. & \overline{\left (\frac{z}{w}\right )} = \frac{\overline{z}}{\hspace{0.05em}\overline{w}\hspace{0.05em}} & \quad & C9. & |z| = 0 \mbox{ if and only if } z=0 \\ C4. & \overline{(\overline{z})} = z & \quad & C10. & |zw| = |z||w| \\ C5. & z \mbox{ is real if and only if } \overline{z} =z & \quad & C11. & |\frac{z}{w}| = \frac{|z|}{|w|} \\ C6. & z\overline{z} = |z|^2 & \quad & C12. & |z+w| \leq |z|+|w| \mbox{(the Triangle Inequality)} \\ \end{array} \end{equation*} All these properties (except property C12) can (and should) be verified by the reader for arbitrary complex numbers and . They are not independent; for example, property C10 follows from properties C2 and C6.
The triangle inequality, as its name suggests, comes from a geometric representation of the complex numbers analogous to identification of the real numbers with the points of a line. The representation is achieved as follows:
Introduce a rectangular coordinate system in the plane, and identify the complex number with the point , as shown in the figure below.
When this is done, the plane is called the complex plane. Note that the point on the axis now represents the real number , and for this reason, the axis is called the real axis. Similarly, the axis is called the imaginary axis. The identification of the geometric point and the complex number will be used in what follows without comment. For example, the origin will be referred to as .
This representation of the complex numbers in the complex plane gives a useful way of describing the absolute value and conjugate of a complex number . The absolute value is just the distance from to the origin. This makes properties C8 and C9 quite obvious. The conjugate of is just the reflection of in the real axis ( axis), a fact that makes properties C4 and C5 clear.
Given two complex numbers and , the absolute value of their difference \begin{equation} \label{eqn:distance_formula} |z_1 - z_2| = \sqrt{(a_1-a_2)^2 + (b_1 - b_2)^2} \end{equation} is just the distance between them. This gives the complex distance formula:
\begin{equation*} |z_1 - z_2| \mbox{ is the distance between } z_1 \mbox{ and } z_2 \end{equation*}
This useful fact yields a simple verification of the triangle inequality, property C12. Suppose and are given complex numbers. Consider the triangle in the figure below whose vertices are , , and .
The three sides have lengths , , and by the complex distance formula, so the inequality \begin{equation*} |z+w| \leq |z| + |w| \end{equation*} expresses the obvious geometric fact that the sum of the lengths of two sides of a triangle is at least as great as the length of the third side.
The representation of complex numbers as points in the complex plane has another very useful property: It enables us to give a geometric description of the sum and product of two complex numbers. To obtain the description for the sum, let \begin{align*} z & = a+bi = (a,b) \\ w & = c+di = (c,d) \end{align*}
denote two complex numbers. We claim that the four points , , , and form the vertices of a parallelogram. In fact, in the figure below, the lines from to and from to have slopes \begin{equation*} \frac{b-0}{a-0} = \frac{b}{a} \quad \mbox{ and } \quad \frac{(b+d)-d}{(a+c)-c} = \frac{b}{a} \end{equation*} respectively, so these lines are parallel. (If it happens that , then both these lines are vertical.)
Similarly, the lines from to and from to are also parallel, so the figure with vertices , , , and is indeed a parallelogram. Hence, the complex number can be obtained geometrically from and by completing the parallelogram. This is sometimes called the parallelogram law of complex addition. Readers who have studied mechanics will recall that velocities and accelerations add in the same way; in fact, these are all special cases of vector addition.
Polar Form
The geometric description of what happens when two complex numbers are multiplied is at least as elegant as the parallelogram law of addition, but it requires that the complex numbers be represented in polar form. Before discussing this, we pause to recall the general definition of the trigonometric functions sine and cosine. An angle in the complex plane is in standard position if it is measured counterclockwise from the positive real axis as indicated in the figure below.
Rather than using degrees to measure angles, it is more natural to use radian measure. This is defined as follows: The circle with its centre at the origin and radius (called the unit circle) is drawn as in the above figure. It has circumference , and the radian measure of is the length of the arc on the unit circle counterclockwise from to the point on the unit circle determined by . Hence , , , and a full circle has the angle . Angles measured clockwise from are negative; for example, corresponds to (or to ).
Consider an angle in the range . If is plotted in standard position as in the above figure, it determines a unique point on the unit circle, and has coordinates (, ) by elementary trigonometry. However, any angle (acute or not) determines a unique point on the unit circle, so we define the cosine and sine of (written and ) to be the and coordinates of this point. For example, the points \begin{equation*} \begin{array}{llll} 1=(1,0) & i=(0,1) & -1=(-1,0) & -i=(0,-1) \end{array} \end{equation*} plotted in the figure are determined by the angles , , , , respectively. Hence \begin{equation*} \begin{array}{lllllll} \cos 0 = 1 & \quad & \cos \frac{\pi }{2} = 0 & \quad & \cos \pi = -1 & \quad & \cos \frac{3\pi }{2} = 0\\ \sin 0 = 0 & & \sin \frac{\pi }{2} = 1 & & \sin \pi = 0 & &\sin \frac{3\pi }{2} = -1 \end{array} \end{equation*}
Now we can describe the polar form of a complex number. Let be a complex number, and write the absolute value of as \begin{equation*} r = |z| = \sqrt{a^2+b^2} \end{equation*}
If , the angle shown in the figure below is called an argument of and is denoted \begin{equation*} \theta = \mbox{arg} z \end{equation*}
This angle is not unique ( would do as well for any
). However, there is only one argument in the range , and this is sometimes called
the principal argument of .
Referring to the figure below, we find that the real and imaginary parts and of are related to and by \begin{align*} a &= r \cos \theta \\ b &= r \sin \theta \end{align*}
Hence the complex number has the form \begin{equation*} z = r(\cos \theta + i \sin \theta ) \quad \mbox{where}\quad r = |z|\quad \mbox{and}\quad \theta = \mbox{arg}(z) \end{equation*} The combination is so important that a special notation is used: \begin{equation*} e^{i\theta } = \cos \theta + i \sin \theta \end{equation*} is called Euler’s formula after the great Swiss mathematician Leonhard Euler (1707–1783). With this notation, is written \begin{equation*} z = r e^{i \theta } \quad \mbox{where}\quad r = |z|\quad \mbox{and}\quad \theta = \mbox{arg}(z) \end{equation*} This is a polar form of the complex number . Of course it is not unique, because the argument can be changed by adding a multiple of .
The two numbers are plotted in the complex plane, as shown in the figure below.
The absolute values are \begin{align*} r_1 &= |-2 + 2i| = \sqrt{(-2)^2 + 2^2} = 2\sqrt{2}\\ r_2 &= |-i| = \sqrt{0^2 + (-1)^2} = 1 \end{align*}
By inspection, arguments of and are \begin{align*} \theta _1 &= \mbox{arg}(-2+2i) = \frac{3\pi }{4}\\ \theta _2 &= \mbox{arg}(-i) = \frac{3\pi }{2} \end{align*}
The corresponding polar forms are and . Of course, we could have taken the argument for and obtained the polar form .
In Euler’s formula , the number is the familiar constant from calculus. The reason for using will not be given here; the reason why is written as an exponential function of is that the law of exponents holds: \begin{equation*} e^{i\theta } \cdot e^{i\phi } = e^{i (\theta + \phi )} \end{equation*} where and are any two angles. In fact, this is an immediate consequence of the addition identities for and :
\begin{align*} e^{i\theta } e^{i\phi } &= (\cos \theta + i \sin \theta ) (\cos \phi + i \sin \phi ) \\ &= (\cos \theta \cos \phi - \sin \theta \sin \phi ) + i (\cos \theta \sin \phi + \sin \theta \cos \phi ) \\ &= \cos (\theta + \phi ) +i \sin (\theta + \phi ) \\ & =e^{i (\theta + \phi )} \end{align*}
This is analogous to the rule , which holds for real numbers and , so it is not unnatural to use the exponential notation for the expression . In fact, a whole theory exists wherein functions such as , , and are studied, where is a complex variable. Many deep and beautiful theorems can be proved in this theory, one of which is the so-called fundamental theorem of algebra mentioned later (Theorem th:034196). We shall not pursue this here.
The geometric description of the multiplication of two complex numbers follows from the law of exponents.
In other words, to multiply two complex numbers, simply multiply the absolute values and add the arguments. This simplifies calculations considerably, particularly when we observe that it is valid for any arguments and .
We have, \begin{align*} & 1-i = \sqrt{2} e^{-i\pi /4} \\ & 1+ \sqrt{3}i = 2e^{i\pi /3} \end{align*}
Hence, by Theorem th:034029, \begin{align*} (1-i)(1+\sqrt{3}i) &= (\sqrt{2} e^{-i\pi /4})(2e^{i\pi /3}) \\ &= 2\sqrt{2} e^{i(-\pi /4 + \pi /3)} \\ &= 2 \sqrt{2} e^{i\pi /12} \end{align*}
This gives the required product in polar form. Of course, direct multiplication gives . Hence, equating real and imaginary parts gives the formulas and .
Roots of Unity
If a complex number is given in polar form, the powers assume a particularly simple form. In fact, , , and so on. Continuing in this way, it follows by induction that the following theorem holds for any positive integer . The name honors Abraham De Moivre (1667–1754).
- Proof
- The case has been discussed, and the reader can verify the result for . To derive it for , first observe that \begin{equation*} \mbox{if } \quad z = re^{i\theta }\neq 0 \quad \mbox{ then } \quad z^{-1} = \frac{1}{r}~e^{-i\theta } \end{equation*} In fact, by the multiplication rule. Now assume that is negative and write it as , . Then \begin{equation*} (re^{i\theta })^n = [(re^{i\theta })^{-1}]^m = (\frac{1}{r}~e^{-i\theta })^m = r^{-m} e^{i(-m\theta )}=r^ne^{in\theta } \end{equation*} If , this is De Moivre’s theorem for negative .
De Moivre’s theorem can be used to find th roots of complex numbers where is positive. The next example illustrates this technique.
Because is real and positive, the condition implies that . However, \begin{equation*} \theta = \frac{2k\pi }{3}, \quad k \mbox{ some integer} \end{equation*}
seems at first glance to yield infinitely many different angles for . However, choosing gives three possible arguments (where ), and the corresponding roots are \begin{align*} 1e^{0i} & = 1 \\ 1e^{2\pi i/3} & = -\frac{1}{2} + \frac{\sqrt{3}}{2} i \\ 1e^{4\pi i/3} & = -\frac{1}{2} - \frac{\sqrt{3}}{2} i \end{align*}
These are displayed in the figure below.
All other values of yield values of that differ from one of these by a multiple of —and so do not give new roots. Hence we have found all the roots.
The same type of calculation gives all complex th roots of unity; that is, all complex numbers such that . As before, write and \begin{equation*} z = re^{i\theta } \end{equation*} in polar form. Then takes the form \begin{equation*} r^ne^{ni\theta } = 1e^{0i} \end{equation*} using De Moivre’s theorem. Comparing absolute values and arguments yields \begin{align*} r^n &= 1 \\ n\theta & = 0 + 2k\pi , \quad k \mbox{ some integer} \end{align*}
Hence , and the values \begin{equation*} \theta = \frac{2k\pi }{n}, \quad k=0, 1, 2, \dots , n-1 \end{equation*} of all lie in the range . As in Example ex:034107, every choice of yields a value of that differs from one of these by a multiple of , so these give the arguments of all the possible roots.
The th roots of unity can be found geometrically as the points on the unit circle that cut the circle into equal sectors, starting at . The case is shown in the figure below, where the five fifth roots of unity are plotted.
The method just used to find the th roots of unity works equally well to find the th roots of any complex number in polar form. We give one example.
An expression of the form , where the coefficients , , and are real numbers, is called a real quadratic. A complex number is called a root of the quadratic if . The roots are given by the famous quadratic formula: \begin{equation*} u = \frac{-b \pm \sqrt{b^2 - 4ac}}{2a} \end{equation*} The quantity is called the discriminant of the quadratic , and there is no real root if and only if . In this case the quadratic is said to be irreducible. Moreover, the fact that means that , so the two (complex) roots are conjugates of each other: \begin{equation*} u = \frac{1}{2a}(-b+i\sqrt{|d|}) \quad \mbox{ and } \quad \overline{u} = \frac{1}{2a}(-b-i\sqrt{|d|}) \end{equation*} The converse of this is true too: Given any nonreal complex number , then and are the roots of some real irreducible quadratic. Indeed, the quadratic \begin{equation*} x^2 - (u + \overline{u})x + u \overline{u} = (x-u)(x-\overline{u}) \end{equation*} has real coefficients ( and is twice the real part of ) and so is irreducible because its roots and are not real.
Fundamental Theorem of Algebra
As we mentioned earlier, the complex numbers are the culmination of a long search by mathematicians to find a set of numbers large enough to contain a root of every polynomial. The fact that the complex numbers have this property was first proved by Gauss in 1797 when he was 20 years old. The proof is omitted.
If is a polynomial with complex coefficients, and if is a root, then the Factor Theorem (see Precalculus by Stitz-Zeager, for instance) asserts that \begin{equation*} f(x) = (x-u_1)g(x) \end{equation*} where is a polynomial with complex coefficients and with degree one less than the degree of . Suppose that is a root of , again by the fundamental theorem. Then , so \begin{equation*} f(x) = (x-u_1)(x-u_2)h(x) \end{equation*} This process continues until the last polynomial to appear is linear. Thus has been expressed as a product of linear factors. The last of these factors can be written in the form , where and are complex (verify this), so the fundamental theorem takes the following form.
This form of the fundamental theorem, when applied to a polynomial with real coefficients, can be used to deduce the following result.
In fact, suppose has the form \begin{equation*} f(x) = a_nx^n + a_{n-1}x^{n-1} + \ldots + a_1x + a_0 \end{equation*} where the coefficients are real. If is a complex root of , then we claim first that is also a root. In fact, we have , so \begin{align*} 0 = \overline{0} = \overline{f(u)} & = \overline{a_nu^n + a_{n-1}u^{n-1} + \ldots + a_1u + a_0 } \\ & = \overline{a_nu^n} + \overline{a_{n-1}u^{n-1}} + \ldots + \overline{a_1u} + \overline{a_0 } \\ & = \overline{a}_n\overline{u}^n + \overline{a}_{n-1}\overline{u}^{n-1} + \ldots + \overline{a}_1\overline{u} + \overline{a}_0 \\ & = a_n\overline{u}^n + a_{n-1}\overline{u}^{n-1} + \ldots + a_1\overline{u} + a_0 \\ &= f(\overline{u}) \end{align*}
where for each because the coefficients are real. Thus if is a root of , so is its conjugate . Of course some of the roots of may be real (and so equal their conjugates), but the nonreal roots come in pairs, and . By Theorem thm:034221, we can thus write as a product: \begin{equation} \label{eq:complexproduct} f(x) = a_n(x-r_1)\cdots (x-r_k)(x-u_1)(x-\overline{u}_1)\cdots (x-u_m)(x-\overline{u}_m) \end{equation} where is the coefficient of in ; are the real roots; and are the nonreal roots. But the product \begin{equation*} (x-u_j)(x-\overline{u}_j) = x^2 - (u_j + \overline{u}_j)x +(u_j \overline{u}_j) \end{equation*} is a real irreducible quadratic for each (see the discussion preceding Example ex:034182). Hence (eq:complexproduct) shows that is a product of linear and irreducible quadratic factors, each with real coefficients. This is the conclusion in Theorem th:034221.
Practice Problems
- (a)
- (challenge problem)
- (b)
- (challenge problem)
- (c)
- (d)
- (e)
- (f)
- (g)
- (h)
- (i)
- (challenge problem)
- (a)
- (b)
- (challenge problem)
- (c)
- (d)
- (e)
- (f)
- (challenge problem)
Click the arrow to see the answer.
- (a)
- (b)
-
Click the arrow to see the answer.
- (c)
- (d)
-
Click the arrow to see the answer.
- (a)
- (b)
-
Click the arrow to see the answer.
- (c)
- (d)
-
Click the arrow to see the answer.
- (a)
- (b)
-
Click the arrow to see the answer.
- (c)
- (d)
-
Click the arrow to see the answer.
- (a)
- (b)
- (c)
- (d)
- (e)
- (challenge problem) , and if is real
- (f)
- (challenge problem) , and if is real
- (a)
- (challenge problem) ;
- (b)
- (challenge problem) ;
- (c)
- (challenge problem) ;
- (d)
- (challenge problem) ;
- (a)
- (b)
-
Click the arrow to see the answer.
- (c)
- (challenge problem)
- (d)
- (challenge problem)
Click the arrow to see the answer.
- (a)
- (b)
-
Click the arrow to see the answer.
Circle, centre at , radius
- (c)
- (d)
-
Click the arrow to see the answer.
Imaginary axis
- (e)
- (f)
- , a real number
Click the arrow to see the answer.
Line
- (a)
- (b)
-
Click the arrow to see the answer.
- (c)
- (d)
-
Click the arrow to see the answer.
- (e)
- (f)
-
Click the arrow to see the answer.
- (a)
- (b)
-
Click the arrow to see the answer.
- (c)
- (d)
-
Click the arrow to see the answer.
- (e)
- (f)
-
Click the arrow to see the answer.
- (a)
- (b)
-
Click the arrow to see the answer.
- (c)
- (d)
-
Click the arrow to see the answer.
- (e)
- (f)
-
Click the arrow to see the answer.
- (a)
- (b)
-
Click the arrow to see the answer.
- (c)
- (d)
-
Click the arrow to see the answer.
- (a)
- Let , , , , and be equally spaced around the unit circle. Show that .
- (b)
- Repeat (a) for any points equally spaced around the unit circle.
- (c)
- If , show that the sum of the roots of is zero.
- (a)
- (challenge problem) only if and
- (b)
- (c)
- (d)
- (e)
- (f)
- (challenge problem) If is a polynomial with rational coefficients and is a root of , then is also a root of .
Text Source
This section was adapted from Appendix A of Keith Nicholson’s Linear Algebra with Applications. (CC-BY-NC-SA)
W. Keith Nicholson, Linear Algebra with Applications, Lyryx 2018, Open Edition, pp. 581–594.
2024-09-11 17:55:17