Complex Matrices

$\newenvironment {prompt}{}{} \newcommand {\ungraded }[0]{} \newcommand {\npnoround }[0]{\nprounddigits {-1}} \newcommand {\npnoroundexp }[0]{\nproundexpdigits {-1}} \newcommand {\npunitcommand }[1]{\ensuremath {\mathrm {#1}}} \newcommand {\tdplotsinandcos }[3]{\pgfmathsetmacro {#1}{sin(#3)}\pgfmathsetmacro {#2}{cos(#3)}} \newcommand {\tdplotmult }[3]{\pgfmathsetmacro {#1}{#2*#3}} \newcommand {\tdplotdiv }[3]{\pgfmathsetmacro {#1}{#2/#3}} \newcommand {\tdplotcheckdiff }[5]{\par \par \pgfmathparse { abs(#2 -#1)<#3 } \par \ifthenelse {\equal {\pgfmathresult }{1}}{#4}{#5} } \newcommand {\tdplotsetmaincoords }[2]{\pgfmathsetmacro {\tdplotmaintheta }{#1} \pgfmathsetmacro {\tdplotmainphi }{#2} \tdplotcalctransformmainscreen \tikzset {tdplot_main_coords/.style={x={(\raarot cm,\rbarot cm)},y={(\rabrot cm,\rbbrot cm)},z={(\racrot cm,\rbcrot cm)}}}} \newcommand {\tdplotcalctransformmainscreen }[0]{\tdplotsinandcos {\sintheta }{\costheta }{\tdplotmaintheta }\tdplotsinandcos {\sinphi }{\cosphi }{\tdplotmainphi }\tdplotmult {\stsp }{\sintheta }{\sinphi }\tdplotmult {\stcp }{\sintheta }{\cosphi }\tdplotmult {\ctsp }{\costheta }{\sinphi }\tdplotmult {\ctcp }{\costheta }{\cosphi }\pgfmathsetmacro {\raarot }{\cosphi }\pgfmathsetmacro {\rabrot }{\sinphi }\pgfmathsetmacro {\racrot }{0}\pgfmathsetmacro {\rbarot }{-\ctsp }\pgfmathsetmacro {\rbbrot }{\ctcp }\pgfmathsetmacro {\rbcrot }{\sintheta }\pgfmathsetmacro {\rcarot }{\stsp }\pgfmathsetmacro {\rcbrot }{-\stcp }\pgfmathsetmacro {\rccrot }{\costheta }} \newcommand {\tdplotcalctransformrotmain }[0]{\tdplotsinandcos {\sinalpha }{\cosalpha }{\tdplotalpha } \tdplotsinandcos {\sinbeta }{\cosbeta }{\tdplotbeta } \tdplotsinandcos {\singamma }{\cosgamma }{\tdplotgamma } \tdplotmult {\sasb }{\sinalpha }{\sinbeta } \tdplotmult {\sbsg }{\sinbeta }{\singamma } \tdplotmult {\sasg }{\sinalpha }{\singamma } \tdplotmult {\sasbsg }{\sasb }{\singamma } \tdplotmult {\sacb }{\sinalpha }{\cosbeta } \tdplotmult {\sacg }{\sinalpha }{\cosgamma } \tdplotmult {\sbcg }{\sinbeta }{\cosgamma } \tdplotmult {\sacbsg }{\sacb }{\singamma } \tdplotmult {\sacbcg }{\sacb }{\cosgamma } \tdplotmult {\casb }{\cosalpha }{\sinbeta } \tdplotmult {\cacb }{\cosalpha }{\cosbeta } \tdplotmult {\cacg }{\cosalpha }{\cosgamma } \tdplotmult {\casg }{\cosalpha }{\singamma } \tdplotmult {\cacbsg }{\cacb }{\singamma } \tdplotmult {\cacbcg }{\cacb }{\cosgamma } \pgfmathsetmacro {\raaeul }{\cacbcg -\sasg } \pgfmathsetmacro {\rabeul }{-\cacbsg -\sacg } \pgfmathsetmacro {\raceul }{\casb } \pgfmathsetmacro {\rbaeul }{\sacbcg + \casg } \pgfmathsetmacro {\rbbeul }{-\sacbsg + \cacg } \pgfmathsetmacro {\rbceul }{\sasb } \pgfmathsetmacro {\rcaeul }{-\sbcg } \pgfmathsetmacro {\rcbeul }{\sbsg } \pgfmathsetmacro {\rcceul }{\cosbeta } } \newcommand {\tdplotcalctransformmainrot }[0]{\tdplotsinandcos {\sinalpha }{\cosalpha }{\tdplotalpha } \tdplotsinandcos {\sinbeta }{\cosbeta }{\tdplotbeta } \tdplotsinandcos {\singamma }{\cosgamma }{\tdplotgamma } \tdplotmult {\sasb }{\sinalpha }{\sinbeta } \tdplotmult {\sbsg }{\sinbeta }{\singamma } \tdplotmult {\sasg }{\sinalpha }{\singamma } \tdplotmult {\sasbsg }{\sasb }{\singamma } \tdplotmult {\sacb }{\sinalpha }{\cosbeta } \tdplotmult {\sacg }{\sinalpha }{\cosgamma } \tdplotmult {\sbcg }{\sinbeta }{\cosgamma } \tdplotmult {\sacbsg }{\sacb }{\singamma } \tdplotmult {\sacbcg }{\sacb }{\cosgamma } \tdplotmult {\casb }{\cosalpha }{\sinbeta } \tdplotmult {\cacb }{\cosalpha }{\cosbeta } \tdplotmult {\cacg }{\cosalpha }{\cosgamma } \tdplotmult {\casg }{\cosalpha }{\singamma } \tdplotmult {\cacbsg }{\cacb }{\singamma } \tdplotmult {\cacbcg }{\cacb }{\cosgamma } \pgfmathsetmacro {\raaeul }{\cacbcg -\sasg } \pgfmathsetmacro {\rabeul }{\sacbcg + \casg } \pgfmathsetmacro {\raceul }{-\sbcg } \pgfmathsetmacro {\rbaeul }{-\cacbsg -\sacg } \pgfmathsetmacro {\rbbeul }{-\sacbsg + \cacg } \pgfmathsetmacro {\rbceul }{\sbsg } \pgfmathsetmacro {\rcaeul }{\casb } \pgfmathsetmacro {\rcbeul }{\sasb } \pgfmathsetmacro {\rcceul }{\cosbeta } } \newcommand {\tdplottransformmainrot }[3]{\tdplotcalctransformmainrot \par \pgfmathsetmacro {\tdplotresx }{\raaeul * #1 + \rabeul * #2 + \raceul * #3} \pgfmathsetmacro {\tdplotresy }{\rbaeul * #1 + \rbbeul * #2 + \rbceul * #3} \pgfmathsetmacro {\tdplotresz }{\rcaeul * #1 + \rcbeul * #2 + \rcceul * #3} } \newcommand {\tdplottransformrotmain }[3]{\tdplotcalctransformrotmain \par \pgfmathsetmacro {\tdplotresx }{\raaeul * #1 + \rabeul * #2 + \raceul * #3} \pgfmathsetmacro {\tdplotresy }{\rbaeul * #1 + \rbbeul * #2 + \rbceul * #3} \pgfmathsetmacro {\tdplotresz }{\rcaeul * #1 + \rcbeul * #2 + \rcceul * #3} } \newcommand {\tdplottransformmainscreen }[3]{\tdplotcalctransformmainscreen \par \pgfmathsetmacro {\tdplotresx }{\raarot * #1 + \rabrot * #2 + \racrot * #3} \pgfmathsetmacro {\tdplotresy }{\rbarot * #1 + \rbbrot * #2 + \rbcrot * #3} } \newcommand {\tdplotsetrotatedcoords }[3]{\pgfmathsetmacro {\tdplotalpha }{#1} \pgfmathsetmacro {\tdplotbeta }{#2} \pgfmathsetmacro {\tdplotgamma }{#3} \tdplotcalctransformrotmain \par \tdplotmult {\raaeaa }{\raarot }{\raaeul } \tdplotmult {\rabeba }{\rabrot }{\rbaeul } \tdplotmult {\raceca }{\racrot }{\rcaeul } \tdplotmult {\raaeab }{\raarot }{\rabeul } \tdplotmult {\rabebb }{\rabrot }{\rbbeul } \tdplotmult {\racecb }{\racrot }{\rcbeul } \tdplotmult {\raaeac }{\raarot }{\raceul } \tdplotmult {\rabebc }{\rabrot }{\rbceul } \tdplotmult {\racecc }{\racrot }{\rcceul } \tdplotmult {\rbaeaa }{\rbarot }{\raaeul } \tdplotmult {\rbbeba }{\rbbrot }{\rbaeul } \tdplotmult {\rbceca }{\rbcrot }{\rcaeul } \tdplotmult {\rbaeab }{\rbarot }{\rabeul } \tdplotmult {\rbbebb }{\rbbrot }{\rbbeul } \tdplotmult {\rbcecb }{\rbcrot }{\rcbeul } \tdplotmult {\rbaeac }{\rbarot }{\raceul } \tdplotmult {\rbbebc }{\rbbrot }{\rbceul } \tdplotmult {\rbcecc }{\rbcrot }{\rcceul } \pgfmathsetmacro {\raarc }{\raaeaa + \rabeba + \raceca } \pgfmathsetmacro {\rabrc }{\raaeab + \rabebb + \racecb } \pgfmathsetmacro {\racrc }{\raaeac + \rabebc + \racecc } \pgfmathsetmacro {\rbarc }{\rbaeaa + \rbbeba + \rbceca } \pgfmathsetmacro {\rbbrc }{\rbaeab + \rbbebb + \rbcecb } \pgfmathsetmacro {\rbcrc }{\rbaeac + \rbbebc + \rbcecc } \tikzset {tdplot_rotated_coords/.append style={x={(\raarc cm,\rbarc cm)},y={(\rabrc cm,\rbbrc cm)},z={(\racrc cm,\rbcrc cm)}}}} \newcommand {\tdplotsetrotatedcoordsorigin }[1]{\tikzset {tdplot_rotated_coords/.append style={shift=#1}}} \newcommand {\tdplotresetrotatedcoordsorigin }[0]{\tikzset {tdplot_rotated_coords/.append style={shift={(0,0,0)}}}} \newcommand {\tdplotsetthetaplanecoords }[1]{\tdplotresetrotatedcoordsorigin \tdplotsetrotatedcoords {270 + #1}{270}{0}} \newcommand {\tdplotsetrotatedthetaplanecoords }[1]{\tdplotsetrotatedcoords {\tdplotalpha }{\tdplotbeta }{\tdplotgamma + #1}\tikzset {tdplot_rotated_coords/.append style={y={(\raarc cm,\rbarc cm)},z={(\rabrc cm,\rbbrc cm)},x={(\racrc cm,\rbcrc cm)}}}} \newcommand {\tdplotsetcoord }[4]{\tdplotsinandcos {\sinthetavec }{\costhetavec }{#3}\tdplotsinandcos {\sinphivec }{\cosphivec }{#4}\tdplotmult {\stcpv }{\sinthetavec }{\cosphivec }\tdplotmult {\stspv }{\sinthetavec }{\sinphivec }\coordinate (#1) at ($#2*(\stcpv ,\stspv ,\costhetavec )$); \coordinate (#1xy) at ($#2*(\stcpv ,\stspv ,0)$); \coordinate (#1xz) at ($#2*(\stcpv ,0,\costhetavec )$); \coordinate (#1yz) at ($#2*(0,\stspv ,\costhetavec )$); \coordinate (#1x) at ($#2*(\stcpv ,0,0)$); \coordinate (#1y) at ($#2*(0,\stspv ,0)$); \coordinate (#1z) at ($#2*(0,0,\costhetavec )$); } \newcommand {\tdplotsimplesetcoord }[4]{\tdplotsinandcos {\sinthetavec }{\costhetavec }{#3}\tdplotsinandcos {\sinphivec }{\cosphivec }{#4}\tdplotmult {\stcpv }{\sinthetavec }{\cosphivec }\tdplotmult {\stspv }{\sinthetavec }{\sinphivec }\coordinate (#1) at ($#2*(\stcpv ,\stspv ,\costhetavec )$); } \newcommand {\tdplotsetpolarplotrange }[4]{\pgfmathsetmacro {\tdplotlowerphi }{#3} \pgfmathsetmacro {\tdplotupperphi }{#4} \pgfmathsetmacro {\tdplotlowertheta }{#1} \pgfmathsetmacro {\tdplotuppertheta }{#2} } \newcommand {\tdplotresetpolarplotrange }[0]{\pgfmathsetmacro {\tdplotlowerphi }{0} \pgfmathsetmacro {\tdplotupperphi }{360} \pgfmathsetmacro {\tdplotlowertheta }{0} \pgfmathsetmacro {\tdplotuppertheta }{180} } \newcommand {\tdplotdosurfaceplot }[6]{\par \pgfmathsetmacro {\nextphi }{\curphi + \tdplotsuperfudge *\viewphistep } \par \begin {scope}[opacity=1] \par \par \tdplotcheckdiff {\nextphi }{360}{\origviewphistep }{#2}{} \tdplotcheckdiff {\nextphi }{0}{\origviewphistep }{#2}{} \par \tdplotcheckdiff {\nextphi }{90}{\origviewphistep }{#3}{} \tdplotcheckdiff {\nextphi }{450}{\origviewphistep }{#3}{} \end {scope} \par \foreach \curtheta in{\viewthetastart ,\viewthetainc ,...,\viewthetaend } { \par \pgfmathsetmacro {\curlongitude }{90 -\curphi } \pgfmathsetmacro {\curlatitude }{90 -\curtheta } \par \ifthenelse {\equal {\leftright }{-1.0}}{\pgfmathsetmacro {\curphi }{\curphi -\origviewphistep } }{} \par \pgfmathsetmacro {\tdplottheta }{mod(\curtheta ,360)} \pgfmathsetmacro {\tdplotphi }{mod(\curphi ,360)} \par \pgfmathparse {\tdplotphi <0} \ifthenelse {\equal {\pgfmathresult }{1}}{ \pgfmathsetmacro {\tdplotphi }{\tdplotphi + 360} }{}\par \pgfmathparse {\tdplottheta >\tdplotuppertheta } \pgfmathsetmacro {\logictest }{1 -\pgfmathresult } \par \pgfmathparse {\tdplottheta <\tdplotlowertheta } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \pgfmathsetmacro {\tdplottheta }{\tdplottheta + \viewthetastep } \pgfmathparse {\tdplottheta >\tdplotuppertheta } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \pgfmathparse {\tdplottheta <\tdplotlowertheta } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \pgfmathparse {\tdplotphi >\tdplotupperphi } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \pgfmathparse {\tdplotphi <\tdplotlowerphi } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \pgfmathsetmacro {\tdplotphi }{\tdplotphi + \viewphistep } \par \pgfmathparse {\tdplotphi <0} \ifthenelse {\equal {\pgfmathresult }{1}}{ \pgfmathsetmacro {\tdplotphi }{\tdplotphi + 360} }{}\par \pgfmathparse {\tdplotphi >\tdplotupperphi } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \pgfmathparse {\tdplotphi <\tdplotlowerphi } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \par \pgfmathsetmacro {\tdplottheta }{\curtheta } \pgfmathsetmacro {\tdplotphi }{\curphi } \par \ifthenelse {\equal {#6}{parametricfill}}{\ifthenelse {\equal {\logictest }{1.0}}{\pgfmathsetmacro {\radius }{#1} \pgfmathsetmacro {\tdplotr }{\radius *360} \par \pgfmathlessthan {\radius }{0} \pgfmathsetmacro {\phaseshift }{180 * \pgfmathresult } \par \pgfmathsetmacro {\colorarg }{#5} \pgfmathsetmacro {\colorarg }{\colorarg + \phaseshift } \pgfmathsetmacro {\colorarg }{mod(\colorarg ,360)} \par \pgfmathlessthan {\colorarg }{0} \pgfmathsetmacro {\colorarg }{\colorarg + 360*\pgfmathresult } \par \pgfmathdivide {\colorarg }{360} \definecolor {tdplotfillcolor}{hsb}{\pgfmathresult ,1,1} \color {tdplotfillcolor} }{}}{\pgfsetfillcolor {#5} } \pgfsetstrokecolor {#4} \par \ifthenelse {\equal {\leftright }{-1.0}}{\pgfmathsetmacro {\curphi }{\curphi + \origviewphistep } }{} \par \ifthenelse {\equal {\logictest }{1.0}}{\pgfmathsetmacro {\radius }{abs(#1)} \pgfpathmoveto {\pgfpointspherical {\curlongitude }{\curlatitude }{\radius }} \par \pgfmathsetmacro {\tdplotphi }{\curphi + \viewphistep } \pgfmathsetmacro {\radius }{abs(#1)} \pgfpathlineto {\pgfpointspherical {\curlongitude -\viewphistep }{\curlatitude }{\radius }} \par \pgfmathsetmacro {\tdplottheta }{\curtheta + \viewthetastep } \pgfmathsetmacro {\radius }{abs(#1)} \pgfpathlineto {\pgfpointspherical {\curlongitude -\viewphistep }{\curlatitude -\viewthetastep }{\radius }} \par \pgfmathsetmacro {\tdplotphi }{\curphi } \pgfmathsetmacro {\radius }{abs(#1)} \pgfpathlineto {\pgfpointspherical {\curlongitude }{\curlatitude -\viewthetastep }{\radius }} \pgfpathclose \par \pgfusepath {fill,stroke} }{} } } \newcommand {\tdplotshowargcolorguide }[4]{ \par \pgfmathsetmacro {\tdplotx }{#1} \pgfmathsetmacro {\tdploty }{#2} \pgfmathsetmacro {\tdplothuestep }{5} \pgfmathsetmacro {\tdplotxsize }{#3} \pgfmathsetmacro {\tdplotysize }{#4} \par \pgfmathsetmacro {\tdplotyscale }{\tdplotysize /360} \par \foreach \tdplotphi in {0,\tdplothuestep ,...,360} { \pgfmathdivide {\tdplotphi }{360} \definecolor {tdplotfillcolor}{hsb}{\pgfmathresult ,1,1} \color {tdplotfillcolor} \par \pgfmathsetmacro {\tdplotstarty }{\tdploty + \tdplotphi * \tdplotyscale } \pgfmathsetmacro {\tdplotstopy }{\tdplotstarty + \tdplothuestep * \tdplotyscale } \pgfmathsetmacro {\tdplotstartx }{\tdplotx } \pgfmathsetmacro {\tdplotstopx }{\tdplotx + \tdplotxsize } \filldraw [tdplot_screen_coords] (\tdplotstartx ,\tdplotstarty ) rectangle (\tdplotstopx ,\tdplotstopy ); } \par \pgfmathsetmacro {\tdplotstopy }{\tdploty + (360+\tdplothuestep )*\tdplotyscale } \pgfmathsetmacro {\tdplotstopx }{\tdplotx + \tdplotxsize } \par \draw [tdplot_screen_coords] (\tdplotx ,\tdploty ) rectangle (\tdplotstopx ,\tdplotstopy ); \par \node [tdplot_screen_coords,anchor=west,xshift=5pt] at (\tdplotstopx ,\tdploty ) {$0$}; \node [tdplot_screen_coords,anchor=west,xshift=5pt] at (\tdplotstopx ,\tdplotstopy ) {$2\pi $}; \par \pgfmathsetmacro {\tdplotstopy }{\tdploty + (360+\tdplothuestep )/2*\tdplotyscale } \node [tdplot_screen_coords,anchor=west,xshift=5pt] at (\tdplotstopx ,\tdplotstopy ) {$\pi $}; } \newcommand {\tdplotgetpolarcoords }[3]{\pgfmathsetmacro {\vxcalc }{#1} \pgfmathsetmacro {\vycalc }{#2} \pgfmathsetmacro {\vzcalc }{#3} \pgfmathsetmacro {\vcalc }{ sqrt((\vxcalc )^2 + (\vycalc )^2 + (\vzcalc )^2) } \par \pgfmathsetmacro {\vxycalc }{ sqrt((\vxcalc )^2 + (\vycalc )^2) } \par \pgfmathsetmacro {\tdplotrestheta }{asin(\vxycalc /\vcalc )} \pgfmathparse {\vzcalc <0} \ifthenelse {\equal {\pgfmathresult }{1}}{\pgfmathsetmacro {\tdplotrestheta }{180 -\tdplotrestheta } } {} \ifthenelse {\equal {\vxcalc }{0.0}}{\pgfmathparse {\vycalc <0} \ifthenelse {\equal {\pgfmathresult }{1}}{\pgfmathsetmacro {\tdplotresphi }{270} } {\pgfmathparse {\vycalc >0} \ifthenelse {\equal {\pgfmathresult }{1}}{\pgfmathsetmacro {\tdplotresphi }{90} } {\pgfmathsetmacro {\tdplotresphi }{0} } } } {\pgfmathsetmacro {\tdplotresphi }{atan(\vycalc /\vxcalc )} \pgfmathparse {\vxcalc <0} \ifthenelse {\equal {\pgfmathresult }{1}}{\pgfmathsetmacro {\tdplotresphi }{\tdplotresphi +180} } { } \par \pgfmathparse {\tdplotresphi <0} \ifthenelse {\equal {\pgfmathresult }{1}}{\pgfmathsetmacro {\tdplotresphi }{\tdplotresphi +360} } {} } } \newcommand {\vec }[0]{\mathbf } \newcommand {\RR }[0]{\mathbb {R}} \newcommand {\dfn }[0]{\textit } \newcommand {\dotp }[0]{\cdot } \newcommand {\id }[0]{\text {id}} \newcommand {\norm }[1]{\left \lVert #1\right \rVert } \newcommand {\mathtoolsset }[1]{\setkeys {\MT_options_name: }{#1}} \newcommand {\refeq }[1]{\textup {\ref {#1}}} \newcommand {\lparen }[0]{(} \newcommand {\rparen }[0]{)} \newcommand {\ordinarycolon }[0]{:} \newcommand {\MT_test_for_tcb_other:nnnnn }[1]{\if:w t#1\relax \expandafter \MH_use_choice_i:nnnn \else: \if:w c#1\relax \expandafter \expandafter \expandafter \MH_use_choice_ii:nnnn \else: \if:w b#1\relax \expandafter \expandafter \expandafter \expandafter \expandafter \expandafter \expandafter \MH_use_choice_iii:nnnn \else: \expandafter \expandafter \expandafter \expandafter \expandafter \expandafter \expandafter \MH_use_choice_iv:nnnn \fi: \fi: \fi: } \newcommand {\newcases }[6]{\newenvironment {#1}{\MT_start_cases:nnnn {#2}{#3}{#4}{#5}}{\MH_end_cases: \right #6}} \newcommand {\renewcases }[6]{\renewenvironment {#1}{\MT_start_cases:nnnn {#2}{#3}{#4}{#5}}{\MH_end_cases: \right #6}} \newcommand {\SwapAboveDisplaySkip }[0]{\noalign {\vskip -\abovedisplayskip \vskip \abovedisplayshortskip }} \newcommand {\vdotswithin }[1]{{\mathmakebox [\widthof {\ensuremath {{}#1{}}}][c]{{\vdots }}}} \newcommand {\MTFlushSpaceBelow }[0]{\\\noalign {\nobreak \vskip -\lineskip \vskip -\l_MT_shortvdotswithinadjustbelow_dim \vskip -\origjot \vskip \jot }} \newcommand {\mathmbox }[0]{\mathpalette \MT_mathmbox:nn } \newcommand {\prescript }[3]{\mathchoice {\MT_prescript_inner: {#1}{#2}{#3}{\scriptstyle }}{\MT_prescript_inner: {#1}{#2}{#3}{\scriptstyle }}{\MT_prescript_inner: {#1}{#2}{#3}{\scriptscriptstyle }}{\MT_prescript_inner: {#1}{#2}{#3}{\scriptscriptstyle }}} \newcommand {\spreadlines }[1]{\setlength {\jot }{#1}\ignorespaces } \newcommand {\newgathered }[4]{\newenvironment {#1}{\def \MT_gathered_pre: {#2}\def \MT_gathered_post: {#3}\def \MT_gathered_env_end: {#4}\MT_gathered_env }{\endMT_gathered_env }} \newcommand {\renewgathered }[4]{\renewenvironment {#1}{\def \MT_gathered_pre: {#2}\def \MT_gathered_post: {#3}\def \MT_gathered_env_end: {#4}\MT_gathered_env }{\endMT_gathered_env }} \newcommand {\lgathered }[0]{\def \MT_gathered_pre: {}\def \MT_gathered_post: {\hfil }\def \MT_gathered_env_end: {}\MT_gathered_env } \newcommand {\rgathered }[0]{\def \MT_gathered_pre: {\hfil }\def \MT_gathered_post: {}\def \MT_gathered_env_end: {}\MT_gathered_env } \newcommand {\gathered }[0]{\def \MT_gathered_pre: {\hfil }\def \MT_gathered_post: {\hfil }\def \MT_gathered_env_end: {}\MT_gathered_env } \newcommand {\splitfrac }[2]{\genfrac {}{}{0pt}{1}{\textstyle #1\quad \hfill }{\textstyle \hfill \quad \mathstrut #2}} \newcommand {\splitdfrac }[2]{\genfrac {}{}{0pt}{0}{#1\quad \hfill }{\hfill \quad \mathstrut #2}} \newcommand {\HyperFirstAtBeginDocument }[0]{\AtBeginDocument } \newcommand {\dblcolon }[0]{\vcentcolon \mathrel {\mkern -.9mu}\vcentcolon } \newcommand {\coloneqq }[0]{\vcentcolon \mathrel {\mkern -1.2mu}=} \newcommand {\Coloneqq }[0]{\dblcolon \mathrel {\mkern -1.2mu}=} \newcommand {\coloneq }[0]{\vcentcolon \mathrel {\mkern -1.2mu}\mathrel {-}} \newcommand {\Coloneq }[0]{\dblcolon \mathrel {\mkern -1.2mu}\mathrel {-}} \newcommand {\eqqcolon }[0]{=\mathrel {\mkern -1.2mu}\vcentcolon } \newcommand {\Eqqcolon }[0]{=\mathrel {\mkern -1.2mu}\dblcolon } \newcommand {\eqcolon }[0]{\mathrel {-}\mathrel {\mkern -1.2mu}\vcentcolon } \newcommand {\Eqcolon }[0]{\mathrel {-}\mathrel {\mkern -1.2mu}\dblcolon } \newcommand {\colonapprox }[0]{\vcentcolon \mathrel {\mkern -1.2mu}\approx } \newcommand {\Colonapprox }[0]{\dblcolon \mathrel {\mkern -1.2mu}\approx } \newcommand {\colonsim }[0]{\vcentcolon \mathrel {\mkern -1.2mu}\sim } \newcommand {\Colonsim }[0]{\dblcolon \mathrel {\mkern -1.2mu}\sim } \newcommand {\nuparrow }[0]{\MH_nuparrow: } \newcommand {\ndownarrow }[0]{\MH_ndownarrow: } \newcommand {\bigtimes }[0]{\MH_csym_bigtimes: }$

Complex Matrices

Nearly everything we have studied in this book would remain true if the phrase real number were replaced by complex number wherever it occurs. Then we would deal with matrices with complex entries, systems of linear equations with complex coefficients (and complex solutions), determinants of complex matrices, and vector spaces with scalar multiplication by any complex number allowed. Moreover, the proofs of most theorems about (the real version of) these concepts extend easily to the complex case. It is not our intention here to give a full treatment of complex linear algebra. However, we will carry the theory far enough to give another proof of the Real Spectral Theorem (th:PrinAxes).

The set of complex numbers is denoted $\mathbb {C}$ . We will use only the most basic properties of these numbers (mainly conjugation and absolute values), and the reader can find this material in Complex Numbers.

If $n \ge 1$ , we denote the set of all $n$ -tuples of complex numbers by $\mathbb {C}^n$ . As with $\RR ^n$ , these $n$ -tuples will be written either as row or column matrices and will be referred to as vectors. We define vector operations on $\mathbb {C}^n$ as follows:

$\begin{align*} [v_{1}, v_{2}, \ldots, v_{n}] + [w_{1}, w_{2}, \ldots, w_{n}] &= [v_{1} + w_{1}, v_{2} + w_{2}, \ldots, v_{n} + w_{n}] \\ u[v_{1}, v_{2}, \ldots, v_{n}] &= [uv_{1}, uv_{2}, \ldots, uv_{n}] \quad \mbox{ for } u \mbox{ in } \mathbb{C} \end{align*}$ With these definitions, $\mathbb {C}^n$ satisfies the axioms for a vector space (with complex scalars) given in Abstract Vector Spaces. Thus we can speak of spanning sets for $\mathbb {C}^n$ , of linearly independent subsets, and of bases. In all cases, the definitions are identical to the real case, except that the scalars are allowed to be complex numbers. In particular, the standard basis of $\RR ^n$ remains a basis of $\mathbb {C}^n$ , called the standard basis of $\mathbb {C}^n$ .

A Generalization of the Dot Product for Complex Vectors

There is a generalization to $\mathbb {C}^n$ of the dot product in $\RR ^n$ .

Given $\vec {z} = [z_{1}, z_{2}, \ldots , z_{n}]$ and $\vec {w} = [w_{1}, w_{2}, \ldots , w_{n}]$ in $\mathbb {C}^n$ , define their standard inner product $\langle \vec {z}, \vec {w} \rangle$ by $\begin{equation*} \langle \vec{z}, \vec{w} \rangle = z_{1}\overline{w}_{1} + z_{2}\overline{w}_{2} + \ldots + z_{n}\overline{w}_{n} \end{equation*}$ where $\overline {w}$ is the conjugate of the complex number $w$ .

Clearly, if $\vec {z}$ and $\vec {w}$ actually lie in $\RR ^n$ , then $\langle \vec {z}, \vec {w} \rangle = \vec {z} \dotp \vec {w}$ is the usual dot product.

If $\vec {z} = [2, 1 - i, 2i, 3 - i]$ and $\vec {w} = [1 - i, -1, -i, 3 + 2i]$ , then $\begin{align*} \langle \vec{z}, \vec{w} \rangle &= 2(1 + i) + (1 - i)(-1) + (2i)(i) + (3 - i)(3 - 2i) = 6 -6i \\ \langle \vec{z}, \vec{z} \rangle &= 2 \cdot 2 + (1 - i)(1 + i) + (2i)(-2i) + (3 - i)(3 + i) = 20 \end{align*}$

Note that $\langle \vec {z}, \vec {w} \rangle$ is a complex number in general, as opposed to requiring an inner product to be real as we do in Inner Product Spaces. However, if $\vec {w} = \vec {z} = [z_{1}, z_{2}, \ldots , z_{n}]$ , the definition gives $\langle \vec {z}, \vec {z} \rangle = |z_{1}|^{2} + \ldots + |z_{n}|^{2}$ which is a nonnegative real number, equal to $0$ if and only if $\vec {z} = \vec {0}$ . This explains the conjugation in the definition of $\langle \vec {z}, \vec {w} \rangle$ , and it gives th:025575d of the following theorem.

Let $\vec {z}$ , $\vec {z}_{1}$ , $\vec {w}$ , and $\vec {w}_{1}$ denote vectors in $\mathbb {C}^n$ , and let $\lambda$ denote a complex number.

(a): $\langle \vec {z} + \vec {z}_{1}, \vec {w}\rangle = \langle \vec {z}, \vec {w} \rangle + \langle \vec {z}_{1}, \vec {w} \rangle$ and $\langle \vec {z}, \vec {w} + \vec {w}_{1} \rangle = \langle \vec {z}, \vec {w} \rangle + \langle \vec {z}, \vec {w}_{1} \rangle$ .
(b): $\langle \lambda \vec {z}, \vec {w} \rangle = \lambda \langle \vec {z}, \vec {w} \rangle$ and $\langle \vec {z}, \lambda \vec {w} \rangle = \overline {\lambda } \langle \vec {z}, \vec {w} \rangle$ .
(c): $\langle \vec {z}, \vec {w} \rangle = \overline {\langle \vec {w}, \vec {z} \rangle }$ .
(d): $\langle \vec {z}, \vec {z} \rangle \ge 0$ , and $\langle \vec {z}, \vec {z} \rangle = 0$ if and only if $\vec {z} = \vec {0}$ .

Proof: We leave th:025575a and th:025575b to the reader (Practice Problem prb:complex_matrices10), and th:025575d has already been proved. To prove th:025575c, write $\vec {z} = [z_{1}, z_{2}, \ldots , z_{n}]$ and $\vec {w} = [w_{1}, w_{2}, \ldots , w_{n}]$ . Then $\begin{align*} \overline{\langle \vec{w}, \vec{z} \rangle} = (\overline{w_{1}\overline{z}_{1} + \ldots + w_{n}\overline{z}_{n}}) &= \overline{w}_{1}\overline{\overline{z}}_{1} + \ldots + \overline{w}_{n}\overline{\overline{z}}_{n} \\ &= z_{1}\overline{w}_{1} + \ldots + z_{n}\overline{w}_{n} = \langle \vec{z}, \vec{w} \rangle \end{align*}$ $\blacksquare$

As for the dot product on $\RR ^n$ , property th:025575d enables us to define the norm or length $\norm {\vec {z}}$ of a vector $\vec {z} = (z_{1}, z_{2}, \ldots , z_{n})$ in $\mathbb {C}^n$ : $\begin{equation*} \norm{\vec{z}} = \sqrt{\langle \vec{z}, \vec{z} \rangle} = \sqrt{|z_{1}|^2 + |z_{2}|^2 + \ldots + |z_{n}|^2} \end{equation*}$

The only properties of the norm function we will need are the following (the proofs are left to the reader):

If $\vec {z}$ is any vector in $\mathbb {C}^n$ , then

(a): $\norm {\vec {z}} \ge 0$ and $\norm {\vec {z}} = 0$ if and only if $\vec {z} = \vec {0}$ .
(b): $\norm {\lambda \vec {z}} = |\lambda | \norm {\vec {z}}$ for all complex numbers $\lambda$ .

A vector $\vec {u}$ in $\mathbb {C}^n$ is called a unit vector if $\norm {\vec {u}} = 1$ . Property th:025616b in Theorem th:025616 then shows that if $\vec {z} \neq \vec {0}$ is any nonzero vector in $\mathbb {C}^n$ , then $\vec {u} = \frac {1}{\norm {\vec {z}}}\vec {z}$ is a unit vector.

In $\mathbb {C}^4$ , find a unit vector $\vec {u}$ that is a positive real multiple of $\vec {z} = \begin {bmatrix} 1 - i\\ i\\ 2\\ 3 + 4i \end {bmatrix}$ .

$\norm {\vec {z}} = \sqrt {2+1+4+25} = \sqrt {32} = 4\sqrt {2}$ , so take $\vec {u} = \frac {1}{4\sqrt {2}}\vec {z}$ .

A matrix $A = \left [ a_{ij} \right ]$ is called a complex matrix if every entry $a_{ij}$ is a complex number. The notion of conjugation for complex numbers extends to matrices as follows: Define the conjugate of $A = \left [ a_{ij} \right ]$ to be the matrix

$\begin{equation*} \overline{A} = \left[ \begin{array}{c} \overline{a}_{ij} \end{array}\right] \end{equation*}$ obtained from $A$ by conjugating every entry. Then (using Appendix chap:appacomplexnumbers) $\begin{equation*} \overline{A + B} = \overline{A} + \overline{B} \quad \mbox{ and } \quad \overline{AB} = \overline{A} \; \overline{B} \end{equation*}$ holds for all (complex) matrices of appropriate size.

Transposition of complex matrices is defined just as in the real case, and the following notion is fundamental.

The conjugate transpose $A^{H}$ of a complex matrix $A$ is defined by $\begin{equation*} A^H = (\overline{A})^T = \overline{(A^T)} \end{equation*}$

Observe that $A^{H} = A^{T}$ when $A$ is real.

Other notations for $A^{H}$ are $A^\ast$ and $A^\dagger$ .

$\begin{equation*} \left[ \begin{array}{ccr} 3 & 1 - i & 2 + i \\ 2i & 5 + 2i & -i \end{array}\right]^H = \left[ \begin{array}{cc} 3 & -2i \\ 1 + i & 5 - 2i \\ 2 - i & i \end{array}\right] \end{equation*}$

The following properties of $A^{H}$ follow easily from the rules for transposition of real matrices and extend these rules to complex matrices. Note the conjugate in property th:025659c.

Let $A$ and $B$ denote complex matrices, and let $\lambda$ be a complex number.

(a): $(A^{H})^{H} = A$ .
(b): $(A + B)^{H} = A^{H} + B^{H}$ .
(c): $(\lambda A)^H = \overline {\lambda }A^H$ .
(d): $(AB)^{H} = B^{H}A^{H}$ .

Hermitian and Unitary Matrices

If $A$ is a real symmetric matrix, it is clear that $A^{H} = A$ . The complex matrices that satisfy this condition turn out to be the most natural generalization of the real symmetric matrices:

A square complex matrix $A$ is called Hermitian if $A^{H} = A$ , equivalently $\overline {A} = A^T$ .

The name Hermitian honours Charles Hermite (1822–1901), a French mathematician who worked primarily in analysis and is remembered as the first to show that the number $e$ from calculus is transcendental—that is, $e$ is not a root of any polynomial with integer coefficients.

Hermitian matrices are easy to recognize because the entries on the main diagonal must be real, and the “reflection” of each off-diagonal entry in the main diagonal must be the conjugate of that entry.

$\left [ \begin {array}{ccc} 3 & i & 2 + i \\ -i & -2 & -7 \\ 2 - i & -7 & 1 \end {array}\right ]$ is Hermitian, whereas $\left [ \begin {array}{rr} 1 & i \\ i & -2 \end {array}\right ]$ and $\left [ \begin {array}{rr} 1 & i \\ -i & i \end {array}\right ]$ are not.

The following theorem extends Theorem th:dotpSymmetric, and gives a very useful characterization of Hermitian matrices in terms of the standard inner product in $\mathbb {C}^n$ .

An $n \times n$ complex matrix $A$ is Hermitian if and only if $\begin{equation*} \langle A\vec{z}, \vec{w} \rangle = \langle \vec{z}, A\vec{w} \rangle \end{equation*}$ for all $n$ -tuples $\vec {z}$ and $\vec {w}$ in $\mathbb {C}^n$ .

Proof: If $A$ is Hermitian, we have $A^T = \overline {A}$ . If $\vec {z}$ and $\vec {w}$ are columns in $\mathbb {C}^n$ , then $\langle \vec {z}, \vec {w} \rangle = \vec {z}^T\overline {\vec {w}}$ , so $\begin{equation*} \langle A\vec{z}, \vec{w} \rangle =(A\vec{z})^T\overline{\vec{w}} = \vec{z}^TA^T\overline{\vec{w}} = \vec{z}^T\overline{A}\overline{\vec{w}} = \vec{z}^T(\overline{A\vec{w}}) = \langle \vec{z}, A\vec{w} \rangle \end{equation*}$ To prove the converse, let $\vec {e}_{j}$ denote column $j$ of the identity matrix. If $A = \left [ a_{ij} \right ]$ , the condition gives $\begin{equation*} \overline{a}_{ij} = \langle \vec{e}_{i}, A\vec{e}_{j} \rangle = \langle A\vec{e}_{i}, \vec{e}_{j} \rangle = {a}_{ij} \end{equation*}$ Hence $\overline {A} = A^T$ , so $A$ is Hermitian. $\blacksquare$

Let $A$ be an $n \times n$ complex matrix. As in the real case, a complex number $\lambda$ is called an eigenvalue of $A$ if $A\vec {x} = \lambda \vec {x}$ holds for some column $\vec {x} \neq \vec {0}$ in $\mathbb {C}^n$ . In this case $\vec {x}$ is called an eigenvector of $A$ corresponding to $\lambda$ .

Let $A$ be an $n \times n$ complex matrix. Then $c_{A}(z)=\det (zI-A)$ is a degree $n$ polynomial (in $z$ ) called the characteristic polynomial of $A$ .

Some books define the characteristic polynomial to be $\det (A-zI)$ (the left side of the characteristic equation eqn:chareqn). Our definition ensures that the characteristic polynomial is a monic polynomial (i.e., its leading coefficient is 1). The two polynomials will have the same roots, which are the eigenvalues of the matrix $A$ .

If $A$ is an $n \times n$ matrix, the characteristic polynomial $c_{A}(z)$ is a polynomial of degree $n$ and the eigenvalues of $A$ are just the roots of $c_{A}(z)$ . In most of our examples these roots have been real numbers (in fact, the examples have been carefully chosen so this will be the case!); but it need not happen, even when the characteristic polynomial has real coefficients. For example, if $A = \left [ \begin {array}{rr} 0 & 1 \\ -1 & 0 \end {array}\right ]$ then $c_{A}(z) = z^{2} + 1$ has roots $i$ and $-i$ , where $i$ is a complex number satisfying $i^{2} = -1$ . Therefore, we have to deal with the possibility that the eigenvalues of a (real) square matrix might be complex numbers.

For a complex matrix, $c_{A}(z)$ has complex coefficients (possibly nonreal). However, an argument like that given in Exploration exp:slowdown of The Characteristic Equation still works to show that the eigenvalues of $A$ are the roots (possibly complex) of $c_{A}(z)$ .

It is at this point that the advantage of working with complex numbers becomes apparent. The real numbers are incomplete in the sense that the characteristic polynomial of a real matrix may fail to have all its roots real. However, this difficulty does not occur for the complex numbers. The so-called fundamental theorem of algebra ensures that every polynomial of positive degree with complex coefficients has a complex root. Hence every square complex matrix $A$ has a (complex) eigenvalue. Indeed (see th:034210), $c_{A}(z)$ factors completely as follows:

$\begin{equation*} c_{A}(z) = (z -\lambda_{1})(z -\lambda_{2}) \cdots (z -\lambda_{n}) \end{equation*}$ where $\lambda _{1}, \lambda _{2}, \ldots , \lambda _{n}$ are the eigenvalues of $A$ (with possible repetitions due to multiple roots).

The next result extends Theorem th:symmetric_has_ortho_ev, which asserts that eigenvectors of a symmetric real matrix corresponding to distinct eigenvalues are orthogonal. In the complex context, two $n$ -tuples $\vec {z}$ and $\vec {w}$ in $\mathbb {C}^n$ are said to be orthogonal if $\langle \vec {z}, \vec {w} \rangle = 0$ .

Suppose $A$ is a Hermitian matrix.

(a): The eigenvalues of $A$ are real.
(b): Eigenvectors of $A$ corresponding to distinct eigenvalues are orthogonal.

Proof: Let $\lambda$ and $\mu$ be eigenvalues of $A$ with (nonzero) eigenvectors $\vec {z}$ and $\vec {w}$ . Then $A\vec {z} = \lambda \vec {z}$ and $A\vec {w} = \mu \vec {w}$ , so Theorem th:025697 gives $\begin{equation} \label{eigenvalEq} \lambda \langle \vec{z}, \vec{w} \rangle = \langle \lambda \vec{z}, \vec{w} \rangle = \langle A\vec{z}, \vec{w} \rangle = \langle \vec{z}, A\vec{w} \rangle = \langle \vec{z}, \mu \vec{w} \rangle = \overline{\mu} \langle \vec{z}, \vec{w} \rangle \end{equation}$ If $\mu = \lambda$ and $\vec {w} = \vec {z}$ , this becomes $\lambda \langle \vec {z}, \vec {z} \rangle = \overline {\lambda } \langle \vec {z}, \vec {z} \rangle$ . Because $\langle \vec {z}, \vec {z} \rangle = \norm {\vec {z}}^{2} \neq 0$ , this implies $\lambda = \overline {\lambda }$ . Thus $\lambda$ is real, proving (1). Similarly, $\mu$ is real, so equation (eigenvalEq) gives $\lambda \langle \vec {z}, \vec {w} \rangle = \mu \langle \vec {z}, \vec {w} \rangle$ . If $\lambda \neq \mu$ , this implies $\langle \vec {z}, \vec {w} \rangle = 0$ , proving (2). $\blacksquare$

Let $A$ be a real symmetric matrix. Then the eigenvalues of $A$ are real.

Proof: Symmetric real matrices are Hermitian, and so the result follows immediately from Theorem th:025729. $\blacksquare$

The Real Spectral Theorem (th:PrinAxes) asserts that every real symmetric matrix $A$ is orthogonally diagonalizable—that is $U^{T}AU$ is diagonal where $U$ is an orthogonal matrix $(U^{-1} = U^{T})$ . The next theorem identifies the complex analogs of these orthogonal real matrices.

As in the real case, a set of nonzero vectors $\{\vec {z}_{1}, \vec {z}_{2}, \ldots , \vec {z}_{m}\}$ in $\mathbb {C}^n$ is called orthogonal if $\langle \vec {z}_{i}, \vec {z}_{j}\rangle = 0$ whenever $i \neq j$ , and it is orthonormal if, in addition, $\norm {\vec {z}_{i} } = 1$ for each $i$ .

The following are equivalent for an $n \times n$ complex matrix $A$ .

(a): $A$ is invertible and $A^{-1} = A^{H}$ .
(b): The rows of $A$ are an orthonormal set in $\mathbb {C}^n$ .
(c): The columns of $A$ are an orthonormal set in $\mathbb {C}^n$ .

Proof: If $A = \left [ \begin {array}{cccc} | & | & & | \\ \vec {c}_{1} & \vec {c}_{2} & \cdots & \vec {c}_{n} \\ | & | & & | \end {array}\right ]$ is a complex matrix with $j$ th column $\vec {c}_{j}$ , then $A^T\overline {A} = \left [ \langle \vec {c}_{i}, \vec {c}_{j}\rangle \right ]$ , as in Theorem thm:024227. Now th:025759a $\Leftrightarrow$ th:025759b follows, and th:025759a $\Leftrightarrow$ th:025759c is proved in the same way. $\blacksquare$

A square complex matrix $U$ is called unitary if $U^{-1} = U^{H}$ .

Thus a real matrix is unitary if and only if it is orthogonal.

The matrix $A = \left [ \begin {array}{rr} 1 + i & 1 \\ 1 - i & i \end {array}\right ]$ has orthogonal columns, but the rows are not orthogonal. Normalizing the columns gives the unitary matrix $\frac {1}{2}\left [ \begin {array}{rr} 1 + i & \sqrt {2} \\ 1 - i & \sqrt {2}i \end {array}\right ]$ .

Given a real symmetric matrix $A$ , we saw in Orthogonal Matrices and Symmetric Matrices a procedure for finding an orthogonal matrix $Q$ such that $Q^{T}AQ$ is diagonal (see Example ex:DiagonalizeSymmetricMatrix). The following example illustrates Theorem th:025729 and shows that the technique works for complex matrices.

Consider the Hermitian matrix $A = \left [ \begin {array}{cc} 3 & 2 + i \\ 2 - i & 7 \end {array}\right ]$ . Find the eigenvalues of $A$ , find two orthonormal eigenvectors, and so find a unitary matrix $U$ such that $U^{H}AU$ is diagonal.

The characteristic polynomial of $A$ is $\begin{equation*} c_{A}(z) = \mbox{det}(zI - A) = \mbox{det}\left[ \begin{array}{rr} x - 3 & -2 - i \\ -2 + i & x - 7 \end{array}\right] = (z-2)(z-8) \end{equation*}$ Hence the eigenvalues are $2$ and $8$ (both real as expected), and corresponding eigenvectors are $\left [ \begin {array}{cc} 2 + i \\ -1 \end {array}\right ]$ and $\left [ \begin {array}{cc} 1 \\ 2 - i \end {array}\right ]$ (orthogonal as expected). Each has length $\sqrt {6}$ , so let $U = \frac {1}{\sqrt {6}}\left [ \begin {array}{cc} 2 + i & 1 \\ -1 & 2 - i \end {array}\right ]$ be the unitary matrix with the normalized eigenvectors as columns.

Then $U^HAU = \left [ \begin {array}{rr} 2 & 0 \\ 0 & 8 \end {array}\right ]$ is diagonal.

Unitary Diagonalization

An $n \times n$ complex matrix $A$ is called unitarily diagonalizable if $U^{H}AU$ is diagonal for some unitary matrix $U$ . As Example ex:025794 suggests, we are going to prove that every Hermitian matrix is unitarily diagonalizable. However, with only a little extra effort, we can get a very important theorem that has this result as an easy consequence.

A complex matrix is called upper triangular if every entry below the main diagonal is zero. We owe the following theorem to Issai Schur.

Issai Schur (1875–1941) was a German mathematician who did fundamental work in the theory of representations of groups as matrices.

Schur’s Theorem If $A$ is any $n \times n$ complex matrix, there exists a unitary matrix $U$ such that $\begin{equation*} U^HAU = T \end{equation*}$ is upper triangular. Moreover, the entries on the main diagonal of $T$ are the eigenvalues $\lambda _{1}, \lambda _{2}, \ldots , \lambda _{n}$ of $A$ (including multiplicities).

Proof: We use induction on $n$ , mirroring the form of the proof of th:PrinAxes. If $n = 1$ , $A$ is already upper triangular. If $n > 1$ , assume the theorem is valid for $(n - 1) \times (n - 1)$ complex matrices. Let $\lambda _{1}$ be an eigenvalue of $A$ , and let $\vec {y}_{1}$ be an eigenvector with $\norm {\vec {y}_{1}} = 1$ . Next, the (complex analog of the) Gram-Schmidt process provides $\vec {y}_{2}, \ldots , \vec {y}_{n}$ such that $\{\vec {y}_{1}, \vec {y}_{2}, \ldots , \vec {y}_{n}\}$ is an orthonormal basis of $\mathbb {C}^n$ . If $U_{1} = \left [ \begin {array}{cccc} | & | & & | \\ \vec {y}_{1} & \vec {y}_{2} & \cdots & \vec {y}_{n} \\ | & | & & | \end {array}\right ]$ is the matrix with these vectors as its columns, then $\begin{equation*} U_{1}^HAU_{1} = \left[ \begin{array}{cc} \lambda_{1} & X_{1} \\ 0 & A_{1} \end{array}\right] \end{equation*}$ in block form. Now apply induction to find a unitary $(n - 1) \times (n - 1)$ matrix $W_{1}$ such that $W_{1}^HA_{1}W_{1} = T_{1}$ is upper triangular. Then $U_{2} = \left [ \begin {array}{cc} 1 & 0 \\ 0 & W_{1} \end {array}\right ]$ is a unitary $n \times n$ matrix. Hence $U = U_{1}U_{2}$ is unitary (using Theorem th:025759), and $\begin{align*} U^HAU &= U_{2}^H(U_{1}^HAU_{1})U_{2} \\ &= \left[ \begin{array}{cc} 1 & 0 \\ 0 & W_{1}^H \end{array}\right] \left[ \begin{array}{cc} \lambda_{1} & X_{1} \\ 0 & A_{1} \end{array}\right] \left[ \begin{array}{cc} 1 & 0 \\ 0 & W_{1} \end{array}\right] = \left[ \begin{array}{cc} \lambda_{1} & X_{1}W_{1} \\ 0 & T_{1} \end{array}\right] \end{align*}$ is upper triangular. Finally, $A$ and $U^{H}AU = T$ have the same eigenvalues by (the complex version of) Theorem th:properties_similar_eig, and they are the diagonal entries of $T$ because $T$ is upper triangular. $\blacksquare$

The fact that similar matrices have the same traces and determinants gives the following consequence of Schur’s theorem.

Let $A$ be an $n \times n$ complex matrix, and let $\lambda _{1}, \lambda _{2}, \ldots , \lambda _{n}$ denote the eigenvalues of $A$ , including multiplicities. Then $\begin{equation*} \mbox{det }A = \lambda_1\lambda_2 \cdots \lambda_n \quad \mbox{and} \quad \mbox{tr }A = \lambda_1 + \lambda_2 + \cdots + \lambda_n \end{equation*}$

Schur’s theorem asserts that every complex matrix can be “unitarily triangularized.” However, we cannot substitute “unitarily diagonalized” here. In fact, if $A = \left [ \begin {array}{cc} 1 & 1 \\ 0 & 1 \end {array}\right ]$ , there is no invertible complex matrix $U$ at all such that $U^{-1}AU$ is diagonal. However, the situation is much better for Hermitian matrices.

If $A$ is Hermitian, there is a unitary matrix $U$ such that $U^{H}AU$ is diagonal.

Proof: By Schur’s theorem, let $U^{H}AU = T$ be upper triangular where $U$ is unitary. Since $A$ is Hermitian, this gives $\begin{equation*} T^H = (U^HAU)^H = U^HA^HU^{HH} = U^HAU = T \end{equation*}$ This means that $T$ is both upper and lower triangular. Hence $T$ is actually diagonal. $\blacksquare$

The Real Spectral Theorem asserts that a real matrix $A$ is symmetric if and only if it is orthogonally diagonalizable (that is, $P^{T}AP$ is diagonal for some real orthogonal matrix $P$ ). Theorem th:Spectral Theorem is the complex analog of half of this result. However, the converse is false for complex matrices: There exist unitarily diagonalizable matrices that are not Hermitian.

Show that the non-Hermitian matrix $A = \left [ \begin {array}{rr} 0 & 1 \\ -1 & 0 \end {array}\right ]$ is unitarily diagonalizable.

The characteristic polynomial is $c_{A}(z) = z^{2} + 1$ . Hence the eigenvalues are $i$ and $-i$ , and it is easy to verify that $\left [ \begin {array}{r} i \\ -1 \end {array}\right ]$ and $\left [ \begin {array}{r} -1 \\ i \end {array}\right ]$ are corresponding eigenvectors. Moreover, these eigenvectors are orthogonal and both have length $\sqrt {2}$ , so $U = \frac {1}{\sqrt {2}}\left [ \begin {array}{rr} i & -1 \\ -1 & i \end {array}\right ]$ is a unitary matrix such that $U^HAU = \left [ \begin {array}{rr} i & 0 \\ 0 & -i \end {array}\right ]$ is diagonal.

There is a very simple way to characterize those complex matrices that are unitarily diagonalizable. To this end, an $n \times n$ complex matrix $N$ is called normal if $NN^{H} = N^{H}N$ . It is clear that every Hermitian or unitary matrix is normal, as is the matrix $\left [ \begin {array}{rr} 0 & 1 \\ -1 & 0 \end {array}\right ]$ in Example exa:025874. In fact we have the following result.

An $n \times n$ complex matrix $A$ is unitarily diagonalizable if and only if $A$ is normal.

Proof: Assume first that $U^{H}AU = D$ , where $U$ is unitary and $D$ is diagonal. Then $DD^{H} = D^{H}D$ as is easily verified. Because $DD^{H} = U^{H}(AA^{H})U$ and $D^{H}D = U^{H}(A^{H}A)U$ , it follows by cancellation that $AA^{H} = A^{H}A$ .
Conversely, assume $A$ is normal—that is, $AA^{H} = A^{H}A$ . By Schur’s theorem, let $U^{H}AU = T$ , where $T$ is upper triangular and $U$ is unitary. Then $T$ is normal too:
$\begin{equation*} TT^H = U^H(AA^H)U = U^H(A^HA)U = T^HT \end{equation*}$ Hence it suffices to show that a normal $n \times n$ upper triangular matrix $T$ must be diagonal. We induct on $n$ ; it is clear if $n = 1$ . If $n > 1$ and $T = \left [ t_{ij} \right ]$ , then equating $(1, 1)$ -entries in $TT^{H}$ and $T^{H}T$ gives $\begin{equation*} |t_{11}|^2 + |t_{12}|^2 + \ldots + |t_{1n}|^2 = |t_{11}|^2 \end{equation*}$ This implies $t_{12} = t_{13} = \ldots = t_{1n} = 0$ , so $T = \left [ \begin {array}{cc} t_{11} & 0 \\ 0 & T_{1} \end {array}\right ]$ in block form. Hence $T = \left [ \begin {array}{cc} \overline {t}_{11} & 0 \\ 0 & T_{1}^H \end {array}\right ]$ so $TT^{H} = T^{H}T$ implies $T_{1}T_{1}^H = T_{1}T_{1}^H$ . Thus $T_{1}$ is diagonal by induction, and the proof is complete. $\blacksquare$

We conclude this section by using Schur’s theorem (Theorem th:025814) to prove a famous theorem about matrices. Recall that the characteristic polynomial of a square matrix $A$ is defined by $c_{A}(z) = \mbox {det}(zI - A)$ , and that the eigenvalues of $A$ are just the roots of $c_{A}(z)$ .

Cayley-Hamilton Theorem If $A$ is an $n \times n$ complex matrix, then $c_{A}(A) = 0$ ; that is, $A$ is a root of its characteristic polynomial.

Named after the English mathematician Arthur Cayley (1821–1895) and William Rowan Hamilton (1805–1865), an Irish mathematician famous for his work on physical dynamics.

Proof

If $p(z)$ is any polynomial with complex coefficients, then $p(P^{-1}AP) = P^{-1}p(A)P$ for any invertible complex matrix $P$ . Hence, by Schur’s theorem, we may assume that $A$ is upper triangular. Then the eigenvalues $\lambda _{1}, \lambda _{2}, \ldots , \lambda _{n}$ of $A$ appear along the main diagonal, so $\begin{equation*} c_{A}(z) = (z - \lambda_{1})(z - \lambda_{2})(z - \lambda_{3}) \cdots (z -\lambda_{n}) \end{equation*}$ Thus $\begin{equation*} c_{A}(A) = (A - \lambda_{1}I)(A - \lambda_{2}I)(A - \lambda_{3}I) \cdots (A - \lambda_{n}I) \end{equation*}$ Note that each matrix $A - \lambda _{i}I$ is upper triangular. Now observe:

(a): $A - \lambda _{1}I$ has zero first column because column 1 of $A$ is $(\lambda _{1}, 0, 0, \ldots , 0)^{T}$ .
(b): Then $(A - \lambda _{1}I)(A - \lambda _{2}I)$ has the first two columns zero because the second column of $(A - \lambda _{2}I)$ is $(b, 0, 0, \ldots , 0)^{T}$ for some constant $b$ .
(c): Next $(A - \lambda _{1}I)(A - \lambda _{2}I)(A - \lambda _{3}I)$ has the first three columns zero because column 3 of $(A -\lambda _{3}I)$ is $(c, d, 0, \ldots , 0)^{T}$ for some constants $c$ and $d$ .

Continuing in this way we see that $(A - \lambda _{1}I)(A - \lambda _{2}I)(A - \lambda _{3}I) \cdots (A - \lambda _{n}I)$ has all $n$ columns zero; that is, $c_{A}(A) = 0$ . $\blacksquare$

Practice Problems

In each case, compute the norm of the complex vector.

(a): $(1, 1 - i, -2, i)$ $\answer {\sqrt {6}}$
(b): $(1 - i, 1 + i, 1, -1)$
(c): $(2 + i, 1 - i, 2, 0, -i)$ $\answer {\sqrt {13}}$
(d): $(-2, -i, 1 + i, 1 - i, 2i)$

In each case, determine whether the two vectors are orthogonal.

(a): $\begin {bmatrix} 4\\ -3i\\ 2 + i\end {bmatrix}$ , $\begin {bmatrix} i\\ 2\\ 2 - 4i\end {bmatrix}$ YesNo
(b): $\begin {bmatrix} i\\ -i\\ 2 + i \end {bmatrix}$ , $\begin {bmatrix} i\\ i\\ 2 -i \end {bmatrix}$ YesNo
(c): $\begin {bmatrix} 1\\ 1\\ i\\ i\end {bmatrix}$ , $\begin {bmatrix} 1\\ i\\ -i\\ 1\end {bmatrix}$ YesNo
(d): $\begin {bmatrix} 4 + 4i\\ 2 + i\\ 2i\end {bmatrix}$ , $\begin {bmatrix} -1 + i\\ 2\\ 3 - 2i\end {bmatrix}$ YesNo

A subset $U$ of $\mathbb {C}^n$ is called a complex subspace of $\mathbb {C}^n$ if it contains $0$ and if, given $\vec {v}$ and $\vec {w}$ in $U$ , both $\vec {v} + \vec {w}$ and $z\vec {v}$ lie in $U$ ( $z$ any complex number). In each case, determine whether $U$ is a complex subspace of $\mathbb {C}^3$ .

(a): $U = \{(w, \overline {w}, 0) \mid w \mbox { in } \mathbb {C}\}$
Not a subspace. For example, $i(0, 0, 1) = (0, 0, i)$ is not in $U$ .
(b): $U = \{(w, 2w, a) \mid w \mbox { in } \mathbb {C}, a \mbox { in } \RR \}$
(c): $U = \RR ^3$ SubspaceNot a Subspace
(d): $U = \{(v + w, v - 2w, v) \mid v, w \mbox { in } \mathbb {C}\}$

In each case, find a basis over $\mathbb {C}$ , and determine the dimension of the complex subspace $U$ of $\mathbb {C}^3$ (see the previous exercise).

(a): $U = \{(w, v + w, v - iw) \mid v, w \mbox { in } \mathbb {C}\}$
Basis $\{(i, 0, 2), (1, 0, -1)\}$ ; dimension $2$
(b): $U = \{(iv + w, 0, 2v - w) \mid v, w \mbox { in } \mathbb {C}\}$
(c): $U = \{(u, v, w) \mid iu - 3v + (1 - i)w = 0;\ \\ u, v, w \mbox { in } \mathbb {C}\}$
Basis $\{(1, 0, -2i), (0, 1, 1 - i)\}$ ; dimension $2$
(d): $U = \{(u, v, w) \mid 2u + (1 + i)v - iw = 0;\ \\ u, v, w \mbox { in } \mathbb {C}\}$

In each case, determine whether the given matrix is Hermitian, unitary, or normal.

(a): $\left [ \begin {array}{rr} 1 & -i \\ i & i \end {array}\right ]$
Normal only
(b): $\left [ \begin {array}{rr} 2 & 3 \\ -3 & 2 \end {array}\right ]$
(c): $\left [ \begin {array}{rr} 1 & i \\ -i & 2 \end {array}\right ]$
Hermitian (and normal), not unitary
(d): $\left [ \begin {array}{rr} 1 & -i \\ i & -1 \end {array}\right ]$
(e): $\frac {1}{\sqrt {2}} \left [ \begin {array}{rr} 1 & -1 \\ 1 & 1 \end {array}\right ]$
(f): $\left [ \begin {array}{cc} 1 & 1 + i \\ 1 + i & i \end {array}\right ]$
None of these adjectives apply.
(g): $\left [ \begin {array}{cc} 1 + i & 1 \\ -i & -1 + i \end {array}\right ]$
(h): $\frac {1}{\sqrt {2}|z|}\left [ \begin {array}{rr} z & z \\ \overline {z} & -\overline {z} \end {array}\right ]$ , $z \neq 0$
Unitary (and normal); Hermitian if and only if $z$ is real

Show that a matrix $N$ is normal if and only if $\overline {N}N^T = N^T\overline {N}$ .

Let $A = \left [ \begin {array}{cc} z & \overline {v} \\ v & w \end {array}\right ]$ where $v$ , $w$ , and $z$ are complex numbers. Characterize in terms of $v$ , $w$ , and $z$ when $A$ is

(a): Hermitian
(b): unitary
(c): normal.

In each case, find a unitary matrix $U$ such that $U^{H}AU$ is diagonal.

(a): $A = \left [ \begin {array}{rr} 1 & i\\ -i & 1 \end {array}\right ]$
(b): $A = \left [ \begin {array}{cc} 4 & 3 - i \\ 3 + i & 1 \end {array}\right ]$
$U = \frac {1}{\sqrt {14}}\left [ \begin {array}{cc} -2 & 3 - i \\ 3 + i & 2 \end {array}\right ]$ , $U^HAU = \left [ \begin {array}{rr} -1 & 0 \\ 0 & 6 \end {array}\right ]$
(c): $A = \left [ \begin {array}{rr} a & b\\ -b & a \end {array}\right ]$ ; $a$ , $b$ , real
(d): $A = \left [ \begin {array}{cc} 2 & 1 + i\\ 1 - i & 3 \end {array}\right ]$
$U = \frac {1}{\sqrt {3}}\left [ \begin {array}{cc} 1 + i & 1 \\ -1 & 1 - i \end {array}\right ]$ , $U^HAU = \left [ \begin {array}{rr} 1 & 0 \\ 0 & 4 \end {array}\right ]$
(e): $A = \left [ \begin {array}{ccc} 1 & 0 & 1 + i\\ 0 & 2 & 0 \\ 1 - i & 0 & 0 \end {array}\right ]$
(f): $A = \left [ \begin {array}{ccc} 1 & 0 & 0\\ 0 & 1 & 1 + i\\ 0 & 1 - i & 2 \end {array}\right ]$
$U = \frac {1}{\sqrt {3}}\left [ \begin {array}{ccc} \sqrt {3} & 0 & 0 \\ 0 & 1 + i & 1 \\ 0 & -1 & 1 - i \end {array}\right ]$ , $U^HAU = \left [ \begin {array}{rrr} 1 & 0 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & 3 \end {array}\right ]$

Show that $\langle A \vec {x}, \vec {y} \rangle = \langle \vec {x}, A^{H}\vec {y}\rangle$ holds for all $n \times n$ matrices $A$ and for all $n$ -tuples $\vec {x}$ and $\vec {y}$ in $\mathbb {C}^n$ .

(a): Prove th:025575a and th:025575b of Theorem th:025575.
(b): Prove Theorem th:025616.
$\norm { \lambda Z }^2 = \langle \lambda Z, \lambda Z \rangle = \lambda \overline {\lambda } \langle Z, Z \rangle = |\lambda |^2 \norm { Z }^2$
(c): Prove Theorem th:025659.

a.: Show that $A$ is Hermitian if and only if $\overline {A} = A^T$ .
b.: Show that the diagonal entries of any Hermitian matrix are real.
If the $(k, k)$ -entry of $A$ is $a_{kk}$ , then the $(k, k)$ -entry of $\overline {A}$ is $\overline {a}_{kk}$ so the $(k, k)$ -entry of $(\overline {A})^T = A^{H}$ is $\overline {a}_{kk}$ . This equals $a$ , so $a_{kk}$ is real.

(a): Show that every complex matrix $Z$ can be written uniquely in the form $Z = A + iB$ , where $A$ and $B$ are real matrices.
(b): If $Z = A + iB$ as in (a), show that $Z$ is Hermitian if and only if $A$ is symmetric, and $B$ is skew-symmetric (that is, $B^{T} = -B$ ).

If $Z$ is any complex $n \times n$ matrix, show that $ZZ^{H}$ and $Z + Z^{H}$ are Hermitian.

A complex matrix $B$ is called skew-Hermitian if $B^{H} = -B$ .

a.: Show that $Z - Z^{H}$ is skew-Hermitian for any square complex matrix $Z$ .
b.: If $B$ is skew-Hermitian, show that $B^{2}$ and $iB$ are Hermitian.
Show that $(B^2)^H = B^HB^H = (-B)(-B) = B^2$ ; $(iB)^H = \overline {i}B^H = (-i)(-B) = iB$ .
c.: If $B$ is skew-Hermitian, show that the eigenvalues of $B$ are pure imaginary ( $i \lambda$ for real $\lambda$ ).
d.: Show that every $n \times n$ complex matrix $Z$ can be written uniquely as $Z = A + B$ , where $A$ is Hermitian and $B$ is skew-Hermitian.
If $Z = A + B$ , as given, first show that $Z^{H} = A - B$ , and hence that $A = \frac {1}{2}(Z + Z^{H})$ and $B = \frac {1}{2}(Z - Z^{H})$ .

Let $U$ be a unitary matrix. Show that:

(a): $\norm { U\vec {x} } = \norm { \vec {x} }$ for all columns $\vec {x}$ in $\mathbb {C}^n$ .
(b): $|\lambda | = 1$ for every eigenvalue $\lambda$ of $U$ .

(a): If $Z$ is an invertible complex matrix, show that $Z^{H}$ is invertible and that $(Z^{H})^{-1} = (Z^{-1})^{H}$ .
(b): Show that the inverse of a unitary matrix is again unitary.
If $U$ is unitary, $(U^{-1})^{-1} = (U^{H})^{-1} = (U^{-1})^{H}$ , so $U^{-1}$ is unitary.
(c): If $U$ is unitary, show that $U^{H}$ is unitary.

Let $Z$ be an $m \times n$ matrix such that $Z^{H}Z = I_{n}$ (for example, $Z$ is a unit column in $\mathbb {C}^n$ ).

(a): Show that $V = ZZ^{H}$ is Hermitian and satisfies
$V^{2} = V$ .
(b): Show that $U = I - 2ZZ^{H}$ is both unitary and Hermitian (so $U^{-1} = U^{H} = U$ ).

(a): If $N$ is normal, show that $zN$ is also normal for all complex numbers $z$ .
(b): Show that (a) fails if normal is replaced by Hermitian.
$H = \left [ \begin {array}{rr} 1 & i \\ -i & 0 \end {array}\right ]$ is Hermitian but $iH = \left [ \begin {array}{rr} i & -1 \\ 1 & 0 \end {array}\right ]$ is not.

Show that a real $2 \times 2$ normal matrix is either symmetric or has the form $\left [ \begin {array}{rr} a & b \\ -b & a \end {array}\right ]$ .

If $A$ is Hermitian, show that all the coefficients of $c_{A}(z)$ are real numbers.

(a): If $A = \left [ \begin {array}{rr} 1 & 1 \\ 0 & 1 \end {array}\right ]$ , show that $U^{-1}AU$ is not diagonal for any invertible complex matrix $U$ .
(b): If $A = \left [ \begin {array}{rr} 0 & 1 \\ -1 & 0 \end {array}\right ]$ , show that $U^{-1}AU$ is not upper triangular for any real invertible matrix $U$ .
Let $U = \left [ \begin {array}{rr} a & b \\ c & d \end {array}\right ]$ be real and invertible, and assume that $U^{-1}AU = \left [ \begin {array}{rr} \lambda & \mu \\ 0 & v \end {array}\right ]$ . Then $AU = U\left [ \begin {array}{rr} \lambda & \mu \\ 0 & v \end {array}\right ]$ , and first column entries are $c = a\lambda$ and $-a = c\lambda$ . Hence $\lambda$ is real ( $c$ and $a$ are both real and are not both $0$ ), and $(1 + \lambda ^{2})a = 0$ . Thus $a = 0$ , $c = a\lambda = 0$ , a contradiction.

If $A$ is any $n \times n$ matrix, show that $U^{H}AU$ is lower triangular for some unitary matrix $U$ .

If $A$ is a $3 \times 3$ matrix, show that $A^{2} = 0$ if and only if there exists a unitary matrix $U$ such that $U^{H}AU$ has the form $\left [ \begin {array}{rrr} 0 & 0 & u \\ 0 & 0 & v \\ 0 & 0 & 0 \end {array}\right ]$ or the form $\left [ \begin {array}{rrr} 0 & u & v \\ 0 & 0 & 0 \\ 0 & 0 & 0 \end {array}\right ]$ .

If $A^{2} = A$ , show that rank $A = \mbox {tr}A$ . [Hint: Use Schur’s theorem.]

Text Source

This section was adapted from Section 8.6 of Keith Nicholson’s Linear Algebra with Applications. (CC-BY-NC-SA)

W. Keith Nicholson, Linear Algebra with Applications, Lyryx 2018, Open Edition, pp. 445–456.