Orthogonal Matrices and Symmetric Matrices

$\newenvironment {prompt}{}{} \newcommand {\ungraded }[0]{} \newcommand {\npnoround }[0]{\nprounddigits {-1}} \newcommand {\npnoroundexp }[0]{\nproundexpdigits {-1}} \newcommand {\npunitcommand }[1]{\ensuremath {\mathrm {#1}}} \newcommand {\tdplotsinandcos }[3]{\pgfmathsetmacro {#1}{sin(#3)}\pgfmathsetmacro {#2}{cos(#3)}} \newcommand {\tdplotmult }[3]{\pgfmathsetmacro {#1}{#2*#3}} \newcommand {\tdplotdiv }[3]{\pgfmathsetmacro {#1}{#2/#3}} \newcommand {\tdplotcheckdiff }[5]{\par \par \pgfmathparse { abs(#2 -#1)<#3 } \par \ifthenelse {\equal {\pgfmathresult }{1}}{#4}{#5} } \newcommand {\tdplotsetmaincoords }[2]{\pgfmathsetmacro {\tdplotmaintheta }{#1} \pgfmathsetmacro {\tdplotmainphi }{#2} \tdplotcalctransformmainscreen \tikzset {tdplot_main_coords/.style={x={(\raarot cm,\rbarot cm)},y={(\rabrot cm,\rbbrot cm)},z={(\racrot cm,\rbcrot cm)}}}} \newcommand {\tdplotcalctransformmainscreen }[0]{\tdplotsinandcos {\sintheta }{\costheta }{\tdplotmaintheta }\tdplotsinandcos {\sinphi }{\cosphi }{\tdplotmainphi }\tdplotmult {\stsp }{\sintheta }{\sinphi }\tdplotmult {\stcp }{\sintheta }{\cosphi }\tdplotmult {\ctsp }{\costheta }{\sinphi }\tdplotmult {\ctcp }{\costheta }{\cosphi }\pgfmathsetmacro {\raarot }{\cosphi }\pgfmathsetmacro {\rabrot }{\sinphi }\pgfmathsetmacro {\racrot }{0}\pgfmathsetmacro {\rbarot }{-\ctsp }\pgfmathsetmacro {\rbbrot }{\ctcp }\pgfmathsetmacro {\rbcrot }{\sintheta }\pgfmathsetmacro {\rcarot }{\stsp }\pgfmathsetmacro {\rcbrot }{-\stcp }\pgfmathsetmacro {\rccrot }{\costheta }} \newcommand {\tdplotcalctransformrotmain }[0]{\tdplotsinandcos {\sinalpha }{\cosalpha }{\tdplotalpha } \tdplotsinandcos {\sinbeta }{\cosbeta }{\tdplotbeta } \tdplotsinandcos {\singamma }{\cosgamma }{\tdplotgamma } \tdplotmult {\sasb }{\sinalpha }{\sinbeta } \tdplotmult {\sbsg }{\sinbeta }{\singamma } \tdplotmult {\sasg }{\sinalpha }{\singamma } \tdplotmult {\sasbsg }{\sasb }{\singamma } \tdplotmult {\sacb }{\sinalpha }{\cosbeta } \tdplotmult {\sacg }{\sinalpha }{\cosgamma } \tdplotmult {\sbcg }{\sinbeta }{\cosgamma } \tdplotmult {\sacbsg }{\sacb }{\singamma } \tdplotmult {\sacbcg }{\sacb }{\cosgamma } \tdplotmult {\casb }{\cosalpha }{\sinbeta } \tdplotmult {\cacb }{\cosalpha }{\cosbeta } \tdplotmult {\cacg }{\cosalpha }{\cosgamma } \tdplotmult {\casg }{\cosalpha }{\singamma } \tdplotmult {\cacbsg }{\cacb }{\singamma } \tdplotmult {\cacbcg }{\cacb }{\cosgamma } \pgfmathsetmacro {\raaeul }{\cacbcg -\sasg } \pgfmathsetmacro {\rabeul }{-\cacbsg -\sacg } \pgfmathsetmacro {\raceul }{\casb } \pgfmathsetmacro {\rbaeul }{\sacbcg + \casg } \pgfmathsetmacro {\rbbeul }{-\sacbsg + \cacg } \pgfmathsetmacro {\rbceul }{\sasb } \pgfmathsetmacro {\rcaeul }{-\sbcg } \pgfmathsetmacro {\rcbeul }{\sbsg } \pgfmathsetmacro {\rcceul }{\cosbeta } } \newcommand {\tdplotcalctransformmainrot }[0]{\tdplotsinandcos {\sinalpha }{\cosalpha }{\tdplotalpha } \tdplotsinandcos {\sinbeta }{\cosbeta }{\tdplotbeta } \tdplotsinandcos {\singamma }{\cosgamma }{\tdplotgamma } \tdplotmult {\sasb }{\sinalpha }{\sinbeta } \tdplotmult {\sbsg }{\sinbeta }{\singamma } \tdplotmult {\sasg }{\sinalpha }{\singamma } \tdplotmult {\sasbsg }{\sasb }{\singamma } \tdplotmult {\sacb }{\sinalpha }{\cosbeta } \tdplotmult {\sacg }{\sinalpha }{\cosgamma } \tdplotmult {\sbcg }{\sinbeta }{\cosgamma } \tdplotmult {\sacbsg }{\sacb }{\singamma } \tdplotmult {\sacbcg }{\sacb }{\cosgamma } \tdplotmult {\casb }{\cosalpha }{\sinbeta } \tdplotmult {\cacb }{\cosalpha }{\cosbeta } \tdplotmult {\cacg }{\cosalpha }{\cosgamma } \tdplotmult {\casg }{\cosalpha }{\singamma } \tdplotmult {\cacbsg }{\cacb }{\singamma } \tdplotmult {\cacbcg }{\cacb }{\cosgamma } \pgfmathsetmacro {\raaeul }{\cacbcg -\sasg } \pgfmathsetmacro {\rabeul }{\sacbcg + \casg } \pgfmathsetmacro {\raceul }{-\sbcg } \pgfmathsetmacro {\rbaeul }{-\cacbsg -\sacg } \pgfmathsetmacro {\rbbeul }{-\sacbsg + \cacg } \pgfmathsetmacro {\rbceul }{\sbsg } \pgfmathsetmacro {\rcaeul }{\casb } \pgfmathsetmacro {\rcbeul }{\sasb } \pgfmathsetmacro {\rcceul }{\cosbeta } } \newcommand {\tdplottransformmainrot }[3]{\tdplotcalctransformmainrot \par \pgfmathsetmacro {\tdplotresx }{\raaeul * #1 + \rabeul * #2 + \raceul * #3} \pgfmathsetmacro {\tdplotresy }{\rbaeul * #1 + \rbbeul * #2 + \rbceul * #3} \pgfmathsetmacro {\tdplotresz }{\rcaeul * #1 + \rcbeul * #2 + \rcceul * #3} } \newcommand {\tdplottransformrotmain }[3]{\tdplotcalctransformrotmain \par \pgfmathsetmacro {\tdplotresx }{\raaeul * #1 + \rabeul * #2 + \raceul * #3} \pgfmathsetmacro {\tdplotresy }{\rbaeul * #1 + \rbbeul * #2 + \rbceul * #3} \pgfmathsetmacro {\tdplotresz }{\rcaeul * #1 + \rcbeul * #2 + \rcceul * #3} } \newcommand {\tdplottransformmainscreen }[3]{\tdplotcalctransformmainscreen \par \pgfmathsetmacro {\tdplotresx }{\raarot * #1 + \rabrot * #2 + \racrot * #3} \pgfmathsetmacro {\tdplotresy }{\rbarot * #1 + \rbbrot * #2 + \rbcrot * #3} } \newcommand {\tdplotsetrotatedcoords }[3]{\pgfmathsetmacro {\tdplotalpha }{#1} \pgfmathsetmacro {\tdplotbeta }{#2} \pgfmathsetmacro {\tdplotgamma }{#3} \tdplotcalctransformrotmain \par \tdplotmult {\raaeaa }{\raarot }{\raaeul } \tdplotmult {\rabeba }{\rabrot }{\rbaeul } \tdplotmult {\raceca }{\racrot }{\rcaeul } \tdplotmult {\raaeab }{\raarot }{\rabeul } \tdplotmult {\rabebb }{\rabrot }{\rbbeul } \tdplotmult {\racecb }{\racrot }{\rcbeul } \tdplotmult {\raaeac }{\raarot }{\raceul } \tdplotmult {\rabebc }{\rabrot }{\rbceul } \tdplotmult {\racecc }{\racrot }{\rcceul } \tdplotmult {\rbaeaa }{\rbarot }{\raaeul } \tdplotmult {\rbbeba }{\rbbrot }{\rbaeul } \tdplotmult {\rbceca }{\rbcrot }{\rcaeul } \tdplotmult {\rbaeab }{\rbarot }{\rabeul } \tdplotmult {\rbbebb }{\rbbrot }{\rbbeul } \tdplotmult {\rbcecb }{\rbcrot }{\rcbeul } \tdplotmult {\rbaeac }{\rbarot }{\raceul } \tdplotmult {\rbbebc }{\rbbrot }{\rbceul } \tdplotmult {\rbcecc }{\rbcrot }{\rcceul } \pgfmathsetmacro {\raarc }{\raaeaa + \rabeba + \raceca } \pgfmathsetmacro {\rabrc }{\raaeab + \rabebb + \racecb } \pgfmathsetmacro {\racrc }{\raaeac + \rabebc + \racecc } \pgfmathsetmacro {\rbarc }{\rbaeaa + \rbbeba + \rbceca } \pgfmathsetmacro {\rbbrc }{\rbaeab + \rbbebb + \rbcecb } \pgfmathsetmacro {\rbcrc }{\rbaeac + \rbbebc + \rbcecc } \tikzset {tdplot_rotated_coords/.append style={x={(\raarc cm,\rbarc cm)},y={(\rabrc cm,\rbbrc cm)},z={(\racrc cm,\rbcrc cm)}}}} \newcommand {\tdplotsetrotatedcoordsorigin }[1]{\tikzset {tdplot_rotated_coords/.append style={shift=#1}}} \newcommand {\tdplotresetrotatedcoordsorigin }[0]{\tikzset {tdplot_rotated_coords/.append style={shift={(0,0,0)}}}} \newcommand {\tdplotsetthetaplanecoords }[1]{\tdplotresetrotatedcoordsorigin \tdplotsetrotatedcoords {270 + #1}{270}{0}} \newcommand {\tdplotsetrotatedthetaplanecoords }[1]{\tdplotsetrotatedcoords {\tdplotalpha }{\tdplotbeta }{\tdplotgamma + #1}\tikzset {tdplot_rotated_coords/.append style={y={(\raarc cm,\rbarc cm)},z={(\rabrc cm,\rbbrc cm)},x={(\racrc cm,\rbcrc cm)}}}} \newcommand {\tdplotsetcoord }[4]{\tdplotsinandcos {\sinthetavec }{\costhetavec }{#3}\tdplotsinandcos {\sinphivec }{\cosphivec }{#4}\tdplotmult {\stcpv }{\sinthetavec }{\cosphivec }\tdplotmult {\stspv }{\sinthetavec }{\sinphivec }\coordinate (#1) at ($#2*(\stcpv ,\stspv ,\costhetavec )$); \coordinate (#1xy) at ($#2*(\stcpv ,\stspv ,0)$); \coordinate (#1xz) at ($#2*(\stcpv ,0,\costhetavec )$); \coordinate (#1yz) at ($#2*(0,\stspv ,\costhetavec )$); \coordinate (#1x) at ($#2*(\stcpv ,0,0)$); \coordinate (#1y) at ($#2*(0,\stspv ,0)$); \coordinate (#1z) at ($#2*(0,0,\costhetavec )$); } \newcommand {\tdplotsimplesetcoord }[4]{\tdplotsinandcos {\sinthetavec }{\costhetavec }{#3}\tdplotsinandcos {\sinphivec }{\cosphivec }{#4}\tdplotmult {\stcpv }{\sinthetavec }{\cosphivec }\tdplotmult {\stspv }{\sinthetavec }{\sinphivec }\coordinate (#1) at ($#2*(\stcpv ,\stspv ,\costhetavec )$); } \newcommand {\tdplotsetpolarplotrange }[4]{\pgfmathsetmacro {\tdplotlowerphi }{#3} \pgfmathsetmacro {\tdplotupperphi }{#4} \pgfmathsetmacro {\tdplotlowertheta }{#1} \pgfmathsetmacro {\tdplotuppertheta }{#2} } \newcommand {\tdplotresetpolarplotrange }[0]{\pgfmathsetmacro {\tdplotlowerphi }{0} \pgfmathsetmacro {\tdplotupperphi }{360} \pgfmathsetmacro {\tdplotlowertheta }{0} \pgfmathsetmacro {\tdplotuppertheta }{180} } \newcommand {\tdplotdosurfaceplot }[6]{\par \pgfmathsetmacro {\nextphi }{\curphi + \tdplotsuperfudge *\viewphistep } \par \begin {scope}[opacity=1] \par \par \tdplotcheckdiff {\nextphi }{360}{\origviewphistep }{#2}{} \tdplotcheckdiff {\nextphi }{0}{\origviewphistep }{#2}{} \par \tdplotcheckdiff {\nextphi }{90}{\origviewphistep }{#3}{} \tdplotcheckdiff {\nextphi }{450}{\origviewphistep }{#3}{} \end {scope} \par \foreach \curtheta in{\viewthetastart ,\viewthetainc ,...,\viewthetaend } { \par \pgfmathsetmacro {\curlongitude }{90 -\curphi } \pgfmathsetmacro {\curlatitude }{90 -\curtheta } \par \ifthenelse {\equal {\leftright }{-1.0}}{\pgfmathsetmacro {\curphi }{\curphi -\origviewphistep } }{} \par \pgfmathsetmacro {\tdplottheta }{mod(\curtheta ,360)} \pgfmathsetmacro {\tdplotphi }{mod(\curphi ,360)} \par \pgfmathparse {\tdplotphi <0} \ifthenelse {\equal {\pgfmathresult }{1}}{ \pgfmathsetmacro {\tdplotphi }{\tdplotphi + 360} }{}\par \pgfmathparse {\tdplottheta >\tdplotuppertheta } \pgfmathsetmacro {\logictest }{1 -\pgfmathresult } \par \pgfmathparse {\tdplottheta <\tdplotlowertheta } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \pgfmathsetmacro {\tdplottheta }{\tdplottheta + \viewthetastep } \pgfmathparse {\tdplottheta >\tdplotuppertheta } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \pgfmathparse {\tdplottheta <\tdplotlowertheta } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \pgfmathparse {\tdplotphi >\tdplotupperphi } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \pgfmathparse {\tdplotphi <\tdplotlowerphi } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \pgfmathsetmacro {\tdplotphi }{\tdplotphi + \viewphistep } \par \pgfmathparse {\tdplotphi <0} \ifthenelse {\equal {\pgfmathresult }{1}}{ \pgfmathsetmacro {\tdplotphi }{\tdplotphi + 360} }{}\par \pgfmathparse {\tdplotphi >\tdplotupperphi } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \pgfmathparse {\tdplotphi <\tdplotlowerphi } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \par \pgfmathsetmacro {\tdplottheta }{\curtheta } \pgfmathsetmacro {\tdplotphi }{\curphi } \par \ifthenelse {\equal {#6}{parametricfill}}{\ifthenelse {\equal {\logictest }{1.0}}{\pgfmathsetmacro {\radius }{#1} \pgfmathsetmacro {\tdplotr }{\radius *360} \par \pgfmathlessthan {\radius }{0} \pgfmathsetmacro {\phaseshift }{180 * \pgfmathresult } \par \pgfmathsetmacro {\colorarg }{#5} \pgfmathsetmacro {\colorarg }{\colorarg + \phaseshift } \pgfmathsetmacro {\colorarg }{mod(\colorarg ,360)} \par \pgfmathlessthan {\colorarg }{0} \pgfmathsetmacro {\colorarg }{\colorarg + 360*\pgfmathresult } \par \pgfmathdivide {\colorarg }{360} \definecolor {tdplotfillcolor}{hsb}{\pgfmathresult ,1,1} \color {tdplotfillcolor} }{}}{\pgfsetfillcolor {#5} } \pgfsetstrokecolor {#4} \par \ifthenelse {\equal {\leftright }{-1.0}}{\pgfmathsetmacro {\curphi }{\curphi + \origviewphistep } }{} \par \ifthenelse {\equal {\logictest }{1.0}}{\pgfmathsetmacro {\radius }{abs(#1)} \pgfpathmoveto {\pgfpointspherical {\curlongitude }{\curlatitude }{\radius }} \par \pgfmathsetmacro {\tdplotphi }{\curphi + \viewphistep } \pgfmathsetmacro {\radius }{abs(#1)} \pgfpathlineto {\pgfpointspherical {\curlongitude -\viewphistep }{\curlatitude }{\radius }} \par \pgfmathsetmacro {\tdplottheta }{\curtheta + \viewthetastep } \pgfmathsetmacro {\radius }{abs(#1)} \pgfpathlineto {\pgfpointspherical {\curlongitude -\viewphistep }{\curlatitude -\viewthetastep }{\radius }} \par \pgfmathsetmacro {\tdplotphi }{\curphi } \pgfmathsetmacro {\radius }{abs(#1)} \pgfpathlineto {\pgfpointspherical {\curlongitude }{\curlatitude -\viewthetastep }{\radius }} \pgfpathclose \par \pgfusepath {fill,stroke} }{} } } \newcommand {\tdplotshowargcolorguide }[4]{ \par \pgfmathsetmacro {\tdplotx }{#1} \pgfmathsetmacro {\tdploty }{#2} \pgfmathsetmacro {\tdplothuestep }{5} \pgfmathsetmacro {\tdplotxsize }{#3} \pgfmathsetmacro {\tdplotysize }{#4} \par \pgfmathsetmacro {\tdplotyscale }{\tdplotysize /360} \par \foreach \tdplotphi in {0,\tdplothuestep ,...,360} { \pgfmathdivide {\tdplotphi }{360} \definecolor {tdplotfillcolor}{hsb}{\pgfmathresult ,1,1} \color {tdplotfillcolor} \par \pgfmathsetmacro {\tdplotstarty }{\tdploty + \tdplotphi * \tdplotyscale } \pgfmathsetmacro {\tdplotstopy }{\tdplotstarty + \tdplothuestep * \tdplotyscale } \pgfmathsetmacro {\tdplotstartx }{\tdplotx } \pgfmathsetmacro {\tdplotstopx }{\tdplotx + \tdplotxsize } \filldraw [tdplot_screen_coords] (\tdplotstartx ,\tdplotstarty ) rectangle (\tdplotstopx ,\tdplotstopy ); } \par \pgfmathsetmacro {\tdplotstopy }{\tdploty + (360+\tdplothuestep )*\tdplotyscale } \pgfmathsetmacro {\tdplotstopx }{\tdplotx + \tdplotxsize } \par \draw [tdplot_screen_coords] (\tdplotx ,\tdploty ) rectangle (\tdplotstopx ,\tdplotstopy ); \par \node [tdplot_screen_coords,anchor=west,xshift=5pt] at (\tdplotstopx ,\tdploty ) {$0$}; \node [tdplot_screen_coords,anchor=west,xshift=5pt] at (\tdplotstopx ,\tdplotstopy ) {$2\pi $}; \par \pgfmathsetmacro {\tdplotstopy }{\tdploty + (360+\tdplothuestep )/2*\tdplotyscale } \node [tdplot_screen_coords,anchor=west,xshift=5pt] at (\tdplotstopx ,\tdplotstopy ) {$\pi $}; } \newcommand {\tdplotgetpolarcoords }[3]{\pgfmathsetmacro {\vxcalc }{#1} \pgfmathsetmacro {\vycalc }{#2} \pgfmathsetmacro {\vzcalc }{#3} \pgfmathsetmacro {\vcalc }{ sqrt((\vxcalc )^2 + (\vycalc )^2 + (\vzcalc )^2) } \par \pgfmathsetmacro {\vxycalc }{ sqrt((\vxcalc )^2 + (\vycalc )^2) } \par \pgfmathsetmacro {\tdplotrestheta }{asin(\vxycalc /\vcalc )} \pgfmathparse {\vzcalc <0} \ifthenelse {\equal {\pgfmathresult }{1}}{\pgfmathsetmacro {\tdplotrestheta }{180 -\tdplotrestheta } } {} \ifthenelse {\equal {\vxcalc }{0.0}}{\pgfmathparse {\vycalc <0} \ifthenelse {\equal {\pgfmathresult }{1}}{\pgfmathsetmacro {\tdplotresphi }{270} } {\pgfmathparse {\vycalc >0} \ifthenelse {\equal {\pgfmathresult }{1}}{\pgfmathsetmacro {\tdplotresphi }{90} } {\pgfmathsetmacro {\tdplotresphi }{0} } } } {\pgfmathsetmacro {\tdplotresphi }{atan(\vycalc /\vxcalc )} \pgfmathparse {\vxcalc <0} \ifthenelse {\equal {\pgfmathresult }{1}}{\pgfmathsetmacro {\tdplotresphi }{\tdplotresphi +180} } { } \par \pgfmathparse {\tdplotresphi <0} \ifthenelse {\equal {\pgfmathresult }{1}}{\pgfmathsetmacro {\tdplotresphi }{\tdplotresphi +360} } {} } } \newcommand {\vec }[0]{\mathbf } \newcommand {\RR }[0]{\mathbb {R}} \newcommand {\dfn }[0]{\textit } \newcommand {\dotp }[0]{\cdot } \newcommand {\id }[0]{\text {id}} \newcommand {\norm }[1]{\left \lVert #1\right \rVert } \newcommand {\mathtoolsset }[1]{\setkeys {\MT_options_name: }{#1}} \newcommand {\refeq }[1]{\textup {\ref {#1}}} \newcommand {\lparen }[0]{(} \newcommand {\rparen }[0]{)} \newcommand {\ordinarycolon }[0]{:} \newcommand {\MT_test_for_tcb_other:nnnnn }[1]{\if:w t#1\relax \expandafter \MH_use_choice_i:nnnn \else: \if:w c#1\relax \expandafter \expandafter \expandafter \MH_use_choice_ii:nnnn \else: \if:w b#1\relax \expandafter \expandafter \expandafter \expandafter \expandafter \expandafter \expandafter \MH_use_choice_iii:nnnn \else: \expandafter \expandafter \expandafter \expandafter \expandafter \expandafter \expandafter \MH_use_choice_iv:nnnn \fi: \fi: \fi: } \newcommand {\newcases }[6]{\newenvironment {#1}{\MT_start_cases:nnnn {#2}{#3}{#4}{#5}}{\MH_end_cases: \right #6}} \newcommand {\renewcases }[6]{\renewenvironment {#1}{\MT_start_cases:nnnn {#2}{#3}{#4}{#5}}{\MH_end_cases: \right #6}} \newcommand {\SwapAboveDisplaySkip }[0]{\noalign {\vskip -\abovedisplayskip \vskip \abovedisplayshortskip }} \newcommand {\vdotswithin }[1]{{\mathmakebox [\widthof {\ensuremath {{}#1{}}}][c]{{\vdots }}}} \newcommand {\MTFlushSpaceBelow }[0]{\\\noalign {\nobreak \vskip -\lineskip \vskip -\l_MT_shortvdotswithinadjustbelow_dim \vskip -\origjot \vskip \jot }} \newcommand {\mathmbox }[0]{\mathpalette \MT_mathmbox:nn } \newcommand {\prescript }[3]{\mathchoice {\MT_prescript_inner: {#1}{#2}{#3}{\scriptstyle }}{\MT_prescript_inner: {#1}{#2}{#3}{\scriptstyle }}{\MT_prescript_inner: {#1}{#2}{#3}{\scriptscriptstyle }}{\MT_prescript_inner: {#1}{#2}{#3}{\scriptscriptstyle }}} \newcommand {\spreadlines }[1]{\setlength {\jot }{#1}\ignorespaces } \newcommand {\newgathered }[4]{\newenvironment {#1}{\def \MT_gathered_pre: {#2}\def \MT_gathered_post: {#3}\def \MT_gathered_env_end: {#4}\MT_gathered_env }{\endMT_gathered_env }} \newcommand {\renewgathered }[4]{\renewenvironment {#1}{\def \MT_gathered_pre: {#2}\def \MT_gathered_post: {#3}\def \MT_gathered_env_end: {#4}\MT_gathered_env }{\endMT_gathered_env }} \newcommand {\lgathered }[0]{\def \MT_gathered_pre: {}\def \MT_gathered_post: {\hfil }\def \MT_gathered_env_end: {}\MT_gathered_env } \newcommand {\rgathered }[0]{\def \MT_gathered_pre: {\hfil }\def \MT_gathered_post: {}\def \MT_gathered_env_end: {}\MT_gathered_env } \newcommand {\gathered }[0]{\def \MT_gathered_pre: {\hfil }\def \MT_gathered_post: {\hfil }\def \MT_gathered_env_end: {}\MT_gathered_env } \newcommand {\splitfrac }[2]{\genfrac {}{}{0pt}{1}{\textstyle #1\quad \hfill }{\textstyle \hfill \quad \mathstrut #2}} \newcommand {\splitdfrac }[2]{\genfrac {}{}{0pt}{0}{#1\quad \hfill }{\hfill \quad \mathstrut #2}} \newcommand {\HyperFirstAtBeginDocument }[0]{\AtBeginDocument } \newcommand {\dblcolon }[0]{\vcentcolon \mathrel {\mkern -.9mu}\vcentcolon } \newcommand {\coloneqq }[0]{\vcentcolon \mathrel {\mkern -1.2mu}=} \newcommand {\Coloneqq }[0]{\dblcolon \mathrel {\mkern -1.2mu}=} \newcommand {\coloneq }[0]{\vcentcolon \mathrel {\mkern -1.2mu}\mathrel {-}} \newcommand {\Coloneq }[0]{\dblcolon \mathrel {\mkern -1.2mu}\mathrel {-}} \newcommand {\eqqcolon }[0]{=\mathrel {\mkern -1.2mu}\vcentcolon } \newcommand {\Eqqcolon }[0]{=\mathrel {\mkern -1.2mu}\dblcolon } \newcommand {\eqcolon }[0]{\mathrel {-}\mathrel {\mkern -1.2mu}\vcentcolon } \newcommand {\Eqcolon }[0]{\mathrel {-}\mathrel {\mkern -1.2mu}\dblcolon } \newcommand {\colonapprox }[0]{\vcentcolon \mathrel {\mkern -1.2mu}\approx } \newcommand {\Colonapprox }[0]{\dblcolon \mathrel {\mkern -1.2mu}\approx } \newcommand {\colonsim }[0]{\vcentcolon \mathrel {\mkern -1.2mu}\sim } \newcommand {\Colonsim }[0]{\dblcolon \mathrel {\mkern -1.2mu}\sim } \newcommand {\nuparrow }[0]{\MH_nuparrow: } \newcommand {\ndownarrow }[0]{\MH_ndownarrow: } \newcommand {\bigtimes }[0]{\MH_csym_bigtimes: }$

Orthogonal Matrices and Symmetric Matrices

Recall that an $n \times n$ matrix $A$ is diagonalizable if and only if it has $n$ linearly independent eigenvectors. (see Diagonalizable Matrices and Multiplicity) Moreover, the matrix $P$ with these eigenvectors as columns is a diagonalizing matrix for $A$ , that is

$\begin{equation*} P^{-1}AP \mbox{ is diagonal.} \end{equation*}$ As we have seen, the nice bases of $\RR ^n$ are the orthogonal ones, so a natural question is: which $n \times n$ matrices have $n$ orthogonal eigenvectors, so that columns of $P$ form an orthogonal basis for $\RR ^n$ ? These turn out to be precisely the symmetric matrices (matrices for which $A=A^T$ ), and this is the main result of this section.

Orthogonal Matrices

Recall that an orthogonal set of vectors is called orthonormal if $\norm {\vec {q}} = 1$ for each vector $\vec {q}$ in the set, and that any orthogonal set $\{\vec {v}_{1}, \vec {v}_{2}, \dots , \vec {v}_{k}\}$ can be “normalized”, i.e. converted into an orthonormal set $\left \{ \frac {1}{\norm {\vec {v}_{1}}}\vec {v}_{1}, \frac {1}{\norm {\vec {v}_{2}}}\vec {v}_{2}, \dots , \frac {1}{\norm {\vec {v}_{k}}}\vec {v}_{k} \right \}$ . In particular, if a matrix $A$ has $n$ orthogonal eigenvectors, they can (by normalizing) be taken to be orthonormal. The corresponding diagonalizing matrix (we will use $Q$ instead of $P$ ) has orthonormal columns, and such matrices are very easy to invert.

The following conditions are equivalent for an $n \times n$ matrix $Q$ .

(a): $Q$ is invertible and $Q^{-1} = Q^{T}$ .
(b): The rows of $Q$ are orthonormal.
(c): The columns of $Q$ are orthonormal.

Proof: First note that condition th:orthogonal_matrices_a is equivalent to $Q^{T}Q = I$ . Let $\vec {q}_{1}, \vec {q}_{2}, \dots , \vec {q}_{n}$ denote the columns of $Q$ . Then $\vec {q}_{i}^{T}$ is the $i$ th row of $Q^{T}$ , so the $(i, j)$ -entry of $Q^{T}Q$ is $\vec {q}_{i} \dotp \vec {q}_{j}$ . Thus $Q^{T}Q = I$ means that $\vec {q}_{i} \dotp \vec {q}_{j} = 0$ if $i \neq j$ and $\vec {q}_{i} \dotp \vec {q}_{j} = 1$ if $i = j$ . Hence condition th:orthogonal_matrices_a is equivalent to th:orthogonal_matrices_c. The proof of the equivalence of th:orthogonal_matrices_a and th:orthogonal_matrices_b is similar. $\blacksquare$

Orthogonal Matrices An $n \times n$ matrix $Q$ is called an orthogonal matrix if it satisfies one (and hence all) of the conditions in Theorem th:orthogonal_matrices.

The rotation matrix $\begin {bmatrix} \cos \theta & -\sin \theta \\ \sin \theta & \cos \theta \end {bmatrix}$ is orthogonal for any angle $\theta$ .

See Practice Problem prob:rotation_ortho.

In view of th:orthogonal_matrices_b and th:orthogonal_matrices_c of Theorem th:orthogonal_matrices, orthonormal matrix might be a better name. But orthogonal matrix is standard.

It is not enough that the rows of a matrix $A$ are merely orthogonal for $A$ to be an orthogonal matrix. Here is an example.

Let $A=\begin {bmatrix} 2&1&1\\-1&1&1\\0&-1&1 \end {bmatrix}$

(a): Check that matrix $A$ has rows that are orthogonal.
(b): Check that matrix $A$ has columns that are NOT orthogonal.
(c): Check that matrix $A$ has rows that are NOT orthonormal.
(d): Create a matrix $Q$ by normalizing each of the rows of $A$ .
(e): Check that $Q$ is an orthogonal matrix.

Click the arrow to see the answer.

You should get $Q = \begin {bmatrix} \frac {2}{\sqrt {6}} & \frac {1}{\sqrt {6}} & \frac {1}{\sqrt {6}} \\ \frac {-1}{\sqrt {3}} & \frac {1}{\sqrt {3}} & \frac {1}{\sqrt {3}} \\ 0 & \frac {-1}{\sqrt {2}} & \frac {1}{\sqrt {2}} \end {bmatrix}$ , and one can check that this is orthogonal in a number of ways.

This exploration can certainly be done by hand (although it takes some time), but it also makes for a very nice Octave exercise.

To use Octave, go to the Sage Math Cell Webpage, copy the code below into the cell, select OCTAVE as the language, and press EVALUATE.

    %Exploration from Section 9.4 Orthogonal Matrices and Symmetric Matrices
 
    A=[2 1 1; -1 1 1; 0 -1 1]
 
    %Check that matrix A has rows that are orthogonal.
 
    A(1,:)*transpose(A(2,:))
 
    A(2,:)*transpose(A(3,:))
 
    A(1,:)*transpose(A(3,:))
 
    %Check that matrix A has columns that are NOT orthogonal.
 
    transpose(A(:,1))*A(:,2)
 
    %(This is 1 of 3 calculations to do.)
 
    %Check that matrix A in the Octave window has rows that are NOT orthonormal.
 
    %(See the results from the first question.)
 
    %Create a matrix Q by normalizing each of the rows of A.
 
    q1=A(1,:)/norm(A(1,:));
 
    q2=A(2,:)/norm(A(2,:));
 
    q3=A(3,:)/norm(A(3,:));
 
    Q = [q1;q2;q3]
 
    %Check that Q is an orthogonal matrix.
 
    Q*transpose(Q)
 
    %(You may get numbers close to zero in some places you expect to get zero due to rounding error)

We studied the idea of closure when we studied Subspaces of $\RR ^n$ . The next theorem tells us that orthogonal matrices are closed under matrix multiplication.

(a): If $P$ and $Q$ are orthogonal matrices, then $PQ$ is also orthogonal. (We say that the set of orthogonal matices is closed under matrix multiplication.)
(b): If $P$ is an orthogonal matrix, then so is $P^{-1} = P^{T}$ .

Proof: For th:orthogonal_product, $P$ and $Q$ are invertible, so $PQ$ is also invertible and $\begin{equation*} (PQ)^{-1} = Q^{-1}P^{-1} = Q^{T}P^{T} = (PQ)^{T} \end{equation*}$ Hence $PQ$ is orthogonal. For th:orthogonal_inverse, $\begin{equation*} (P^{-1})^{-1} = P = (P^{T})^{T} = (P^{-1})^{T} \end{equation*}$ shows that $P^{-1}$ is orthogonal. $\blacksquare$

Symmetric Matrices

We now shift our focus from orthogonal matrices to another important class of $n \times n$ matrices called symmetric matrices. A symmetric matrix is a matrix which is equal to its transpose. We saw a few examples of such matrices in Transpose of a Matrix.

When we began our study of eigenvalues and eigenvectors, we saw numerous examples of matrices with entries that were real numbers with eigenvalues that were complex numbers. It can be shown that symmetric matrices only have real eigenvalues. We also learned that some matrices are diagonalizable while other matrices are not. It turns out that every symmetric matrix is diagonalizable. In fact, we can say more, but first we need the following definition.

An $n \times n$ matrix $A$ is said to be orthogonally diagonalizable if an orthogonal matrix $Q$ can be found such that $Q^{-1}AQ = Q^{T}AQ$ is diagonal.

We have learned earlier that when we diagonalize a matrix $A$ , we write $P^{-1}AP=D$ for some matrix $P$ where $D$ is diagonal, and the diagonal entries are the eigenvalues of $A$ . We have also learned that the columns of the matrix $P$ are the corresponding eigenvectors of $A$ . So when a matrix is orthogonally diagonalizable, we are able to accomplish the diagonalization using a matrix $Q$ consisting of $n$ eigenvectors that form an orthonormal basis for $\RR ^n$ . The following remarkable theorem shows that the matrices that have this property are precisely the symmetric matrices.

Real Spectral Theorem Let $A$ be an $n \times n$ matrix. Then $A$ is symmetric if and only if $A$ is orthogonally diagonalizable.

Proof

If $A$ is orthogonally diagonalizable, then it is an easy exercise to prove that it is symmetric. You are asked to do this in Practice Problem prob:ortho_diag_implies_symmetric.

To prove the “only if” part of this theorem, we assume $A$ is symmetric, and we need to show it is orthogonally diagonalizable. We proceed by induction on $n$ , the size of the symmetric matrix. If $n = 1$ , $A$ is already diagonal. If $n > 1$ , assume that we know the “only if” statement holds for $(n - 1) \times (n - 1)$ symmetric matrices. Let $\lambda _{1}$ be an eigenvalue of $A$ , and let $A\vec {x}_{1} = \lambda _{1}\vec {x}_{1}$ , where $\norm {\vec {x}_{1}} = 1$ . Next, set $\vec {q}_{1}=\vec {x}_{1}$ , and use the Gram-Schmidt algorithm to find an orthonormal basis $\{\vec {q}_{1}, \vec {q}_{2}, \dots , \vec {q}_{n}\}$ for $\RR ^n$ . Let $Q_{1} = \begin {bmatrix} | & | & & | \\ \vec {q}_1 & \vec {q}_2 & \cdots & \vec {q}_n \\ | & | & & | \end {bmatrix}$ , so that $Q_{1}$ is an orthogonal matrix. We have

$\begin{align*} Q_{1}^TAQ_{1} &= \begin{bmatrix} -- & \vec{q}_{1}^T & -- \\ -- & \vec{q}_{2}^T & -- \\ & \vdots & \\ -- & \vec{q}_{n}^T & -- \end{bmatrix} A \begin{bmatrix} | & | & & | \\ \vec{q}_1 & \vec{q}_2 & \cdots & \vec{q}_n \\ | & | & & | \end{bmatrix} \\ &= \begin{bmatrix} -- & \vec{q}_{1}^T & -- \\ -- & \vec{q}_{2}^T & -- \\ & \vdots & \\ -- & \vec{q}_{n}^T & -- \end{bmatrix} \begin{bmatrix} | & | & & | \\ A\vec{q}_1 & A\vec{q}_2 & \cdots & A\vec{q}_n \\ | & | & & | \end{bmatrix} \\ &= \begin{bmatrix} \lambda_{1} & B \\ \vec{0} & A_{1} \end{bmatrix},\\ \end{align*}$ where the block $B$ has dimensions $1 \times (n-1)$ , and the block under $\lambda _1$ is a $(n-1) \times 1$ zero matrix, because of the orthogonality of the basis vectors.

Next, using the fact that $A$ is symmetric, we notice that $(Q_{1}^TAQ_{1})^T = Q_{1}^T A^T (Q_{1}^T)^T = Q_{1}^TAQ_{1},$ so $Q_{1}^TAQ_{1}$ is symmetric. It follows that $B$ is also a zero matrix and that $A_{1}$ is symmetric. Since $A_{1}$ is an $(n - 1) \times (n - 1)$ symmetric matrix, we may apply the inductive hypothesis, so there exists an $(n - 1) \times (n - 1)$ orthogonal matrix $Q$ such that $Q^{T}A_{1}Q = D_{1}$ is diagonal. We observe that $Q_{2} = \begin {bmatrix} 1 & 0\\ 0 & Q \end {bmatrix}$ is orthogonal, and we compute:

$\begin{align*} (Q_{1}Q_{2})^TA(Q_{1}Q_{2}) &= Q_{2}^T(Q_{1}^TAQ_{1})Q_{2} \\ &= \begin{bmatrix} 1 & 0 \\ 0 & Q^T \end{bmatrix} \begin{bmatrix} \lambda_{1} & 0 \\ 0 & A_{1} \end{bmatrix}\begin{bmatrix} 1 & 0 \\ 0 & Q \end{bmatrix}\\ &= \begin{bmatrix} \lambda_{1} & 0 \\ 0 & D_{1} \end{bmatrix} \end{align*}$ is diagonal. Because $Q_{1}Q_{2}$ is orthogonal by Theorem th:orthogonal_product_inverse th:orthogonal_product, this completes the proof. $\blacksquare$

Because the eigenvalues of a real symmetric matrix are real, Theorem th:PrinAxes is also called the Real Spectral Theorem, and the set of distinct eigenvalues is called the spectrum of the matrix. A similar result holds for matrices with complex entries (Theorem th:025890).

Find an orthogonal matrix $Q$ such that $Q^{-1}AQ$ is diagonal, where $A = \begin {bmatrix} 1 & 0 & -1 \\ 0 & 1 & 2 \\ -1 & 2 & 5 \end {bmatrix}$ .

The characteristic polynomial of $A$ is (adding twice row 1 to row 2): $\begin{equation*} c_{A}(z) = \det \begin{bmatrix} z - 1 & 0 & 1 \\ 0 & z - 1 & -2 \\ 1 & -2 & z - 5 \end{bmatrix} = z(z - 1)(z - 6) \end{equation*}$ Thus the eigenvalues are $\lambda = 0$ , $1$ , and $6$ , and corresponding eigenvectors are $\begin{equation*} \vec{x}_{1} = \begin{bmatrix} 1 \\ -2 \\ 1 \end{bmatrix} \; \vec{x}_{2} = \begin{bmatrix} 2 \\ 1 \\ 0 \end{bmatrix} \; \vec{x}_{3} = \begin{bmatrix} -1 \\ 2 \\ 5 \end{bmatrix} \end{equation*}$ respectively. Moreover, by what at first appears to be remarkably good luck, these eigenvectors are orthogonal. We have $\norm {\vec {x}_{1}}^{2} = 6$ , $\norm {\vec {x}_{2}}^{2} = 5$ , and $\norm {\vec {x}_{3}}^{2} = 30$ , so $\begin{equation*} P = \begin{bmatrix} | & | & | \\ \frac{1}{\sqrt{6}}\vec{x}_{1} & \frac{1}{\sqrt{5}}\vec{x}_{2} & \frac{1}{\sqrt{30}}\vec{x}_{3} \\ | & | & | \end{bmatrix} = \frac{1}{\sqrt{30}} \begin{bmatrix} \sqrt{5} & 2\sqrt{6} & -1 \\ -2\sqrt{5} & \sqrt{6} & 2 \\ \sqrt{5} & 0 & 5 \end{bmatrix} \end{equation*}$ is an orthogonal matrix. Thus $P^{-1} = P^{T}$ and $\begin{equation*} P^TAP = \begin{bmatrix} 0 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 6 \end{bmatrix}. \end{equation*}$

Actually, the fact that the eigenvectors in Example ex:DiagonalizeSymmetricMatrix are orthogonal is no coincidence. These vectors certainly must be linearly independent (they correspond to distinct eigenvalues). We will see that the fact that the matrix is symmetric implies that the eigenvectors are orthogonal. To prove this we need the following useful fact about symmetric matrices.

If A is an $n \times n$ symmetric matrix, then $\begin{equation*} (A\vec{x}) \dotp \vec{y} = \vec{x} \dotp (A\vec{y}) \end{equation*}$ for all columns $\vec {x}$ and $\vec {y}$ in $\RR ^n$ .

The converse also holds (see Practice Problem ex:8_2_15).

Proof: Recall that $\vec {x} \dotp \vec {y} = \vec {x}^{T} \vec {y}$ for all columns $\vec {x}$ and $\vec {y}$ . Because $A^{T} = A$ , we get $\begin{equation*} (A\vec{x}) \dotp \vec{y} = (A\vec{x})^T\vec{y} = \vec{x}^TA^T\vec{y} = \vec{x}^TA\vec{y} = \vec{x} \dotp (A\vec{y}) \end{equation*}$ $\blacksquare$

If $A$ is a symmetric matrix, then eigenvectors of $A$ corresponding to distinct eigenvalues are orthogonal.

Proof: Let $A\vec {x} = \lambda \vec {x}$ and $A\vec {y} = \mu \vec {y}$ , where $\lambda \neq \mu$ . We compute $\begin{equation*} \lambda(\vec{x} \dotp \vec{y}) = (\lambda\vec{x}) \dotp \vec{y} = (A\vec{x}) \dotp \vec{y} = \vec{x} \dotp (A\vec{y}) = \vec{x} \dotp (\mu\vec{y}) = \mu(\vec{x} \dotp \vec{y}) \end{equation*}$ Hence $(\lambda - \mu )(\vec {x} \dotp \vec {y}) = 0$ , and so $\vec {x} \dotp \vec {y} = 0$ because $\lambda \neq \mu$ . $\blacksquare$

Now the procedure for diagonalizing a symmetric $n \times n$ matrix is clear. Find the distinct eigenvalues and find orthonormal bases for each eigenspace (the Gram-Schmidt algorithm may be needed when there is a repeated eigenvalue). Then the set of all these basis vectors is orthonormal (by Theorem th:symmetric_has_ortho_ev) and contains $n$ vectors. Here is an example.

Orthogonally diagonalize the symmetric matrix $A = \begin {bmatrix} 8 & -2 & 2 \\ -2 & 5 & 4 \\ 2 & 4 & 5 \end {bmatrix}$ .

The characteristic polynomial is $\begin{equation*} c_{A}(z) = \det \begin{bmatrix} z-8 & 2 & -2 \\ 2 & z-5 & -4 \\ -2 & -4 & z-5 \end{bmatrix} = z(z-9)^2 \end{equation*}$ Hence the distinct eigenvalues are $0$ and $9$ are of algebraic multiplicity $1$ and $2$ , respectively. The geometric multiplicities must be the same, for $A$ is diagonalizable, being symmetric. It follows that $\mbox {dim}(\mathcal {S}_0) = 1$ and $\mbox {dim}(\mathcal {S}_9) = 2$ . Gaussian elimination gives $\begin{equation*} \mathcal{S}_{0}(A) = \mbox{span}\{\vec{x}_{1}\}, \quad \vec{x}_{1} = \begin{bmatrix} 1 \\ 2 \\ -2 \end{bmatrix}, \quad \mbox{ and } \quad \mathcal{S}_{9}(A) = \mbox{span} \left\lbrace \begin{bmatrix} -2 \\ 1 \\ 0 \end{bmatrix}, \begin{bmatrix} 2 \\ 0 \\ 1 \end{bmatrix} \right\rbrace \end{equation*}$ The eigenvectors in $\mathcal {S}_{9}$ are both orthogonal to $\vec {x}_{1}$ as Theorem th:symmetric_has_ortho_ev guarantees, but not to each other. However, the Gram-Schmidt process yields an orthogonal basis $\begin{equation*} \{\vec{f}_{2}, \vec{f}_{3}\} \mbox{ of } \mathcal{S}_{9}(A) \quad \mbox{ where } \quad \vec{f}_{2} = \begin{bmatrix} -2 \\ 1 \\ 0 \end{bmatrix} \mbox{ and } \vec{f}_{3} = \begin{bmatrix} 2 \\ 4 \\ 5 \end{bmatrix} \end{equation*}$ Normalizing gives orthonormal vectors $\{\frac {1}{3}\vec {x}_{1}, \frac {1}{\sqrt {5}}\vec {f}_{2}, \frac {1}{3\sqrt {5}}\vec {f}_{3}\}$ , so $\begin{equation*} Q = \begin{bmatrix} | & | & | \\ \frac{1}{3}\vec{x}_{1} & \frac{1}{\sqrt{5}}\vec{f}_{2} & \frac{1}{3\sqrt{5}}\vec{f}_{3} \\ | & | & | \end{bmatrix} = \frac{1}{3\sqrt{5}}\begin{bmatrix} \sqrt{5} & -6 & 2 \\ 2\sqrt{5} & 3 & 4 \\ -2\sqrt{5} & 0 & 5 \end{bmatrix} \end{equation*}$ is an orthogonal matrix such that $Q^{-1}AQ$ is diagonal.

It is worth noting that other, more convenient, diagonalizing matrices $Q$ exist. For example, $\vec {y}_{2} = \begin {bmatrix} 2 \\ 1 \\ 2 \end {bmatrix}$ and $\vec {y}_{3} = \begin {bmatrix} -2 \\ 2 \\ 1 \end {bmatrix}$ lie in $\mathcal {S}_{9}(A)$ and they are orthogonal. Moreover, they both have norm $3$ (as does $\vec {x}_{1}$ ), so

$\begin{equation*} \hat{Q} = \begin{bmatrix} | & | & | \\ \frac{1}{3}\vec{x}_{1} & \frac{1}{3}\vec{y}_{2} & \frac{1}{3}\vec{y}_{3} \\ | & | & | \end{bmatrix} = \frac{1}{3}\begin{bmatrix} 1 & 2 & -2 \\ 2 & 1 & 2 \\ -2 & 2 & 1 \end{bmatrix} \end{equation*}$ is a nicer orthogonal matrix with the property that $\hat {Q}^{-1}A\hat {Q}$ is diagonal.

Let $A$ be an $n \times n$ matrix. $A$ has an orthonormal set of $n$ eigenvectors if and only if $A$ is orthogonally diagonalizable.

Proof: Let $\vec {q}_{1}, \vec {q}_{2}, \dots , \vec {q}_{n}$ be orthonormal eigenvectors of $A$ with corresponding eigenvalues $\lambda _1, \lambda _2, \ldots , \lambda _n$ . We must show $A$ is orthogonally diagonalizable. Let $Q = \begin {bmatrix} | & | & & | \\ \vec {q}_1 & \vec {q}_2 & \cdots & \vec {q}_n \\ | & | & & | \end {bmatrix}$ so that $Q$ is orthogonal. We have $AQ = \begin {bmatrix} | & | & & | \\ A\vec {q}_1 & A\vec {q}_2 & \cdots & A\vec {q}_n \\ | & | & & | \end {bmatrix} = \begin {bmatrix} | & | & & | \\ \lambda _1 \vec {q}_{1} & \lambda _2 \vec {q}_{2} & \dots & \lambda _n \vec {q}_{n} \\ | & | & & | \end {bmatrix}=QD,$ where $D$ is the diagonal matrix with diagonal entries $\lambda _1, \lambda _2, \ldots , \lambda _n$ . But then $Q^TAQ=D$ , proving this half of the theorem.
For the converse, if $A$ is orthogonally diagonalizable, then by Theorem th:PrinAxes it is symmetric. But then Theorem th:symmetric_has_ortho_ev tells us that eigenvectors corresponding to distinct eigenvalues are orthogonal. Because $A$ is (orthogonally) diagonalizable, we know the geometric multiplicity of each eigenvalue is equal to its algebraic multiplicity. This implies that we can use Gram-Schmidt on each eigenspace of dimension $> 1$ to get a full set of $n$ orthogonal eigenvectors. $\blacksquare$

If we are willing to replace “diagonal” by “upper triangular” in the real spectral theorem, we can weaken the requirement that $A$ is symmetric to insisting only that $A$ has real eigenvalues.

Schur Triangularization Theorem If $A$ is an $n \times n$ matrix with $n$ real eigenvalues, an orthogonal matrix $Q$ exists such that $Q^{T}AQ$ is upper triangular.

There is also a lower triangular version of this theorem.

Proof: See Practice Problem prob:SchurChallenge $\blacksquare$

The eigenvalues of an upper triangular matrix are displayed along the main diagonal. Because $A$ and $Q^{T}AQ$ have the same determinant and trace whenever $Q$ is orthogonal (for they are similar matrices), Theorem th:Schur gives:

If $A$ is an $n \times n$ matrix with real eigenvalues $\lambda _{1}, \lambda _{2}, \dots , \lambda _{n}$ (possibly not all distinct), then $\det A = \lambda _{1}\lambda _{2} \dots \lambda _{n}$ and $\mbox {tr} A = \lambda _{1} + \lambda _{2} + \dots + \lambda _{n}$ .

This corollary remains true even if the eigenvalues are not real.

Practice Problems

Suppose $A$ is orthogonally diagonalizable. Prove that $A$ is symmetric. (This is the easy direction of the ”if and only if” in Theorem th:PrinAxes.)

Normalize the rows to make each of the following matrices orthogonal.

$A = \begin {bmatrix} 1 & 1 \\ -1 & 1 \end {bmatrix}$

$A = \begin {bmatrix} 1 & 2 \\ -4 & 2 \end {bmatrix}$

$A = \begin {bmatrix} \cos \theta & -\sin \theta & 0 \\ \sin \theta & \cos \theta & 0 \\ 0 & 0 & 2 \end {bmatrix}$

$A = \begin {bmatrix} -1 & 2 & 2 \\ 2 & -1 & 2 \\ 2 & 2 & -1 \end {bmatrix}$

If $Q$ is a triangular orthogonal matrix, show that $Q$ is diagonal and that all diagonal entries are $1$ or $-1$ .

We have $Q^{T} = Q^{-1}$ ; the first step is to show that $Q$ is lower triangular and also upper triangular, and so is diagonal. But then $Q = Q^{T} = Q^{-1}$ , so $Q^{2} = I$ . This implies that the diagonal entries of $Q$ are all $\pm 1$ .

If $Q$ is orthogonal, show that $kQ$ is orthogonal if and only if $k = 1$ or $k = -1$ .

If the first two rows of an orthogonal matrix are $[\frac {1}{3}, \frac {2}{3}, \frac {2}{3}]$ and $[\frac {2}{3}, \frac {1}{3}, \frac {-2}{3}]$ , find all possible third rows.

For each matrix $A$ , find an orthogonal matrix $Q$ such that $Q^{-1}AQ$ is diagonal.

(a): $A = \begin {bmatrix} 0 & 1 \\ 1 & 0 \end {bmatrix}$
(b): $A = \begin {bmatrix} 1 & -1 \\ -1 & 1 \end {bmatrix}$
(c): $A = \begin {bmatrix} 3 & 0 & 0 \\ 0 & 2 & 2 \\ 0 & 2 & 5 \end {bmatrix}$
(d): $A = \begin {bmatrix} 3 & 0 & 7 \\ 0 & 5 & 0 \\ 7 & 0 & 3 \end {bmatrix}$
(e): $A = \begin {bmatrix} 1 & 1 & 0 \\ 1 & 1 & 0 \\ 0 & 0 & 2 \end {bmatrix}$
(f): $A = \begin {bmatrix} 5 & -2 & -4 \\ -2 & 8 & -2\\ -4 & -2 & 5 \end {bmatrix}$
(g): (challenging problem) $A = \begin {bmatrix} 5 & 3 & 0 & 0 \\ 3 & 5 & 0 & 0 \\ 0 & 0 & 7 & 1 \\ 0 & 0 & 1 & 7 \end {bmatrix}$
(h): (challenging problem) $A = \begin {bmatrix} 3 & 5 & -1 & 1 \\ 5 & 3 & 1 & -1 \\ -1 & 1 & 3 & 5 \\ 1 & -1 & 5 & 3 \end {bmatrix}$

Show that the following are equivalent for a symmetric matrix $A$ .

(a): $A$ is orthogonal.
(b): $A^{2} = I$ .
(c): All eigenvalues of $A$ are $\pm 1$ .

For (b) if and only if (c), use Theorem th:detofproduct.

We call matrices $A$ and $B$ orthogonally similar (and write $A \stackrel {\circ }{\sim } B$ ) if $B = P^{T}AP$ for an orthogonal matrix $P$ .

(a)

Show that $A \stackrel {\circ }{\sim } A$ for all $A$ ; $A \stackrel {\circ }{\sim } B \Rightarrow B \stackrel {\circ }{\sim } A$ ; and $A \stackrel {\circ }{\sim } B$ and $B \stackrel {\circ }{\sim } C \Rightarrow A \stackrel {\circ }{\sim } C$ . (This means that “orthogonally similar” is an equivalence relation.)

(b)

Show that the following are equivalent for two symmetric matrices $A$ and $B$ .

(i): $A$ and $B$ are similar.
(ii): $A$ and $B$ are orthogonally similar.
(iii): $A$ and $B$ have the same eigenvalues.

Assume that $A$ and $B$ are orthogonally similar (Problem ex:8_2_12).

(a): If $A$ and $B$ are invertible, show that $A^{-1}$ and $B^{-1}$ are orthogonally similar.
(b): Show that $A^{2}$ and $B^{2}$ are orthogonally similar.
(c): Show that, if $A$ is symmetric, so is $B$ .

If $A$ is symmetric, show that every eigenvalue of $A$ is nonnegative if and only if $A = B^{2}$ for some symmetric matrix $B$ .

Prove the converse of Theorem th:dotpSymmetric:

If $(A\vec {x}) \dotp \vec {y} = \vec {x} \dotp (A\vec {y})$ for all $n$ -columns $\vec {x}$ and $\vec {y}$ , then $A$ is symmetric.

Show that every eigenvalue of $A$ is zero if and only if $A$ is nilpotent ( $A^{k} = 0$ for some $k \geq 1$ ).

If $A$ has real eigenvalues, show that $A = B + C$ where $B$ is symmetric and $C$ is nilpotent.

Let $Q$ be an orthogonal matrix.

(a): Show that $\det Q = 1$ or $\det Q = -1$ .
(b): Give $2 \times 2$ examples of $Q$ such that $\det Q = 1$ and $\det Q = -1$ .
(c): If $\det Q = -1$ , show that $I + Q$ has no inverse.
$Q^{T}(I + Q) = (I + Q)^{T}$ .
(d): If $P$ is $n \times n$ and $\det P \neq (-1)^{n}$ , show that $I - P$ has no inverse.
$P^{T}(I - P) = -(I - P)^{T}$

We call a square matrix $E$ a projection matrix if $E^{2} = E = E^{T}$ .

(a): If $E$ is a projection matrix, show that $Q = I - 2E$ is orthogonal and symmetric.
(b): If $Q$ is orthogonal and symmetric, show that
$E = \frac {1}{2}(I - Q)$ is a projection matrix.
(c): If $Q$ is $m \times n$ and $Q^{T}Q = I$ (for example, a unit column in $\RR ^n$ ), show that $E = QQ^{T}$ is a projection matrix.

A matrix that we obtain from the identity matrix by writing its rows in a different order is called a permutation matrix (see Theorem th:LUPA). Show that every permutation matrix is orthogonal.

If the rows $\vec {r}_{1}, \dots , \vec {r}_{n}$ of the $n \times n$ matrix $A = \begin {bmatrix} a_{ij} \end {bmatrix}$ are orthogonal, show that the $(i, j)$ -entry of $A^{-1}$ is $\frac {a_{ji}}{\norm {\vec {r}_{j}}^2}$ .

(a)

Let $A$ be an $m \times n$ matrix. Show that the following are equivalent.

i.: $A$ has orthogonal rows.
ii.: $A$ can be factored as $A = DP$ , where $D$ is invertible and diagonal and $P$ has orthonormal rows.
iii.: $AA^{T}$ is an invertible, diagonal matrix.

(b)

Show that an $n \times n$ matrix $A$ has orthogonal rows if and only if $A$ can be factored as $A = DQ$ , where $Q$ is orthogonal and $D$ is diagonal and invertible.

Let $A$ be a skew-symmetric matrix; that is, $A^{T} = -A$ . Assume that $A$ is an $n \times n$ matrix.

(a): Show that $I + A$ is invertible.
By Theorem thm:004553, it suffices to show that $(I + A)\vec {x} = \vec {0}$ , $\vec {x}$ in $\RR ^n$ , implies $\vec {x} = \vec {0}$ . Compute $\vec {x} \dotp \vec {x} = \vec {x}^{T}\vec {x}$ , and use the fact that $A\vec {x} = -\vec {x}$ and $A^{2}\vec {x} = \vec {x}$ .
(b): Show that $Q = (I - A)(I + A)^{-1}$ is orthogonal.
(c): Show that every orthogonal matrix $P$ such that $I + P$ is invertible arises as in part (b) from some skew-symmetric matrix $A$ .
Solve $P = (I - A)(I + A)^{-1}$ for $A$ .

Show that the following are equivalent for an $n \times n$ matrix $Q$ .

(a): $Q$ is orthogonal.
(b): $\norm {Q\vec {x}} = \norm {\vec {x}}$ for all $\vec {x}\in \RR ^n$ .
(c): $\norm { Q\vec {x} - Q\vec {y}} = \norm {\vec {x} - \vec {y}}$ for all $\vec {x}$ , $\vec {y}\in \RR ^n$ .
(d): $(Q\vec {x}) \dotp (Q\vec {y}) = \vec {x} \dotp \vec {y}$ for all columns $\vec {x}$ , $\vec {y}\in \RR ^n$ .
For (d) $\Rightarrow$ (a), show that column $i$ of $Q$ equals $Q\vec {e}_{i}$ , where $\vec {e}_{i}$ is column $i$ of the identity matrix.

This exercise shows that linear transformations with orthogonal standard matrices are distance-preserving (b,c) and angle-preserving (d).

(a): Show that $\begin {bmatrix} \cos \theta & -\sin \theta \\ \sin \theta & \cos \theta \end {bmatrix}$ is an orthogonal matrix.
(b): Show that every $2 \times 2$ orthogonal matrix has the form $\begin {bmatrix} \cos \theta & -\sin \theta \\ \sin \theta & \cos \theta \end {bmatrix}$ or $\begin {bmatrix} \cos \theta & \sin \theta \\ \sin \theta & -\cos \theta \end {bmatrix}$ for some angle $\theta$ .
If $a^{2} + b^{2} = 1$ , then $a = \cos \theta$ and $b = \sin \theta$ for some angle $\theta$ .

Modify the proof of Theorem th:PrinAxes to prove Theorem th:Schur.

Text Source

This section was adapted from Section 8.2 of Keith Nicholson’s Linear Algebra with Applications. (CC-BY-NC-SA)

W. Keith Nicholson, Linear Algebra with Applications, Lyryx 2018, Open Edition, p. 424