Diagonalizable Matrices and Multiplicity

$\newenvironment {prompt}{}{} \newcommand {\ungraded }[0]{} \newcommand {\npnoround }[0]{\nprounddigits {-1}} \newcommand {\npnoroundexp }[0]{\nproundexpdigits {-1}} \newcommand {\npunitcommand }[1]{\ensuremath {\mathrm {#1}}} \newcommand {\tdplotsinandcos }[3]{\pgfmathsetmacro {#1}{sin(#3)}\pgfmathsetmacro {#2}{cos(#3)}} \newcommand {\tdplotmult }[3]{\pgfmathsetmacro {#1}{#2*#3}} \newcommand {\tdplotdiv }[3]{\pgfmathsetmacro {#1}{#2/#3}} \newcommand {\tdplotcheckdiff }[5]{\par \par \pgfmathparse { abs(#2 -#1)<#3 } \par \ifthenelse {\equal {\pgfmathresult }{1}}{#4}{#5} } \newcommand {\tdplotsetmaincoords }[2]{\pgfmathsetmacro {\tdplotmaintheta }{#1} \pgfmathsetmacro {\tdplotmainphi }{#2} \tdplotcalctransformmainscreen \tikzset {tdplot_main_coords/.style={x={(\raarot cm,\rbarot cm)},y={(\rabrot cm,\rbbrot cm)},z={(\racrot cm,\rbcrot cm)}}}} \newcommand {\tdplotcalctransformmainscreen }[0]{\tdplotsinandcos {\sintheta }{\costheta }{\tdplotmaintheta }\tdplotsinandcos {\sinphi }{\cosphi }{\tdplotmainphi }\tdplotmult {\stsp }{\sintheta }{\sinphi }\tdplotmult {\stcp }{\sintheta }{\cosphi }\tdplotmult {\ctsp }{\costheta }{\sinphi }\tdplotmult {\ctcp }{\costheta }{\cosphi }\pgfmathsetmacro {\raarot }{\cosphi }\pgfmathsetmacro {\rabrot }{\sinphi }\pgfmathsetmacro {\racrot }{0}\pgfmathsetmacro {\rbarot }{-\ctsp }\pgfmathsetmacro {\rbbrot }{\ctcp }\pgfmathsetmacro {\rbcrot }{\sintheta }\pgfmathsetmacro {\rcarot }{\stsp }\pgfmathsetmacro {\rcbrot }{-\stcp }\pgfmathsetmacro {\rccrot }{\costheta }} \newcommand {\tdplotcalctransformrotmain }[0]{\tdplotsinandcos {\sinalpha }{\cosalpha }{\tdplotalpha } \tdplotsinandcos {\sinbeta }{\cosbeta }{\tdplotbeta } \tdplotsinandcos {\singamma }{\cosgamma }{\tdplotgamma } \tdplotmult {\sasb }{\sinalpha }{\sinbeta } \tdplotmult {\sbsg }{\sinbeta }{\singamma } \tdplotmult {\sasg }{\sinalpha }{\singamma } \tdplotmult {\sasbsg }{\sasb }{\singamma } \tdplotmult {\sacb }{\sinalpha }{\cosbeta } \tdplotmult {\sacg }{\sinalpha }{\cosgamma } \tdplotmult {\sbcg }{\sinbeta }{\cosgamma } \tdplotmult {\sacbsg }{\sacb }{\singamma } \tdplotmult {\sacbcg }{\sacb }{\cosgamma } \tdplotmult {\casb }{\cosalpha }{\sinbeta } \tdplotmult {\cacb }{\cosalpha }{\cosbeta } \tdplotmult {\cacg }{\cosalpha }{\cosgamma } \tdplotmult {\casg }{\cosalpha }{\singamma } \tdplotmult {\cacbsg }{\cacb }{\singamma } \tdplotmult {\cacbcg }{\cacb }{\cosgamma } \pgfmathsetmacro {\raaeul }{\cacbcg -\sasg } \pgfmathsetmacro {\rabeul }{-\cacbsg -\sacg } \pgfmathsetmacro {\raceul }{\casb } \pgfmathsetmacro {\rbaeul }{\sacbcg + \casg } \pgfmathsetmacro {\rbbeul }{-\sacbsg + \cacg } \pgfmathsetmacro {\rbceul }{\sasb } \pgfmathsetmacro {\rcaeul }{-\sbcg } \pgfmathsetmacro {\rcbeul }{\sbsg } \pgfmathsetmacro {\rcceul }{\cosbeta } } \newcommand {\tdplotcalctransformmainrot }[0]{\tdplotsinandcos {\sinalpha }{\cosalpha }{\tdplotalpha } \tdplotsinandcos {\sinbeta }{\cosbeta }{\tdplotbeta } \tdplotsinandcos {\singamma }{\cosgamma }{\tdplotgamma } \tdplotmult {\sasb }{\sinalpha }{\sinbeta } \tdplotmult {\sbsg }{\sinbeta }{\singamma } \tdplotmult {\sasg }{\sinalpha }{\singamma } \tdplotmult {\sasbsg }{\sasb }{\singamma } \tdplotmult {\sacb }{\sinalpha }{\cosbeta } \tdplotmult {\sacg }{\sinalpha }{\cosgamma } \tdplotmult {\sbcg }{\sinbeta }{\cosgamma } \tdplotmult {\sacbsg }{\sacb }{\singamma } \tdplotmult {\sacbcg }{\sacb }{\cosgamma } \tdplotmult {\casb }{\cosalpha }{\sinbeta } \tdplotmult {\cacb }{\cosalpha }{\cosbeta } \tdplotmult {\cacg }{\cosalpha }{\cosgamma } \tdplotmult {\casg }{\cosalpha }{\singamma } \tdplotmult {\cacbsg }{\cacb }{\singamma } \tdplotmult {\cacbcg }{\cacb }{\cosgamma } \pgfmathsetmacro {\raaeul }{\cacbcg -\sasg } \pgfmathsetmacro {\rabeul }{\sacbcg + \casg } \pgfmathsetmacro {\raceul }{-\sbcg } \pgfmathsetmacro {\rbaeul }{-\cacbsg -\sacg } \pgfmathsetmacro {\rbbeul }{-\sacbsg + \cacg } \pgfmathsetmacro {\rbceul }{\sbsg } \pgfmathsetmacro {\rcaeul }{\casb } \pgfmathsetmacro {\rcbeul }{\sasb } \pgfmathsetmacro {\rcceul }{\cosbeta } } \newcommand {\tdplottransformmainrot }[3]{\tdplotcalctransformmainrot \par \pgfmathsetmacro {\tdplotresx }{\raaeul * #1 + \rabeul * #2 + \raceul * #3} \pgfmathsetmacro {\tdplotresy }{\rbaeul * #1 + \rbbeul * #2 + \rbceul * #3} \pgfmathsetmacro {\tdplotresz }{\rcaeul * #1 + \rcbeul * #2 + \rcceul * #3} } \newcommand {\tdplottransformrotmain }[3]{\tdplotcalctransformrotmain \par \pgfmathsetmacro {\tdplotresx }{\raaeul * #1 + \rabeul * #2 + \raceul * #3} \pgfmathsetmacro {\tdplotresy }{\rbaeul * #1 + \rbbeul * #2 + \rbceul * #3} \pgfmathsetmacro {\tdplotresz }{\rcaeul * #1 + \rcbeul * #2 + \rcceul * #3} } \newcommand {\tdplottransformmainscreen }[3]{\tdplotcalctransformmainscreen \par \pgfmathsetmacro {\tdplotresx }{\raarot * #1 + \rabrot * #2 + \racrot * #3} \pgfmathsetmacro {\tdplotresy }{\rbarot * #1 + \rbbrot * #2 + \rbcrot * #3} } \newcommand {\tdplotsetrotatedcoords }[3]{\pgfmathsetmacro {\tdplotalpha }{#1} \pgfmathsetmacro {\tdplotbeta }{#2} \pgfmathsetmacro {\tdplotgamma }{#3} \tdplotcalctransformrotmain \par \tdplotmult {\raaeaa }{\raarot }{\raaeul } \tdplotmult {\rabeba }{\rabrot }{\rbaeul } \tdplotmult {\raceca }{\racrot }{\rcaeul } \tdplotmult {\raaeab }{\raarot }{\rabeul } \tdplotmult {\rabebb }{\rabrot }{\rbbeul } \tdplotmult {\racecb }{\racrot }{\rcbeul } \tdplotmult {\raaeac }{\raarot }{\raceul } \tdplotmult {\rabebc }{\rabrot }{\rbceul } \tdplotmult {\racecc }{\racrot }{\rcceul } \tdplotmult {\rbaeaa }{\rbarot }{\raaeul } \tdplotmult {\rbbeba }{\rbbrot }{\rbaeul } \tdplotmult {\rbceca }{\rbcrot }{\rcaeul } \tdplotmult {\rbaeab }{\rbarot }{\rabeul } \tdplotmult {\rbbebb }{\rbbrot }{\rbbeul } \tdplotmult {\rbcecb }{\rbcrot }{\rcbeul } \tdplotmult {\rbaeac }{\rbarot }{\raceul } \tdplotmult {\rbbebc }{\rbbrot }{\rbceul } \tdplotmult {\rbcecc }{\rbcrot }{\rcceul } \pgfmathsetmacro {\raarc }{\raaeaa + \rabeba + \raceca } \pgfmathsetmacro {\rabrc }{\raaeab + \rabebb + \racecb } \pgfmathsetmacro {\racrc }{\raaeac + \rabebc + \racecc } \pgfmathsetmacro {\rbarc }{\rbaeaa + \rbbeba + \rbceca } \pgfmathsetmacro {\rbbrc }{\rbaeab + \rbbebb + \rbcecb } \pgfmathsetmacro {\rbcrc }{\rbaeac + \rbbebc + \rbcecc } \tikzset {tdplot_rotated_coords/.append style={x={(\raarc cm,\rbarc cm)},y={(\rabrc cm,\rbbrc cm)},z={(\racrc cm,\rbcrc cm)}}}} \newcommand {\tdplotsetrotatedcoordsorigin }[1]{\tikzset {tdplot_rotated_coords/.append style={shift=#1}}} \newcommand {\tdplotresetrotatedcoordsorigin }[0]{\tikzset {tdplot_rotated_coords/.append style={shift={(0,0,0)}}}} \newcommand {\tdplotsetthetaplanecoords }[1]{\tdplotresetrotatedcoordsorigin \tdplotsetrotatedcoords {270 + #1}{270}{0}} \newcommand {\tdplotsetrotatedthetaplanecoords }[1]{\tdplotsetrotatedcoords {\tdplotalpha }{\tdplotbeta }{\tdplotgamma + #1}\tikzset {tdplot_rotated_coords/.append style={y={(\raarc cm,\rbarc cm)},z={(\rabrc cm,\rbbrc cm)},x={(\racrc cm,\rbcrc cm)}}}} \newcommand {\tdplotsetcoord }[4]{\tdplotsinandcos {\sinthetavec }{\costhetavec }{#3}\tdplotsinandcos {\sinphivec }{\cosphivec }{#4}\tdplotmult {\stcpv }{\sinthetavec }{\cosphivec }\tdplotmult {\stspv }{\sinthetavec }{\sinphivec }\coordinate (#1) at ($#2*(\stcpv ,\stspv ,\costhetavec )$); \coordinate (#1xy) at ($#2*(\stcpv ,\stspv ,0)$); \coordinate (#1xz) at ($#2*(\stcpv ,0,\costhetavec )$); \coordinate (#1yz) at ($#2*(0,\stspv ,\costhetavec )$); \coordinate (#1x) at ($#2*(\stcpv ,0,0)$); \coordinate (#1y) at ($#2*(0,\stspv ,0)$); \coordinate (#1z) at ($#2*(0,0,\costhetavec )$); } \newcommand {\tdplotsimplesetcoord }[4]{\tdplotsinandcos {\sinthetavec }{\costhetavec }{#3}\tdplotsinandcos {\sinphivec }{\cosphivec }{#4}\tdplotmult {\stcpv }{\sinthetavec }{\cosphivec }\tdplotmult {\stspv }{\sinthetavec }{\sinphivec }\coordinate (#1) at ($#2*(\stcpv ,\stspv ,\costhetavec )$); } \newcommand {\tdplotsetpolarplotrange }[4]{\pgfmathsetmacro {\tdplotlowerphi }{#3} \pgfmathsetmacro {\tdplotupperphi }{#4} \pgfmathsetmacro {\tdplotlowertheta }{#1} \pgfmathsetmacro {\tdplotuppertheta }{#2} } \newcommand {\tdplotresetpolarplotrange }[0]{\pgfmathsetmacro {\tdplotlowerphi }{0} \pgfmathsetmacro {\tdplotupperphi }{360} \pgfmathsetmacro {\tdplotlowertheta }{0} \pgfmathsetmacro {\tdplotuppertheta }{180} } \newcommand {\tdplotdosurfaceplot }[6]{\par \pgfmathsetmacro {\nextphi }{\curphi + \tdplotsuperfudge *\viewphistep } \par \begin {scope}[opacity=1] \par \par \tdplotcheckdiff {\nextphi }{360}{\origviewphistep }{#2}{} \tdplotcheckdiff {\nextphi }{0}{\origviewphistep }{#2}{} \par \tdplotcheckdiff {\nextphi }{90}{\origviewphistep }{#3}{} \tdplotcheckdiff {\nextphi }{450}{\origviewphistep }{#3}{} \end {scope} \par \foreach \curtheta in{\viewthetastart ,\viewthetainc ,...,\viewthetaend } { \par \pgfmathsetmacro {\curlongitude }{90 -\curphi } \pgfmathsetmacro {\curlatitude }{90 -\curtheta } \par \ifthenelse {\equal {\leftright }{-1.0}}{\pgfmathsetmacro {\curphi }{\curphi -\origviewphistep } }{} \par \pgfmathsetmacro {\tdplottheta }{mod(\curtheta ,360)} \pgfmathsetmacro {\tdplotphi }{mod(\curphi ,360)} \par \pgfmathparse {\tdplotphi <0} \ifthenelse {\equal {\pgfmathresult }{1}}{ \pgfmathsetmacro {\tdplotphi }{\tdplotphi + 360} }{}\par \pgfmathparse {\tdplottheta >\tdplotuppertheta } \pgfmathsetmacro {\logictest }{1 -\pgfmathresult } \par \pgfmathparse {\tdplottheta <\tdplotlowertheta } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \pgfmathsetmacro {\tdplottheta }{\tdplottheta + \viewthetastep } \pgfmathparse {\tdplottheta >\tdplotuppertheta } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \pgfmathparse {\tdplottheta <\tdplotlowertheta } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \pgfmathparse {\tdplotphi >\tdplotupperphi } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \pgfmathparse {\tdplotphi <\tdplotlowerphi } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \pgfmathsetmacro {\tdplotphi }{\tdplotphi + \viewphistep } \par \pgfmathparse {\tdplotphi <0} \ifthenelse {\equal {\pgfmathresult }{1}}{ \pgfmathsetmacro {\tdplotphi }{\tdplotphi + 360} }{}\par \pgfmathparse {\tdplotphi >\tdplotupperphi } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \pgfmathparse {\tdplotphi <\tdplotlowerphi } \pgfmathsetmacro {\logictest }{\logictest * (1 -\pgfmathresult )} \par \par \pgfmathsetmacro {\tdplottheta }{\curtheta } \pgfmathsetmacro {\tdplotphi }{\curphi } \par \ifthenelse {\equal {#6}{parametricfill}}{\ifthenelse {\equal {\logictest }{1.0}}{\pgfmathsetmacro {\radius }{#1} \pgfmathsetmacro {\tdplotr }{\radius *360} \par \pgfmathlessthan {\radius }{0} \pgfmathsetmacro {\phaseshift }{180 * \pgfmathresult } \par \pgfmathsetmacro {\colorarg }{#5} \pgfmathsetmacro {\colorarg }{\colorarg + \phaseshift } \pgfmathsetmacro {\colorarg }{mod(\colorarg ,360)} \par \pgfmathlessthan {\colorarg }{0} \pgfmathsetmacro {\colorarg }{\colorarg + 360*\pgfmathresult } \par \pgfmathdivide {\colorarg }{360} \definecolor {tdplotfillcolor}{hsb}{\pgfmathresult ,1,1} \color {tdplotfillcolor} }{}}{\pgfsetfillcolor {#5} } \pgfsetstrokecolor {#4} \par \ifthenelse {\equal {\leftright }{-1.0}}{\pgfmathsetmacro {\curphi }{\curphi + \origviewphistep } }{} \par \ifthenelse {\equal {\logictest }{1.0}}{\pgfmathsetmacro {\radius }{abs(#1)} \pgfpathmoveto {\pgfpointspherical {\curlongitude }{\curlatitude }{\radius }} \par \pgfmathsetmacro {\tdplotphi }{\curphi + \viewphistep } \pgfmathsetmacro {\radius }{abs(#1)} \pgfpathlineto {\pgfpointspherical {\curlongitude -\viewphistep }{\curlatitude }{\radius }} \par \pgfmathsetmacro {\tdplottheta }{\curtheta + \viewthetastep } \pgfmathsetmacro {\radius }{abs(#1)} \pgfpathlineto {\pgfpointspherical {\curlongitude -\viewphistep }{\curlatitude -\viewthetastep }{\radius }} \par \pgfmathsetmacro {\tdplotphi }{\curphi } \pgfmathsetmacro {\radius }{abs(#1)} \pgfpathlineto {\pgfpointspherical {\curlongitude }{\curlatitude -\viewthetastep }{\radius }} \pgfpathclose \par \pgfusepath {fill,stroke} }{} } } \newcommand {\tdplotshowargcolorguide }[4]{ \par \pgfmathsetmacro {\tdplotx }{#1} \pgfmathsetmacro {\tdploty }{#2} \pgfmathsetmacro {\tdplothuestep }{5} \pgfmathsetmacro {\tdplotxsize }{#3} \pgfmathsetmacro {\tdplotysize }{#4} \par \pgfmathsetmacro {\tdplotyscale }{\tdplotysize /360} \par \foreach \tdplotphi in {0,\tdplothuestep ,...,360} { \pgfmathdivide {\tdplotphi }{360} \definecolor {tdplotfillcolor}{hsb}{\pgfmathresult ,1,1} \color {tdplotfillcolor} \par \pgfmathsetmacro {\tdplotstarty }{\tdploty + \tdplotphi * \tdplotyscale } \pgfmathsetmacro {\tdplotstopy }{\tdplotstarty + \tdplothuestep * \tdplotyscale } \pgfmathsetmacro {\tdplotstartx }{\tdplotx } \pgfmathsetmacro {\tdplotstopx }{\tdplotx + \tdplotxsize } \filldraw [tdplot_screen_coords] (\tdplotstartx ,\tdplotstarty ) rectangle (\tdplotstopx ,\tdplotstopy ); } \par \pgfmathsetmacro {\tdplotstopy }{\tdploty + (360+\tdplothuestep )*\tdplotyscale } \pgfmathsetmacro {\tdplotstopx }{\tdplotx + \tdplotxsize } \par \draw [tdplot_screen_coords] (\tdplotx ,\tdploty ) rectangle (\tdplotstopx ,\tdplotstopy ); \par \node [tdplot_screen_coords,anchor=west,xshift=5pt] at (\tdplotstopx ,\tdploty ) {$0$}; \node [tdplot_screen_coords,anchor=west,xshift=5pt] at (\tdplotstopx ,\tdplotstopy ) {$2\pi $}; \par \pgfmathsetmacro {\tdplotstopy }{\tdploty + (360+\tdplothuestep )/2*\tdplotyscale } \node [tdplot_screen_coords,anchor=west,xshift=5pt] at (\tdplotstopx ,\tdplotstopy ) {$\pi $}; } \newcommand {\tdplotgetpolarcoords }[3]{\pgfmathsetmacro {\vxcalc }{#1} \pgfmathsetmacro {\vycalc }{#2} \pgfmathsetmacro {\vzcalc }{#3} \pgfmathsetmacro {\vcalc }{ sqrt((\vxcalc )^2 + (\vycalc )^2 + (\vzcalc )^2) } \par \pgfmathsetmacro {\vxycalc }{ sqrt((\vxcalc )^2 + (\vycalc )^2) } \par \pgfmathsetmacro {\tdplotrestheta }{asin(\vxycalc /\vcalc )} \pgfmathparse {\vzcalc <0} \ifthenelse {\equal {\pgfmathresult }{1}}{\pgfmathsetmacro {\tdplotrestheta }{180 -\tdplotrestheta } } {} \ifthenelse {\equal {\vxcalc }{0.0}}{\pgfmathparse {\vycalc <0} \ifthenelse {\equal {\pgfmathresult }{1}}{\pgfmathsetmacro {\tdplotresphi }{270} } {\pgfmathparse {\vycalc >0} \ifthenelse {\equal {\pgfmathresult }{1}}{\pgfmathsetmacro {\tdplotresphi }{90} } {\pgfmathsetmacro {\tdplotresphi }{0} } } } {\pgfmathsetmacro {\tdplotresphi }{atan(\vycalc /\vxcalc )} \pgfmathparse {\vxcalc <0} \ifthenelse {\equal {\pgfmathresult }{1}}{\pgfmathsetmacro {\tdplotresphi }{\tdplotresphi +180} } { } \par \pgfmathparse {\tdplotresphi <0} \ifthenelse {\equal {\pgfmathresult }{1}}{\pgfmathsetmacro {\tdplotresphi }{\tdplotresphi +360} } {} } } \newcommand {\vec }[0]{\mathbf } \newcommand {\RR }[0]{\mathbb {R}} \newcommand {\dfn }[0]{\textit } \newcommand {\dotp }[0]{\cdot } \newcommand {\id }[0]{\text {id}} \newcommand {\norm }[1]{\left \lVert #1\right \rVert } \newcommand {\mathtoolsset }[1]{\setkeys {\MT_options_name: }{#1}} \newcommand {\refeq }[1]{\textup {\ref {#1}}} \newcommand {\lparen }[0]{(} \newcommand {\rparen }[0]{)} \newcommand {\ordinarycolon }[0]{:} \newcommand {\MT_test_for_tcb_other:nnnnn }[1]{\if:w t#1\relax \expandafter \MH_use_choice_i:nnnn \else: \if:w c#1\relax \expandafter \expandafter \expandafter \MH_use_choice_ii:nnnn \else: \if:w b#1\relax \expandafter \expandafter \expandafter \expandafter \expandafter \expandafter \expandafter \MH_use_choice_iii:nnnn \else: \expandafter \expandafter \expandafter \expandafter \expandafter \expandafter \expandafter \MH_use_choice_iv:nnnn \fi: \fi: \fi: } \newcommand {\newcases }[6]{\newenvironment {#1}{\MT_start_cases:nnnn {#2}{#3}{#4}{#5}}{\MH_end_cases: \right #6}} \newcommand {\renewcases }[6]{\renewenvironment {#1}{\MT_start_cases:nnnn {#2}{#3}{#4}{#5}}{\MH_end_cases: \right #6}} \newcommand {\SwapAboveDisplaySkip }[0]{\noalign {\vskip -\abovedisplayskip \vskip \abovedisplayshortskip }} \newcommand {\vdotswithin }[1]{{\mathmakebox [\widthof {\ensuremath {{}#1{}}}][c]{{\vdots }}}} \newcommand {\MTFlushSpaceBelow }[0]{\\\noalign {\nobreak \vskip -\lineskip \vskip -\l_MT_shortvdotswithinadjustbelow_dim \vskip -\origjot \vskip \jot }} \newcommand {\mathmbox }[0]{\mathpalette \MT_mathmbox:nn } \newcommand {\prescript }[3]{\mathchoice {\MT_prescript_inner: {#1}{#2}{#3}{\scriptstyle }}{\MT_prescript_inner: {#1}{#2}{#3}{\scriptstyle }}{\MT_prescript_inner: {#1}{#2}{#3}{\scriptscriptstyle }}{\MT_prescript_inner: {#1}{#2}{#3}{\scriptscriptstyle }}} \newcommand {\spreadlines }[1]{\setlength {\jot }{#1}\ignorespaces } \newcommand {\newgathered }[4]{\newenvironment {#1}{\def \MT_gathered_pre: {#2}\def \MT_gathered_post: {#3}\def \MT_gathered_env_end: {#4}\MT_gathered_env }{\endMT_gathered_env }} \newcommand {\renewgathered }[4]{\renewenvironment {#1}{\def \MT_gathered_pre: {#2}\def \MT_gathered_post: {#3}\def \MT_gathered_env_end: {#4}\MT_gathered_env }{\endMT_gathered_env }} \newcommand {\lgathered }[0]{\def \MT_gathered_pre: {}\def \MT_gathered_post: {\hfil }\def \MT_gathered_env_end: {}\MT_gathered_env } \newcommand {\rgathered }[0]{\def \MT_gathered_pre: {\hfil }\def \MT_gathered_post: {}\def \MT_gathered_env_end: {}\MT_gathered_env } \newcommand {\gathered }[0]{\def \MT_gathered_pre: {\hfil }\def \MT_gathered_post: {\hfil }\def \MT_gathered_env_end: {}\MT_gathered_env } \newcommand {\splitfrac }[2]{\genfrac {}{}{0pt}{1}{\textstyle #1\quad \hfill }{\textstyle \hfill \quad \mathstrut #2}} \newcommand {\splitdfrac }[2]{\genfrac {}{}{0pt}{0}{#1\quad \hfill }{\hfill \quad \mathstrut #2}} \newcommand {\HyperFirstAtBeginDocument }[0]{\AtBeginDocument } \newcommand {\dblcolon }[0]{\vcentcolon \mathrel {\mkern -.9mu}\vcentcolon } \newcommand {\coloneqq }[0]{\vcentcolon \mathrel {\mkern -1.2mu}=} \newcommand {\Coloneqq }[0]{\dblcolon \mathrel {\mkern -1.2mu}=} \newcommand {\coloneq }[0]{\vcentcolon \mathrel {\mkern -1.2mu}\mathrel {-}} \newcommand {\Coloneq }[0]{\dblcolon \mathrel {\mkern -1.2mu}\mathrel {-}} \newcommand {\eqqcolon }[0]{=\mathrel {\mkern -1.2mu}\vcentcolon } \newcommand {\Eqqcolon }[0]{=\mathrel {\mkern -1.2mu}\dblcolon } \newcommand {\eqcolon }[0]{\mathrel {-}\mathrel {\mkern -1.2mu}\vcentcolon } \newcommand {\Eqcolon }[0]{\mathrel {-}\mathrel {\mkern -1.2mu}\dblcolon } \newcommand {\colonapprox }[0]{\vcentcolon \mathrel {\mkern -1.2mu}\approx } \newcommand {\Colonapprox }[0]{\dblcolon \mathrel {\mkern -1.2mu}\approx } \newcommand {\colonsim }[0]{\vcentcolon \mathrel {\mkern -1.2mu}\sim } \newcommand {\Colonsim }[0]{\dblcolon \mathrel {\mkern -1.2mu}\sim } \newcommand {\nuparrow }[0]{\MH_nuparrow: } \newcommand {\ndownarrow }[0]{\MH_ndownarrow: } \newcommand {\bigtimes }[0]{\MH_csym_bigtimes: }$

Diagonalizable Matrices and Multiplicity

Recall that a diagonal matrix $D$ is a matrix containing a zero in every entry except those on the main diagonal. More precisely, if $d_{ij}$ is the $ij^{th}$ entry of a diagonal matrix $D$ , then $d_{ij}=0$ unless $i=j$ . Such matrices look like the following.

$\begin{equation*} D = \begin{bmatrix} * & & 0 \\ & \ddots & \\ 0 & & * \end{bmatrix} \end{equation*}$ where $*$ is a number which might not be zero.

Diagonal matrices have some nice properties, as we demonstrate below.

Let $M =\begin {bmatrix}1 & 2 & 3\\ 4&5&6\\7&8&9\end {bmatrix}$ and let $D =\begin {bmatrix}2 & 0 & 0\\ 0&-5&0\\0&0&10\end {bmatrix}$ . Compute $MD$ and $DM$ using Octave.

To use Octave, go to the Sage Math Cell Webpage, copy the code below into the cell, select OCTAVE as the language, and press EVALUATE.

% Understanding multiplication by a diagonal matrix
 
M=[1 2 3; 4 5 6; 7 8 9]
 
D=diag([2 -5 10])
 
%The function diag(vector) creates a square matrix with diagonal vector and the remaining entries zero. The function diag() can also be used to return the diagonal of a matrix.  For example, diag(M) would return [1;5;9].
 
M*D
 
D*M

Notice the patterns present in the product matrices. Each row of $DM$ is the same as its corresponding row of $M$ multiplied by the scalar which is the corresponding diagonal element of $D$ . In the product $MD$ , it is the columns of $M$ that have been multiplied by the diagonal elements. These patterns hold in general for any diagonal matrix, and they are fundamental to understanding diagonalization, the process we discuss below.

Let $A$ be an $n\times n$ matrix. Then $A$ is said to be diagonalizable if there exists an invertible matrix $P$ such that $\begin{equation*} P^{-1}AP=D \end{equation*}$ where $D$ is a diagonal matrix. In other words, a matrix $A$ is diagonalizable if it is similar to a diagonal matrix, $A \sim D$ .

If we are given a matrix $A$ that is diagonalizable, then we can write $P^{-1}AP=D$ for some matrix $P$ , or, equivalently,

$\begin{equation} \label{eq:understand_diag} AP=PD \end{equation}$ If we pause to examine Equation (eq:understand_diag), the work that we did in Exploration init:multiplydiag can help us to understand how to find $P$ that will diagonalize $A$ . The product $PD$ is formed by multiplying each column of $P$ by a scalar which is the corresponding element on the diagonal of $D$ . To restate this, if $\vec {x}_i$ is column $i$ in our matrix $P$ , then Equation (eq:understand_diag) tells us that $\begin{equation} \label{eq:ev_ew_diag} A \vec{x}_i = \lambda_i \vec{x}_i, \end{equation}$ where $\lambda _i$ is the $i$ th diagonal element of $D$ .

Of course, Equation (eq:ev_ew_diag) is very familiar! We see that if we are able to diagonalize a matrix $A$ , the columns of matrix $P$ will be the eigenvectors of $A$ , and the corresponding diagonal entries of $D$ will be the corresponding eigenvalues of $A$ . This is summed up in the following theorem.

An $n\times n$ matrix $A$ is diagonalizable if and only if there is an invertible matrix $P$ given by $P=\begin {bmatrix} | & | & & | \\ \vec {x}_1 & \vec {x}_2 & \cdots & \vec {x}_n \\ | & | & & | \end {bmatrix},$ where the columns $\vec {x}_i$ are eigenvectors of $A$ .

Moreover, if $A$ is diagonalizable, the corresponding eigenvalues of $A$ are the diagonal entries of the diagonal matrix $D$ .

Proof: Suppose $P$ is given as above as an invertible matrix whose columns are eigenvectors of $A$ . To show that $A$ is diagonalizable, we will show $AP=PD,$ which is equivalent to $P^{-1}AP=D$ . We have $AP=\begin {bmatrix} | & | & & | \\ A\vec {x}_1 & A\vec {x}_2 & \cdots & A\vec {x}_n \\ | & | & & | \end {bmatrix},$ while $\begin{equation*} PD=\begin{bmatrix} | & | & & | \\ \vec{x}_1 & \vec{x}_2 & \cdots & \vec{x}_n \\ | & | & & | \end{bmatrix} \begin{bmatrix} \lambda _{1} & & 0 \\ & \ddots & \\ 0 & & \lambda _{n} \end{bmatrix}=\begin{bmatrix} | & | & & | \\ \lambda _{1}\vec{x}_1 & \lambda _{2}\vec{x}_2 & \cdots & \lambda_{n}\vec{x}_n \\ | & | & & | \end{bmatrix} \end{equation*}$ We can complete this half of the proof by comparing columns, and noting that $\begin{equation} A \vec{x}_i = \lambda_i \vec{x}_i \end{equation}$ for $i=1,\ldots ,n$ since the $\vec {x}_i$ are eigenvectors of $A$ and the $\lambda _i$ are corresponding eigenvalues of $A$ .
Conversely, suppose $A$ is diagonalizable so that $P^{-1}AP=D.$ Let
$\begin{equation*} P=\begin{bmatrix} | & | & & | \\ \vec{x}_1 & \vec{x}_2 & \cdots & \vec{x}_n \\ | & | & & | \end{bmatrix} \end{equation*}$ where the columns are the vectors $\vec {x}_i$ and $\begin{equation*} D=\begin{bmatrix} \lambda _{1} & & 0 \\ & \ddots & \\ 0 & & \lambda _{n} \end{bmatrix} \end{equation*}$ Then $\begin{equation*} AP=PD=\begin{bmatrix} | & | & & | \\ \vec{x}_1 & \vec{x}_2 & \cdots & \vec{x}_n \\ | & | & & | \end{bmatrix} \begin{bmatrix} \lambda _{1} & & 0 \\ & \ddots & \\ 0 & & \lambda _{n} \end{bmatrix} \end{equation*}$ and so $\begin{equation*} \begin{bmatrix} | & | & & | \\ A\vec{x}_1 & A\vec{x}_2 & \cdots & A\vec{x}_n \\ | & | & & | \end{bmatrix} =\begin{bmatrix} | & | & & | \\ \lambda _{1}\vec{x}_1 & \lambda _{2}\vec{x}_2 & \cdots & \lambda_{n}\vec{x}_n \\ | & | & & | \end{bmatrix} \end{equation*}$ showing the $\vec {x}_i$ are eigenvectors of $A$ and the $\lambda _{i}$ are eigenvalues. $\blacksquare$

Notice that because the matrix $P$ defined above is invertible it follows that the set of eigenvectors of $A$ , $\left \{ \vec {x}_1 , \vec {x}_2 , \ldots , , \vec {x}_n \right \}$ , is a basis of $\mathbb {R}^n$ .

We demonstrate the concept given in the above theorem in the next example. Note that not only are the columns of the matrix $P$ formed by eigenvectors, but $P$ must be invertible, and therefore must consist of a linearly independent set of eigenvectors.

Let $\begin{equation*} A=\begin{bmatrix} 2 & 0 & 0 \\ 1 & 4 & -1 \\ -2 & -4 & 4 \end{bmatrix} \end{equation*}$ Find an invertible matrix $P$ and a diagonal matrix $D$ such that $P^{-1}AP=D$ .

We will use eigenvectors of $A$ as the columns of $P$ , and the corresponding eigenvalues of $A$ as the diagonal entries of $D$ . The eigenvalues of $A$ are $\lambda _1 =2,\lambda _2 = 2$ , and $\lambda _3 = 6$ . We leave these computations as exercises, as well as the computations to find a basis for each eigenspace.

One possible basis for $\mathcal {S}_2$ , the eigenspace corresponding to $2$ , is $\left \{ \begin {bmatrix} -2 \\ 1 \\ 0 \end {bmatrix}, \begin {bmatrix} 1 \\ 0 \\ 1 \end {bmatrix} \right \}$ , while a basis for $\mathcal {S}_6$ is given by $\left \{\begin {bmatrix} 0 \\ 1 \\ -2 \end {bmatrix}\right \}$ .

We construct the matrix $P$ by using these basis elements as columns.

$\begin{equation*} P=\begin{bmatrix} -2 & 1 & 0 \\ 1 & 0 & 1 \\ 0 & 1 & -2 \end{bmatrix} \end{equation*}$ You can verify (See Practice Problem ) that $\begin{equation*} P^{-1}=\begin{bmatrix} -1/4 & 1/2 & 1/4 \\ 1/2 & 1 & 1/2 \\ 1/4 & 1/2 & -1/4 \end{bmatrix} \end{equation*}$ Thus,

$\begin{eqnarray*} P^{-1}AP &=&\begin{bmatrix} -1/4 & 1/2 & 1/4 \\ 1/2 & 1 & 1/2 \\ 1/4 & 1/2 & -1/4 \end{bmatrix} \begin{bmatrix} 2 & 0 & 0 \\ 1 & 4 & -1 \\ -2 & -4 & 4 \end{bmatrix} \begin{bmatrix} -2 & 1 & 0 \\ 1 & 0 & 1 \\ 0 & 1 & -2 \end{bmatrix} \\ &=&\begin{bmatrix} 2 & 0 & 0 \\ 0 & 2 & 0 \\ 0 & 0 & 6 \end{bmatrix} \end{eqnarray*}$

You can see that the result here is a diagonal matrix where the entries on the main diagonal are the eigenvalues of $A$ . Notice that eigenvalues on the main diagonal must be in the same order as the corresponding eigenvectors in $P$ .

It is often easier to work with matrices that are diagonalizable, as the next Exploration demonstrates.

Let $A=\begin {bmatrix} 2 & 0 & 0 \\ 1 & 4 & -1 \\ -2 & -4 & 4 \end {bmatrix}$ and let $D=\begin {bmatrix} 2 & 0 & 0 \\ 0 & 2 & 0 \\ 0 & 0 & 6 \end {bmatrix}$ . Would it be easier to compute $A^5$ or $D^5$ if you had to do so by hand, without a computer? Certainly $D^5$ is easier, due to the number of zero entries! Let’s use Octave to compute both.

To use Octave, go to the Sage Math Cell Webpage, copy the code below into the cell, select OCTAVE as the language, and press EVALUATE.

% Powers of a diagonal matrix are easy to compute
 
A=[2 0 0; 1 4 -1; -2 -4 4]
 
D=diag([2 2 6])
 
HARD = A^5
 
EZ = D^5

We see that raising a diagonal matrix to a power amounts to simply raising each entry to that same power, whereas computing $A^5$ requires many more calculations. However, we learned in Example ex:diagonalizematrix that $A$ is similar to $D$ , and we can use this to make our computation easier. This is because

$\begin{align*} A^5&=\left(PDP^{-1}\right)^5 \\ &=(PDP^{-1})(PDP^{-1})(PDP^{-1})(PDP^{-1})(PDP^{-1}) \\ &=PD(P^{-1}P)D(P^{-1}P)D(P^{-1}P)D(P^{-1}P)DP^{-1} \\ &=PD(I)D(I)D(I)D(I)DP^{-1} \\ &=PD^5P^{-1} \end{align*}$ With this in mind, it is not as daunting to calculate $A^5$ by hand. We can compute the product $PD^5$ quite easily since $D^5$ is diagonal, as we learned in Exploration init:multiplydiag. That leaves just one product of $3 \times 3$ matrices to compute by hand to compute $A^5$ . And the savings in work would certainly be more pronounced for larger matrices or for powers larger that 5.

To use Octave, go to the Sage Math Cell Webpage, copy the code below into the cell, select OCTAVE as the language, and press EVALUATE.

A=[2 0 0; 1 4 -1; -2 -4 4]
 
D=diag([2 2 6])
 
P=[-2 1 0; 1 0 1; 0 1 -2]
 
EZ = D^5
 
P*EZ*P^{-1}
                                                                  

                                                                  
 
HARD = A^5

In Exploration exp:motivate_diagonalization, because matrix $A$ was diagonalizable, we were able to cut down on computations. When we chose to work with $D$ and $P$ instead of $A$ we worked with the eigenvalues and eigenvectors of $A$ . Each column of $P$ is an eigenvector of $A$ , and so we repeatedly made use of the following theorem (with $m=5$ ).

Let $A$ be an $n \times n$ matrix and suppose $A\vec {x}=\lambda \vec {x}$ . Then $A^m \vec {x} = \lambda ^m \vec {x}$

Proof: We prove this theorem by induction on $m$ . Clearly $A^m \vec {x} = \lambda ^m \vec {x}$ holds when $m=1$ , as that was given. For the inductive step, suppose that we know $A^{m-1} \vec {x} = \lambda ^{m-1} \vec {x}$ . Then $\begin{align*} A^m \vec{x} &= (A A^{m-1}) \vec{x} \\ &= A (A^{m-1} \vec{x}) \\ &= A (\lambda^{m-1} \vec{x}) \\ &= \lambda^{m-1} A\vec{x} \\ &= \lambda^{m-1} \lambda \vec{x} \\ &= \lambda^m \vec{x} \end{align*}$ as desired. $\blacksquare$

Matrix $A$ from the Example ex:diagonalizematrix and Exploration exp:motivate_diagonalization had a repeated eigenvalue of 2. The next theorem and corollary show that matrices which have distinct eigenvalues (where none are repeated) have desirable properties.

Let $A$ be an $n\times n$ matrix, and suppose that $A$ has distinct eigenvalues $\lambda _1, \lambda _2, \ldots , \lambda _m$ . For each $i$ , let $\vec {x}_i$ be a $\lambda _i$ -eigenvector of $A$ . Then $\{ \vec {x}_1, \vec {x}_2, \ldots , \vec {x}_m\}$ is linearly independent.

Proof: We prove this by induction on $m$ , the number of vectors in the set. If $m = 1$ , then $\{\vec {x}_{1}\}$ is a linearly independent set because $\vec {x}_{1} \neq \vec {0}$ . In general, suppose we have established that the theorem is true for some $m \geq 1$ . Given eigenvectors $\{\vec {x}_{1}, \vec {x}_{2}, \dots , \vec {x}_{m+1}\}$ , suppose $\begin{equation} \label{eq:thm_proof_5_5_4_1} c_1\vec{x}_1 + c_2\vec{x}_2 + \dots + c_{m+1}\vec{x}_{m+1} = \vec{0}. \end{equation}$ We must show that each $c_{i} = 0$ . Multiply both sides of (eq:thm_proof_5_5_4_1) on the left by $A$ and use the fact that $A\vec {x}_{i} = \lambda _{i}\vec {x}_{i}$ to get $\begin{equation} \label{eq:thm_proof_5_5_4_2} c_1\lambda_1\vec{x}_1 + c_2\lambda_2\vec{x}_2 + \dots + c_{m+1}\lambda_{m+1}\vec{x}_{m+1} = \vec{0}. \end{equation}$ If we multiply (eq:thm_proof_5_5_4_1) by $\lambda _{1}$ and subtract the result from (eq:thm_proof_5_5_4_2), the first terms cancel and we obtain $\begin{equation*} c_2(\lambda_2 - \lambda_1)\vec{x}_2 + c_3(\lambda_3 - \lambda_1)\vec{x}_3 + \dots + c_{m+1}(\lambda_{m+1} - \lambda_1)\vec{x}_{m+1} = \vec{0}. \end{equation*}$ Since $\vec {x}_{2}, \vec {x}_{3}, \dots , \vec {x}_{m+1}$ correspond to distinct eigenvalues $\lambda _{2}, \lambda _{3}, \dots , \lambda _{m+1}$ , the set $\{\vec {x}_{2}, \vec {x}_{3}, \dots , \vec {x}_{m+1}\}$ is linearly independent by the induction hypothesis. Hence, $\begin{equation*} c_2(\lambda_2 - \lambda_1) = 0, \quad c_3(\lambda_3 - \lambda_1) = 0, \quad \dots, \quad c_{m+1}(\lambda_{m+1} - \lambda_1) = 0 \end{equation*}$ and so $c_{2} = c_{3} = \dots = c_{m+1} = 0$ because the $\lambda _{i}$ are distinct. It follows that (eq:thm_proof_5_5_4_1) becomes $c_{1}\vec {x}_{1} = \vec {0}$ , which implies that $c_{1} = 0$ because $\vec {x}_{1} \neq \vec {0}$ , and the proof is complete. $\blacksquare$

The corollary that follows from this theorem gives a useful tool in determining if $A$ is diagonalizable.

Let $A$ be an $n \times n$ matrix and suppose it has $n$ distinct eigenvalues. Then it follows that $A$ is diagonalizable.

Note that Corollary th:distincteigenvalues is NOT an “if and only if statement”. This means that if $A$ has repeated eigenvalues it is still sometimes possible to diagonalize $A$ , as seen in Example ex:diagonalizematrix.

If we are able to diagonalize $A$ , say $A=PDP^{-1}$ , we say that $PDP^{-1}$ is an eigenvalue decomposition of $A$ .

Not every matrix has an eigenvalue decomposition. Sometimes we cannot find an invertible matrix $P$ such that $P^{-1}AP=D$ . Consider the following example.

Let $\begin{equation*} A = \begin{bmatrix} 1 & 1 \\ 0 & 1 \end{bmatrix} \end{equation*}$ If possible, find an invertible matrix $P$ and a diagonal matrix $D$ so that $P^{-1}AP=D$ .

We see immediately (how?) that the eigenvalues of $A$ are $\lambda _1 =1$ and $\lambda _2=1$ . To find $P$ , the next step would be to find a basis for the corresponding eigenspace $\mathcal {S}_1$ . We solve the equation $\left ( A - \lambda I \right ) \vec {x} = \vec {0}$ . Writing this equation as an augmented matrix, we already have a matrix in row echelon form: $\begin{equation*} \left[\begin{array}{cc|c} 0 & -1 & 0 \\ 0 & 0 & 0 \end{array}\right] \end{equation*}$ We see that the eigenvectors in $\mathcal {S}_1$ are of the form $\begin {bmatrix} t \\ 0 \end {bmatrix} =t\begin {bmatrix} 1 \\ 0 \end {bmatrix},$ so a basis for the eigenspace $\mathcal {S}_1$ is given by $\begin {bmatrix} 1 \\ 0 \end {bmatrix}$ . It is easy to see that we cannot form an invertible matrix $P$ , because any two eigenvectors will be of the form $\begin {bmatrix} t \\ 0 \end {bmatrix}$ , and so the second row of $P$ would have a row of zeros, and $P$ could not be invertible. Hence $A$ cannot be diagonalized.

We saw earlier in Corollary th:distincteigenvalues that an $n \times n$ matrix with $n$ distinct eigenvalues is diagonalizable. It turns out that there are other useful diagonalizability tests.

Recall that the algebraic multiplicity of an eigenvalue $\lambda$ is the number of times that it occurs as a root of the characteristic polynomial.

The geometric multiplicity of an eigenvalue $\lambda$ is the dimension of the corresponding eigenspace $\mathcal {S}_\lambda$ .

Consider now the following lemma.

Let $A$ be an $n\times n$ matrix, and let $\mathcal {S}_{\lambda _1}$ be the eigenspace corresponding to the eigenvalue $\lambda _1$ which has algebraic multiplicity $m$ . Then $\mbox {dim}(\mathcal {S}_{\lambda _1})\leq m$

In other words, the geometric multiplicity of an eigenvalue is less than or equal to the algebraic multiplicity of that same eigenvalue.

Proof: Let $k$ be the geometric multiplicity of $\lambda _1$ , i.e., $k=\mbox {dim}(\mathcal {S}_{\lambda _1})$ . Suppose $\left \{\vec {x}_1, \vec {x}_2, \ldots ,\vec {x}_k\right \}$ is a basis for the eigenspace $\mathcal {S}_{\lambda _1}$ . Let $P$ be any invertible matrix having $\vec {x}_1, \vec {x}_2, \ldots ,\vec {x}_k$ as its first $k$ columns, say $P=\begin {bmatrix} | & | & & | & | & & | \\ \vec {x}_1 & \vec {x}_2 & \cdots & \vec {x}_k & \vec {x}_{k+1} & \cdots & \vec {x}_n \\ | & | & & | & | & & | \end {bmatrix}.$ In block form we may write $P=\begin {bmatrix} B&C \end {bmatrix} \quad \text {and} \quad P^{-1}=\begin {bmatrix} D \\ E \end {bmatrix}$ where $B$ is $n \times k$ , $C$ is $n \times (n-k)$ , $D$ is $k \times n$ , and $E$ is $(n-k) \times n$ . We observe $I_n = P^{-1}P = \left [\begin {array}{c|c} DB & DC \\ \hline EB & EC \end {array}\right ]$ . This implies $DB = I_k,\quad DC=O_{k\,\,n-k},\quad EB = O_{n-k\,\,k} \quad \text { and }\quad EC = I_{n-k}$ Therefore, $\begin{equation*} P^{-1}AP=\begin{bmatrix} D \\ E \end{bmatrix} A \begin{bmatrix} B&C \end{bmatrix} = \left[\begin{array}{c|c} DAB & DAC \\ \hline EAB & EAC \end{array}\right] \end{equation*}$ $\begin{equation*} = \left[\begin{array}{c|c} \lambda_1 DB & DAC \\ \hline \lambda_1 EB & EAC \end{array}\right] = \left[\begin{array}{c|c} \lambda_1 I_k & DAC \\ \hline O & EAC \end{array}\right] \end{equation*}$ We finish the proof by comparing the characteristic polynomials on both sides of this equation, and making use of the fact that similar matrices have the same characteristic polynomials. $\det (A-\lambda I) = \det (P^{-1}AP-\lambda I)=(\lambda _1 - \lambda )^k \det (EAC)$ We see that the characteristic polynomial of $A$ has $(\lambda _1 - \lambda )^k$ as a factor. This tells us that algebraic multiplicity of $\lambda _1$ is at least $k$ , proving the desired inequality. $\blacksquare$

This result tells us that if $\lambda$ is an eigenvalue of $A$ , then the number of linearly independent $\lambda$ -eigenvectors is never more than the multiplicity of $\lambda$ . We now use this fact to provide a useful diagonalizability condition.

Let $A$ be an $n \times n$ matrix $A$ . Then $A$ is diagonalizable if and only if for each eigenvalue $\lambda$ of $A$ , the algebraic multiplicity of $\lambda$ is equal to the geometric multiplicity of $\lambda$ .

Proof: Suppose $A$ is diagonalizable and let $\lambda _1, \ldots , \lambda _t$ be the distinct eigenvalues of $A$ , with algebraic multiplicities $m_1, \ldots , m_t$ , respectively and geometric multiplicities $k_1, \ldots , k_t$ , respectively. Since $A$ is diagonalizable, Theorem th:eigenvectorsanddiagonalizable implies that $k_1+\cdots +k_t=n$ . By applying Lemma lemma:dimeigenspace $t$ times, we have $n = k_1+\cdots +k_t \le m_1+\cdots +m_t = n,$ which is only possible if $k_i=m_i$ for $i=1,\ldots ,t$ .
Conversely, if the geometric multiplicity equals the algebraic multiplicity of each eigenvalue, then obtaining a basis for each eigenspace yields $n$ eigenvectors. Applying Theorem th:linindepeigenvectors, we know that these $n$ eigenvectors are linearly independent, so Theorem th:eigenvectorsanddiagonalizable implies that $A$ is diagonalizable. $\blacksquare$

Practice Problems

In this exercise you will ”fill in the details” of Example ex:diagonalizematrix.

Find the eigenvalues of matrix $\begin{equation*} A=\begin{bmatrix} 2 & 0 & 0 \\ 1 & 4 & -1 \\ -2 & -4 & 4 \end{bmatrix} \end{equation*}$

Find a basis for each eigenspace of the matrix $A$ .

Compute the inverse of $\begin{equation*} P= \begin{bmatrix} -2 & 1 & 0 \\ 1 & 0 & 1 \\ 0 & 1 & -2 \end{bmatrix} \end{equation*}$

Compute $D=P^{-1}AP$

Show that computing the inverse of $P$ is not really necessary by comparing the products $PD$ and $AP$ .

In each case, decide whether the matrix $A$ is diagonalizable. If so, find $P$ such that $P^{-1}AP$ is diagonal.

(a): $\begin {bmatrix} 1 & 0 & 0 \\ 1 & 2 & 1 \\ 0 & 0 & 1 \end {bmatrix}$ [correct]DiagonalizableNot Diagonalizable
(b): $\begin {bmatrix} 3 & 0 & 6 \\ 0 & -3 & 0 \\ 5 & 0 & 2 \end {bmatrix}$ [correct]DiagonalizableNot Diagonalizable
(c): $\begin {bmatrix} 3 & 1 & 6 \\ 2 & 1 & 0 \\ -1 & 0 & -3 \end {bmatrix}$ [correct]DiagonalizableNot Diagonalizable
(d): $\begin {bmatrix} 4 & 0 & 0 \\ 0 & 2 & 2 \\ 2 & 3 & 1 \end {bmatrix}$ DiagonalizableNot Diagonalizable

Let $A$ denote an $n \times n$ upper triangular matrix.

(a): If all the main diagonal entries of $A$ are distinct, show that $A$ is diagonalizable.
(b): If all the main diagonal entries of $A$ are equal, show that $A$ is diagonalizable only if it is already diagonal. Click the arrow to see the answer.

The eigenvalues of $A$ are all equal (they are the diagonal elements), so if $P^{-1}AP = D$ is diagonal, then $D = \lambda I$ . Hence $A = P^{-1}(\lambda I)P = \lambda I$ .
(c): Show that $\begin {bmatrix} 1 & 0 & 1 \\ 0 & 1 & 0 \\ 0 & 0 & 2 \end {bmatrix}$ is diagonalizable but that $\begin {bmatrix} 1 & 1 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 2 \end {bmatrix}$ is not diagonalizable.

Let $A$ be a diagonalizable $n \times n$ matrix with eigenvalues $\lambda _{1}, \lambda _{2}, \dots , \lambda _{n}$ (including multiplicities). Show that:

(a): $\det A = \lambda _{1}\lambda _{2}\cdots \lambda _{n}$
(b): $\mbox {tr} A = \lambda _{1} + \lambda _{2} + \cdots + \lambda _{n}$

(a): Show that two diagonalizable matrices are similar if and only if they have the same eigenvalues with the same multiplicities.
(b): If $A$ is diagonalizable, show that $A \sim A^{T}$ .
(c): Show that $A \sim A^{T}$ if $A = \begin {bmatrix} 1 & 1 \\ 0 & 1 \end {bmatrix}$

Text Source

The text in this section is a compilation of material from Section 7.2.1 of Ken Kuttler’s A First Course in Linear Algebra (CC-BY) and Section 5.5 of Keith Nicholson’s Linear Algebra with Applications (CC-BY-NC-SA).

Ken Kuttler, A First Course in Linear Algebra, Lyryx 2017, Open Edition, p. 362-364.

Many of the Practice Problems are Exercises from W. Keith Nicholson, Linear Algebra with Applications, Lyryx 2018, Open Edition, pp. 298-310.