행렬 부호 함수

수학에서 행렬 부호 함수는 복합 부호 함수와 유사한 제곱 행렬의 행렬 함수다.^[1]

1971년 케임브리지 대학의 기술보고서에 소개되었으며, 이후 1980년 학술지에 발표되었다.^[2]^[3]

정의

행렬 부호 함수는 복합 부호 함수의 일반화다.

$\operatorname {csgn}(z)={\begin{case}1&{\text{}}\mathrm {Re}(z)0,\-1&{\text{}}}}}}}$

매트릭스 평가 아날로그 $\operatorname {csgn} (A)$ $\operatorname {csgn} (A)$ ( $\operatorname {csgn} (A)$ $\operatorname {csgn} (A)$ ${\displaystyle \operatorname {csgn} (A$ 부호 함수는 분석적이지 않지만, 가상 축에 고유값이 없는 모든 행렬에 대해 잘 정의되어 있다(예: Jordan-form-based 정의(파생물이 모두 0인 경우).

특성.

정리: $A\in \mathbb {C} ^{n\times n}$ $A\in \mathbb {C} ^{n\times n}$ C $A\in \mathbb {C} ^{n\times n}$ × $A\in \mathbb {C} ^{n\times n}$ ${\$ n $A\in \mathbb {C} ^{n\times n}$ $\operatorname {csgn} (A)^{2}=I$ ( $A$ = $\operatorname {csgn} (A)^{2}=I$ I {\ $displaystyle \operatorname {csgn}^{2}=I$ ^[1] .

정리: $A\in \mathbb {C} ^{n\times n}$ $A\in \mathbb {C} ^{n\times n}$ $A\in \mathbb {C} ^{n\times n}$ n $A\in \mathbb {C} ^{n\times n}$ × $A\in \mathbb {C} ^{n\times n}$ ${\$ n $A\in \mathbb {C} ^{n\times n}$ $\operatorname {csgn} (A)$ ( ( $)$ {\ $displaysty \operatorname {csgn}$ ( $A)}$ 은(는 $\operatorname {csgn} (A)$ ) 대각선으로 가능하며 $\pm 1$ 은 $\pm 1$ ± 1 ${\displaysty \pm1}$ 이다 $\pm 1$ ^[1]

정리:Let $A\in \mathbb {C} ^{n\times n}$ , then $(I+\operatorname {csgn} (A))/2$ is a projector onto the invariant subspace associated with the eigenvalues in the right-half plane, and analogously for ${\displaystyle (I-\operatorname$ ${csgn}(A)/2}$ 및 왼쪽 절반 $(I-\operatorname {csgn} (A))/2$ 평면.^[1]

정리: $A\in \mathbb {C} ^{n\times n}$ $A\in \mathbb {C} ^{n\times n}$ $A\in \mathbb {C} ^{n\times n}$ n $A\in \mathbb {C} ^{n\times n}$ × $A\in \mathbb {C} ^{n\times n}$ ${\$ $A=P{\begin{bmatrix}J_{+}&0\\0&J_{-}\end{bmatrix}}P^{-1}$ = $A=P{\begin{bmatrix}J_{+}&0\\0&J_{-}\end{bmatrix}}P^{-1}$ [ $A=P{\begin{bmatrix}J_{+}&0\\0&J_{-}\end{bmatrix}}P^{-1}$ + $A=P{\begin{bmatrix}J_{+}&0\\0&J_{-}\end{bmatrix}}P^{-1}$ $A=P{\begin{bmatrix}J_{+}&0\\0&J_{-}\end{bmatrix}}P^{-1}$ - $A=P{\begin{bmatrix}J_{+}&0\\0&J_{-}\end{bmatrix}}P^{-1}$ ] $A=P{\begin{bmatrix}J_{+}&0\\0&J_{-}\end{bmatrix}}P^{-1}$ - $A=P{\begin{bmatrix}J_{+}&0\\0&J_{-}\end{bmatrix}}P^{-1}$ ${\$ 을(으)로 한다. $J_{+}&0\\0&$ $J_{-}\end{bmatrix}{bmatrix$ }} $P^{-1}$ 는 J $J_{+}$ + ${\$ 는 양의 실체를 가진 고유값에 해당하고 $J_{+}$ , $J_{-}$ - $J_{-}$ {\ $displaysty J_{-}$ 는 $J_{-}$ 음의 실체를 가진 고유값에 해당하도록 요르단 분해물이다 $A=P{\begin{bmatrix}J_{+}&0\\0&J_{-}\end{bmatrix}}P^{-1}$ .그러면 $\operatorname {csgn} (A)=P{\begin{bmatrix}I_{+}&0\\0&-I_{-}\end{bmatrix}}P^{-1}$ $\operatorname {csgn} (A)=P{\begin{bmatrix}I_{+}&0\\0&-I_{-}\end{bmatrix}}P^{-1}$ ( $\operatorname {csgn} (A)=P{\begin{bmatrix}I_{+}&0\\0&-I_{-}\end{bmatrix}}P^{-1}$ ) $\operatorname {csgn} (A)=P{\begin{bmatrix}I_{+}&0\\0&-I_{-}\end{bmatrix}}P^{-1}$ = P $\operatorname {csgn} (A)=P{\begin{bmatrix}I_{+}&0\\0&-I_{-}\end{bmatrix}}P^{-1}$ [ $\operatorname {csgn} (A)=P{\begin{bmatrix}I_{+}&0\\0&-I_{-}\end{bmatrix}}P^{-1}$ + $\operatorname {csgn} (A)=P{\begin{bmatrix}I_{+}&0\\0&-I_{-}\end{bmatrix}}P^{-1}$ - $\operatorname {csgn} (A)=P{\begin{bmatrix}I_{+}&0\\0&-I_{-}\end{bmatrix}}P^{-1}$ - $\operatorname {csgn} (A)=P{\begin{bmatrix}I_{+}&0\\0&-I_{-}\end{bmatrix}}P^{-1}$ $\operatorname {csgn} (A)=P{\begin{bmatrix}I_{+}&0\\0&-I_{-}\end{bmatrix}}P^{-1}$ - $\operatorname {csgn} (A)=P{\begin{bmatrix}I_{+}&0\\0&-I_{-}\end{bmatrix}}P^{-1}$ ${\$ $P{\begin{bmatrix}$ $I_{+}&0\\0&-I_{-}\end{bmatrix}}P^{-1}}$ , where $I_{+}$ and $I_{-}$ are identity matrices of sizes corresponding to $J_{+}$ and $J_{-}$ , respectively.^[1]

계산 방법

함수는 매트릭스 함수에 대한 일반적인 방법으로 계산할 수 있지만, 전문화된 방법도 있다.

뉴턴 반복

뉴턴 반복은 $\operatorname {csgn} (x)={\sqrt {x^{2}}}/x$ $\operatorname {csgn} (x)={\sqrt {x^{2}}}/x$ ) $\operatorname {csgn} (x)={\sqrt {x^{2}}}/x$ = $\operatorname {csgn} (x)={\sqrt {x^{2}}}/x$ $\operatorname {csgn} (x)={\sqrt {x^{2}}}/x$ / x ${\displaystyle \operatorname {csgn}(x)={\sqrt{x^{2}}/x$ 을(를) 관찰함으로써 도출할 수 있으며, 행렬의 면에서는 $\operatorname {csgn} (A)=A^{-1}{\sqrt {A^{2}}}$ ) = $\operatorname {csgn} (A)=A^{-1}{\sqrt {A^{2}}}$ - $\operatorname {csgn} (A)=A^{-1}{\sqrt {A^{2}}}$ ${\$ \ $\operatorname$ {coperatorname { $coratorname$ }( $A)$ 로 기록할 수 있다 $.$ $A^{-1}{\sqrt{A^{2}}:$ 매트릭스 제곱근을 사용하는 곳 $\operatorname {csgn} (A)=A^{-1}{\sqrt {A^{2}}}$ If we apply the Babylonian method to compute the square root of the matrix $A^{2}$ , that is, the iteration ${\textstyle X_{k+1}={\frac {1}{2}}\left(X_{k}+AX_{k}^{-1}\right)}$ , and define the new iterate $Z_{k$ $=A^{-1}X_{k$ 반복에 도착함

$Z_{k+1}={\frac {1}{2}}\left(Z_{k}+Z_{k}^{-1}\right)$ $Z_{k+1}={\frac {1}{2}}\left(Z_{k}+Z_{k}^{-1}\right)$ + $Z_{k+1}={\frac {1}{2}}\left(Z_{k}+Z_{k}^{-1}\right)$ = 1 $Z_{k+1}={\frac {1}{2}}\left(Z_{k}+Z_{k}^{-1}\right)$ ( Z $Z_{k+1}={\frac {1}{2}}\left(Z_{k}+Z_{k}^{-1}\right)$ + $Z_{k+1}={\frac {1}{2}}\left(Z_{k}+Z_{k}^{-1}\right)$ $Z_{k+1}={\frac {1}{2}}\left(Z_{k}+Z_{k}^{-1}\right)$ - $Z_{k+1}={\frac {1}{2}}\left(Z_{k}+Z_{k}^{-1}\right)$ ) ${\$ 1}:{ $2}}\좌측(Z_{k}+Z_{k}^{-1}\오른쪽$

여기서 일반적으로 $Z_{0}=A$ $Z_{0}=A$ = $Z_{0}=A$ ${\displaystyle Z_{0}=A$ 수렴은 글로벌이며 로컬로 2차이다.^[1]^[2]

뉴턴 반복은 반복체 $Z_{k}$ $Z_{k}$ ${\$ 의 명시적인 역류를 사용한다 $Z_{k}$

뉴턴-슐츠 반복

To avoid the need of an explicit inverse used in the Newton iteration, the inverse can be approximated with one step of the Newton iteration for the inverse, $Z_{k}^{-1}\approx Z_{k}\left(2I-Z_{k}^{2}\right)$ , derived by Schulz(de) in 1933.^[4]이 근사치를 이전 방법으로 대체하면, 새로운 방법은

$Z_{k+1}={\frac {1}{2}}Z_{k}\left(3I-Z_{k}^{2}\right)$ $Z_{k+1}={\frac {1}{2}}Z_{k}\left(3I-Z_{k}^{2}\right)$ + 1 $Z_{k+1}={\frac {1}{2}}Z_{k}\left(3I-Z_{k}^{2}\right)$ = $Z_{k+1}={\frac {1}{2}}Z_{k}\left(3I-Z_{k}^{2}\right)$ 2 $Z_{k+1}={\frac {1}{2}}Z_{k}\left(3I-Z_{k}^{2}\right)$ $Z_{k+1}={\frac {1}{2}}Z_{k}\left(3I-Z_{k}^{2}\right)$ ( $Z_{k+1}={\frac {1}{2}}Z_{k}\left(3I-Z_{k}^{2}\right)$ $Z_{k+1}={\frac {1}{2}}Z_{k}\left(3I-Z_{k}^{2}\right)$ - Z $Z_{k+1}={\frac {1}{2}}Z_{k}\left(3I-Z_{k}^{2}\right)$ $Z_{k+1}={\frac {1}{2}}Z_{k}\left(3I-Z_{k}^{2}\right)$ ) ${\$ 2}} $Z_{k}\왼쪽(3I-Z_{k}^{2}\오른쪽)}$ .

수렴은 (여전히) 이차적이지만, 국부적 $\|I-A^{2}\|<1$ i $\|I-A^{2}\|<1$ - $\|I-A^{2}\|<1$ $\|I-A^{2}\|<1$ < $\|I-A^{2}\|<1$ {\ $displaystyle$ \ $I-A^{2}\$ <1}에 대해서만 보장됨) $\|I-A^{2}\|<1$ ^[1]

적용들

실베스터 방정식의 해법

정리:^[2]^[3]Let $A,B,C\in \mathbb {R} ^{n\times n}$ and assume that $A$ and $B$ are stable, then the unique solution to the Sylvester equation, $AX+XB=C$ , is given by $X$ such that

${\begin{bmatrix}-I&2X\\0>I\end{bmatrix}=\operatorname {csgn}\좌측({\begin{bmatrix}A&-C\\0&-B\end{bmatrix}}오른쪽).$

교정 스케치:결과는 유사성 변환에서 나온다.

${\begin{bmatrix}A&-C\0&-B\end{bmatrix}={\begin{bmatrix}I&X\\0&I\end{bmatrix}{\begin{bmatrix}A&0\\0&-B\end{bmatrix}{\begin{bmatrix}I&X\\0&I\end{bmatrix}^{-1$

그 이후

$\operatorname {csgn} \left({\begin{bmatrix}A&-C\\0&-B\end{bmatrix}\right)={\begin{bmatrix}I&X\\0&I\end{bmatrix}{\begin{bmatrix}I&0\\0&I\end{bmatrix}{\begin{bmatrix}I&-X\\0&I\end{bmatrix},$

$A$ $A$ 및 $A$ $B$ $B$ 의 안정성 때문에 $B$

그 정리는 자연스레 랴푸노프 방정식에도 적용할 수 있다. $그러나$ 구조 때문에 뉴턴 반복은 A $A$ 및 $A^{T}$ $A^{T}$ ${\$ A $^{T}$ 의 역만 포함하도록 단순화된다 $A^{T}$

대수 리카티 방정식의 해법

There is a similar result applicable to the algebraic Riccati equation, $A^{H}P+PA-PFP+Q=0$ .^[1]^[2] Define $V,W\in \mathbb {C} ^{2n\times n}$ as

${\begin{bmatrix}V&W\end{bmatrix}=\operatorname {csgn}\좌측({\begin{bmatrix}A^{H}&Q\\F&-A\end{bmatrix}\right)-{\begin{bmatrix}I&0\\0&I\end{bmatrix}.$

$A-FP$ $F,Q\in \mathbb {C} ^{n\times n}$ , $F,Q\in \mathbb {C} ^{n\times n}$ $F,Q\in \mathbb {C} ^{n\times n}$ $F,Q\in \mathbb {C} ^{n\times n}$ n $F,Q\in \mathbb {C} ^{n\times n}$ n ${\$ }} ^{ $A-FP$ \time n $}}}$ 이(가) 에르미트인이며 $F,Q\in \mathbb {C} ^{n\times n}$ 고유한 안정화 솔루션이 존재한다는 가정 $A-FP$ 하에 $A-FP$ A - F P {\displaystystylease $A-FP}$ 이 $A-FP$ 안정화되며, 그 솔루션은 과의 결정적이지만 일관성이 있다.

$VP=-W.$

교정 스케치:유사성 변환

${\begin{bmatrix}A^{H}&Q\\F&-A\end{bmatrix}={\begin{bmatrix}P&-I\\I&0\end{bmatrix}{\begin{bmatrix}(-A-FP)&-F\\0&(A-FP)\end{bmatrix}{begin{bmatrix}P&-I\\\\\\I&0\end{bmatrix}^{-1$

$A-FP$ - $A-FP$ $A-FP$ $A-FP$ 의 안정성은 $A-FP$

$\left(\operatorname {csgn}) \left({\begin{bmatrix}A^{H}&Q\\F&-A\end{bmatrix}\right)-{\begin{bmatrix}I&0\\0&I\end{bmatrix}\right){\begin{bmatrix}X&-I\\I&0\end{bmatrix}={\begin{bmatrix}X&-I\\I&0\end{bmatrix}{\begin{bmatrix}0&&Y\\\0&-2I\end{bmatrix},$

일부 행렬 $Y\in \mathbb {C} ^{n\times n}$ C $Y\in \mathbb {C} ^{n\times n}$ × $Y\in \mathbb {C} ^{n\times n}$ {\ $displaystyle Y\in \mathb {C}$ ^{ $n\time$ n $}}$ 에 대해 $Y\in \mathbb {C} ^{n\times n}$

행렬 제곱근 계산

매트릭스의 제곱근에 대한 Denman-Bevers $A-PIP=0$ 은 $A-PIP=0$ A - $A-PIP=0$ I $A-PIP=0$ = $A-PIP=0$ $[\displaystyle$ A-PIP $=0}$ 이 $A-PIP=0$ (가) 퇴행 대수 리카티^[3] 방정식이며, 정의상 $솔루션$ P $P$ 이 $P$ (가) $A$ ${\style A$ 의 제곱근이라는 것을 알아냄으로써 매트릭스 부호 함수에 대한 뉴턴 반복에서 도출할 수 있다.

참조

^ ^a ^b ^c ^d ^e ^f ^g ^h Higham, Nicholas J. (2008). Functions of matrices : theory and computation. Society for Industrial and Applied Mathematics. Philadelphia, Pa.: Society for Industrial and Applied Mathematics (SIAM, 3600 Market Street, Floor 6, Philadelphia, PA 19104). ISBN 978-0-89871-777-8. OCLC 693957820.
^ ^a ^b ^c ^d Roberts, J. D. "Linear model reduction and solution of the algebraic Riccati equation by use of the sign function". International Journal of Control. 32 (4): 677–687. doi:10.1080/00207178008922881. ISSN 0020-7179.
^ ^a ^b ^c Denman, Eugene D.; Beavers, Alex N. (1976). "The matrix sign function and computations in systems". Applied Mathematics and Computation. 2 (1): 63–94. doi:10.1016/0096-3003(76)90020-5. ISSN 0096-3003.
^ Schulz, Günther (1933). "Iterative Berechung der reziproken Matrix". ZAMM - Journal of Applied Mathematics and Mechanics / Zeitschrift für Angewandte Mathematik und Mechanik. 13 (1): 57–59. doi:10.1002/zamm.19330130111. ISSN 1521-4001.

[:0-1] ^ ^a ^b ^c ^d ^e ^f ^g ^h Higham, Nicholas J. (2008). Functions of matrices : theory and computation. Society for Industrial and Applied Mathematics. Philadelphia, Pa.: Society for Industrial and Applied Mathematics (SIAM, 3600 Market Street, Floor 6, Philadelphia, PA 19104). ISBN 978-0-89871-777-8. OCLC 693957820.

[:1-2] Roberts, J. D. "Linear model reduction and solution of the algebraic Riccati equation by use of the sign function". International Journal of Control. 32 (4): 677–687. doi:10.1080/00207178008922881. ISSN 0020-7179.

[:2-3] Denman, Eugene D.; Beavers, Alex N. (1976). "The matrix sign function and computations in systems". Applied Mathematics and Computation. 2 (1): 63–94. doi:10.1016/0096-3003(76)90020-5. ISSN 0096-3003.

[4] Schulz, Günther (1933). "Iterative Berechung der reziproken Matrix". ZAMM - Journal of Applied Mathematics and Mechanics / Zeitschrift für Angewandte Mathematik und Mechanik. 13 (1): 57–59. doi:10.1002/zamm.19330130111. ISSN 1521-4001.

[1]

[2]

[3]

[4]

Search