f-확장

확률론에서 ƒ-diversity는 두 확률분포P와 Q의 차이를 측정하는 함수 D_f(PQ)이다.직관이 P와 Q가^{[citation needed]} 주는 승산비의 함수 f에 의해 가중된 차이를 평균으로 생각할 수 있도록 돕는다.

이러한 다이버전스는 알프레드 레니에^[1] 의해 그가 잘 알려진 레니 엔트로피를 소개한 같은 신문에서 소개되었다.그는 마르코프 프로세스에서 이러한 다이버전스가 감소한다는 것을 증명했다.f-다이버겐은 Csiszarr(1963 ), 모리모토(1963), 알리&실비(1966)에 의해 더 독립적으로 연구되었고, 때로는 Csiszar ƒ-Divergenes, Csar-Morimotorgenes 또는 Ali-Silvey 거리로 알려져 있다.

정의

P와 Q가 Q에 대해 절대적으로 연속되도록 공간 Ω에 대한 두 개의 확률 분포가 되도록 한다.그런 다음, f(1) = 0과 같은 볼록함수 f의 경우 Q로부터 P의 f-diversion은 다음과 같이 정의된다.

D_{f}(P\병렬 Q)\equiv \int _{\Oomega }f\왼쪽({\frac {dP}{dQ}\오른쪽)\,dQ.

P와 Q가 모두 Ω의 기준 분포 μ에 대해 절대적으로 연속적인 경우, 이들의 확률 밀도 p와 q는 dP = p dμ, dQ = q dμ를 만족한다.이 경우 f-diversity는 다음과 같이 기록할 수 있다.

D_{f}(P\parallel Q)=\int _{\Oomega }f\왼쪽({\frac {p(x)}{q(x)}}\right)q(x)\,d\mu(x).

f-divergenes는 Taylor 시리즈를 사용하여 표현할 수 있으며, 기형 거리의 가중 합계를 사용하여 다시 쓸 수 있다(Nielsen & Nock(2013년).

f-divergenes의 예

KL-다이버전스, 헬링거 거리, 총변동거리와 같은 많은 일반적인 다이버전스들은 f-다이버전스의 특별한 경우로서 f의 특정한 선택과 일치한다.다음 표에는 확률 분포와 해당 분포가 일치하는 f 함수(cf) 사이의 많은 공통 분해가 나열되어 있다.Liese & Vajda(2006).

발산	해당 f(t)
KL-디버전스	$t\displaystyle t\log t}$
역 KL-다이버전스	$-\log t$
제곱 헬링거 거리	${\displaystyle({\sqrt{t}-1)^{2},2(1-{\sqrt{t})}$
총변동거리	${\frac {1}{2}} t-1 \,$
Pearson $\chi ^{2}$ $\chi ^{2}$ ${\$ }} $\chi ^{2}$ - 디버전스 $\chi ^{2}$	$(t-1)^{2},\,t^{2}-1,\,t^{2}-t$
네이맨 $\chi ^{2}$ $\chi ^{2}$ ${\$ }} $\chi ^{2}$ -디버전스 $\chi ^{2}$ (역 피어슨)	${\frac {1}{t}-1,\, {\frac {1}{t}-t$
α-diver전위	${\begin{cases}{\frac {4}{1-\alpha ^{2}}}{\big (}1-t^{(1+\alpha )/2}{\big )},&{\text{if}}\ \alpha \neq \pm 1,\\t\ln t,&{\text{if}}\ \alpha =1,\\-\ln t,&{\text{if}}\ \alpha =-1\end{cases}}$
옌센-샤논 다이버전스	${\frac {1}{1}:{2}}[(t+1)\log {\big(}{\frac {2}{t+1}{\big )}+t\log t]}$
α-파괴전위(기타 명칭	${\begin{cases}{\frac {t^{\alpha }-t}{\alpha (\alpha -1)}},&{\text{if}}\ \alpha \neq 0,\,\alpha \neq 1,\\t\ln t,&{\text{if}}\ \alpha =1,\\-\ln t,&{\text{if}}\ \alpha =0\end{cases}}$

$f(t)$ ( t $f(t)$ ) ${\displaystyle f(t$ $c(t-1)$ 함수는 $c(t-1)$ $c(t-1)$ ( t - $c(t-1)$ 1 $c(t-1)$ ) ${\displaystyle c(t-1$ )까지 정의되며 $f(t)$ $c(t-1)$ $c$ 서 c $c$ 은 $c$ (는) 일정하다.

특성.

비부정성: ƒ-diversity는 항상 양의 값이다; 측정값 P와 Q가 일치하면 0이다.이는 옌센의 불평등에서 바로 뒤따른다.
$D_{f}(P\!\parallel \!)!Q)=\int \f{\bigg (}{\frac {dP}{dQ}{\bigg )}dQ\geq f{\bigg (}\proc {dP}{dQ}{dQ}}}dQ{\bigg )=0.$
단조로움: κ이 측정값 P와 Q를 그에 상응하여_κ P와_κ Q로 변환하는 임의의 전이 확률이라면,
$D_{f}(P\!\parallel \!)!Q)\geq D_{f}(P_{\kappa }\\parallel \!Q_{\kappa }).$
{P, Q}에 대한 충분한 통계량에서 전환을 유도하는 경우에만 여기에서 동등성이 유지된다.
조인트 볼록도: 0 ≤ λ 1 1의 경우
$D_{f}{\Big (}\lambda P_{1}+(1-\lambda )P_{2}\parallel \lambda Q_{1}+(1-\lambda )Q_{2}{\Big )}\leq \lambda D_{f}(P_{1}\!\parallel \!Q_{1}+(1-\lambda )D_{f}(P_{2}\!\parallel \!Q_{2}).$
$\mathbb {R} _{+}^{2}$ 는 R $\mathbb {R} _{+}^{2}$ + $\mathbb {R} _{+}^{2}$ ${\$ }}의 $(p,q)\mapsto qf(p/q)$ 매핑 $(p,q)\mapsto qf(p/q)$ $(p,q)\mapsto qf(p/q)$ {\ $displaystyle$ \mathb {R}{+}^{2 $\mathbb {R} _{+}^{2}$

In particular, the monotonicity implies that if a Markov process has a positive equilibrium probability distribution $P^{*}$ then $D_{f}(P(t)\parallel P^{*})$ is a monotonic (non-increasing) function of time, where the probability distribution ${\displays$ $tyle P(t)}$ 은 $P(t)$ Kolmogorov 전진 방정식(또는 마스터 방정식)의 솔루션으로, 마르코프 프로세스에서 확률 분포의 시간 진화를 설명하는 데 사용된다.이것은 모든 $D_{f}(P(t)\parallel P^{*})$ D $D_{f}(P(t)\parallel P^{*})$ ( $D_{f}(P(t)\parallel P^{*})$ ( $D_{f}(P(t)\parallel P^{*})$ ) $D_{f}(P(t)\parallel P^{*})$ ∥ $D_{f}(P(t)\parallel P^{*})$ $D_{f}(P(t)\parallel P^{*})$ ) $D_{f}(P(t)\병렬 P^{*}}$ 이(가) 콜모고로프 전진 방정식의 랴푸노프 함수라는 $D_{f}(P(t)\parallel P^{*})$ 것을 의미한다.Reverse statement is also true: If $H(P)$ is a Lyapunov function for all Markov chains with positive equilibrium $P^{*}$ and is of the trace-form ( $H(P)=\sum _{i}f(P_{i},P_{i}^{*})$ ) then $H(P)=D_{f}(P(t)\parallel P^{*})$ $H(P)=D_{f}(P(t)\parallel P^{*})$ ) $H(P)=D_{f}(P(t)\parallel P^{*})$ P ^[2]^[3]function ) {\ $displaystyle H(P)=D_{f}(P(t)\parallel$ P $H(P)=D_{f}(P(t)\parallel P^{*})$ 일부 볼록 함수 f.예를 들어, Bregman 다이버전트는 일반적으로 그러한 속성을 가지고 있지 않으며 마르코프 프로세스에서 증가할 수 있다.^[4]

재무해석

한 쌍의 확률 분포는 한 분포가 공식 확률을 정의하고 다른 분포가 실제 확률을 포함하는 운명의 게임으로 볼 수 있다.실제 확률을 알면 플레이어가 게임에서 이익을 얻을 수 있다.다수의 합리적인 참여자들에게 기대수익률은 profit-diversity와 같은 일반적인 형태를 가진다.^[5]

참고 항목

참조

^ Rényi, Alfréd (1961). On measures of entropy and information (PDF). The 4th Berkeley Symposium on Mathematics, Statistics and Probability, 1960. Berkeley, CA: University of California Press. pp. 547–561. Eq. (4.20)
^ Gorban, Pavel A. (15 October 2003). "Monotonically equivalent entropies and solution of additivity equation". Physica A. 328 (3–4): 380–390. arXiv:cond-mat/0304131. Bibcode:2003PhyA..328..380G. doi:10.1016/S0378-4371(03)00578-8. S2CID 14975501.
^ Amari, Shun'ichi (2009). Leung, C.S.; Lee, M.; Chan, J.H. (eds.). Divergence, Optimization, Geometry. The 16th International Conference on Neural Information Processing (ICONIP 20009), Bangkok, Thailand, 1--5 December 2009. Lecture Notes in Computer Science, vol 5863. Berlin, Heidelberg: Springer. pp. 185–193. doi:10.1007/978-3-642-10677-4_21.
^ Gorban, Alexander N. (29 April 2014). "General H-theorem and Entropies that Violate the Second Law". Entropy. 16 (5): 2408–2432. arXiv:1212.6767. Bibcode:2014Entrp..16.2408G. doi:10.3390/e16052408.
^ Soklakov, Andrei N. (2020). "Economics of Disagreement—Financial Intuition for the Rényi Divergence". Entropy. 22 (8): 860. arXiv:1811.08308. Bibcode:2020Entrp..22..860S. doi:10.3390/e22080860. PMC 7517462. PMID 33286632.

Csiszár, I. (1963). "Eine informationstheoretische Ungleichung und ihre Anwendung auf den Beweis der Ergodizitat von Markoffschen Ketten". Magyar. Tud. Akad. Mat. Kutato Int. Kozl. 8: 85–108.
Morimoto, T. (1963). "Markov processes and the H-theorem". J. Phys. Soc. Jpn. 18 (3): 328–331. Bibcode:1963JPSJ...18..328M. doi:10.1143/JPSJ.18.328.
Ali, S. M.; Silvey, S. D. (1966). "A general class of coefficients of divergence of one distribution from another". Journal of the Royal Statistical Society, Series B. 28 (1): 131–142. JSTOR 2984279. MR 0196777.
Csiszár, I. (1967). "Information-type measures of difference of probability distributions and indirect observation". Studia Scientiarum Mathematicarum Hungarica. 2: 229–318.
Csiszár, I.; Shields, P. (2004). "Information Theory and Statistics: A Tutorial" (PDF). Foundations and Trends in Communications and Information Theory. 1 (4): 417–528. doi:10.1561/0100000004. Retrieved 2009-04-08.
Liese, F.; Vajda, I. (2006). "On divergences and informations in statistics and information theory". IEEE Transactions on Information Theory. 52 (10): 4394–4412. doi:10.1109/TIT.2006.881731. S2CID 2720215.
Nielsen, F.; Nock, R. (2013). "On the Chi square and higher-order Chi distances for approximating f-divergences". IEEE Signal Processing Letters. 21: 10–13. arXiv:1309.3029. Bibcode:2014ISPL...21...10N. doi:10.1109/LSP.2013.2288355. S2CID 4152365.
Coeurjolly, J-F.; Drouilhet, R. (2006). "Normalized information-based divergences". arXiv:math/0604246.

[1] Rényi, Alfréd (1961). On measures of entropy and information (PDF). The 4th Berkeley Symposium on Mathematics, Statistics and Probability, 1960. Berkeley, CA: University of California Press. pp. 547–561. Eq. (4.20)

[2] Gorban, Pavel A. (15 October 2003). "Monotonically equivalent entropies and solution of additivity equation". Physica A. 328 (3–4): 380–390. arXiv:cond-mat/0304131. Bibcode:2003PhyA..328..380G. doi:10.1016/S0378-4371(03)00578-8. S2CID 14975501.

[3] Amari, Shun'ichi (2009). Leung, C.S.; Lee, M.; Chan, J.H. (eds.). Divergence, Optimization, Geometry. The 16th International Conference on Neural Information Processing (ICONIP 20009), Bangkok, Thailand, 1--5 December 2009. Lecture Notes in Computer Science, vol 5863. Berlin, Heidelberg: Springer. pp. 185–193. doi:10.1007/978-3-642-10677-4_21.

[4] Gorban, Alexander N. (29 April 2014). "General H-theorem and Entropies that Violate the Second Law". Entropy. 16 (5): 2408–2432. arXiv:1212.6767. Bibcode:2014Entrp..16.2408G. doi:10.3390/e16052408.

[5] Soklakov, Andrei N. (2020). "Economics of Disagreement—Financial Intuition for the Rényi Divergence". Entropy. 22 (8): 860. arXiv:1811.08308. Bibcode:2020Entrp..22..860S. doi:10.3390/e22080860. PMC 7517462. PMID 33286632.

[1]

[2]

[3]

[4]

[5]

Search

f-확장

네임스페이스

더

목차

정의

f-divergenes의 예

특성.

재무해석

참고 항목

참조