범위연결문법

레인지 결합 문법(Range Concatenation Grammer, RCG)은 중국어 번호와 독일어 단어 순서 뒤엉키기 등 자연 언어의 여러 현상을 특징짓기 위해 1998년 피에르 불리에가 개발한 문법 형식주의다.^[2]

이론적인 관점에서, 다항식 시간에 구문 분석할 수 있는 모든 언어는 양성 범위 결합 그래머라고 불리는 RCG의 하위 집합에 속하며, 호혜적으로 사용된다.^[4]

그로닝크의 문자 그대로의 움직임 문법 그래머(LMGs)에 변형된 것으로 의도되었지만, RCG는 문법적 과정을 제작이라기보다는 증거로 취급한다.LMG는 시작 술어에서 단자 문자열을 생성하는 반면, RCG는 시작 술어(단자 문자열의 술어)를 빈 문자열로 감소시키는 것을 목표로 하고 있으며, 이는 언어에서 단자 문자열 멤버쉽의 증거를 구성한다.

설명

형식 정의

포지티브 범위 연결 문법(PRCG)은 튜플 $G=(N,~T,~V,~S,~P)$ = $G=(N,~T,~V,~S,~P)$ $G=(N,~T,~V,~S,~P)$ $G=(N,~T,~V,~S,~P)$ $G=(N,~T,~V,~S,~P)$ P $G=(N,~T,~V,~S,~P)$ ) ${\displaystyle G=(N,$ ~ $T,$ ~ $V,$ ~ $S,$ ~ $P)}$ 이며 $G=(N,~T,~V,~S,~P)$ 여기서:

$N$ ${\displaystyle N$ T $T$ 및 $T$ $V$ $V$ 은 $V$ (의존적으로) 술어 이름, 터미널 기호 및 변수 이름의 분리된 유한 집합이다.각 술어 이름에는 $\dim :N\rightarrow \mathbb {N} \setminus \{0\}$ N $\dim :N\rightarrow \mathbb {N} \setminus \{0\}$ → N $\dim :N\rightarrow \mathbb {N} \setminus \{0\}$ { $\dim :N\rightarrow \mathbb {N} \setminus \{0\}$ ${\displaystyle \dim$ : $N\rightarrow \mathb$ {N $} \setminus \{0\}}}}$ 이(가) 부여된 연관성이 있다 $\dim :N\rightarrow \mathbb {N} \setminus \{0\}$
$S\in N$ $S\in N$ $S\in N$ $S\in N$ 은 $S\in N$ (는) 시작 술어 이름이며 $\dim(S)=1$ = $\dim(S)=1$ ${\displaystyle \dim(S)=1$ }을 $($ 를) 확인하십시오 $\dim(S)=1$
$P$ is a finite set of clauses of the form $\psi _{0}\rightarrow \psi _{1}\ldots \psi _{m}$ , where the $\psi _{i}$ are predicates of the form ${\displaystyle A_{i}(\alpha _{1},\ldots ,\alpha$ $A_{i}\in N$ {\ $dim(A_{i}}})$ $A_{i}\in N$ $\alpha _{i}\in (T\cup V)^{\star }$ $\alpha _{i}\in (T\cup V)^{\star }$ $∪$ V $\alpha _{i}\in (T\cup V)^{\star }$ $\alpha _{i}\in (T\cup V)^{\star }$ ${\$ V $)^{\star$ $\alpha _{i}\in (T\cup V)^{\star }$ 이 $A_{i}(\alpha _{1},\ldots ,\alpha _{\dim(A_{i})})$ (가) 있는 $_{\$ dim(A_ ${$ $i$ }}}}}}}}}}.

A Negative Range Concatenation Grammar (NRCG) is defined like a PRCG, but with the addition that some predicates occurring in the right-hand side of a clause can have the form ${\overline {A_{i}(\alpha _{1},\ldots ,\alpha _{\dim(A_{i})})}}$ . Such predicates are called n자아의 술어

범위 연계 문법은 긍정적이거나 부정적이다.PRCG는 기술적으로 NRCG이지만 음성 술어의 부재(PRCG) 또는 존재(NRCG)를 강조하기 위해 이 용어를 사용한다.

A range in a word $w\in T^{\star }$ is a couple $\langle l,r\rangle _{w}$ , with $0\leq l\leq r\leq n$ , where $n$ is the length of $w$ . Two ranges ${\displaystyle$ $\langle l_{1},r_{1}\rangle _{w}}$ and $\langle l_{2},r_{2}\rangle _{w}$ can be concatenated iff $r_{1}=l_{2}$ , and we then have: ${\displaystyle \langle l_{1},r_{1}\$ $랑글 _{w}\cdot \langle l_{2},r_{2},\angle _{w}=\langle l_{1},r_{2}\angle _{w$

For a word $w=w_{1}w_{2}\ldots w_{n}$ , with $w_{i}\in T$ , the dotted notation for ranges is: ${\displaystyle \langle l,r\rangle _{w}=w_{1}\ldots w_{l-1}\bullet w_{l}\ldots w_{$ $r-1}\properties w_$ {r $}\ldots$ w_ ${n$

문자열 인식

LMGs와 마찬가지로 RCG 절에는 일반 $A(x_{1},...,x_{n})\to \alpha$ A $A(x_{1},...,x_{n})\to \alpha$ , $A(x_{1},...,x_{n})\to \alpha$ . $A(x_{1},...,x_{n})\to \alpha$ . . , $A(x_{1},...,x_{n})\to \alpha$ $A(x_{1},...,x_{n})\to \alpha$ ) $A(x_{1},...,x_{n})\to \alpha$ → $A(x_{1},...,x_{n})\to \alpha$ ${\displaystyle A(x_{1$ }, $...,x_{n})\to \alpha}$ 가 있는데 $A(x_{1},...,x_{n})\to \alpha$ 여기서 RCG에서는 $\alpha$ $\alpha$ 이 빈 문자열 또는 술어의 문자열이다 $\alpha$ .인수 $x_{i}$ $x_{i}$ ${\$ 는 LMG와 같이 실제 인수 값과 일치하는 패턴으로 구성된 터미널 기호 및/또는 변수 기호의 문자열로 구성되며 $x_{i}$ , 인접한 변수는 파티션에 대한 일치를 구성하므로 인수 x $y$ ${\displaystyle xy$ 는 두 변수를 사용하여 리터럴 문자열과 일치한다. $x=\epsilon ,\ y=ab;\ x=a,\ y=b;\ x=ab,\ y=\epsilon$ 의 $세$ 가지 다른 $ab$ $x=\epsilon ,\ y=ab;\ x=a,\ y=b;\ x=ab,\ y=\epsilon$ : x = $x=\epsilon ,\ y=ab;\ x=a,\ y=b;\ x=ab,\ y=\epsilon$ $x=\epsilon ,\ y=ab;\ x=a,\ y=b;\ x=ab,\ y=\epsilon$ = a ; x $x=\epsilon ,\ y=ab;\ x=a,\ y=b;\ x=ab,\ y=\epsilon$ = a $x=\epsilon ,\ y=ab;\ x=a,\ y=b;\ x=ab,\ y=\epsilon$ $x=\epsilon ,\ y=ab;\ x=a,\ y=b;\ x=ab,\ y=\epsilon$ = $x=\epsilon ,\ y=ab;\ x=a,\ y=b;\ x=ab,\ y=\epsilon$ y $x=\epsilon ,\ y=ab;\ x=a,\ y=b;\ x=ab,\ y=\epsilon$ = $x=\epsilon ,\ y=ab;\ x=a,\ y=b;\ x=ab,\ y=\epsilon$ $x=\epsilon ,\ y=ab;\ x=a,\ y=b;\ x=ab,\ y=\epsilon$ = $,$ ϵ, ϵ, ϵ, \ $displaystyle$ x $=\epsilon,\y=a,\ y=b;\x=ab,\y$ =y=y=y $=$ y=\ $epsilon$ $x=\epsilon ,\ y=ab;\ x=a,\ y=b;\ x=ab,\ y=\epsilon$

술어 용어는 양수(성공할 때 빈 문자열을 생성함)와 음수(실패할 때 빈 문자열을 생성함/양수 용어가 빈 문자열을 생성하지 않는 경우)의 두 가지 형태로 나타난다.음의 용어는 ${\overline {A(x_{1},...,x_{n})}}$ 1, ${\overline {A(x_{1},...,x_{n})}}$ . ${\overline {A(x_{1},...,x_{n})}}$ . . ${\overline {A(x_{1},...,x_{n})}}$ , ${\overline {A(x_{1},...,x_{n})}}$ )의 ${\overline {A(x_{1},...,x_{n})}}$ {\ $displaystyle {\overline$ {A $(x_{1$ }, $...,$ x_ ${n}}}}$ 에서와 같이 양의 용어와 같은 것으로 표시된다 ${\overline {A(x_{1},...,x_{n})}}$

The rewrite semantics for RCGs is rather simple, identical to the corresponding semantics of LMGs. Given a predicate string $A(\alpha _{1},...,\alpha _{n})$ , where the symbols $\alpha _{i}$ are terminal strings, if there is a rule $A(x_{1},...,x_{n})\to \beta$ $A(x_{1},...,x_{n})\to \beta$ $A(x_{1},...,x_{n})\to \beta$ ${\displaystyle A(x_{1},...,x_{n}})\$ to $\beta }$ 의 술어 문자열과 $A(x_{1},...,x_{n})\to \beta$ 일치하는 각 $x_{i}$ $x_{i}$ ${\$ 의 변수 대신 $\beta$ ${\displaystystyle \beta }$ 로 대체 $x_{i}$

For example, given the rule $A(x,ayb)\to B(axb,y)$ , where $x$ and $y$ are variable symbols and $a$ and $b$ are terminal symbols, the predicate string $A(a,abb)$ can be rewritten as $B(aab,b)$ , because $A(a,abb)$ matches $A(x,ayb)$ when $x=a,\ y=b$ . Similarly, if there were a rule ${\displaystyle A(x,ayb)\to$ $A$ $(x,x)\ A(y$ $,$ $y)},$ A $A(a,abb)$ $A(a,abb)$ ) ${\$ $displaystyle$ $A(a,abb)}$ 은(는 $)$ $(,$ $A(a,a)\ A(b,b)$ $b)로$ 다시 쓸 수 있다 $A(a,abb)$ $A(a,a)\ A(b,b)$

문자열 $\alpha$ $\alpha$ 의 증명/인식은 $S(\alpha )$ $S(\alpha )$ 이(가) 빈 문자열을 생성한다는 $S(\alpha )$ 것을 보여줌으로써 이루어진다 $\alpha$ .개별 재작성 단계의 경우, 복수의 대체 변수 일치가 가능한 경우, 전체 증거를 성공으로 이끌 수 있는 모든 재작성을 고려한다.따라서 초기 문자열 $S(\alpha )$ ) ${\displaystyle$ S $(\alpha )}$ 에서 빈 문자열을 생산하는 방법이 적어도 하나 이상 있다면 $S(\alpha )$ 그 증거는 실패하는 다른 방법이 얼마나 많은지에 관계없이 성공으로 간주된다.

예

RCG는 다음과 같이 $\{www:w\in \{a,b\}^{*}\}$ 비선형 인덱스 언어 $\{www:w\in \{a,b\}^{*}\}$ { $\{www:w\in \{a,b\}^{*}\}$ $\{www:w\in \{a,b\}^{*}\}$ : $\{www:w\in \{a,b\}^{*}\}$ w w : $\{www:w\in \{a,b\}^{*}\}$ w $\{www:w\in \{a,b\}^{*}\}$ { a , $\{www:w\in \{a,b\}^{*}\}$ $\{www:w\in \{a,b\}^{*}\}$ } ${\displaystyle \{www:w\\in$ \{ $a,b\}^{*}\}}}}}}}}$ 을(를) 인식할 수 있다.

x, y 및 z를 변수 기호로 지정:

$(\displaystyle S(xyz)\to A(x,y,z)}$

$(\displaystyle A(ax,ay,az)\to A(x,y,z)}$

$A(bx, by,bz)\to A(x,y,z)$

$\\displaystyle A(\epsilon ,\epsilon ,\epsilon )\to \epsilon }$

압밥에 대한 증거는 그때 있다.

$(\displaystyle S(abbabbabb)\오른쪽 화살표 A(abb,abb,abb)\오른쪽 화살표 A(bb,bb,bb)\Rightarrow A(b,b,b)\Rightarrow A(\epsilon ,\epsilon ,\epsilon )\Rightarrow \epsilon }$

또는 범위에 대해 보다 정확한 점 표기법 사용:

$S(\bullet {}abbabbabb\bullet {})\Rightarrow A(\bullet {}abb\bullet {}abbabb,abb\bullet {}abb\bullet {}abb,abbabb\bullet {}abb\bullet {})\Rightarrow A(a\bullet {}bb\bullet {}abbabb,abba\bullet {}bb\bullet {}abb,abbabba\bullet {}bb\bullet {})$ $\Rightarrow \bb\bullet {}abbabb,abbabbab\bullet {}bb\bullet {}B\balllet {}\Rightarrow A(\epsilon ,\epsilon )\Rightarrow \epsilon$

참조

^ Boullier, Pierre (Jan 1998). Proposal for a Natural Language Processing Syntactic Backbone (PDF) (Technical report). Vol. 3342. INRIA Rocquencourt (France).
^ Pierre Boullier (1999). "Chinese Numbers, MIX, Scrambling, and Range Concatenation Grammars" (PDF). Proc. EACL. pp. 53–60. Archived from the original (PDF) on 2003-05-15.
^ Eberhard Bertsch and Mark-Jan Nederhof (Oct 2001). "On the complexity of some extensions of RCG parsing" (PDF). Proceedings of the Seventh International Workshop on Parsing Technologies (Beijing). pp. 66–77.
^ Laura Kallmeyer (2010). Parsing Beyond Context-Free Grammars. Springer Science & Business Media. p. 37. ISBN 978-3-642-14846-0. 베르트슈를 인용하여 네데르호프(2001)^[3]

[boullier1998-1] Boullier, Pierre (Jan 1998). Proposal for a Natural Language Processing Syntactic Backbone (PDF) (Technical report). Vol. 3342. INRIA Rocquencourt (France).

[boullier1999-2] Pierre Boullier (1999). "Chinese Numbers, MIX, Scrambling, and Range Concatenation Grammars" (PDF). Proc. EACL. pp. 53–60. Archived from the original (PDF) on 2003-05-15.

[3] Eberhard Bertsch and Mark-Jan Nederhof (Oct 2001). "On the complexity of some extensions of RCG parsing" (PDF). Proceedings of the Seventh International Workshop on Parsing Technologies (Beijing). pp. 66–77.

[Kallmeyer2010-4] Laura Kallmeyer (2010). Parsing Beyond Context-Free Grammars. Springer Science & Business Media. p. 37. ISBN 978-3-642-14846-0. 베르트슈를 인용하여 네데르호프(2001)^[3]

[2]

[4]

[3]

Search

범위연결문법

네임스페이스

더

목차

설명

형식 정의

문자열 인식

예

참조