RISS 검색 - 국내학술지논문 상세보기

국문 초록 (Abstract)

입력문장이 길어질수록 구문분석의 정확률은 크게 낮아진다. 따라서 긴 문장의 구문분석 정확률을 높이기 위해 장문분할 방법들이 많이 연구되었다. 중국어는 고립어로서 자연언어처리에 도움을 줄 수 있는 굴절이나 어미정보가 없는 대신 쉼표를 비교적 많이, 또 정확히 사용하고 있어서 이러한 쉼표사용이 장문분할에 도움을 줄 수 있다. 본 논문에서는 중국어 문장에서 쉼표 주변의 문맥을 파악하여 해당 쉼표위치에 문장분할이 가능한지 Support Vector Machine을 이용해 판단하고자 한다. 쉼표의 분류의 정확률이 87.1%에 이르고, 이 분할모델을 적용한 후 구문분석한 결과, 의존트리의 정확률이 5.6% 증가했다.

번역하기

입력문장이 길어질수록 구문분석의 정확률은 크게 낮아진다. 따라서 긴 문장의 구문분석 정확률을 높이기 위해 장문분할 방법들이 많이 연구되었다. 중국어는 고립어로서 자연언어처리에 ...

다국어 초록 (Multilingual Abstract)

The longer the input sentences, the worse the parsing results. To improve the parsing performance, many methods about long sentence segmentation have been reserarched. As an isolating language, Chinese sentence has fewer cues for sentence segmentation. However, the average frequency of comma usage in Chinese is higher than that of other languages. The syntactic information that the comma conveys can play an important role in long sentence segmentation of Chinese languages. This paper proposes a method for classifying commas in Chinese sentences according to the context where the comma occurs. Then, sentences are segmented using the classification result. The experimental results show that the accuracy of the comma classification reaches 87.1%, and with our segmentation model, the dependency parsing accuracy of our parser is improved by 5.6%.

번역하기

목차 (Table of Contents)

요약
Abstract
1. 서론
2. 기존연구
3. 구문분석을 위한 쉼표의 분류

요약
Abstract
1. 서론
2. 기존연구
3. 구문분석을 위한 쉼표의 분류
4. 쉼표의 분류를 위한 자질 추출
5. 실험
6. 기존연구와의 비교 및 결론
참고문헌
저자소개

참고문헌 (Reference)

1 Geoffrey Nunberg, "the linguistics of punctuation" 18 : 1990.

2 Shui-fang Lin, "study and application of punctuation" People’s Publisher 2000.

3 B. Say, "current approaches to punctuation in computational linguistics" 30 (30): 457-469, 1997

4 V.J. Leffa, "clause processing in complex sentences" 937-943, 1998.

5 B. Jones, "What’s the point? A(computational) theory of punctuation" 1996.

6 D.M.Bikel, "Two statistical parsing models applied to the Chinese Treebank" 1-6, 2000.

7 B. Jones, "Towards testing the syntax of punctuation" 363-365, 1996.

8 V N. Vapnik, "The nature of statistical learning theory" Springer-Verlag 1995

9 N. Xue, "The bracketing Guidelines for the Penn Chinese Treebank University of Pennsylvania" 2000.

10 H. Yamada, "Statistical Dependency Analysis with Support Vector Machines" 195-206, 2003