RISS 검색 - 국내학술지논문

1
Ambiguity Resolution in Chinese Word Segmentation

( Sun Maosong ) 한국언어정보학회 1995 국제 워크샵 Vol.1995 No.-
- 원문보기
A new method for Chinese word segmentation named Conditional F&BMM (Forward and Backward Maximal Matching) \vhich incorporates both bigram statistics (ie., mutual infonnation and difference of t-test between Chinese characters) and linguistic rules for ambiguity resolution is proposed in this paper The key characteristics of this model are the use of: (i) statistics which can be automatically derived from any raw corpus, (ii) a rule base for disambiguation with consistency and controlled size to be built up in a systematic way.
2
A Well-formed Chinese Lexicon for Word Segmentation and Part-of-speech Tagging

Sun Maosong,Kang Shiyong 한국어정보학회 2002 한국어정보학 Vol.7ㆍ8 No.-
- 원문보기
3
대륙, 홍콩, 대만 정보기술용어의 초보적 비교와 분석

Sun Maosong,Benjamin K Tsou 한국어정보학회 2002 한국어정보학 Vol.5ㆍ6 No.-
- 원문보기
4
Identification of Chinese Personal Names in Unrestricted Texts

( Lawrence Cheung ),( Benjamin K Tsou ),( Maosong Sun ) 한국언어정보학회 2002 국제 워크샵 Vol.2002 No.-
- 원문보기
Automatic identification of Chinese personal names in unrestricted texts is a key task in Chinese word segmentation, and can affect other NLP tasks such as word segmentation and information retrieval, if it is not properly addressed. This paper (1) demonstrates the problems of Chinese personal name identification in some IT applications, (2) analyzes the structure of Chinese personal names, and (3) further presents the relevant processing strategies. The geographical differences of Chinese personal names between Beijing and Hong Kong are highlighted at the end. It shows that variation in names across different Chinese communities constitutes a critical factor in designing Chinese personal name identification algorithm.

상세검색

RISS 보유자료

상세검색

해외전자자료

연관 검색어 추천