http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.
변환된 중국어를 복사하여 사용하시면 됩니다.
SINICA CORPUS; Design Methodology for Balanced Corpora
( Keh Jiann Chen ),( Chu Ren Huang ),( Li Ping Chang ),( Hui Li Hsu ) 한국언어정보학회 1996 국제 워크샵 Vol.1996 No.-
The Academia Sinica Balanced Corpus (Sinica Corpus) is the first balanced Chinese corpus with part-of-speech tagging. The corpus (Sinica 2.0) is open to the research community through the WWW (http://www.sinica.edu.tw/ftms-bin/ kiwi.sh). Current size of the corpus is 3.5 million words, and the immediate expansion target is five million words. Each text In the corpus is classified and marked according to five criteria: genre, style, mode, topic, and source. The feature values of these classifications are assigned in a hierarchy. Subcorpora can be defined with a specific set of attributes to serve different research purposes. Texts in the corpus are segmented according to the word segmentation standard proposed by the ROC Computational Linguistic Society. Each segmented word is tagged with its part-of-speech. Linguistic patterns and language structures can be extracted from the tagged corpus via a corpus inspection program which has the functions of KWIC searching, filtering, statistics, printing, and collocation.
Modality and Modal Sense Representation in E-HowNet
( You Shan Chung ),( Shu Ling Huang ),( Keh Jiann Chen ) 한국언어정보학회 2007 학술대회 논문집 Vol.2007 No.-
This paper explains how we define and represent modality in E-HowNet. Following Lyons (1977, reviewed in Hsieh 2003, among others), we hold that modals express a speaker``s opinion or attitude toward a proposition and hence have a pragmatic dimension and recognize five kinds of modal categories, i.e. epistemic, deontic, ability, volition and expectation modality. We then present a representational formalism that contains the three most basic components of modal meaning: modal category, positive or negative and strength. Such a formula can define not only modal words but also words that contain modal meanings and cope with co-compositions of modals and the negation construction.