http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.
변환된 중국어를 복사하여 사용하시면 됩니다.
음성합성을 위한 C-ToBI기반의 중국어 운율 경계와 F0 contour 생성
김승원,정옥,이근배,김병창,Kim, Seung-Won,Zheng, Yu,Lee, Gary-Geunbae,Kim, Byeong-Chang 대한음성학회 2005 말소리 Vol.53 No.-
Prosody Generation Based on C-ToBI Representation for Text-to-SpeechSeungwon Kim, Yu Zheng, Gary Geunbae Lee, Byeongchang KimProsody modeling is critical in developing text-to-speech (TTS) systems where speech synthesis is used to automatically generate natural speech. In this paper, we present a prosody generation architecture based on Chinese Tone and Break Index (C-ToBI) representation. ToBI is a multi-tier representation system based on linguistic knowledge to transcribe events in an utterance. The TTS system which adopts ToBI as an intermediate representation is known to exhibit higher flexibility, modularity and domain/task portability compared with the direct prosody generation TTS systems. However, the cost of corpus preparation is very expensive for practical-level performance because the ToBI labeled corpus has been manually constructed by many prosody experts and normally requires a large amount of data for accurate statistical prosody modeling. This paper proposes a new method which transcribes the C-ToBI labels automatically in Chinese speech. We model Chinese prosody generation as a classification problem and apply conditional Maximum Entropy (ME) classification to this problem. We empirically verify the usefulness of various natural language and phonology features to make well-integrated features for ME framework.
고품위 중국어 TTS(Text-to-Speech) 시스템 구축을 위한 자연어 처리 연구
성윤숙(Yoon Suk Seong),김병창(Byeong Chang Kim),정옥(Yu Zheng),이근배(Gary Geun Bae Lee) 언어과학회 2004 언어과학연구 Vol.30 No.-
Overall architecture of the Chinese `TTS system consists of natural language processing module, grapheme-to-phoneme conversion and prosody prediction module, and speech signal synthesis module. POSTAG/C is a system which combines word segmentation module based on rules and a dictionary with a POS tagging module based on HMM (Hidden Markov Model). The word dictionary was automatically acquired by POS tagged corpus and the system has high portability to serve both GB texts as well as Big5 texts. The word segmenting module is composed of two parts. Input Chinese texts go through a pre-segmenting process at first. The pre-segmenting process splits input texts into short phrases using some special characters that are symbols, numbers, foreign language characters, and mono-character words. Next, we segment in more detailed using four heuristic rules. The two-phase unknown word prediction process extracts candidate boundaries where unknown words possibly occur in a sentence at first. Next, we predict the category of the words with these boundaries using an SVM-based method. The method of our grapheme-to-phoneme conversion is a dictionary-based and rules-based hybrid approach. In this process, our module converts the decided Pinyin strings into more natural forms with standard Chinese phonetic rules.