다수 화자 음성 변환을 위한 VAE의 잠재 공간 disentanglement와 변환 경로 학습|RISS 상세보기

다국어 입력

あぁかがさざただなはばぱまやゃらわゎんいぃきぎしじちぢにひびぴみりうぅくぐすずつづっぬふぶぷむゆゅるえぇけげせぜてでねへべぺめれおぉこごそぞとどのほぼぽもよょろを

アァカサザタダナハバパマヤャラワヮンイィキギシジチヂニヒビピミリウゥクグスズツヅッヌフブプムユュルエェケゲセゼテデヘベペメレオォコゴソゾトドノホボポモヨョロヲ ―

http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.

변환된 중국어를 복사하여 사용하시면 됩니다.

예시)

中文 을 입력하시려면 zhongwen을 입력하시고 space를누르시면됩니다.
北京 을 입력하시려면 beijing을 입력하시고 space를 누르시면 됩니다.

ㅥ ㅦ ㅧ ㅨ ㅩ ㅪ ㅫ ㅬ ㅭ ㅮ ㅯ ㅰ ㅱ ㅲ ㅳ ㅴ ㅵ ㅶ ㅷ ㅸ ㅹ ㅺ ㅻ ㅼ ㅽ ㅾ ㅿ ㆀ ㆁ ㆂ ㆃ ㆄ ㆅ ㆆ ㆇ ㆈ ㆉ ㆊ ㆋ ㆌ ㆍ ㆎ

Α Β Γ Δ Ε Ζ Η Θ Ι Κ Λ Μ Ν Ξ Ο Π Ρ Σ Τ Υ Φ Χ Ψ Ω α β γ δ ε ζ η θ ι κ λ μ ν ξ ο π ρ σ τ υ φ χ ψ ω

á à Á À é è É È ç Ç ê

Ä Ö Ü ä ö ü ß

ְ ֳ ֲ ֱ ָ ַ ֵ ֶ ִ ֹ ּ ֻ ׂ ׁ ּ פ ם ן ו ט א ר ק ף ך ל ח י ע כ ג ד ש ץ ת צ מ נ ה ב

‘ ’ “ ” 〔〕〈〉「」『』【】＂（）［］｛｝

± × ÷ ≠ ≤ ≥ ∞ ∴ ♂ ♀ ∠ ⊥ ⌒ ∂ ∇ ≡ ≒ ≪ ≫ √ ∽ ∝ ∵ ∫ ∬ ∈ ∋ ⊆ ⊇ ⊂ ⊃ ∪ ∩ ∧ ∨ ￢ ⇒ ⇔ ∀ ∃ ∮ ∑ ∏ ＋－＜＝＞

、。 · ‥ … ¨ 〃 ― ∥ ＼ ∼ ´ ～ ˇ ˘ ˝ ˚ ˙ ¸ ˛ ¡ ¿ ː ！＇，．／：；？＾＿｀｜

½ ⅓ ⅔ ¼ ¾ ⅛ ⅜ ⅝ ⅞ ¹ ² ³ ⁴ ⁿ ₁ ₂ ₃ ₄

Æ Ð Ħ Ĳ Ł Ø Œ Þ Ŧ Ŋ æ đ ð ħ ı ĳ ĸ ŀ ł ø œ ß þ ŧ ŋ ŉ

А Б В Г Д Е Ё Ж З И Й К Л М Н О П Р С Т У Ф Х Ц Ч Ш Щ Ъ Ы Ь Э Ю Я а б в г д е ё ж з и й к л м н о п р с т у ф х ц ч ш щ ъ ы ь э ю я

′ ″ ℃ Å ￠￡￥ ¤ ℉ ‰ ＄％Ｆ￦㎕㎖㎗ ℓ ㎘㏄㎣㎤㎥㎦㎙㎚㎛㎜㎝㎞㎟㎠㎡㎢㏊㎍㎎㎏㏏㎈㎉㏈㎧㎨㎰㎱㎲㎳㎴㎵㎶㎷㎸㎹㎀㎁㎂㎃㎄㎺㎻㎽㎾㎿㎐㎑㎒㎓㎔ Ω ㏀㏁㎊㎋㎌㏖㏅㎭㎮㎯㏛㎩㎪㎫㎬㏝㏐㏓㏃㏉㏜㏆

§ ※ ☆ ★ ○ ● ◎ ◇ ◆ □ ■ △ ▽ → ← ↑ ↓ ↔ 〓 ◁ ◀ ▷ ▶ ♤ ♠ ♡ ♥ ♧ ♣ ⊙ ◈ ▣ ◐ ◑ ▒ ▤ ▥ ▨ ▧ ▦ ▩ ♨ ☏ ☎ ☜ ☞ ¶ † ‡ ↕ ↗ ↙ ↖ ↘ ♭ ♩ ♪ ♬ ㉿㈜ № ㏇ ™ ㏂㏘ ℡ ＃＆＊＠ ª º

ⅰ ⅱ ⅲ ⅳ ⅴ ⅵ ⅶ ⅷ ⅸ ⅹ Ⅰ Ⅱ Ⅲ Ⅳ Ⅴ Ⅵ Ⅶ Ⅷ Ⅸ Ⅹ

ا ب ت ث ج ح خ د ذ ر ز س ش ص ض ط ظ ع غ ف ق ک ل م ن ه و ی

최근 검색 목록
전체삭제 닫기

RISS 인기검색어

다수 화자 음성 변환을 위한 VAE의 잠재 공간 disentanglement와 변환 경로 학습

한글로보기

https://www.riss.kr/link?id=T15529396

저자

이건녕
발행사항
서울 : 高麗大學敎, 2020
학위논문사항

학위논문(석사) -- 高麗大學敎大學院 , 컴퓨터學科 , 2020
발행연도
2020
작성언어
한국어
KDC
004 판사항(6)
DDC
004 판사항(23)
발행국(도시)
서울
형태사항
vii, 50장 : 도표 ; 26 cm
일반주기명

지도교수: 陸東錫
VAE는 "Variational Autoencoder"의 약어임
참고문헌 수록
DOI식별코드
10.23186/korea.000000127387.11009.0000950
소장기관
- 고려대학교 도서관
- 국립중앙도서관

0
상세조회
0
다운로드
0
내보내기

서지정보 열기

부가정보

다국어 초록 (Multilingual Abstract)

One of the most critical obstacles in voice conversion is the requirement of parallel training data, which contain the same linguistic content utterances spoken by different speakers. Collecting such parallel data is highly expensive process, therefore many works attempted to use non-parallel training data for voice conversion. One of such successful approaches is using cycle-consistent adversarial networks (CycleGAN), which utilize the cycle consistency loss. The major drawback of CycleGAN based methods, however, is that they can handle only one-to-one voice conversion from a source speaker to a target speaker, which makes it difficult to use for general-purpose cases requiring many-to-many voice conversion among multiple speakers. Another group of approaches using variational autoencoder (VAE) can handle many-to-many voice conversion, but their sound qualities are much lower than that of CycleGAN based methods. In this paper, we propose new methods of the conversion path training and disentangling latent space vector for many-to-many voice conversion and improving the sound qualities.

번역하기

One of the most critical obstacles in voice conversion is the requirement of parallel training data, which contain the same linguistic content utterances spoken by different speakers. Collecting such parallel data is highly expensive process, therefor...

One of the most critical obstacles in voice conversion is the requirement of parallel training data, which contain the same linguistic content utterances spoken by different speakers. Collecting such parallel data is highly expensive process, therefore many works attempted to use non-parallel training data for voice conversion. One of such successful approaches is using cycle-consistent adversarial networks (CycleGAN), which utilize the cycle consistency loss. The major drawback of CycleGAN based methods, however, is that they can handle only one-to-one voice conversion from a source speaker to a target speaker, which makes it difficult to use for general-purpose cases requiring many-to-many voice conversion among multiple speakers. Another group of approaches using variational autoencoder (VAE) can handle many-to-many voice conversion, but their sound qualities are much lower than that of CycleGAN based methods. In this paper, we propose new methods of the conversion path training and disentangling latent space vector for many-to-many voice conversion and improving the sound qualities.

더보기

목차 (Table of Contents)

제 1 장 서론 1
제 2 장 관련 연구 4
2.1. Conditional Variational Autoencoder 4
2.2. Disentangled Latent Space 6
2.2.1. Speaker Classifier 6

제 1 장 서론 1
제 2 장 관련 연구 4
2.1. Conditional Variational Autoencoder 4
2.2. Disentangled Latent Space 6
2.2.1. Speaker Classifier 6
2.3. Training Methods of Conversion Path 8
2.3.1. Auxiliary Classifier 8
2.3.2. Cross Generative Adverarial Network 10
제 3 장 제안하는 모델 13
3.1. Entropy of latent space vector 13
3.2. Cycle Consistency Loss 15
3.3. Multiple Decoders 17
3.4. Merge Models 19
제 4 장 실험 22
4.1. 학습 데이터 22
4.2. VAE 모델 구조 22
4.3. 파라메타 조정 23
4.4. 객관적 평가 25
4.5. 주관적 평가 41
제 5 장 결론 및 향후 과제 44
참고 문헌 45

더보기

분석정보

View

상세정보조회

0

Usage

원문다운로드

0

대출신청

0

복사신청

0

EDDS신청

0

동일 주제 내 활용도 TOP

주제

연도별 연구동향

연도별 활용동향

연관논문

연구자 네트워크맵

공동연구자 (7)

더보기

유사연구자 (20) 활용도상위20명

더보기

이 자료와 함께 이용한 RISS 자료

나만을 위한 추천자료

서지정보
부가정보
분석정보

해외이동버튼