Improving seq2seq by revising attention mechanism for speech recognition|RISS 상세보기

다국어 입력

あぁかがさざただなはばぱまやゃらわゎんいぃきぎしじちぢにひびぴみりうぅくぐすずつづっぬふぶぷむゆゅるえぇけげせぜてでねへべぺめれおぉこごそぞとどのほぼぽもよょろを

アァカサザタダナハバパマヤャラワヮンイィキギシジチヂニヒビピミリウゥクグスズツヅッヌフブプムユュルエェケゲセゼテデヘベペメレオォコゴソゾトドノホボポモヨョロヲ ―

http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.

변환된 중국어를 복사하여 사용하시면 됩니다.

예시)

中文 을 입력하시려면 zhongwen을 입력하시고 space를누르시면됩니다.
北京 을 입력하시려면 beijing을 입력하시고 space를 누르시면 됩니다.

ㅥ ㅦ ㅧ ㅨ ㅩ ㅪ ㅫ ㅬ ㅭ ㅮ ㅯ ㅰ ㅱ ㅲ ㅳ ㅴ ㅵ ㅶ ㅷ ㅸ ㅹ ㅺ ㅻ ㅼ ㅽ ㅾ ㅿ ㆀ ㆁ ㆂ ㆃ ㆄ ㆅ ㆆ ㆇ ㆈ ㆉ ㆊ ㆋ ㆌ ㆍ ㆎ

Α Β Γ Δ Ε Ζ Η Θ Ι Κ Λ Μ Ν Ξ Ο Π Ρ Σ Τ Υ Φ Χ Ψ Ω α β γ δ ε ζ η θ ι κ λ μ ν ξ ο π ρ σ τ υ φ χ ψ ω

á à Á À é è É È ç Ç ê

Ä Ö Ü ä ö ü ß

ְ ֳ ֲ ֱ ָ ַ ֵ ֶ ִ ֹ ּ ֻ ׂ ׁ ּ פ ם ן ו ט א ר ק ף ך ל ח י ע כ ג ד ש ץ ת צ מ נ ה ב

‘ ’ “ ” 〔〕〈〉「」『』【】＂（）［］｛｝

± × ÷ ≠ ≤ ≥ ∞ ∴ ♂ ♀ ∠ ⊥ ⌒ ∂ ∇ ≡ ≒ ≪ ≫ √ ∽ ∝ ∵ ∫ ∬ ∈ ∋ ⊆ ⊇ ⊂ ⊃ ∪ ∩ ∧ ∨ ￢ ⇒ ⇔ ∀ ∃ ∮ ∑ ∏ ＋－＜＝＞

、。 · ‥ … ¨ 〃 ― ∥ ＼ ∼ ´ ～ ˇ ˘ ˝ ˚ ˙ ¸ ˛ ¡ ¿ ː ！＇，．／：；？＾＿｀｜

½ ⅓ ⅔ ¼ ¾ ⅛ ⅜ ⅝ ⅞ ¹ ² ³ ⁴ ⁿ ₁ ₂ ₃ ₄

Æ Ð Ħ Ĳ Ł Ø Œ Þ Ŧ Ŋ æ đ ð ħ ı ĳ ĸ ŀ ł ø œ ß þ ŧ ŋ ŉ

А Б В Г Д Е Ё Ж З И Й К Л М Н О П Р С Т У Ф Х Ц Ч Ш Щ Ъ Ы Ь Э Ю Я а б в г д е ё ж з и й к л м н о п р с т у ф х ц ч ш щ ъ ы ь э ю я

′ ″ ℃ Å ￠￡￥ ¤ ℉ ‰ ＄％Ｆ￦㎕㎖㎗ ℓ ㎘㏄㎣㎤㎥㎦㎙㎚㎛㎜㎝㎞㎟㎠㎡㎢㏊㎍㎎㎏㏏㎈㎉㏈㎧㎨㎰㎱㎲㎳㎴㎵㎶㎷㎸㎹㎀㎁㎂㎃㎄㎺㎻㎽㎾㎿㎐㎑㎒㎓㎔ Ω ㏀㏁㎊㎋㎌㏖㏅㎭㎮㎯㏛㎩㎪㎫㎬㏝㏐㏓㏃㏉㏜㏆

§ ※ ☆ ★ ○ ● ◎ ◇ ◆ □ ■ △ ▽ → ← ↑ ↓ ↔ 〓 ◁ ◀ ▷ ▶ ♤ ♠ ♡ ♥ ♧ ♣ ⊙ ◈ ▣ ◐ ◑ ▒ ▤ ▥ ▨ ▧ ▦ ▩ ♨ ☏ ☎ ☜ ☞ ¶ † ‡ ↕ ↗ ↙ ↖ ↘ ♭ ♩ ♪ ♬ ㉿㈜ № ㏇ ™ ㏂㏘ ℡ ＃＆＊＠ ª º

ⅰ ⅱ ⅲ ⅳ ⅴ ⅵ ⅶ ⅷ ⅸ ⅹ Ⅰ Ⅱ Ⅲ Ⅳ Ⅴ Ⅵ Ⅶ Ⅷ Ⅸ Ⅹ

ا ب ت ث ج ح خ د ذ ر ز س ش ص ض ط ظ ع غ ف ق ک ل م ن ه و ی

최근 검색 목록
전체삭제 닫기

RISS 인기검색어

Improving seq2seq by revising attention mechanism for speech recognition

한글로보기

https://www.riss.kr/link?id=T14924509

저자

임단
발행사항
Seoul : Graduate School, Korea University, 2018
학위논문사항

학위논문(석사) -- 고려대학교 대학원 , 컴퓨터학과(정보대학) , 2018. 8
발행연도
2018
작성언어
영어
주제어

speech recognition ; attention
발행국(도시)
서울
형태사항
iii, 34장 : 도표 ; 26 cm
일반주기명

지도교수: 육동석
부록: A. alignment examples
참고문헌: 장 32-34
UCI식별코드
I804:11009-000000081643
DOI식별코드
10.23186/korea.000000081643.11009.0000815
소장기관
- 고려대학교 과학도서관
- 고려대학교 도서관

0
상세조회
0
다운로드
0
내보내기

서지정보 열기

부가정보

다국어 초록 (Multilingual Abstract)

Sequence-to-sequence models (seq2seq) have been designed to learn a mapping from arbitrary sized input sequence to an output sequence. Although the models are so versatile that it have been applied to variety of domain successfully, its adapta- tion for speech recognition should be reconsidered in that im- plicit alignment between speech signal and its output sequence is different from other domains. Moreover, speech signal is usu- ally much longer than its corresponding text label sequence.
In this thesis, I modified attention mechanisms of sequence- to-sequence models so that it can perform better for speech recognition. The revised model used double attention mecha- nism instead of conventional single attention mechanism so that it can attend relevant part of input sequence more easily. More- over, I generalized existing hybrid score function and achieved best results with multiplicative score function.
Experimental results on TIMIT dataset showed that pro- posed modifications achieve fast convergence and improved recog- nition performance.

번역하기

Sequence-to-sequence models (seq2seq) have been designed to learn a mapping from arbitrary sized input sequence to an output sequence. Although the models are so versatile that it have been applied to variety of domain successfully, its adapta- tion f...

Sequence-to-sequence models (seq2seq) have been designed to learn a mapping from arbitrary sized input sequence to an output sequence. Although the models are so versatile that it have been applied to variety of domain successfully, its adapta- tion for speech recognition should be reconsidered in that im- plicit alignment between speech signal and its output sequence is different from other domains. Moreover, speech signal is usu- ally much longer than its corresponding text label sequence.
In this thesis, I modified attention mechanisms of sequence- to-sequence models so that it can perform better for speech recognition. The revised model used double attention mecha- nism instead of conventional single attention mechanism so that it can attend relevant part of input sequence more easily. More- over, I generalized existing hybrid score function and achieved best results with multiplicative score function.
Experimental results on TIMIT dataset showed that pro- posed modifications achieve fast convergence and improved recog- nition performance.

더보기

목차 (Table of Contents)

1 Introduction 1
2 Background 4
2.1 Automatic speech recognition 4
2.1.1 Acoustic feature vector 5
2.1.2 Evaluation metric 7

1 Introduction 1
2 Background 4
2.1 Automatic speech recognition 4
2.1.1 Acoustic feature vector 5
2.1.2 Evaluation metric 7
2.2 Sequence-to-sequence model 7
2.2.1 Learning 9
2.2.2 Decoding 10
2.3 Attention mechanism 11
2.3.1 Content-based score function 12
2.3.2 Hybrid score function 13
3 Proposed modification 15
3.1 Double attention mechanism 15
3.2 Multiplicative hybrid score function 18
4 Experiment 20
4.1 Data description 20
4.2 Training details 20
4.3 Results 22
5 Conclusion 25
Appendices 27
A Alignment examples 28
Bibliography 32

더보기

분석정보

View

상세정보조회

0

Usage

원문다운로드

0

대출신청

0

복사신청

0

EDDS신청

0

동일 주제 내 활용도 TOP

주제

연도별 연구동향

연도별 활용동향

연관논문

연구자 네트워크맵

공동연구자 (7)

더보기

유사연구자 (20) 활용도상위20명

더보기

연관 공개강의(KOCW)

이 자료와 함께 이용한 RISS 자료

나만을 위한 추천자료

서지정보
부가정보
분석정보
연관 공개강의(KOCW)

해외이동버튼