RISS 학술연구정보서비스

검색
다국어 입력

http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.

변환된 중국어를 복사하여 사용하시면 됩니다.

예시)
  • 中文 을 입력하시려면 zhongwen을 입력하시고 space를누르시면됩니다.
  • 北京 을 입력하시려면 beijing을 입력하시고 space를 누르시면 됩니다.
닫기
    인기검색어 순위 펼치기

    RISS 인기검색어

      Improving seq2seq by revising attention mechanism for speech recognition

      한글로보기

      https://www.riss.kr/link?id=T14924509

      • 0

        상세조회
      • 0

        다운로드
      서지정보 열기
      • 내보내기
      • 내책장담기
      • 공유하기
      • 오류접수

      부가정보

      다국어 초록 (Multilingual Abstract) kakao i 다국어 번역

      Sequence-to-sequence models (seq2seq) have been designed to learn a mapping from arbitrary sized input sequence to an output sequence. Although the models are so versatile that it have been applied to variety of domain successfully, its adapta- tion for speech recognition should be reconsidered in that im- plicit alignment between speech signal and its output sequence is different from other domains. Moreover, speech signal is usu- ally much longer than its corresponding text label sequence.
      In this thesis, I modified attention mechanisms of sequence- to-sequence models so that it can perform better for speech recognition. The revised model used double attention mecha- nism instead of conventional single attention mechanism so that it can attend relevant part of input sequence more easily. More- over, I generalized existing hybrid score function and achieved best results with multiplicative score function.
      Experimental results on TIMIT dataset showed that pro- posed modifications achieve fast convergence and improved recog- nition performance.
      번역하기

      Sequence-to-sequence models (seq2seq) have been designed to learn a mapping from arbitrary sized input sequence to an output sequence. Although the models are so versatile that it have been applied to variety of domain successfully, its adapta- tion f...

      Sequence-to-sequence models (seq2seq) have been designed to learn a mapping from arbitrary sized input sequence to an output sequence. Although the models are so versatile that it have been applied to variety of domain successfully, its adapta- tion for speech recognition should be reconsidered in that im- plicit alignment between speech signal and its output sequence is different from other domains. Moreover, speech signal is usu- ally much longer than its corresponding text label sequence.
      In this thesis, I modified attention mechanisms of sequence- to-sequence models so that it can perform better for speech recognition. The revised model used double attention mecha- nism instead of conventional single attention mechanism so that it can attend relevant part of input sequence more easily. More- over, I generalized existing hybrid score function and achieved best results with multiplicative score function.
      Experimental results on TIMIT dataset showed that pro- posed modifications achieve fast convergence and improved recog- nition performance.

      더보기

      목차 (Table of Contents)

      • 1 Introduction 1
      • 2 Background 4
      • 2.1 Automatic speech recognition 4
      • 2.1.1 Acoustic feature vector 5
      • 2.1.2 Evaluation metric 7
      • 1 Introduction 1
      • 2 Background 4
      • 2.1 Automatic speech recognition 4
      • 2.1.1 Acoustic feature vector 5
      • 2.1.2 Evaluation metric 7
      • 2.2 Sequence-to-sequence model 7
      • 2.2.1 Learning 9
      • 2.2.2 Decoding 10
      • 2.3 Attention mechanism 11
      • 2.3.1 Content-based score function 12
      • 2.3.2 Hybrid score function 13
      • 3 Proposed modification 15
      • 3.1 Double attention mechanism 15
      • 3.2 Multiplicative hybrid score function 18
      • 4 Experiment 20
      • 4.1 Data description 20
      • 4.2 Training details 20
      • 4.3 Results 22
      • 5 Conclusion 25
      • Appendices 27
      • A Alignment examples 28
      • Bibliography 32
      더보기

      분석정보

      View

      상세정보조회

      0

      Usage

      원문다운로드

      0

      대출신청

      0

      복사신청

      0

      EDDS신청

      0

      동일 주제 내 활용도 TOP

      더보기

      주제

      연도별 연구동향

      연도별 활용동향

      연관논문

      연구자 네트워크맵

      공동연구자 (7)

      유사연구자 (20) 활용도상위20명

      이 자료와 함께 이용한 RISS 자료

      나만을 위한 추천자료

      해외이동버튼