RISS 학술연구정보서비스

검색
다국어 입력

http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.

변환된 중국어를 복사하여 사용하시면 됩니다.

예시)
  • 中文 을 입력하시려면 zhongwen을 입력하시고 space를누르시면됩니다.
  • 北京 을 입력하시려면 beijing을 입력하시고 space를 누르시면 됩니다.
닫기
    인기검색어 순위 펼치기

    RISS 인기검색어

      딥러닝을 위한 파이프라인 방식 확률적 경사 하강법 = Pipelined stochastic gradient descent for training deep neural network models

      한글로보기

      https://www.riss.kr/link?id=T15362679

      • 0

        상세조회
      • 0

        다운로드
      서지정보 열기
      • 내보내기
      • 내책장담기
      • 공유하기
      • 오류접수

      부가정보

      다국어 초록 (Multilingual Abstract) kakao i 다국어 번역

      Parallel train algorithms for deep neural networks (DNNs) are needed to train substantial data. Deep learning has been rapidly growing since 2006 after the introduction of deep belief nets, which DNNs are initialized by the restricted Boltzmann machine. Deep learning has performed well in a variety of classification problems. Many deep learning applications typically perform better with more data. It takes a lot of time for DNN to train a large data set. As data become large, faster train method is needed.
      Many parallel learning algorithms are introducing various approximation to speed up. Stochastic gradient descent (SGD) is the most widely used method for training DNNs. Since SGD is an inherently sequential process, the parallelization of SGD is difficult. Delayed gradient problems occur while sequential processes are parallelized.
      To avoid the problem of gradient mismatch due to delayed gradients, we improve Pipelined SGD by storing model parameters of each module regarding to the time delay.
      The proposed method showed the speedup of x2.25 using 4-GPU without significant performance degradation for Cifar10 dataset.
      번역하기

      Parallel train algorithms for deep neural networks (DNNs) are needed to train substantial data. Deep learning has been rapidly growing since 2006 after the introduction of deep belief nets, which DNNs are initialized by the restricted Boltzmann machin...

      Parallel train algorithms for deep neural networks (DNNs) are needed to train substantial data. Deep learning has been rapidly growing since 2006 after the introduction of deep belief nets, which DNNs are initialized by the restricted Boltzmann machine. Deep learning has performed well in a variety of classification problems. Many deep learning applications typically perform better with more data. It takes a lot of time for DNN to train a large data set. As data become large, faster train method is needed.
      Many parallel learning algorithms are introducing various approximation to speed up. Stochastic gradient descent (SGD) is the most widely used method for training DNNs. Since SGD is an inherently sequential process, the parallelization of SGD is difficult. Delayed gradient problems occur while sequential processes are parallelized.
      To avoid the problem of gradient mismatch due to delayed gradients, we improve Pipelined SGD by storing model parameters of each module regarding to the time delay.
      The proposed method showed the speedup of x2.25 using 4-GPU without significant performance degradation for Cifar10 dataset.

      더보기

      목차 (Table of Contents)

      • 1. Introduction 1
      • 2. Background 3
      • 2.1 Backpropagation (BP) 3
      • 2.2 Data Parallelism 5
      • 2.2.1 Synchronous SGD (SSGD) 6
      • 1. Introduction 1
      • 2. Background 3
      • 2.1 Backpropagation (BP) 3
      • 2.2 Data Parallelism 5
      • 2.2.1 Synchronous SGD (SSGD) 6
      • 2.2.2 Elastic Averaging SGD (EASGD) 7
      • 2.2.3 Asynchronous SGD (ASGD) 9
      • 2.3 Model Parallelism 10
      • 2.3.1 Pipelined SGD 11
      • 3. 병렬 알고리즘 복잡도 비교 13
      • 3.1 complexity 비교 13
      • 3.1.1 Backpropagation complexity 13
      • 3.1.2 Parallel algorithms 분석 14
      • 4 Proposed Model 16
      • 4.1 Pipelined SGD 개선 16
      • 5. Experiment 18
      • 5.1 MNIST 실험 조건 18
      • 5.2 MNIST 학습 속도 18
      • 5.3 MNIST 정확도 20
      • 5.4 Cifar10 실험 조건 21
      • 5.5 Cifar10 실험 결과 21
      • 6. 결론 및 향후 과제 24
      • 7. References 25
      • Appendix 29
      • Appendix A. backpropagation의 복잡도 29
      더보기

      분석정보

      View

      상세정보조회

      0

      Usage

      원문다운로드

      0

      대출신청

      0

      복사신청

      0

      EDDS신청

      0

      동일 주제 내 활용도 TOP

      더보기

      주제

      연도별 연구동향

      연도별 활용동향

      연관논문

      연구자 네트워크맵

      공동연구자 (7)

      유사연구자 (20) 활용도상위20명

      이 자료와 함께 이용한 RISS 자료

      나만을 위한 추천자료

      해외이동버튼