http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.
변환된 중국어를 복사하여 사용하시면 됩니다.
딥 러닝 기반의 행동 간 유사도 예측을 통한 연속 행동 공간에서의 최적 행동 탐색
정영빈(Yeongbin Jeong),김민구(Mingu Kim),박지수(Jisoo Park),서일홍(Il Hong Suh) 대한전자공학회 2019 대한전자공학회 학술대회 Vol.2019 No.11
Many real-world planning problems require choosing actions in a continuous action space. The continuous action space is so large that it is often discretized using domain knowledge. However, discretization of the action space causes loss of information. There is a kernel based MCTS algorithm that solves this problem through similarity between actions and finds a good action in the continuous action space. However, the algorithm does not take into account the current state and the result of taking action. This problem leads to quite inefficient action search. In this paper, we use deep neural networks to solve these problems and propose a more efficient action search algorithm.