군집 드론 간 회피 및 교전을 위한 강화학습 기반의 정책 보정과 안전 보장 필터링 기법|RISS 상세보기

국문 초록 (Abstract)

본 논문은 위협 지도를 기반으로 한 군집 드론 간 회피 및 교전 기동의 최적화를 위해, 강화학습 기반의 정책 보정과 제어 장벽 함수를 이용한 안전 보장 필터링 기법을 제안한다. 먼저, 위협도 가중치가 반영된 공간 및 시간 보상함수를 설계하고 제어 장벽 함수를 이용해 개별 드론의 정책을 학습하였다. 이후 학습된 정책의 제어 명령은 안전 필터르 통해 충돌 및 제한 위반을 사전에 방지하도록 조정하였다. 제안하는 기법은 시뮬레이션을 통해 다수의 드론이 위협 밀집 지역을 회피하면서 효율적인 교전 궤적을 생성함을 확인하였다. 본 연구는 강화학습 기반의 최적 정책과 제어 안정성 필터링의 융합을 통해, 군집 드론의 자율 교전에서 안전성과 효율성을 동시에 확보할 수 있음을 보여준다.

번역하기

본 논문은 위협 지도를 기반으로 한 군집 드론 간 회피 및 교전 기동의 최적화를 위해, 강화학습 기반의 정책 보정과 제어 장벽 함수를 이용한 안전 보장 필터링 기법을 제안한다. 먼저, 위협...

다국어 초록 (Multilingual Abstract)

This paper presents a reinforcement-learning-based policy correction and safety-assured filtering method for optimizing evasion and engagement maneuvers among drone swarms using a threat map. A spatio-temporal reward function, weighted by local threat levels, is designed to guide the learning process, and individual drone policies are trained under safety constraints enforced by control barrier functions (CBFs). The control commands produced by the learned PPO policy are subsequently refined through a CBF-based safety filter to prevent collisions, constraint violations, and unsafe maneuvers during cooperative operations. Simulation results demonstrate that the proposed framework enables multiple drones to avoid high-threat regions while generating tactically efficient engagement trajectories under dynamic and uncertain conditions. The study shows that integrating RL-based optimal policy generation with CBF-based safety filtering offers both performance and formal safety guarantees, providing a practical and scalable approach for autonomous swarm-level evasion and engagement.

번역하기

상세검색

RISS 보유자료

상세검색

해외전자자료

군집 드론 간 회피 및 교전을 위한 강화학습 기반의 정책 보정과 안전 보장 필터링 기법 = Policy Correction and Safety-Assured Filtering Based on Reinforcement Learning for Cooperative Evasion and Engagement among Drone Swarms

부가정보

동일학술지(권/호) 다른 논문

분석정보

연관 공개강의(KOCW)

이 자료와 함께 이용한 RISS 자료

나만을 위한 추천자료