이상 음향 탐지를 위한 경량 Diffusion Transformer 모델의 설계 및 구현 = Design and Implementation of a Lightweight Diffusion Transformer Model for Anomalous Sound Detection|RISS 상세보기

국문 초록 (Abstract)

이상 음향 탐지(Anomalous Sound Detection)는 산업 설비의 작동음을 분석하여 비정상 상태를 조기에 식별하는 기술로, 예지 정비 및 스마트 팩토리 구현의 핵심 요소로 주목받고 있다. 특히 음향 기반 모니터링은 비접촉 방식으로 실시간 상태 감시가 가능하다는 장점을 지니지만, 실제 산업 환경에서는 이상 데이터 부족, 다양한 기계 유형 및 환경 조건에 대한 일반화 성능 저하, 그리고 엣지 디바이스의 제한된 연산 자원과 같은 요인으로 인해 실용적 적용이 제한된다. 이에 따라 정상 데이터만으로 학습 가능하고 다양한 기계에 강건하며, 자원 제약적인 환경에서도 실시간 추론이 가능한 경량 이상 음향 탐지 모델에 관한 연구가 필요하다.
본 연구에서는 이상 음향 탐지를 위한 경량 Diffusion Transformer 모델인 DiAT를 설계하고, 엣지 디바이스에서의 실시간 추론 성능을 평가하였다. 제안하는 모델은 Diffusion Transformer 구조를 기반으로 하며, Agent-Attention 메커니즘을 통합하여 시계열 음향 특징의 장거리 의존성을 효율적으로 포착하면서도 계산 복잡도를 선형 수준으로 유지하도록 구성하였다. 음향 신호의 시간-주파수 정보를 효과적으로 표현하기 위해 FDWT(Framed Discrete Wavelet Transform) 전처리 기법을 제안하였다. 이상 데이터 부족 문제를 해결하기 위해 정상 데이터만을 사용한 비지도 학습 전략을 적용하였다. 이와 같은 전략을 통해 산업 기계 음향 데이터 세트에 대해 평가한 결과, 평균 AUC 0.92를 달성하였다. 또한 경량 Diffusion Transformer 모델을 통해 Raspberry Pi 4 및 Raspberry Pi Zero 2 W와 같은 엣지 디바이스에서 실시간 이상 탐지 가능함을 입증하였다.

번역하기

이상 음향 탐지(Anomalous Sound Detection)는 산업 설비의 작동음을 분석하여 비정상 상태를 조기에 식별하는 기술로, 예지 정비 및 스마트 팩토리 구현의 핵심 요소로 주목받고 있다. 특히 음향 기...

다국어 초록 (Multilingual Abstract)

Anomalous Sound Detection (ASD) is a technology that identifies abnormal conditions early by analyzing the operational sounds of industrial equipment, and it has gained attention as a core component for predictive maintenance and smart factory implementation. In particular, audio-based monitoring offers the advantage of real-time condition monitoring through non-contact methods. However, practical applications in actual industrial environments are limited by factors such as scarcity of anomalous data, degraded generalization performance across diverse machine types and environmental conditions, and constrained computational resources on edge devices. Accordingly, research on lightweight anomalous sound detection models that can be trained with only normal data, are robust to various machines, and enable real-time inference in resource-constrained environments is necessary.
In this study, we designed DiAT, a lightweight Diffusion Transformer model for anomalous sound detection, and evaluated its real-time inference performance on edge devices. The proposed model is based on a Diffusion Transformer architecture and incorporates an Agent-Attention mechanism to efficiently capture long-range dependencies in time-series acoustic features while maintaining computational complexity at a linear level. To effectively represent the time-frequency information of acoustic signals, we proposed a Framed Discrete Wavelet Transform (FDWT) preprocessing technique. To address the scarcity of anomalous data, we applied an unsupervised learning strategy using only normal data. Through this strategy, evaluation on industrial machine acoustic datasets achieved an average AUC of 0.92.
Furthermore, the lightweight Diffusion Transformer model confirmed the feasibility of real-time anomaly detection on Raspberry Pi 4 and Raspberry Pi Zero 2 W.

번역하기

목차 (Table of Contents)

1. 서 론 1
2. 연구 배경 및 관련 연구 4
2.1 연구 배경 4
2.2 관련 연구 7
2.3 문제 정의 및 접근 방법 12

1. 서 론 1
2. 연구 배경 및 관련 연구 4
2.1 연구 배경 4
2.2 관련 연구 7
2.3 문제 정의 및 접근 방법 12
3. 이상 음향 탐지를 위한 경량 Diffusion Transformer 모델 14
3.1 시스템 설계 개요 14
3.2 제안하는 딥러닝 모델 설계 15
3.3 온디바이스 시스템 및 모델 최적화 26
4. 실험 및 성능 평가 28
4.1 데이터 세트 28
4.2 실험 환경 및 성능 평가 지표 31
4.3 기존 연구와 비교 32
4.4 전처리 기법에 따른 성능 평가 35
4.5 에이전트 어텐션 적용에 따른 성능 평가 40
4.6 온디바이스 성능 평가 43
5. 결론 및 향후 연구 과제 45

상세검색

RISS 보유자료

상세검색

해외전자자료

이상 음향 탐지를 위한 경량 Diffusion Transformer 모델의 설계 및 구현 = Design and Implementation of a Lightweight Diffusion Transformer Model for Anomalous Sound Detection

부가정보

분석정보

연관 공개강의(KOCW)

이 자료와 함께 이용한 RISS 자료

나만을 위한 추천자료