Spatio-Temporal Transformer Network for Abnormal Human Action Detection in Surveillance Video|RISS 상세보기

다국어 입력

あぁかがさざただなはばぱまやゃらわゎんいぃきぎしじちぢにひびぴみりうぅくぐすずつづっぬふぶぷむゆゅるえぇけげせぜてでねへべぺめれおぉこごそぞとどのほぼぽもよょろを

アァカサザタダナハバパマヤャラワヮンイィキギシジチヂニヒビピミリウゥクグスズツヅッヌフブプムユュルエェケゲセゼテデヘベペメレオォコゴソゾトドノホボポモヨョロヲ ―

http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.

변환된 중국어를 복사하여 사용하시면 됩니다.

예시)

中文 을 입력하시려면 zhongwen을 입력하시고 space를누르시면됩니다.
北京 을 입력하시려면 beijing을 입력하시고 space를 누르시면 됩니다.

ㅥ ㅦ ㅧ ㅨ ㅩ ㅪ ㅫ ㅬ ㅭ ㅮ ㅯ ㅰ ㅱ ㅲ ㅳ ㅴ ㅵ ㅶ ㅷ ㅸ ㅹ ㅺ ㅻ ㅼ ㅽ ㅾ ㅿ ㆀ ㆁ ㆂ ㆃ ㆄ ㆅ ㆆ ㆇ ㆈ ㆉ ㆊ ㆋ ㆌ ㆍ ㆎ

Α Β Γ Δ Ε Ζ Η Θ Ι Κ Λ Μ Ν Ξ Ο Π Ρ Σ Τ Υ Φ Χ Ψ Ω α β γ δ ε ζ η θ ι κ λ μ ν ξ ο π ρ σ τ υ φ χ ψ ω

á à Á À é è É È ç Ç ê

Ä Ö Ü ä ö ü ß

ְ ֳ ֲ ֱ ָ ַ ֵ ֶ ִ ֹ ּ ֻ ׂ ׁ ּ פ ם ן ו ט א ר ק ף ך ל ח י ע כ ג ד ש ץ ת צ מ נ ה ב

‘ ’ “ ” 〔〕〈〉「」『』【】＂（）［］｛｝

± × ÷ ≠ ≤ ≥ ∞ ∴ ♂ ♀ ∠ ⊥ ⌒ ∂ ∇ ≡ ≒ ≪ ≫ √ ∽ ∝ ∵ ∫ ∬ ∈ ∋ ⊆ ⊇ ⊂ ⊃ ∪ ∩ ∧ ∨ ￢ ⇒ ⇔ ∀ ∃ ∮ ∑ ∏ ＋－＜＝＞

、。 · ‥ … ¨ 〃 ― ∥ ＼ ∼ ´ ～ ˇ ˘ ˝ ˚ ˙ ¸ ˛ ¡ ¿ ː ！＇，．／：；？＾＿｀｜

½ ⅓ ⅔ ¼ ¾ ⅛ ⅜ ⅝ ⅞ ¹ ² ³ ⁴ ⁿ ₁ ₂ ₃ ₄

Æ Ð Ħ Ĳ Ł Ø Œ Þ Ŧ Ŋ æ đ ð ħ ı ĳ ĸ ŀ ł ø œ ß þ ŧ ŋ ŉ

А Б В Г Д Е Ё Ж З И Й К Л М Н О П Р С Т У Ф Х Ц Ч Ш Щ Ъ Ы Ь Э Ю Я а б в г д е ё ж з и й к л м н о п р с т у ф х ц ч ш щ ъ ы ь э ю я

′ ″ ℃ Å ￠￡￥ ¤ ℉ ‰ ＄％Ｆ￦㎕㎖㎗ ℓ ㎘㏄㎣㎤㎥㎦㎙㎚㎛㎜㎝㎞㎟㎠㎡㎢㏊㎍㎎㎏㏏㎈㎉㏈㎧㎨㎰㎱㎲㎳㎴㎵㎶㎷㎸㎹㎀㎁㎂㎃㎄㎺㎻㎽㎾㎿㎐㎑㎒㎓㎔ Ω ㏀㏁㎊㎋㎌㏖㏅㎭㎮㎯㏛㎩㎪㎫㎬㏝㏐㏓㏃㏉㏜㏆

§ ※ ☆ ★ ○ ● ◎ ◇ ◆ □ ■ △ ▽ → ← ↑ ↓ ↔ 〓 ◁ ◀ ▷ ▶ ♤ ♠ ♡ ♥ ♧ ♣ ⊙ ◈ ▣ ◐ ◑ ▒ ▤ ▥ ▨ ▧ ▦ ▩ ♨ ☏ ☎ ☜ ☞ ¶ † ‡ ↕ ↗ ↙ ↖ ↘ ♭ ♩ ♪ ♬ ㉿㈜ № ㏇ ™ ㏂㏘ ℡ ＃＆＊＠ ª º

ⅰ ⅱ ⅲ ⅳ ⅴ ⅵ ⅶ ⅷ ⅸ ⅹ Ⅰ Ⅱ Ⅲ Ⅳ Ⅴ Ⅵ Ⅶ Ⅷ Ⅸ Ⅹ

ا ب ت ث ج ح خ د ذ ر ز س ش ص ض ط ظ ع غ ف ق ک ل م ن ه و ی

최근 검색 목록
전체삭제 닫기

RISS 인기검색어

Spatio-Temporal Transformer Network for Abnormal Human Action Detection in Surveillance Video

한글로보기

https://www.riss.kr/link?id=A110060607

저자

Saravit Soeng (충북대학교) ; 김성익 (유원대학교) ; 조완섭 (충북대학교)
발행기관
사)한국빅데이터학회
학술지명
한국빅데이터학회 학회지(The Korea Journal of BigData)
권호사항

Vol.10 No.2 [2025]
발행연도
2025
작성언어
English
주제어

Abnormal Action Detection ; Transformer ; Spatio-Temporal ; CLIP ; Deep Learning ; Surveillance Video ; 이상 행동 탐지 ; 트랜스포머 ; 시공간 ; CLIP ; 딥러닝 ; 감시 영상
등재정보
KCI등재
자료형태
학술저널
수록면

331-343(13쪽)
DOI식별코드
http://dx.doi.org/10.36498/kbigdt.2025.10.2.331
제공처
KCI, KISS

0
상세조회
0
다운로드
0
내보내기

서지정보 열기

원문보기 2
- KCI
- KISS
인용하기

부가정보

다국어 초록 (Multilingual Abstract)

Human action analysis is crucial for identifying abnormal behaviors linked to security threats, unusual events, and potentially suspicious activities in surveillance and public settings. However, video-based abnormal action detection still presents significant challenges, particularly in complex, real-world scenarios. This study proposes a deep learning approach for abnormal human action detection that integrates robust feature extraction using a pre-trained CLIP Image Encoder with a Transformer-based sequential model. The proposed method effectively captures both spatial (visual) and temporal action characteristics across video sequences. Rich visual features, representing the scene and subject’s appearance, are extracted directly from video frames using the CLIP image encoder and fed into an encoder-only Transformer model to classify action sequences as abnormal or normal. The model was evaluated on the Surveillance Perspective Human Action Recognition (SPHAR) dataset, achieving high classification accuracy and real-time performance. Experimental results demonstrate the effectiveness and robustness of the proposed method in detecting abnormal human actions from a surveillance perspective.

번역하기

Human action analysis is crucial for identifying abnormal behaviors linked to security threats, unusual events, and potentially suspicious activities in surveillance and public settings. However, video-based abnormal action detection still presents si...

Human action analysis is crucial for identifying abnormal behaviors linked to security threats, unusual events, and potentially suspicious activities in surveillance and public settings. However, video-based abnormal action detection still presents significant challenges, particularly in complex, real-world scenarios. This study proposes a deep learning approach for abnormal human action detection that integrates robust feature extraction using a pre-trained CLIP Image Encoder with a Transformer-based sequential model. The proposed method effectively captures both spatial (visual) and temporal action characteristics across video sequences. Rich visual features, representing the scene and subject’s appearance, are extracted directly from video frames using the CLIP image encoder and fed into an encoder-only Transformer model to classify action sequences as abnormal or normal. The model was evaluated on the Surveillance Perspective Human Action Recognition (SPHAR) dataset, achieving high classification accuracy and real-time performance. Experimental results demonstrate the effectiveness and robustness of the proposed method in detecting abnormal human actions from a surveillance perspective.

더보기

동일학술지(권/호) 다른 논문

사회연결망 분석을 통한 풀필먼트 센터의 재고배치에 관한 연구
- 사)한국빅데이터학회
- 차대욱
- 2025
- KCI등재
강화학습을 활용한 도심내 소규모 배송거점 네트워크 구축 방안에 관한 연구
- 사)한국빅데이터학회
- 이강현
- 2025
- KCI등재
AI 챗봇 서비스 실패가 사용자 감정과 이탈 의도에 미치는 영향
- 사)한국빅데이터학회
- 조상리
- 2025
- KCI등재
비정형 데이터 시대의 정보시스템 연구 지형: 텍스트 마이닝의 확산과 지적 구조 분석
- 사)한국빅데이터학회
- 안재영
- 2025
- KCI등재

동일학술지 더보기

더보기

분석정보

View

상세정보조회

0

Usage

원문다운로드

0

대출신청

0

복사신청

0

EDDS신청

0

동일 주제 내 활용도 TOP

주제

연도별 연구동향

연도별 활용동향

연관논문

연구자 네트워크맵

공동연구자 (7)

더보기

유사연구자 (20) 활용도상위20명

더보기

연관 공개강의(KOCW)

이 자료와 함께 이용한 RISS 자료

나만을 위한 추천자료

서지정보
부가정보
동일학술지(권/호) 다른 논문
분석정보
연관 공개강의(KOCW)

해외이동버튼