http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.
변환된 중국어를 복사하여 사용하시면 됩니다.
OCR 및 Average Hash 기법을 사용한 이미지 기반 유해사이트 판별 방법
박시현(Si-Hyeon Park),유성민(Seong-Min You),송동호(Dong-Ho Song),이광재(Kwangjae Lee) 대한전기학회 2023 전기학회논문지 P Vol.72P No.2
Recently, websites containing harmful information such as gambling, illegal drugs, pornography, and prostitution are exposed to the public. These harmful sites cause damage to copyright holders and related service industries, and cause various social problems. In this paper, we propose an image-based harmful site identification system using OCR and Average Hash techniques to identify and classify harmful sites. This system uses the characteristic that most gambling banner advertisements repeatedly use similar images, and analyzes the similarity with the average hash value of the banner advertisement image. And using Easy OCR, it determines whether the phrase written in the banner advertisement is harmful or not. To evaluate the performance of the proposed idea, a program was created to determine harmfulness by collecting and analyzing the sites banner advertisement image when the site name was entered, and it was confirmed that the discrimination accuracy was 84%. In addition, since the information collected while running the program is stored in the database, trends in harmful sites can be identified. This will be effectively used to search for harmful sites that are expected to occur.