RISS 검색 - 국내학술지논문

무료
기관 내 무료
유료

내보내기
내책장담기
한글로보기

정확도순

내림차순

내림차순

10개씩 출력

1
인쇄 문서 영상의 단어 단위 속성 인식

곽희규(Heekue Kwag),김수형(Soohyung Kim) 한국정보과학회 2001 정보과학회논문지 : 소프트웨어 및 응용 Vol.28 No.5
- 원문보기 2
  ScienceON
  
  DBpia
본 논문은 문서 영상에 존재하는 개별 단어들에 대한 속성정보 추출 방법을 제안한다. 단어 단위의 속성 인식은 단어 영상 매칭의 정확도 및 속도 개선, OCR 시스템에서 인식률 향상, 문서의 재생산 등 다양한 응용 가치를 찾을 수 있으며, 메타정보(meta-information) 추출을 통해 영상 검색(image retrieval)이나 요약(summary) 생성 등에 활용할 수 있다. 제안하는 시스템에서 고려하는 단어 영상의 속성은 언어의 종류(한글, 영문), 스타일(볼드, 이탤릭, 보통, 밑줄), 문자 크기(10, 12, 14포인트), 문자 개수(한글: 2, 3, 4, 5, 영문: 4, 5, 6, 7, 8, 9, 10), 서체(명조, 고딕)의 다섯 가지 정보이다. 속성 인식을 위한 특징은, 언어 종류 인식에 2개, 스타일 인식에 3개, 문자 크기와 개수는 각각 1개, 한글 서체 인식은 1개, 영문 서체 인식은 2개를 사용한다. 분류기는 신경망, 2차형 판별함수(QDF), 선형 판별함수(LDF)를 계층적으로 구성한다. 다섯 가지 속성이 조합된 26,400개의 단어 영상을 사용한 실험을 통해, 제안된 방법이 소수의 특징만으로도 우수한 속성 인식 성능을 보임을 입증하였다. In this paper, we propose a method to extract attribute information of individual words in machine-printed document images. The recognition of the word-level attributes helps in improving the accuracy and speed of the word image matching, the OCR performance, and the reproduction of the document. It is also useful in retrieving document image and creating summary, by an extraction of some meta-information from printed documents. The attributes of word images considered in the proposed system are language(Korean, English), style(bold, italic, regular, underline), size(10, 12, 14 points), the number of characters(Korean: 2, 3, 4, 5, English: 4, 5, 6, 7, 8, 9, 10), and typeface(Myung-jo, Gothic). Two features are used for the language classification, three for the style, one for the size, one for the number of characters, one for the Korean typeface, and two for the English typeface classification, respectively. The classifiers are constructed hierarchically using the neural network, quadratic discriminant function, and linear discriminant function. An experiment using 26,400 word images combining the five attributes demonstrates how the proposed method shows high performance of attribute recognition with only a small number of features.
2
태양광 분산전원 발전설비 O&M을 위한 드론활용 실시간 불량 패널 검출 기술

김량수(Ryangsoo Kim),류지형(Jihyoung Ryu),왕기철(Gicheol Wang),김재인(Jaein Kim),곽희규(Heekue Kwag),허철균(Cheolgyun Heo),박형준(Hyoungjun Park),김성창(Sungchang Kim) 대한전기학회 2022 대한전기학회 학술대회 논문집 Vol.2022 No.7
- 원문보기

내보내기
내책장담기
한글로보기

정확도순

내림차순

내림차순

10개씩 출력

맨처음 페이지로 1 맨끝 페이지로

상세검색

RISS 보유자료

상세검색

해외전자자료

연관 검색어 추천