RISS 검색 - 국내학술지논문

무료
기관 내 무료
유료

내보내기
내책장담기
한글로보기

정확도순

내림차순

내림차순

10개씩 출력

1
Optical Character Recognition for Hindi Language Using a Neural-network Approach

Yadav, Divakar,Sanchez-Cuadrado, Sonia,Morato, Jorge Korea Information Processing Society 2013 Journal of information processing systems Vol.9 No.1
- 원문보기
Hindi is the most widely spoken language in India, with more than 300 million speakers. As there is no separation between the characters of texts written in Hindi as there is in English, the Optical Character Recognition (OCR) systems developed for the Hindi language carry a very poor recognition rate. In this paper we propose an OCR for printed Hindi text in Devanagari script, using Artificial Neural Network (ANN), which improves its efficiency. One of the major reasons for the poor recognition rate is error in character segmentation. The presence of touching characters in the scanned documents further complicates the segmentation process, creating a major problem when designing an effective character segmentation technique. Preprocessing, character segmentation, feature extraction, and finally, classification and recognition are the major steps which are followed by a general OCR. The preprocessing tasks considered in the paper are conversion of gray scaled images to binary images, image rectification, and segmentation of the document's textual contents into paragraphs, lines, words, and then at the level of basic symbols. The basic symbols, obtained as the fundamental unit from the segmentation process, are recognized by the neural classifier. In this work, three feature extraction techniques-: histogram of projection based on mean distance, histogram of projection based on pixel value, and vertical zero crossing, have been used to improve the rate of recognition. These feature extraction techniques are powerful enough to extract features of even distorted characters/symbols. For development of the neural classifier, a back-propagation neural network with two hidden layers is used. The classifier is trained and tested for printed Hindi texts. A performance of approximately 90% correct recognition rate is achieved.
2
Optical Character Recognition for Hindi Language Using a Neural-network Approach

( Divakar Yadav ),( Sonia Sanchez-cuadrado ),( Jorge Morato ) 한국정보처리학회 2013 Journal of information processing systems Vol.9 No.1
- 원문보기 2
  KCI
  
  KISS
Hindi is the most widely spoken language in India with more than 300 million speakers. As there is no separation between the characters of texts written in Hindi as there is in English the Optical Character Recognition (OCR) systems developed for the Hindi language carry a very poor recognition rate. In this paper we propose an OCR for printed Hindi text in Devanagari script using Artificial Neural Network (ANN) which improves its efficiency. One of the major reasons for the poor recognition rate is error in character segmentation. The presence of touching characters in the scanned documents further complicates the segmentation process creating a major problem when designing an effective character segmentation technique. Preprocessing character segmentation feature extraction and finally classification and recognition are the major steps which are followed by a general OCR. The preprocessing tasks considered in the paper are conversion of gray scaled images to binary images image rectification and segmentation of the document`s textual contents into paragraphs lines words and then at the level of basic symbols. The basic symbols obtained as the fundamental unit from the segmentation process are recognized by the neural classifier. In this work three feature extraction techniques-: histogram of projection based on mean distance histogram of projection based on pixel value and vertical zero crossing have been used to improve the rate of recognition. These feature extraction techniques are powerful enough to extract features of even distorted characters/symbols. For development of the neural classifier a back-propagation neural network with two hidden layers is used. The classifier is trained and tested for printed Hindi texts. A performance of approximately 90% correct recognition rate is achieved.

내보내기
내책장담기
한글로보기

정확도순

내림차순

내림차순

10개씩 출력

맨처음 페이지로 1 맨끝 페이지로

상세검색

RISS 보유자료

상세검색

해외전자자료

연관 검색어 추천