http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.
변환된 중국어를 복사하여 사용하시면 됩니다.
Extracting Keywords from large numbers of Documents
Putu Y. Kusmawan,권준호 한국정보과학회 2014 데이타베이스 연구 Vol.30 No.2
As the growing of Internet activity, the usage of digital text document is becoming popular. Anenormous number of documents in various forms are now freely available over the Internet. Themain challenge of this situation is to make a system which can be used to quickly analyse a largeset of documents and express them in a better form of representation. In this paper, we proposea new technique to extract keywords from large set of documents by the word correlation analysis. This analysis method considers frequencies of words, correlation between adjacent words, transitivecorrelation between words in a document. Then, we represent them as a graph for providing abetter visualization for the document. Moreover, we also make our technique scalable byimplementing it using Map/Reduce algorithms. Experimental results shows that our method caneffectively extract kewords from large sets of documents.