RISS 학술연구정보서비스

검색
다국어 입력

http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.

변환된 중국어를 복사하여 사용하시면 됩니다.

예시)
  • 中文 을 입력하시려면 zhongwen을 입력하시고 space를누르시면됩니다.
  • 北京 을 입력하시려면 beijing을 입력하시고 space를 누르시면 됩니다.
닫기
    인기검색어 순위 펼치기

    RISS 인기검색어

      검색결과 좁혀 보기

      선택해제
      • 좁혀본 항목 보기순서

        • 원문유무
        • 음성지원유무
        • 원문제공처
          펼치기
        • 등재정보
          펼치기
        • 학술지명
          펼치기
        • 주제분류
          펼치기
        • 발행연도
          펼치기
        • 작성언어
          펼치기
        • 저자
          펼치기

      오늘 본 자료

      • 오늘 본 자료가 없습니다.
      더보기
      • 무료
      • 기관 내 무료
      • 유료
      • KCI등재

        Semantic Conceptual Relational Similarity Based Web Document Clustering for Efficient Information Retrieval Using Semantic Ontology

        ( Selvalakshmi B ),( Subramaniam M ),( Sathiyasekar K ) 한국인터넷정보학회 2021 KSII Transactions on Internet and Information Syst Vol.15 No.9

        In the modern rapid growing web era, the scope of web publication is about accessing the web resources. Due to the increased size of web, the search engines face many challenges, in indexing the web pages as well as producing result to the user query. Methodologies discussed in literatures towards clustering web documents suffer in producing higher clustering accuracy. Problem is mitigated using, the proposed scheme, Semantic Conceptual Relational Similarity (SCRS) based clustering algorithm which, considers the relationship of any document in two ways, to measure the similarity. One is with the number of semantic relations of any document class covered by the input document and the second is the number of conceptual relation the input document covers towards any document class. With a given data set Ds, the method estimates the SCRS measure for each document Di towards available class of documents. As a result, a class with maximum SCRS is identified and the document is indexed on the selected class. The SCRS measure is measured according to the semantic relevancy of input document towards each document of any class. Similarly, the input query has been measured for Query Relational Semantic Score (QRSS) towards each class of documents. Based on the value of QRSS measure, the document class is identified, retrieved and ranked based on the QRSS measure to produce final population. In both the way, the semantic measures are estimated based on the concepts available in semantic ontology. The proposed method had risen efficient result in indexing as well as search efficiency also has been improved.

      • KCI등재

        Design of Document-HTML Generation Technique for Authorized Electronic Document Communication

        황현천(Hyun-Cheon Hwang),김우제(Woo-Je Kim) 한국산업경영시스템학회 2021 한국산업경영시스템학회지 Vol.44 No.1

        Electronic document communication based on a digital channel is becoming increasingly important with the advent of the paperless age. The electronic document based on PDF format does not provide a powerful customer experience for a mobile device user despite replacing a paper document by providing the content integrity and the independence of various devices and software. On the other hand, the electronic document based on HTML5 format has weakness in the content integrity as there is no HTML5 specification for the content integrity despite its enhanced customer experience such as a responsive web technology for a mobile device user. In this paper, we design the Document-HTML, which provides the content integrity and the powerful customer experience by declaring the HTML5 constraint rules and the extended tags to contain the digital signature based on PKI. We analyze the existing electronic document that has been used in the major financial enterprise to develop a sample. We also verify the Document-HTML by experimenting with the sample of HTML electronic communication documents and analyze the PKI equation. The Document-HTML document can be used as an authorized electronic document communication and provide a powerful customer experience in the mobile environment between an enterprise and a user in the future.

      • KCI등재

        간호학과 학생들의 전자형태 문서이용 및 문서읽기행태에 대한 분석

        나경식,이지수 한국정보관리학회 2014 정보관리학회지 Vol.31 No.3

        The purpose of this study is to analyze undergraduate nursing students' electronic document use and reading behavior. To do this, a survey questionnaire was collected from 509 respondents who experienced reading behavior for the last semester. The results of this study show that nursing students’ preference of electronic documents is higher than that of printed documents in general. They also prefer electronic documents to printed documents when they want to keep documents. Of respondents, about 94% or higher spent 30mins or more to find information and the main source to find information is ‘Naver’ search engine as the highest information source, and the place to access information is ’Home’ as their highest information access location. In particular, the preference of the document ‘on the move’ is electronic documents and the main reason includes convenience and easiness to access and move the documents. The findings of this study expect to facilitate the understanding of undergraduate nursing students electronic document use and reading behavior so that it can be used to design and develop medical digital library services and tools more effectively and efficiently in medical area in the future. Furthermore, it expects to provide useful data in promoting user services in digital library in a whole. 본 연구는 간호학과 학생들의 전자형태 문서의 이용 및 문서읽기 행태를 분석하기 위해 이들의 행태에서 나타나는 요소들에 대해 종합적으로 분석해 보고자 하였다. 이를 위해 본 연구는 간호학과 학생을 중심으로 지난 한 학기동안 학생들이 경험한 읽기행태를 조사하여 문서형태선호도와 요소 등을 다각적으로 분석하였다. 총 509명의 응답에 기초하여 분석한 결과, 간호학과 학생들은 일반적으로 전자형태문서에 대한 선호도가 높은 것으로 나타났으며, 소장하고 싶은 문서형태 또한 전자형태문서라고 응답하였다. 응답자 중, 약 94% 이상이 필요한 정보를 찾기 위해 30분 이상을 소요하고 있다고 나타났으며, 접근경로는 ‘네이버’ 검색엔진을 가장 많이 사용하는 것으로 응답하였다. 또한 주로 찾는 정보의 주제는 수업관련 정보가 가장 많았으며, 정보의 접근 장소로는 집에서 주로 정보를 찾고 이용하는 것으로 나타났다. 특히, 응답자들의 이동 중에 나타나는 문서선호도는 전자형태에 대한 선호도가 높게 나타났으며, 이동의 간편성과 접근의 용이성이 가장 높은 이유로 드러났다. 본 연구의 결과는 간호학과 학생들의 문서읽기행태를 분석하여 인쇄형태와 전자형태 문서의 읽기행태에 대한 이해를 증진시키고 향후 간호학 분야에서의 의학도서관 디지털정보이용의 서비스 설계 및 도구를 개발 또는 보완하기 위한 자료와, 더 나아가, 디지털 도서관 이용자 서비스 개선 방안을 모색하는 데 자료를 제시하고자 한다.

      • KCI등재

        복합 문서의 의미적 분해를 통한 다중 벡터 문서 임베딩 방법론

        박종인,김남규 한국지능정보시스템학회 2019 지능정보연구 Vol.25 No.3

        According to the rapidly increasing demand for text data analysis, research and investment in text mining are being actively conducted not only in academia but also in various industries. Text mining is generally conducted in two steps. In the first step, the text of the collected document is tokenized and structured to convert the original document into a computer-readable form. In the second step, tasks such as document classification, clustering, and topic modeling are conducted according to the purpose of analysis. Until recently, text mining-related studies have been focused on the application of the second steps, such as document classification, clustering, and topic modeling. However, with the discovery that the text structuring process substantially influences the quality of the analysis results, various embedding methods have actively been studied to improve the quality of analysis results by preserving the meaning of words and documents in the process of representing text data as vectors. Unlike structured data, which can be directly applied to a variety of operations and traditional analysis techniques, Unstructured text should be preceded by a structuring task that transforms the original document into a form that the computer can understand before analysis. It is called "Embedding" that arbitrary objects are mapped to a specific dimension space while maintaining algebraic properties for structuring the text data. Recently, attempts have been made to embed not only words but also sentences, paragraphs, and entire documents in various aspects. Particularly, with the demand for analysis of document embedding increases rapidly, many algorithms have been developed to support it. Among them, doc2Vec which extends word2Vec and embeds each document into one vector is most widely used. However, the traditional document embedding method represented by doc2Vec generates a vector for each document using the whole corpus included in the document. This causes a limit that the document vector is affected by not only core words but also miscellaneous words. Additionally, the traditional document embedding schemes usually map each document into a single corresponding vector. Therefore, it is difficult to represent a complex document with multiple subjects into a single vector accurately using the traditional approach. In this paper, we propose a new multi-vector document embedding method to overcome these limitations of the traditional document embedding methods. This study targets documents that explicitly separate body content and keywords. In the case of a document without keywords, this method can be applied after extract keywords through various analysis methods. However, since this is not the core subject of the proposed method, we introduce the process of applying the proposed method to documents that predefine keywords in the text. The proposed method consists of (1) Parsing, (2) Word Embedding, (3) Keyword Vector Extraction, (4) Keyword Clustering, and (5) Multiple-Vector Generation. The specific process is as follows. all text in a document is tokenized and each token is represented as a vector having N-dimensional real value through word embedding. After that, to overcome the limitations of the traditional document embedding method that is affected by not only the core word but also the miscellaneous words, vectors corresponding to the keywords of each document are extracted and make up sets of keyword vector for each document. Next, clustering is conducted on a set of keywords for each document to identify multiple subjects included in the document. Finally, a Multi-vector is generated from vectors of keywords constituting each cluster. The experiments for 3.147 academic papers revealed that the single vector-based traditional approach cannot properly map complex documents because of interference among subjects in each vector. With the proposed multi-vector based method, we ascertained that complex documents ... 텍스트 데이터에 대한 다양한 분석을 위해 최근 비정형 텍스트 데이터를 구조화하는 방안에 대한 연구가 활발하게 이루어지고 있다. doc2Vec으로 대표되는 기존 문서 임베딩 방법은 문서가 포함한 모든 단어를 사용하여벡터를 만들기 때문에, 문서 벡터가 핵심 단어뿐 아니라 주변 단어의 영향도 함께 받는다는 한계가 있다. 또한기존 문서 임베딩 방법은 하나의 문서가 하나의 벡터로 표현되기 때문에, 다양한 주제를 복합적으로 갖는 복합문서를 정확하게 사상하기 어렵다는 한계를 갖는다. 본 논문에서는 기존의 문서 임베딩이 갖는 이러한 두 가지한계를 극복하기 위해 다중 벡터 문서 임베딩 방법론을 새롭게 제안한다. 구체적으로 제안 방법론은 전체 단어가 아닌 핵심 단어만 이용하여 문서를 벡터화하고, 문서가 포함하는 다양한 주제를 분해하여 하나의 문서를 여러 벡터의 집합으로 표현한다. KISS에서 수집한 총 3,147개의 논문에 대한 실험을 통해 복합 문서를 단일 벡터로 표현하는 경우의 벡터 왜곡 현상을 확인하였으며, 복합 문서를 의미적으로 분해하여 다중 벡터로 나타내는제안 방법론에 의해 이러한 왜곡 현상을 보정하고 각 문서를 더욱 정확하게 임베딩할 수 있음을 확인하였다

      • KCI등재

        신뢰적 공인전자문서 소통을 위한 Document-HTML 시스템의 설계 및 구현

        황현천 ( Hyun-Cheon Hwang ),김우제 ( Woo-Je Kim ) 조선대학교 공학기술연구원 2021 공학기술논문지 Vol.14 No.2

        Electronic documents have become essential for communication in the digital era rather than paper documents. An electronic document, such as a PDF document that has the same layout as a paper document, is delivered through a delivery channel such as e-mail. Moreover, in the mobile era, delivering an HTML document to a mobile application has become a means of communication. However, unlike a PDF document, an HTML document is composed of related external resource files such as an image. There is no explicit specification to guarantee the content integrity against unauthorized content modification. In this study research was conducted into Document-HTML specifications that guarantee document integrity like a PDF document and provide a better customer experience in the mobile environment based on HTML5. Furthermore, we designed and implemented a Document-HTML system in a cloud computing environment. The Document-HTML system provides stateless REST API for Document-HTML document creation and validation, and the Document-HTML document provides RSA cryptography level content integrity by using a digital signature based on PKI. The Document-HTML system can be used as a secure customer communication platform in enterprises to provide content integrity and a better user experience in the mobile environment.

      • Efficient Document Similarity Detection Using Weighted Phrase Indexing

        Papias Niyigena,Zhang Zuping,Mansoor Ahmed Khuhro,Damien Hanyurwimfura 보안공학연구지원센터 2016 International Journal of Multimedia and Ubiquitous Vol.11 No.5

        Document similarity techniques mostly rely on single term analysis of the document in the data set. To improve the efficiency and effectiveness of the process of document similarity detection, more informative feature terms have been developed and presented by many researchers. In this paper, we present phrase weight index, which indexes documents in the data set based on important phrases. Phrasal indexing aims to reduce the ambiguity inherent to the words considered in isolation, and then improve the effectiveness in document similarity computation. The method we are presenting here in this paper inherit the term tf-idf weighting scheme in computing important phrases in the collection. It computes the weight of phrases in the document collection and according to a given threshold; the important phrases are identified and are indexed. The data dimensionality which hinders the performance of document similarity for different methods is solved by an offline index creation of important phrases for every document. The evaluation experiments indicate that the presented method is very effective on document similarity detection and its quality surpasses the traditional phrase-based approach in which the reduction of dimensionality is ignored and other methods which use single-word tf-idf.

      • KCI등재

        韓國 古代 文書木簡의 書式과 서사재료

        홍승우 경북대학교 인문학술원 2022 동서인문 Vol.- No.19

        This paper analyzed the format of official documents recorded on ancient Korean wooden tablets in connection with the characteristics of writing materials as part of the work to shed light on the characteristics of the ancient Korean wooden tablet culture. The study mainly focuses on documents related to the household registers of Baekje and the four-sided wooden documents sent and received of Silla. The Bokam-ri documents and the Gungnamji documents are similar in terms of content, while diverging in their formats. The Bokam-ri documents are divided into several parts on one side and each item is entered as a new part so the contents can easily be grasped visually. This type of format well reflects the characteristics of the wooden tablets. On the other hand, the Gungnamji documents are written in-line, with a blank space to separate each item. This format is typical of paper documents and it conveys information only through text content. There are several four-sided wooden tablets used for the official documents that were sent and received between officials and which were excavated from Seongsan-moutain Fortress in Haman or Wolseong Moat in Gyeongju. These were written in-line just as paper documents and, as such, delivered accurate content with, for example, the use of specific terms to understand the content or the change of line just as the four-sided wooden tablets. These wooden tablets have been classified as stand-alone documents that show completeness in terms of content. Nonetheless, based on the characteristics of the formats identified in this paper, where the contents are understood, it appears that the documents could have been produced and used as parts of a series of documents. Indeed, not every wooden tablet displays complete content by itself. This is because these ancient Korean documents written on wooden tablets were produced at a time when paper documents prevailed and wooden tablets were used for supplementary purposes, such as attached documents. In conclusion, while the format of ancient Korean documents written on wooden tablets was basically more suitable for paper documents, formats typical of wooden tablets as writing materials were also utilized.

      • KCI등재

        단순 교통사건 형사사법절차상 전자(화)문서의 문서성에 대한 고찰

        박수호,이상진 경찰대학 경찰대학 2019 경찰학연구 Vol.19 No.1

        This paper takes a look at the related law that has been regulating regarding the electronic documents and the electronization documents and the problems that have been taking place because of it. And it had considered the evidential documents that were submitted by the investigation cooperator, including the electronic documents of the records of the statements, the protocol of the examination of the suspect, etc., that had been written in terms of the practical affairs by the police, the testifier, etc. regarding the documentability regarding the electronically-handled, electronization documents. Also, it had been intended to propose an improvement plan as an alternative regarding what must be done in order to abolish the investigation documents that had been written on paper under the current law. In order to solidify the evidential capabilities of the electronic documents and the electronization documents that are distributed in the criminal justice information system as the documents, firstly, the electronic mission of an electronic document must be written and electronically signed by a police officer as currently carried out. Secondly, regarding the signing and sealing by the person confirming in the police report regarding the testifier, the victim, and the suspect, the system shall be improved by having the electronic signature put in through the official certificate. Also, in the case of converting the paper documents into the electronized documents, too, the system shall be improved so that the electronic signatures through the official certificate of the civil petitioner, who is the submitter, can be done. Thirdly, regarding the evidential capabilities of the electronized documents, the furnishing of the facility, the completion of the education, etc. that result from the Regulations on the Procedure and the Method of Writing an Electronized Document (No. 2017-7, the announcement of the Ministry of Science and ICT) and the Regulations on the Electronized Documents Writing Facility or the Detailed Matters of the Equipment Certifying Standard (No. 2012-67, the announcement of the Ministry of Knowledge Economy) must be abided by. Fourthly, the examination and the interpretation that recognize the same effectiveness of the electronic documents and the electronization documents as the paper documents are needed. I hope that, in the future, with regard to the criminal justice procedure, by having the electronic (electronized) documents receive the recognition of the effectiveness that is the same as the paper documents, the paper documents will not be produced any more. 이 논문은 전자문서 및 전자화문서에 대해 규정하고 있는 관련 법률과 그에 따라 발생되고 있는 문제점을 살펴보고 경찰이 실무상에서 작성한 진술조서, 피의자신문조서 등의 전자문서와 참고인 등 수사 협조자가 제출한 증거서류를 전자적으로 처리한 전자화문서의 문서성에 대해 고찰하였다. 또한 현행 법률하에서 종이로 작성된 수사서류를 전자(화)문서로 대체하는 방안을 제안하였다. 형사사법정보시스템에서 유통되는 전자문서 및 전자화문서가 문서로서의 증거능력을 공고히 하기 위해서는 첫째, 전자문서의 전자서명이 현행과 같이 경찰관이 작성하여 전자서명하고, 둘째, 참고인, 피해자, 피의자에 대한 조서에 확인자의 서명 날인을 공인인증서를 통한 전자서명하는 방식으로 개선해야 하며, 종이문서를 전자화문서로 변환하는 경우에도 제출자인 민원인의 공인인증서를 통한 전자서명을 하도록 제도를 개선해야 한다. 셋째, 전자화문서의 증거능력에 대해서는 전자화문서의 작성 절차 및 방법에 관한 규정(과학기술정보통신부고시 제2017-7호)과 전자화문서 작성시설 또는 장비 인증기준 세부사항 규정(지식경제부고시 제2012-67호)에 따른 시설완비 및 교육 이수 등이 준수되도록 하여야 한다. 넷째, 전자문서 및 전자화문서를 종이문서와 동일한 효력을 인정하는 형사법의 검토 및 해석이 필요하다. 향후 형사사법절차에 있어 전자(화)문서가 종이문서와 동일한 효력을 인정받아 더 이상 종이문서를 생산하지 않게 되기를 바란다.

      • KCI등재후보

        中國 古代公文書 發展過程과 硏究現況

        딩 샤오 창,심재권 서울대학교 규장각한국학연구원 2009 규장각 Vol.34 No.-

        The process of progress of China's ancient official documents can be divided into four stages: sprout, first step, growing and settlement-decline. Since 1980's, the study of Chinese practical style has remarkably progressed and the fruits have been eye-catching. Today's study of Chinese practical style can be characterized by the followings. The study on documents in China is being carried out at the department of secretary, of documents and of applied essay-writing. Studies of secretaries, of documents and of applied essay-writing is being carried out by concerned organizations, professors and researchers working for specialized institutions. The objects and directions of the study: (1) document compilation organizations and their men; (2) document-writing skills; (3) official documents' styles; (4) formulas and methods of processing of official documents; (5) systems and regulations of processing of official documents; (6) official documents' languages and styles. It is natural that these studies include not only contemporary documents but also ancient ones and lay stress on not only theories but also practice. There are very many results of related studies and significant related texts, but from the point of view of levels and results, very few are excellent in quality. Most studies are about official documents―mainly in practical style. Main restricting conditions for the study of practical style are the absolute shortage of researching manpower and insufficiency of the foundation of studies. Those conditions are recently growing better gradually, though still in low level. The participation of departments of studies of diverse spheres is widening the broad of researching group of practice styles and consequently is having good influence on the studies of practical styles. The progress and the change of government systems are objects of studies of the department of history. The close connection between government systems and the systems of official documents made some historians be concerned with ancient documents and produced many fruits. Since the beginning of the last century, a number of ancient written bamboos and documents were excavated in Dunhuang, Turfan, Yunmeng, Guodian and so on. These provide studies of history with important new sources from Fighting Nations' Period through Five Dynasties' Period. Due to this, there appeared new spheres of studies. These studies of those ancient records refer to ancient practical styles and consequently many related studies have produced fruits. A great number of Chinese ancient records were drained overseas and so studies on them have been accumulated in many countries. 중국의 공문서의 발전과정은 크게 맹아기-초보기-성장기-정착쇠퇴기의 4단계로 나눌 수 있다. 1980년대 이래 중국실용문체의 연구는 현저하게 발전했다. 현재 중국에서 문서에 관한 연구는 비서학과, 문서학과, 應用寫作學科 등에서 이루어지고 있다. 정치제도와 공문서제도와의 밀접한 관계로 인해 역사학자들이 고문서에 관심을 가지게 되었고, 많은 성과를 이루어내었다.

      • KCI등재

        FCA 기반 계층적 구조를 이용한 문서 통합 기법

        김태환(Taehwan Kim),전호철(Hocheol Jeon),최종민(Joongmin Choi) 한국지능정보시스템학회 2011 지능정보연구 Vol.17 No.3

        The World Wide Web is a very large distributed digital information space. From its origins in 1991, the web has grown to encompass diverse information resources as personal home pasges, online digital libraries and virtual museums. Some estimates suggest that the web currently includes over 500 billion pages in the deep web. The ability to search and retrieve information from the web efficiently and effectively is an enabling technology for realizing its full potential. With powerful workstations and parallel processing technology, efficiency is not a bottleneck. In fact, some existing search tools sift through gigabyte-syze precompiled web indexes in a fraction of a second. But retrieval effectiveness is a different matter. Current search tools retrieve too many documents, of which only a small fraction are relevant to the user query. Furthermore, the most relevant documents do not nessarily appear at the top of the query output order. Also, current search tools can not retrieve the documents related with retrieved document from gigantic amount of documents. The most important problem for lots of current searching systems is to increase the quality of search. It means to provide related documents or decrease the number of unrelated documents as low as possible in the results of search. For this problem, CiteSeer proposed the ACI (Autonomous Citation Indexing) of the articles on the World Wide Web. A “citation index” indexes the links between articles that researchers make when they cite other articles. Citation indexes are very useful for a number of purposes, including literature search and analysis of the academic literature. For details of this work, references contained in academic articles are used to give credit to previous work in the literature and provide a link between the “citing” and “cited” articles. A citation index indexes the citations that an article makes, linking the articleswith the cited works. Citation indexes were originally designed mainly for information retrieval. The citation links allow navigating the literature in unique ways. Papers can be located independent of language, and words in thetitle, keywords or document. A citation index allows navigation backward in time (the list of cited articles) and forwardin time (which subsequent articles cite the current article?) But CiteSeer can not indexes the links between articles that researchers doesnt make. Because it indexes the links between articles that only researchers make when they cite other articles. Also, CiteSeer is not easy to scalability. Because CiteSeer can not indexes the links between articles that researchers doesnt make. All these problems make us orient for designing more effective search system. This paper shows a method that extracts subject and predicate per each sentence in documents. A document will be changed into the tabular form that extracted predicate checked value of possible subject and object. We make a hierarchical graph of a document using the table and then integrate graphs of documents. The graph of entire documents calculates the area of document as compared with integrated documents. We mark relation among the documents as compared with the area of documents. Also it proposes a method for structural integration of documents that retrieves documents from the graph. It makes that the user can find information easier. We compared the performance of the proposed approaches with lucene search engine using the formulas for ranking. As a result, the F-measure is about 60% and it is better as about 15%.

      연관 검색어 추천

      이 검색어로 많이 본 자료

      활용도 높은 자료

      해외이동버튼