RISS 검색 - 국내학술지논문

무료
기관 내 무료
유료

내보내기
내책장담기
한글로보기

정확도순

내림차순

내림차순

10개씩 출력

1
영어자원문법을 활용한 신경망 기계번역의 데이터 증강과 성능 평가

왕규현 ( Wang Guehyun ),송상헌 ( Song Sanghoun ) 서강대학교 언어정보연구소 2021 언어와 정보 사회 Vol.42 No.-
- 원문보기 2
  KCI
  
  KISS
Machine translation commonly involves both analysis and generation across different human languages. This implies that parallel corpora of a large size are essential to create a theoretically reliable and practically robust translation model. However, as is well known, the parallel corpora between Korean and English are insufficient. In this respect, this study expands the data by means of English Resource Grammar (Flickinger 2000) to improve the translation model between the languages. Then, it looks at whether the neural machine translation model performs better with the augmented data. Unfortunately, it turns out the translation models based on augmented data exhibit rather lower BLEU scores. This study further discusses the reason for the unsatisfactory scores and raises the necessity of human evaluation as a next step.
2
사용자 이력을 활용한 앙상블 방식의 도서 난이도 연구

왕규현 ( Wang Guehyun ),이수진 ( Lee Soojin ),정희원 ( Jung Heewon ),임누리 ( Lim Nuri ) 한국독서학회 2022 독서연구 Vol.- No.65
- 원문보기
본 연구는 도서 텍스트에 대해 사용자 이력을 바탕으로 대용량 데이터를 구축하고, 적합한 기계학습 모델을 이용하여 난이도를 분석하는 데 목적이 있다. 한국어 텍스트 난이도 측정 선행연구에서 대용량의 데이터와 적절한 모델을 활용한 분석이 부족함을 살피고, 이를 해소하기 위해 실제 기업의 서비스 데이터를 활용하여 대용량의 데이터를 구축하고 앙상블 방식의 기계학습 모델을 이용해 난이도를 측정했다. 도서 텍스트 데이터는 단어 수준, 문장 수준, 문단 수준의 19개의 지표로 분석되었고 사용자 이력을 통해 설정된 나이를 예측하는 방식으로 모델을 훈련하였다. 모델의 평가를 위해 전문가 평정을 받은 읽기능력 진단검사 지문을 활용하여 사용자 이력 기반 데이터의 적절성과 앙상블 방식 기계학습 모델의 성능이 뛰어남을 보였다. 본 연구는 대용량 데이터를 구축하기 어려운 텍스트 난이도 측정 분야에서 사용자 이력을 통한 나이 설정이 적절하다는 것을 밝힌 점과 설명력 있는 기계학습 모델을 활용한 분석이 효과적이라는 점을 소개한 것에 그 의의가 있다. 본 연구 결과는 텍스트 난이도 측정 분야의 데이터 설정 및 모델 활용에 도움을 줄 것으로 기대한다. The purpose of this study is to construct large data for redability assesment based on user histories, and to analyze the difficulty using a suitable machine learning model. In previous Korean redability assesment studies, we examined the lack of analysis using large data and appropriate machine learning models. In this paper, book text data was analyzed with 19 features at word level, sentence level, and paragraph level, and the model was trained in a way to predict the age set through user history. To evaluate the model, the text data that received an expert rating were used, As a result of the evaluation, it was shown that the user history-based data and model are suitable. This study is meaningful in that it revealed that age setting through user history is appropriate in the field of redability assesment, where it is difficult to construct large data, and that analysis using an explanatory machine learning model is effective. The results of this study are expected to help the field of redability assesment to set data and model.
3
DeepKLM - 통사 실험을 위한 전산 언어모델 라이브러리 -

이규민 ( Lee Gyu-min ),김성태 ( Seongtae Kim ),김현수 ( Hyunsoo Kim ),박권식 ( Kwonsik Park ),신운섭 ( Unsub Shin ),왕규현 ( Guehyun Wang ),박명관 ( Myung-kwan Park ),송상헌 ( Sanghoun Song ) 연세대학교 언어정보연구원(구 연세대학교 언어정보개발원) 2021 언어사실과 관점 Vol.52 No.-
- 원문보기 2
  KCI
  
  KISS
This paper introduces DeepKLM, a deep learning library for syntactic experiments. The library enables researchers to use the state-of-the-art deep computational language model, based on BERT (Bidirectional Encoder Representations from Transformers). The library, written in Python, works to fill the masked part of a sentence with a specific token, similar to the Cloze task in the traditional language experiments. The output value of surprisal is related to human language processing in terms of speed and complexity. The library additionally provides two visualization tools of the heatmap and the attention head visualization. This article also provides two case studies of NPIs and reflexives employing the library. The library has room for improvement in that the BERT-based components are not entirely on par with those in human language sentences. Despite such limits, the case studies imply that the library enables us to assess human and deep learning machines’ language ability.

내보내기
내책장담기
한글로보기

정확도순

내림차순

내림차순

10개씩 출력

맨처음 페이지로 1 맨끝 페이지로

상세검색

RISS 보유자료

상세검색

해외전자자료

연관 검색어 추천