RISS 검색 - 국내학술지논문 상세보기

다국어 입력

あぁかがさざただなはばぱまやゃらわゎんいぃきぎしじちぢにひびぴみりうぅくぐすずつづっぬふぶぷむゆゅるえぇけげせぜてでねへべぺめれおぉこごそぞとどのほぼぽもよょろを

アァカサザタダナハバパマヤャラワヮンイィキギシジチヂニヒビピミリウゥクグスズツヅッヌフブプムユュルエェケゲセゼテデヘベペメレオォコゴソゾトドノホボポモヨョロヲ ―

http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.

변환된 중국어를 복사하여 사용하시면 됩니다.

예시)

中文 을 입력하시려면 zhongwen을 입력하시고 space를누르시면됩니다.
北京 을 입력하시려면 beijing을 입력하시고 space를 누르시면 됩니다.

ㅥ ㅦ ㅧ ㅨ ㅩ ㅪ ㅫ ㅬ ㅭ ㅮ ㅯ ㅰ ㅱ ㅲ ㅳ ㅴ ㅵ ㅶ ㅷ ㅸ ㅹ ㅺ ㅻ ㅼ ㅽ ㅾ ㅿ ㆀ ㆁ ㆂ ㆃ ㆄ ㆅ ㆆ ㆇ ㆈ ㆉ ㆊ ㆋ ㆌ ㆍ ㆎ

Α Β Γ Δ Ε Ζ Η Θ Ι Κ Λ Μ Ν Ξ Ο Π Ρ Σ Τ Υ Φ Χ Ψ Ω α β γ δ ε ζ η θ ι κ λ μ ν ξ ο π ρ σ τ υ φ χ ψ ω

á à Á À é è É È ç Ç ê

Ä Ö Ü ä ö ü ß

ְ ֳ ֲ ֱ ָ ַ ֵ ֶ ִ ֹ ּ ֻ ׂ ׁ ּ פ ם ן ו ט א ר ק ף ך ל ח י ע כ ג ד ש ץ ת צ מ נ ה ב

‘ ’ “ ” 〔〕〈〉「」『』【】＂（）［］｛｝

± × ÷ ≠ ≤ ≥ ∞ ∴ ♂ ♀ ∠ ⊥ ⌒ ∂ ∇ ≡ ≒ ≪ ≫ √ ∽ ∝ ∵ ∫ ∬ ∈ ∋ ⊆ ⊇ ⊂ ⊃ ∪ ∩ ∧ ∨ ￢ ⇒ ⇔ ∀ ∃ ∮ ∑ ∏ ＋－＜＝＞

、。 · ‥ … ¨ 〃 ― ∥ ＼ ∼ ´ ～ ˇ ˘ ˝ ˚ ˙ ¸ ˛ ¡ ¿ ː ！＇，．／：；？＾＿｀｜

½ ⅓ ⅔ ¼ ¾ ⅛ ⅜ ⅝ ⅞ ¹ ² ³ ⁴ ⁿ ₁ ₂ ₃ ₄

Æ Ð Ħ Ĳ Ł Ø Œ Þ Ŧ Ŋ æ đ ð ħ ı ĳ ĸ ŀ ł ø œ ß þ ŧ ŋ ŉ

А Б В Г Д Е Ё Ж З И Й К Л М Н О П Р С Т У Ф Х Ц Ч Ш Щ Ъ Ы Ь Э Ю Я а б в г д е ё ж з и й к л м н о п р с т у ф х ц ч ш щ ъ ы ь э ю я

′ ″ ℃ Å ￠￡￥ ¤ ℉ ‰ ＄％Ｆ￦㎕㎖㎗ ℓ ㎘㏄㎣㎤㎥㎦㎙㎚㎛㎜㎝㎞㎟㎠㎡㎢㏊㎍㎎㎏㏏㎈㎉㏈㎧㎨㎰㎱㎲㎳㎴㎵㎶㎷㎸㎹㎀㎁㎂㎃㎄㎺㎻㎽㎾㎿㎐㎑㎒㎓㎔ Ω ㏀㏁㎊㎋㎌㏖㏅㎭㎮㎯㏛㎩㎪㎫㎬㏝㏐㏓㏃㏉㏜㏆

§ ※ ☆ ★ ○ ● ◎ ◇ ◆ □ ■ △ ▽ → ← ↑ ↓ ↔ 〓 ◁ ◀ ▷ ▶ ♤ ♠ ♡ ♥ ♧ ♣ ⊙ ◈ ▣ ◐ ◑ ▒ ▤ ▥ ▨ ▧ ▦ ▩ ♨ ☏ ☎ ☜ ☞ ¶ † ‡ ↕ ↗ ↙ ↖ ↘ ♭ ♩ ♪ ♬ ㉿㈜ № ㏇ ™ ㏂㏘ ℡ ＃＆＊＠ ª º

ⅰ ⅱ ⅲ ⅳ ⅴ ⅵ ⅶ ⅷ ⅸ ⅹ Ⅰ Ⅱ Ⅲ Ⅳ Ⅴ Ⅵ Ⅶ Ⅷ Ⅸ Ⅹ

ا ب ت ث ج ح خ د ذ ر ز س ش ص ض ط ظ ع غ ف ق ک ل م ن ه و ی

최근 검색 목록
전체삭제 닫기

RISS 인기검색어

문서 분류를 위한 신경망 모델에 적합한 텍스트 전처리와 워드 임베딩의 조합

한글로보기

https://www.riss.kr/link?id=A105451690

저자

김영수(Yeongsu Kim) ; 이승우(Seungwoo Lee)
발행기관
Korean Institute of Information Scientists and Engineers
학술지명
정보과학회논문지(Journal of KIISE)
권호사항

Vol.45 No.7 [2018]
발행연도
2018
작성언어
Korean
주제어

문서 분류 ; 신경망 모델 ; 워드 임베딩 ; 텍스트 전처리 ; document classification ; neural network ; word embedding ; text preprocessing
KDC
569
등재정보
KCI우수등재
자료형태
학술저널
발행기관 URL
http://www.kiise.or.kr
수록면

690-700(11쪽)
KCI 피인용횟수
6
DOI식별코드
http://dx.doi.org/10.5626/JOK.2018.45.7.690
제공처
ScienceON, DBpia
소장기관
- 영남대학교 과학도서관

0
상세조회
0
다운로드
0
내보내기

서지정보 열기

부가정보

다국어 초록 (Multilingual Abstract)

Neural networks with word embedding have recently used for document classification. Researchers concentrate on designing new architecture or optimizing model parameters to increase performance. However, most recent studies have overlooked text preproc...

Neural networks with word embedding have recently used for document classification. Researchers concentrate on designing new architecture or optimizing model parameters to increase performance. However, most recent studies have overlooked text preprocessing and word embedding, in that the description of text preprocessing used is insufficient, and a certain pretrained word embedding model is mostly used without any plausible reasons. Our paper shows that finding a suitable combination of text preprocessing and word embedding can be one of the important factors required to enhance the performance. We conducted experiments on AG’s News dataset to compare those possible combinations, and zero/random padding, and presence or absence of fine-tuning. We used pretrained word embedding models such as skip-gram, GloVe, and fastText. For diversity, we also use an average of multiple pretrained embeddings (Average), randomly initialized embedding (Random), task data-trained skip-gram (AGNews-Skip). In addition, we used three advanced neural networks for the sake of generality. Experimental results based on OOV (Out Of Vocabulary) word statistics suggest the necessity of those comparisons and a suitable combination of text preprocessing and word embedding.

더보기

참고문헌 (Reference)

1 "word2vec: tool fot computing continuous distributed representations of words"

2 "fastText: wiki word vectors"

3 Conneau, Alexis, "Very deep convolutional networks for natural language processing"

4 Lai, Siwei, "Recurrent Convolutional Neural Networks for Text Classification" 333 : 2015

5 Camacho-Collados, Jose, "On the Role of Text Preprocessing in Neural Network Architectures: An Evaluation Study on Text Categorization and Sentiment Analysis"

6 Chen, Huimin, "Neural sentiment classification with user and product attention" 2016

7 Bahdanau, Dzmitry, "Neural machine translation by jointly learning to align and translate"

8 Loper, Edward, "NLTK: The natural language toolkit" Association for Computational Linguistics 2002

9 Lei, Tao, "Molding cnns for text: non-linear, non-consecutive convolutions"

10 Hochreiter, Sepp, "Long short-term memory" 9 (9): 1735-1780, 1997

1 "word2vec: tool fot computing continuous distributed representations of words"

2 "fastText: wiki word vectors"

3 Conneau, Alexis, "Very deep convolutional networks for natural language processing"

4 Lai, Siwei, "Recurrent Convolutional Neural Networks for Text Classification" 333 : 2015

5 Camacho-Collados, Jose, "On the Role of Text Preprocessing in Neural Network Architectures: An Evaluation Study on Text Categorization and Sentiment Analysis"

6 Chen, Huimin, "Neural sentiment classification with user and product attention" 2016

7 Bahdanau, Dzmitry, "Neural machine translation by jointly learning to align and translate"

8 Loper, Edward, "NLTK: The natural language toolkit" Association for Computational Linguistics 2002

9 Lei, Tao, "Molding cnns for text: non-linear, non-consecutive convolutions"

10 Hochreiter, Sepp, "Long short-term memory" 9 (9): 1735-1780, 1997

11 Wen, Ying, "Learning text representation using recurrent convolutional neural network with highway layers"

12 Yang, Zichao, "Hierarchical attention networks for document classification" 2016

13 Kowsari, Kamran, "Hdltex: Hierarchical deep learning for text classification"

14 Pennington, Jeffrey, "Glove: Global vectors for word representation" 2014

15 "GloVe: Global Vectors for Word Representation"

16 Raffel, Colin, "Feed-forward networks with attention can solve some long-term memory problems"

17 Bojanowski, Piotr, "Enriching word vectors with subword information"

18 Xiao, Yijun, "Efficient character-level document classification by combining convolution and recurrent layers"

19 Tang, Duyu, "Document modeling with gated recurrent neural network for sentiment classification" 2015

20 Mikolov, Tomas, "Distributed representations of words and phrases and their compositionality, Advances in neural information processing systems" 2013

21 Johnson, Rie, "Deep pyramid convolutional neural networks for text categorization" 1 : 2017

22 Kim, Yoon, "Convolutional neural networks for sentence classification"

23 Zhang, Xiang, "Character-level convolutional networks for text classification;Advances in neural information processing systems" 2015

24 Zhang, Ye, "A sensitivity analysis of (and practitioners' guide to) convolutional neural networks for sentence classification"

25 Zhou, Chunting, "A C-LSTM neural network for text classification"

동일학술지(권/호) 다른 논문

무선 네트워크에서 동기화 영상 재생을 위한 RANSAC 기반 시간동기화 방법
- 한국정보과학회
- 황준엽
- 2018
- KCI우수등재
Convolutional Neural Network를 이용한 웹 어플리케이션 공격 탐지 기법
- 한국정보과학회
- 서영웅
- 2018
- KCI우수등재
임베딩을 활용한 순환 신경망 기반 추천 모델의 성능 향상 기법
- 한국정보과학회
- 권명하
- 2018
- KCI우수등재
한정된 프라이버시 예산에서 배치 전략을 통한 차분 프라이버시 질의 처리 기법
- 한국정보과학회
- 강민석
- 2018
- KCI우수등재

동일학술지 더보기

더보기

분석정보

View

상세정보조회

0

Usage

원문다운로드

0

대출신청

0

복사신청

0

EDDS신청

0

동일 주제 내 활용도 TOP

주제

연도별 연구동향

연도별 활용동향

연관논문

연구자 네트워크맵

공동연구자 (7)

더보기

유사연구자 (20) 활용도상위20명

더보기

인용정보 인용지수 설명보기

학술지 이력

학술지 이력
연월일	이력구분	이력상세	등재구분
2021	평가예정	계속평가 신청대상 (등재유지)
2016-01-01	평가	우수등재학술지 선정 (계속평가)
2015-01-01	평가	등재학술지 유지 (등재유지)
2002-01-01	평가	학술지 통합 (등재유지)

학술지 인용정보

학술지 인용정보
기준연도	WOS-KCI 통합IF(2년)	KCIF(2년)	KCIF(3년)
2016	0.19	0.19	0.19
KCIF(4년)	KCIF(5년)	중심성지수(3년)	즉시성지수
0.2	0.18	0.373	0.07

연관 공개강의(KOCW)

이 자료와 함께 이용한 RISS 자료

나만을 위한 추천자료

서지정보
부가정보
동일학술지(권/호) 다른 논문
분석정보
인용정보
연관 공개강의(KOCW)

해외이동버튼