HierarchicalPatentQA: 특허 문서의 구조적 특성을 활용한 질의응답 데이터셋 구축 및 성능 평가|RISS 상세보기

다국어 입력

あぁかがさざただなはばぱまやゃらわゎんいぃきぎしじちぢにひびぴみりうぅくぐすずつづっぬふぶぷむゆゅるえぇけげせぜてでねへべぺめれおぉこごそぞとどのほぼぽもよょろを

アァカサザタダナハバパマヤャラワヮンイィキギシジチヂニヒビピミリウゥクグスズツヅッヌフブプムユュルエェケゲセゼテデヘベペメレオォコゴソゾトドノホボポモヨョロヲ ―

http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.

변환된 중국어를 복사하여 사용하시면 됩니다.

예시)

中文 을 입력하시려면 zhongwen을 입력하시고 space를누르시면됩니다.
北京 을 입력하시려면 beijing을 입력하시고 space를 누르시면 됩니다.

ㅥ ㅦ ㅧ ㅨ ㅩ ㅪ ㅫ ㅬ ㅭ ㅮ ㅯ ㅰ ㅱ ㅲ ㅳ ㅴ ㅵ ㅶ ㅷ ㅸ ㅹ ㅺ ㅻ ㅼ ㅽ ㅾ ㅿ ㆀ ㆁ ㆂ ㆃ ㆄ ㆅ ㆆ ㆇ ㆈ ㆉ ㆊ ㆋ ㆌ ㆍ ㆎ

Α Β Γ Δ Ε Ζ Η Θ Ι Κ Λ Μ Ν Ξ Ο Π Ρ Σ Τ Υ Φ Χ Ψ Ω α β γ δ ε ζ η θ ι κ λ μ ν ξ ο π ρ σ τ υ φ χ ψ ω

á à Á À é è É È ç Ç ê

Ä Ö Ü ä ö ü ß

ְ ֳ ֲ ֱ ָ ַ ֵ ֶ ִ ֹ ּ ֻ ׂ ׁ ּ פ ם ן ו ט א ר ק ף ך ל ח י ע כ ג ד ש ץ ת צ מ נ ה ב

‘ ’ “ ” 〔〕〈〉「」『』【】＂（）［］｛｝

± × ÷ ≠ ≤ ≥ ∞ ∴ ♂ ♀ ∠ ⊥ ⌒ ∂ ∇ ≡ ≒ ≪ ≫ √ ∽ ∝ ∵ ∫ ∬ ∈ ∋ ⊆ ⊇ ⊂ ⊃ ∪ ∩ ∧ ∨ ￢ ⇒ ⇔ ∀ ∃ ∮ ∑ ∏ ＋－＜＝＞

、。 · ‥ … ¨ 〃 ― ∥ ＼ ∼ ´ ～ ˇ ˘ ˝ ˚ ˙ ¸ ˛ ¡ ¿ ː ！＇，．／：；？＾＿｀｜

½ ⅓ ⅔ ¼ ¾ ⅛ ⅜ ⅝ ⅞ ¹ ² ³ ⁴ ⁿ ₁ ₂ ₃ ₄

Æ Ð Ħ Ĳ Ł Ø Œ Þ Ŧ Ŋ æ đ ð ħ ı ĳ ĸ ŀ ł ø œ ß þ ŧ ŋ ŉ

А Б В Г Д Е Ё Ж З И Й К Л М Н О П Р С Т У Ф Х Ц Ч Ш Щ Ъ Ы Ь Э Ю Я а б в г д е ё ж з и й к л м н о п р с т у ф х ц ч ш щ ъ ы ь э ю я

′ ″ ℃ Å ￠￡￥ ¤ ℉ ‰ ＄％Ｆ￦㎕㎖㎗ ℓ ㎘㏄㎣㎤㎥㎦㎙㎚㎛㎜㎝㎞㎟㎠㎡㎢㏊㎍㎎㎏㏏㎈㎉㏈㎧㎨㎰㎱㎲㎳㎴㎵㎶㎷㎸㎹㎀㎁㎂㎃㎄㎺㎻㎽㎾㎿㎐㎑㎒㎓㎔ Ω ㏀㏁㎊㎋㎌㏖㏅㎭㎮㎯㏛㎩㎪㎫㎬㏝㏐㏓㏃㏉㏜㏆

§ ※ ☆ ★ ○ ● ◎ ◇ ◆ □ ■ △ ▽ → ← ↑ ↓ ↔ 〓 ◁ ◀ ▷ ▶ ♤ ♠ ♡ ♥ ♧ ♣ ⊙ ◈ ▣ ◐ ◑ ▒ ▤ ▥ ▨ ▧ ▦ ▩ ♨ ☏ ☎ ☜ ☞ ¶ † ‡ ↕ ↗ ↙ ↖ ↘ ♭ ♩ ♪ ♬ ㉿㈜ № ㏇ ™ ㏂㏘ ℡ ＃＆＊＠ ª º

ⅰ ⅱ ⅲ ⅳ ⅴ ⅵ ⅶ ⅷ ⅸ ⅹ Ⅰ Ⅱ Ⅲ Ⅳ Ⅴ Ⅵ Ⅶ Ⅷ Ⅸ Ⅹ

ا ب ت ث ج ح خ د ذ ر ز س ش ص ض ط ظ ع غ ف ق ک ل م ن ه و ی

최근 검색 목록
전체삭제 닫기

RISS 인기검색어

HierarchicalPatentQA: 특허 문서의 구조적 특성을 활용한 질의응답 데이터셋 구축 및 성능 평가

한글로보기

https://www.riss.kr/link?id=A109809720

저자

함양훈(Yang-Hoon Ham) ; 문영은(Yeong-Eun Moon) ; 엄예지(Ye-Ji Eum) ; 이준석(Junseok Lee)
발행기관
한국지능시스템학회
학술지명
한국지능시스템학회논문지(Journal of Korean institute of intelligent systems)
권호사항

Vol.35 No.3 [2025]
발행연도
2025
작성언어
Korean
주제어

자연어처리 ; 빅데이터 ; 특허데이터 ; Natural language processing ; Big data ; LLM ; Patent Data ; EXAONE
KDC
003
등재정보
KCI등재
자료형태
학술저널
발행기관 URL
http://www.fuzzy.or.kr
수록면

285-292(8쪽)
제공처
DBpia

0
상세조회
0
다운로드
0
내보내기

서지정보 열기

부가정보

다국어 초록 (Multilingual Abstract)

This study proposes a methodology for constructing a new QA dataset by utilizing the structural characteristics of patent documents. Existing patent search systems have shown limitations in grasping the overall context of documents due to fragmentary searches focused on the title, abstract, and claims. To overcome this limitation, this study analyzed 3,000 patent documents from 2000 to 2021 and constructed 1,071 question-answer datasets covering various sections such as background technology, technical field, and implementation details of the invention. Questions and answers were generated by leveraging the structural characteristics of patent documents through the hierarchical reasoning framework of the EXAONE 3.5 7.8B model and Retrieval-Augmented Generation (RAG) method. The KoELECTRA model, trained on the constructed dataset, achieved an EM score of 0.943 and an F1 score of 0.986, demonstrating a significant performance improvement compared to existing patent QA benchmarks. This study is significant in that it proposes a new direction in the field of patent information processing by introducing a dataset construction methodology based on the hierarchical structure of patent documents.

번역하기

This study proposes a methodology for constructing a new QA dataset by utilizing the structural characteristics of patent documents. Existing patent search systems have shown limitations in grasping the overall context of documents due to fragmentary ...

This study proposes a methodology for constructing a new QA dataset by utilizing the structural characteristics of patent documents. Existing patent search systems have shown limitations in grasping the overall context of documents due to fragmentary searches focused on the title, abstract, and claims. To overcome this limitation, this study analyzed 3,000 patent documents from 2000 to 2021 and constructed 1,071 question-answer datasets covering various sections such as background technology, technical field, and implementation details of the invention. Questions and answers were generated by leveraging the structural characteristics of patent documents through the hierarchical reasoning framework of the EXAONE 3.5 7.8B model and Retrieval-Augmented Generation (RAG) method. The KoELECTRA model, trained on the constructed dataset, achieved an EM score of 0.943 and an F1 score of 0.986, demonstrating a significant performance improvement compared to existing patent QA benchmarks. This study is significant in that it proposes a new direction in the field of patent information processing by introducing a dataset construction methodology based on the hierarchical structure of patent documents.

더보기

동일학술지(권/호) 다른 논문

한국지능시스템학회 논문 투고규정 외
- 한국지능시스템학회
- 편집부(편집자)
- 2025
- KCI등재
Intelligent FPGA-Based Image Enhancement Using Wavelet Filtering and Superpixel Segmentation for Real-Time Vision Systems
- 한국지능시스템학회
- Islam Md Tarikul(타리쿨 이슬람)
- 2025
- KCI등재
오토인코더 기반 재현 이미지 안전성 평가
- 한국지능시스템학회
- 고영호(Young-Ho Ko)
- 2025
- KCI등재
세 손가락 로봇 매니퓰레이터 임피던스 제어를 통한 변형 가능 물체 파지
- 한국지능시스템학회
- 김송우(Song Woo Kim)
- 2025
- KCI등재

동일학술지 더보기

더보기

분석정보

View

상세정보조회

0

Usage

원문다운로드

0

대출신청

0

복사신청

0

EDDS신청

0

동일 주제 내 활용도 TOP

주제

연도별 연구동향

연도별 활용동향

연관논문

연구자 네트워크맵

공동연구자 (7)

더보기

유사연구자 (20) 활용도상위20명

더보기

연관 공개강의(KOCW)

이 자료와 함께 이용한 RISS 자료

나만을 위한 추천자료

서지정보
부가정보
동일학술지(권/호) 다른 논문
분석정보
연관 공개강의(KOCW)

해외이동버튼