바로가기메뉴

본문 바로가기 주메뉴 바로가기

logo

자동 발췌문/요약 시스템 구축에 관한 연구- 학술지 논문기사를 중심으로 -

A Study on the Construction of the Automatic Extracts and Summaries: On the Basis of Scientific Journal Articles

한국문헌정보학회지 / Journal of the Korean Society for Library and Information Science, (P)1225-598X; (E)2982-6292
2005, v.39 no.3, pp.139-163
이태영 (전북대학교)
  • 다운로드 수
  • 조회수

초록

코퍼스 기반의 제 방법, 담화구조의 수사역할, 유사문장의 통합을 이용하여 발췌문과 기초적 요약문을 자동으로 작성하는 방법론을 구축하였다. 코퍼스에 따른 기법들의 효율적 한계치를 사전에 확인하였고 발췌/요약문의 신축적 작성을 위해서 요약문을 이루는 문장들의 수사역할을 목적, 배경, 방법, 결과, 결론 등으로 정하고 각각의 발췌기를 적용하였다. 발췌 성공률은 90%이었다. 수사역할별로 선정된 문장의 합성과 분리를 위하여 유사도 공식을 이용한 유사문장의 통합, 불필요한 의미의 수식절, 삽입절의 제거, 짧은 문장들과 연결이 가능한 문장들의 합성을 시도하였다. 높은 발췌 성공률을 바탕으로 문장의 수사역할, 절의 용언어미 표징, 단서적 어구와 소재를 가미한 문장 정리 시스템의 개발이 요망된다.

keywords
Automatic Summaries, Extraction Methods, Location, Rhetorical Roles, Web, 단서, 발췌기법, 소재, 수사역할, 웹시스템, 의미범주, 자동발췌문, 자동요약, 중요어, Automatic Summaries, Extraction Methods, Location, Rhetorical Roles, Web

Abstract

Various corpus-based approaches, rhetorical roles of discourse structure, and unifications of similar sentences were applied to construct the automatic Ext/Sums(extracts and summaries). Rhetorical roles of sentences like objective, method, background, result, conclusion, etc. for making elastic Ext/Sums were established and extraction engines according to respective role were prepared. The 90% of Success rate in extracting the important sentences of sample articles was accomplished. Rearranging the selected sentences, it used unification of similar sentences using the cosine coefficient equation, deletion of unnecessary modification and insertion clauses, junction of short sentences, and connection of sentences able to link. They suggest the methods applying rhetorical roles of sentences, meaning and signature of noun and verb in clauses, and cue words and location will be researched to construct the more effective Ext/Sums.

keywords
Automatic Summaries, Extraction Methods, Location, Rhetorical Roles, Web, 단서, 발췌기법, 소재, 수사역할, 웹시스템, 의미범주, 자동발췌문, 자동요약, 중요어, Automatic Summaries, Extraction Methods, Location, Rhetorical Roles, Web

참고문헌

1.

동적 시소러스의 구축에 관한 실험적 연구,

2.

한국어 초록 작성의 자동화에 대한 연구-미생물학분야 학술지의 논문을 대상으로-,

3.

(.1999.) A Trainable Summarizer with Konwledge Acquired from Robust NLP Techniques” quo- ted in I. Mani and M.T. Maybury . 1999. Advanced in Automatic Text Summarization. Cambridge the MIT Press.,

4.

(1997.) “Using Lexical Chains for Text Summari- zation” In Proceedings of the Work- shop on Intelligent Scalable Text Summarization at the ACL/EACL Conference,

5.

(1997.) “Salience- based Content Characterization of Text Documents In Proceedings of the Workshop on Intelligent Scalable Text summarization at the ACL/ EACL Conference,

6.

(1995.) and L. Rau. “Automatic condensation of Electronic Publications by Sentence Selection.,

7.

G. G. 1999. Introduction to Mordern Information Retrieval. London, Library Association Publishing.

8.

(1998.270.) L. L. 1970. “Experiments in Automatic Extracting and Indexing. 313-334. quoted in F. W. Lancaster. Indexing and Abstracting in Theory and Prac- tice. London,

9.

(1998.269.) H. P. 1969. “New Methods in Automatic Extracting. Journal. of ACM 377-391. quoted in F. W. Lancaster. Indexing and Abstrac- ting in Theory and Practice,

10.

. “Lexical Chains as representation of context for the detection and correc- tion of malapropisms”. In Fellbaum An Electronic Lexical Database and Some of its Applications. Cambridge The MIT Press.,

11.

E. and C. Lin. 1999. “Automated Text Summarization in SUMMARIST” In Proceedings of the Workshop on Gaps and Bridges in NL Planning and Generation,

12.

(k.s.1999.“automaticsummarizing) quoted in I. Mani and M.T. Maybury . 1999. Advanced in Automatic Text Sum- marization. Cambridge the MIT Press.,

13.

. Proceedings of the Eighteenth Annual International ACM Conference on Research and Development in Infor- maton Retrieval,

14.

and V. Martinez. 2004. “Automatic Generation of Domain Representations Using Thesaurus Struc- tures,

15.

. 1999. Advanced in Automatic Text Sum- marization. Cambridge the MIT Press.,

16.

and J. Du- mortier. 1999. “Abstracting of Legal Cases The Pontential of Clustering Based on the Selection of Represen- tative Objects.,

17.

S. H. and D. H. Jang. 1999. “Development and Evaluation of a Statistically-based Document Summari- zation System quoted in I. Mani and M.T. Maybury . 1999. Ad- vanced in Automatic Text Summari- zation. Cambridge the MIT Press.,

18.

C. D. 1990. “Constructing Literature Abstract by Computer,

19.

J. E. et al. 1971. “Automatic Abst- racting and Indexing. II. Production of Indicative Abstracts by Appli- cation of Contextual Inference and Syntactic Coherence Criteria.,

20.

S. and M. Moens. 1999. “Argu- mentive classification of extracted sentences as a first step towards flexible abstracting quoted in I. Mani and M.T. Maybury . 1999. Advanced in Automatic Text Sum- marization. Cambridge the MIT Press,

21.

. In W. Burchart and K. Hulker Walter de Gruyter. quoted in I. Mani. 2001. Automatic Summari- zation. Amsterdam, John Benjamins Publishing Company

22.

(2004) 질의응답을 위한 복수문서 요약에 관한 실헙적 연구,

23.

최인숙, (2000) 술어기반 문형정보를 이용한 자동요약시스템에 관한 연구,

한국문헌정보학회지