Summarization of Text-based Documents with a Determination of Latent Topical Sections and Information-Rich Sentences

被引:9
作者
Alguliev, R. M. [1 ]
Alyguliev, R. M. [1 ]
机构
[1] Natl Acad Sci Azerbaijan, Inst Informat Technol, Ul Agaev 9, AZ-1141 Baku, Azerbaijan
关键词
summarization; clustering; optimal number of clusters; information; rich sentence; neural networks;
D O I
10.3103/S0146411607030030
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A method is proposed for use in summarization of text-based documents. By means of the method it is possible to discover latent topical sections and information-rich sentences. The underlying basis of the method - clustering of sentences - is formulated mathematically in the form of a problem of quadratic-type integer programming. An algorithm that makes it possible to determine with specified precision the optimal number of clusters is developed. The synthesis of a neural network is described for the purpose of solving a problem of integer quadratic programming.
引用
收藏
页码:132 / 140
页数:9
相关论文
共 18 条
[1]  
Alguliev R.M., 2004, AVTOM VYCHISL TEKH, P55
[2]   New algorithms for multi-class cancer diagnosis using tumor gene expression signatures [J].
Bagirov, AM ;
Ferguson, B ;
Ivkovic, S ;
Saunders, G ;
Yearwood, J .
BIOINFORMATICS, 2003, 19 (14) :1800-1807
[3]  
Banko M., 1999, P 4 C PAC ASS COMP L, P36
[4]   Mathematical programming for data mining: Formulations and challenges [J].
Bradley, PS ;
Fayyad, UM ;
Mangasarian, OL .
INFORMS JOURNAL ON COMPUTING, 1999, 11 (03) :217-238
[5]  
Delort J. Y., 2003, P 14 ACM C HYP HYP N, P208
[6]  
Dou Shen, 2004, Proceedings of Sheffield SIGIR 2004. The Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P242
[7]  
Galushkin A.I., 2002, NEYROMATEMATIKA, V6
[8]   Summarizing text documents: Sentence selection and evaluation metrics [J].
Goldstein, J ;
Kantrowitz, M ;
Mittal, V ;
Carbonell, J .
SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1999, :121-128
[9]   Techniques of cluster algorithms in data mining [J].
Grabmeier, J ;
Rudolph, A .
DATA MINING AND KNOWLEDGE DISCOVERY, 2002, 6 (04) :303-360
[10]  
Hu P, 2004, FOURTH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, PROCEEDINGS, P1159