A statistical approach for modeling inter-document semantic relationships in digital libraries

被引:6
作者
Muralikumar, Jeyavaishnavi [1 ]
Seelan, Sri Ananda [1 ]
Vijayakumar, Narendranath [1 ]
Balasubramanian, Vidhya [1 ]
机构
[1] Amrita Univ, Amrita Sch Engn, Dept Comp Sci & Engn, Amrita Vishwa Vidyapeetham, Coimbatore, Tamil Nadu, India
关键词
Relatedness; Information retrieval; Digital libraries; Statistical modeling;
D O I
10.1007/s10844-016-0423-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
E-Learning repositories and digital libraries are fast becoming important sources for gathering information and learning material. Such systems must therefore provide services to support the learning needs of their users. When a retrieval system shows how its documents relate to each other semantically, a user gets the liberty to choose from different material, and direct his/her study in a focused manner. This calls for a model that identifies types of document relationships, that need to address different aspects of learning. This article defines three such types and a unique statistical model that can automatically identify them in technical/scientific documents. The model defines measures to quantify the degree of relatedness based on distinct statistical patterns exhibited by the common terms in a pair of documents. This approach does not strictly require a knowledge base or hypertext for identifying the characteristic relationship between two documents. Such a statistical model can be extended to build further relatedness types and can be used alongside various other techniques in digital library recommendation engines. Our experiments over a large number of technical documents show that our techniques effectively extract the different types of relationships between documents.
引用
收藏
页码:477 / 498
页数:22
相关论文
共 32 条
[1]   Search result visualisation with xFIND [J].
Andrews, K ;
Gütl, C ;
Moser, J ;
Sabol, V ;
Lackner, W .
SECOND INTERNATIONAL WORKSHOP ON USER INTERFACES TO DATA INTENSIVE SYSTEMS, PROCEEDINGS, 2001, :50-58
[2]  
[Anonymous], 2012, IEEE INT C TECHN ENH
[3]  
[Anonymous], SEMANTIC RELATEDNESS
[4]  
[Anonymous], J ASS INFORM SCI TEC
[5]  
[Anonymous], 2009, N AM CHAPTER ASS COM
[6]  
[Anonymous], INT J DIGITAL LIB
[7]  
[Anonymous], PERSONALISED VIDEO R
[8]  
[Anonymous], RELATIONSHIPS ORG KN
[9]  
[Anonymous], 2013, SAC
[10]  
[Anonymous], MIT OPEN COURS