Information retrieval methodology for aiding scientific database search
Date
2019-01-01
Authors
Marcos-Pablos, S.
García-Peñalvo, F. J.
Journal Title
Journal ISSN
Volume Title
Publisher
Springer
Abstract
During literature reviews, and specially when conducting systematic literature reviews (SLRs), finding and screening relevant papers during scientific document search may involve managing and processing large amounts of unstruc-tured text data. In those cases where the search topic is difficult to establish or has fuzzy limits, researchers require to broaden the scope of the search and, in conse-quence, data from retrieved scientific publications may become huge and uncorre-lated. However, through a convenient analysis of these data the researcher may be able to discover new knowledge which may be hidden within the search output, thus exploring the limits of the search and enhancing the review scope. With that aim, this paper presents an iterative methodology that applies text mining and machine learning techniques to a downloaded corpus of abstracts from scientific databases, combining automatic processing algorithms with tools for supervised decision making in an iterative process sustained on the researchers’ judgement, so as to adapt, screen and tune the search output. The paper ends showing a work-ing example that employs a set of developed scripts that implement the different stages of the proposed methodology
Description
Keywords
Information retrieval, Systematic literature review, Text mining, Vector Space Model, Support Vector Machine
Citation
Marcos-Pablos, S., & García-Peñalvo, F. J. (2019). Information retrieval methodology for aiding scientific database search. Soft Computing, doi:10.1007/s00500-018-3568-0