Please use this identifier to cite or link to this item:
Title: Information retrieval methodology for aiding scientific database search
Authors: Marcos-Pablos, S.
García-Peñalvo, F. J.
Keywords: Information retrieval
Systematic literature review
Text mining
Vector Space Model
Support Vector Machine
Issue Date: 1-Jan-2019
Publisher: Springer
Citation: Marcos-Pablos, S., & García-Peñalvo, F. J. (2019). Information retrieval methodology for aiding scientific database search. Soft Computing, doi:10.1007/s00500-018-3568-0
Abstract: During literature reviews, and specially when conducting systematic literature reviews (SLRs), finding and screening relevant papers during scientific document search may involve managing and processing large amounts of unstruc-tured text data. In those cases where the search topic is difficult to establish or has fuzzy limits, researchers require to broaden the scope of the search and, in conse-quence, data from retrieved scientific publications may become huge and uncorre-lated. However, through a convenient analysis of these data the researcher may be able to discover new knowledge which may be hidden within the search output, thus exploring the limits of the search and enhancing the review scope. With that aim, this paper presents an iterative methodology that applies text mining and machine learning techniques to a downloaded corpus of abstracts from scientific databases, combining automatic processing algorithms with tools for supervised decision making in an iterative process sustained on the researchers’ judgement, so as to adapt, screen and tune the search output. The paper ends showing a work-ing example that employs a set of developed scripts that implement the different stages of the proposed methodology
ISSN: 1432-7643
Appears in Collections:Publications

Files in This Item:
File Description SizeFormat 
SoftComputing_DecissionSuportTools_postprint.pdfArticle3,25 MBAdobe PDFView/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.