A Real-time Cancer-Covid Gene-Set Based Biomedical Document Classification and Ranking Framework for Large Databases

International Journal of Electronics and Communication Engineering
© 2023 by SSRG - IJECE Journal
Volume 10 Issue 9
Year of Publication : 2023
Authors : Jose Mary Golamari, D. Haritha
pdf
How to Cite?

Jose Mary Golamari, D. Haritha, "A Real-time Cancer-Covid Gene-Set Based Biomedical Document Classification and Ranking Framework for Large Databases," SSRG International Journal of Electronics and Communication Engineering, vol. 10,  no. 9, pp. 72-80, 2023. Crossref, https://doi.org/10.14445/23488549/IJECE-V10I9P108

Abstract:

Identifying and ranking gene and disease patterns are essential for analyzing and ranking biomedical documents in current biomedical repositories. However, the presence of noise, uncertainty, and missing values in most biomedical databases, coupled with their diverse features and varying levels of gene and disease patterns, makes identifying and ranking high-dimensional patterns across different repositories a complex and challenging task. Data classification algorithms rely on MeSH terms or user-specific keywords to classify documents in conventional biomedical repositories. Nevertheless, these algorithms use static methods to establish relationships among gene sets, which may need to be revised for accurate analysis and ranking of biomedical documents. Locating cancer and COVID genes associated with diseases and their patterns in biomedical repositories is a difficult task. A novel Cancer-Covid gene/disease document classification and ranking approach has been suggested, employing a cross-gene model with machine learning techniques. The proposed method employs an optimized Glove feature extraction technique and an advanced classification model to identify significant features from biomedical documents. Experimental results indicate that this feature extraction method is more effective than other existing techniques in predicting gene-disease relationships in various biomedical documents. 

Keywords:

Cross-domain analysis, Cancer genesets, Covid gene sets. 

References:

[1] Xin Shao et al., “A Clinical Genomics-Guided Prioritizing Strategy Enables Selecting Proper Cancer Cell Lines for Biomedical Research,” iScience, vol. 23, no. 11, 2020. 
[CrossRef] [Google Scholar] [Publisher Link]
[2] Yongjing Lin et al., “A Document Clustering and Ranking System for Exploring MEDLINE Citations,” Journal of the American Medical Informatics Association, vol. 14, no. 5, pp. 651–661, 2007. 
[CrossRef] [Google Scholar] [Publisher Link]
[3] Thulasi Bikku, and Radhika Paturi, “A Novel Somatic Cancer Gene-Based Biomedical Document Feature Ranking and Clustering Model,” Informatics in Medicine Unlocked, vol. 16, 2019. 
[CrossRef] [Google Scholar] [Publisher Link]
[4] Gaelen P. Adam et al., “A Novel Tool that Allows Interactive Screening of Pubmed Citations Showed Promise for the Semi-Automation of Identification of Biomedical Literature,” Journal of Clinical Epidemiology, vol. 150, pp. 63–71, 2022. 
[CrossRef] [Google Scholar] [Publisher Link]
[5] Mercedes García Carrillo et al., “Academic Dependency: The Influence of the Prevailing International Biomedical Research Agenda on Argentina’s CONICET,” Heliyon, vol. 8, no. 11, 2022. 
[CrossRef] [Google Scholar] [Publisher Link]
[6] P. Dhanalakshmi, K. Ramani, and B. Eswara Reddy, “An Improved Rank Based Disease Prediction Using Web Navigation Patterns on Bio-Medical Databases,” Future Computing and Informatics Journal, vol. 2, no. 2, pp. 133–147, 2017. 
[CrossRef] [Google Scholar] [Publisher Link]
[7] Saeid Balaneshinkordan, and Alexander Kotov, “Bayesian Approach to Incorporating Different Types of Biomedical Knowledge Bases into Information Retrieval Systems for Clinical Decision Support in Precision Medicine,” Journal of Biomedical Informatics, vol. 98, 2019. 
[CrossRef] [Google Scholar] [Publisher Link]
[8] Lena Maier-Hein et al., “BIAS: Transparent Reporting of Biomedical Image Analysis Challenges,” Medical Image Analysis, vol. 66, 2020. 
[CrossRef] [Google Scholar] [Publisher Link]
[9] Fei Zhu et al., “Biomedical Text Mining and its Applications in Cancer Research,” Journal of Biomedical Informatics, vol. 46, no. 2, pp. 200–211, 2013. 
[CrossRef] [Google Scholar] [Publisher Link]
[10] Chloé Cabot, Stéfan Darmoni, and Lina F. Soualmia, “Cimind: A Phonetic-Based Tool for Multilingual Named Entity Recognition in Biomedical Texts,” Journal of Biomedical Informatics, vol. 94, 2019. 
[CrossRef] [Google Scholar] [Publisher Link]
[11] Maciej Rybinski, Jerry Xu, and Sarvnaz Karimi, “Clinical Trial Search: Using Biomedical Language Understanding Models for Re-Ranking,” Journal of Biomedical Informatics, vol. 109, 2020. 
[CrossRef] [Google Scholar] [Publisher Link]
[12] Muhammad Abulaish, Md. Aslam Parwez, and Jahiruddin, “DiseaSE: A Biomedical Text Analytics System for Disease Symptom Extraction and Characterization,” Journal of Biomedical Informatics, vol. 100, 2019. 
[CrossRef] [Google Scholar] [Publisher Link]
[13] Nicholas C. Ide et al., “Essie: A Concept-Based Search Engine for Structured Biomedical Text,” Journal of the American Medical Informatics Association, vol. 14, no. 3, pp. 253–263, 2007. 
[CrossRef] [Google Scholar] [Publisher Link]
[14] Muhammad Ali Ibrahim et al., “GHS-NET a Generic Hybridized Shallow Neural Network for Multi-Label Biomedical Text Classification,” Journal of Biomedical Informatics, vol. 116, 2021. 
[CrossRef] [Google Scholar] [Publisher Link]   
[15] Jiho Noh, and Ramakanth Kavuluru, “Improved Biomedical word Embeddings in the Transformer Era,” Journal of Biomedical Informatics, vol. 120, 2021. 
[CrossRef] [Google Scholar] [Publisher Link]
[16] Tuan Manh Lai, ChengXiang Zhai, and Heng Ji, “KEBLM: Knowledge-Enhanced Biomedical Language Models,” Journal of Biomedical Informatics, vol. 143, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[17] Haixia Shang, and Zhi-Ping Liu, “Network-Based Prioritization of Cancer Biomarkers by Phenotype-Driven Module Detection and Ranking,” Computational and Structural Biotechnology Journal, vol. 20, pp. 206–217, 2022. 
[CrossRef] [Google Scholar] [Publisher Link]
[18] Satya S. Sahoo et al., “ProvCaRe: Characterizing Scientific Reproducibility of Biomedical Research Studies Using Semantic Provenance Metadata,” International Journal of Medical Informatics, vol. 121, pp. 10–18, 2019. 
[CrossRef] [Google Scholar] [Publisher Link]
[19] Sarvnaz Karimi, Justin Zobel, and Falk Scholer, “Quantifying the Impact of Concept Recognition on Biomedical Information Retrieval,” Information Processing & Management, vol. 48, no. 1, pp. 94–106, 2012. 
[CrossRef] [Google Scholar] [Publisher Link]
[20] Zan-Xia Jin et al., “Ranking via Partial Ordering for Answer Selection,” Information Sciences, vol. 538, pp. 358–371, 2020. 
[CrossRef] [Google Scholar] [Publisher Link]
[21] Grace Wang et al., “Representation of Women in Diagnostic Radiology Residency Programs: Does National Institutes of Health Program Ranking Matter?,” Journal of the American College of Radiology, vol. 18, no. 1, pp. 185–191, 2021. 
[CrossRef] [Google Scholar] [Publisher Link]
[22] Shuang Zhu et al., “Research Trends in Biomedical Applications of Two-Dimensional Nanomaterials over the Last Decade – A Bibliometric Analysis,” Advanced Drug Delivery Reviews, vol. 188, 2022. 
[CrossRef] [Google Scholar] [Publisher Link]