Swarm Intelligence-based Hierarchical Clustering for Identification of ncRNA using Covariance Search Model

Lustiana Pratiwi, Yun Huoy Choo, Azah Kamilah Muda, Satrya Fajri Pratama

Research output: Contribution to journalArticlepeer-review

Abstract

Covariance Model (CM) has been quite effective in finding potential members of existing families of non-coding Ribonucleic Acid (ncRNA) identification and has provided excellent accuracy in genome sequence database. However, it has significant drawbacks with family-specific search. An existing Hierarchical Agglomerative Clustering (HAC) technique merged overlapping sequences which is known as combined CM (CCM). However, the structural information will be discarded, and the sequence features of each family will be significantly diluted as the number of original structures increases. Additionally, it can only find members of the existing families and is not useful in finding potential members of novel ncRNA families. Furthermore, it is also important to construct generic sequence models which can be used to recognize new potential members of novel ncRNA families and define unknown ncRNA sequence as the potential members for known families. To achieve these objectives, this study proposes to implement Particle Swarm Optimization (PSO) and Genetic Algorithm (GA) to ensure the CCMs have the best quality for every level of dendrogram hierarchy. This study will also apply distance matrix as the criteria to measure the compatibility between two CMs. The proposed techniques will be using five gene families with fifty sequences from each family from Rfam database which will be divided into training and testing dataset to test CMs combination method. The proposed techniques will be compared to the existing HAC in terms of identification accuracy, sum of bit-scores, and processing time, where each of these performance measurements will be statistically validated.

Original languageEnglish
Pages (from-to)822-831
Number of pages10
JournalInternational Journal of Advanced Computer Science and Applications
Volume13
Issue number11
DOIs
Publication statusPublished - 2022

Keywords

  • Covariance model
  • Hierarchical clustering
  • Ncrna identification
  • Swarm intelligence

Fingerprint

Dive into the research topics of 'Swarm Intelligence-based Hierarchical Clustering for Identification of ncRNA using Covariance Search Model'. Together they form a unique fingerprint.

Cite this