loading page

Investigation of protein sequence similarity based on physio-chemical properties of amino acids
  • +2
  • Ranjeet Rout,
  • Hari Pandey,
  • Sanchit Sindhwani,
  • Saiyed Umer,
  • Smitarani Pati
Ranjeet Rout
National Institute of Technology Srinagar
Author Profile
Hari Pandey
Edge Hill University
Author Profile
Sanchit Sindhwani
National Institute of Technology Jalandhar
Author Profile
Saiyed Umer
Aliah University
Author Profile
Smitarani Pati
National Institute of Technology Jalandhar
Author Profile

Abstract

Comparison of protein sequence similarity is a significant study. By virtue of this method, we can expose the evolutionary relationship among protein sequences. So, it is required to design effective computational algorithms that can compare the similarities among the colossal amount of sequences. The aim of this research is to develop efficient tools in the field of protein sequences comparison and phylogenetic study. The proposed method performs a feature generation process based on the physio-chemical properties of amino acids that best describes the revolutionary relationship among the species in a protein family. The protein sequences are transferred into an Eighty dimensional feature vector among the group of amino acids. Finally, four different datasets were used to validate the accuracy of the proposal and a correlation coefficient of 0.94417 of ND5 dataset using ClustalW has been found. This is much higher than some of the methods. At last the result explains the effectiveness in the similarity analysis among genome sequences.