Debashree Bandyopadhyay - Authorea

Debashree Bandyopadhyay

Assistant Professor

Hyderabad

Public Documents 4

Consortium of “consistent amino acid substitutions” on Influenza A (H1N1) viral prote...

Syeda Lubna

and 1 more

August 31, 2023

Influenza-A-virus (IAV) is one of the common threats to humankind since 1918. The viral proteome is frequently substituted leading to new strains and recurrent pandemics. Despite knowing the effects of single amino acid substitutions on individual viral proteins, effects of collective substitutions on viral infection remain elusive. Here, we addressed the question whether the “consistent amino substitutions” occur in consortium on functional domains and protein-protein interaction (PPI) sites, impacting overall viral infection and host immune responses. By definition, “consistent substitutions” occur on “all” the IAV strains isolated in a particular year. Big protein data (563370 sequences and 9824 PPI) and bioinformatics techniques were exploited to address this question. Total one-hundred-and-five “consistent substitutions” were mapped on IAV proteome. Fifty of those emerged on viral functional domains and PPIs, engaged in the specific stages of IAV infection, namely, i) cell surface entry and exit, ii) nuclear import, vRNP assembly and nuclear export and iii) antagonizing immune responses. The study for the first time showed that consortium of “consistent substitutions” emerged on protein functional domains and PPIs, impacting specific stages of viral infection, rather than a single protein, and presumably navigate viral escape from human immune response.

Hist-i-fy: Multiple histidine function prediction based on protein sequences using de...

Debashree Bandyopadhyay

and 4 more

June 21, 2023

Histidine (His) is the most reactive amino acid at enzyme active sites. Multiple post-translational modifications (functions) are reported for His side chains. The high-throughput sequencing techniques produce a large number of protein sequences without functional annotations at the amino acid level. Experimental characterization of His functions in proteins is laborious and time-consuming. Computational characterization based on protein sequences may complement the need. There are only a handful of Histidine function prediction tools available and those annotate only a single function. Here we curated a dataset of active Histidine with known functions based on protein sequences obtained from UniProt database (sample size n=1584) and trained against four machine learning methods. The convolution neural network (CNN) model (“ Hist-i-fy”) performed the best with 75% overall accuracy. The external validation of Hist-i-fy on phosphorylated histidine data (sample size 34) showed 94.1% prediction accuracy. For the first time, we report multiple His function prediction, based on protein sequences using deep neural networks. The inputs to the model are i) protein sequence containing His, and ii) the His residue number. The model predicts one out of the eight histidine functions, namely, acetylation, ribosylation, glycosylation, hydroxylation, methylation, oxidation, phosphorylation, and protein splicing. The novelty of the work is, it predicts maximum number of histidine functions at a time with optimal performance. There is a scope of improvement in the model upon availability of a larger dataset. The model is available as a web application ([https://histify.streamlit.app/](https://histify.streamlit.app/)) and a stand-alone code [https://github.com/dibyansu24-maker/Histify](https://github.com/dibyansu24-maker/Histify)).

Recent mutations in NS1 protein of H1N1 influenza virus isolated from India during 20...

Lubna Syeda

and 5 more

March 30, 2022

The routine influenza (H1N1) surveillance in India started almost a decade ago. The fluctuation in the number of deaths and cases in different Indian states over the last decade presumably indicated the possible changes in the viral sequence and in the immune response of the host. To track these changes, we have chosen NS1 protein that invades host antiviral immune response. Objective of this study was to identify the recent mutations on NS1 protein from Indian isolates. The sequences of NS1 proteins from H1N1 strains isolated in India over a decade were obtained from publicly available databases. Multiple sequence alignment, phylogeny and surface hydrophilicity analyses were performed to confirm the consistent mutations on NS1 protein, evolved chronologically in India. Total eight mutations were identified, two in RNA-binding domain (RBD), five in effector domain (ED) and one in the linker region. Three mutations were reported first time in this study at the sequence positions, 2, 80 and 155; those evolved either in 2017 or in 2019. These recent mutations were associated with conservative substitutions in the alternative domains of NS1 protein, namely, i) D2E and E55D, ii) T80A and A155T and iii) E55K and K131E. A gradual shift of NS1 antigenic regions (surface hydrophilicity) was observed from ED to RBD domains along the time line. The possible consequences of these mutations on host-pathogen interactions were hypothesized based on the sequence positions of NS1 mutations belonging to various cellular-binding sub-domains. The hypothesis is subject to further experimental and computational verification.

DeepCys: structure-based multiple cysteine function prediction method trained on deep...

Vamsi Mohan Nallapareddy

and 4 more

October 16, 2020

Cysteine (Cys) is the most reactive amino acid participating in a wide range of biological functions. In-silico predictions complement the experiments to meet the need of functional characterization. Multiple Cys function prediction algorithm is scarce, in contrast to specific function prediction algorithms. Here we present a deep neural network-based multiple Cys function prediction, available on web-server (DeepCys) (https://deepcys.herokuapp.com/). DeepCys model was trained and tested on two independent datasets curated from protein crystal structures. This prediction method requires three inputs, namely, PDB identifier (ID), chain ID and residue ID for a given Cys and outputs the probabilities of four cysteine functions, namely, disulphide, metal-binding, thioether and sulphenylation and predicts the most probable Cys function. The algorithm exploits the local and global protein properties, like, sequence and secondary structure motifs, buried fractions, microenvironments and protein/enzyme class. DeepCys outperformed most of the multiple and specific Cys function algorithms. This method can predict maximum number of cysteine functions. Moreover, for the first time, explicitly predicts thioether function. This tool was used to elucidate the cysteine functions on domains of unknown functions (DUFs) belonging to cytochrome C oxidase subunit-II (COX2) like transmembrane domains. Apart from the web-server, a standalone program is also available on GitHub (https://github.com/vam-sin/deepcys)