ABSTRACT
With the development of deep DNA sequencing techniques, the cost for detecting mutations in the human genome falls significantly. Numerous non-synonymous single nucleotide polymorphisms (nsSNPs) have been identified and many of them are associated with human disease. One of the long-standing challenges is to understand how nsSNPs change protein structure and further affect their function. While it is impractical to solve all the mutated protein structures experimentally, it is quite feasible to model the mutated structures in silico. Toward this goal, we are building a publicly available structure database (SNP2Structure) to facilitate our research endeavors. Compared with the existing web portals with a similar aim, ours has three major advantages. First, we corrected the existing sequence mapping discrepancies presented in others. Although the percentage of erroneously mapped structures is small, it is critical to correct such errors. Second, our portal offers comparison of two structures simultaneously. Third, the mutated structures are available to download locally for further investigations. We believe SNP2Structure will be a valuable tool to the research community to understand the functional impact of disease-causing nsSNPs.
- Collins, F. S., Brooks, L. D. et al. (1998). A DNA polymorphism discovery resource for research on human genetic variation. Genome Res 8(12): 1229--1231.Google ScholarCross Ref
- Ng, P. C. and Henikoff, S. (2006). Predicting the effects of amino acid substitutions on protein function. Annu Rev Genomics Hum Genet 7: 61--80.Google ScholarCross Ref
- Wang, Z. and Moult, J. (2001). SNPs, protein structure, and disease. Hum Mutat 17(4): 263--270.Google ScholarCross Ref
- Chasman, D. and Adams, R. M. (2001). Predicting the functional consequences of non-synonymous single nucleotide polymorphisms: structure-based assessment of amino acid variation. J Mol Biol 307(2): 683--706.Google ScholarCross Ref
- Saunders, C. T. and Baker, D. (2002). Evaluation of structural and evolutionary contributions to deleterious mutation prediction. J Mol Biol 322(4): 891--901.Google ScholarCross Ref
- Ramensky, V., Bork, P. et al. (2002). Human non-synonymous SNPs: server and survey. Nucleic Acids Res 30(17): 3894--3900.Google ScholarCross Ref
- Ng, P. C. and Henikoff, S. (2003). SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Res 31(13): 3812--3814.Google ScholarCross Ref
- Ferrer-Costa, C., Gelpi, J. L. et al. (2005). PMUT: a web-based tool for the annotation of pathological mutations on proteins. Bioinformatics 21(14): 3176--3178. Google ScholarDigital Library
- Reva, B., Antipin, Y. et al. (2007). Determinants of protein function revealed by combinatorial entropy optimization. Genome Biol 8(11): R232.Google ScholarCross Ref
- Bromberg, Y., Yachdav, G. et al. (2008). SNAP predicts effect of mutations on protein function. Bioinformatics 24(20): 2397--2398. Google ScholarDigital Library
- Ryan, M., Diekhans, M. et al. (2009). LS-SNP/PDB: annotated non-synonymous SNPs mapped to Protein Data Bank structures. Bioinformatics 25(11): 1431--1432. Google ScholarDigital Library
- Schwarz, J. M., Rodelsperger, C. et al. (2010). MutationTaster evaluates disease-causing potential of sequence alterations. Nat Methods 7(8): 575--576.Google ScholarCross Ref
- Adzhubei, I. A., Schmidt, S. et al. (2010). A method and server for predicting damaging missense mutations. Nat Methods 7(4): 248--249.Google ScholarCross Ref
- Sherry, S. T., Ward, M. H. et al. (2001). dbSNP: the NCBI database of genetic variation. Nucleic Acids Res 29(1): 308--311.Google ScholarCross Ref
- Berman, H. M., Westbrook, J. et al. (2000). The Protein Data Bank. Nucleic Acids Res 28(1): 235--242.Google ScholarCross Ref
- Wu, C. H., Apweiler, R. et al. (2006). The Universal Protein Resource (UniProt): an expanding universe of protein information. Nucleic Acids Res 34(Database issue): D187--191.Google Scholar
- Pruitt, K. D., Tatusova, T. et al. (2007). NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res 35(Database issue): D61--65.Google Scholar
- Sali, A. and Blundell, T. L. (1993). Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol 234(3): 779--815.Google ScholarCross Ref
Index Terms
- SNP2Structure: A public database for mapping and modeling nsSNPs on human protein structures
Recommendations
In silico investigation on the inhibitory effect of fungal secondary metabolites on RNA dependent RNA polymerase of SARS-CoV-II: A docking and molecular dynamic simulation study
AbstractThe newly emerged Coronavirus Disease 2019 (COVID-19) rapidly outspread worldwide and now is one of the biggest infectious pandemics in human society. In this study, the inhibitory potential of 99 secondary metabolites obtained from ...
Highlights- Fungal secondary metabolites are tested against coronavirus RdRp computationally.
VARUN: Discovering Extensible Motifs under Saturation Constraints
The discovery of motifs in biosequences is frequently torn between the rigidity of the model on one hand and the abundance of candidates on the other hand. In particular, motifs that include wild cards or “don't cares” escalate exponentially with their ...
An improved sequence-based prediction protocol for protein-protein interactions using amino acids substitution matrix and rotation forest ensemble classifiers
Protein-protein Interactions (PPIs) play important roles in a wide variety of cellular processes, including metabolic cycles, DNA transcription and replication, and signaling cascades. High-throughput biological experiments for identifying PPIs are ...
Comments