1,814 results match your criteria Proteins: Structure Function and Bioinformatics[Journal]


MultiDomainBenchmark: a multi-domain query and subject database suite.

BMC Bioinformatics 2019 Feb 14;20(1):77. Epub 2019 Feb 14.

National Center for Biotechnology Information, Bethesda, National Institutes of Health, 8600 Rockville Pike, Bethesda, 20894, MD, USA.

Background: Genetic sequence database retrieval benchmarks play an essential role in evaluating the performance of sequence searching tools. To date, all phylogenetically diverse benchmarks known to the authors include only query sequences with single protein domains. Domains are the primary building blocks of protein structure and function. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-019-2660-5DOI Listing
February 2019

Identification and analysis of structurally critical fragments in HopS2.

BMC Bioinformatics 2019 Feb 4;19(Suppl 13):552. Epub 2019 Feb 4.

Department of Molecular Biology and Biotechnology, Tezpur University, Tezpur, Assam, 784028, India.

Background: Among the diverse roles of the Type III secretion-system (T3SS), one of the notable functions is that it serves as unique nano machineries in gram-negative bacteria that facilitate the translocation of effector proteins from bacteria into their host. These effector proteins serve as potential targets to control the pathogenicity conferred to the bacteria. Despite being ideal choices to disrupt bacterial systems, it has been quite an ordeal in the recent times to experimentally reveal and establish a concrete sequence-structure-function relationship for these effector proteins. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2551-1DOI Listing
February 2019

Disentangling the complexity of low complexity proteins.

Brief Bioinform 2019 Jan 30. Epub 2019 Jan 30.

Institute of Organismic and Molecular Evolution, Johannes Gutenberg University of Mainz, Mainz, Germany.

There are multiple definitions for low complexity regions (LCRs) in protein sequences, with all of them broadly considering LCRs as regions with fewer amino acid types compared to an average composition. Following this view, LCRs can also be defined as regions showing composition bias. In this critical review, we focus on the definition of sequence complexity of LCRs and their connection with structure. Read More

View Article

Download full-text PDF

Source
https://academic.oup.com/bib/advance-article/doi/10.1093/bib
Publisher Site
http://dx.doi.org/10.1093/bib/bbz007DOI Listing
January 2019
3 Reads
9.617 Impact Factor

A comprehensive review and comparison of existing computational methods for intrinsically disordered protein and region prediction.

Brief Bioinform 2019 01;20(1):330-346

School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, China.

Intrinsically disordered proteins and regions are widely distributed in proteins, which are associated with many biological processes and diseases. Accurate prediction of intrinsically disordered proteins and regions is critical for both basic research (such as protein structure and function prediction) and practical applications (such as drug development). During the past decades, many computational approaches have been proposed, which have greatly facilitated the development of this important field. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1093/bib/bbx126DOI Listing
January 2019
9 Reads

KORP: Knowledge-based 6D potential for fast protein and loop modeling.

Bioinformatics 2019 Jan 14. Epub 2019 Jan 14.

Department of Biological Chemical Physics, Rocasolano Institute of Physical Chemistry C.S.I.C., Serrano 119, Madrid, Spain.

Motivation: Knowledge-based statistical potentials constitute a simpler and easier alternative to physics-based potentials in many applications, including folding, docking, and protein modeling. Here, to improve the effectiveness of the current approximations, we attempt to capture the 6-dimensional (6D) nature of residue-residue interactions from known protein structures using a simple backbone-based representation.

Results: We have developed KORP, a knowledge-based pairwise potential for proteins that depends on the relative position and orientation between residues. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btz026DOI Listing
January 2019
2 Reads

Optimising orbit counting of arbitrary order by equation selection.

BMC Bioinformatics 2019 Jan 15;20(1):27. Epub 2019 Jan 15.

Ghent University - imec, IDLab, Technologiepark 15, Ghent, 9052, Belgium.

Background: Graphlets are useful for bioinformatics network analysis. Based on the structure of Hočevar and Demšar's ORCA algorithm, we have created an orbit counting algorithm, named Jesse. This algorithm, like ORCA, uses equations to count the orbits, but unlike ORCA it can count graphlets of any order. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2483-9DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6334470PMC
January 2019
1 Read

CoExpresso: assess the quantitative behavior of protein complexes in human cells.

BMC Bioinformatics 2019 Jan 9;20(1):17. Epub 2019 Jan 9.

Department of Biochemistry and Molecular Biology and VILLUM Center for Bioanalytical Sciences, University of Southern Denmark, Campusvej 55, Odense M, 5230, Denmark.

Background: Translational and post-translational control mechanisms in the cell result in widely observable differences between measured gene transcription and protein abundances. Herein, protein complexes are among the most tightly controlled entities by selective degradation of their individual proteins. They furthermore act as control hubs that regulate highly important processes in the cell and exhibit a high functional diversity due to their ability to change their composition and their structure. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2573-8DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6327379PMC
January 2019
2 Reads

A model to predict the function of hypothetical proteins through a nine-point classification scoring schema.

BMC Bioinformatics 2019 Jan 8;20(1):14. Epub 2019 Jan 8.

Bioclues.org, Kukatpally, Hyderabad, 500072, India.

Background: Hypothetical proteins [HP] are those that are predicted to be expressed in an organism, but no evidence of their existence is known. In the recent past, annotation and curation efforts have helped overcome the challenge in understanding their diverse functions. Techniques to decipher sequence-structure-function relationship, especially in terms of functional modelling of the HPs have been developed by researchers, but using the features as classifiers for HPs has not been attempted. Read More

View Article

Download full-text PDF

Source
https://bmcbioinformatics.biomedcentral.com/articles/10.1186
Publisher Site
http://dx.doi.org/10.1186/s12859-018-2554-yDOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6325861PMC
January 2019
7 Reads
2.576 Impact Factor

A nearest-neighbors network model for sequence data reveals new insight into genotype distribution of a pathogen.

BMC Bioinformatics 2018 Dec 12;19(1):475. Epub 2018 Dec 12.

School of Electrical Engineering and Computer Science, Washington State University, Pullman, WA, USA.

Background: Sequence similarity networks are useful for classifying and characterizing biologically important proteins. Threshold-based approaches to similarity network construction using exact distance measures are prohibitively slow to compute and rely on the difficult task of selecting an appropriate threshold, while similarity networks based on approximate distance calculations compromise useful structural information.

Results: We present an alternative network representation for a set of sequence data that overcomes these drawbacks. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2453-2DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6291930PMC
December 2018
2 Reads

Improving Prediction of Protein Secondary Structure, Backbone Angles, Solvent Accessibility, and Contact Numbers by Using Predicted Contact Maps and an Ensemble of Recurrent and Residual Convolutional Neural Networks.

Bioinformatics 2018 Dec 7. Epub 2018 Dec 7.

School of Information and Communication Technology, Griffith University, Gold Coast, Australia.

Motivation: Sequence-based prediction of one dimensional structural properties of proteins has been a long-standing subproblem of protein structure prediction. Recently, prediction accuracy has been significantly improved due to the rapid expansion of protein sequence and structure libraries and advances in deep learning techniques, such as residual convolutional networks (ResNets) and Long-Short-Term Memory Cells in Bidirectional Recurrent Neural Networks (LSTM-BRNNs). Here we leverage an ensemble of LSTM-BRNN and ResNet models, together with predicted residue-residue contact maps, to continue the push towards the attainable limit of prediction for 3- and 8-state secondary structure, backbone angles (θ, τ, ϕ, and ψ), half-sphere exposure, contact numbers, and solvent accessible surface area (ASA). Read More

View Article

Download full-text PDF

Source
https://academic.oup.com/bioinformatics/advance-article/doi/
Publisher Site
http://dx.doi.org/10.1093/bioinformatics/bty1006DOI Listing
December 2018
2 Reads

Representativeness of variation benchmark datasets.

BMC Bioinformatics 2018 Nov 29;19(1):461. Epub 2018 Nov 29.

Protein Structure and Bioinformatics, Department of Experimental Medical Science, Lund University, BMC B13, SE-221 84, Lund, Sweden.

Background: Benchmark datasets are essential for both method development and performance assessment. These datasets have numerous requirements, representativeness being one. In the case of variant tolerance/pathogenicity prediction, representativeness means that the dataset covers the space of variations and their effects. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2478-6DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6267811PMC
November 2018
2 Reads
2.576 Impact Factor

Distinguishing crystallographic from biological interfaces in protein complexes: role of intermolecular contacts and energetics for classification.

BMC Bioinformatics 2018 Nov 30;19(Suppl 15):438. Epub 2018 Nov 30.

Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, 3584 CH, Utrecht, The Netherlands.

Background: Study of macromolecular assemblies is fundamental to understand functions in cells. X-ray crystallography is the most common technique to solve their 3D structure at atomic resolution. In a crystal, however, both biologically-relevant interfaces and non-specific interfaces resulting from crystallographic packing are observed. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2414-9DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6266931PMC
November 2018
3 Reads

A Structural Homology Approach for Computational Protein Design with Flexible Backbone.

Bioinformatics 2018 Nov 29. Epub 2018 Nov 29.

LISBP, Université de Toulouse, CNRS, INRA, INSA, Toulouse, France.

Motivation: Structure-based Computational Protein design (CPD) plays a critical role in advancing the field of protein engineering. Using an all-atom energy function, CPD tries to identify amino acid sequences that fold into a target structure and ultimately perform a desired function. Energy functions remain however imperfect and injecting relevant information from known structures in the design process should lead to improved designs. Read More

View Article

Download full-text PDF

Source
https://academic.oup.com/bioinformatics/advance-article/doi/
Publisher Site
http://dx.doi.org/10.1093/bioinformatics/bty975DOI Listing
November 2018
12 Reads

Computational prediction of inter-species relationships through omics data analysis and machine learning.

BMC Bioinformatics 2018 Nov 20;19(Suppl 14):420. Epub 2018 Nov 20.

School of Business and Engineering Vaud (HEIG-VD), University of Applied Sciences Western Switzerland (HES-SO), Route. de Cheseaux 1, Yverdon-Les-Bains, 1400, Switzerland.

Background: Antibiotic resistance and its rapid dissemination around the world threaten the efficacy of currently-used medical treatments and call for novel, innovative approaches to manage multi-drug resistant infections. Phage therapy, i.e. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2388-7DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6245486PMC
November 2018
3 Reads

Automated shape-based clustering of 3D immunoglobulin protein structures in chronic lymphocytic leukemia.

BMC Bioinformatics 2018 Nov 20;19(Suppl 14):414. Epub 2018 Nov 20.

Information Technologies Institute, Centre for Research and Technology Hellas, 6th km Harilaou-Thermi Road, Thessaloniki, Greece.

Background: Although the etiology of chronic lymphocytic leukemia (CLL), the most common type of adult leukemia, is still unclear, strong evidence implicates antigen involvement in disease ontogeny and evolution. Primary and 3D structure analysis has been utilised in order to discover indications of antigenic pressure. The latter has been mostly based on the 3D models of the clonotypic B cell receptor immunoglobulin (BcR IG) amino acid sequences. Read More

View Article

Download full-text PDF

Source
https://bmcbioinformatics.biomedcentral.com/articles/10.1186
Publisher Site
http://dx.doi.org/10.1186/s12859-018-2381-1DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6245605PMC
November 2018
8 Reads

The predictive performance of short-linear motif features in the prediction of calmodulin-binding proteins.

BMC Bioinformatics 2018 Nov 20;19(Suppl 14):410. Epub 2018 Nov 20.

School of Computer Science, University of Windsor, Windsor, Ontario, Canada.

Background: The prediction of calmodulin-binding (CaM-binding) proteins plays a very important role in the fields of biology and biochemistry, because the calmodulin protein binds and regulates a multitude of protein targets affecting different cellular processes. Computational methods that can accurately identify CaM-binding proteins and CaM-binding domains would accelerate research in calcium signaling and calmodulin function. Short-linear motifs (SLiMs), on the other hand, have been effectively used as features for analyzing protein-protein interactions, though their properties have not been utilized in the prediction of CaM-binding proteins. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2378-9DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6245490PMC
November 2018
7 Reads

Computational discovery of direct associations between GO terms and protein domains.

BMC Bioinformatics 2018 Nov 20;19(Suppl 14):413. Epub 2018 Nov 20.

Université de Lorraine, CNRS, Inria, LORIA, Nancy, F-54500, France.

Background: Families of related proteins and their different functions may be described systematically using common classifications and ontologies such as Pfam and GO (Gene Ontology), for example. However, many proteins consist of multiple domains, and each domain, or some combination of domains, can be responsible for a particular molecular function. Therefore, identifying which domains should be associated with a specific function is a non-trivial task. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2380-2DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6245584PMC
November 2018
8 Reads

Learning protein binding affinity using privileged information.

BMC Bioinformatics 2018 Nov 15;19(1):425. Epub 2018 Nov 15.

Biomedical Informatics Research Laboratory (BIRL), Department of Computer and Information Sciences (DCIS), Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, ISL, 45650, Pakistan.

Background: Determining protein-protein interactions and their binding affinity are important in understanding cellular biological processes, discovery and design of novel therapeutics, protein engineering, and mutagenesis studies. Due to the time and effort required in wet lab experiments, computational prediction of binding affinity from sequence or structure is an important area of research. Structure-based methods, though more accurate than sequence-based techniques, are limited in their applicability due to limited availability of protein structure data. Read More

View Article

Download full-text PDF

Source
https://bmcbioinformatics.biomedcentral.com/articles/10.1186
Publisher Site
http://dx.doi.org/10.1186/s12859-018-2448-zDOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6238365PMC
November 2018
10 Reads

Topology independent structural matching discovers novel templates for protein interfaces.

Bioinformatics 2018 Sep;34(17):i787-i794

Division of Bioinformatics, Department of Physics, Chemistry and Biology, Linköping University, Linköping SE, Sweden.

Motivation: Protein-protein interactions (PPI) are essential for the function of the cellular machinery. The rapid growth of protein-protein complexes with known 3D structures offers a unique opportunity to study PPI to gain crucial insights into protein function and the causes of many diseases. In particular, it would be extremely useful to compare interaction surfaces of monomers, as this would enable the pinpointing of potential interaction surfaces based solely on the monomer structure, without the need to predict the complete complex structure. Read More

View Article

Download full-text PDF

Source
https://academic.oup.com/bioinformatics/article/34/17/i787/5
Publisher Site
http://dx.doi.org/10.1093/bioinformatics/bty587DOI Listing
September 2018
7 Reads

HITS-PR-HHblits: protein remote homology detection by combining PageRank and Hyperlink-Induced Topic Search.

Brief Bioinform 2018 Nov 7. Epub 2018 Nov 7.

Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China.

As one of the most important fundamental problems in protein sequence analysis, protein remote homology detection is critical for both theoretical research (protein structure and function studies) and real world applications (drug design). Although several computational predictors have been proposed, their detection performance is still limited. In this study, we treat protein remote homology detection as a document retrieval task, where the proteins are considered as documents and its aim is to find the highly related documents with the query documents in a database. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1093/bib/bby104DOI Listing
November 2018
4 Reads

Analysis of drug resistance in HIV protease.

BMC Bioinformatics 2018 Oct 22;19(Suppl 11):362. Epub 2018 Oct 22.

Department of Computer Science, 25 Park Place, Atlanta, GA 30303, USA.

Background: Drug resistance in HIV is the major problem limiting effective antiviral therapy. Computational techniques for predicting drug resistance profiles from genomic data can accelerate the appropriate choice of therapy. These techniques can also be used to select protease mutants for experimental studies of resistance and thereby assist in the development of next-generation therapies. Read More

View Article

Download full-text PDF

Source
https://bmcbioinformatics.biomedcentral.com/articles/10.1186
Publisher Site
http://dx.doi.org/10.1186/s12859-018-2331-yDOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6196403PMC
October 2018
9 Reads

The EVcouplings Python framework for coevolutionary sequence analysis.

Bioinformatics 2018 Oct 9. Epub 2018 Oct 9.

Department of Systems Biology, Harvard Medical School, Boston, MA, USA.

Summary: Coevolutionary sequence analysis has become a commonly used technique for de novo prediction of the structure and function of proteins, RNA, and protein complexes. We present the EVcouplings framework, a fully integrated open-source application and Python package for coevolutionary analysis. The framework enables generation of sequence alignments, calculation and evaluation of evolutionary couplings (ECs), and de novo prediction of structure and mutation effects. Read More

View Article

Download full-text PDF

Source
https://academic.oup.com/bioinformatics/advance-article/doi/
Publisher Site
http://dx.doi.org/10.1093/bioinformatics/bty862DOI Listing
October 2018
6 Reads

Fast design of arbitrary length loops in proteins using InteractiveRosetta.

BMC Bioinformatics 2018 Sep 24;19(1):337. Epub 2018 Sep 24.

Department of Biology, Rensselaer Polytechnic Institute, Troy, NY, USA.

Background: With increasing interest in ab initio protein design, there is a desire to be able to fully explore the design space of insertions and deletions. Nature inserts and deletes residues to optimize energy and function, but allowing variable length indels in the context of an interactive protein design session presents challenges with regard to speed and accuracy.

Results: Here we present a new module (INDEL) for InteractiveRosetta which allows the user to specify a range of lengths for a desired indel, and which returns a set of low energy backbones in a matter of seconds. Read More

View Article

Download full-text PDF

Source
https://bmcbioinformatics.biomedcentral.com/articles/10.1186
Publisher Site
http://dx.doi.org/10.1186/s12859-018-2345-5DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6154894PMC
September 2018
5 Reads

ECPred: a tool for the prediction of the enzymatic functions of protein sequences based on the EC nomenclature.

BMC Bioinformatics 2018 Sep 21;19(1):334. Epub 2018 Sep 21.

European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, UK.

Background: The automated prediction of the enzymatic functions of uncharacterized proteins is a crucial topic in bioinformatics. Although several methods and tools have been proposed to classify enzymes, most of these studies are limited to specific functional classes and levels of the Enzyme Commission (EC) number hierarchy. Besides, most of the previous methods incorporated only a single input feature type, which limits the applicability to the wide functional space. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2368-yDOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6150975PMC
September 2018
4 Reads

Phage spanins: diversity, topological dynamics and gene convergence.

BMC Bioinformatics 2018 Sep 15;19(1):326. Epub 2018 Sep 15.

Center for Phage Technology, Department of Biochemistry and Biophysics, Texas A&M University, 2128 TAMU, College Station, TX, 77843-2128, USA.

Background: Spanins are phage lysis proteins required to disrupt the outer membrane. Phages employ either two-component spanins or unimolecular spanins in this final step of Gram-negative host lysis. Two-component spanins like Rz-Rz1 from phage lambda consist of an integral inner membrane protein: i-spanin, and an outer membrane lipoprotein: o-spanin, that form a complex spanning the periplasm. Read More

View Article

Download full-text PDF

Source
https://bmcbioinformatics.biomedcentral.com/articles/10.1186
Publisher Site
http://dx.doi.org/10.1186/s12859-018-2342-8DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6139136PMC
September 2018
9 Reads

Improving Protein Function Prediction Using Protein Sequence and GO-term Similarities.

Bioinformatics 2018 Aug 29. Epub 2018 Aug 29.

Delft Bioinformatics Lab, Delft University of Technology, Mekelweg 4, 2628CD, Delft, the Netherlands.

Motivation: Most automatic functional annotation methods assign Gene Ontology (GO) terms to proteins based on annotations of highly similar proteins. We advocate that proteins that are less similar are still informative. Also, despite their simplicity and structure, GO terms seem to be hard for computers to learn, in particular the Biological Process ontology, which has the most terms (>29,000). Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/bty751DOI Listing
August 2018
2 Reads

Predicting overlapping protein complexes based on core-attachment and a local modularity structure.

BMC Bioinformatics 2018 Aug 22;19(1):305. Epub 2018 Aug 22.

College of Computer Science and Technology, Jilin University, No. 2699 Qianjin Street, Changchun, 130012, China.

Background: In recent decades, detecting protein complexes (PCs) from protein-protein interaction networks (PPINs) has been an active area of research. There are a large number of excellent graph clustering methods that work very well for identifying PCs. However, most of existing methods usually overlook the inherent core-attachment organization of PCs. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2309-9DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6106838PMC
August 2018
5 Reads

StructureProfiler: An all-in-one Tool for 3D Protein Structure Profiling.

Bioinformatics 2018 Aug 16. Epub 2018 Aug 16.

ZBH - Center for Bioinformatics, Universität Hamburg, Hamburg, Germany.

Motivation: Three-dimensional protein structures are important starting points for elucidating protein function and applications like drug design. Computational methods in this area rely on high quality validation data sets which are usually manually assembled. Due to the increase in published structures as well as the increasing demand for specially tailored validation data sets, automatic procedures should be adopted. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/bty692DOI Listing
August 2018
3 Reads

BROCKMAN: deciphering variance in epigenomic regulators by k-mer factorization.

BMC Bioinformatics 2018 07 3;19(1):253. Epub 2018 Jul 3.

Klarman Cell Observatory, Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.

Background: Variation in chromatin organization across single cells can help shed important light on the mechanisms controlling gene expression, but scale, noise, and sparsity pose significant challenges for interpretation of single cell chromatin data. Here, we develop BROCKMAN (Brockman Representation Of Chromatin by K-mers in Mark-Associated Nucleotides), an approach to infer variation in transcription factor (TF) activity across samples through unsupervised analysis of the variation in DNA sequences associated with an epigenomic mark.

Results: BROCKMAN represents each sample as a vector of epigenomic-mark-associated DNA word frequencies, and decomposes the resulting matrix to find hidden structure in the data, followed by unsupervised grouping of samples and identification of the TFs that distinguish groups. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2255-6DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6029352PMC
July 2018
11 Reads

HFSP: high speed homology-driven function annotation of proteins.

Bioinformatics 2018 07;34(13):i304-i312

Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, NJ, USA.

Motivation: The rapid drop in sequencing costs has produced many more (predicted) protein sequences than can feasibly be functionally annotated with wet-lab experiments. Thus, many computational methods have been developed for this purpose. Most of these methods employ homology-based inference, approximated via sequence alignments, to transfer functional annotations between proteins. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/bty262DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022561PMC
July 2018
3 Reads

DisruPPI: structure-based computational redesign algorithm for protein binding disruption.

Bioinformatics 2018 07;34(13):i245-i253

Department of Computer Science, Dartmouth, Hanover, NH, USA.

Motivation: Disruption of protein-protein interactions can mitigate antibody recognition of therapeutic proteins, yield monomeric forms of oligomeric proteins, and elucidate signaling mechanisms, among other applications. While designing affinity-enhancing mutations remains generally quite challenging, both statistically and physically based computational methods can precisely identify affinity-reducing mutations. In order to leverage this ability to design variants of a target protein with disrupted interactions, we developed the DisruPPI protein design method (DISRUpting Protein-Protein Interactions) to optimize combinations of mutations simultaneously for both disruption and stability, so that incorporated disruptive mutations do not inadvertently affect the target protein adversely. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/bty274DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022686PMC
July 2018
2 Reads

A novel methodology on distributed representations of proteins using their interacting ligands.

Bioinformatics 2018 07;34(13):i295-i303

Department of Computer Engineering, Bogazici University, Istanbul, Turkey.

Motivation: The effective representation of proteins is a crucial task that directly affects the performance of many bioinformatics problems. Related proteins usually bind to similar ligands. Chemical characteristics of ligands are known to capture the functional and mechanistic properties of proteins suggesting that a ligand-based approach can be utilized in protein representation. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/bty287DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022674PMC
July 2018
15 Reads

mol2sphere: spherical decomposition of multi-domain molecules for visualization and coarse grained spatial modeling.

Bioinformatics 2018 Nov;34(22):3948-3950

Department of Molecular Biology and Biophysics, UConn Health, Farmington, CT, USA.

Motivation: Proteins, especially those involved in signaling pathways are composed of functional modules connected by linker domains with varying degrees of flexibility. To understand the structure-function relationships in these macromolecules, it is helpful to visualize the geometric arrangement of domains. Furthermore, accurate spatial representation of domain structure is necessary for coarse-grain models of the multi-molecular interactions that comprise signaling pathways. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/bty487DOI Listing
November 2018
2 Reads

SSMART: sequence-structure motif identification for RNA-binding proteins.

Bioinformatics 2018 Dec;34(23):3990-3998

Berlin Institute for Medical Systems Biology, Max Delbruck Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany.

Motivation: RNA-binding proteins (RBPs) regulate every aspect of RNA metabolism and function. There are hundreds of RBPs encoded in the eukaryotic genomes, and each recognize its RNA targets through a specific mixture of RNA sequence and structure properties. For most RBPs, however, only a primary sequence motif has been determined, while the structure of the binding sites is uncharacterized. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/bty404DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6247937PMC
December 2018
10 Reads

Sequence-based prediction of physicochemical interactions at protein functional sites using a function-and-interaction-annotated domain profile database.

BMC Bioinformatics 2018 06 1;19(1):204. Epub 2018 Jun 1.

College of Biotechnology and Pharmaceutical Engineering, Nanjing Tech University, Biotech Building Room B1-404, 30 South Puzhu Road, Jiangsu, 211816, Nanjing, People's Republic of China.

Background: Identifying protein functional sites (PFSs) and, particularly, the physicochemical interactions at these sites is critical to understanding protein functions and the biochemical reactions involved. Several knowledge-based methods have been developed for the prediction of PFSs; however, accurate methods for predicting the physicochemical interactions associated with PFSs are still lacking.

Results: In this paper, we present a sequence-based method for the prediction of physicochemical interactions at PFSs. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2206-2DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5984826PMC
June 2018
3 Reads

Applying graph theory to protein structures: an Atlas of coiled coils.

Bioinformatics 2018 Oct;34(19):3316-3323

School of Chemistry, University of Bristol, Bristol, UK.

Motivation: To understand protein structure, folding and function fully and to design proteins de novo reliably, we must learn from natural protein structures that have been characterized experimentally. The number of protein structures available is large and growing exponentially, which makes this task challenging. Indeed, computational resources are becoming increasingly important for classifying and analyzing this resource. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/bty347DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6157074PMC
October 2018
3 Reads

GapRepairer: a server to model a structural gap and validate it using topological analysis.

Bioinformatics 2018 Oct;34(19):3300-3307

Centre of New Technologies, University of Warsaw, Warsaw, Poland.

Motivation: Over 25% of protein structures possess unresolved fragments. On the other hand, approximately 6% of protein chains have non-trivial topology (and form knots, slipknots, lassos and links). As the topology is fundamental for the proper function of proteins, modeling of topologically correct structures is decisive in various fields, including biophysics, biotechnology and molecular biology. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/bty334DOI Listing
October 2018
5 Reads

Predicting gene structure changes resulting from genetic variants via exon definition features.

Bioinformatics 2018 Nov;34(21):3616-3623

Program in Computational Biology and Bioinformatics, Duke University, Durham, NC, USA.

Motivation: Genetic variation that disrupts gene function by altering gene splicing between individuals can substantially influence traits and disease. In those cases, accurately predicting the effects of genetic variation on splicing can be highly valuable for investigating the mechanisms underlying those traits and diseases. While methods have been developed to generate high quality computational predictions of gene structures in reference genomes, the same methods perform poorly when used to predict the potentially deleterious effects of genetic changes that alter gene splicing between individuals. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/bty324DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6198862PMC
November 2018
5 Reads

Structural disorder of plasmid-encoded proteins in Bacteria and Archaea.

BMC Bioinformatics 2018 04 25;19(1):158. Epub 2018 Apr 25.

Bio-lab, Institute of General and Physical Chemistry, P.O.B. 45, Studentski trg 12/V, Belgrade, 11001, Serbia.

Background: In the last decade and a half it has been firmly established that a large number of proteins do not adopt a well-defined (ordered) structure under physiological conditions. Such intrinsically disordered proteins (IDPs) and intrinsically disordered (protein) regions (IDRs) are involved in essential cell processes through two basic mechanisms: the entropic chain mechanism which is responsible for rapid fluctuations among many alternative conformations, and molecular recognition via short recognition elements that bind to other molecules. IDPs possess a high adaptive potential and there is special interest in investigating their involvement in organism evolution. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2158-6DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5922023PMC
April 2018
4 Reads

Ultra-fast global homology detection with Discrete Cosine Transform and Dynamic Time Warping.

Bioinformatics 2018 Sep;34(18):3118-3125

Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, Brussels, Belgium.

Motivation: Evolutionary information is crucial for the annotation of proteins in bioinformatics. The amount of retrieved homologs often correlates with the quality of predicted protein annotations related to structure or function. With a growing amount of sequences available, fast and reliable methods for homology detection are essential, as they have a direct impact on predicted protein annotations. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/bty309DOI Listing
September 2018
5 Reads

SPIDR: small-molecule peptide-influenced drug repurposing.

BMC Bioinformatics 2018 04 16;19(1):138. Epub 2018 Apr 16.

Department of Chemistry and Biochemistry, Boise State University, Boise, USA.

Background: Conventional de novo drug design is costly and time consuming, making it accessible to only the best resourced research organizations. An emergent approach to new drug development is drug repurposing, in which compounds that have already gone through some level of clinical testing are examined for efficacy against diseases divergent than their original application. Repurposing of existing drugs circumvents the time and considerable cost of early stages of drug development, and can be accelerated by using software to screen existing chemical databases to identify suitable drug candidates. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2153-yDOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5902895PMC
April 2018
4 Reads

COZOID: contact zone identifier for visual analysis of protein-protein interactions.

BMC Bioinformatics 2018 04 6;19(1):125. Epub 2018 Apr 6.

Faculty of Informatics, Masaryk University, Brno, Czech Republic.

Background: Studying the patterns of protein-protein interactions (PPIs) is fundamental for understanding the structure and function of protein complexes. The exploration of the vast space of possible mutual configurations of interacting proteins and their contact zones is very time consuming and requires the proteomic expert knowledge.

Results: In this paper, we propose a novel tool containing a set of visual abstraction techniques for the guided exploration of PPI configuration space. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2113-6DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5889581PMC
April 2018
6 Reads

DeepSol: a deep learning framework for sequence-based protein solubility prediction.

Bioinformatics 2018 Aug;34(15):2605-2613

Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar.

Motivation: Protein solubility plays a vital role in pharmaceutical research and production yield. For a given protein, the extent of its solubility can represent the quality of its function, and is ultimately defined by its sequence. Thus, it is imperative to develop novel, highly accurate in silico sequence-based protein solubility predictors. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/bty166DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6355112PMC
August 2018
4 Reads

Inferring RNA sequence preferences for poorly studied RNA-binding proteins based on co-evolution.

BMC Bioinformatics 2018 03 12;19(1):96. Epub 2018 Mar 12.

Department of Computer Science, University of British Columbia, Vancouver, Canada.

Background: Characterizing the binding preference of RNA-binding proteins (RBP) is essential for us to understand the interaction between an RBP and its RNA targets, and to decipher the mechanism of post-transcriptional regulation. Experimental methods have been used to generate protein-RNA binding data for a number of RBPs in vivo and in vitro. Utilizing the binding data, a couple of computational methods have been developed to detect the RNA sequence or structure preferences of the RBPs. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2091-8DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5848454PMC
March 2018
8 Reads

Comparative assessment of strategies to identify similar ligand-binding pockets in proteins.

BMC Bioinformatics 2018 03 9;19(1):91. Epub 2018 Mar 9.

Department of Biological Sciences, Louisiana State University, Baton Rouge, LA, USA.

Background: Detecting similar ligand-binding sites in globally unrelated proteins has a wide range of applications in modern drug discovery, including drug repurposing, the prediction of side effects, and drug-target interactions. Although a number of techniques to compare binding pockets have been developed, this problem still poses significant challenges.

Results: We evaluate the performance of three algorithms to calculate similarities between ligand-binding sites, APoc, SiteEngine, and G-LoSA. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2109-2DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5845264PMC
March 2018
2 Reads

Alignment-free clustering of large data sets of unannotated protein conserved regions using minhashing.

BMC Bioinformatics 2018 03 5;19(1):83. Epub 2018 Mar 5.

School of EECS, Washington State University, 355 NE Spokane St, Pullman, 99164, USA.

Background: Clustering of protein sequences is of key importance in predicting the structure and function of newly sequenced proteins and is also of use for their annotation. With the advent of multiple high-throughput sequencing technologies, new protein sequences are becoming available at an extraordinary rate. The rapid growth rate has impeded deployment of existing protein clustering/annotation tools which depend largely on pairwise sequence alignment. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2080-yDOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5838936PMC
March 2018
12 Reads

BetaSerpentine: a bioinformatics tool for reconstruction of amyloid structures.

Bioinformatics 2018 02;34(4):599-608

Structural Bioinformatics and Molecular Modeling, Centre de Recherche en Biologie Cellulaire de Montpellier, CNRS, Université Montpellier, Montpellier 34293, France.

Motivation: Numerous experimental studies have suggested that polypeptide chains of large amyloidogenic regions zig-zag in β-serpentine arrangements. These β-serpentines are stacked axially and form the superpleated β-structure. Despite this progress in the understanding of amyloid folds, the determination of their 3D structure at the atomic level is still a problem due to the polymorphism of these fibrils and incompleteness of experimental structural data. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btx629DOI Listing
February 2018
7 Reads

Exploring the potential of 3D Zernike descriptors and SVM for protein-protein interface prediction.

BMC Bioinformatics 2018 02 6;19(1):35. Epub 2018 Feb 6.

Department of Information Engineering, University of Padova, via Gradenigo 6/A, Padova, 35131, Italy.

Background: The correct determination of protein-protein interaction interfaces is important for understanding disease mechanisms and for rational drug design. To date, several computational methods for the prediction of protein interfaces have been developed, but the interface prediction problem is still not fully understood. Experimental evidence suggests that the location of binding sites is imprinted in the protein structure, but there are major differences among the interfaces of the various protein types: the characterising properties can vary a lot depending on the interaction type and function. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2043-3DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5802066PMC
February 2018
4 Reads

Grid-based prediction of torsion angle probabilities of protein backbone and its application to discrimination of protein intrinsic disorder regions and selection of model structures.

BMC Bioinformatics 2018 02 1;19(1):29. Epub 2018 Feb 1.

Institute for Glycomics and School of Information and Communication Technology, Griffith University, Parklands Dr, Southport, QLD, 4222, Australia.

Background: Protein structure can be described by backbone torsion angles: rotational angles about the N-Cα bond (φ) and the Cα-C bond (ψ) or the angle between Cα-Cα-Cα (θ) and the rotational angle about the Cα-Cα bond (τ). Thus, their accurate prediction is useful for structure prediction and model refinement. Early methods predicted torsion angles in a few discrete bins whereas most recent methods have focused on prediction of angles in real, continuous values. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2031-7DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5796405PMC
February 2018
4 Reads

Large scale analysis of protein conformational transitions from aqueous to non-aqueous media.

BMC Bioinformatics 2018 01 30;19(1):27. Epub 2018 Jan 30.

Departamento de Ciencia y Tecnología, CONICET, Universidad Nacional de Quilmes, Roque Sáenz Peña 352, B1876BXD, Bernal, Provincia de Buenos Aires, Argentina.

Background: Biocatalysis in organic solvents is nowadays a common practice with a large potential in Biotechnology. Several studies report that proteins which are co-crystallized or soaked in organic solvents preserve their fold integrity showing almost identical arrangements when compared to their aqueous forms. However, it is well established that the catalytic activity of proteins in organic solvents is much lower than in water. Read More

View Article

Download full-text PDF

Source
http://dx.doi.org/10.1186/s12859-018-2044-2DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5791380PMC
January 2018
2 Reads