Buy article online - an online subscription or single-article purchase is required to access this article.
Download citation
Download citation
link to html
Obtaining diffraction quality crystals remains one of the major bottlenecks in structural biology. The ability to predict the chances of crystallization from the amino-acid sequence of the protein can, at least partly, address this problem by allowing a crystallographer to select homologs that are more likely to succeed and/or to modify the sequence of the target to avoid features that are detrimental to successful crystallization. In 2007, the now widely used XtalPred algorithm [Slabinski et al. (2007), Protein Sci. 16, 2472–2482] was developed. XtalPred classifies proteins into five `crystallization classes' based on a simple statistical analysis of the physicochemical features of a protein. Here, towards the same goal, advanced machine-learning methods are applied and, in addition, the predictive potential of additional protein features such as predicted surface ruggedness, hydrophobicity, side-chain entropy of surface residues and amino-acid composition of the predicted protein surface are tested. The new XtalPred-RF (random forest) achieves significant improvement of the prediction of crystallization success over the original XtalPred. To illustrate this, XtalPred-RF was tested by revisiting target selection from 271 Pfam families targeted by the Joint Center for Structural Genomics (JCSG) in PSI-2, and it was estimated that the number of targets entered into the protein-production and crystallization pipeline could have been reduced by 30% without lowering the number of families for which the first structures were solved. The prediction improvement depends on the subset of targets used as a testing set and reaches 100% (i.e. twofold) for the top class of predicted targets.

Subscribe to Acta Crystallographica Section D: Biological Crystallography

The full text of this article is available to subscribers to the journal.

If you have already registered and are using a computer listed in your registration details, please email support@iucr.org for assistance.

Buy online

You may purchase this article in PDF and/or HTML formats. For purchasers in the European Community who do not have a VAT number, VAT will be added at the local rate. Payments to the IUCr are handled by WorldPay, who will accept payment by credit card in several currencies. To purchase the article, please complete the form below (fields marked * are required), and then click on `Continue'.
E-mail address* 
Repeat e-mail address* 
(for error checking) 

Format*   PDF (US $40)
   HTML (US $40)
   PDF+HTML (US $50)
In order for VAT to be shown for your country javascript needs to be enabled.

VAT number 
(non-UK EC countries only) 
Country* 
 

Terms and conditions of use
Contact us

Follow Acta Cryst. D
Sign up for e-alerts
Follow Acta Cryst. on Twitter
Follow us on facebook
Sign up for RSS feeds