Buy article online - an online subscription or single-article purchase is required to access this article.
Download citation
Download citation
link to html
A new linear function for modelling the background in whole-powder-pattern fitting has been derived by applying LASSO (least absolute shrinkage and selection operator) and the technique of tree search. The background function (BGF) consists of terms bnL(2θ/180)n/2 and bnH(1 − 2θ/180)n/2 for the low- and high-angle sides, respectively. Some variable parameters of the BGF should be fixed at zero while others should be varied in order to find the best fit for a given data set without inducing overfitting. The LASSO algorithm can automatically select the variables in linear regression analysis. However, it finds the best-fit BGF with a set of adjustable parameters for a given data set while it derives a different set of parameters for a different data set. Thus, LASSO derives multiple solutions depending on the data set used. By regarding the individual solutions from LASSO as nodes of trees, tree structures were constructed from these solutions. The root node has the maximum number of adjustable parameters, P. P decreases with descending levels of the tree one by one, and leaf nodes have just one parameter. By evaluating individual solutions (nodes) by their χ2 index, the best-fit single path from a root node to a leaf node was found. The present BGF can be used simply by varying P in the range 1–10. The BGF thus derived as a final single solution was incorporated into computer programs for Pawley-based whole-powder-pattern decomposition and Rietveld refinement, and the performance of the BGF was tested in comparison with the polynomials currently widely used as the BGF. The present BGF has been demonstrated to be stable and to give an excellent fit, comparable to polynomials but with a smaller number of adjustable parameters and without introducing undulation into the calculated background curve. Basic algorithms used in statistics and machine learning have been demonstrated to be useful in developing an analytical model in X-ray crystallography.

Supporting information

rtv

Rietveld powder data file (CIF format) https://doi.org/10.1107/S1600576720016751/tu5002sup1.rtv
Contains datablock znvoprofile


Subscribe to Journal of Applied Crystallography

The full text of this article is available to subscribers to the journal.

If you have already registered and are using a computer listed in your registration details, please email support@iucr.org for assistance.

Buy online

You may purchase this article in PDF and/or HTML formats. For purchasers in the European Community who do not have a VAT number, VAT will be added at the local rate. Payments to the IUCr are handled by WorldPay, who will accept payment by credit card in several currencies. To purchase the article, please complete the form below (fields marked * are required), and then click on `Continue'.
E-mail address* 
Repeat e-mail address* 
(for error checking) 

Format*   PDF (US $40)
   HTML (US $40)
   PDF+HTML (US $50)
In order for VAT to be shown for your country javascript needs to be enabled.

VAT number 
(non-UK EC countries only) 
Country* 
 

Terms and conditions of use
Contact us

Follow J. Appl. Cryst.
Sign up for e-alerts
Follow J. Appl. Cryst. on Twitter
Follow us on facebook
Sign up for RSS feeds