Back to Search Start Over

Unsupervised learning approaches to characterizing heterogeneous samples using X-ray single-particle imaging

Authors :
Yulong Zhuang
Salah Awel
Anton Barty
Richard Bean
Johan Bielecki
Martin Bergemann
Benedikt J. Daurer
Tomas Ekeberg
Armando D. Estillore
Hans Fangohr
Klaus Giewekemeyer
Mark S. Hunter
Mikhail Karnevskiy
Richard A. Kirian
Henry Kirkwood
Yoonhee Kim
Jayanath Koliyadu
Holger Lange
Romain Letrun
Jannik Lübke
Abhishek Mall
Thomas Michelat
Andrew J. Morgan
Nils Roth
Amit K. Samanta
Tokushi Sato
Zhou Shen
Marcin Sikorski
Florian Schulz
John C. H. Spence
Patrik Vagovic
Tamme Wollweber
Lena Worbs
P. Lourdu Xavier
Oleksandr Yefanov
Filipe R. N. C. Maia
Daniel A. Horke
Jochen Küpper
N. Duane Loh
Adrian P. Mancuso
Henry N. Chapman
Kartik Ayyer
Source :
IUCrJ, Vol 9, Iss 2, Pp 204-214 (2022)
Publication Year :
2022
Publisher :
International Union of Crystallography, 2022.

Abstract

One of the outstanding analytical problems in X-ray single-particle imaging (SPI) is the classification of structural heterogeneity, which is especially difficult given the low signal-to-noise ratios of individual patterns and the fact that even identical objects can yield patterns that vary greatly when orientation is taken into consideration. Proposed here are two methods which explicitly account for this orientation-induced variation and can robustly determine the structural landscape of a sample ensemble. The first, termed common-line principal component analysis (PCA), provides a rough classification which is essentially parameter free and can be run automatically on any SPI dataset. The second method, utilizing variation auto-encoders (VAEs), can generate 3D structures of the objects at any point in the structural landscape. Both these methods are implemented in combination with the noise-tolerant expand–maximize–compress (EMC) algorithm and its utility is demonstrated by applying it to an experimental dataset from gold nanoparticles with only a few thousand photons per pattern. Both discrete structural classes and continuous deformations are recovered. These developments diverge from previous approaches of extracting reproducible subsets of patterns from a dataset and open up the possibility of moving beyond the study of homogeneous sample sets to addressing open questions on topics such as nanocrystal growth and dynamics, as well as phase transitions which have not been externally triggered.

Details

Language :
English
ISSN :
20522525
Volume :
9
Issue :
2
Database :
Directory of Open Access Journals
Journal :
IUCrJ
Publication Type :
Academic Journal
Accession number :
edsdoj.46dd9fdeb55640e18e8540d7e995bd4e
Document Type :
article
Full Text :
https://doi.org/10.1107/S2052252521012707