Dynamic Classification for Materials-Informatics: Mining the Solar Cell Space

Abraham Yosipof, Anna Khalemsky, Roy Gelbard, Hanoch Senderowitz

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

The ever-growing data acquisition speed represents a challenge for data analysis in materials sciences in general and the field of solar cells in particular. This is because many unsupervised and supervised learning algorithms require model re-derivation when presented with new samples which are markedly different from those used for model construction. Dynamic segmentation addresses this problem by continuously updating the clusters structure, for example, by splitting old clusters or opening new ones, as new samples are presented. In this work we present the application of a Dynamic Classification Unit (DCU) to the study of the photovoltaic space. Using a database of 1165 metal oxide-based solar cells, constructed from five libraries, we demonstrate that the DCU algorithm, when initiated with only 10 % of the database, correctly classified 82 % of the remaining, 90 % samples. At the same time the algorithm unveiled the presence of interesting trends, outliers and compositional activity cliffs. These abilities may prove useful for the analysis of the photovoltaic space and in turn may contribute to the design of solar cells with improved properties. We suggest that DCU and other dynamic clustering methods will find wide applications in the rapidly developing field of materials informatics.

Original languageEnglish
Article number2000173
JournalMolecular Informatics
Volume41
Issue number1
Early online date1 Sep 2020
DOIs
StatePublished - Jan 2022

Bibliographical note

Publisher Copyright:
© 2020 Wiley-VCH GmbH

Fingerprint

Dive into the research topics of 'Dynamic Classification for Materials-Informatics: Mining the Solar Cell Space'. Together they form a unique fingerprint.

Cite this