Abstract
This work describes the integration of several data mining and machine learning tools for researching Photovoltaic (PV) solar cells libraries into a unified workflow embedded within a GUI-supported Decision Support System (DSS), named PV Analyzer. The analyzer's workflow is composed of several data analysis components including basic statistical and visualization methods as well as an algorithm for building predictive machine learning models. The analyzer allows for the identification of interesting trends within the libraries, not easily observable using simple bi-parametric correlations. This may lead to new insights into factor affecting solar cells performances with the ultimate goal of designing better solar cells. The analyzer was developed using MATLAB version R2014a and consequently could be easily extended by adding additional tools and algorithms. Furthermore, while in our hands, the analyzer has been primarily used in the area of PV cells, is it equally applicable to the analysis of any other dataset composed of activities as dependent variables and descriptors as independent variables.
Original language | English |
---|---|
Article number | 1800067 |
Journal | Molecular Informatics |
Volume | 37 |
Issue number | 9 |
DOIs | |
State | Published - Sep 2018 |
Bibliographical note
Publisher Copyright:© 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim
Funding
The raw data utilized in this manuscript and which are provided in the supporting information were produced in the laboratories of Prof. Arie Zaban at Bar-Ilan University, Ramat Gan, Israel and Professor Elvira Fortunato at the Materials Science Department of New University of Lisbon, Portugal. We are thankful to the members of both labs for sharing the data with us. The authors also acknowledge COST action CA16235 “Performance and Reliability of Photovoltaic Systems: Evaluations of Large-Scale Monitoring Data” (PEARL-PV) for travel support.
Funders | Funder number |
---|---|
Materials Science Department of New University of Lisbon | |
European Cooperation in Science and Technology | CA16235 |
Keywords
- Data mining
- Decision Support System
- Machine learning
- Material-Informatics
- Photovoltaic
- QSAR
- RANSAC
- Solar cell