Abstract
Reinforcement-Learning-based solutions have achieved many successes in numerous complex tasks. However, their training process may be unstable, and achieving convergence can be difficult, expensive, and in some instances impossible. We propose herein an approach that enables the integration of strong formal verification methods in order to improve the learning process as well as prove convergence. During the learning process, formal methods serve as experts to identify weaknesses in the learned model, improve it, and even lead it to converge. By evaluating our approach on several common problems, which have already been studied and solved by classical methods, we demonstrate the strength and potential of our core idea of incorporating formal methods into the training process of Reinforcement Learning methods.
Original language | English |
---|---|
Title of host publication | SOFSEM 2023 |
Subtitle of host publication | Theory and Practice of Computer Science - 48th International Conference on Current Trends in Theory and Practice of Computer Science, SOFSEM 2023, Proceedings |
Editors | Leszek Gasieniec |
Publisher | Springer Science and Business Media Deutschland GmbH |
Pages | 342-355 |
Number of pages | 14 |
ISBN (Print) | 9783031231001 |
DOIs | |
State | Published - 2023 |
Event | 48th International Conference on Current Trends in Theory and Practice of Computer Science, SOFSEM 2023 - Nový Smokovec, Slovakia Duration: 15 Jan 2023 → 18 Jan 2023 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 13878 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 48th International Conference on Current Trends in Theory and Practice of Computer Science, SOFSEM 2023 |
---|---|
Country/Territory | Slovakia |
City | Nový Smokovec |
Period | 15/01/23 → 18/01/23 |
Bibliographical note
Publisher Copyright:© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
Funding
Acknowledgments. This work is supported by the Horizon 2020 research and innovation programme for the Bio4Comp project under grant agreement number 732482 and by the ISRAEL SCIENCE FOUNDATION (Grant No. 190/19). We would like to thank Assaf Grundman and Shlomi Mamman for their work and feedback on an early version of this project, and the Data Science Institute at Bar-Ilan University.
Funders | Funder number |
---|---|
Israel Science Foundation | 190/19 |
Horizon 2020 | 732482 |
Keywords
- Formal verification
- Model checking
- Q-learning
- Reinforcement learning