Abstract
We study the synthesis of control for a system that interacts with a black-box environment, based on deep learning. The goal is to minimize the number of interaction failures. The current state of the environment is unavailable to the controller, hence its operation depends on a limited view of the history. We suggest a reinforcement learning framework of training a Recurrent Neural Network (RNN) to control such a system. We experiment with various parameters: loss function, exploration/exploitation ratio, and size of lookahead. We designed examples that capture various potential control difficulties. We present experiments performed with the toolkit DyNet.
Original language | English |
---|---|
Title of host publication | Leveraging Applications of Formal Methods, Verification and Validation |
Subtitle of host publication | Engineering Principles - 9th International Symposium on Leveraging Applications of Formal Methods, ISoLA 2020, Proceedings |
Editors | Tiziana Margaria, Bernhard Steffen |
Publisher | Springer Science and Business Media Deutschland GmbH |
Pages | 457-472 |
Number of pages | 16 |
ISBN (Print) | 9783030614690 |
DOIs | |
State | Published - 2020 |
Event | 9th International Symposium on Leveraging Applications of Formal Methods, Verification and Validation, ISoLA 2020 - Rhodes, Greece Duration: 20 Oct 2020 → 30 Oct 2020 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 12477 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 9th International Symposium on Leveraging Applications of Formal Methods, Verification and Validation, ISoLA 2020 |
---|---|
Country/Territory | Greece |
City | Rhodes |
Period | 20/10/20 → 30/10/20 |
Bibliographical note
Publisher Copyright:© 2020, Springer Nature Switzerland AG.
Funding
S. Iosti and S. Bensalem—The research performed by these authors was partially funded by H2020-ECSEL grants CPS4EU 2018-IA call - Grant Agreement number 826276. D. Peled and K. Aharon—The research performed by these authors was partially funded by ISF grants “Runtime Measuring and Checking of Cyber Physical Systems” (ISF award 2239/15) and “Efficient Runtime Verification for Systems with Lots of Data and its Applications” (ISF award 1464/18).
Funders | Funder number |
---|---|
H2020-ECSEL | |
Horizon 2020 Framework Programme | 826276 |
Israel Science Foundation | 1464/18, 2239/15 |