Learning to Do or Learning While Doing: Reinforcement Learning and Bayesian Optimisation for Online Continuous Tuning

Jan Kaiser,Chenran Xu,Annika Eichler,Andrea Santamaria Garcia,O. Stein,E. Bründermann,W. Kuropka,H. Dinter,F. Mayet,T. Vinatier,F. Burkart,H. Schlarb

Published 2023 in arXiv.org

ABSTRACT

Online tuning of real-world plants is a complex optimisation problem that continues to require manual intervention by experienced human operators. Autonomous tuning is a rapidly expanding field of research, where learning-based methods, such as Reinforcement Learning-trained Optimisation (RLO) and Bayesian optimisation (BO), hold great promise for achieving outstanding plant performance and reducing tuning times. Which algorithm to choose in different scenarios, however, remains an open question. Here we present a comparative study using a routine task in a real particle accelerator as an example, showing that RLO generally outperforms BO, but is not always the best choice. Based on the study's results, we provide a clear set of criteria to guide the choice of algorithm for a given tuning task. These can ease the adoption of learning-based autonomous tuning solutions to the operation of complex real-world plants, ultimately improving the availability and pushing the limits of operability of these facilities, thereby enabling scientific and engineering advancements.

PUBLICATION RECORD

Publication year
2023
Venue
arXiv.org
Publication date
2023-06-06
Fields of study
Physics, Computer Science, Engineering
Identifiers
DOI 10.48550/arXiv.2306.03739 arXiv 2306.03739
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

Magnetic control of tokamak plasmas through deep reinforcement learning
2022cited by this paper
Bayesian optimization of the beam injection process into a storage ring
2022cited by this paper
Learning-based Optimisation of Particle Accelerators Under Partial Observability Without Real-World Training
2022influential reference
Development of an operation trajectory design algorithm for control of multiple 0D parameters using deep reinforcement learning in KSTAR
2022cited by this paper
BADGER: THE MISSING OPTIMIZER IN ACR ∗
2022cited by this paper
Commissioning Results and Electron Beam Characterization with the S-Band Photoinjector at SINBAD-ARES
2021cited by this paper
Application of Deep Reinforcement Learning to Thermal Control of Space Telescope
2021cited by this paper
Toward autonomous additive manufacturing: Bayesian optimization on a 3D printer
2021cited by this paper
Bayesian Optimization of a Laser-Plasma Accelerator.
2021cited by this paper
Learning to Optimize: A Primer and A Benchmark
2021cited by this paper
Appendix to: BOTORCH: A Framework for Efficient Monte-Carlo Bayesian Optimization
2021cited by this paper
Deep reinforcement learning for smart calibration of radio telescopes
2021cited by this paper
Real-world challenges for multi-agent reinforcement learning in grid-interactive buildings
2021cited by this paper
First Steps Toward an Autonomous Accelerator, a Common Project Between DESY and KIT
2021cited by this paper
Feedforward beta control in the KSTAR tokamak by deep reinforcement learning
2021cited by this paper
Accelerated Deep Reinforcement Learning for Fast Feedback of Beam Dynamics at KARA
2021cited by this paper
Towards Piston Fine Tuning of Segmented Mirrors through Reinforcement Learning
2020cited by this paper
Intelligent Thermal Control Strategy Based on Reinforcement Learning for Space Telescope
2020cited by this paper
Basic Reinforcement Learning Techniques to Control the Intensity of a Seeded Free-Electron Laser
2020cited by this paper
Autonomous Control of a Particle Accelerator using Deep Reinforcement Learning
2020cited by this paper
Real-time artificial intelligence for accelerator control: A study at the Fermilab Booster
2020cited by this paper
MB2C: Model-Based Deep Reinforcement Learning for Multi-zone Building Control
2020cited by this paper
Bayesian Optimization for Radio Resource Management: Open Loop Power Control
2020cited by this paper
Sample-efficient reinforcement learning for CERN accelerator control
2020cited by this paper
Policy gradient methods for free-electron laser and terahertz source optimization and stabilization at the FERMI free-electron laser at Elettra
2020cited by this paper
Online tuning and light source control using a physics-informed Gaussian process Adi
2019cited by this paper
Feedback Design for Control of the Micro-Bunching Instability based on Reinforcement Learning
2019cited by this paper
Spatial Variation
2019cited by this paper
Challenges of Real-World Reinforcement Learning
2019cited by this paper
SciPy 1.0: fundamental algorithms for scientific computing in Python
2019cited by this paper
Bayesian Optimization of a Free-Electron Laser.
2019cited by this paper
Solving Rubik's Cube with a Robot Hand
2019cited by this paper
Bayesian Optimization for Dynamic Problems
2018cited by this paper
Structural mechanism for nucleotide-driven remodeling of the AAA-ATPase unfoldase in the activated human 26S proteasome
2018cited by this paper
Addressing Function Approximation Error in Actor-Critic Methods
2018cited by this paper
Online storage ring optimization using dimension-reduction and genetic algorithms
2018cited by this paper
Online Optimisation of the MAX IV 3 GeV Ring Dynamic Aperture
2018cited by this paper
Optimizing Chemical Reactions with Deep Reinforcement Learning
2017cited by this paper
Learning to Optimize Neural Nets
2017cited by this paper
Real-time control using Bayesian optimization: A case study in airborne wind energy systems
2017cited by this paper
Domain randomization for transferring deep neural networks from simulation to the real world
2017cited by this paper
Learning to Optimize
2016cited by this paper
Bayesian Optimization of FEL Performance at LCLS
2016cited by this paper
Progress in Automatic Software-based Optimization of Accelerator Performance
2016cited by this paper
Taking the Human Out of the Loop: A Review of Bayesian Optimization
2016cited by this paper
Bayesian optimization for maximum power point tracking in photovoltaic power plants
2016cited by this paper
Learning to learn by gradient descent by gradient descent
2016cited by this paper
Multi-objective particle swarm and genetic algorithm for the optimization of the LANSCE linac operation
2014cited by this paper
MACHINE BASED OPTIMIZATION USING GENETIC ALGORITHMS IN A STORAGE RING
2014cited by this paper
An algorithm for online optimization of accelerators
2013cited by this paper
Contextual Gaussian Process Bandit Optimization
2011cited by this paper
Efficient Global Optimization of Expensive Black-Box Functions
1998cited by this paper
Dynamic Programming
1993cited by this paper
A Simplex Method for Function Minimization
1965cited by this paper
ACCELERATING LINEAR BEAM DYNAMICS SIMULATIONS FOR MACHINE LEARNING APPLICATIONS
year unknowncited by this paper

CITED BY

Research on LLRF Feedback Control Algorithm Based on Reinforcement Learning
2025cites this paper
Cheetah: Bridging the Gap Between Machine Learning and Particle Accelerator Physics with High-Speed, Differentiable Simulations
2024influential citation
Microsecond-Latency Feedback at a Particle Accelerator by Online Reinforcement Learning on Hardware
2024cites this paper
Bayesian optimization algorithms for accelerator physics
2023cites this paper
HOW CAN MACHINE LEARNING HELP FUTURE LIGHT SOURCES?
year unknowncites this paper
THE REINFORCEMENT LEARNING FOR AUTONOMOUS ACCELERATORS COLLABORATION
year unknowncites this paper