4th collaboration workshop on Reinforcement Learning for Autonomous Accelerators (RL4AA'26)

Name: 4th collaboration workshop on Reinforcement Learning for Autonomous Accelerators (RL4AA'26)
Start: 2026-03-30T08:30:00+01:00
End: 2026-04-01T18:00:00+01:00
Location: University of Liverpool

30 March 2026 to 1 April 2026

University of Liverpool

Europe/London timezone

Dr Andrea Santamaria Garcia

ansantam@liverpool.ac.uk

Causal GP-MPC: Where Structure, Safety, and Online Learning Meet for Robust Accelerator Control

31 Mar 2026, 12:00

Teaching Hub 502 First Floor (University of Liverpool)

Teaching Hub 502 First Floor

University of Liverpool

Poster Poster session

Simon Hirlaender (IDA Lab, Paris Lodron University of Salzburg)

Robust accelerator control increasingly relies on data-driven optimisation, yet balancing adaptability with safety remains challenging. Simulation-driven, physics-informed reinforcement learning (RL) relies on soft constraints without formal safety guarantees, and classical response-matrix inversion (RMI) becomes suboptimal under noise
and hard actuator limits. Using the AWAKE electron beam-steering task at CERN as a high-fidelity benchmark, we formulate beam steering as a stochastic control problem in a linear Markov decision process (MDP) with continuous state/action spaces and realistic constraints, and compare RMI, the nominally optimal linear controller-Kalman Quadratic Programming (KalmanQP), Gaussian-process MPC (GP-MPC), and RL.

Our main contribution is a causal GP-MPC scheme that embeds the beamline’s causal layout directly into the GP prior and kernel. This structural inductive bias reduces model complexity, improves conditioning, and enables accurate multi-step prediction from limited data. In simulations on the measured response matrix, RMI and KalmanQP perform well in benign conditions, but their nominal optimality is brittle performance degrades sharply under noise. PPO learns robust policies yet is data inefficient. Structured GP-MPC bridges these extremes, leveraging the RMI-based physical prior for high sample efficiency and a learned residual to surpass the robustness of standard controllers. Taken together, the results indicate that causally structured learning offers a promising route to data-efficient, interpretable, and deployable control strategies for complex accelerator systems.

Student	Yes

Olga Mironova (IDA Lab, Paris Lodron University of Salzburg) Simon Hirlaender (IDA Lab, Paris Lodron University of Salzburg)

Leander Grech (Department of Communications \& Computer Engineering, University of Malta) Lorenz FISCHL (MedAustron GmbH) Thomas Gallien (Institute for Robotics and Flexible Production, JOANNEUM RESEARCH) Sarah Trausner (IDA Lab, Paris Lodron University of Salzburg)

There are no materials yet.

4th collaboration workshop on Reinforcement Learning for Autonomous Accelerators (RL4AA'26)

Dr Andrea Santamaria Garcia

Causal GP-MPC: Where Structure, Safety, and Online Learning Meet for Robust Accelerator Control

Teaching Hub 502 First Floor

University of Liverpool

Speaker

Description

Primary authors

Co-authors

Presentation materials

Choose timezone

4th collaboration workshop on Reinforcement Learning for Autonomous Accelerators (RL4AA'26)

Dr Andrea Santamaria Garcia

Speaker

Description

Primary authors

Co-authors

Presentation materials