BQIT:23 – Sample-efficient Model-based Reinforcement Learning for Quantum Control

Irtaza Khalid, Carrie Weidner, S.G. Schirmer, Edmond Jonckheere, Frank C. Langbein. Sample-efficient Model-based Reinforcement Learning for Quantum Control. Poster, BQIT:23, 2023. [PDF:poster]

Our model-based reinforcement learning (RL) algorithm reduces the sample complexity for time-dependent noisy quantum gate control tasks by at least an order of magnitude over model-free RL.
The model is a differentiable ordinary differential equation (ODE) within our Learnable Hamiltonian Model-Based Soft-Actor Critic (LH-MBSAC) algorithm.
We encode a partially characterised Hamiltonian in the model and only learn the time-independent term.
The learned model can be leveraged to further optimize RL controllers using GRAPE.
LH-MBSAC is a step towards bridging the gap between theoretical and experimental quantum control by reducing the experimental resource requirements for RL control.

Cite this page as 'Frank C Langbein, "BQIT:23 – Sample-efficient Model-based Reinforcement Learning for Quantum Control," Ex Tenebris Scientia, 10th April 2023, https://langbein.org/bqit23/ [accessed 10th August 2025]'.

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Frank C Langbein
Ex Tenebris Scientia

Frank C Langbein
Ex Tenebris Scientia

BQIT:23 – Sample-efficient Model-based Reinforcement Learning for Quantum Control