Basic protocols in quantum reinforcement learning with superconducting circuits

doi:10.1038/s41598-017-01711-6

. 2017 May 9;7(1):1609.

doi: 10.1038/s41598-017-01711-6.

Basic protocols in quantum reinforcement learning with superconducting circuits

Lucas Lamata¹

Affiliations

PMID: 28487535
PMCID: PMC5431677
DOI: 10.1038/s41598-017-01711-6

Basic protocols in quantum reinforcement learning with superconducting circuits

Lucas Lamata. Sci Rep. 2017.

. 2017 May 9;7(1):1609.

doi: 10.1038/s41598-017-01711-6.

Author

Lucas Lamata¹

Affiliation

¹ Department of Physical Chemistry, University of the Basque Country UPV/EHU, Apartado 644, 48080, Bilbao, Spain. [email protected].

PMID: 28487535
PMCID: PMC5431677
DOI: 10.1038/s41598-017-01711-6

Abstract

Superconducting circuit technologies have recently achieved quantum protocols involving closed feedback loops. Quantum artificial intelligence and quantum machine learning are emerging fields inside quantum technologies which may enable quantum devices to acquire information from the outer world and improve themselves via a learning process. Here we propose the implementation of basic protocols in quantum reinforcement learning, with superconducting circuits employing feedback- loop control. We introduce diverse scenarios for proof-of-principle experiments with state-of-the-art superconducting circuit technologies and analyze their feasibility in presence of imperfections. The field of quantum artificial intelligence implemented with superconducting circuits paves the way for enhanced quantum control and quantum computation protocols.

PubMed Disclaimer

Conflict of interest statement

The author declares that they have no competing interests.

Figures

**Figure 1**
Scheme of reinforcement learning. In each learning cycle, an Agent, denoted by S, interacts with an Environment, denoted by E, realizing some Action (A) on it, as well as gathering information, or Percept (P) about its relation to it. Subsequently, the information obtained is employed in order to decide a strategy on how to optimize the agent, based on a Reward Criterion, whose aim may be to maximize a Learning Fidelity. Afterwards, a new cycle begins. The situation in the quantum realm is similar, and can oscillate between having a quantum version of agent, of environment, or of both of them, as well as interactions between them that can be quantum and/or classical channels with feedforward.

**Figure 2**
Quantum reinforcement learning for one qubit. We depict the circuit representation of the proposed learning protocol. S, E and R denote the agent, environment and register qubits, respectively. CNOT gates between E and R as well as between S and R are depicted with the standard notation. M is a measurement in the computational basis chosen, while U _S and U _R are local operations on agent and register, respectively, conditional on the measurement outcomes via classical feedback loop. The double lines denote classical information being fedforward. The protocol can be iterated upon changes in the environment.

**Figure 3**
Quantum reinforcement learning for multiqubit system, I. We depict the circuit representation of the proposed learning protocol. S, E and R denote the agent, environment and register two-qubit states, respectively. CNOT gates between the respective pairs of qubits in E and R as well as in S and R are depicted with the standard notation. M ₁ is a measurement in the computational basis chosen on the first qubit of the register, M ₂ is a measurement in the computational basis chosen on the two-qubit state of the register, while U _S and U _R are local operations on agent and register, respectively, conditional on the measurement outcomes via a classical feedback loop. The double narrow lines denote classical information being fedforward, while the horizontal, double wider lines denote two-qubit states. The protocol can be iterated upon changes in the environment via reset of the agent.

**Figure 4**
Quantum reinforcement learning for multiqubit system, II. We depict the circuit representation of the proposed learning protocol. S, E and R denote the agent, environment and register two-qubit states, respectively. CNOT gates between the respective pairs of qubits in E and R as well as in S and R are depicted with the standard notation. In this case, the measurement M acts on both register qubits, and is performed in the computational basis chosen. U _S and U _R are local operations on agent and register, respectively, conditional on the measurement outcomes via a classical feedback loop. The double narrow lines denote classical information being fedforward, while the horizontal, double wider lines denote two-qubit states. The protocol can be iterated upon changes in the environment via reset of the agent.

**Figure 5**
Scheme of the proposed implementation. In the most complex example proposed, we consider 6 superconducting qubits inside a 3D cavity, distributed in two rows along the cavity axis (another possible configuration would be with two 3-qubit columns perpendicular to the cavity axis). Amp denotes the amplification process, while C represents the controller device, and U is a local operation on the qubits conditional on the classical feedback loop.

See this image and copyright information in PMC

Cited by

A high-bias, low-variance introduction to Machine Learning for physicists.
Mehta P, Wang CH, Day AGR, Richardson C, Bukov M, Fisher CK, Schwab DJ. Mehta P, et al. Phys Rep. 2019 May 30;810:1-124. doi: 10.1016/j.physrep.2019.03.001. Epub 2019 Mar 14. Phys Rep. 2019. PMID: 31404441 Free PMC article.
A Variational Quantum Linear Solver Application to Discrete Finite-Element Methods.
Trahan CJ, Loveland M, Davis N, Ellison E. Trahan CJ, et al. Entropy (Basel). 2023 Mar 28;25(4):580. doi: 10.3390/e25040580. Entropy (Basel). 2023. PMID: 37190367 Free PMC article.
Quantum machine learning.
Biamonte J, Wittek P, Pancotti N, Rebentrost P, Wiebe N, Lloyd S. Biamonte J, et al. Nature. 2017 Sep 13;549(7671):195-202. doi: 10.1038/nature23474. Nature. 2017. PMID: 28905917
Supervised Quantum Learning without Measurements.
Alvarez-Rodriguez U, Lamata L, Escandell-Montero P, Martín-Guerrero JD, Solano E. Alvarez-Rodriguez U, et al. Sci Rep. 2017 Oct 20;7(1):13645. doi: 10.1038/s41598-017-13378-0. Sci Rep. 2017. PMID: 29057923 Free PMC article.
Multiqubit and multilevel quantum reinforcement learning with quantum technologies.
Cárdenas-López FA, Lamata L, Retamal JC, Solano E. Cárdenas-López FA, et al. PLoS One. 2018 Jul 19;13(7):e0200455. doi: 10.1371/journal.pone.0200455. eCollection 2018. PLoS One. 2018. PMID: 30024914 Free PMC article.

See all "Cited by" articles

References

1. Russell, S. & Norvig, P. Artificial Intelligence: A Modern Approach 3rd. ed. (Pearson, New Jersey, 2010).
1. Sutton, R. S. & Barto, A. G. Reinforcement Learning: An Introduction (MIT Press, Cambridge, MA, 1998).
1. Nielsen, M. A. & Chuang, I. L. Quantum Computation and Quantum Information (Cambridge University Press, Cambridge, UK, 2000).
1. Schuld M, Sinayskiy I, Petruccione F. An introduction to quantum machine learning. Contemp. Phys. 2015;56:172–185. doi: 10.1080/00107514.2014.964942. - DOI
1. Biamonte, J. et al. Quantum Machine Learning. arXiv:1611.09347 (2016).

Publication types

Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

[1] Russell, S. & Norvig, P. Artificial Intelligence: A Modern Approach 3rd. ed. (Pearson, New Jersey, 2010).

[2] Russell, S. & Norvig, P. Artificial Intelligence: A Modern Approach 3rd. ed. (Pearson, New Jersey, 2010).

[3] Sutton, R. S. & Barto, A. G. Reinforcement Learning: An Introduction (MIT Press, Cambridge, MA, 1998).

[4] Sutton, R. S. & Barto, A. G. Reinforcement Learning: An Introduction (MIT Press, Cambridge, MA, 1998).

[5] Nielsen, M. A. & Chuang, I. L. Quantum Computation and Quantum Information (Cambridge University Press, Cambridge, UK, 2000).

[6] Nielsen, M. A. & Chuang, I. L. Quantum Computation and Quantum Information (Cambridge University Press, Cambridge, UK, 2000).

[7] Schuld M, Sinayskiy I, Petruccione F. An introduction to quantum machine learning. Contemp. Phys. 2015;56:172–185. doi: 10.1080/00107514.2014.964942. - DOI

[8] Schuld M, Sinayskiy I, Petruccione F. An introduction to quantum machine learning. Contemp. Phys. 2015;56:172–185. doi: 10.1080/00107514.2014.964942. - DOI

[9] Biamonte, J. et al. Quantum Machine Learning. arXiv:1611.09347 (2016).

[10] Biamonte, J. et al. Quantum Machine Learning. arXiv:1611.09347 (2016).

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Basic protocols in quantum reinforcement learning with superconducting circuits

Affiliation

Basic protocols in quantum reinforcement learning with superconducting circuits

Author

Affiliation

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

LinkOut - more resources

Full Text Sources

Other Literature Sources