Complex Vector Gain-Based Annealer For Minimizing XY Hamiltonians
Complex Vector Gain-Based Annealer For Minimizing XY Hamiltonians
ṙi = γi ri − ri3 + α ∑ Jij rj cos (θi − θj ) , (4) The CoVeGA model operates through a system of
(1)
j N two-dimensional complex-field vectors Ψi = (ψi =
rj (1) (1) (2) (2) (2)
θ̇i = −α ∑ Jij sin (θi − θj ) . (5) ri eiθi , ψi = ri eiθi ). It utilizes annealing,
j ri symmetry-breaking bifurcation, gradient descent, and
mode selection to drive the system to the global min-
Starting from below the steady state threshold in the imum. The Hamiltonian is the sum of three terms
vacuum state ri = 0, all oscillators are pumped equally. H = H1 + αH2 + H3 , where
Then, depending on the structure of J, nonzero ampli-
tudes emerge at different rates for each oscillator as the 1 N 2 2
pumping intensity increases. The feedback mechanism H1 = ∑ (γi (t) − ∣∣Ψi ∣∣2 ) , (7)
2 i=1
of Eq. (2) adjusts each oscillator so that they all reach
1 N
equal amplitudes at the steady state threshold. H2 = − ∑ Jij (Ψi ⋅ Ψ∗j + Ψ∗i ⋅ Ψj ) , (8)
Only under the condition of equal amplitudes at the 2 i,j
steady state will Eq. (5) reach the minimum of the XY N
Hamiltonian. The sum of the steady states of Eq. (4) H3 = −P (t) ∑ (Ψj ⊙ Ψ∗i ) ⋅ [Q(Ψ∗j ⊙ Ψi )] . (9)
gives N = ∑N i=1 γi + α/2 ∑i,j Jij cos (θi − θj ), so the global
i,j=1
minimum of the XY model corresponds to the smallest
Here, ⊙ indicates element-wise multiplication, and Q is
effective injection ∑i γi . Close to the threshold, Eq. (5)
a 2 × 2 permutation matrix given by Q = ( 01 10 ). As the
becomes fully analogous to the Kuramoto model
effective gain γi (t) increases with time t from negative
N (effective losses) to positive values, H1 anneals between
θ̇i = −α ∑ Jij sin (θi − θj ) . (6) a convex function with minimum at ∣∣Ψi ∣∣22 = 0 for all i,
j=1 to nonzero amplitudes. Writing the complex vectors in
H2 using their polar coordinates gives
Equation (1) can be adapted to minimize Ising Hamilto-
nians by restricting the state space of the phase, which (1) (1) (1) (2) (2) (2)
H2 = − ∑ Jij (ri rj cos θij + ri rj cos θij ) , (10)
we detail in Appendix B. i,j
Gain-based systems described by Eq. (1) can still set- (k) (k) (k)
tle in local minima during amplitude bifurcation, which where we define θij ≡ θi −θj . Equation (10) is analo-
limits the probability of finding the global minimum. To gous to the second term on the right-hand side of Eq. (3),
combat this, we introduce the complex vector gain-based but now we have two terms corresponding to the two di-
annealer (CoVeGA) that exploits the advantages of ex- mensions in the complex vector space that CoVeGA op-
tended spatial dimensions. In this model, continuous XY erates in. The effective gain is subject to the feedback
3
D= ∑ (1 − ∣Cijk ∣) , (14)
{ijk}∈T
IV. RESULTS
Figure 5. The probability distribution of recovering a state Figure 6. The probability distribution of recovering a state
with excitation parameter D for CoVeGA, one-dimensional with excitation parameter D for CoVeGA, one-dimensional
Stuart-Landau, SVL, Kuramoto, and BFGS methods on N = Stuart-Landau, SVL, Kuramoto, and BFGS methods on N =
12 (blue), 102 (orange), and 204 (green) 4-regular Möbius 16 (blue), 100 (orange), and 194 (green) triangular lattice
ladder graphs. One thousand runs are used to calculate the graphs. One thousand runs starting from random initial con-
probability of recovering excitation parameters D for each ditions are used to calculate the probability for each method
value of N . Parameter values for CoVeGA are taken from and for each value of N .
Table (I) in Appendix E [43], and so are values for the Stuart-
Landau network solver where applicable. For SVL, m = 1.0,
ξ ∼ N (0, 0.1), and γ = 0.9. In all cases, a fixed time step of imity gap metric to evaluate performance. The proximity
∆t = 0.1 is used with annealing time length T = 1000. gap provides a measure of the distance between any solu-
tion state and the best-known state by comparing their
objective values. More precisely, it is defined as the ra-
through its gain-based annealing strategy in higher- tio of the objective value obtained by a given method to
dimensional spaces. In Figs. (5) and (6), we compare the best objective value found among all methods. Here,
CoVeGA to these models and the Broyden-Fletcher- the objective value is given by the XY Hamiltonian HXY .
Goldfarb-Shanno (BFGS) algorithm on 4-regular Möbius When comparing optimization methods on basic Kura-
ladder graphs and triangular lattice graphs, respectively. towskian graphs, we choose p = 0.1, the hardest value of
BFGS is a classical quasi-Newton method for uncon- p, which is the probability that a coupling is set to zero.
strained nonlinear optimization that approximates the From Fig. (3)(f), we see that this occurs when p = 0.1,
Hessian matrix to guide the search for a local minimum. where the average sample variance s2 of Kuramoto runs
By leveraging a multidimensional approach, the CoV- is maximized.
eGA model recovers the ground state (D = 0) of 4-regular Figure (7)(a) compares the objective values achieved
Möbius ladder and triangular lattice graphs more often by CoVeGA with those obtained by one-dimensional
than any of the compared methods. As a result, the dis- Stuart-Landau, SVL, Kuramoto, and BFGS methods
tribution of excitations D for CoVeGA is skewed closer for N = 64 and N = 144 basic Kuratowskian graphs.
to D = 0 for all tested values of N , compared to the We define the quality improvement of CoVeGA over
single-dimension Stuart-Landau network, and this effect another method X in terms of objective values O as
becomes more pronounced as N increases. While the an- (OX − OCoVeGA )/OCoVeGA , showcasing these metrics in
nealing schedules used in these approaches and in SVL Fig. (7)(b), where X represents the best-performing com-
demonstrate efficient convergence to low-energy solutions peting method for each instance.
compared to Kuramoto and BFGS, CoVeGA’s higher di- These results demonstrate that CoVeGA consis-
mensionality enables XY spins to traverse paths connect- tently achieves ground states more reliably than one-
ing minima and converge to the ground state. This en- dimensional models, particularly in larger and more com-
hances system robustness, making the final system less plex systems.
sensitive to initial conditions.
Next, we consider basic Kuratowskian graphs, which
exhibit significant difficulty for standard gradient descent V. CONCLUSIONS
methods, as illustrated in Fig. (3) for small values of the
parameter p. However, the randomness of the couplings This paper presents the Complex Vector Gain-Based
can introduce statistical convergence issues in small sam- Annealer (CoVeGA). This approach combines multidi-
ple sizes. Despite this, graphs with random couplings are mensional continuous spin systems, gain-based oper-
popular for benchmarking physical simulators [45–47]. ations, and soft-amplitude annealing to optimize XY
Because basic Kuratowskian graph instances do not Hamiltonians on challenging graph structures. By ex-
have analytically known ground states, we use the prox- ploiting higher-dimensional spaces, CoVeGA improves
7
with reservoir dynamics ṅR = −(b0 +b1 ∣ψi ∣2 )nR +Pi , where
g, b0 , and b1 are dimensionless parameters, and Pi is
the pumping intensity. Networks of oscillators are con-
structed by coupling multiple lasers or condensates. In
this case, using the tight-binding approximation from
the mean-field complex Ginzburg-Landau equation, or
the mean-field Maxwell-Bloch equations for laser cavi-
ties, Eq. (A1) becomes
⎡ ⎤
⎢ ⎥
ψ̇i = −i∣ψi ∣2 ψi − ψi + (1 − ig) ⎢Ri ψi + α ∑ Jij ψj ⎥ , (A2)
⎢ j≠i
⎥
⎣ ⎦
0.1. For CoVeGA, a linear annealing schedule is cho- N (0, 0.1). The Kuramoto model is implemented with-
sen such that P (t) = βt. Each time evolution is to a out hyperparameters, given by Eq. (6) with α = 1. The
maximum time T = 1000. The total parameter space BFGS algorithm is ran with the SciPy library using the
for CoVeGA is {ε, α, β, γ(0)}, with optimal choices de- scipy.optimize.minimize function.
duced by the Python machine-learner online optimiza-
tion package (M-LOOP) [43]. The hyperparameter α is Graph Type N ε α′ β γ(0)
scaled by the inverse of the largest positive eigenvalue of 12 0.032 2.882 0.020 −0.139
coupling matrix J such that α = α′ /λmax , and the initial 4-Regular Möbius 102 0.022 2.402 0.003 −0.453
effective gain is γi (0) = γ(0) for all i. For a selection
of graph sizes for 4-regular Möbius ladder graphs, tri- 204 0.002 2.529 0.008 −1.235
angular lattice graphs, and basic Kuratowskian graphs, 16 0.045 1.129 0.018 −0.900
the optimal parameter choices are detailed in Table (I). Triangular 100 0.034 2.280 0.004 −0.526
Since the value of λmax varies for random instances of 196 0.010 1.301 0.005 −0.986
basic Kuratowskian graphs, the values of α′ in Table (I) 16 0.047 1.148 0.016 −0.295
for this case are chosen as the best over a sample of 100 Kuratowskian 64 0.023 2.763 0.007 −1.275
p = 0.1 graph instances.
144 0.012 2.745 0.014 −1.285
For results obtained using the one-dimensional Stuart-
Landau network given by Eq. (1), parameters ε, α, and Table I. Optimal sets of hyperparameters for different graphs
γ(0) are taken from Table (I). For SVL we fix m = 1.0, and sizes.
b = 0.9, and the Gaussian noise ξ is sampled from
[1] M. W. Johnson, M. H. Amin, S. Gildert, T. Lanting, son, V. Čeperić, et al., Nature Communications 11, 249
F. Hamze, N. Dickson, R. Harris, A. J. Berkley, J. Jo- (2020).
hansson, P. Bunyk, et al., Nature 473, 194 (2011). [14] K. Kim, M.-S. Chang, S. Korenblit, R. Islam, E. E. Ed-
[2] V. S. Denchev, S. Boixo, S. V. Isakov, N. Ding, R. Bab- wards, J. K. Freericks, G.-D. Lin, L.-M. Duan, and
bush, V. Smelyanskiy, J. Martinis, and H. Neven, Phys- C. Monroe, Nature 465, 590 (2010).
ical Review X 6, 031015 (2016). [15] N. G. Berloff, M. Silva, K. Kalinin, A. Askitopoulos,
[3] F. Arute, K. Arya, R. Babbush, D. Bacon, J. C. Bardin, J. D. Töpfer, P. Cilibrizzi, W. Langbein, and P. G.
R. Barends, R. Biswas, S. Boixo, F. G. Brandao, D. A. Lagoudakis, Nature Materials (2017).
Buell, et al., Nature 574, 505 (2019). [16] K. P. Kalinin, A. Amo, J. Bloch, and N. G. Berloff,
[4] P. L. McMahon, A. Marandi, Y. Haribara, R. Hamerly, Nanophotonics 9, 4127 (2020).
C. Langrock, S. Tamate, T. Inagaki, H. Takesue, S. Ut- [17] M. Vretenar, B. Kassenberg, S. Bissesar, C. Toebes, and
sunomiya, K. Aihara, et al., Science 354, 614 (2016). J. Klaers, Physical Review Research 3, 023167 (2021).
[5] T. Inagaki, Y. Haribara, K. Igarashi, T. Sonobe, S. Ta- [18] A. Litvinenko, R. Khymyn, R. Ovcharov, and
mate, T. Honjo, A. Marandi, P. L. McMahon, T. Umeki, J. Åkerman, arXiv preprint arXiv:2311.06830 (2023).
K. Enbutsu, et al., Science 354, 603 (2016). [19] A. Lucas, Frontiers in Physics 2, 5 (2014).
[6] Y. Yamamoto, K. Aihara, T. Leleu, K.-i. Kawarabayashi, [20] M. Yamaoka, C. Yoshimura, M. Hayashi, T. Okuyama,
S. Kako, M. Fejer, K. Inoue, and H. Takesue, npj Quan- H. Aoki, and H. Mizuno, IEEE Journal of Solid-State
tum Information 3, 1 (2017). Circuits 51, 303 (2015).
[7] T. Honjo, T. Sonobe, K. Inaba, T. Inagaki, T. Ikuta, [21] K. Tanahashi, S. Takayanagi, T. Motohashi, and
Y. Yamada, T. Kazama, K. Enbutsu, T. Umeki, R. Kasa- S. Tanaka, Journal of the Physical Society of Japan 88,
hara, et al., Science Advances 7, eabh0952 (2021). 061010 (2019).
[8] F. Cai, S. Kumar, T. Van Vaerenbergh, X. Sheng, R. Liu, [22] A. Momeni, B. Rahmani, B. Scellier, L. G. Wright, P. L.
C. Li, Z. Liu, M. Foltin, S. Yu, Q. Xia, et al., Nature McMahon, C. C. Wanjura, Y. Li, A. Skalli, N. G. Berloff,
Electronics 3, 409 (2020). T. Onodera, et al., arXiv preprint arXiv:2406.03372
[9] M. Babaeian, D. T. Nguyen, V. Demir, M. Akbulut, (2024).
P.-A. Blanche, Y. Kaneda, S. Guha, M. A. Neifeld, [23] M. Gilli, D. Maringer, and E. Schumann, Numerical
and N. Peyghambarian, Nature Communications 10, 1 Methods and Optimization in Finance (Academic Press,
(2019). 2019).
[10] V. Pal, S. Mahler, C. Tradonsky, A. A. Friesem, and [24] R. Z. Wang, J. S. Cummins, M. Syed, N. Stroev,
N. Davidson, Physical Review Research 2, 033008 (2020). G. Pastras, J. Sakellariou, S. Tsintzos, A. Askitopou-
[11] M. Parto, W. Hayenga, A. Marandi, D. N. los, D. Veraldi, M. C. Strinati, et al., arXiv preprint
Christodoulides, and M. Khajavikhan, Nature Ma- arXiv:2406.01400 (2024).
terials 19, 725 (2020). [25] J. M. Kosterlitz and D. J. Thouless, in Basic Notions Of
[12] D. Pierangeli, G. Marcucci, and C. Conti, Physical Re- Condensed Matter Physics (CRC Press, 2018) pp. 493–
view Letters 122, 213902 (2019). 515.
[13] C. Roques-Carmes, Y. Shen, C. Zanoci, M. Prabhu, [26] P. Minnhagen, Reviews of Modern Physics 59, 1001
F. Atieh, L. Jing, T. Dubček, C. Mao, M. R. John- (1987).
10
[27] W. H. Zurek, in Formation and Interactions of Topo- vokin, N. Berloff, and P. Lagoudakis, Physical Review
logical Defects: Proceedings of a NATO Advanced Study X 6, 031032 (2016).
Institute on Formation and Interactions of Topological [39] M. C. Strinati, D. Pierangeli, and C. Conti, Physical
Defects (Springer, 1995) pp. 349–378. Review Applied 16, 054022 (2021).
[28] P.-G. De Gennes and J. Prost, The Physics of Liquid [40] A. B. Ayoub and D. Psaltis, Scientific Reports 11, 1
Crystals, 83 (Oxford University Press, 1993). (2021).
[29] D. Gallina and G. Pastor, Physical Review X 10, 021068 [41] K. P. Kalinin and N. G. Berloff, arXiv preprint
(2020). arXiv:2008.00466 (2020).
[30] K. A. Dill, S. B. Ozkan, M. S. Shell, and T. R. Weikl, [42] S. Istrail, in Proceedings of the thirty-second annual ACM
Annu. Rev. Biophys. 37, 289 (2008). symposium on Theory of computing (2000) pp. 87–96.
[31] E. J. Candes, Y. C. Eldar, T. Strohmer, and V. Voronin- [43] P. B. Wigley, P. J. Everitt, A. van den Hengel, J. W.
ski, SIAM Review 57, 225 (2015). Bastian, M. A. Sooriyabandara, G. D. McDonald, K. S.
[32] M. Syed and N. G. Berloff, IEEE Journal of Selected Hardman, C. D. Quinlivan, P. Manju, C. C. Kuhn, et al.,
Topics in Quantum Electronics 29, 1 (2023). Scientific Reports 6, 25890 (2016).
[33] J. S. Cummins, H. Salman, and N. G. Berloff, arXiv [44] D. Subires, F. J. Gómez-Ruiz, A. Ruiz-Garcı́a, D. Alonso,
preprint arXiv:2311.17359 (2023). and A. Del Campo, Physical Review Research 4, 023104
[34] J. S. Cummins and N. G. Berloff, arXiv preprint (2022).
arXiv:2403.16608 (2024). [45] R. Hamerly, T. Inagaki, P. L. McMahon, D. Venturelli,
[35] K. P. Kalinin and N. G. Berloff, Scientific Reports 8, A. Marandi, T. Onodera, E. Ng, C. Langrock, K. Inaba,
17791 (2018). T. Honjo, et al., Science Advances 5, eaau0823 (2019).
[36] E. Boros and P. L. Hammer, Discrete Applied Mathe- [46] M. P. Harrigan, K. J. Sung, M. Neeley, K. J. Satzinger,
matics 123, 155 (2002). F. Arute, K. Arya, J. Atalaya, J. C. Bardin, R. Barends,
[37] N. Stroev and N. G. Berloff, Physical Review Letters S. Boixo, et al., Nature Physics 17, 332 (2021).
126, 050504 (2021). [47] F. Böhm, G. Verschaffelt, and G. Van der Sande, Nature
[38] H. Ohadi, R. Gregory, T. Freegarde, Y. Rubo, A. Ka- Communications 10, 1 (2019).