<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="en"><front><journal-meta><journal-id journal-id-type="publisher-id">najo</journal-id><journal-title-group><journal-title xml:lang="en">Nanosystems: Physics, Chemistry, Mathematics</journal-title><trans-title-group xml:lang="ru"><trans-title>Наносистемы: физика, химия, математика</trans-title></trans-title-group></journal-title-group><issn pub-type="ppub">2220-8054</issn><issn pub-type="epub">2305-7971</issn><publisher><publisher-name>Университет ИТМО</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.17586/2220-8054-2023-14-6-613-625</article-id><article-id custom-type="elpub" pub-id-type="custom">najo-124</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="en"><subject>MATHEMATICS</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>МАТЕМАТИКА</subject></subj-group></article-categories><title-group><article-title>Toward nanomagnetic implementation of energy-based machine learning</article-title><trans-title-group xml:lang="ru"><trans-title>На пути к наномагнитным реализациям энергетических моделей машинного обучения</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><contrib-id contrib-id-type="orcid">https://orcid.org/0000-0001-8789-3267</contrib-id><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Лобанов</surname><given-names>И. С.</given-names></name><name name-style="western" xml:lang="en"><surname>Lobanov</surname><given-names>I. S.</given-names></name></name-alternatives><bio xml:lang="en"><p>Igor S. Lobanov</p><p>Lomonosova Str. 9, Saint Petersburg, 191002</p></bio><email xlink:type="simple">lobanov.igor@gmail.com</email><xref ref-type="aff" rid="aff-1"/></contrib></contrib-group><aff xml:lang="en" id="aff-1"><institution>Faculty of Physics, ITMO University</institution><country>Russian Federation</country></aff><pub-date pub-type="collection"><year>2023</year></pub-date><pub-date pub-type="epub"><day>02</day><month>06</month><year>2025</year></pub-date><volume>14</volume><issue>6</issue><fpage>613</fpage><lpage>625</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Lobanov I.S., 2025</copyright-statement><copyright-year>2025</copyright-year><copyright-holder xml:lang="ru">Лобанов И.С.</copyright-holder><copyright-holder xml:lang="en">Lobanov I.S.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://nanojournal.ifmo.ru/jour/article/view/124">https://nanojournal.ifmo.ru/jour/article/view/124</self-uri><abstract><p>Some approaches to machine learning (ML) such as Boltzmann machines (BM) can be reformulated as energy based models, which are famous for being trained by minimization of free energy. In the standard contrastive divergence (CD) learning the model parameters optimization is driven by competition of relaxation forces appearing in the target system and the model one. It is tempting to implement a physical device having natural relaxation dynamics matching minimization of the loss function of the ML model. In the article, we propose a general approach for the design of such devices. We systematically reduce the BM, the restricted BM and BM for classification problems to energy based models. For each model we describe a device capable of learning model parameters by relaxation. We compare simulated dynamics of the models using CD, Monte-Carlo method and Langevin dynamics. Benchmarks of the proposed devices on generation and classification of hand-written digits from MNIST dataset are provided.</p></abstract><trans-abstract xml:lang="ru"><p>Некоторые подходы к машинному обучению (МО), такие как машины Больцмана (МБ),  могут быть переформулированы в виде моделей, обучение которых основано на минимизации свободной энергии. При обучении таких моделей стандартным методом контрастивной дивергенции (КД) динамика параметров модели обусловлена конкуренцией релаксационных сил в целевой системе и в модельной. В статье мы предлагаем общий подход к созданию физических устройств, релаксационная динамика которых соответствует минимизации функции потерь соответствующей модели МО. Мы систематически сводим модель МБ, ограниченной МБ и МБ для задач классификации к энергетическим моделям. Для каждой модели мы описываем устройство, обучающееся путем релаксации. Моделирование релаксационной динамики проведено методами КД, Монте-Карло и на основе динамики Ланжевена. Приводятся результаты симуляции предлагаемых устройств для решения задач генерации и классификации рукописных цифр из набора данных MNIST.</p></trans-abstract><kwd-group xml:lang="ru"><kwd>Машинное обучение</kwd><kwd>машина Больцмана</kwd><kwd>энергетические модели</kwd><kwd>диссипативное обучение</kwd></kwd-group><kwd-group xml:lang="en"><kwd>Machine learning</kwd><kwd>Boltzmann machine</kwd><kwd>energy based model</kwd><kwd>dissipative training</kwd></kwd-group><funding-group><funding-statement xml:lang="en">The work is supported by Russian Science Foundation grant 22-22-00565: https://rscf.ru/en/project/22-22-00565/</funding-statement></funding-group></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Min B., Ross H., Sulem E., et al. Recent Advances in Natural Language Processing via Large Pre-trained Language Models: A Survey. ACM Computing Surveys, 2023, 56(2), P. 1–40.</mixed-citation><mixed-citation xml:lang="en">Min B., Ross H., Sulem E., et al. Recent Advances in Natural Language Processing via Large Pre-trained Language Models: A Survey. ACM Computing Surveys, 2023, 56(2), P. 1–40.</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">Chen Y., Nazhamaiti M., Xu H., et al. All-analog photoelectronic chip for high-speed vision tasks. Nature, 2023, 623(7985), P. 48–57.</mixed-citation><mixed-citation xml:lang="en">Chen Y., Nazhamaiti M., Xu H., et al. All-analog photoelectronic chip for high-speed vision tasks. Nature, 2023, 623(7985), P. 48–57.</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Grollier J., Querlioz D., Camsari K.Y., et al. Neuromorphic spintronics. Nature Electronics, 2020, 3(7), P. 360–370.</mixed-citation><mixed-citation xml:lang="en">Grollier J., Querlioz D., Camsari K.Y., et al. Neuromorphic spintronics. Nature Electronics, 2020, 3(7), P. 360–370.</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Li S., Kang W., Zhang X., et al. Magnetic skyrmions for unconventional computing. Materials Horizons, 2021, 8(3), P. 854–868.</mixed-citation><mixed-citation xml:lang="en">Li S., Kang W., Zhang X., et al. Magnetic skyrmions for unconventional computing. Materials Horizons, 2021, 8(3), P. 854–868.</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">Yokouchi T., Sugimoto S., Rana B., et al. Pattern recognition with neuromorphic computing using magnetic fieldinduced dynamics of skyrmions. Science Advances, 2022, 8(39), P. eabq5652.</mixed-citation><mixed-citation xml:lang="en">Yokouchi T., Sugimoto S., Rana B., et al. Pattern recognition with neuromorphic computing using magnetic fieldinduced dynamics of skyrmions. Science Advances, 2022, 8(39), P. eabq5652.</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Gu K., Guan Y., Hazra B.K., et al. Three-dimensional racetrack memory devices designed from free- standing magnetic heterostructures. Nature Nanotechnology, 2022, 17(10), P. 1065–1071.</mixed-citation><mixed-citation xml:lang="en">Gu K., Guan Y., Hazra B.K., et al. Three-dimensional racetrack memory devices designed from free- standing magnetic heterostructures. Nature Nanotechnology, 2022, 17(10), P. 1065–1071.</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Blasing R., Khan A.A., Filippou P.C., et al. Magnetic Racetrack Memory: From Physics to the Cusp of ApplicationsWithin a Decade. Proceedings of the IEEE, 2020, 108(8), P. 1303–1321.</mixed-citation><mixed-citation xml:lang="en">Blasing R., Khan A.A., Filippou P.C., et al. Magnetic Racetrack Memory: From Physics to the Cusp of ApplicationsWithin a Decade. Proceedings of the IEEE, 2020, 108(8), P. 1303–1321.</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">Marullo C., Agliari E. Boltzmann Machines as Generalized Hopfield Networks: A Review of Recent Results and Outlooks. Entropy, 2020, 23(1), 34 pp.</mixed-citation><mixed-citation xml:lang="en">Marullo C., Agliari E. Boltzmann Machines as Generalized Hopfield Networks: A Review of Recent Results and Outlooks. Entropy, 2020, 23(1), 34 pp.</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">Khater A., Abou Ghantous M. Magnetic properties of 2D nano-islands I: Ising spin model. Journal of Magnetism and Magnetic Materials, 2011, 323(22), P. 2717–2726.</mixed-citation><mixed-citation xml:lang="en">Khater A., Abou Ghantous M. Magnetic properties of 2D nano-islands I: Ising spin model. Journal of Magnetism and Magnetic Materials, 2011, 323(22), P. 2717–2726.</mixed-citation></citation-alternatives></ref><ref id="cit10"><label>10</label><citation-alternatives><mixed-citation xml:lang="ru">Yang S., Shin J., Kim T., et al. Integrated neuromorphic computing networks by artificial spin synapses and spin neurons. NPG Asia Materials, 2021, 13(1), P. 1–10.</mixed-citation><mixed-citation xml:lang="en">Yang S., Shin J., Kim T., et al. Integrated neuromorphic computing networks by artificial spin synapses and spin neurons. NPG Asia Materials, 2021, 13(1), P. 1–10.</mixed-citation></citation-alternatives></ref><ref id="cit11"><label>11</label><citation-alternatives><mixed-citation xml:lang="ru">Dixit V., Selvarajan R., Alam M.A., et al. Training Restricted Boltzmann MachinesWith a D-Wave Quantum Annealer. Frontiers in Physics, 2021, 9, 589626, 10 pp.</mixed-citation><mixed-citation xml:lang="en">Dixit V., Selvarajan R., Alam M.A., et al. Training Restricted Boltzmann MachinesWith a D-Wave Quantum Annealer. Frontiers in Physics, 2021, 9, 589626, 10 pp.</mixed-citation></citation-alternatives></ref><ref id="cit12"><label>12</label><citation-alternatives><mixed-citation xml:lang="ru">Lecun Y., Chopra S., Hadsell R., et al. A tutorial on energy-based learning. In G. Bakir, T. Hofman, B. Scholkopt, A. Smola, B. Taskar, editors, Predicting structured data. MIT Press. 2006.</mixed-citation><mixed-citation xml:lang="en">Lecun Y., Chopra S., Hadsell R., et al. A tutorial on energy-based learning. In G. Bakir, T. Hofman, B. Scholkopt, A. Smola, B. Taskar, editors, Predicting structured data. MIT Press. 2006.</mixed-citation></citation-alternatives></ref><ref id="cit13"><label>13</label><citation-alternatives><mixed-citation xml:lang="ru">Dayan P., Hinton G.E., Neal R.M., et al. The Helmholtz Machine. 1999. Neural Comput., 1995, 7(5), P. 889–904.</mixed-citation><mixed-citation xml:lang="en">Dayan P., Hinton G.E., Neal R.M., et al. The Helmholtz Machine. 1999. Neural Comput., 1995, 7(5), P. 889–904.</mixed-citation></citation-alternatives></ref><ref id="cit14"><label>14</label><citation-alternatives><mixed-citation xml:lang="ru">Teh Y.W., Welling M., Osindero S., et al. Energy-Based Models for Sparse Overcomplete Representa- tions. Journal of Machine Learning Research, 2003, 4, P. 1235–1260.</mixed-citation><mixed-citation xml:lang="en">Teh Y.W., Welling M., Osindero S., et al. Energy-Based Models for Sparse Overcomplete Representa- tions. Journal of Machine Learning Research, 2003, 4, P. 1235–1260.</mixed-citation></citation-alternatives></ref><ref id="cit15"><label>15</label><citation-alternatives><mixed-citation xml:lang="ru">Du Y., Mordatch I. Implicit Generation and Modeling with Energy Based Models. In Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc.</mixed-citation><mixed-citation xml:lang="en">Du Y., Mordatch I. Implicit Generation and Modeling with Energy Based Models. In Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc.</mixed-citation></citation-alternatives></ref><ref id="cit16"><label>16</label><citation-alternatives><mixed-citation xml:lang="ru">Lobanov I. Spin Boltzmann machine. Nanosystems: Physics, Chemistry, Mathematics, 2022, 13(6), P. 593–607.</mixed-citation><mixed-citation xml:lang="en">Lobanov I. Spin Boltzmann machine. Nanosystems: Physics, Chemistry, Mathematics, 2022, 13(6), P. 593–607.</mixed-citation></citation-alternatives></ref><ref id="cit17"><label>17</label><citation-alternatives><mixed-citation xml:lang="ru">Kiraly B., Knol E.J., Van Weerdenburg W.M.J., et al. An atomic Boltzmann machine capable of self- adaption. Nature Nanotechnology, 2021, 16(4), P. 414–420.</mixed-citation><mixed-citation xml:lang="en">Kiraly B., Knol E.J., Van Weerdenburg W.M.J., et al. An atomic Boltzmann machine capable of self- adaption. Nature Nanotechnology, 2021, 16(4), P. 414–420.</mixed-citation></citation-alternatives></ref><ref id="cit18"><label>18</label><citation-alternatives><mixed-citation xml:lang="ru">Aksenova E., Dobrun L., Kovshik A., et al. Magnetic Field-Induced Macroscopic Alignment of Liquid- Crystalline Lanthanide Complexes. Crystals, 2019, 9(10), P. 499.</mixed-citation><mixed-citation xml:lang="en">Aksenova E., Dobrun L., Kovshik A., et al. Magnetic Field-Induced Macroscopic Alignment of Liquid- Crystalline Lanthanide Complexes. Crystals, 2019, 9(10), P. 499.</mixed-citation></citation-alternatives></ref><ref id="cit19"><label>19</label><citation-alternatives><mixed-citation xml:lang="ru">Yair O., Michaeli T. Contrastive Divergence Learning is a Time Reversal Adversarial Game. URL/arXiv: https://doi.org/10.48550/arXiv.2012.03295.</mixed-citation><mixed-citation xml:lang="en">Yair O., Michaeli T. Contrastive Divergence Learning is a Time Reversal Adversarial Game. URL/arXiv: https://doi.org/10.48550/arXiv.2012.03295.</mixed-citation></citation-alternatives></ref><ref id="cit20"><label>20</label><citation-alternatives><mixed-citation xml:lang="ru">Colombo S., Pedrozo-Peafiel E., Adiyatullin A.F., et al. Time-reversal-based quantum metrology with many-body entangled states. Nature Physics, 2022, 18(8), P. 925–930.</mixed-citation><mixed-citation xml:lang="en">Colombo S., Pedrozo-Peafiel E., Adiyatullin A.F., et al. Time-reversal-based quantum metrology with many-body entangled states. Nature Physics, 2022, 18(8), P. 925–930.</mixed-citation></citation-alternatives></ref><ref id="cit21"><label>21</label><citation-alternatives><mixed-citation xml:lang="ru">Purcell E.M., Pound R.V. A Nuclear Spin System at Negative Temperature. Physical Review, 1951, 81(2), P. 279–280.</mixed-citation><mixed-citation xml:lang="en">Purcell E.M., Pound R.V. A Nuclear Spin System at Negative Temperature. Physical Review, 1951, 81(2), P. 279–280.</mixed-citation></citation-alternatives></ref><ref id="cit22"><label>22</label><citation-alternatives><mixed-citation xml:lang="ru">Hakonen P., Lounasmaa O.V. Negative Absolute Temperatures: “Hot” Spins in Spontaneous Magnetic Order. Science, 1994, 265(5180), P. 1821– 1825.</mixed-citation><mixed-citation xml:lang="en">Hakonen P., Lounasmaa O.V. Negative Absolute Temperatures: “Hot” Spins in Spontaneous Magnetic Order. Science, 1994, 265(5180), P. 1821– 1825.</mixed-citation></citation-alternatives></ref><ref id="cit23"><label>23</label><citation-alternatives><mixed-citation xml:lang="ru">Medley P., Weld D.M., Miyake H., et al. Spin Gradient Demagnetization Cooling of Ultracold Atoms. Physical Review Letters, 2011, 106(19), 195301, 4 pp.</mixed-citation><mixed-citation xml:lang="en">Medley P., Weld D.M., Miyake H., et al. Spin Gradient Demagnetization Cooling of Ultracold Atoms. Physical Review Letters, 2011, 106(19), 195301, 4 pp.</mixed-citation></citation-alternatives></ref><ref id="cit24"><label>24</label><citation-alternatives><mixed-citation xml:lang="ru">Ramsey N.F. Thermodynamics and Statistical Mechanics at Negative Absolute Temperatures. Physical Review, 1956, 103(1), P. 20–28.</mixed-citation><mixed-citation xml:lang="en">Ramsey N.F. Thermodynamics and Statistical Mechanics at Negative Absolute Temperatures. Physical Review, 1956, 103(1), P. 20–28.</mixed-citation></citation-alternatives></ref><ref id="cit25"><label>25</label><citation-alternatives><mixed-citation xml:lang="ru">Baldovin M., Iubini S., Livi R., et al. Statistical mechanics of systems with negative temperature. Physics Reports, 2021, 923, P. 1–50.</mixed-citation><mixed-citation xml:lang="en">Baldovin M., Iubini S., Livi R., et al. Statistical mechanics of systems with negative temperature. Physics Reports, 2021, 923, P. 1–50.</mixed-citation></citation-alternatives></ref><ref id="cit26"><label>26</label><citation-alternatives><mixed-citation xml:lang="ru">Deng L. The mnist database of handwritten digit images for machine learning research. IEEE Signal Processing Magazine, 2012, 29(6), P. 141–142.</mixed-citation><mixed-citation xml:lang="en">Deng L. The mnist database of handwritten digit images for machine learning research. IEEE Signal Processing Magazine, 2012, 29(6), P. 141–142.</mixed-citation></citation-alternatives></ref><ref id="cit27"><label>27</label><citation-alternatives><mixed-citation xml:lang="ru">Hoffmann M., Blgel S. Systematic derivation of realistic spin models for beyond-Heisenberg solids. Physical Review B, 2020, 101(2), P. 024418.</mixed-citation><mixed-citation xml:lang="en">Hoffmann M., Blgel S. Systematic derivation of realistic spin models for beyond-Heisenberg solids. Physical Review B, 2020, 101(2), P. 024418.</mixed-citation></citation-alternatives></ref><ref id="cit28"><label>28</label><citation-alternatives><mixed-citation xml:lang="ru">Yoshioka N., Akagi Y., Katsura H. Transforming generalized Ising models into Boltzmann machines. Physical Review E, 2019, 99(3), P. 032113.</mixed-citation><mixed-citation xml:lang="en">Yoshioka N., Akagi Y., Katsura H. Transforming generalized Ising models into Boltzmann machines. Physical Review E, 2019, 99(3), P. 032113.</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
