Feedforward beta control in the KSTAR tokamak by deep reinforcement learning

被引：47

作者：

Seo, Jaemin ^{[1
]}

Na, Y. S. ^{[1
]}

Kim, B. ^{[1
]}

Lee, C. Y. ^{[1
]}

Park, M. S. ^{[1
]}

Park, S. J. ^{[1
]}

Lee, Y. H. ^{[2
]}

机构：

[1] Seoul Natl Univ, Dept Nucl Engn, Seoul, South Korea

[2] Korea Inst Fus Energy, Daejeon, South Korea

来源：

NUCLEAR FUSION | 2021年 / 61卷 / 10期

基金：

新加坡国家研究基金会;

关键词：

machine learning; reinforcement learning; beta control; data-driven simulation; KSTAR; tokamak; GENERAL AXISYMMETRICAL EQUILIBRIA; RECONSTRUCTION; PARAMETERS; PLASMAS;

D O I：

10.1088/1741-4326/ac121b

中图分类号：

O35 [流体力学]; O53 [等离子体物理学];

学科分类号：

070204 ; 080103 ; 080704 ;

摘要：

In this work, we address a new feedforward control scheme for the normalized beta (beta (N)) in tokamak plasmas, using the deep reinforcement learning (RL) technique. The deep RL algorithm optimizes an artificial decision-making agent that adjusts the discharge scenario to obtain a given target beta (N) from the state-action-reward sets explored by its own trial and error in a virtual tokamak environment. The virtual environment for the RL training is constructed using a long short-term memory (LSTM) network that imitates the plasma responses to external actuator controls, which is trained using five years' worth of KSTAR experimental data. The RL agent then experiences numerous discharges with different actuator controls in the LSTM simulator, and its internal parameters are optimized in the direction of maximizing the reward. We analyze a series of KSTAR experiments conducted with the RL-determined scenarios to validate the feasibility of the beta control scheme in a real device. We discuss the successes and limitations of feedforward beta control by RL, and suggest a future research path for this area of study.

引用

页数：14

共 51 条

[31]

Krogh A., 1995, Advances in Neural Information Processing Systems 7, P231

[32] RECONSTRUCTION OF CURRENT PROFILE PARAMETERS AND PLASMA SHAPES IN TOKAMAKS [J].

LAO, LL ;

STJOHN, H ;

STAMBAUGH, RD ;

KELLMAN, AG ;

PFEIFFER, W .

NUCLEAR FUSION, 1985, 25 (11) :1611-1622

[33] Development of integrated suite of codes and its validation on KSTAR [J].

Lee, C. Y. ;

Seo, J. ;

Park, S. J. ;

Lee, J. G. ;

Kim, S. K. ;

Kim, B. ;

Byun, C. S. ;

Lee, Y. S. ;

Gwak, J. W. ;

Kang, J. ;

Jung, L. ;

Kim, H. -S. ;

Hong, S. -H. ;

Na, Yong-Su .

NUCLEAR FUSION, 2021, 61 (09)

[34] Development of advanced inductive scenarios for ITER [J].

Luce, T. C. ;

Challis, C. D. ;

Ide, S. ;

Joffrin, E. ;

Kamada, Y. ;

Politzer, P. A. ;

Schweinzer, J. ;

Sips, A. C. C. ;

Stober, J. ;

Giruzzi, G. ;

Kessel, C. E. ;

Murakami, M. ;

Na, Y. -S. ;

Park, J. M. ;

Polevoi, A. R. ;

Budny, R. V. ;

Citrin, J. ;

Garcia, J. ;

Hayashi, N. ;

Hobirk, J. ;

Hudson, B. F. ;

Imbeaux, F. ;

Isayama, A. ;

McDonald, D. C. ;

Nakano, T. ;

Oyama, N. ;

Parail, V. V. ;

Petrie, T. W. ;

Petty, C. C. ;

Suzuki, T. ;

Wade, M. R. .

NUCLEAR FUSION, 2014, 54 (01)

[35] High performance stationary discharges in the DIII-D tokamak [J].

Luce, TC ;

Wade, MR ;

Ferron, JR ;

Politzer, PA ;

Hyatt, AW ;

Sips, ACC ;

Murakami, M .

PHYSICS OF PLASMAS, 2004, 11 (05) :2627-2636

[36] Self-consistent core-pedestal transport simulations with neural network accelerated models [J].

Meneghini, O. ;

Smith, S. P. ;

Snyder, P. B. ;

Staebler, G. M. ;

Candy, J. ;

Belli, E. ;

Lao, L. ;

Kostuk, M. ;

Luce, T. ;

Luda, T. ;

Park, J. M. ;

Poli, F. .

NUCLEAR FUSION, 2017, 57 (08)

[37] Modeling of transport phenomena in tokamak plasmas with neural networks [J].

Meneghini, O. ;

Luna, C. J. ;

Smith, S. P. ;

Lao, L. L. .

PHYSICS OF PLASMAS, 2014, 21 (06)

[38] Human-level control through deep reinforcement learning [J].

Mnih, Volodymyr ;

Kavukcuoglu, Koray ;

Silver, David ;

Rusu, Andrei A. ;

Veness, Joel ;

Bellemare, Marc G. ;

Graves, Alex ;

Riedmiller, Martin ;

Fidjeland, Andreas K. ;

Ostrovski, Georg ;

Petersen, Stig ;

Beattie, Charles ;

Sadik, Amir ;

Antonoglou, Ioannis ;

King, Helen ;

Kumaran, Dharshan ;

Wierstra, Daan ;

Legg, Shane ;

Hassabis, Demis .

NATURE, 2015, 518 (7540) :529-533

[39] Identification of models for current profile modification in ASDEX upgrade [J].

Na, Yong-Su ;

Sips, A. C. C. ;

Treutterer, W. .

FUSION SCIENCE AND TECHNOLOGY, 2006, 50 (04) :490-502

[40] On hybrid scenarios in KSTAR [J].

Na, Yong-Su ;

Lee, Y. H. ;

Byun, C. S. ;

Kim, S. K. ;

Lee, C. Y. ;

Park, M. S. ;

Yang, S. M. ;

Kim, B. ;

Jeon, Y. -M. ;

Choi, G. J. ;

Citrin, J. ;

Juhn, J. W. ;

Kang, J. S. ;

Kim, H. -S. ;

Kim, J. H. ;

Ko, W. H. ;

Kwon, J. -M. ;

Lee, W. C. ;

Woo, M. H. ;

Yi, S. ;

Yoon, S. W. ;

Yun, G. S. .

NUCLEAR FUSION, 2020, 60 (08)

← 1 2 3 4 5 6 →