ログイン
Language:

WEKO3

  • トップ
  • ランキング
To
lat lon distance
To

Field does not validate



インデックスリンク

インデックスツリー

メールアドレスを入力してください。

WEKO

One fine body…

WEKO

One fine body…

アイテム

  1. 02 情報科学
  2. 01 学術雑誌論文

Scalable reinforcement learning for plant-wide control of vinyl acetate monomer process

http://hdl.handle.net/10061/0002000120
http://hdl.handle.net/10061/0002000120
e4c76e85-287c-4f31-a882-482b0734693a
アイテムタイプ 学術雑誌論文 / Journal Article(1)
公開日 2024-02-07
タイトル
タイトル Scalable reinforcement learning for plant-wide control of vinyl acetate monomer process
言語
言語 eng
キーワード
主題Scheme Other
主題 Chemical process control
キーワード
主題Scheme Other
主題 Reinforcement learning
キーワード
主題Scheme Other
主題 Vinyl acetate monomer
資源タイプ
資源タイプ journal article
アクセス権
アクセス権 open access
著者 Zhu, Lingwei

× Zhu, Lingwei

en Zhu, Lingwei

Search repository
Cui, Yunduan

× Cui, Yunduan

en Cui, Yunduan

Search repository
Takami, Go

× Takami, Go

en Takami, Go

Search repository
Kanokogi, Hiroaki

× Kanokogi, Hiroaki

en Kanokogi, Hiroaki

Search repository
松原, 崇充

× 松原, 崇充

WEKO 181
e-Rad_Researcher 20508056

ja 松原, 崇充

ja-Kana マツバラ, タカミツ

en Matsubara, Takamitsu

Search repository
抄録
内容記述タイプ Abstract
内容記述 This paper explores a reinforcement learning (RL) approach that designs automatic control strategies in a large-scale chemical process control scenario as the first step for leveraging an RL method to intelligently control real-world chemical plants. The huge number of units for chemical reactions as well as feeding and recycling the materials of a typical chemical process induces a vast amount of samples and subsequent prohibitive computation complexity in RL for deriving a suitable control policy due to high-dimensional state and action spaces. To tackle this problem, a novel RL algorithm: Factorial Fast-food Dynamic Policy Programming (FFDPP) is proposed. By introducing a factorial framework that efficiently factorizes the action space, Fast-food kernel approximation that alleviates the curse of dimensionality caused by the high dimensionality of state space, into Dynamic Policy Programming (DPP) that achieves stable learning even with insufficient samples. FFDPP is evaluated in a commercial chemical plant simulator for a Vinyl Acetate Monomer (VAM) process. Experimental results demonstrate that without any knowledge of the model, the proposed method successfully learned a stable policy with reasonable computation resources to produce a larger amount of VAM product with comparative performance to a state-of-the-art model-based control.
書誌情報 en : Control Engineering Practice

巻 97, 発行日 2020-02-10
出版者
出版者 Elsevier
ISSN
収録物識別子タイプ EISSN
収録物識別子 0967-0661
出版者版DOI
関連タイプ isReplacedBy
識別子タイプ DOI
関連識別子 https://doi.org/10.1016/j.conengprac.2020.104331
出版者版URI
関連タイプ isReplacedBy
識別子タイプ URI
関連識別子 https://www.sciencedirect.com/science/article/pii/S0967066120300186
権利
権利情報Resource http://creativecommons.org/licenses/by-nc-nd/4.0/
権利情報 c 2020 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license(http://creativecommons.org/licenses/by-nc-nd/4.0/).
著者版フラグ
出版タイプ NA
戻る
0
views
See details
Views

Versions

Ver.1 2024-02-07 09:49:45.509680
Show All versions

Share

Share
tweet

Cite as

Other

print

エクスポート

OAI-PMH
  • OAI-PMH JPCOAR 2.0
  • OAI-PMH JPCOAR 1.0
  • OAI-PMH DublinCore
  • OAI-PMH DDI
Other Formats
  • JSON
  • BIBTEX
  • ZIP

コミュニティ

確認

確認

確認


Powered by WEKO3


Powered by WEKO3