(629h) Continuous Control of a Polymerization System with Deep Reinforcement Learning

Conference

AIChE Annual Meeting

Year

2018

Proceeding

2018 AIChE Annual Meeting

Group

Computing and Systems Technology Division

Session

Modeling, Control, and Optimization of Manufacturing Systems

Time

Thursday, November 1, 2018 - 10:13am to 10:32am

Authors

Ma, Y. - Presenter, Louisiana State University

Zhu, W., Chemical Engineering Department, Louisiana State U

Benton, M. G., Louisiana State University

Romagnoli, J. A., Louisiana State University

Improving and deriving noble control laws for polymerization reaction has been one of the uppermost tasks in the chemical industry. There have been extensive studies in control industries from proportional controls to various forms of model predictive controls, which have been benefiting in current chemical processes and plants[1]. Recent breakthroughs in deep learning area have started to inspire the development of Artificial Intelligence(AI)-based controllers in chemical reaction control area thanks to its known success in wide-range applications from gaming to robotics control[2]. These applications utilize deep reinforcement learning (Deep RL), of which the ultimate goal is to enable computers to make human-like policies based on AIâ€™s exploration of the environment[3]. However, some challenges are still faced for applying Deep RL in controlling chemical reactions, since the inputs of chemical reactors are usually high-dimensional, and the system dynamics are often sensitive and has considerable time-delay effect[2].

Previous studies have been done in controlling a free-radical polymerization process by following real-time measurement of weight-average molar mass[1], where the specific molar mass distribution can be achieved by following a trajectory of weight-average molar mass with respect to time[1]. In this work, we developed a deep learning based controller for a free radical poly-acrylamide polymerization system using Deep Deterministic Policy Gradient (DDPG). The DDPG utilizes actor-critic structure that is able to predict actions of infinite dimensions[4]. The controller calculates the control action a_t, which consists of the monomer flow rate F_m and initiator flow rate F_i to adjust at each time step t, and the system response of the action is recorded as a state s_t. The network is trained to maximize the cumulative reward r that accounts for the distance between current output and target output for each iteration[5]. The network is trained on an established kinetic model of the polymerization reaction, and the controller gradually learns the policy through exploration of the system. Then convergence is achieved when the average cumulative reward reaches a desired threshold. In our experiment, the controller successfully has learned the control policy to follow the target trajectory of the weight-average molar mass.

Overall the smart controller has shown robust control over a range of operating conditions, which indicates the deep reinforcement learning based approachâ€™s capability in controlling a nonlinear dynamic semi-batch system.

Reference

[1] N. Ghadipasha, W. Zhu, J. A. Romagnoli, T. Mcafee, T. Zekoski, and W. F. Reed, â€œOnline Optimal Feedback Control of Polymerization Reactors : Application to Polymerization of Acrylamide âˆ’ Water âˆ’ Potassium Persulfate ( KPS ) System,â€ 2017.

[2] S. P. K. Spielberg, R. B. Gopaluni, and P. D. Loewen, â€œDeep Reinforcement Learning Approaches for Process Controlâ€, 2017

[3] V. Mnih, D. Silver, and M. Riedmiller, â€œPlaying Atari with Deep Reinforcement Learning,â€ pp. 1â€“9.

[4] T. P. Lillicrap et al., â€œContinuous learning control with deep reinforcement learning,â€ , 2016.

[5] D. Silver, G. Lever, D. Technologies, G. U. Y. Lever, and U. C. L. Ac, â€œDeterministic Policy Gradient Algorithms.â€

Topics

Process Automation & Control

Checkout

This paper has an Extended Abstract file available; you must purchase the conference proceedings to access it.

Checkout

Do you already own this?

Pricing

Individuals

AIChE Pro Members	$150.00
AIChE Graduate Student Members	Free
AIChE Undergraduate Student Members	Free
AIChE Explorer Members	$225.00
Non-Members	$225.00

Technical

Professional/Personal Growth

Societal Needs

Leadership

2024 mRNA Technology Conference

5th Engineering Cosmetics and Consumer Products Conference

Upcoming Conferences & Events

2024 DIERS Virtual Spring Meeting

2024 Pacific Northwest Student Regional Conference

2024 Western Student Regional Conference

CCPS Middle East Regional Meeting

Hydrogen Fueling Station Safety

Streamlining Permit-to-Work Processes With a Digital Solution

6th Middle East Process Engineering Conference and Exhibition

Quantum Computing and Artificial Intelligence Applications Workshop

2024 Offshore Technology Conference

CEP: April 2024

CEP: March 2024

Explore Areas of Advancement:

Learning Center:

Want to be an Entrepreneur? Personal Stories From Three Successful Entrepreneurs Who Have Traveled This Path.

(629h) Continuous Control of a Polymerization System with Deep Reinforcement Learning

AIChE Annual Meeting

2018

2018 AIChE Annual Meeting

Computing and Systems Technology Division

Modeling, Control, and Optimization of Manufacturing Systems

Thursday, November 1, 2018 - 10:13am to 10:32am

Authors

Topics

Checkout

Do you already own this?

Pricing

Individuals

More Conference Links

Cancelation Policy

Code of Conduct

Beware of Hotel and Attendee-list Scams

Code of Conduct

Beware of Hotel and Attendee-list Scams