|
|
Dynamic response surface methodology using Lasso regression for organic pharmaceutical synthesis |
Yachao Dong1,2, Christos Georgakis2( ), Jacob Santos-Marques2, Jian Du1 |
1. Institute of Chemical Process Systems Engineering, School of Chemical Engineering, Dalian University of Technology, Dalian 116024, China 2. Department of Chemical and Biological Engineering and Systems Research Institute, Tufts University, Medford, MA 02155, USA |
|
|
Abstract To study the dynamic behavior of a process, time-resolved data are collected at different time instants during each of a series of experiments, which are usually designed with the design of experiments or the design of dynamic experiments methodologies. For utilizing such time-resolved data to model the dynamic behavior, dynamic response surface methodology (DRSM), a data-driven modeling method, has been proposed. Two approaches can be adopted in the estimation of the model parameters: stepwise regression, used in several of previous publications, and Lasso regression, which is newly incorporated in this paper for the estimation of DRSM models. Here, we show that both approaches yield similarly accurate models, while the computational time of Lasso is on average two magnitude smaller. Two case studies are performed to show the advantages of the proposed method. In the first case study, where the concentrations of different species are modeled directly, DRSM method provides more accurate models compared to the models in the literature. The second case study, where the reaction extents are modeled instead of the species concentrations, illustrates the versatility of the DRSM methodology. Therefore, DRSM with Lasso regression can provide faster and more accurate data-driven models for a variety of organic synthesis datasets.
|
Keywords
data-driven modeling
pharmaceutical organic synthesis
Lasso regression
dynamic response surface methodology
|
Corresponding Author(s):
Christos Georgakis
|
Online First Date: 13 July 2021
Issue Date: 10 January 2022
|
|
1 |
C W Coley, N S Eyke, K F Jensen. Autonomous discovery in the chemical sciences part I: progress. Angewandte Chemie International Edition, 2020, 59: 2–38
|
2 |
R Van de Vijver, N M Vandewiele, P L Bhoorasingh, B L Slakman, F S Khanshan, H H Carstensen, M F Reyniers, G B Marin, R H West, K M Van Geem. Automatic mechanism and kinetic model generation for gas- and solution-phase processes: a perspective on best practices, recent advances, and future challenges. International Journal of Chemical Kinetics, 2015, 47(4): 199–231
https://doi.org/10.1002/kin.20902
|
3 |
F Qian, L Tao, W Sun, W Du. Development of a free radical kinetic model for industrial oxidation of p-xylene based on artificial neural network and adaptive immune genetic algorithm. Industrial & Engineering Chemistry Research, 2012, 51(8): 3229–3237
https://doi.org/10.1021/ie200737x
|
4 |
H Shi, T Zhou. Computational design of heterogeneous catalysts and gas separation materials for advanced chemical processing. Frontiers of Chemical Science and Engineering, 2021, 15(1): 49–59
https://doi.org/10.1007/s11705-020-1959-0
|
5 |
J A Selekman, J Qiu, K Tran, J Stevens, V Rosso, E Simmons, Y Xiao, J Janey. High-throughput automation in chemical process development. Annual Review of Chemical and Biomolecular Engineering, 2017, 8(1): 525–547
https://doi.org/10.1146/annurev-chembioeng-060816-101411
|
6 |
S Caron, N M Thomson. Pharmaceutical process chemistry: evolution of a contemporary data-rich laboratory environment. Journal of Organic Chemistry, 2015, 80(6): 2943–2958
https://doi.org/10.1021/jo502879m
|
7 |
J Ulrich, P Frohberg. Problems, potentials and future of industrial crystallization. Frontiers of Chemical Science and Engineering, 2013, 7(1): 1–8
https://doi.org/10.1007/s11705-013-1304-y
|
8 |
K V Gernaey, A E Cervera-Padrell, J M Woodley. A perspective on PSE in pharmaceutical process development and innovation. Computers & Chemical Engineering, 2012, 42: 15–29
https://doi.org/10.1016/j.compchemeng.2012.02.022
|
9 |
W Yue, X Chen, W Gui, Y Xie, H Zhang. A knowledge reasoning fuzzy-Bayesian network for root cause analysis of abnormal aluminum electrolysis cell condition. Frontiers of Chemical Science and Engineering, 2017, 11(3): 414–428
https://doi.org/10.1007/s11705-017-1663-x
|
10 |
D C Montgomery. Design and Analysis of Experiments. 8th edition. Hoboken: John Wiley & Sons, 2008
|
11 |
N Klebanov, C Georgakis. Dynamic response surface models: a data-driven approach for the analysis of time-varying process outputs. Industrial & Engineering Chemistry Research, 2016, 55(14): 4022–4034
https://doi.org/10.1021/acs.iecr.5b03572
|
12 |
Z Wang, C Georgakis. New dynamic response surface methodology for modeling nonlinear processes over semi-infinite time horizons. Industrial & Engineering Chemistry Research, 2017, 56(38): 10770–10782
https://doi.org/10.1021/acs.iecr.7b02381
|
13 |
Y Dong, C Georgakis, J Mustakis, J M Hawkins, L Han, K Wang, J P McMullen, S T Grosser, K Stone. Constrained version of the dynamic response surface methodology for modeling pharmaceutical reactions. Industrial & Engineering Chemistry Research, 2019, 58(30): 13611–13621
https://doi.org/10.1021/acs.iecr.9b00731
|
14 |
N R Domagalski, B C Mack, J E Tabora. Analysis of design of experiments with dynamic responses. Organic Process Research & Development, 2015, 19(11): 1667–1682
https://doi.org/10.1021/acs.oprd.5b00143
|
15 |
K Wang, L Han, J Mustakis, B Li, J Magano, D B Damon, A Dion, M T Maloney, R Post, R Li. Kinetic and data-driven reaction analysis for pharmaceutical process development. Industrial & Engineering Chemistry Research, 2020, 59(6): 2409–2421
https://doi.org/10.1021/acs.iecr.9b03578
|
16 |
E Alpaydin. Introduction to Machine Learning. 3rd edition. Cambridge: MIT Press, 2014
|
17 |
S Boyd, N Parikh, E Chu, B Peleato, J Eckstein. Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends in Machine Learning, 2011, 3(1): 1–122
https://doi.org/10.1561/2200000016
|
18 |
S García-Muñoz, S Dolph, H W Ward II. Handling uncertainty in the establishment of a design space for the manufacture of a pharmaceutical product. Computers & Chemical Engineering, 2010, 34(7): 1098–1107
https://doi.org/10.1016/j.compchemeng.2010.02.027
|
19 |
J Santos-Marques, C Georgakis, J Mustakis, J M Hawkins. From DRSM models to the identification of the reaction stoichiometry in a complex pharmaceutical case study. AIChE Journal. American Institute of Chemical Engineers, 2019, 65(4): 1173–1185
https://doi.org/10.1002/aic.16515
|
20 |
Y Dong, C Georgakis, J Mustakis, J M Hawkins, L Han, K Wang, J P McMullen, S T Grosser, K Stone. Stoichiometry identification of pharmaceutical reactions using the constrained dynamic response surface methodology. AIChE Journal. American Institute of Chemical Engineers, 2019, 65(11): e16726
https://doi.org/10.1002/aic.16726
|
21 |
N Huri, M Feder. In selecting the Lasso regularization parameter via Bayesian principles, 2016 IEEE International Conference on the Science of Electrical Engineering (ICSEE), 2016, 1–5
|
22 |
D C Montgomery, E A Peck, G G Vining. Introduction to Linear Regression Analysis. 5th edition. London: Wiley, 2012
|
23 |
G H Golub, M Heath, G Wahba. Generalized cross-validation as a method for choosing a good ridge parameter. Technometrics, 1979, 21(2): 215–223
https://doi.org/10.1080/00401706.1979.10489751
|
24 |
G Hanrahan, K Lu. Application of factorial and response surface methodology in modern experimental design and optimization. Critical Reviews in Analytical Chemistry, 2006, 36(3-4): 141–151
https://doi.org/10.1080/10408340600969478
|
25 |
G Singh, R S Pai, V K Devi. Response surface methodology and process optimization of sustained release pellets using Taguchi orthogonal array design and central composite design. Journal of Advanced Pharmaceutical Technology & Research, 2012, 3(1): 30–40
|
26 |
M A Bezerra, R E Santelli, E P Oliveira, L S Villar, L A Escaleira. Response surface methodology (RSM) as a tool for optimization in analytical chemistry. Talanta, 2008, 76(5): 965–977
https://doi.org/10.1016/j.talanta.2008.05.019
|
27 |
Y Dong, C Georgakis, J Mustakis, H Lu, J P McMullen. Optimization of pharmaceutical reactions using the dynamic response surface methodology. Computers & Chemical Engineering, 2020, 135: 106778
https://doi.org/10.1016/j.compchemeng.2020.106778
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|