TY - JOUR
T1 - Deep hybrid modeling of a HEK293 process
T2 - Combining long short-term memory networks with first principles equations
AU - Ramos, João R. C.
AU - Pinto, José
AU - Poiares-Oliveira, Gil
AU - Peeters, Ludovic
AU - Dumas, Patrick
AU - Oliveira, Rui
N1 - Funding Information:
The authors acknowledge GlaxoSmithKline Biologicals SA for providing the experimental data used in this study. JP acknowledges PhD grant SFRD/BD14610472019, Fundação para a Ciência e Tecnologia (FCT), Portugal.
Publisher Copyright:
© 2024 The Authors. Biotechnology and Bioengineering published by Wiley Periodicals LLC.
PY - 2024/5
Y1 - 2024/5
N2 - The combination of physical equations with deep learning is becoming a promising methodology for bioprocess digitalization. In this paper, we investigate for the first time the combination of long short-term memory (LSTM) networks with first principles equations in a hybrid workflow to describe human embryonic kidney 293 (HEK293) culture dynamics. Experimental data of 27 extracellular state variables in 20 fed-batch HEK293 cultures were collected in a parallel high throughput 250 mL cultivation system in an industrial process development setting. The adaptive moment estimation method with stochastic regularization and cross-validation were employed for deep learning. A total of 784 hybrid models with varying deep neural network architectures, depths, layers sizes and node activation functions were compared. In most scenarios, hybrid LSTM models outperformed classical hybrid Feedforward Neural Network (FFNN) models in terms of training and testing error. Hybrid LSTM models revealed to be less sensitive to data resampling than FFNN hybrid models. As disadvantages, Hybrid LSTM models are in general more complex (higher number of parameters) and have a higher computation cost than FFNN hybrid models. The hybrid model with the highest prediction accuracy consisted in a LSTM network with seven internal states connected in series with dynamic material balance equations. This hybrid model correctly predicted the dynamics of the 27 state variables (R2 = 0.93 in the test data set), including biomass, key substrates, amino acids and metabolic by-products for around 10 cultivation days.
AB - The combination of physical equations with deep learning is becoming a promising methodology for bioprocess digitalization. In this paper, we investigate for the first time the combination of long short-term memory (LSTM) networks with first principles equations in a hybrid workflow to describe human embryonic kidney 293 (HEK293) culture dynamics. Experimental data of 27 extracellular state variables in 20 fed-batch HEK293 cultures were collected in a parallel high throughput 250 mL cultivation system in an industrial process development setting. The adaptive moment estimation method with stochastic regularization and cross-validation were employed for deep learning. A total of 784 hybrid models with varying deep neural network architectures, depths, layers sizes and node activation functions were compared. In most scenarios, hybrid LSTM models outperformed classical hybrid Feedforward Neural Network (FFNN) models in terms of training and testing error. Hybrid LSTM models revealed to be less sensitive to data resampling than FFNN hybrid models. As disadvantages, Hybrid LSTM models are in general more complex (higher number of parameters) and have a higher computation cost than FFNN hybrid models. The hybrid model with the highest prediction accuracy consisted in a LSTM network with seven internal states connected in series with dynamic material balance equations. This hybrid model correctly predicted the dynamics of the 27 state variables (R2 = 0.93 in the test data set), including biomass, key substrates, amino acids and metabolic by-products for around 10 cultivation days.
KW - deep hybrid modeling
KW - deep learning
KW - feedforward neural networks
KW - HEK293 cells
KW - long short-term memory network
KW - machine learning
UR - http://www.scopus.com/inward/record.url?scp=85185140352&partnerID=8YFLogxK
U2 - 10.1002/bit.28668
DO - 10.1002/bit.28668
M3 - Article
C2 - 38343176
AN - SCOPUS:85185140352
SN - 0006-3592
VL - 121
SP - 1554
EP - 1568
JO - Biotechnology and Bioengineering
JF - Biotechnology and Bioengineering
IS - 5
ER -