publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2025
- CMAMESpatially-aware diffusion models with cross-attention for global field reconstruction with sparse observationsYilin Zhuang, Sibo Cheng, and Karthik DuraisamyComputer Methods in Applied Mechanics and Engineering, 2025
Diffusion models have gained attention for their ability to represent complex distributions and incorporate uncertainty, making them ideal for robust predictions in the presence of noisy or incomplete data. In this study, we develop and enhance score-based diffusion models in field reconstruction tasks, where the goal is to estimate complete spatial fields from partial observations. We introduce a condition encoding approach to construct a tractable mapping between observed and unobserved regions using a learnable integration of sparse observations and interpolated fields as an inductive bias. With refined sensing representations and an unraveled temporal dimension, our method can handle arbitrary moving sensors and effectively reconstruct fields. Furthermore, we conduct a comprehensive benchmark of our approach against a deterministic interpolation-based method across various static and time-dependent PDEs. Our study attempts to addresses the gap in strong baselines for evaluating performance across varying sampling hyperparameters, noise levels, and conditioning methods. Our results show that diffusion models with cross-attention and the proposed conditional encoding generally outperform other methods under noisy conditions, although the deterministic method excels with noiseless data. Additionally, both the diffusion models and the deterministic method surpass the numerical approach in accuracy and computational cost for the steady problem. We also demonstrate the ability of the model to capture possible reconstructions and improve the accuracy of fused results in covariance-based correction tasks using ensemble sampling.
@article{ZHUANG2025117623, title = {Spatially-aware diffusion models with cross-attention for global field reconstruction with sparse observations}, volume = {435}, issn = {0045-7825}, url = {https://www.sciencedirect.com/science/article/pii/S0045782524008776}, doi = {https://doi.org/10.1016/j.cma.2024.117623}, journal = {Computer Methods in Applied Mechanics and Engineering}, author = {Zhuang, Yilin and Cheng, Sibo and Duraisamy, Karthik}, year = {2025}, keywords = {Diffusion model, Generative AI, Global field reconstruction, Inverse problems}, pages = {117623}, }
2024
- CoCoGen: Physically-Consistent and Conditioned Score-based Generative Models for Forward and Inverse ProblemsChristian Jacobsen, Yilin Zhuang, and Karthik DuraisamyOct 2024arXiv:2312.10527 [cs]
Recent advances in generative artificial intelligence have had a significant impact on diverse domains spanning computer vision, natural language processing, and drug discovery. This work extends the reach of generative models into physical problem domains, particularly addressing the efficient enforcement of physical laws and conditioning for forward and inverse problems involving partial differential equations (PDEs). Our work introduces two key contributions: firstly, we present an efficient approach to promote consistency with the underlying PDE. By incorporating discretized information into score-based generative models, our method generates samples closely aligned with the true data distribution, showcasing residuals comparable to data generated through conventional PDE solvers, significantly enhancing fidelity. Secondly, we showcase the potential and versatility of score-based generative models in various physics tasks, specifically highlighting surrogate modeling as well as probabilistic field reconstruction and inversion from sparse measurements. A robust foundation is laid by designing unconditional score-based generative models that utilize reversible probability flow ordinary differential equations. Leveraging conditional models that require minimal training, we illustrate their flexibility when combined with a frozen unconditional model. These conditional models generate PDE solutions by incorporating parameters, macroscopic quantities, or partial field measurements as guidance. The results illustrate the inherent flexibility of score-based generative models and explore the synergy between unconditional score-based generative models and the present physically-consistent sampling approach, emphasizing the power and flexibility in solving for and inverting physical fields governed by differential equations, and in other scientific machine learning tasks.
@misc{jacobsenCoCoGenPhysicallyConsistentConditioned2024, title = {{CoCoGen}: {Physically}-{Consistent} and {Conditioned} {Score}-based {Generative} {Models} for {Forward} and {Inverse} {Problems}}, shorttitle = {{CoCoGen}}, url = {http://arxiv.org/abs/2312.10527}, doi = {10.48550/arXiv.2312.10527}, urldate = {2025-02-23}, publisher = {arXiv}, author = {Jacobsen, Christian and Zhuang, Yilin and Duraisamy, Karthik}, month = oct, year = {2024}, note = {arXiv:2312.10527 [cs]}, keywords = {Computer Science - Machine Learning}, file = {Jacobsen et al_2024_CoCoGen.pdf:C\:\\Users\\TonyZ\\Zotero\\storage\\IFWBKVFZ\\Jacobsen et al_2024_CoCoGen.pdf:application/pdf;Snapshot:C\:\\Users\\TonyZ\\Zotero\\storage\\GBTDCB24\\2312.html:text/html}, }
- Int. J. Multiph. FlowMachine learning and physics-driven modelling and simulation of multiphase systemsNausheen Basha, Rossella Arcucci, Panagiota Angeli, and 31 more authorsInternational Journal of Multiphase Flow, Oct 2024
We highlight the work of a multi-university collaborative programme, PREMIERE (PREdictive Modelling with QuantIfication of UncERtainty for MultiphasE Systems), which is at the intersection of multi-physics and machine learning, aiming to enhance predictive capabilities in complex multiphase flow systems across diverse length and time scales. Our contributions encompass a variety of approaches, including the Design of Experiments for nanoparticle synthesis optimisation, Generalised Latent Assimilation models for drop coalescence prediction, Bayesian regularised artificial neural networks, eXtreme Gradient Boosting for microdroplet formation prediction, and a sub-sampling based adversarial neural network for predicting slug flow behaviour in two-phase pipe flows. Additionally, we introduce a generalised latent assimilation technique, Long Short-Term Memory networks for sequence forecasting mixing performance in stirred and static mixers, active learning via Bayesian optimisation to recover coalescence model parameters for high current density electrolysers, Gaussian process regression for drop size distribution predictions for sprays, and acoustic emission signal inversion using gradient boosting machines to characterise particle size distribution in fluidised beds. We also offer perspectives on the development of a shape optimisation framework that leverages the use of a multi-fidelity multiphase emulator. The results presented have applications in chemical synthesis, microfluidics, product manufacturing, and green hydrogen generation.
@article{BASHA2024104936, title = {Machine learning and physics-driven modelling and simulation of multiphase systems}, volume = {179}, issn = {0301-9322}, url = {https://www.sciencedirect.com/science/article/pii/S0301932224002131}, doi = {https://doi.org/10.1016/j.ijmultiphaseflow.2024.104936}, journal = {International Journal of Multiphase Flow}, author = {Basha, Nausheen and Arcucci, Rossella and Angeli, Panagiota and Anastasiou, Charitos and Abadie, Thomas and Casas, César Quilodrán and Chen, Jianhua and Cheng, Sibo and Chagot, Loïc and Galvanin, Federico and Heaney, Claire E. and Hossein, Fria and Hu, Jinwei and Kovalchuk, Nina and Kalli, Maria and Kahouadji, Lyes and Kerhouant, Morgan and Lavino, Alessio and Liang, Fuyue and Nathanael, Konstantia and Magri, Luca and Lettieri, Paola and Materazzi, Massimiliano and Erigo, Matteo and Pico, Paula and Pain, Christopher C. and Shams, Mosayeb and Simmons, Mark and Traverso, Tullio and Valdes, Juan Pablo and Wolffs, Zef and Zhu, Kewei and Zhuang, Yilin and Matar, Omar K}, year = {2024}, keywords = {Hybrid methods, Machine Learning, Microfluidics, Multi-fidelity, Multiphase, Numerical simulations}, pages = {104936}, }
- CMAMEMulti-domain encoder–decoder neural networks for latent data assimilation in dynamical systemsSibo Cheng, Yilin Zhuang, Lyes Kahouadji, and 4 more authorsComputer Methods in Applied Mechanics and Engineering, Oct 2024
High-dimensional dynamical systems often require computationally intensive physics-based simulations, making full physical space data assimilation impractical. Latent data assimilation methods perform assimilation in reduced-order latent space for efficiency but struggle with complex, nonlinear state-observation mappings. Recent solutions like Generalized Latent Data Assimilation (GLA) and Latent Space Data Assimilation (LSDA) address heterogeneous latent spaces by incorporating surrogate mapping functions but introduce computational costs and uncertainties. Furthermore, current algorithms that integrate data assimilation and deep learning still face limitations when it comes to handling non-explicit mapping functions. To address these challenges, this paper introduces a novel deep-learning-based data assimilation scheme, named Multi-domain Encoder–Decoder Latent Data Assimilation (MEDLA), capable of handling diverse data sources by sharing a common latent space. The proposed approach significantly reduces the computational burden since the complex mapping functions are mimicked by the multi-domain encoder–decoder neural network. It also enhances assimilation accuracy by minimizing interpolation and approximation errors. Extensive numerical experiments from three different test cases assess MEDLA’s performance in high dimensional dynamical systems, benchmarking it against state-of-the-art latent data assimilation methods. The numerical results consistently underscore MEDLA’s superiority in managing multi-scale observational data and tackling intricate, non-explicit mapping functions.
@article{CHENG2024117201, title = {Multi-domain encoder–decoder neural networks for latent data assimilation in dynamical systems}, volume = {430}, issn = {0045-7825}, url = {https://www.sciencedirect.com/science/article/pii/S0045782524004572}, doi = {https://doi.org/10.1016/j.cma.2024.117201}, journal = {Computer Methods in Applied Mechanics and Engineering}, author = {Cheng, Sibo and Zhuang, Yilin and Kahouadji, Lyes and Liu, Che and Chen, Jianhua and Matar, Omar K. and Arcucci, Rossella}, year = {2024}, keywords = {Data assimilation, Data fusion, Deep learning, Dynamical systems}, pages = {117201}, }
2023
- INDIN 2023Semi-supervised Variational Autoencoders for Regression: Application to Soft SensorsYilin Zhuang*, Zhuobin Zhou*, Burak Alakent, and 1 more authorIn 2023 IEEE 21st International Conference on Industrial Informatics (INDIN), Jul 2023ISSN: 2378-363X
We present the development of a semi-supervised regression method using variational autoencoders (VAE) for soft sensing of process quality variables. Recently, use of VAEs was proposed for regression applications based on variational inference. In this work, We extend this approach of supervised VAEs for regression to make it learn from both labelled and unlabelled data leading to a semi-supervised VAE for regression (SSVAER) formulation. The probabilistic regressor resulting from the variational approach makes it possible to estimate the variance of the predictions simultaneously, which provides a means for online uncertainty quantification for soft sensors. We provide an extensive comparative study of SSVAER with previously proposed semi-supervised learning methods on two soft sensing benchmark problems using fixed-size datasets, where we vary the percentage of labelled data available for training. In these experiments, SSVAER achieves the lowest test errors in 11 of the 20 studied cases, compared to other methods where the second best method gets 4 lowest test errors out of the 20.
@inproceedings{zhuangSemisupervisedVariationalAutoencoders2023, title = {Semi-supervised {Variational} {Autoencoders} for {Regression}: {Application} to {Soft} {Sensors}}, shorttitle = {Semi-supervised {Variational} {Autoencoders} for {Regression}}, url = {https://ieeexplore.ieee.org/document/10218227}, doi = {10.1109/INDIN51400.2023.10218227}, urldate = {2025-02-23}, booktitle = {2023 {IEEE} 21st {International} {Conference} on {Industrial} {Informatics} ({INDIN})}, author = {Zhuang, Yilin and Zhou, Zhuobin and Alakent, Burak and Mercangöz, Mehmet}, month = jul, year = {2023}, note = {ISSN: 2378-363X}, keywords = {Estimation, Job shop scheduling, Neurons, Semi-supervised learning, Semisupervised learning, soft sensors, Soft sensors, Training, Uncertainty, uncertainty analysis, variational autoencoder}, pages = {1--8}, file = {IEEE Xplore Abstract Record:C\:\\Users\\TonyZ\\Zotero\\storage\\YZ8TTIP7\\10218227.html:text/html;Zhuang et al_2023_Semi-supervised Variational Autoencoders for Regression.pdf:C\:\\Users\\TonyZ\\Zotero\\storage\\NDWUB2IL\\Zhuang et al_2023_Semi-supervised Variational Autoencoders for Regression.pdf:application/pdf}, }
2022
- Lab on a ChipEnsemble latent assimilation with deep learning surrogate model: application to drop interaction in a microfluidics deviceYilin Zhuang, Sibo Cheng, Nina Kovalchuk, and 4 more authorsLab on a chip, Jul 2022Publisher: The Royal Society of Chemistry
A major challenge in the field of microfluidics is to predict and control drop interactions. This work develops an image-based data-driven model to forecast drop dynamics based on experiments performed on a microfluidics device. Reduced-order modelling techniques are applied to compress the recorded images into low-dimensional spaces and alleviate the computational cost. Recurrent neural networks are then employed to build a surrogate model of drop interactions by learning the dynamics of compressed variables in the reduced-order space. The surrogate model is integrated with real-time observations using data assimilation. In this paper we developed an ensemble-based latent assimilation algorithm scheme which shows an improvement in terms of accuracy with respect to the previous approaches. This work demonstrates the possibility to create a reliable data-driven model enabling a high fidelity prediction of drop interactions in microfluidics device. The performance of the developed system is evaluated against experimental data (i.e., recorded videos), which are excluded from the training of the surrogate model.
@article{D2LC00303A, title = {Ensemble latent assimilation with deep learning surrogate model: application to drop interaction in a microfluidics device}, volume = {22}, url = {http://dx.doi.org/10.1039/D2LC00303A}, doi = {10.1039/D2LC00303A}, number = {17}, journal = {Lab on a chip}, author = {Zhuang, Yilin and Cheng, Sibo and Kovalchuk, Nina and Simmons, Mark and Matar, Omar K. and Guo, Yi-Ke and Arcucci, Rossella}, year = {2022}, note = {Publisher: The Royal Society of Chemistry}, pages = {3187--3202}, }
- Comput. Ind.A hybrid data-driven and mechanistic model soft sensor for estimating CO2 concentrations for a carbon capture pilot plantYilin Zhuang, Yixuan Liu, Akhil Ahmed, and 4 more authorsComputers in Industry, Jul 2022
Integrating post-combustion carbon capture and storage (CCS) facilities into fossil fuel power plants is considered an important step for reaching global carbon emission reduction targets. When the number of gas analyzers in such CCS units is limited, variations in the load of the power plant pose a challenge to determine the gaseous CO2 concentration profile in the absorber. A dynamic hybrid model for estimating the carbon capture absorber’s gaseous CO2 concentration profile has been proposed in this study. The model is built using actual process data collected from a carbon capture pilot plant and it combines data-driven and reaction kinetic (mechanistic) modeling approaches to act as a soft sensor along the absorber column. A subset of the process data is used for training the data-driven models and for estimating the parameters of the mechanistic model respectively. Dimensionality reduction techniques are applied to the data-driven model to reduce the input size and hence the size of the dynamic model elements. The outputs of the two models are fused by comparing the computed covariance matrices. A particular challenge for this work is that the collected process data has missing spatial labels and temporal values for the CO2 concentration measurements. The presented models are obtained using an encoding and interpolation approach for the missing information. For comparison, an alternative approach based on semi-supervised learning has been implemented. The performance of the resulting soft sensors is verified by using process data from previously unseen operating conditions. The soft sensor based on the proposed hybrid model outperforms the soft sensor trained by semi-supervised autoencoder. Overall the results indicate that the proposed approach can estimate the CO2 concentration percentage with an average root mean squared error of 0.123.
@article{ZHUANG2022103747, title = {A hybrid data-driven and mechanistic model soft sensor for estimating {CO2} concentrations for a carbon capture pilot plant}, volume = {143}, issn = {0166-3615}, url = {https://www.sciencedirect.com/science/article/pii/S0166361522001440}, doi = {https://doi.org/10.1016/j.compind.2022.103747}, journal = {Computers in Industry}, author = {Zhuang, Yilin and Liu, Yixuan and Ahmed, Akhil and Zhong, Zhengang and del Rio Chanona, Ehecatl A. and Hale, Colin P. and Mercangöz, Mehmet}, year = {2022}, keywords = {Carbon capture pilot plant, Data-driven modeling, Long short term memory, Soft sensor}, pages = {103747}, }
2020
- APS DFDBenchmarking of CFD solvers for the simulation of two-phase jetsYilin Zhuang, Gabriel FN Gonçalves, Cristian Ricardo Constante Amores, and 2 more authorsIn APS division of fluid dynamics meeting abstracts, Jul 2020
Multiphase flows have received significant interest due to their occurrence in a multitude of natural and industrial applications. In this study, we perform the benchmarking of solvers for the simulation of two-phase jets. The Navier-Stokes equation are solved under the assumption of the single-fluid formulation and therefore, the interfacial location must be also determined as part of the solution. We evaluate solvers based on both interface-capturing methods (such as Basilisk and OpenFOAM) and interface-tracking methods. We consider the scenario of a water drop formation from a cylindrical nozzle and the accuracy of the different solvers is compared with experimental results, in terms of jet-shape and drop-size. A thorough study in terms of the computational requirements (e.g., grid resolution) and the computational cost is also performed.