TY - GEN
T1 - Simulation of Scientific Experiments with Generative Models
AU - Veretennikov, Stepan
AU - Minartz, Koen
AU - Menkovski, Vlado
AU - Gumuscu, Burcu
AU - de Boer, Jan
PY - 2022
Y1 - 2022
N2 - Lab experiments are a crucial part of research in natural sciences. High-throughput screening is leveraged to generate hypotheses, by evaluating a wide range of experimental parameter values and accumulating a wealth of data on the corresponding experimental outcomes. The data is subsequently analyzed to design new rounds of experiments. While discriminative models have previously proven useful for screening data analytics, they do not account for randomness inherent to lab experiments, and do not have the capacity to capture the potentially high-dimensional relationship between the experiment input parameters and outcomes. Instead, we take a data-driven simulation perspective on the problem. Inspired by biomaterials research experiments, we consider a case where both the input parameter space and the outcome space have a high-dimensional (image) representation. We propose a deep generative model that serves simultaneously as a simulation model of the experiment, i.e. allows to generate potential outcomes conditioned on the experiment input, and as a tool for inverse design, i.e. generating instances of inputs that could lead to a given experiment outcome. A proof-of-concept evaluation on a synthetic dataset shows that the model is able to learn the embedded relationship between the properties of the input and of the output in a probabilistic manner and allows for experiment simulation and design application scenarios.
AB - Lab experiments are a crucial part of research in natural sciences. High-throughput screening is leveraged to generate hypotheses, by evaluating a wide range of experimental parameter values and accumulating a wealth of data on the corresponding experimental outcomes. The data is subsequently analyzed to design new rounds of experiments. While discriminative models have previously proven useful for screening data analytics, they do not account for randomness inherent to lab experiments, and do not have the capacity to capture the potentially high-dimensional relationship between the experiment input parameters and outcomes. Instead, we take a data-driven simulation perspective on the problem. Inspired by biomaterials research experiments, we consider a case where both the input parameter space and the outcome space have a high-dimensional (image) representation. We propose a deep generative model that serves simultaneously as a simulation model of the experiment, i.e. allows to generate potential outcomes conditioned on the experiment input, and as a tool for inverse design, i.e. generating instances of inputs that could lead to a given experiment outcome. A proof-of-concept evaluation on a synthetic dataset shows that the model is able to learn the embedded relationship between the properties of the input and of the output in a probabilistic manner and allows for experiment simulation and design application scenarios.
KW - Biomaterials engineering
KW - Disentangled latent space
KW - Generative models
KW - Simulation of experiments
UR - http://www.scopus.com/inward/record.url?scp=85128702239&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-01333-1_27
DO - 10.1007/978-3-031-01333-1_27
M3 - Conference contribution
SN - 978-3-031-01332-4
T3 - Lecture Notes in Computer Science
SP - 341
EP - 353
BT - Advances in Intelligent Data Analysis XX
A2 - Bouadi, Tassadit
A2 - Fromont, Elisa
A2 - Hüllermeier, Eyke
PB - Springer
T2 - 20th International Symposium on Intelligent Data Analysis, IDA 2022
Y2 - 20 April 2022 through 22 April 2022
ER -