TY - JOUR
T1 - Age-Based Maintenance under Population Heterogeneity: Optimal Exploration and Exploitation
AU - Dursun, İpek
AU - Akçay, Alp
AU - van Houtum, Geert-Jan
PY - 2022/9/16
Y1 - 2022/9/16
N2 - We consider a system with a finite lifespan and a single critical component that is subject to random failures. An age-based replacement policy is applied to preventively replace the component before its failure. The components used for replacement come from either a weak population or a strong population, referred to as population heterogeneity. However, the true population type is unknown to the decision maker. By considering that the decision maker has a belief on the probability of having a weak population, we build a partially observable Markov decision process model with the objective of minimizing the total cost over the lifespan of the system. The resulting optimal policy updates the belief variable in a Bayesian fashion by using the data obtained over the course of the system lifespan, and it denotes when to execute preventive replacement. It optimally balances the trade-off between the cost of learning the true population type (via deliberately delaying the preventive replacement time to better learn the population type) and the cost of maintenance activities. By addressing this so-called exploration-exploitation trade-off, we generate insights on the optimal policy and compare its performance with existing heuristic approaches from the literature. We also characterize a lower bound to the optimal cost, allowing us to determine the value of resolving the uncertainty on the population type.
AB - We consider a system with a finite lifespan and a single critical component that is subject to random failures. An age-based replacement policy is applied to preventively replace the component before its failure. The components used for replacement come from either a weak population or a strong population, referred to as population heterogeneity. However, the true population type is unknown to the decision maker. By considering that the decision maker has a belief on the probability of having a weak population, we build a partially observable Markov decision process model with the objective of minimizing the total cost over the lifespan of the system. The resulting optimal policy updates the belief variable in a Bayesian fashion by using the data obtained over the course of the system lifespan, and it denotes when to execute preventive replacement. It optimally balances the trade-off between the cost of learning the true population type (via deliberately delaying the preventive replacement time to better learn the population type) and the cost of maintenance activities. By addressing this so-called exploration-exploitation trade-off, we generate insights on the optimal policy and compare its performance with existing heuristic approaches from the literature. We also characterize a lower bound to the optimal cost, allowing us to determine the value of resolving the uncertainty on the population type.
KW - Age-based replacement
KW - Bayesian learning
KW - Maintenance
KW - Partially observable Markov decision processes
KW - Population heterogeneity
UR - http://www.scopus.com/inward/record.url?scp=85121589448&partnerID=8YFLogxK
U2 - 10.1016/j.ejor.2021.11.038
DO - 10.1016/j.ejor.2021.11.038
M3 - Article
SN - 0377-2217
VL - 301
SP - 1007
EP - 1020
JO - European Journal of Operational Research
JF - European Journal of Operational Research
IS - 3
ER -