Skip to main navigation Skip to search Skip to main content

Maintenance Planning with Deterioration by a Reinforcement Learning Approach - a Semiconductor Simulation Study

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

37 Downloads (Pure)

Abstract

Manufacturing companies are often faced with deteriorating production systems, which can greatly impact their overall performance. Scheduling the preventive maintenance activities optimally is vital for these companies. This study utilizes historical data on product quality deterioration in the last machine of a real-world serial, buffered production line to refine maintenance decisions. Currently, maintenance intervals for the last machine are fixed. The objective is to improve this policy by increasing the production system’s
long-term throughput. A simplified two-machine-one-buffer (2M1B) system is modeled to devise and assess policies. Various optimization techniques are applied, with the average reward Reinforcement Learning (RL) technique showing the most promising results in numerical experiments. The RL-optimized policy exhibits significant potential by considering product quality deterioration, buffer levels ahead of the last machine, and the machine states of the last two machines.
Original languageEnglish
Title of host publicationProceedings of the 2024 Winter Simulation Conference
EditorsH. Lam, E. Azar, D. Batur, S. Gao, X. Wie, S.R. Hunter, M.D. Rossetti
PublisherINFORMS Institute for Operations Research and the Management Sciences
Pages1761-1772
Number of pages12
ISBN (Electronic)979-8-3315-3420-2
Publication statusPublished - 22 Jan 2025
Event2024 Winter Simulation Conference - Orlando, United States
Duration: 15 Dec 202418 Dec 2024

Conference

Conference2024 Winter Simulation Conference
Country/TerritoryUnited States
CityOrlando
Period15/12/2418/12/24

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

Fingerprint

Dive into the research topics of 'Maintenance Planning with Deterioration by a Reinforcement Learning Approach - a Semiconductor Simulation Study'. Together they form a unique fingerprint.

Cite this