Please use this identifier to cite or link to this item: https://www.um.edu.mt/library/oar/handle/123456789/131782
Title: Learning circuit placement techniques through reinforcement learning with adaptive rewards
Authors: Vassallo, Luke
Bajada, Josef
Keywords: Printed circuits -- Design and construction
Reinforcement learning -- Mathematical models
Simulated annealing (Mathematics)
Electronic circuit design -- Automation
Markov processes
Issue Date: 2024-03
Publisher: Institute of Electrical and Electronics Engineers
Citation: Vassallo, L., & Bajada, J. (2024, March). Learning Circuit Placement Techniques Through Reinforcement Learning with Adaptive Rewards. Proceedings of the 2024 Design, Automation and Test in Europe Conference (DATE), IEEE. Valencia, Spain. 1-6.
Abstract: Placement is the initial step of Printed Circuit Board (PCB) physical design and demands considerable time and domain expertise. Placement quality impacts the performance of subsequent tasks, and the generation of an optimal placement is known to be, at the very least, NP-complete. While stochastic optimisation and analytic techniques have had some success, they often lack the intuitive understanding of human engineers. In this study, we propose a novel end-to-end Machine Learning (ML) approach to learn fundamental placement techniques and use experience to optimise PCB layouts efficiently. To achieve this, we formulate the PCB placement problem as a Markov Decision Process (MDP) and use Reinforcement Learning (RL) to learn general placement techniques. The agent-driven data collection process generates highly diverse and consistent data points sufficient for learning general policies without expert knowledge under the guidance of an adaptive reward signal. Compared to state-of-the-art simulated annealing approaches on unseen circuits, the resulting policies trained with TD3 and SAC, on average, yield 17% and 21% reduction in post-routing wirelength. Qualitative analysis shows that the policies learn fundamental placement techniques and demonstrate an understanding of the underlying problem dynamics. Collectively, they demonstrate emergent collaborative or competitive behaviours and faster placement convergence, sometimes exceeding an order of magnitude.
URI: https://www.um.edu.mt/library/oar/handle/123456789/131782
Appears in Collections:Scholarly Works - FacICTAI

Files in This Item:
File Description SizeFormat 
Learning circuit placement techniques through reinforcement learning with adaptive rewards 2024.pdf
  Restricted Access
768.27 kBAdobe PDFView/Open Request a copy


Items in OAR@UM are protected by copyright, with all rights reserved, unless otherwise indicated.