A Scalable Hardware Architecture for Efficient Learning of Recurrent Neural Networks at the Edge

Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

214 Downloads (Pure)

Samenvatting

Edge devices can execute pre-trained Artificial Intelligence (AI) models optimized on large Graphical Processing Units (GPU) but often need fine-tuning for real-world data. This process, known as edge learning, is crucial for personalized learning for tasks such as speech and gesture recognition and often requires recurrent neural networks (RNNs). However, training RNNs on edge devices faces challenges due to limited resources. We propose a system for RNN training through sequence partitioning using the Forward Propagation Through Time (FPTT) training method, facilitating edge learning. Our optimized HW/SW co-design for FPTT is the first of its kind. In our work, we have implemented the complete computational process for training Long Short-Term Memory (LSTM) networks using FPTT, and we have optimized and explored the hardware architecture leveraging the Chipyard framework. Our findings indicate considerable memory savings, with only a slight increase in latency, when training small-batch size sequential MNIST (S-MNIST) data.
Originele taal-2Engels
Titel2024 IFIP/IEEE 32nd International Conference on Very Large Scale Integration, VLSI-SoC 2024
UitgeverijInstitute of Electrical and Electronics Engineers
Aantal pagina's4
ISBN van elektronische versie979-8-3315-3967-2
DOI's
StatusGepubliceerd - 3 dec. 2024
EvenementIFIP/IEEE International Conference on Very Large Scale Integration, VLSI-SoC 2024
- Morocco, Tanger, Marokko
Duur: 6 okt. 20249 okt. 2024
https://vlsisoc2024.nl/

Congres

CongresIFIP/IEEE International Conference on Very Large Scale Integration, VLSI-SoC 2024
Verkorte titelVLSI-SoC 2024
Land/RegioMarokko
StadTanger
Periode6/10/249/10/24
Internet adres

Financiering

This work has been funded by the Dutch Organization for Scientific Research (NWO) with Grant KICH1.ST04.22.021.

FinanciersFinanciernummer
Netherlands Organisation for Applied Scientific Research - TNOKICH1.ST04.22.021

    Vingerafdruk

    Duik in de onderzoeksthema's van 'A Scalable Hardware Architecture for Efficient Learning of Recurrent Neural Networks at the Edge'. Samen vormen ze een unieke vingerafdruk.

    Citeer dit