Doorgaan naar hoofdnavigatie Doorgaan naar zoeken Ga verder naar hoofdinhoud

Scaling Up Quantization-Aware Neural Architecture Search for Efficient Deep Learning on the Edge

  • Yao Lu
  • , Hiram Rayo Torres Rodriguez
  • , Sebastian Vogel
  • , Nick van de Waterlaat
  • , Pavol Jancura

Onderzoeksoutput: Hoofdstuk in Boek/Rapport/CongresprocedureConferentiebijdrageAcademicpeer review

51 Downloads (Pure)

Samenvatting

Neural Architecture Search (NAS) has become the de-facto approach for designing accurate and efficient networks for edge devices. Since models are typically quantized for edge deployment, recent work has investigated quantization-aware NAS (QA-NAS) to search for highly accurate and efficient quantized models. However, existing QA-NAS approaches, particularly few-bit mixed-precision (FB-MP) methods, do not scale to larger tasks. Consequently, QA-NAS has mostly been limited to low-scale tasks and tiny networks. In this work, we present an approach to enable QA-NAS (INT8 and FB-MP) on large-scale tasks by leveraging the block-wise formulation introduced by block-wise NAS. We demonstrate strong results for the semantic segmentation task on the Cityscapes dataset, finding FB-MP models 33% smaller and INT8 models 17.6% faster than DeepLabV3 (INT8) without compromising task performance.
Originele taal-2Engels
TitelCODAI '23
SubtitelProceedings of the 2023 Workshop on Compilers, Deployment, and Tooling for Edge AI
Plaats van productieNew York
UitgeverijAssociation for Computing Machinery, Inc.
Aantal pagina's5
ISBN van elektronische versie979-8-4007-0337-9
DOI's
StatusGepubliceerd - 10 jun. 2024
Evenement2023 IEEE/ACM International Workshop on Compilers, Deployment, and Tooling for Edge AI , CODAI 2023 - Hamburg, Duitsland
Duur: 21 sep. 202321 sep. 2023

Workshop

Workshop2023 IEEE/ACM International Workshop on Compilers, Deployment, and Tooling for Edge AI , CODAI 2023
Verkorte titelCODAI 2023
Land/RegioDuitsland
StadHamburg
Periode21/09/2321/09/23

Financiering

This work was supported by Key Digital Technologies Joint Undertaking (KDT JU) in EdgeAI \"Edge AI Technologies for Optimised Performance Embedded Processing\" project, grant agreement No 101097300.

Vingerafdruk

Duik in de onderzoeksthema's van 'Scaling Up Quantization-Aware Neural Architecture Search for Efficient Deep Learning on the Edge'. Samen vormen ze een unieke vingerafdruk.

Citeer dit