Samenvatting
The degree of DLP parallelism in applications is not fixed and varies due to different computational characteristics of applications. On the contrary, most of the processors today include single-width SIMD (vector) hardware to exploit DLP. However, single-width SIMD architectures may not be optimal to serve applications with varying DLP and they may cause performance and energy inefficiency. We propose the usage of VLIW processors with multiple native vector-widths to better serve applications with changing DLP. SHAVE is an example of such VLIW processor and provides hardware support for the native 32-bit and 128-bit wide vector operations. This paper researches and implements the mixed-length SIMD code generation support for SHAVE processor. More specifically, we target generating 32-bit and 128/64-bit SIMD code for the native 32-bit and 128-bit wide vector units of SHAVE processor. In this way, we improved the performance of compiler generated SIMD code by reducing the number of overhead operations and by increasing the SIMD hardware utilization. Experimental results demonstrated that our methodology implemented in the compiler improves the performance of synthetic benchmarks up to 47%.
Originele taal-2 | Engels |
---|---|
Titel | ASAP 2015 - 26th IEEE International Conference on Application-specific Systems, Architectures and Processors, 27-29 July 2015, Toronto, Canada |
Plaats van productie | Toronto, Canada |
Uitgeverij | Institute of Electrical and Electronics Engineers |
Pagina's | 181-188 |
DOI's | |
Status | Gepubliceerd - 2015 |
Evenement | conference; ASAP; 2015-07-27; 2015-07-29 - Duur: 27 jul. 2015 → 29 jul. 2015 |
Congres
Congres | conference; ASAP; 2015-07-27; 2015-07-29 |
---|---|
Periode | 27/07/15 → 29/07/15 |
Ander | ASAP |