Creating and sharing knowledge for telecommunications

To PiM or Not to PiM: The case for in-memory inferencing of quantized CNNs at the edge

Falcão, G. ; Ferreira, J.

Queue Vol. 20, Nº 6, pp. 9 - 34, December, 2022.

ISSN (print): 1542-7730
ISSN (online): 1542-7749

Scimago Journal Ranking: 0,42 (in 2022)

Digital Object Identifier: 10.1145/3580503

Download Full text PDF ( 3 MBs)

Downloaded 2 times

Abstract
As artificial intelligence becomes a pervasive tool for the billions of IoT (Internet of things) devices at the edge, the data movement bottleneck imposes severe limitations on the performance and autonomy of these systems. PiM (processing-in-memory) is emerging as a way of mitigating the data movement bottleneck while satisfying the stringent performance, energy efficiency, and accuracy requirements of edge imaging applications that rely on CNNs (convolutional neural networks).