MACC-SRAM: A Multistep Accumulation Capacitor-Coupling In-Memory Computing SRAM Macro for Deep Convolutional Neural Networks

IEEE JOURNAL OF SOLID-STATE CIRCUITS（2024）

引用 0|浏览9

摘要

This article presents multistep accumulation capacitor coupling static random-access memory (MACC-SRAM), capacitor-based in-memory computing (IMC) SRAM macro for 4-b deep convolutional neural network (DNN) inference. The macro can simultaneously activate all its 128 $\times$ 128 custom 9T1C bitcells to perform the vector–matrix multiplication (VMM). MACC-SRAM also integrates 128 stepwise-charging and discharging input drivers (SCD-IDRs) to efficiently convert the digital codes of the input activations into analog voltages in a 2-b serial fashion. As a result, it can save up to 66% of the capacitor-driving energy. Also, the macro adopts an adder-first architecture to reduce the analog-to-digital (A/D) conversion overhead for the analog-mixed-signal (AMS) computation. The partial sums of the four adjacent rows, representing different bit positions in the 4-b weights, are first accumulated with an analog switched-capacitor adder and then converted to digital codes by a 6-bit successive approximation register (SAR) analog-to-digital converter (ADC). Compared with the ADC-first architecture, where partial sums of each row are first converted to digital codes and then accumulated in the digital domain, the adder-first architecture can save 60.7% of the area and 66.7% of the energy consumption of the A/D conversion. Moreover, the co-optimization of the DNN model by increasing the sparsity further reduces 39.4% of the capacitor-driving energy with a neglectable 0.4% DNN accuracy loss. We prototyped the macro in 28 nm technology, and the measurement shows an energy efficiency of 163 tera-operations per second per watt (TOPS/W) and a throughput of 211 giga-operations per second (GOPS) for a 4-b/4-b DNN model under 0.9 V supply, among the highest of the recent IMC SRAMs.

查看译文

关键词

Capacitive coupling computing,in-memory computing (IMC),sparsity-optimized deep convolutional neural network (DNN),stepwise-charging and discharging input driver (SCD-IDR)

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

您的评分 :

暂无评分

数据免责声明

页面数据均来自互联网公开来源、合作出版商和通过AI技术自动分析结果，我们不对页面数据的有效性、准确性、正确性、可靠性、完整性和及时性做出任何承诺和保证。若有疑问，可以通过电子邮件方式联系我们：report@aminer.cn