One-Time Programmable Memory for Ultra-Low Power ANN Inference Accelerator With Security Against Thermal Fault Injection

Memory technologies for in-memory computing (IMC) based neural network (NN) operations in edge devices face two primary challenges. First, these technologies often exhibit an excessive average power due to the high currents in the memory’s low and high resistance state (LRS and HRS). Seco...

Full description

Saved in:
Bibliographic Details
Main Authors: Shreyas Deshmukh, Ankit Bende, Diti Sanghai, Vivek Saraswat, Anmol Biswas, Abhishek Kadam, Shubham Patil, Ajay Kumar Singh, Veeresh Deshpande, Udayan Ganguly
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Journal of the Electron Devices Society
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10771781/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Memory technologies for in-memory computing (IMC) based neural network (NN) operations in edge devices face two primary challenges. First, these technologies often exhibit an excessive average power due to the high currents in the memory&#x2019;s low and high resistance state (LRS and HRS). Second, their vulnerability to tampering compromises device reliability in security-sensitive Internet of Things (IoT) applications. To tackle these challenges, we propose the use of one-time programmable memory (OTPM) for IMC-based thermal attack-secured and ultra-low powered NN accelerator with write disablement to prevent re-programming/tampering. In this article, first, we show that HRS for OTP memory is exceptionally high (~190 M<inline-formula> <tex-math notation="LaTeX">$\Omega $ </tex-math></inline-formula>), but the higher current in LRS (~100 mA) presents a challenge of high average power. We address this challenge by introducing a select transistor gate voltage biasing scheme with the 1T-1OTPM bit cell to reduce LRS current significantly while still trading off the excessive HRS/LRS ratio of 1.5 M to about 1 k. This reduces average power by <inline-formula> <tex-math notation="LaTeX">$\sim 1000\times $ </tex-math></inline-formula> without degrading CIFAR-10 classification performance. Second, we evaluate the impact of OTPM size scaling on HRS and LRS distributions. Though scaling reduces HRS current, we show that moderately scaled 10F<inline-formula> <tex-math notation="LaTeX">$\times 10$ </tex-math></inline-formula>F OTPM bit cells produce the lowest maximum current in HRS compared to the more aggressively scaled 1F<inline-formula> <tex-math notation="LaTeX">$\times 1$ </tex-math></inline-formula>F, which suffers from significant process-induced variability. Third, we show the robustness to temperature, which enables security from thermal attacks.
ISSN:2168-6734