Smoothed per-tensor weight quantization: a robust solution for neural network deployment

This paper introduces a novel method to improve quantization outcomes for per-tensor weight quantization, focusing on enhancing computational efficiency and compatibility with resource-constrained hardware. Addressing the inherent challenges of depth-wise convolutions, the proposed smooth quantizati...

Full description

Saved in:
Bibliographic Details
Main Author: Xin Chang
Format: Article
Language:English
Published: Polish Academy of Sciences 2025-07-01
Series:International Journal of Electronics and Telecommunications
Subjects:
Online Access:https://journals.pan.pl/Content/135755/23_4966_Chang_L_sk.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!