A lip reading method based on adaptive pooling attention Transformer

Lip reading technology establishes the mapping relationship between lip movements and specific language characters by processing a series of consecutive lip images, thereby enabling semantic information recognition. Existing methods mainly use recurrent networks for spatiotemporal modeling of sequen...

Full description

Saved in:
Bibliographic Details
Main Authors: YAO Yun, HU Zhenxiao, DENG Tao, WANG Xiao
Format: Article
Language:Chinese
Published: POSTS&TELECOM PRESS Co., LTD 2025-06-01
Series:智能科学与技术学报
Subjects:
Online Access:http://www.cjist.com.cn/zh/article/doi/10.11959/j.issn.2096-6652.202515/
Tags: Add Tag
No Tags, Be the first to tag this record!