A lip reading method based on adaptive pooling attention Transformer
Lip reading technology establishes the mapping relationship between lip movements and specific language characters by processing a series of consecutive lip images, thereby enabling semantic information recognition. Existing methods mainly use recurrent networks for spatiotemporal modeling of sequen...
Saved in:
Main Authors: | YAO Yun, HU Zhenxiao, DENG Tao, WANG Xiao |
---|---|
Format: | Article |
Language: | Chinese |
Published: |
POSTS&TELECOM PRESS Co., LTD
2025-06-01
|
Series: | 智能科学与技术学报 |
Subjects: | |
Online Access: | http://www.cjist.com.cn/zh/article/doi/10.11959/j.issn.2096-6652.202515/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
IEAM: Integrating Edge Enhancement and Attention Mechanism with Multi-Path Complementary Features for Salient Object Detection in Remote Sensing Images
by: Fubin Zhang, et al.
Published: (2025-06-01) -
CAGNet: A Network Combining Multiscale Feature Aggregation and Attention Mechanisms for Intelligent Facial Expression Recognition in Human-Robot Interaction
by: Dengpan Zhang, et al.
Published: (2025-06-01) -
MRFB-Net: A Novel Attention Pooling Network With Modified Receptive Field Block for Uterine Fibroid Segmentation
by: Yun Jiang, et al.
Published: (2025-01-01) -
Drought tolerance gene pool in developing adaptive varieties of durum wheat identified in study nurseries under the Kazakhstan-Siberian program
by: M. G. Evdokimov, et al.
Published: (2017-11-01) -
An interpretable hybrid graph pooling scheme for system-scale adaptive small-signal stability assessment
by: Jiyu Huang, et al.
Published: (2025-09-01)