Word-embedding Based Text Vectorization Using Clustering

It is known that in the tasks of natural language processing, the representation of texts by vectors of fixed length using word-embedding models makes sense in cases where the vectorized texts are short.The longer the texts being compared, the worse the approach works. This situation is due to the f...

Full description

Saved in:
Bibliographic Details
Main Authors: Vitaly I. Yuferev, Nikolai A. Razin
Format: Article
Language:English
Published: Yaroslavl State University 2021-10-01
Series:Моделирование и анализ информационных систем
Subjects:
Online Access:https://www.mais-journal.ru/jour/article/view/1529
Tags: Add Tag
No Tags, Be the first to tag this record!