Word-embedding Based Text Vectorization Using Clustering

It is known that in the tasks of natural language processing, the representation of texts by vectors of fixed length using word-embedding models makes sense in cases where the vectorized texts are short.The longer the texts being compared, the worse the approach works. This situation is due to the f...

Full description

Saved in:

Bibliographic Details
Main Authors:	Vitaly I. Yuferev, Nikolai A. Razin
Format:	Article
Language:	English
Published:	Yaroslavl State University 2021-10-01
Series:	Моделирование и анализ информационных систем
Subjects:	word embedding fasttext tf-idf averaging clustering text similarity distance text ranking
Online Access:	https://www.mais-journal.ru/jour/article/view/1529
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://www.mais-journal.ru/jour/article/view/1529

Word-embedding Based Text Vectorization Using Clustering

Internet

Similar Items