Synergy-CLIP: Extending CLIP With Multi-Modal Integration for Robust Representation Learning

Multi-modal representation learning has become a pivotal area in artificial intelligence, enabling the integration of diverse modalities such as vision, text, and audio to solve complex problems. However, existing approaches predominantly focus on bimodal interactions, such as image-text pairs, whic...

Full description

Saved in:
Bibliographic Details
Main Authors: Sangyeon Cho, Jangyeong Jeon, Mingi Kim, Junyeong Kim
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10962132/
Tags: Add Tag
No Tags, Be the first to tag this record!

Similar Items