Image First or Text First? Optimising the Sequencing of Modalities in Large Language Model Prompting and Reasoning Tasks

Our study investigates how the sequencing of text and image inputs within multi-modal prompts affects the reasoning performance of Large Language Models (LLMs). Through empirical evaluations of three major commercial LLM vendors—OpenAI, Google, and Anthropic—alongside a user study on interaction str...

תיאור מלא

שמור ב:

מידע ביבליוגרפי
Main Authors:	Grant Wardle, Teo Sušnjak
פורמט:	Article
שפה:	אנגלית
יצא לאור:	MDPI AG 2025-06-01
סדרה:	Big Data and Cognitive Computing
נושאים:	multi-modal prompting interactive AI systems user-guided AI adaptation multi-modal large language models modality fusion multi-modal reasoning
גישה מקוונת:	https://www.mdpi.com/2504-2289/9/6/149
תגים:	הוספת תג אין תגיות, היה/י הראשונ/ה לתייג את הרשומה!

אינטרנט

https://www.mdpi.com/2504-2289/9/6/149

Image First or Text First? Optimising the Sequencing of Modalities in Large Language Model Prompting and Reasoning Tasks

אינטרנט

פריטים דומים