Image First or Text First? Optimising the Sequencing of Modalities in Large Language Model Prompting and Reasoning Tasks

Our study investigates how the sequencing of text and image inputs within multi-modal prompts affects the reasoning performance of Large Language Models (LLMs). Through empirical evaluations of three major commercial LLM vendors—OpenAI, Google, and Anthropic—alongside a user study on interaction str...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Grant Wardle, Teo Sušnjak
Format:	Artikel
Sprache:	Englisch
Veröffentlicht:	MDPI AG 2025-06-01
Schriftenreihe:	Big Data and Cognitive Computing
Schlagworte:	multi-modal prompting interactive AI systems user-guided AI adaptation multi-modal large language models modality fusion multi-modal reasoning
Online-Zugang:	https://www.mdpi.com/2504-2289/9/6/149
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Schreiben Sie den ersten Kommentar!

Image First or Text First? Optimising the Sequencing of Modalities in Large Language Model Prompting and Reasoning Tasks

Ähnliche Einträge