Image First or Text First? Optimising the Sequencing of Modalities in Large Language Model Prompting and Reasoning Tasks

Our study investigates how the sequencing of text and image inputs within multi-modal prompts affects the reasoning performance of Large Language Models (LLMs). Through empirical evaluations of three major commercial LLM vendors—OpenAI, Google, and Anthropic—alongside a user study on interaction str...

Full description

Saved in:
Bibliographic Details
Main Authors: Grant Wardle, Teo Sušnjak
Format: Article
Language:English
Published: MDPI AG 2025-06-01
Series:Big Data and Cognitive Computing
Subjects:
Online Access:https://www.mdpi.com/2504-2289/9/6/149
Tags: Add Tag
No Tags, Be the first to tag this record!