Advancing Object Detection in Transportation with Multimodal Large Language Models (MLLMs): A Comprehensive Review and Empirical Testing
This study aims to comprehensively review and empirically evaluate the application of multimodal large language models (MLLMs) and Large Vision Models (VLMs) in object detection for transportation systems. In the first fold, we provide a background about the potential benefits of MLLMs in transporta...
Saved in:
Main Authors: | Huthaifa I. Ashqar, Ahmed Jaber, Taqwa I. Alhadidi, Mohammed Elhenawy |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2025-06-01
|
Series: | Computation |
Subjects: | |
Online Access: | https://www.mdpi.com/2079-3197/13/6/133 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Leveraging Bird Eye View Video and Multimodal Large Language Models for Real-Time Intersection Control and Reasoning
by: Sari Masri, et al.
Published: (2025-05-01) -
SRP-CR: Semantic and Representational Priors for Diffusion-Based Cloud Removal
by: Zhentao Zou, et al.
Published: (2025-01-01) -
When Multimodal Large Language Models Meet Computer Vision: Progressive GPT Fine-Tuning and Stress Testing
by: Konstantinos I. Roumeliotis, et al.
Published: (2025-01-01) -
Efficacy of Autonomous Vehicle’s Adaptive Decision-Making Based on Large Language Models Across Multiple Driving Scenarios
by: Guanzhi Xiong, et al.
Published: (2025-01-01) -
AI for Data Quality Auditing: Detecting Mislabeled Work Zone Crashes Using Large Language Models
by: Shadi Jaradat, et al.
Published: (2025-05-01)