Advancing Object Detection in Transportation with Multimodal Large Language Models (MLLMs): A Comprehensive Review and Empirical Testing

QR Code

Advancing Object Detection in Transportation with Multimodal Large Language Models (MLLMs): A Comprehensive Review and Empirical Testing

This study aims to comprehensively review and empirically evaluate the application of multimodal large language models (MLLMs) and Large Vision Models (VLMs) in object detection for transportation systems. In the first fold, we provide a background about the potential benefits of MLLMs in transporta...

Full description

Saved in:

Bibliographic Details
Main Authors:	Huthaifa I. Ashqar, Ahmed Jaber, Taqwa I. Alhadidi, Mohammed Elhenawy
Format:	Article
Language:	English
Published:	MDPI AG 2025-06-01
Series:	Computation
Subjects:	multimodal large language models (MLLMs) end-to-end object detection large vision models (VLMs) autonomous driving intelligent transportation systems (ITS)
Online Access:	https://www.mdpi.com/2079-3197/13/6/133
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Leveraging Bird Eye View Video and Multimodal Large Language Models for Real-Time Intersection Control and Reasoning
by: Sari Masri, et al.
Published: (2025-05-01)

SRP-CR: Semantic and Representational Priors for Diffusion-Based Cloud Removal
by: Zhentao Zou, et al.
Published: (2025-01-01)

When Multimodal Large Language Models Meet Computer Vision: Progressive GPT Fine-Tuning and Stress Testing
by: Konstantinos I. Roumeliotis, et al.
Published: (2025-01-01)

Efficacy of Autonomous Vehicle’s Adaptive Decision-Making Based on Large Language Models Across Multiple Driving Scenarios
by: Guanzhi Xiong, et al.
Published: (2025-01-01)

AI for Data Quality Auditing: Detecting Mislabeled Work Zone Crashes Using Large Language Models
by: Shadi Jaradat, et al.
Published: (2025-05-01)

Coherent Interpretation of Entire Visual Field Test Reports Using a Multimodal Large Language Model (ChatGPT)
by: Jeremy C. K. Tan
Published: (2025-04-01)

END TO END LEARNING FOR A DRIVING SIMULATOR
by: V. F. Alexeev, et al.
Published: (2019-06-01)

An Overview of Autonomous Parking Systems: Strategies, Challenges, and Future Directions
by: Javier Santiago Olmos Medina, et al.
Published: (2025-07-01)

SMILE: A Small Multimodal Dataset Capturing Roadside Behavior in Indian Driving Conditions
by: Mayur Anand Pandya, et al.
Published: (2025-01-01)

DUAL INVERTER-FED DRIVES WITH THE SYNCHRONISED MULTILEVEL VOLTAGE WAVEFORMS
by: Oleschuk V.I., et al.
Published: (2006-04-01)

Multimodal LLM for enhanced Alzheimer’s Disease diagnosis: Interpretable feature extraction from Mini-Mental State Examination data
by: Meiwei Zhang, et al.
Published: (2025-09-01)

An Architecture for Intelligent Tutoring in Virtual Reality: Integrating LLMs and Multimodal Interaction for Immersive Learning
by: Mohamed El Hajji, et al.
Published: (2025-06-01)

Enhancing Autonomous Driving Perception: A Practical Approach to Event-Based Object Detection in CARLA and ROS
by: Jingxiang Feng, et al.
Published: (2025-05-01)

Optimization and application of vision-based large models in educational scenarios
by: XU Yuepeng, et al.
Published: (2025-01-01)

Urban Greening Analysis: A Multimodal Large Language Model for Pinpointing Vegetation Areas in Adverse Weather Conditions
by: Hanzhang Liu, et al.
Published: (2025-06-01)

Few-shot learning for novel object detection in autonomous driving
by: Yifan Zhuang, et al.
Published: (2025-12-01)

PathVLM-Eval: Evaluation of open vision language models in histopathology
by: Nauman Ullah Gilal, et al.
Published: (2025-08-01)

Evaluation of the correlation between six scoring systems for assessing the severity of end-stage liver disease and intraoperative blood loss during liver transplantation: a retrospective study
by: Amer Majeed, et al.
Published: (2025-06-01)

Ex Vivo and Simulation Comparison of Leakage in End-to-End Versus End-to-Side Anastomosed Porcine Large Intestine
by: Youssef Fahmy, et al.
Published: (2025-06-01)

A multimodal framework for enhancing E-commerce information management using vision transformers and large language models
by: Anitha Balachandran, et al.
Published: (2025-12-01)

Managing the reliability of structures in the mechanism of implementation of end-to-end technologies in innovative ecosystem of the department
by: V. V. Galayko, et al.
Published: (2024-11-01)

Laparoscopic central pancreatectomy with end-to-end pancreatic anastomosis (with video)
by: Gang Wang, et al.
Published: (2025-08-01)

Ayatollah Javadi Amoli's Perspective on Redemption: Inclusive End or Dominant End?
by: Tahereh Salehi, et al.
Published: (2024-06-01)

WiFi-TSN low latency conversion architecture
by: Wang Bo, et al.
Published: (2022-04-01)

Application of Terminal Audio Mixing in Multi-Bandwidth End-to-End Encrypted Voice Conference
by: Chi-Hung Lien, et al.
Published: (2025-05-01)

Art appreciation based on graph retrieval augmented generation and few-shot learning
by: LIU Tianyang, et al.
Published: (2025-01-01)

Generative AI Models (2018–2024): Advancements and Applications in Kidney Care
by: Fnu Neha, et al.
Published: (2025-04-01)

End-to-end Standardization of Original Medicines when Determining Related Impurities
by: Yu. E. Generalova, et al.
Published: (2023-12-01)

Endgame strategy
by: Shereshevsky, M. I.
Published: (1985)

A Survey of Deep Learning-Driven 3D Object Detection: Sensor Modalities, Technical Architectures, and Applications
by: Xiang Zhang, et al.
Published: (2025-06-01)

Reliable QoE Prediction in IMVCAs Using an LMM-Based Agent
by: Michael Sidorov, et al.
Published: (2025-07-01)

RADAR: Reasoning AI-Generated Image Detection for Semantic Fakes
by: Haochen Wang, et al.
Published: (2025-07-01)

LST-BEV: Generating a Long-Term Spatial–Temporal Bird’s-Eye-View Feature for Multi-View 3D Object Detection
by: Qijun Feng, et al.
Published: (2025-06-01)

Simulation of Electromagnetic Field of a Powerful Electrical Machine
by: D. I. Hvalin, et al.
Published: (2021-04-01)

Use of End-to-End Tool for the Analysis of the Digital Governance of Ports
by: Nicoletta González-Cancelas, et al.
Published: (2024-06-01)

A Review of DEtection TRansformer: From Basic Architecture to Advanced Developments and Visual Perception Applications
by: Liang Yu, et al.
Published: (2025-06-01)

End-to-End Interrupted Sampling Repeater Jamming Countermeasure Network Under Low Signal-to-Noise Ratio
by: Gane Dai, et al.
Published: (2025-06-01)

End-to-end design of a competitive car with a high level of handling and safety indicators
by: N. A. Volkova, et al.
Published: (2022-07-01)

Online Persian/Arabic Writer Identification using Gated Recurrent Unit Neural Networks
by: Mahsa Aliakbarzadeh, et al.
Published: (2024-02-01)

End user computing : management, applications, and technology /
by: Panko, Raymond R.
Published: (1988)