YUNet_LLMClaimReport: An Enhanced Automobile Insurance Fraud Detection and Automated Claim Report Generation Using Large Language Models

The detection of fraud in automobile insurance and processing of automobile claims are still a challenge since manual inspections are carried out and automated systems are fragmented. The state-of-the-art approaches (convolutional neural networks, statistical models, etc.) tend to focus on separate...

Full description

Saved in:
Bibliographic Details
Main Authors: P. Anand Kumar, S. Sountharrajan
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/11062644/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The detection of fraud in automobile insurance and processing of automobile claims are still a challenge since manual inspections are carried out and automated systems are fragmented. The state-of-the-art approaches (convolutional neural networks, statistical models, etc.) tend to focus on separate tasks (damage detection, fraud classification, etc.), without being integrated with report generation on an end-to-end basis, and without making sufficient use of unstructured information (e.g., images, claim descriptions). In this research, YUNet_LLMClaimReport, a new framework is proposed that combines YOLOv11, U-Net, and a fine-tuned GPT-3.5-turbo large language model to automatically generate claim reports based on the detections and segmentation. The model is trained on a custom dataset of 4,000 vehicle images within the Egyptian insurance market (70:20:10 training, validation and test split), using data augmentation, dropout (0.3) and L2 regularisation (l =0.01) to ensure robustness. The evaluation measures are mean average precision (mAP50: 0.9424), intersection over union (IoU: 0.85), claim report accuracy (CRA: 0.92+/−0.008), BLEU (0.85+/−0.01), and ROUGE-L (0.82+/−0.01). The framework runs at 8.3 FPS on NVIDIA Jetson Nano and saves processing time by 60 percent. Weaknesses are single-run measures because of computational limitations and factual inconsistencies in reports (10%). The model can be generalized to the Kaggle Car Damage Detection dataset (mAP50: 0.90), providing a scalable solution to insurance processes. The code and sample data are open sourced.
ISSN:2169-3536