Performance Comparison of Large Language Models for Efficient Literature Screening

<b>Background:</b> Systematic reviewers face a growing body of biomedical literature, making early-stage article screening increasingly time-consuming. In this study, we assessed six large language models (LLMs)—OpenHermes, Flan T5, GPT-2, Claude 3 Haiku, GPT-3.5 Turbo, and GPT-4o—for th...

Full description

Saved in:
Bibliographic Details
Main Authors: Maria Teresa Colangelo, Stefano Guizzardi, Marco Meleti, Elena Calciolari, Carlo Galli
Format: Article
Language:English
Published: MDPI AG 2025-05-01
Series:BioMedInformatics
Subjects:
Online Access:https://www.mdpi.com/2673-7426/5/2/25
Tags: Add Tag
No Tags, Be the first to tag this record!