Performance Comparison of Large Language Models for Efficient Literature Screening

<b>Background:</b> Systematic reviewers face a growing body of biomedical literature, making early-stage article screening increasingly time-consuming. In this study, we assessed six large language models (LLMs)—OpenHermes, Flan T5, GPT-2, Claude 3 Haiku, GPT-3.5 Turbo, and GPT-4o—for th...

Full description

Saved in:

Bibliographic Details
Main Authors:	Maria Teresa Colangelo, Stefano Guizzardi, Marco Meleti, Elena Calciolari, Carlo Galli
Format:	Article
Language:	English
Published:	MDPI AG 2025-05-01
Series:	BioMedInformatics
Subjects:	systematic review large language models literature screening artificial intelligence
Online Access:	https://www.mdpi.com/2673-7426/5/2/25
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://www.mdpi.com/2673-7426/5/2/25

Performance Comparison of Large Language Models for Efficient Literature Screening

Internet

Similar Items