DeepSeek-R1 outperforms Gemini 2.0 Pro, OpenAI o1, and o3-mini in bilingual complex ophthalmology reasoning

Purpose: To evaluate the accuracy and reasoning ability of DeepSeek-R1 and three recently released large language models (LLMs) in bilingual complex ophthalmology cases. Methods: A total of 130 multiple-choice questions (MCQs) related to diagnosis (n ​= ​39) and management (n ​= ​91) were collected...

Full description

Saved in:
Bibliographic Details
Main Authors: Pusheng Xu, Yue Wu, Kai Jin, Xiaolan Chen, Mingguang He, Danli Shi
Format: Article
Language:English
Published: Elsevier 2025-08-01
Series:Advances in Ophthalmology Practice and Research
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2667376225000290
Tags: Add Tag
No Tags, Be the first to tag this record!