DeepSeek-R1 outperforms Gemini 2.0 Pro, OpenAI o1, and o3-mini in bilingual complex ophthalmology reasoning

Purpose: To evaluate the accuracy and reasoning ability of DeepSeek-R1 and three recently released large language models (LLMs) in bilingual complex ophthalmology cases. Methods: A total of 130 multiple-choice questions (MCQs) related to diagnosis (n = 39) and management (n = 91) were collected...

Full description

Saved in:

Bibliographic Details
Main Authors:	Pusheng Xu, Yue Wu, Kai Jin, Xiaolan Chen, Mingguang He, Danli Shi
Format:	Article
Language:	English
Published:	Elsevier 2025-08-01
Series:	Advances in Ophthalmology Practice and Research
Subjects:	Large language models DeepSeek Gemini OpenAI Clinical decision support Reasoning ability
Online Access:	http://www.sciencedirect.com/science/article/pii/S2667376225000290
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

http://www.sciencedirect.com/science/article/pii/S2667376225000290

DeepSeek-R1 outperforms Gemini 2.0 Pro, OpenAI o1, and o3-mini in bilingual complex ophthalmology reasoning

Internet

Similar Items