DeepSeek-R1 outperforms Gemini 2.0 Pro, OpenAI o1, and o3-mini in bilingual complex ophthalmology reasoning
Purpose: To evaluate the accuracy and reasoning ability of DeepSeek-R1 and three recently released large language models (LLMs) in bilingual complex ophthalmology cases. Methods: A total of 130 multiple-choice questions (MCQs) related to diagnosis (n = 39) and management (n = 91) were collected...
Saved in:
Main Authors: | Pusheng Xu, Yue Wu, Kai Jin, Xiaolan Chen, Mingguang He, Danli Shi |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2025-08-01
|
Series: | Advances in Ophthalmology Practice and Research |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2667376225000290 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Generative AI in Pragmatics: Assessing the Accuracy of Automated Speech Act Classification in Pinter’s The Birthday Party
by: Tadej Todorović, et al.
Published: (2025-06-01) -
DeepSeek: Another step forward in the diagnosis of oral lesions
by: Márcio Diniz-Freitas, et al.
Published: (2025-07-01) -
Large Language Models for Transforming Healthcare: A Perspective on DeepSeek‐R1
by: Jinsong Zhou, et al.
Published: (2025-06-01) -
Pre-operative T-stage discrimination in gallbladder cancer using machine learning and DeepSeek-R1
by: Joongwon Chae, et al.
Published: (2025-08-01) -
Advancing Software Vulnerability Detection with Reasoning LLMs: DeepSeek-R1′s Performance and Insights
by: Wenting Qin, et al.
Published: (2025-06-01)