How well can LLMs grade essays in Arabic?
This research assesses the effectiveness of state-of-the-art large language models (LLMs), including ChatGPT, Llama, Aya, Jais, and ACEGPT, in the task of Arabic automated essay scoring (AES) using the AR-AES dataset. It explores various evaluation methodologies, including zero-shot, few-shot in con...
Saved in:
Main Authors: | Rayed Ghazawi, Edwin Simpson |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2025-12-01
|
Series: | Computers and Education: Artificial Intelligence |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2666920X2500089X |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
AutoTA: A Dynamic Intent-Based Virtual Teaching Assistant for Students Using Open Source LLMs
by: Rajashree Dahal, et al.
Published: (2025-01-01) -
Statistics is not measurement: The inbuilt semantics of psychometric scales and language-based models obscures crucial epistemic differences
by: Jana Uher
Published: (2025-06-01) -
LLMs in Cyber Security: Bridging Practice and Education
by: Hany F. Atlam
Published: (2025-07-01) -
Leveraging LLMs for COVID-19 Fake News Generation and Detection: A Comparative Analysis on Twitter Data
by: Hong N. Dao, et al.
Published: (2025-01-01) -
On protecting the data privacy of Large Language Models (LLMs) and LLM agents: A literature review
by: Biwei Yan, et al.
Published: (2025-06-01)