How well can LLMs grade essays in Arabic?

QR Code

How well can LLMs grade essays in Arabic?

This research assesses the effectiveness of state-of-the-art large language models (LLMs), including ChatGPT, Llama, Aya, Jais, and ACEGPT, in the task of Arabic automated essay scoring (AES) using the AR-AES dataset. It explores various evaluation methodologies, including zero-shot, few-shot in con...

Full description

Saved in:

Bibliographic Details
Main Authors:	Rayed Ghazawi, Edwin Simpson
Format:	Article
Language:	English
Published:	Elsevier 2025-12-01
Series:	Computers and Education: Artificial Intelligence
Subjects:	Automatic essay scoring (AES) Natural language processing (NLP) Large language models (LLMs) Arabic language
Online Access:	http://www.sciencedirect.com/science/article/pii/S2666920X2500089X
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

AutoTA: A Dynamic Intent-Based Virtual Teaching Assistant for Students Using Open Source LLMs
by: Rajashree Dahal, et al.
Published: (2025-01-01)

Statistics is not measurement: The inbuilt semantics of psychometric scales and language-based models obscures crucial epistemic differences
by: Jana Uher
Published: (2025-06-01)

LLMs in Cyber Security: Bridging Practice and Education
by: Hany F. Atlam
Published: (2025-07-01)

Leveraging LLMs for COVID-19 Fake News Generation and Detection: A Comparative Analysis on Twitter Data
by: Hong N. Dao, et al.
Published: (2025-01-01)

On protecting the data privacy of Large Language Models (LLMs) and LLM agents: A literature review
by: Biwei Yan, et al.
Published: (2025-06-01)

Smart Building Recommendations with LLMs: A Semantic Comparison Approach
by: Ioannis Papaioannou, et al.
Published: (2025-06-01)

Language structure and translation : essays /
by: Nida, Eugene A.
Published: (1975)

Similarities And Differences Between Gpsg And Hpsg Grammars Applied To The Arabic Language
by: Abdelmadjid Achit, et al.
Published: (2011-12-01)

Development and Evaluation of Learning Portfolio Query System Based on LangChain Framework
by: Nien-Lin Hsueh, et al.
Published: (2025-04-01)

Rooted in and beyond interaction: A systematic review of interactive affordances of chatbots for language learning amidst the rise of large language models
by: Yunfei Du, et al.
Published: (2025-09-01)

How to write essays /
by: Lewis, Roger
Published: (1979)

Evaluation of Commentators’ Opinions on “Clear Arabic Language” (Arabic: لسانٍ عربیّ مبینٍ) in the Holy Quran
by: Ahmad Karimi, et al.
Published: (2023-07-01)

ResDecode: Accelerating Large Language Models Inference via Residual Decoding Heads
by: Ziqian Zeng, et al.
Published: (2025-06-01)

Large Language Models in Healthcare and Medical Applications: A Review
by: Subhankar Maity, et al.
Published: (2025-06-01)

How to write themes and essays /
by: McCall, John
Published: (1989)

Integrating Arab Cultural Elements in Arabic Language Education
by: Achmad Sulton, et al.
Published: (2025-04-01)

Harmonizing organ-at-risk structure names using open-source large language models
by: Adrian Thummerer, et al.
Published: (2025-07-01)

Large Language Model and Traditional Machine Learning Scoring of Evolutionary Explanations: Benefits and Drawbacks
by: Yunlong Pan, et al.
Published: (2025-05-01)

The evolution of language models: From N-Grams to LLMs, and beyond
by: Mohammad Ghaseminejad Raeini
Published: (2025-09-01)

Leave as Fast as You Can: Using Generative AI to Automate and Accelerate Hospital Discharge Reports
by: Alex Trejo Omeñaca, et al.
Published: (2025-05-01)

Comparative Analysis of BERT and GPT for Classifying Crisis News with Sudan Conflict as an Example
by: Yahya Masri, et al.
Published: (2025-07-01)

Decoding the Mystery: How Can LLMs Turn Text Into Cypher in Complex Knowledge Graphs?
by: Ioanna Mandilara, et al.
Published: (2025-01-01)

Large Language Models: A Structured Taxonomy and Review of Challenges, Limitations, Solutions, and Future Directions
by: Pejman Peykani, et al.
Published: (2025-07-01)

From the Arabic press a language reader in economic and social affairs
by: Nahmad, H. M.
Published: (1970)

The Dangerous Effects of a Frustratingly Easy LLMs Jailbreak Attack
by: Marco Bombieri, et al.
Published: (2025-01-01)

From digital traces to public vaccination behaviors: leveraging large language models for big data classification
by: Yoo Jung Oh, et al.
Published: (2025-07-01)

Overview of deep learning and large language models in machine translation: a special perspective on the Arabic language
by: Sanaa Abou Elhamayed, et al.
Published: (2025-06-01)

Deep Learning-Driven Labor Education and Skill Assessment: A Big Data Approach for Optimizing Workforce Development and Industrial Relations
by: Dan Peng
Published: (2025-01-01)

Potential of Artificial Intelligence Tools for Text Evaluation and Feedback Provision
by: S. V. Bogolepova
Published: (2025-03-01)

Regional Corpus Of Modern Standard Arabic
by: Ahmed Abdelali, et al.
Published: (2011-12-01)

Upcoming: Assessing the Potential Challenges of Paid LLMs and Inequities in Language Classrooms
by: Aditi Jhaveri
Published: (2025-07-01)

The Contribution of Rapid Automatized Naming Skills and Phonological Awareness to Arabic Language Reading Fluency: A Path Analysis
by: Абдулазіз Альшахрані
Published: (2023-04-01)

Arabic As a Medium for the Internalization of Worship Values
by: Aufia Aisa, et al.
Published: (2025-07-01)

Using Graph Mining Method in Analyzing Turkish Loanwords Derived from Arabic Language
by: Abbood Kirebut Jassim, et al.
Published: (2022-12-01)

Meticulous Thought Defender: Fine-Grained Chain-of-Thought (CoT) for Detecting Prompt Injection Attacks of Large Language Models
by: Lijuan Shi, et al.
Published: (2025-01-01)

Pidginization and creolization the case of Arabic
by: Versteegh, Kees
Published: (1984)

Rethinking survey development in health research with AI-driven methodologies
by: Hakan Kuru
Published: (2025-07-01)

Large Language Models’ Trustworthiness in the Light of the EU AI Act—A Systematic Mapping Study
by: Md Masum Billah, et al.
Published: (2025-07-01)

What are the future directions for microplastics characterization? A regex-llama data mining approach for identifying emerging trends
by: FERNANDO GOMES, et al.
Published: (2025-07-01)

Efficacy of Autonomous Vehicle’s Adaptive Decision-Making Based on Large Language Models Across Multiple Driving Scenarios
by: Guanzhi Xiong, et al.
Published: (2025-01-01)