Reliable but not rigorous: Evaluating ChatGPT's reliability, validity, and bias in automated academic grading

Raed Awashreh; Hisham Al Ghunaimi; Said AlGhenaimi

doi:10.1016/j.ssaho.2026.102788

What is it about?

This study investigates whether ChatGPT can function as a reliable and fair evaluator of student academic work in higher education, with particular attention to grading reliability, evaluative validity, and potential bias across social science disciplines.

Photo by ilgmyzin on Unsplash

Why is it important?

The study empirically advances understanding of the reliability–validity paradox in AI-assisted assessment by demonstrating that grading consistency does not equate to pedagogical fairness, positioning leniency bias as a structural outcome of large language model design rather than random error.

Perspectives

Practical implications ChatGPT is best suited for formative feedback and diagnostic support, not summative grading. Institutions should adopt calibrated human–AI hybrid assessment models and clear governance frameworks. Social implications Unregulated AI grading risks normalizing grade inflation and misrepresenting academic achievement, potentially undermining trust in educational credentials.
Dr Hisham Al Ghunaimi

This page is a summary of: Reliable but not rigorous: Evaluating ChatGPT's reliability, validity, and bias in automated academic grading, Social Sciences & Humanities Open, June 2026, Elsevier,
DOI: 10.1016/j.ssaho.2026.102788.
You can read the full text:

Read

Contributors

The following have contributed to this page

Dr Hisham Al Ghunaimi

Is Evaluating ChatGPT's reliability, validity, and bias in automated academic grading

What is it about?

Why is it important?

Perspectives

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

Is Evaluating ChatGPT's reliability, validity, and bias in automated academic grading

What is it about?

Featured Image

Why is it important?

Perspectives

Read the Original

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management