Are MOOC Learning Analytics Results Trustworthy? With Fake Learners, They Might Not Be!

Giora Alexandron, Lisa Y. Yoo, Jose A. Ruiperez-Valiente, Sunbok Lee, David E. Pritchard

Research output: Contribution to journalArticlepeer-review

Abstract

The rich data that Massive Open Online Courses (MOOCs) platforms collect on the behavior of millions of users provide a unique opportunity to study human learning and to develop data-driven methods that can address the needs of individual learners. This type of research falls into the emerging field of learning analytics. However, learning analytics research tends to ignore the issue of the reliability of results that are based on MOOCs data, which is typically noisy and generated by a largely anonymous crowd of learners. This paper provides evidence that learning analytics in MOOCs can be significantly biased by users who abuse the anonymity and open-nature of MOOCs, for example by setting up multiple accounts, due to their amount and aberrant behavior. We identify these users, denoted fake learners, using dedicated algorithms. The methodology for measuring the bias caused by fake learners' activity combines the ideas of Replication Research and Sensitivity Analysis. We replicate two highly-cited learning analytics studies with and without fake learners data, and compare the results. While in one study, the results were relatively stable against fake learners, in the other, removing the fake learners' data significantly changed the results. These findings raise concerns regarding the reliability of learning analytics in MOOCs, and highlight the need to develop more robust, generalizable and verifiable research methods.

Original languageEnglish
Pages (from-to)484-506
Number of pages23
JournalInternational Journal of Artificial Intelligence in Education
Volume29
Issue number4
Early online date2 Jul 2019
DOIs
StatePublished - Dec 2019

All Science Journal Classification (ASJC) codes

  • Education
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'Are MOOC Learning Analytics Results Trustworthy? With Fake Learners, They Might Not Be!'. Together they form a unique fingerprint.

Cite this