TY - GEN
T1 - On the use of Bayesian probabilistic matrix factorization for predicting student performance in online learning environments
AU - Kim, Jinho
AU - Park, Jung Yeon
AU - van Den Noortgate, Wim
N1 - Publisher Copyright:
© This work is licensed under a Creative Commons License CC BY-NC-ND 4.0
PY - 2020
Y1 - 2020
N2 - Thanks to the advances in digital educational technology, online learning (or e-learning) environments such as Massive Open Online Course (MOOC) have been rapidly growing. In the online educational systems, however, there are two inherent challenges in predicting the performance of students and providing personalized supports to them: sparse data and the cold-start problem. To overcome such challenges, this article aims to employ a pertinent machine learning algorithm, the Bayesian Probabilistic Matrix Factorization (BPMF) that can enhance the prediction by incorporating background information on the side of students and/or items. An experimental study with two prediction scenarios and 24 experimental conditions was conducted to study the BPMF based on real online learning data. The results show that the lower rate of missingness and the appropriate dimensionality of latent features provided better prediction accuracy in both prediction scenarios. The use of side information enhanced the prediction accuracy but the effect was diminished for the high dimensional latent features when the data are sparse. The methodological value, applicability, and practical implications of the BPMF and side information to the online educational systems were also discussed.
AB - Thanks to the advances in digital educational technology, online learning (or e-learning) environments such as Massive Open Online Course (MOOC) have been rapidly growing. In the online educational systems, however, there are two inherent challenges in predicting the performance of students and providing personalized supports to them: sparse data and the cold-start problem. To overcome such challenges, this article aims to employ a pertinent machine learning algorithm, the Bayesian Probabilistic Matrix Factorization (BPMF) that can enhance the prediction by incorporating background information on the side of students and/or items. An experimental study with two prediction scenarios and 24 experimental conditions was conducted to study the BPMF based on real online learning data. The results show that the lower rate of missingness and the appropriate dimensionality of latent features provided better prediction accuracy in both prediction scenarios. The use of side information enhanced the prediction accuracy but the effect was diminished for the high dimensional latent features when the data are sparse. The methodological value, applicability, and practical implications of the BPMF and side information to the online educational systems were also discussed.
KW - Bayesian Probabilistic Matrix Factorization
KW - Digital educational technology
KW - Machine learning
KW - Online educational systems
KW - Online learning
KW - Student performance prediction
UR - http://www.scopus.com/inward/record.url?scp=85088390143&partnerID=8YFLogxK
U2 - 10.4995/HEAd20.2020.11137
DO - 10.4995/HEAd20.2020.11137
M3 - Conference contribution
AN - SCOPUS:85088390143
T3 - International Conference on Higher Education Advances
SP - 751
EP - 759
BT - HEAd 2020 - 6th International Conference on Higher Education Advances
PB - Universitat Politecnica de Valencia
T2 - 6th International Conference on Higher Education Advances, HEAd 2020
Y2 - 2 June 2020 through 5 June 2020
ER -