TY - JOUR
T1 - Predicting Academic Performance through Data Mining
T2 - A Systematic Literature
AU - Daza, Alfredo
AU - Guerra, Carlos
AU - Cervera, Noemí
AU - Burgos, Erwin
N1 - Publisher Copyright:
© 2022. Alfredo daza Vergaray et al; published by UIKTEN. This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 License.
PY - 2022/5/1
Y1 - 2022/5/1
N2 - The main objective of this work is to make a systematic review of the literature on the prediction of the academic performance of university students by applying data mining techniques. For this purpose, an exhaustive search was carried out and after the analysis of the documentation collected, aspects such as: methodology, attributes, selection algorithms, techniques, tools, and metrics were considered, which served as the basis for the elaboration of this document. The results of the study showed that the most used methodology is KDD(database knowledge extraction), the most important attribute to achieve prediction is CGPA(academic performance), the most commonly used variable selection algorithm is InfoGain-AttributeEval, among the most efficient techniques are Naïve Bayes, Neural Networks (MLP) and Decision Tree (J48), the most used tools for the development of the models is the Weka software and finally the metrics necessary to determine the effectiveness of the model were Precision and Recall.
AB - The main objective of this work is to make a systematic review of the literature on the prediction of the academic performance of university students by applying data mining techniques. For this purpose, an exhaustive search was carried out and after the analysis of the documentation collected, aspects such as: methodology, attributes, selection algorithms, techniques, tools, and metrics were considered, which served as the basis for the elaboration of this document. The results of the study showed that the most used methodology is KDD(database knowledge extraction), the most important attribute to achieve prediction is CGPA(academic performance), the most commonly used variable selection algorithm is InfoGain-AttributeEval, among the most efficient techniques are Naïve Bayes, Neural Networks (MLP) and Decision Tree (J48), the most used tools for the development of the models is the Weka software and finally the metrics necessary to determine the effectiveness of the model were Precision and Recall.
KW - Academic performance
KW - Academic performance in college students
KW - Data mining
KW - Prediction
UR - https://www.scopus.com/pages/publications/85131447674
U2 - 10.18421/TEM112-57
DO - 10.18421/TEM112-57
M3 - Article
AN - SCOPUS:85131447674
SN - 2217-8309
VL - 11
SP - 939
EP - 949
JO - TEM Journal
JF - TEM Journal
IS - 2
ER -