Analysis of Factors Effecting PISA 2015 Mathematics Literacy via Educational Data Mining

Gure O. B., Kayri M., Erdoğan F.

EGITIM VE BILIM-EDUCATION AND SCIENCE, vol.45, no.202, pp.393-415, 2020 (SSCI) identifier identifier

  • Publication Type: Article / Article
  • Volume: 45 Issue: 202
  • Publication Date: 2020
  • Doi Number: 10.15390/eb.2020.8477
  • Journal Indexes: Social Sciences Citation Index (SSCI), Scopus, EBSCO Education Source, TR DİZİN (ULAKBİM)
  • Page Numbers: pp.393-415
  • Keywords: PISA, Mathematics literacy, Educational Data Mining, Multi-layer Perceptron, Random Forest, RANDOM FOREST, MULTILAYER PERCEPTRON, CLASSIFICATION, CLASSIFIERS, STUDENTS
  • Van Yüzüncü Yıl University Affiliated: Yes


The aim of this study is to determine the factors affecting PISA 2015 Mathematics literacy by using data mining methods such as Multilayer Perceptron Artificial Neural Networks and Random Forest. Cause and effect relation within the context of the study was tried to be discovered by means of data mining methods at the level of deep learning. In terms of Prediction Ability, the findings of the method whose performance was high were accepted as the factors determining the qualifications in Mathematics literacy in Turkey. In this study, the information, which was collected from a total of 4422 students, 215 (49%) of whom were boys and 2257 (51%) of whom were girls participating in PISA 2015 test, was used. The scores, which the students, having gone in for PISA 2015 test, got from mathematics test, and dependent variables and 25 variables, which were thought to have connection with dependent variables institutionally, were included in the analysis as predictors. As a result of analysis, it was witnessed that Random Forest (RF) method made prediction with smaller errors in terms of a number of performance indicators. The factors that random forest method found important after anxiety variable are Turkish success level of students, mother education level, motivation level, the belief in epistemology, interest level of teachers and class disciplinary environment, respectively. The statistical meaning, significance and impact levels of other variables were tackled together with their details in this study. It is expected that this study will set an example for data mining use in the process of educational studies and that the factors whose affects were found out about the students' mathematics literacy will shed light on National Education system.