Abstract
Modeling and analysis of students’ performance is a common task that is aimed at identifying important factors that affect the learning process. Typically, the analysis uses one-dimensional input parameters. However, with the advancement of data collections tools, many of the gathered educational datasets have become high-dimensional. Hence, the use of standard statistical methods may be limited in cases that the initial data unit is a vector. This paper proposes to use vector input units, which consist of student performance trajectories, for identifying statistical differences in college performances for several populations of college students. Two kernel based methods named diffusion maps and the kernel two-sample test are utilized. Diffusion maps generates a low-dimensional representation of the data, in which important characteristic factors are identified. The kernel two-sample test is a statistical test for comparing whether high-dimensional samples are drawn from two different probability distributions. The two methods are combined into a unified framework. Two case studies, which are processed similarly, are presented. The first tests for significant distributional differences between students with or without learning disabilities. Our results show that these groups’ performances is significantly different. The second case-study analyzes whether the SAT score impacts students’ performance throughout their 4-year of studies. It was found that significant distribution differences in performance are only present for groups of students having a very high or a very low SAT score. Thus, the SAT score is only weakly correlated to students’ college performance.
Original language | English |
---|---|
Pages (from-to) | 492-503 |
Number of pages | 12 |
Journal | Engineering Applications of Artificial Intelligence |
Volume | 85 |
DOIs | |
State | Published - Oct 2019 |
Externally published | Yes |
Bibliographical note
Publisher Copyright:© 2019 Elsevier Ltd
Funding
This research was supported by the National Institute for Testing and Evaluation, Israel and by an internal research grant from Afeka-Tel Aviv College of Engineering, Israel . The authors would like to thank Prof. Ronald R. Coifman for fruitful conversations. The authors would like to thank the anonymous reviewers for their detailed comments, which improved the presentation of the paper.
Funders | Funder number |
---|---|
Afeka-Tel Aviv College of Engineering, Israel | |
National Institute for Testing and Evaluation |
Keywords
- Data visualization
- Diffusion maps
- Educational data analysis
- Kernel two-sample test
- Manifold learning