Abstract
This paper constructs a predictive model of student reading literacy based on data from students who participated in the Program for International Student Assessment from four provinces/municipalities of China, i.e., Beijing, Shanghai, Jiangsu and Zhejiang. We calculated the contribution of influencing factors in the model by using eXtreme Gradient Boosting algorithm and sHapley additive exPlanations values, and get the following findings: Factors that have the greatest impact on students’ reading literacy are from individual and family levels, with school-level factors taking a relative back seat. The most important influencing factors at individual level are reading metacognition and reading interest. The most important factors at family level are ESCS and language environment, and dialect is negative for reading literacy, whereas proficiency in both a dialect and Mandarin plays a positive role. At the school level, the most important factors are time dedicated to learning and class discipline, and we found that there is an optimal value for learning time, which suggests that reasonable learning time is beneficial, but overextended learning time may make academic performance worse instead of improving it.