Abstract
Machine learning (ML) models recently led to major breakthroughs in predictive tasks in the natural sciences. Yet their benefits for the social sciences are less evident, as even high-profile studies on the prediction of life trajectories have shown to be largely unsuccessful – at least when measured in traditional criteria of scientific success. This paper tries to shed light on this remarkable performance gap. Comparing two social science case studies to a paradigm example from the natural sciences, we argue that, in addition to explanation, prediction is an important goal of social science – and we identify constraints that impede pure ML prediction from being successful in that field. As a remedy, we outline elements of an integrative modelling approach that combines explanatory models and predictive ML models.