Works by Timo Freiesleben

Order:

Order

1 filter applied

53
Beyond generalization: a theory of robustness in machine learning.Thomas Grote & Timo Freiesleben - 2023 - Synthese 202 (4):1-28.details
The term robustness is ubiquitous in modern Machine Learning (ML). However, its meaning varies depending on context and community. Researchers either focus on narrow technical definitions, such as adversarial robustness, natural distribution shifts, and performativity, or they simply leave open what exactly they mean by robustness. In this paper, we provide a conceptual analysis of the term robustness, with the aim to develop a common language, that allows us to weave together different strands of robustness research. We define robustness as (...)
No categories
Direct download (2 more)

Export citation

Bookmark 8 citations
86
The Intriguing Relation Between Counterfactual Explanations and Adversarial Examples.Timo Freiesleben - 2021 - Minds and Machines 32 (1):1-33.details
The same method that creates adversarial examples to fool image-classifiers can be used to generate counterfactual explanations that explain algorithmic decisions. This observation has led researchers to consider CEs as AEs by another name. We argue that the relationship to the true label and the tolerance with respect to proximity are two properties that formally distinguish CEs and AEs. Based on these arguments, we introduce CEs, AEs, and related concepts mathematically in a common framework. Furthermore, we show connections between current (...)
Philosophy of Artificial Intelligence in Philosophy of Cognitive Science

Direct download (7 more)

Export citation

Bookmark 4 citations
14
Scientific Inference with Interpretable Machine Learning: Analyzing Models to Learn About Real-World Phenomena.Timo Freiesleben, Gunnar König, Christoph Molnar & Álvaro Tejero-Cantero - 2024 - Minds and Machines 34 (3):1-39.details
To learn about real world phenomena, scientists have traditionally used models with clearly interpretable elements. However, modern machine learning (ML) models, while powerful predictors, lack this direct elementwise interpretability (e.g. neural network weights). Interpretable machine learning (IML) offers a solution by analyzing models holistically to derive interpretations. Yet, current IML research is focused on auditing ML models rather than leveraging them for scientific inference. Our work bridges this gap, presenting a framework for designing IML methods—termed ’property descriptors’—that illuminate not just (...)
Philosophy of Artificial Intelligence in Philosophy of Cognitive Science

Direct download (3 more)

Export citation

Bookmark 1 citation

Off-campus access

Using PhilPapers from home?

Create an account to enable off-campus access through your institution's proxy server or OpenAthens.

Monitor this page

Be alerted of all new items appearing on this page. Choose how you want to monitor it:

RSS feed

	show categories
	categorization shortcuts
	hide abstracts
	open articles in new windows

	show categories
	categorization shortcuts
	hide abstracts
	open articles in new windows

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...