Security practices in AI development

Petr Spelda; Vit Stritecky

PhilArchive

More download options

Security practices in AI development

Petr Spelda & Vit Stritecky

AI and Society (forthcoming) Copy BIBT_EX

Abstract

What makes safety claims about general purpose AI systems such as large language models trustworthy? We show that rather than the capabilities of security tools such as alignment and red teaming procedures, it is security practices based on these tools that contributed to reconfiguring the image of AI safety and made the claims acceptable. After showing what causes the gap between the capabilities of security tools and the desired safety guarantees, we critically investigate how AI security practices attempt to fill the gap and identify several shortcomings in diversity and participation. We found that these security practices are part of securitization processes aiming to support (commercial) development of general purpose AI systems whose trustworthiness can only be imperfectly tested instead of guaranteed. We conclude by offering several improvements to the current AI security practices.

Author Profiles

Petr Spelda

Charles University, Prague

Vít Střítecký

Keywords

Artificial Intelligence Machine Learning AI Safety LLM AI Alignment Security Practices

Reprint years

DOI

10.1007/s00146-025-02247-4

Other Versions

No versions found

My notes

Analytics

Added to PP
2025-03-15

Downloads
73 (#305,893)

6 months
73 (#87,134)

Historical graph of downloads

How can I increase my downloads?

Author Profiles

Petr Spelda

Charles University, Prague

Vít Střítecký

Citations of this work

No citations found.

Add more citations

References found in this work

Diversity in sociotechnical machine learning systems.Maria De-Arteaga & Sina Fazelpour - 2022 - Big Data and Society 9 (1).

Why There is no General Solution to the Problem of Software Verification.John Symons & Jack K. Horner - 2020 - Foundations of Science 25 (3):541-557.

Why There is no General Solution to the Problem of Software Verification.John Symons & Jack J. Horner - 2020 - Foundations of Science 25 (3):541-557.

The Future of Human-Artificial Intelligence Nexus and its Environmental Costs.Petr Spelda & Vit Stritecky - 2020 - Futures 117.

Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Security practices in AI development

Abstract

Author Profiles

Categories

Keywords

Reprint years

DOI

Other Versions

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Author Profiles

Citations of this work

References found in this work