Possibilities and challenges in the moral growth of large language models: a philosophical perspective

Guoyu Wang; Wei Wang; Yiqin Cao; Yan Teng; Qianyu Guo; Haofen Wang; Junyu Lin; Jiajie Ma; Jin Liu; Yingchun Wang

Download from

dx.doi.org

More download options

Possibilities and challenges in the moral growth of large language models: a philosophical perspective

Guoyu Wang, Wei Wang, Yiqin Cao, Yan Teng, Qianyu Guo, Haofen Wang, Junyu Lin, Jiajie Ma, Jin Liu & Yingchun Wang

Ethics and Information Technology 27 (1):1-11 (2025) Copy BIBT_EX

Abstract

With the rapid expansion of parameters in large language models (LLMs) and the application of Reinforcement Learning with Human Feedback (RLHF), there has been a noticeable growth in the moral competence of LLMs. However, several questions warrant further exploration: Is it really possible for LLMs to fully align with human values through RLHF? How can the current moral growth be philosophically contextualized? We identify similarities between LLMs’ moral growth and Deweyan ethics in terms of the discourse of human moral development. We then attempt to use Dewey’s theory on an experimental basis to examine and further explain the extent to which the current alignment pathway enables the development of LLMs. A beating experiment serves as the foundational case for analyzing LLMs’ moral competence across various parameters and stages, including basic moral cognition, moral dilemma judgment, and moral behavior. The results demonstrate that the moral competence of the GPT series has seen a significant improvement, and Dewey’s Impulse-Habit-Character theory of moral development can be used to explain this: the moral competence of LLMs has been enhanced through experience-based learning, supported by human feedback. Nevertheless, LLMs’ moral development through RLHF remains constrained and does not reach the character stage described by Dewey, possibly due to their lack of self-consciousness. This fundamental difference between humans and LLMs underscores both the limitations of LLMs’ moral growth and the challenges of applying RLHF for AI alignment. It also emphasizes the need for external societal governance and legal regulation.

Author Profiles

Yan Teng

Wei Wang

Keywords

Ethics Innovation/Technology Management Library Science Management of Computing and Information Systems User Interfaces and Human Computer Interaction

Reprint years

DOI

10.1007/s10676-024-09818-x

Other Versions

No versions found

Links

PhilArchive

This entry is not archived by us. If you are the author and have permission from the publisher, we recommend that you archive it. Many publishers automatically grant permission to authors to archive pre-prints. By uploading a copy of your work, you will enable us to better index it, making it easier to find.

Upload a copy of this work Papers currently archived: 103,703

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Sign in / register and customize your OpenURL resolver
Configure custom resolver

My notes

Analytics

Added to PP
2024-12-21

Downloads
14 (#1,351,358)

6 months
14 (#213,080)

Historical graph of downloads

How can I increase my downloads?

Author Profiles

Yan Teng

Wei Wang

Citations of this work

No citations found.

Add more citations

References found in this work

The Problem of Abortion and the Doctrine of the Double Effect.Philippa Foot - 1967 - Oxford Review 5:5-15.

Intelligence without representation.Rodney A. Brooks - 1991 - Artificial Intelligence 47 (1--3):139-159.

Artificial Intelligence, Values, and Alignment.Iason Gabriel - 2020 - Minds and Machines 30 (3):411-437.

Patiency is not a virtue: the design of intelligent systems and systems of ethics.Joanna J. Bryson - 2018 - Ethics and Information Technology 20 (1):15-26.

The Trolley Problem.Judith Thomson - 1985 - Yale Law Journal 94 (6):1395-1415.

View all 8 references / Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Possibilities and challenges in the moral growth of large language models: a philosophical perspective

Abstract

Author Profiles

Categories

Keywords

Reprint years

DOI

Other Versions

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Author Profiles

Citations of this work

References found in this work