0

Modelling Human Values for Value-Aware Multi-Agent Systems

One of today's most pressing societal challenges is building AI systems whose behaviour, or the behaviour it enables within communities of interacting human and artificial agents, aligns with relevant human values.

Year
2024
Hosting
Excerpt onlyCC-BY-NC-SA-4.0

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2402.06359CC-BY-NC-SA-4.0
TL;DR
Semantic Scholar
Attribution policy →

Abstract

One of today's most pressing societal challenges is building AI systems whose behaviour, or the behaviour it enables within communities of interacting human and artificial agents, aligns with relevant human values. To address this challenge, we propose a formal computational framework for representing human values that provides the foundational structures required for value-aware reasoning in multi-agent systems. To our knowledge, this has not been attempted as yet, which is surprising given the growing volume of research integrating human values into AI systems. Taking as our starting point the wealth of research in human values from the field of social psychology, we set out to provide a formal model which captures value relations, value importance, and computational semantics in order to support the evaluation of behaviour with respect to values and the development of value-aware decision-making mechanisms in agent-based systems. We demonstrate how the model supports the evaluation of behaviour in terms of value alignment across a real-world scenario, establishing a bridge between abstract human values and concrete agent behaviour. We illustrate how our model captures key concepts from social psychology research and outline a roadmap for incorporating values as first-class constructs in multi-agent systems.