Robots in the Middle: Evaluating LLMs in Dispute Resolution

Jinzhe Tan*, Hannes Westermann, Nikhil Reddy Pottanigari, Jaromír Šavelka, Sébastien Meeùs, Mia Godet, Karim Benyekhlef

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingAcademicpeer-review

Abstract

Mediation is a dispute resolution method featuring a neutral third-party (mediator) who intervenes to help the individuals resolve their dispute. In this paper, we investigate to what extent large language models (LLMs) are able to act as mediators. We investigate whether LLMs are able to analyze dispute conversations, select suitable intervention types, and generate appropriate intervention messages. Using a novel, manually created dataset of 50 dispute scenarios, we conduct a blind evaluation comparing LLMs with human annotators across several key metrics. Overall, the LLMs showed strong performance, even outperforming our human annotators across key dimensions. Specifically, in 62% of the cases, the LLMs chose intervention types that were rated as better than or equivalent to those chosen by humans. Moreover, in 84% of the cases, the intervention messages generated by the LLMs were rated as better than or equal to the intervention messages written by humans. LLMs likewise performed favourably on metrics such as impartiality, understanding and contextualization. Our results demonstrate the potential of integrating AI in online dispute resolution (ODR) platforms.
Original languageEnglish
Title of host publicationLegal Knowledge and Information Systems - JURIX 2024
Subtitle of host publication37th Annual Conference
EditorsJaromir Savelka, Jakub Harasta, Tereza Novotna, Jakub Misek
PublisherIOS Press
Pages168-179
Number of pages12
Volume395
ISBN (Electronic)9781643685625
DOIs
Publication statusPublished - 1 Jan 2024
Event37th Annual Conference on Legal Knowledge and Information Systems, JURIX 2024 - Brno, Czech Republic
Duration: 11 Dec 202413 Dec 2024
https://jurix.nl/

Publication series

SeriesFrontiers in Artificial Intelligence and Applications
Volume395
ISSN0922-6389

Conference

Conference37th Annual Conference on Legal Knowledge and Information Systems, JURIX 2024
Abbreviated titleJURIX 2024
Country/TerritoryCzech Republic
CityBrno
Period11/12/2413/12/24
Internet address

Keywords

  • access to justice
  • ai & law
  • artificial intelligence
  • chatgpt
  • large language models
  • online dispute resolution

Fingerprint

Dive into the research topics of 'Robots in the Middle: Evaluating LLMs in Dispute Resolution'. Together they form a unique fingerprint.

Cite this