Using Named Entities for Computer-Automated Verbal Deception Detection

Bennett Kleinberg; Maximilian Mozes; Arnoud Arntz; Bruno Verschuere

doi:10.1111/1556-4029.13645

Using Named Entities for Computer-Automated Verbal Deception Detection

Bennett Kleinberg^*, Maximilian Mozes, Arnoud Arntz, Bruno Verschuere

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

There is an increasing demand for automated verbal deception detection systems. We propose named entity recognition (NER; i.e., the automatic identification and extraction of information from text) to model three established theoretical principles: (i) truth tellers provide accounts that are richer in detail, (ii) contain more contextual references (specific persons, locations, and times), and (iii) deceivers tend to withhold potentially checkable information. We test whether NER captures these theoretical concepts and can automatically identify truthful versus deceptive hotel reviews. We extracted the proportion of named entities with two NER tools (spaCy and Stanford's NER) and compared the discriminative ability to a lexicon word count approach (LIWC) and a measure of sentence specificity (speciteller). Named entities discriminated truthful from deceptive hotel reviews above chance level, and outperformed the lexicon approach and sentence specificity. This investigation suggests that named entities may be a useful addition to existing automated verbal deception detection approaches.

Original language	English
Pages (from-to)	714-723
Number of pages	10
Journal	Journal of Forensic Sciences
Volume	63
Issue number	3
DOIs	https://doi.org/10.1111/1556-4029.13645
Publication status	Published - 2018

Keywords

forensic science
computational linguistics
deception detection
named entity recognition
linguistic inquiry and word count
reality monitoring
criteria-based content analysis
CLASSIFICATION
METAANALYSIS
RECOGNITION
CUES

Access to Document

10.1111/1556-4029.13645

Cite this

@article{74f0bf0478b44618b8e71218b3a362f3,

title = "Using Named Entities for Computer-Automated Verbal Deception Detection",

abstract = "There is an increasing demand for automated verbal deception detection systems. We propose named entity recognition (NER; i.e., the automatic identification and extraction of information from text) to model three established theoretical principles: (i) truth tellers provide accounts that are richer in detail, (ii) contain more contextual references (specific persons, locations, and times), and (iii) deceivers tend to withhold potentially checkable information. We test whether NER captures these theoretical concepts and can automatically identify truthful versus deceptive hotel reviews. We extracted the proportion of named entities with two NER tools (spaCy and Stanford's NER) and compared the discriminative ability to a lexicon word count approach (LIWC) and a measure of sentence specificity (speciteller). Named entities discriminated truthful from deceptive hotel reviews above chance level, and outperformed the lexicon approach and sentence specificity. This investigation suggests that named entities may be a useful addition to existing automated verbal deception detection approaches.",

keywords = "forensic science, computational linguistics, deception detection, named entity recognition, linguistic inquiry and word count, reality monitoring, criteria-based content analysis, CLASSIFICATION, METAANALYSIS, RECOGNITION, CUES",

author = "Bennett Kleinberg and Maximilian Mozes and Arnoud Arntz and Bruno Verschuere",

year = "2018",

doi = "10.1111/1556-4029.13645",

language = "English",

volume = "63",

pages = "714--723",

journal = "Journal of Forensic Sciences",

issn = "0022-1198",

publisher = "Wiley",

number = "3",

}

TY - JOUR

T1 - Using Named Entities for Computer-Automated Verbal Deception Detection

AU - Kleinberg, Bennett

AU - Mozes, Maximilian

AU - Arntz, Arnoud

AU - Verschuere, Bruno

PY - 2018

Y1 - 2018

N2 - There is an increasing demand for automated verbal deception detection systems. We propose named entity recognition (NER; i.e., the automatic identification and extraction of information from text) to model three established theoretical principles: (i) truth tellers provide accounts that are richer in detail, (ii) contain more contextual references (specific persons, locations, and times), and (iii) deceivers tend to withhold potentially checkable information. We test whether NER captures these theoretical concepts and can automatically identify truthful versus deceptive hotel reviews. We extracted the proportion of named entities with two NER tools (spaCy and Stanford's NER) and compared the discriminative ability to a lexicon word count approach (LIWC) and a measure of sentence specificity (speciteller). Named entities discriminated truthful from deceptive hotel reviews above chance level, and outperformed the lexicon approach and sentence specificity. This investigation suggests that named entities may be a useful addition to existing automated verbal deception detection approaches.

AB - There is an increasing demand for automated verbal deception detection systems. We propose named entity recognition (NER; i.e., the automatic identification and extraction of information from text) to model three established theoretical principles: (i) truth tellers provide accounts that are richer in detail, (ii) contain more contextual references (specific persons, locations, and times), and (iii) deceivers tend to withhold potentially checkable information. We test whether NER captures these theoretical concepts and can automatically identify truthful versus deceptive hotel reviews. We extracted the proportion of named entities with two NER tools (spaCy and Stanford's NER) and compared the discriminative ability to a lexicon word count approach (LIWC) and a measure of sentence specificity (speciteller). Named entities discriminated truthful from deceptive hotel reviews above chance level, and outperformed the lexicon approach and sentence specificity. This investigation suggests that named entities may be a useful addition to existing automated verbal deception detection approaches.

KW - forensic science

KW - computational linguistics

KW - deception detection

KW - named entity recognition

KW - linguistic inquiry and word count

KW - reality monitoring

KW - criteria-based content analysis

KW - CLASSIFICATION

KW - METAANALYSIS

KW - RECOGNITION

KW - CUES

U2 - 10.1111/1556-4029.13645

DO - 10.1111/1556-4029.13645

M3 - Article

C2 - 28940300

SN - 0022-1198

VL - 63

SP - 714

EP - 723

JO - Journal of Forensic Sciences

JF - Journal of Forensic Sciences

IS - 3

ER -