Knowledge Base Construction from Pre-trained Language Models by Prompt learning

Xiao Ning; Remzi Celebi

Knowledge Base Construction from Pre-trained Language Models by Prompt learning

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

32 Downloads (Pure)

Abstract

Pre-trained language models (LMs) have advanced the state-of-the-art for many semantic tasks and have also been proven effective for extracting knowledge from the models itself. Although several works have explored the capability of the LMs for constructing knowledge bases, including prompt learning, this potential has not yet been fully explored. In this work, we propose a method of extracting factual knowledge from LMs for given subject-relation pairs and explore the most effective strategy to generate blank object entities for each relation of triples. We design prompt templates for each relation using personal knowledge and the descriptive information available on the web such as WikiData. The probing approach of our proposed LMs is tested on the dataset provided by the International Semantic Web Conference (ISWC 2022) LM-KBC Challenge. To cope with the problem of varying performance for each relation, we designed a parameter selection strategy for each relation. Using the test dataset, we obtain an F1-score of 0.4935%, which is higher than the baseline of 31.08%.

Original language	English
Title of host publication	Knowledge Base Construction from Pre-trained Language Models 2022
Pages	46-54
Number of pages	9
Volume	3274
Publication status	Published - 1 Jan 2022
Event	2022 Semantic Web Challenge on Knowledge Base Construction from Pre-Trained Language Models - Online, Hanghzou, China Duration: 1 Jan 2022 → 1 Oct 2022

Publication series

Series	CEUR Workshop Proceedings
ISSN	1613-0073

Conference

Conference	2022 Semantic Web Challenge on Knowledge Base Construction from Pre-Trained Language Models
Abbreviated title	LM-KBC 2022
Country/Territory	China
City	Hanghzou
Period	1/01/22 → 1/10/22

Keywords

Information Extraction
Link Prediction
Pre-trained language model
Prompt learning

Access to Document

Full TextFinal published version, 1.01 MBLicence: Taverne

https://ceur-ws.org/Vol-3274/paper4.pdf

Cite this

@inproceedings{4c9a13a7e7254adeb81c9f84e5f94fce,

title = "Knowledge Base Construction from Pre-trained Language Models by Prompt learning",

abstract = "Pre-trained language models (LMs) have advanced the state-of-the-art for many semantic tasks and have also been proven effective for extracting knowledge from the models itself. Although several works have explored the capability of the LMs for constructing knowledge bases, including prompt learning, this potential has not yet been fully explored. In this work, we propose a method of extracting factual knowledge from LMs for given subject-relation pairs and explore the most effective strategy to generate blank object entities for each relation of triples. We design prompt templates for each relation using personal knowledge and the descriptive information available on the web such as WikiData. The probing approach of our proposed LMs is tested on the dataset provided by the International Semantic Web Conference (ISWC 2022) LM-KBC Challenge. To cope with the problem of varying performance for each relation, we designed a parameter selection strategy for each relation. Using the test dataset, we obtain an F1-score of 0.4935%, which is higher than the baseline of 31.08%.",

keywords = "Information Extraction, Link Prediction, Pre-trained language model, Prompt learning",

author = "Xiao Ning and Remzi Celebi",

note = "Funding Information: Thanks to Shuai Wang, an excellent software engineer from Amazon, he introduced several practical scripts for me to automate run the code, which significantly increased the efficiency of experiments. Furthermore, the experiment part of this research was made possible, in part, using the Data Science Research Infrastructure (DSRI) hosted at Maastricht University. Publisher Copyright: {\textcopyright} 2022 Copyright for this paper by its authors.; 2022 Semantic Web Challenge on Knowledge Base Construction from Pre-Trained Language Models, LM-KBC 2022 ; Conference date: 01-01-2022 Through 01-10-2022",

year = "2022",

month = jan,

day = "1",

language = "English",

volume = "3274",

series = "CEUR Workshop Proceedings",

publisher = "Rheinisch-Westfaelische Technische Hochschule Aachen * Lehrstuhl Informatik V",

pages = "46--54",

booktitle = "Knowledge Base Construction from Pre-trained Language Models 2022",

}

Ning, X & Celebi, R 2022, Knowledge Base Construction from Pre-trained Language Models by Prompt learning. in Knowledge Base Construction from Pre-trained Language Models 2022. vol. 3274, CEUR Workshop Proceedings, pp. 46-54, 2022 Semantic Web Challenge on Knowledge Base Construction from Pre-Trained Language Models, Hanghzou, China, 1/01/22. <https://ceur-ws.org/Vol-3274/paper4.pdf>

TY - GEN

T1 - Knowledge Base Construction from Pre-trained Language Models by Prompt learning

AU - Ning, Xiao

AU - Celebi, Remzi

N1 - Funding Information: Thanks to Shuai Wang, an excellent software engineer from Amazon, he introduced several practical scripts for me to automate run the code, which significantly increased the efficiency of experiments. Furthermore, the experiment part of this research was made possible, in part, using the Data Science Research Infrastructure (DSRI) hosted at Maastricht University. Publisher Copyright: © 2022 Copyright for this paper by its authors.

PY - 2022/1/1

Y1 - 2022/1/1

N2 - Pre-trained language models (LMs) have advanced the state-of-the-art for many semantic tasks and have also been proven effective for extracting knowledge from the models itself. Although several works have explored the capability of the LMs for constructing knowledge bases, including prompt learning, this potential has not yet been fully explored. In this work, we propose a method of extracting factual knowledge from LMs for given subject-relation pairs and explore the most effective strategy to generate blank object entities for each relation of triples. We design prompt templates for each relation using personal knowledge and the descriptive information available on the web such as WikiData. The probing approach of our proposed LMs is tested on the dataset provided by the International Semantic Web Conference (ISWC 2022) LM-KBC Challenge. To cope with the problem of varying performance for each relation, we designed a parameter selection strategy for each relation. Using the test dataset, we obtain an F1-score of 0.4935%, which is higher than the baseline of 31.08%.

AB - Pre-trained language models (LMs) have advanced the state-of-the-art for many semantic tasks and have also been proven effective for extracting knowledge from the models itself. Although several works have explored the capability of the LMs for constructing knowledge bases, including prompt learning, this potential has not yet been fully explored. In this work, we propose a method of extracting factual knowledge from LMs for given subject-relation pairs and explore the most effective strategy to generate blank object entities for each relation of triples. We design prompt templates for each relation using personal knowledge and the descriptive information available on the web such as WikiData. The probing approach of our proposed LMs is tested on the dataset provided by the International Semantic Web Conference (ISWC 2022) LM-KBC Challenge. To cope with the problem of varying performance for each relation, we designed a parameter selection strategy for each relation. Using the test dataset, we obtain an F1-score of 0.4935%, which is higher than the baseline of 31.08%.

KW - Information Extraction

KW - Link Prediction

KW - Pre-trained language model

KW - Prompt learning

M3 - Conference article in proceeding

VL - 3274

T3 - CEUR Workshop Proceedings

SP - 46

EP - 54

BT - Knowledge Base Construction from Pre-trained Language Models 2022

T2 - 2022 Semantic Web Challenge on Knowledge Base Construction from Pre-Trained Language Models

Y2 - 1 January 2022 through 1 October 2022

ER -