基于知识提示的应急预案少样本关系抽取方法

doi:10.16265/j.cnki.issn1003-3033.2024.12.0308

摘要/Abstract

摘要：

为从少样本应急预案文本中精准、快速实现关系抽取,提出一种基于知识提示的K最近邻关系抽取模型(KMKP)。首先,使用融入关系语义的可学习实体类型标记构建提示模板,强化输入对预训练语言模型(PLM)的提示引导效果;其次,利用边界损失函数优化模型训练,使PLM学习应急领域下的特定依赖关系,实现对PLM中掩码标记符[MASK]预测的结构化约束;然后,以训练数据创建无梯度应急知识存储数据库,结合K最近邻(KNN)算法构建知识查询机制,捕捉训练数据和预测数据之间的特征联系,无梯度范式校正PLM的预测结果;最后,在4个公开数据集的少样本设置下(1-,8-,16-shot)进行试验验证与分析。结果表明:KMKP对比最好模型KnowPrompt,F₁值平均提升2.1%、2.8%、1.9%。在少样本(16-shot)应急预案实例测试中,KMKP关系抽取准确率达到91.02%,KMKP能有效缓解少样本场景下模型的灾难性遗忘和过拟合问题。

关键词: 知识提示, 少样本, 应急预案, 关系抽取, 数据增强, K最近邻(KNN)关系抽取模型(KMKP)

Abstract:

In order to accurately and quickly achieve relation extraction from few-shot emergency plan texts, KMKP based on knowledge prompts was proposed. First, a prompt template was constructed, utilizing learnable typed entity markers that incorporate relation semantics. The effectiveness of input guidance on the pre-trained language model (PLM) was thereby enhanced by these markers. Second, the boundary loss function was utilized to optimize model training, enabling the PLM to learn specific dependency relationships in the emergency domain and apply structured constraints to [MASK] predictions. Third, a gradient-free emergency knowledge storage database was created using the training data, and a knowledge retrieval mechanism was constructed by integrating KNN algorithm. The feature connections between training and prediction data can be captured through this mechanism and the gradient-free normation was used to correct the predictions of PLM. Finally, the experimental validation and analysis were performed using four public datasets under few-shot settings (1-, 8-, and 16-shot). The results show that compared to the state-of-the-art model, KnowPrompt, F1 score is boosted by an average of 2.1%, 2.8%, and 1.9% by KMKP. In a 16-shot emergency plan instance test, a relation extraction accuracy of 91.02% is achieved by KMKP. Catastrophic forgetting and overfitting issues in few-shot scenarios are effectively mitigated.

Key words: knowledge-prompted, few-shot, emergency plan, relation extraction, data augmentation, k-nearest neighbor(KNN) relationship extraction model based on knowledge prompts (KMKP)

中图分类号:

X913安全系统学

张凯, 陈强, 倪凯, 张玉金. 基于知识提示的应急预案少样本关系抽取方法[J]. 中国安全科学学报, 2024, 34(12): 213-222.

ZHANG Kai, CHEN Qiang, NI Kai, ZHANG Yujin. Knowledge-prompted few-shot relation extraction for emergency plan texts[J]. China Safety Science Journal, 2024, 34(12): 213-222.

图/表 14

图1

表1

表2

表3

表4

表5

表6

表7

表8

图2

图3

图4

图5

图6

参考文献 27

[1]	高光涵. 总体应急预案的府际差异与量化评价:基于29个省级预案文本的比较分析[J]. 北京工业大学学报:社会利学版, 2023, 23(6):113-128.
	GAO Guanghan. Differences and quantitative evaluation of intergovernmental overall emergency plans-comparative analysis based on the texts of 29 provincial plan[J]. Journal of Beijing University of Technology: Social Sciences Edition, 2023, 23(6):113-128.
[2]	冯双剑, 李尧远. 应急管理学科建设调查分析及建议[J]. 中国应急管理, 2022(8):66-77.
[3]	杨继星, 房玉东, 边路, 等. 应急救援数字化战场体系研究与应用探索[J]. 中国安全科学学报, 2023, 33(10):240-246. doi: 10.16265/j.cnki.issn1003-3033.2023.10.2199
	YANG Jixing, FANG Yudong, BIAN Lu, et al. Research and application exploration of digital battlefield system for emergency rescue[J]. China Safety Science Journal, 2023, 33(10):240-246. doi: 10.16265/j.cnki.issn1003-3033.2023.10.2199
[4]	宋敦江, 杨霖, 钟少波. 基于BERT的灾害三元组信息抽取优化研究[J]. 中国安全科学学报, 2022, 32(2):115-120. doi: 10.16265/j.cnki.issn1003-3033.2022.02.016
	SONG Dunjiang, YANG Lin, ZHONG Shaobo. Research on optimization of disaster triplet information extraction based on BERT[J]. China Safety Science Journal, 2022, 32(2):115-120. doi: 10.16265/j.cnki.issn1003-3033.2022.02.016
[5]	王浩畅, 刘如意. 基于预训练模型的关系抽取研究综述[J]. 计算机与现代化, 2023(1):49-57,94.
	WANG Haochang, LIU Ruyi. Review of relation extraction based on pre-training language model[J]. Computer and Modernization, 2023(1):49-57,94.
[6]	WANG Lihu, LIU Xuemei, LIU Yang, et al. Emergency entity relationship extraction for water diversion project based on pre-trained model and multi-featured graph convolutional network[J]. Plos One, 2023, 18(10):DOI: 10.1371/journal.pone.0292004.
[7]	许娜, 梁燕翔, 王亮, 等. 基于知识图谱的煤矿建设安全领域知识管理研究[J]. 中国安全科学学报, 2024, 34(5):28-35. doi: 10.16265/j.cnki.issn1003-3033.2024.05.0835
	XU Na, LIANG Yanxiang, WANG Liang, et al. Research on knowledge management in coal mine construction safety field based on knowledge graph[J]. China Safety Science Journal, 2024, 34(5):28-35. doi: 10.16265/j.cnki.issn1003-3033.2024.05.0835
[8]	LIU Xuemei, LU Hankang, LI Hairui. Intelligent generation method of emergency plan for hydraulic engineering based on knowledge graph:take the south-to-north water diversion project as an example[J]. LHB-hydroscience Journal, 2022, 108(1):DOI: 10.1080/27678490.2022.2153629.
[9]	BELKIN M, HSU D, MA Siyuan, et al. Reconciling modern machine-learning practice and the classical bias-variance trade-off[J]. Proceedings of the National Academy of Sciences, 2019, 116(32):15849-15 854.
[10]	PENG Hao, GAO Tianyu, HAN Xu, et al. Learning from context or names? an empirical study on neural relation extraction[C]. the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020:3661-3672.
[11]	ZHOW Wenxuan, CHEN Muhao. An improved baseline for sentence-level relation extraction[C]. the 2^nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12^th International Joint Conference on Natural Language Processing (Volume 2:Short Papers),2022:161-168.
[12]	LIU Junbao, QIN Xizhong, MA Xiaoqin, et al. FREDA: few-shot relation extraction based on data augmentation[J]. Applied Sciences, 2023, 13(14):DOI: 10.3390/app13148312.
[13]	NASAR Z, JAFFRY S W, MALIK M K. Named entity recognition and relation extraction: state-of-the-art[J]. ACM Computing Surveys (CSUR), 2021, 54(1):1-39.
[14]	SHIN T, RAZEGHI Y, LOGAN R L, et al. Autoprompt: eliciting knowledge from language models with automatically generated prompts[C]. the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP),2020:4222-4235.
[15]	PETERSON L E. K-nearest neighbor[J]. Scholarpedia, 2009, 4(2):DOI: 10.4249/scholarpedia.1883.
[16]	XU Benfeng, WANG Quan, MAO Zhendong, et al. kNN prompting: beyond-context learning with calibration-free nearest neighbor inference[C]. the 11^th International Conference on Learning Representations,2023: 1-24.
[17]	HUANG Anzhong, XU Rui, CHEN Yu, et al. Research on multi-label user classification of social media based on ML-KNN algorithm[J]. Technological Forecasting and Social Change, 2023,188:DOI: 10.1016/j.techfore.2022.122271.
[18]	HENDRICKX I, KIM S N, KOZAREVA Z, et al. Semeval-2010 task 8: multi-way classification of semantic relations between pairs of nominals[C]. the 5^th International Workshop on Semantic Evaluation, 2010:33-38.
[19]	ZHANG Yuhao, ZHONG Victor, CHEN Danqi, et al. Position-aware attention and supervised data improve slot filling[C]. Conference on Empirical Methods in Natural Language Processing, 2017:35-45.
[20]	黄子麒, 胡建鹏. 实体类别增强的汽车领域嵌套命名实体识别[J]. 计算机应用, 2024, 44(2):377-384. doi: 10.11772/j.issn.1001-9081.2023020239
	HUANG Ziqi, HU Jianpeng. Entity category enhanced nested named entity recognition in automotive domain[J]. Journal of Computer Applications, 2024, 44(2):377-384. doi: 10.11772/j.issn.1001-9081.2023020239
[21]	WU Shanchan, HE Yifan. Enriching pre-trained language model with entity information for relation classification[C]. the 28^th ACM International Conference on Information and Knowledge Management,2019:2361-2364.
[22]	XUE Fuzhao, SUN Aixin, ZHANG Hao, et al. Gdpnet: refining latent multi-view graph for relation extraction[C]. the AAAI Conference on Artificial Intelligence,2021:DOI: 10.48550/arXiv.2012.06780.
[23]	HAN Xu, ZHAO Weilin, DING Ning, et al. Ptr: prompt tuning with rules for text classification[J]. AI Open, 2022, 3:182-192.
[24]	CHEN Xiang, ZHANG Ningyu, XIE Xin, et al. Knowprompt: knowledge-aware prompt-tuning with synergistic optimization for relation extraction[C]. the ACM Web Conference 2022:2778-2788.
[25]	FEDUS W, GOODFELLOW I, DAI A M. Maskgan: better text generation via filling in the[C]. International Conference on Learning Representations, 2018:1-17.
[26]	WEI J. Good-enough example extrapolation[C]. the 2021 Conference on Empirical Methods in Natural Language Processing,2021:5923-5929.
[27]	周义棋, 刘畅, 龙增, 等. 电网应急预案知识图谱构建方法与应用[J]. 中国安全生产科学技术, 2023, 19(1):5-13.
	ZHOU Yiqi, LIU Chang, LONG Zeng, et al. Construction method and application of knowledge graph in emergency plans for power grid[J]. Journal of Safety Science and Technology, 2023, 19(1):5-13.

方法	超参数复杂度	计算复杂度	训练时间复杂度	可解释性
Maskgan	高	极高	极高	弱
SMOTE	低	低	中	弱
GE3	无	低	中	弱
KMKP	无	低	低	强

实例	关系标签	头实体类型	尾实体类型	实体类型先验概率分布p
企业将有关情况报告人民政府	上下级	职责部门	职责部门	p(职责部门)=4/13 p(指挥体系)=2/13 p(工作组)=3/13 p(岗位)=1/13 p(职责部门)=2/13 p(职责内容)=1/13
现场指挥部下设综合组、抢险救援组…	设立	指挥体系	工作组/岗位
市政府分管副市长担任现场指挥部指挥长	担任	部门成员	岗位
市较大生产安全事故应急指挥部成员单位由市委宣传部、市发改委等单位组成	组成单位	指挥体系/工作组	职责部门
市消防救援支队参与事故应急救援和处置工作	执行	职责部门/工作组	职责内容

数据集		train	vel	test	label
中文	CCL2022	2 399	300	301	2
中文	人物关系抽取	10 000	1 000	1 000	12
英文	SemEval	6 507	1 493	2 717	19
英文	TACRED	68 124	22 631	15 509	42

参数	名称	取值
PLM	PLM	中文:roberta-chinese-large 英文:roberta-large
batch_size	训练批次	8
epoch	训练轮次	30
lr	学习率	5e-5
max_length	最大文本长度	256
optimizer	优化器	AdamW
t_beta	边界损失函数权重	0.05
knn_topk	KNN实例数据量	8
knn_lambda	矫正因子权重	0.3
gamma	边界值	1

模型		中文		英文		均值
模型		人物关系抽取	CCL2022	SemEval	TACRED	均值
PLM	Fine-Tuning	72.0	99.7	87.6	68.7	82.0
	R-BERT	73.1	99.3	89.3	69.4	82.8
	GDPNet	74.7	98.6	88.7	71.5	83.4
PT 预训练模型	PTR	—	—	89.9	72.4	—
	KnowPrompt	79.7	99.3	90.2	72.4	85.4
	KMKP	83.2 (+3.5)	99.6 (-0.1)	90.5 (+0.3)	72.8 (+0.4)	86.5 (+1.1)