中国安全科学学报 ›› 2026, Vol. 36 ›› Issue (3): 66-73.doi: 10.16265/j.cnki.issn1003-3033.2026.03.0874

• 安全技术与工程 • 上一篇    下一篇

融合LLM-RAG-KG的电力生产安全事故问答大模型*

晋良海1,2(), 张倩2,3, 徐童欣1,2, 陈云1,2, 彭仲文4   

  1. 1 三峡大学 水电工程施工与管理湖北省重点实验室, 湖北 宜昌 443002
    2 三峡大学 水利与环境学院, 湖北 宜昌 443002
    3 三峡大学 经济与管理学院, 湖北 宜昌 443002
    4 中国铁建大桥工程局集团有限公司, 天津 300300
  • 收稿日期:2025-09-10 修回日期:2025-12-15 出版日期:2026-03-31
  • 作者简介:

    晋良海 (1973—),男,四川简阳人,教授,博士生导师,全国水利工程专业学位优秀指导教师,主要从事安全工效学研究。 E-mail:

    张 倩,工程师。

  • 基金资助:
    国家自然科学基金资助(52179136); 教育部人文社科规划基金资助(21YJA630038); 中国铁建大桥工程局集团有限公司科研项目(SDHZ2022012); 湖北省自然科学基金资助(2024AFD153)

A large model for analyzing power production safety accidents integrating LLM, RAG and KG

JIN Lianghai1,2(), ZHANG Qian2,3, XU Tongxin1,2, CHEN Yun1,2, PENG Zhongwen4   

  1. 1 Hubei Key Laboratory of Construction and Management in Hydropower Engineering, China Three Gorges University, Yichang Hubei 443002, China
    2 College of Hydraulic & Environmental Engineering, China Three Gorges University, Yichang Hubei 443002, China
    3 School of Economics and Management, China Three Gorges University, Yichang Hubei 443002, China
    4 China Railway Construction Bridge Engineering Group Co., Ltd., Tianjin 300300, China
  • Received:2025-09-10 Revised:2025-12-15 Published:2026-03-31

摘要:

为解决传统分析方法在面对电力系统多因素非线性交互的复杂特性时存在专业知识整合不足、因果推理可解释性弱等固有局限,提出一种融合大型语言模型(LLM)、检索增强生成(RAG)和知识图谱(KG)的电力生产安全事故分析大模型;构建包含知识检索、知识推理、答案生成与效果评估4个核心模块框架:基于RAG技术从专业文本中精准检索相关知识,利用KG结构化推理事故实体和关系,以弥补检索盲区;通过LLM生成专业、可解释的事故因果分析答案,通过主观专家评分与自动评估指标(ROUGE)、双语替换学习(BLEU)等客观指标全面评估系统。结果表明:在电力生产安全事故分析场景中,RAG 与 KG 的知识增强技术对具备一定规模参数的基础模型有普适性性能提升,能帮助模型精准捕捉设备故障传导链等专业关联,提升事故致因挖掘与结果演化推理质量;DeepSeek-R1、Qwen2.5-72B等大模型在该模式下解析专业术语、梳理多因素关联的准确性显著提高,其中 DeepSeek-R1 综合评分达 4.05 分,更满足领域精度要求;增强效果存在模型能力阈值,Qwen2.5-72B 增强后能高效解析跨区域电网故障联动等复杂逻辑,且平衡性能与部署成本,适配企业实际需求,而 Qwen2.5-14B 等小模型因基础推理能力有限,引入外部知识后难处理专业信息,性能下降,无法满足专业性要求。

关键词: 大型语言模型(LLM), 检索增强生成(RAG), 知识图谱(KG), 电力生产安全事故, 事故分析, DeepSeek-R1

Abstract:

In order to address the inherent limitations of traditional analysis methods—such as insufficient integration of professional knowledge and weak interpretability of causal reasoning—when dealing with the complex characteristics of multi-factor nonlinear interactions in power systems, a large model for power production safety accident analysis was proposed that integrates LLM, RAG, and KG. A framework with four core modules was built: knowledge retrieval, knowledge reasoning, answer generation, and performance evaluation. RAG technology was used to accurately retrieve relevant knowledge from professional texts, and KG was leveraged for structured reasoning on accident entities and relationships to make up for retrieval blind spots. Finally, LLM was employed to generate professional and interpretable answers for accident causal analysis. The study comprehensively evaluated the system through subjective expert scoring and objective metrics like ROUGE and BLEU, and results show that in the scenario of power production safety accident analysis, the knowledge enhancement technology of RAG and KG provides universal performance improvement for basic models with a certain parameter scale—it helps models accurately capture professional correlations such as equipment fault transmission chains and enhances the quality of accident cause mining and result evolution reasoning. Large models including DeepSeek-R1 and Qwen2.5-72B significantly improved in the accuracy of parsing professional terms and organizing multi-factor correlations under this mode, among which DeepSeek-R1 achieved a comprehensive score of 4.05, better meeting the precision requirements of the field; meanwhile, there is a model capability threshold for the enhancement effect: after enhancement, Qwen2.5-72B can efficiently parse complex logics like cross-regional power grid fault linkage, balances performance and deployment costs, and is suitable for enterprises' practical needs, while smaller models such as Qwen2.5-14B, due to limited basic reasoning capabilities, fail to process professional information effectively after introducing external knowledge, leading to performance degradation and inability to meet professional requirements.

Key words: large language models (LLM), retrieval-augmented generation (RAG), knowledge graph (KG), electric power production safety, accident analysis, DeepSeek-R1

中图分类号: