中国安全科学学报 ›› 2026, Vol. 36 ›› Issue (6): 222-228.doi: 10.16265/j.cnki.issn1003-3033.2026.06.0640

• 公共安全与应急管理 • 上一篇    下一篇

数据-知识融合驱动的城市供水管网风险评估

李泽玉1(), 李素贞1,2,**()   

  1. 1 同济大学 土木工程学院, 上海 200092
    2 同济大学 土木工程防灾减灾全国重点实验室, 上海 200092
  • 收稿日期:2025-12-02 修回日期:2026-03-14 出版日期:2026-06-28
  • 通信作者:
    ** 李素贞(1978—),女,江西赣州人,博士,研究员,主要从事生命线工程智能运维、结构健康监测、光纤传感等方面的研究。E-mail:
  • 作者简介:

    李泽玉 (1997—),男,河北石家庄人,博士,主要从事生命线工程智能运维。E-mail:

  • 基金资助:
    国家自然科学基金资助(52378525); 国家重点研发计划(2024YFC3808800)

Integration of data and knowledge for risk assessment of urban water supply networks

Li Zeyu1(), Li Suzhen1,2,**()   

  1. 1 School of Structural Engineering, Tongji University, Shanghai 200092, China
    2 State Key Laboratory of Disaster Research in Civil Engineering, Tongji University, Shanghai 200092, China
  • Received:2025-12-02 Revised:2026-03-14 Published:2026-06-28

摘要:

针对现有数据驱动方法常因数据质量问题导致模型鲁棒性不足,且常忽视专家先验知识价值的问题,提出一种基于高斯过程分类(GPC)与高斯混合模型(GMM)的数据-知识融合的城市供水管网风险评估方法。首先,通过专家打分规则获取风险评分,作为知识融合的基础;其次,将评分与管道物理属性、运行环境等客观属性共同构成特征矩阵,采用GPC模型预测各管段的失效概率;然后,聚类分析预测概率分布,应用GMM科学划分风险等级;最后,基于加拿大Kitchener市管段及失效记录的数据集开展案例验证。结果表明:所提数据-知识融合模型在不同数据场景下均表现出色,在1%更换率下,最高可避免30%的潜在故障;风险排序的曲线下面积为0.94;风险分级的卡方系数大于0.1,决定系数超过0.7;加性解释方法 (SHAP) 分析验证了专家打分的关键作用。

关键词: 数据-知识融合, 城市供水管网, 风险评估, 高斯过程分类(GPC), 高斯混合模型(GMM), 加性解释方法 (SHAP)

Abstract:

To address the insufficient robustness of existing data-driven methods caused by data quality problems and the frequent neglect of expert prior knowledge, a data-knowledge fusion risk assessment method for urban water supply networks was proposed based on GPC and GMM. First, risk scores were obtained by expert scoring rules and were used as the basis for knowledge fusion. Second, a feature matrix was constructed by combining the scores with objective attributes, including pipeline physical properties and operating environment. A GPC model was used to predict the failure probability of each pipe segment. Third, cluster analysis was performed on the predicted probability distribution, and a Gaussian mixture model was applied to classify risk levels scientifically. Finally, experiments were conducted on a dataset of pipe segments and failure records from Kitchener, Canada. The results show that the proposed data-knowledge fusion model performs well under different data scenarios, and at a 1% replacement rate, up to 30% of potential failures are prevented. The area under the curve of 0.94 is achieved for risk ranking. The chi-square coefficient for risk classification exceeds 0.1, and the coefficient of determination is greater than 0.7. SHAP analysis verifies the key role of expert scoring.

Key words: data-knowledge fusion, urban water supply network, risk assessment, gaussian process classification (GPC), Gaussian mixture model (GMM), SHapley additive explanations(SHAP)

中图分类号: