中国安全科学学报 ›› 2022, Vol. 32 ›› Issue (6): 123-130.doi: 10.16265/j.cnki.issn1003-3033.2022.06.2634

• 安全工程技术 • 上一篇    下一篇

基于国内外铁路运营数据的列车运行时间预测模型

唐涛1(), 甘婧2   

  1. 1 中铁第四勘察设计院集团有限公司,湖北 武汉 430063
    2 南京邮电大学 现代邮政学院,江苏 南京 210003
  • 收稿日期:2022-01-10 修回日期:2022-04-20 出版日期:2022-06-28 发布日期:2022-12-28
  • 作者简介:

    唐 涛 (1991—),男,江苏淮安人,硕士,工程师,主要从事境外铁路工程研究。E-mail:

    甘婧,讲师

  • 基金资助:
    铁四院科研课题(2018K094)

A train running time prediction model based on domestic and foreign railway operation data

TANG Tao1(), GAN Jing2   

  1. 1 China Railway Siyuan Survey and Design Group Co., Ltd., Wuhan Hubei 430063,China
    2 School of Modern Posts, Nanjing University of Posts and Telecommunications, Nanjing Jiangsu 210003,China
  • Received:2022-01-10 Revised:2022-04-20 Online:2022-06-28 Published:2022-12-28

摘要:

为了准确预测列车运行时间,考虑铁路调度工作准确性和及时性需求,构建列车区间运行时间预测模型。首先,提取并分析列车时刻表数据,确定可能影响列车区间运行时间的因素;考虑列车运行时间数据分布的偏态性,引入Box-Cox转换正态化处理数据;然后,基于决策树以及网格搜索算法,分别优化模型输入特征和超参数,提升模型性能;最后,应用装箱梯度提升树(HGBT),基于优化后的特征和超参数,建立列车区间运行时间预测模型,并分别利用我国某铁路线路和欧洲某铁路线路的运营数据评估各阶段工作。结果表明:Box-Cox转换可显著提高数据正态性,提升列车运行时间预测模型的拟合效果;网格搜索算法可同时提高列车运行时间模型效率和精度;相较于其他常用的运行时间预测模型,HGBT模型具有高精度和高效率的优势。

关键词: 铁路运营数据, 列车运行时间, 预测模型, Box-Cox转换, 决策树, 装箱梯度提升树(HGBT)

Abstract:

In order to accurately predict train running time while considering requirement on accuracy and timeliness of its operation, a time prediction model was established. Firstly, data of railway timetable were extracted and analyzed to determine possible influencing factors of its running time, and Box-Cox transformation was used to normalize operation data considering their non-normality. Then, the model's input features and hyper-parameters were optimized based on decision tree and grid-search algorithm, respectively, and its performance was improved. Finally, a train running time prediction model was established by adopting HGBT based on optimized input features and hyper-parameters, and operation data of a Chinese and European railway were utilized to evaluate works at each stage. The results show that Box-Cox transformation can significantly improve data normality and goodness-of-fit of the prediction model, while grid-search algorithm can simultaneously improve the model's efficiency and accuracy. Compared with other commonly used machine learning algorithms for train running time prediction, the proposed HGBT model features higher accuracy and efficiency.

Key words: railway operation data, train running time, prediction model, Box-Cox transformation, decision tree, histogram-based gradient boosting tree (HGBT)