交通运输系统工程与信息 ›› 2011, Vol. 11 ›› Issue (4): 61-76.

• 系统工程理论与方法 • 上一篇    下一篇

调查问卷回答缺项对交通方式选择模型的影响:发展中国家城市的比较分析(英文)

李 纲 * ,  藤原章正,  张峻屹   

  1. 广岛大学,大学院国际协力研究科, 广岛 739-0025, 日本
  • 收稿日期:2010-12-13 修回日期:2011-02-24 出版日期:2011-08-25 发布日期:2011-11-28
  • 作者简介:李纲(1982-),男,博士生.

Effects of Item Nonresponses on the Performance of Travel Mode Choice Models for Developing Cities: A Comparative Analysis

LI Gang , FUJIWARA Akimasa, ZHANG Jun-yi   

  1. Graduate School for International Development and Cooperation, Hiroshima University, Hiroshima 739-0025, Japan
  • Received:2010-12-13 Revised:2011-02-24 Online:2011-08-25 Published:2011-11-28

摘要: 为提高交通规划与决策质量,有必要妥善处理交通调查无回答问题. 但是在相关研究中尤其在发展中国家此问题常被忽视. 本文统计分析了发展中城市居民出行调查中项目无回答的缺失模式,进一步分析项目无回答作为模型缺失数据对基于期望最大化的数据修补的多项Logit集计的出行方式选择模型的影响. 选择三个发展中城市代表调查无回答的三个等级做案例分析. 首先,根据项目无回答缺失模式,利用Pearson开方检验得出重要的出行者社会经济属性和出行背景因素. 其次,利用基于缺失模式分析的期望最大化估计,处理缺失数据得到无偏数据集作为基准. 此外,利用独立样本的T检验检验了零假设,即有无期望最大化估计情况下的模型参数估计是相同的. 用基于R平方系数的内在效度检验计算估计情况下的模型预测偏差. 最后,考虑缺失数据的影响,评估一个重要的指标——出行时间价值. 结果表明,在发展中城市,问卷回答者和不回答者的社会经济背景存在很大差异. 缺失率和缺失模式均对出行方式选择模型的参数和预测精度有很大影响. 考虑缺失模式的出行时间价值计算表明, 项目无回答对出行时间价值的影响被过高估计了.

关键词: 城市交通, 缺失模式, 期望最大化修补方法, 出行方式选择, 多项Logit集计模型, 发展中城市, 出行时间价值

Abstract: To enhance the quality of transportation planning and policy making, it is necessary to properly deal with the nonresponse issues in transport surveys. However, such nonresponse issues especially in developing countries have been ignored in literature. This paper first statistically identifies the missing patterns of item nonresponse (INR) in person trip survey data collected in developing cities and then analyzes the effects of INR on the performance of travel mode choice model (an aggregated multinomial logit model) based on expectation-maximization (EM) imputation method. As a case study, three developing cities representing three levels of INR are analyzed as follow. Firstly, the statistically significant social-economic attributes of trip makers and trip-context factors are identified with respect to INR in the missing pattern analysis by using Chi-Square test method. Secondly, EM imputation based on missing pattern analysis is applied to deal with missing data to obtain the unbiased data set as a benchmark. Thirdly, the null hypothesis that the model parameters estimated with and without imputation are equal is statistically tested using independent-sample T tests and further the internal validity performed in terms of R-squared coefficients is used to identify the discrepancy of model predictions between with and without imputations. Finally, one critical indicator-value of travel time (VOTT) is evaluated considering the effects of missing data. The results confirm that the respondents and non-respondents are quite different in terms of the social-economic background in the developing cities and further show that not only the missing rates but also the missing patterns greatly affect the performance of mode choice model in terms of model parameters and the prediction ability. The calculation of VOTT reveals that the VOTT affected by INR tends to be overestimated.

Key words: urban traffic, missing pattern, EM imputation, travel mode choice, aggregated multinomial logit model, developing cities, value of travel time

中图分类号: