Advancing Association Rule Base on Gini Impurity Statistic for Predicting Transportation Mode Choice

  • Jiajia Zhang
  • , Tao Feng
  • , Zhengkui Lin
  • , Harry J.P. Timmermans

Research output: Contribution to conferencePaperAcademic

Abstract

Recently, machine learning approaches have been applied to predict transportation mode choice as an alternative to the more commonly used discrete choice models. General class association rules (CARs) have been introduced as a promising machine learning method, but the interpretability of the prediction results in terms of the underlying behavioral decision-making process has remained a concern. In an attempt to improve CARs, this study proposes a more advanced association rule model (named CARGIGI) with stronger interpretability. Based on the original CARIG approach that uses information gain (IG) statistic for improving the predictive accuracy, in this model, the Gini impurity (GI) statistic is used to generate new rules for improving predictive accuracy and calculate the relative importance of the variables, that of the variable levels and the weight of rules in transportation mode decision process. The weight of rules is introduced as a new pruning indicator to improve the predictive accuracy, while the relative importance of the level of a variable is used to enhance the behavioral interpretability of the results. The suggested approach is applied to the 2015 Dutch National Travel Survey. Results indicate that travel distance, OV card usage frequency, travel time, and travel purpose are the most important variables, while travel party and gender are the least important variables for predicting transportation mode choice. In addition, a 10-fold cross validation test is conducted to validate the advanced model. The results show that the newly proposed model outperform both the selected machine learning algorithms and the MNL model.
Original languageEnglish
Number of pages21
Publication statusPublished - 2021
Event100th Transportation Research Board Annual Meeting - Washington, United States
Duration: 21 Jan 202129 Jan 2021

Conference

Conference100th Transportation Research Board Annual Meeting
Country/TerritoryUnited States
CityWashington
Period21/01/2129/01/21

Keywords

  • Gini impurity, Class association rules, Weight of rules, Transportation mode choice

Fingerprint

Dive into the research topics of 'Advancing Association Rule Base on Gini Impurity Statistic for Predicting Transportation Mode Choice'. Together they form a unique fingerprint.

Cite this