TY - JOUR
T1 - Attitudes on Autonomous Vehicle Adoption using Interpretable Gradient Boosting Machine
AU - Lee, Dongwoo
AU - Mulrow, John
AU - Haboucha, Chana Joanne
AU - Derrible, Sybil
AU - Shiftan, Yoram
N1 - Publisher Copyright:
© National Academy of Sciences: Transportation Research Board 2019.
PY - 2019/11
Y1 - 2019/11
N2 - This article applies machine learning (ML) to develop a choice model on three choice alternatives related to autonomous vehicles (AV): regular vehicle (REG), private AV (PAV), and shared AV (SAV). The learned model is used to examine users’ preferences and behaviors on AV uptake by car commuters. Specifically, this study applies gradient boosting machine (GBM) to stated preference (SP) survey data (i.e., panel data). GBM notably possesses more interpretable features than other ML methods as well as high predictive performance for panel data. The prediction performance of GBM is evaluated by conducting a 5-fold cross-validation and shows around 80% accuracy. To interpret users’ behaviors, variable importance (VI) and partial dependence (PD) were measured. The results of VI indicate that trip cost, purchase cost, and subscription cost are the most influential variables in selecting an alternative. Moreover, the attitudinal variables Pro-AV Sentiment and Environmental Concern are also shown to be significant. The article also examines the sensitivity of choice by using the PD of the log-odds on selected important factors. The results inform both the modeling of transportation technology uptake and the configuration and interpretation of GBM that can be applied for policy analysis.
AB - This article applies machine learning (ML) to develop a choice model on three choice alternatives related to autonomous vehicles (AV): regular vehicle (REG), private AV (PAV), and shared AV (SAV). The learned model is used to examine users’ preferences and behaviors on AV uptake by car commuters. Specifically, this study applies gradient boosting machine (GBM) to stated preference (SP) survey data (i.e., panel data). GBM notably possesses more interpretable features than other ML methods as well as high predictive performance for panel data. The prediction performance of GBM is evaluated by conducting a 5-fold cross-validation and shows around 80% accuracy. To interpret users’ behaviors, variable importance (VI) and partial dependence (PD) were measured. The results of VI indicate that trip cost, purchase cost, and subscription cost are the most influential variables in selecting an alternative. Moreover, the attitudinal variables Pro-AV Sentiment and Environmental Concern are also shown to be significant. The article also examines the sensitivity of choice by using the PD of the log-odds on selected important factors. The results inform both the modeling of transportation technology uptake and the configuration and interpretation of GBM that can be applied for policy analysis.
UR - http://www.scopus.com/inward/record.url?scp=85068252954&partnerID=8YFLogxK
U2 - 10.1177/0361198119857953
DO - 10.1177/0361198119857953
M3 - Article
AN - SCOPUS:85068252954
SN - 0361-1981
VL - 2673
SP - 865
EP - 878
JO - Transportation Research Record
JF - Transportation Research Record
IS - 11
ER -