TY - GEN
T1 - On Inferring a Meaningful Similarity Metric for Customer Behaviour
AU - Hassani, Marwan
A2 - van den Berg, Sophie
A2 - Dong, Yuxiao
A2 - Kourtellis, Nicolas
A2 - Hammer, Barbara
A2 - Lozano, Jose A.
N1 - Publisher Copyright:
© 2021, Springer Nature Switzerland AG.
PY - 2021/9/10
Y1 - 2021/9/10
N2 - In omnichannel customer service environments, where no real process is enforced, a wide variety of customer journey variants exists. This variety makes it complex to find process improvement opportunities. Modeling the journeys as traces is an essential step before discovering an explainable model of various behaviours. Trace clustering helps improvement efforts by separating the journeys into homogeneous subsets in terms of behaviour and purpose. For this, a one-size-fits-all distance metric has been used so far in the literature. This paper shows that a domain-informed similarity metric will improve customer journey clustering compared to a generic one. We propose SIMPRIM framework, which uses clustering quality metrics to develop a similarity metric that maximizes the separability of the journeys in a low dimensional space while agreeing with existing process knowledge. Experimental evaluation on real life use cases of a large telecom company and a benchmark dataset show that, compared to a generic metric, respectively a 46% and 39% improvement can be obtained in terms of the internal clustering quality while keeping the external clustering quality equal. We also show that the inferred metric can be useful for prediction applications.
AB - In omnichannel customer service environments, where no real process is enforced, a wide variety of customer journey variants exists. This variety makes it complex to find process improvement opportunities. Modeling the journeys as traces is an essential step before discovering an explainable model of various behaviours. Trace clustering helps improvement efforts by separating the journeys into homogeneous subsets in terms of behaviour and purpose. For this, a one-size-fits-all distance metric has been used so far in the literature. This paper shows that a domain-informed similarity metric will improve customer journey clustering compared to a generic one. We propose SIMPRIM framework, which uses clustering quality metrics to develop a similarity metric that maximizes the separability of the journeys in a low dimensional space while agreeing with existing process knowledge. Experimental evaluation on real life use cases of a large telecom company and a benchmark dataset show that, compared to a generic metric, respectively a 46% and 39% improvement can be obtained in terms of the internal clustering quality while keeping the external clustering quality equal. We also show that the inferred metric can be useful for prediction applications.
KW - Customer journey clustering
KW - Similarity metric
UR - https://www.scopus.com/pages/publications/85115731939
U2 - 10.1007/978-3-030-86517-7_15
DO - 10.1007/978-3-030-86517-7_15
M3 - Conference contribution
AN - SCOPUS:85115731939
SN - 978-3-030-86516-0
VL - Part V
T3 - Lecture Notes in Computer Science (LNCS)
SP - 234
EP - 250
BT - Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track
PB - Springer
CY - Cham
T2 - European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML PKDD 2021
Y2 - 13 September 2021 through 17 September 2021
ER -