Looking deeper into deep learning model: attribution-based explanations of TextCNN

Wenting Xiong, Iftitahu Ni'mah, Juan M. G. Huesca, Werner van Ipenburg, Jan Veldsink, Mykola Pechenizkiy

Research output: Contribution to conferencePaperAcademic

33 Downloads (Pure)

Abstract

Layer-wise Relevance Propagation (LRP) and saliency maps have been recently used to explain the predictions of Deep Learning models, specifically in the domain of text classification. Given different attribution-based explanations to highlight relevant words for a predicted class label, experiments based on word deleting perturbation is a common evaluation method. This word removal approach, however, disregards any linguistic dependencies that may exist between words or phrases in a sentence, which could semantically guide a classifier to a particular prediction. In this paper, we present a feature-based evaluation framework for comparing the two attribution methods on customer reviews (public data sets) and Customer Due Diligence (CDD) extracted reports (corporate data set). Instead of removing words based on the relevance score, we investigate perturbations based on embedded features removal from intermediate layers of Convolutional Neural Networks. Our experimental study is carried out on embedded-word, embedded-document, and embedded-ngrams explanations. Using the proposed framework, we provide a visualization tool to assist analysts in reasoning toward the model's final prediction.
Original languageEnglish
Number of pages9
Publication statusPublished - 8 Nov 2018
EventNIPS 2018 Workshop on Challenges and Opportunities for AI in Financial Services: the Impact of Fairness, Explainability, Accuracy, and Privacy - Montreal, Canada
Duration: 7 Dec 20187 Dec 2018

Conference

ConferenceNIPS 2018 Workshop on Challenges and Opportunities for AI in Financial Services
Country/TerritoryCanada
CityMontreal
Period7/12/187/12/18
OtherFEAP-AI4Fin 2018

Keywords

  • cs.IR
  • cs.LG
  • stat.ML

Fingerprint

Dive into the research topics of 'Looking deeper into deep learning model: attribution-based explanations of TextCNN'. Together they form a unique fingerprint.

Cite this