A framework for product description classification in e-commerce

D. Vandic, F. Frasincar, U. Kaymak

Research output: Contribution to journalArticleAcademicpeer-review

9 Citations (Scopus)
4 Downloads (Pure)

Abstract

We propose the Hierarchical Product Classification (HPC) framework for the purpose of classifying products using a hierarchical product taxonomy. The framework uses a classification system with multiple classification nodes, each residing on a different level of the taxonomy. The innovative part of the framework stems from the definition of classification recipes that can be used to construct high-quality classifier nodes, using the product descriptions in the most optimal way. These classifier recipes are specifically tailored for the e-commerce domain. The use of these classifier recipes enables flexible classifiers that adjust to the taxonomy depth-specific characteristics of product taxonomies. Furthermore, in order to gain insight into which components are required to perform high quality product classification, we evaluate several feature selection methods and classification techniques in the context of our framework. Based on 3000 product descriptions obtained from Amazon.com, HPC achieves an overall accuracy of 76.80% for product classification. Using 110 categories from CircuitCity.com and Amazon.com, we obtain a precision of 93.61% for mapping the categories to the taxonomy of shopping.com.
Original languageEnglish
Pages (from-to)1-27
Number of pages27
JournalJournal of Web Engineering
Volume17
Issue number1-2
DOIs
Publication statusPublished - Mar 2018

Keywords

  • Product descriptions
  • hierarchical clustering
  • feature selection
  • e-commerce
  • Feature selection
  • E-commerce
  • Hierarchical clustering

Fingerprint

Dive into the research topics of 'A framework for product description classification in e-commerce'. Together they form a unique fingerprint.

Cite this