Product categorisation: recent work

Product categorisation: recent work

Earlier work: Automatic categorization in product catalog (state-of-the-art)

Product classification automatically predicts a taxonomy path in a predefined hierarchy given a textual description or title. For efficiency, a suitable representation for the document feature vector and fast algorithms for prediction are required. A distributional semantics representation and two-level ensemble approach [Gupta 16] utilize (wrt taxonomy tree) path-wise, node-wise and depth-wise classifiers to reduce error in the final task.?

No alt text provided for this image

Data size, category skewness and noisy metadata are challenges in product categorization. DeepCN [Ha 16] is an end-to-end model using multiple RNNs dedicated to metadata attributes for generating features from text metadata and fully connected layers for classifying item categories from the generated features. Categorization errors are propagated back through fully connected layers to RNNs for weight update in the learning process. This allows diverse attributes to be integrated into a common representation thus overcoming sparsity and scalability.

No alt text provided for this image

Fine-grained leaf categories in taxonomies are defined by the most descriptive/specific words of products. Finding them remains challenging due to blurred concepts (i.e. multiple equivalent or synonymous categories), unstable category vocabulary (i.e. emerging new products and evolving language habits), and lack of labeled data. Proposed model (NPC) [Chen 19] to address these is equipped with a character-level convolutional embedding layer to learn compositional word representations, and a spiral residual layer to extract word context annotations capturing complex long range dependencies and structural information. A product is categorized by jointly recognizing categories from description and predicting categories from predefined category vocabularies. Furthermore, to avoid extensive human labor, NPC is able to adapt to weak labels, generated by mining the search logs, where the customer’s behaviors naturally connect products with categories.

No alt text provided for this image

Product classification in a multi-level (or hierarchical) setting is not suitable for a dynamic taxonomy. An open-world model (OW Learning) instead can automatically classify a product based on a set of categories in an e-commerce platform with various sellers. An emerging product not belonging to any existing category herein should be classified as unseen. Further, this unseen set may keep growing. When the number of products belonging to a new category is large enough, it should be added to the existing set. An open-world model should easily accommodate this addition with a low cost of training since it is impractical to retrain the model from scratch every time a new class is added. L2AC [Hu 19] leverages the huge amount of product descriptions from public datasets and forms thus the OWL task : each path to a leaf node tree-structured category system is considered as a class; products belonging to multiple classes are removed to ensure the classes have no overlap.

No alt text provided for this image

Existing algorithms take title/description as input to classify a product into a leaf category. A paradigm based on machine translation (MT) instead [Tan 20] converts a product’s natural language description into a sequence of tokens representing a root-to-leaf path in a product taxonomy based on the vocabulary of categories. This generation allows new paths to be created based on product titles. Although such system-created paths utilize existing nodes in a product taxonomy tree, paths (which are permutations of nodes) need not pre-exist in the tree. When the paths are added to the tree to form new edges between previously unconnected nodes in a tree, they transform the tree into a DAG, which offers a richer representation of the product. Such category paths better accommodate a user’s many conceptualizations of a product thus promoting user-friendly navigation and are more adaptable to new products.

No alt text provided for this image

Building category trees that reflect users’ dynamic interests is carried out by taxonomists. This manual construction however leads to outdated trees as it is hard to keep track of market trends. While taxonomists can identify candidate categories, i.e. sets of items with a shared label, most such categories cannot simultaneously exist in the tree, as platforms set a bound on the number of categories an item may belong to. To address this setting, tree construction is formalized [Avron 22] where categories are maximally similar to desirable candidate categories while satisfying combinatorial requirements and provide a model that captures practical considerations. Two heuristic algorithms, are demonstrated to be effective over datasets from real-life e-commerce platforms, far exceeding the worst-case bounds. A natural special case is identified for which a solution with tight approximation guarantees is devised. The approach facilitates continual updates and maintaining consistency with an existing tree. Finally, input candidate categories derived from result sets to recent search queries are included to reflect dynamic user interests and trends.

No alt text provided for this image

[Gupta 16] Product Classification in E-Commerce using Distributional Semantics

[Ha 16] Large-Scale Item Categorization in e-Commerce Using Multiple Recurrent Neural Networks

[Chen 19] Fine-Grained Product Categorization in E-commerce

[Hu 19] Open-world Learning and Application to Product Classification

[Tan 20] E-Commerce Product Categorization via Machine Translation

[Avron 22] Automated Category Tree Construction in E-Commerce

We’re launching our fused multimodal foundational model for e-commerce with categorization benchmarks. We’d love to hear more about your categorization work!

要查看或添加评论,请登录

Muthusamy Chelliah的更多文章

社区洞察

其他会员也浏览了