Aspect/sentiment-aware review summarization (Recent)

Muthusamy Chelliah

发布日期: 2023年4月13日

Existing unsupervised, opinion summarization techniques follow a two-stage framework: first creating synthetic review-summary paired datasets and then feeding them into generative summary model for supervised training. However, these methods mainly focus on semantic similarity in synthetic dataset creation, ignoring consistency of aspects/sentiments in synthetic pairs. Such inconsistency also brings a gap to training and inference of summarization model. To alleviate this problem, ConsistSum [Ke 22] first extracts preliminary “review/summary” pairs from raw corpus by evaluating distance of aspect and sentiment distribution. Then, preliminary summary is refined with constrained Metropolis-Hastings sampling to produce highly consistent synthetic datasets. In the summarization phase, generative model T5 is fine-tuned by incorporating loss of predicting aspect/opinion distribution.

Previous summarization approaches construct multiple reviews and their summary based on textual similarities between reviews, resulting in information mismatch between review input and summary. [Liu 22] instead converts each review into a mix of structured and unstructured data, called as opinion/aspect pairs (OAs) and implicit sentences (ISs). A new method synthesizes training pairs of such mix-structured data as input and textual summary as output, and designs a summarization model with OA encoder and IS encoder.

Semantic Autoencoder (SemAE) [Chowdhury 22] performs extractive summarization in an unsupervised manner. SemAE uses dictionary learning to implicitly capture semantic information from review and learns a latent representation of each sentence over semantic units. A semantic unit is supposed to capture an abstract concept. Representations are leveraged to identify representative opinions among hundreds of reviews. SemAE is also able to perform controllable summarization to generate aspect-specific summaries.

Two simple yet effective unsupervised approaches [Shen 23] generate both aspect-specific and general opinion summaries by training on synthetic datasets constructed with aspect-related review contents. Seed Words Based Leave-One-Out (SW-LOO) identifies aspect-related portions of reviews simply by exact-matching aspect seed words. Natural Language Inference Based Leave-One-Out (NLILOO) identifies aspect-related sentences utilizing an NLI model in a more general setting without using seed words.

[Ke 22] ConsistSum: Unsupervised Opinion Summarization with the Consistency of Aspect, Sentiment and Semantic

[Liu 22] Opinion Summarization by Weak-Supervision from Mix-structured Data

[Chowdhury 22] Unsupervised Extractive Opinion Summarization Using Sparse Coding

[Shen 23] Simple Yet Effective Synthetic Dataset Construction for Unsupervised Opinion Summarization

要查看或添加评论，请登录

Muthusamy Chelliah的更多文章

Next Basket Recommendation - Potpourri (SOTA)

2023年4月22日

Next Basket Recommendation - Potpourri (SOTA)

Next basket recommendation (NBR) aims to infer a set of items that a user will purchase at the next visit by…
Next Basket Recommendation - Potpourri (Recent)

2023年4月20日

Next Basket Recommendation - Potpourri (Recent)

Traditional recommender systems mainly aim to model inherent and long-term user preference, while dynamic user demands…
Repeat purchase recommendation for consumable replenishment: SOTA

2023年4月18日

Repeat purchase recommendation for consumable replenishment: SOTA

In e-commerce and retail industry, a user purchases a set of items (a basket) at a time. Recommending items for the…
Aspect/sentiment-aware review summarization (SOTA)

2023年4月15日

Aspect/sentiment-aware review summarization (SOTA)

Several pipeline methods [Bhaskar 22] apply GPT-3 to summarize a large collection of user reviews in a zero-shot…
Aspect/sentiment-aware review summarization (Seminal)

2023年4月10日

Aspect/sentiment-aware review summarization (Seminal)

Opinion summarization has been traditionally approached with unsupervised, weakly supervised and few-shot learning…
Product categorisation: recent work

2023年1月20日

Product categorisation: recent work

Earlier work: Automatic categorization in product catalog (state-of-the-art) Product classification automatically…

2 条评论
Text generation [3]: explainable recommendation

2023年1月12日

Text generation [3]: explainable recommendation

Current approaches to generating sentence explanations are either limited to predefined templates, which restrict…
Text generation [2]: product reviews

2023年1月9日

Text generation [2]: product reviews

Building data-driven models that can generate reviews for the given products/ratings helps understand how a specific…
Multimodal product summarization

2023年1月3日

Multimodal product summarization

Existing approaches for generating a concise/readable product summary given its long text description and image suffer…
Comparative summarisation for explainable recommendation

2022年12月27日

Comparative summarisation for explainable recommendation

Earlier, relevant articles Comparative summarization of product reviews Explainable product recommendation:…

See all articles

Muthusamy Chelliah的更多文章

Next Basket Recommendation - Potpourri (SOTA)

Next Basket Recommendation - Potpourri (Recent)

Repeat purchase recommendation for consumable replenishment: SOTA

Aspect/sentiment-aware review summarization (SOTA)

Aspect/sentiment-aware review summarization (Seminal)

Product categorisation: recent work

Text generation [3]: explainable recommendation

Text generation [2]: product reviews

Multimodal product summarization

Comparative summarisation for explainable recommendation

社区洞察