Aspect Extraction in Domain Lexicon Generation: A New Frequency-Based Approach

Document Type

Article

Publication Date

1-1-2024

Abstract

Domain sentimental lexicon building become an attractive field in recent years. This is due to the increased number of users' generated data through the internet besides the different sentiments of opinion words in different contexts. Domain lexicons mainly consist of opinion pairs and their associated sentiment. Any opinion pair is formed by a domain word and one of its associated opinion words. Therefore, to generate a domain lexicon from a domain corpus, domain word extraction is needed with their associated opinion words. One of the traditional approaches is frequency-based approaches. However, the ambiguity problem is a big concern of these approaches. This paper introduced a frequency-based equation that considers the context of the words for domain word extraction. The equation was tested on five Amazon reviews datasets and it proved its efficiency over other used frequency-based equations in terms of recall and precision. Therefore, more related lexicons to the domains were generated.

Keywords

Feature extraction, Data mining, Frequency-domain analysis, Social networking (online), Sentiment analysis, Semantics, Accuracy, Statistical analysis, Text processing, Text mining, Aspect, domain lexicon, frequency-based, sentiment analysis, statistical, word extraction, context

Divisions

infosystem

Funders

Universiti Malaya (UM) International Collaboration (ST005-2023)

Publication Title

IEEE Access

Volume

12

Publisher

Institute of Electrical and Electronics Engineers

Publisher Location

445 HOES LANE, PISCATAWAY, NJ 08855-4141 USA

This document is currently not available here.

Share

COinS