Aspect Extraction in Domain Lexicon Generation: A New Frequency-Based Approach
Document Type
Article
Publication Date
1-1-2024
Abstract
Domain sentimental lexicon building become an attractive field in recent years. This is due to the increased number of users' generated data through the internet besides the different sentiments of opinion words in different contexts. Domain lexicons mainly consist of opinion pairs and their associated sentiment. Any opinion pair is formed by a domain word and one of its associated opinion words. Therefore, to generate a domain lexicon from a domain corpus, domain word extraction is needed with their associated opinion words. One of the traditional approaches is frequency-based approaches. However, the ambiguity problem is a big concern of these approaches. This paper introduced a frequency-based equation that considers the context of the words for domain word extraction. The equation was tested on five Amazon reviews datasets and it proved its efficiency over other used frequency-based equations in terms of recall and precision. Therefore, more related lexicons to the domains were generated.
Keywords
Feature extraction, Data mining, Frequency-domain analysis, Social networking (online), Sentiment analysis, Semantics, Accuracy, Statistical analysis, Text processing, Text mining, Aspect, domain lexicon, frequency-based, sentiment analysis, statistical, word extraction, context
Divisions
infosystem
Funders
Universiti Malaya (UM) International Collaboration (ST005-2023)
Publication Title
IEEE Access
Volume
12
Publisher
Institute of Electrical and Electronics Engineers
Publisher Location
445 HOES LANE, PISCATAWAY, NJ 08855-4141 USA