Feature selection for location metonymy using augmented bag-of-words

Document Type

Article

Publication Date

1-1-2022

Abstract

Location metonymy resolution is a study that deals with locations being used in a non-literal way that create problems in several natural language processing tasks such as Named entity recognition and Geographical parsing. Many studies were conducted attempting to accurately classify whether the location is used literally or metonymically, however, most of the approaches that performed well had to employ a considerable amount of resources along with complex machine learning models; those that reduced the resources experienced a decline in performance due to data sparseness. This study proposes a novel feature selection approach that uses bag-of-words and augments it with GloVe embeddings to obtain features that can be recognized based on the context of the sentence. We then implement a minimalist deep learning model making the entire classification task as light as possible. The study found that relying solely on the given datasets to identify features without depending on other external resources can achieve remarkable results despite the small size of the datasets. The results obtained from evaluating our method compared to the state-of-the-art methods show that eliminating noise based on the context notwithstanding the usage of low-cost resources has outperformed all of the previous methods with an accuracy of 99.2% on the WIMCOR dataset.

Keywords

Task analysis, Organizations, Feature extraction, Standards organizations, Syntactics, Natural language processing, Electronic mail, Text classification, Metonymy resolution, Deep learning, Feature selection, Bag-of-words, Natural language understanding

Divisions

ai

Funders

Universiti Malaya Research University (Grant No: GPF091A-2020 & TR001D-2018A)

Publication Title

IEEE Access

Volume

10

Publisher

Institute of Electrical and Electronics Engineers

Publisher Location

445 HOES LANE, PISCATAWAY, NJ 08855-4141 USA

This document is currently not available here.

Share

COinS