lnu.sePublications
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A Semi-Supervised Approach for Aspect Category Detection and Aspect Term Extraction from Opinionated Text
Sukkur IBA University, Pakistan.
Sukkur IBA University, Pakistan.
Norwegian University of Science and Technology (NTNU), Norway.
Linnaeus University, Faculty of Technology, Department of Informatics. Linnaeus University, Faculty of Technology, Department of computer science and media technology (CM).ORCID iD: 0000-0002-0199-2377
Show others and affiliations
2023 (English)In: Computers, Materials and Continua, ISSN 1546-2218, E-ISSN 1546-2226, Vol. 77, no 1, p. 115-137Article in journal (Refereed) Published
Abstract [en]

The Internet has become one of the significant sources for sharing information and expressing users' opinions about products and their interests with the associated aspects. It is essential to learn about product reviews; however, to react to such reviews, extracting aspects of the entity to which these reviews belong is equally important. Aspect-based Sentiment Analysis (ABSA) refers to aspects extracted from an opinionated text. The literature proposes different approaches for ABSA; however, most research is focused on supervised approaches, which require labeled datasets with manual sentiment polarity labeling and aspect tagging. This study proposes a semi-supervised approach with minimal human supervision to extract aspect terms by detecting the aspect categories. Hence, the study deals with two main sub-tasks in ABSA, named Aspect Category Detection (ACD) and Aspect Term Extraction (ATE). In the first sub-task, aspects categories are extracted using topic modeling and filtered by an oracle further, and it is fed to zero-shot learning as the prompts and the augmented text. The predicted categories are the input to find similar phrases curated with extracting meaningful phrases (e.g., Nouns, Proper Nouns, NER (Named Entity Recognition) entities) to detect the aspect terms. The study sets a baseline accuracy for two main sub-tasks in ABSA on the Multi-Aspect Multi-Sentiment (MAMS) dataset along with SemEval-2014 Task 4 sub-task 1 to show that the proposed approach helps detect aspect terms via aspect categories.

Place, publisher, year, edition, pages
Tech Science Press , 2023. Vol. 77, no 1, p. 115-137
National Category
Language Technology (Computational Linguistics)
Research subject
Computer and Information Sciences Computer Science, Computer Science
Identifiers
URN: urn:nbn:se:lnu:diva-125433DOI: 10.32604/cmc.2023.040638OAI: oai:DiVA.org:lnu-125433DiVA, id: diva2:1809069
Available from: 2023-11-01 Created: 2023-11-01 Last updated: 2023-11-07Bibliographically approved

Open Access in DiVA

fulltext(1020 kB)20 downloads
File information
File name FULLTEXT01.pdfFile size 1020 kBChecksum SHA-512
0044e63d45d32c00c4f509e7a1463fb80a21d6b82c06c25ad52548c1c9201c65b3c95c98b5a955bfa5939373f89c65e8972a677d8275e1cb8763d9bbd973617f
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Authority records

Kastrati, Zenun

Search in DiVA

By author/editor
Kastrati, Zenun
By organisation
Department of InformaticsDepartment of computer science and media technology (CM)
In the same journal
Computers, Materials and Continua
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
Total: 20 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 183 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf