lnu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Stance Classification of Social Media Texts for Under-Resourced Scenarios in Social Sciences
Uppsala University, Sweden.ORCID iD: 0000-0001-9938-2675
Linnaeus University, Faculty of Technology, Department of computer science and media technology (CM). Linköping University, Sweden. (ISOVIS;DISA-DH)ORCID iD: 0000-0002-1907-7820
2022 (English)In: Data, E-ISSN 2306-5729, Vol. 7, no 11, article id 159Article in journal (Refereed) Published
Abstract [en]

In this work, we explore the performance of supervised stance classification methods for social media texts in under-resourced languages and using limited amounts of labeled data. In particular, we focus specifically on the possibilities and limitations of the application of classic machine learning versus deep learning in social sciences. To achieve this goal, we use a training dataset of 5.7K messages posted on Flashback Forum, a Swedish discussion platform, further supplemented with the previously published ABSAbank-Imm annotated dataset, and evaluate the performance of various model parameters and configurations to achieve the best training results given the character of the data. Our experiments indicate that classic machine learning models achieve results that are on par or even outperform those of neural networks and, thus, could be given priority when considering machine learning approaches for similar knowledge domains, tasks, and data. At the same time, the modern pre-trained language models provide useful and convenient pipelines for obtaining vectorized data representations that can be combined with classic machine learning algorithms. We discuss the implications of their use in such scenarios and outline the directions for further research.

Place, publisher, year, edition, pages
MDPI, 2022. Vol. 7, no 11, article id 159
Keywords [en]
text mining, machine learning, deep learning, neural networks, stance classification, computational social science, social media, supervised learning, sentiment classification, Swedish language data
National Category
Language Technology (Computational Linguistics) Social Sciences Interdisciplinary
Research subject
Social Sciences; Computer and Information Sciences Computer Science; Computer and Information Sciences Computer Science, Computer Science
Identifiers
URN: urn:nbn:se:lnu:diva-117570DOI: 10.3390/data7110159ISI: 000895323600001Scopus ID: 2-s2.0-85149444003OAI: oai:DiVA.org:lnu-117570DiVA, id: diva2:1711488
Funder
European Commission, INEA/CEF/ICT/A2020/2394203
Note

This paper is an extended version of our paper published in the Proceedings of the Swedish Workshop on Data Science (SweDS ’21), Växjö, Sweden, December 2–3, 2021.

Available from: 2022-11-17 Created: 2022-11-17 Last updated: 2023-05-11Bibliographically approved

Open Access in DiVA

fulltext(605 kB)164 downloads
File information
File name FULLTEXT01.pdfFile size 605 kBChecksum SHA-512
c2bbcc23bf22aa9e6515f786530e769d1082c76c776f55cfc4bb3aba178c7a1d3114614207859d8081e32eb2b5f026f7ecc9e68ab06fcd3ccb2ef5beff80fee2
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Yantseva, VictoriaKucher, Kostiantyn

Search in DiVA

By author/editor
Yantseva, VictoriaKucher, Kostiantyn
By organisation
Department of computer science and media technology (CM)
Language Technology (Computational Linguistics)Social Sciences Interdisciplinary

Search outside of DiVA

GoogleGoogle Scholar
Total: 164 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 119 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf