lnu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Automatic subject indexing of Swedish LGBTQ+ fiction
University of Gothenburg, Sweden.
University of Borås, Sweden.
Linnaeus University, Faculty of Arts and Humanities, Department of Languages. Linnaeus University, Faculty of Arts and Humanities, Department of Cultural Sciences.ORCID iD: 0000-0002-0930-644X
Linnaeus University, Faculty of Arts and Humanities, Department of Cultural Sciences. (Library and Information Science)ORCID iD: 0000-0003-4169-4777
Show others and affiliations
2024 (English)In: Presented at Huminfra Conference (HiC), Gothenburg, 10 jan 2024 - 11 jan 2024, 2024Conference paper, Oral presentation with published abstract (Refereed)
Sustainable development
SDG 5: Achieve gender equality and empower all women and girls
Abstract [en]

Fiction is a challenging genre for automatic theme identification. Unlike other types of documents, such as physics academic papers, fiction does not always name the concepts it addresses, but rather implies them through subtle clues. Fiction also uses metaphors intentionally to convey deeper meanings. To make Swedish LGBTQ+ fiction more accessible, the Queerlit database (https://queerlit.dh.gu.se/) provides subject indexing by information professionals. They use the QLIT thesaurus (based on Homosaurus) for LGBTQ+ themes and Swedish Subject Headings (SAO – Svenska Ämnesord) for non-LGBTQ+ themes. The indexing is comprehensive and retrospective, assigning terms to previously published Swedish fiction.</p><p>This work aims to determine to what degree and under which conditions is it possible to automatically assign subject index terms from QLIT, in order to estimate the usefulness of automatic tools to support subject indexing conducted by information professionals. This process may require a large number of training documents which are not available (the entire Queerlit database has about 2000 works indexed and QLIT has about 800 terms, while SAO is much bigger). Therefore, another approach will be explored – whether automatically extracted terms from the texts provide the potential to complement existing, professionally assigned terms from QLIT and SAO. We experiment with zero-shot classification transformers and topic modeling.</p><p>The proposed paper will present the intermediate results of different methods applied to available texts from the QLIT database. It is important to note that the project is currently in an exploratory phase and that the presentation is intended to showcase how different approaches have both failed and succeeded. We also intend to highlight areas of possible applicability specifically from the perspective afforded by the QLIT thesaurus, i.e., the appropriateness of the methods for Swedish LGBTQ+ fiction. We will also discuss the challenges and limitations of automatic theme identification for fiction, especially for LGBTQ+ themes that are often implicit or nuanced.

Place, publisher, year, edition, pages
2024.
National Category
Information Studies
Research subject
Humanities, Library and Information Science
Identifiers
URN: urn:nbn:se:lnu:diva-127766DOI: 10.3384/ecp205ISBN: 978-91-8075-512-2 (print)OAI: oai:DiVA.org:lnu-127766DiVA, id: diva2:1837603
Conference
HiC 2024, Huminfra Conference, Gothenburg, Sweden, 10-11 January, 2024
Available from: 2024-02-14 Created: 2024-02-14 Last updated: 2025-01-21Bibliographically approved

Open Access in DiVA

Presentation(804 kB)35 downloads
File information
File name FULLTEXT01.pdfFile size 804 kBChecksum SHA-512
db45a09dfe6a6051104b6584dd22e04601742fefa642195b68d8494a081408a4b3053b9299739539984df7c47067f996035934c8d8d194a17fe2adf87df38ea1
Type fulltextMimetype application/pdf

Other links

Publisher's full textHiC Book of Abstracts

Authority records

Ihrmark, DanielGolub, Koraljka

Search in DiVA

By author/editor
Ihrmark, DanielGolub, Koraljka
By organisation
Department of LanguagesDepartment of Cultural Sciences
Information Studies

Search outside of DiVA

GoogleGoogle Scholar
Total: 35 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 265 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf