lnu.sePublications
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Automatic subject indexing of oral history interviews with Whisper and Claude
Linnaeus University, Faculty of Arts and Humanities, Department of Cultural Sciences.ORCID iD: 0009-0003-4070-3798
2025 (English)Conference paper, Poster (with or without abstract) (Refereed)
Abstract [en]

In the archival media trinity of text, image and sound, the latter presents particular challenges for users’ searching and browsing. While a user can manually or digitally browse through texts and images to locate items of interest, sound needs to be played more or less in real time to do the same. Audio files can naturally be described and transcribed – digital or digitized audio files of speech automatically so – thus facilitating full-text search. However, the familiar Achilles’ heel of full-text search, namely the ambiguity of natural language, remains.

Enter the hallmark trade of librarianship: subject indexing. When subject index terms have been assigned to individual sections of an audio file, a user searching for these or similar terms can locate precisely where in the retrieved audio files the subject is discussed. With state-of-the-art AI systems, even this can be done automatically, thereby decimating the amount of time needed to index audio files, from real time to as fast as the system can process them. This heralds a brave new future for the accessibility and searchability of oral history archives.

This poster presents a pilot study on automatically transcribing interviews in Swedish from oral history archives using OpenAI’s Whisper, describing the content and assigning subject index terms to sections using Claude from Anthropic, and visualizing the results. The accuracy of the results depends on many factors, including sound quality, accents of the speakers, the amount of language mixing etc. The results are very promising, however, suggesting automatic subject indexing of interviews to be a worthwhile research direction going forward.

Place, publisher, year, edition, pages
2025.
Keywords [en]
artificial intelligence, archives, oral history, indexing
National Category
Information Studies
Identifiers
URN: urn:nbn:se:lnu:diva-138419OAI: oai:DiVA.org:lnu-138419DiVA, id: diva2:1956997
Conference
Digital Dreams and Practices, Digital Humanities in Nordic and Baltic Countries 9th Conference, Tartu, Estonia 5-7,03,2025
Part of project
Artificial Intelligence as a risk and opportunity for the authenticity of archives, Wallenberg Foundations
Funder
Wallenberg AI, Autonomous Systems and Software Program – Humanity and Society (WASP-HS)Available from: 2025-05-08 Created: 2025-05-08 Last updated: 2025-05-08Bibliographically approved

Open Access in DiVA

fulltext(247 kB)13 downloads
File information
File name FULLTEXT01.pdfFile size 247 kBChecksum SHA-512
732a3dc3cd76d77db4f6ef7493c9d8013706d07df9a80941e98d217dfc4c0cae3c29435c4d675bc9e1c0409f0790f0357ac0e472a97923fe990eb3f8c1232fbc
Type fulltextMimetype application/pdf

Other links

Programme Book of abstracts

Authority records

Widegren, Johannes

Search in DiVA

By author/editor
Widegren, Johannes
By organisation
Department of Cultural Sciences
Information Studies

Search outside of DiVA

GoogleGoogle Scholar
Total: 13 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 156 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf